Results 1 - 10
of
209
Estimation of probabilities from sparse data for the language model component of a speech recognizer
- IEEE Transactions on Acoustics, Speech and Signal Processing
, 1987
"... Abstract-The description of a novel type of rn-gram language model is given. The model offers, via a nonlinear recursive procedure, a com-putation and space efficient solution to the problem of estimating prob-abilities from sparse data. This solution compares favorably to other proposed methods. Wh ..."
Abstract
-
Cited by 799 (2 self)
- Add to MetaCart
, and it is a problem that one always encounters while collecting fre-quency statistics on words and word sequences (m-grams) from a text of finite size. This means that even for a very large data col-lection, the maximum likelihood estimation method does not allow Turing’s estimate PT for a probability of a
Íslenskur Orðasjóður- Building a Large Icelandic Corpus
"... We introduce an Icelandic corpus of more than 250 million running words and de-scribe the methodology to build it. The re-source is available for use free of charge. We provide automatically generated mono-lingual lexicon entries, comprising fre-quency statistics, samples of usage, co-occurring word ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
We introduce an Icelandic corpus of more than 250 million running words and de-scribe the methodology to build it. The re-source is available for use free of charge. We provide automatically generated mono-lingual lexicon entries, comprising fre-quency statistics, samples of usage, co
Comparing corpora
- International Journal of Corpus Linguistics
"... Corpus linguistics lacks strategies for describing and compar-ing corpora. Currently most descriptions of corpora are textual, and questions such as ‘what sort of a corpus is this?’, or ‘how does this corpus compare to that? ’ can only be answered impressionistically. This paper considers various wa ..."
Abstract
-
Cited by 107 (6 self)
- Add to MetaCart
one performs best. All methods considered in this paper are based on word and ngram fre-quencies; the strategy is defended. 1
Dictionary Acquisition using Parallel Text and Co-occurrence Statistics
"... We present a simple and efficient approach for deriving bilingual dic-tionaries from sentence-aligned par-allel text by extending the notion of co-occurrences to a cross-lingual setting. Dictionaries are evaluated against gold standards and manu-ally; the analysis accounts for fre-quency and corpus ..."
Abstract
-
Cited by 2 (2 self)
- Add to MetaCart
We present a simple and efficient approach for deriving bilingual dic-tionaries from sentence-aligned par-allel text by extending the notion of co-occurrences to a cross-lingual setting. Dictionaries are evaluated against gold standards and manu-ally; the analysis accounts for fre-quency and corpus
A statistical basis for speech sound discrimination
- Language and Speech
, 2003
"... Infants under six months are able to discriminate native and non-native con-sonant contrasts equally well, but as they learn the phonological systems of their native language, this ability declines. Current explanations of this phenomenon agree that the decline in discrimination ability is linked to ..."
Abstract
-
Cited by 33 (1 self)
- Add to MetaCart
-erties of the native language. A simple attractor model suffices to account for these and previous results on loss of discrimination of non-native-language contrasts and suggests that the technique of measuring graded loss of multiple contrasts, in combination with observation of input fre-quencies, can offer a
© Institute of Mathematical Statistics, 2009 ASYMPTOTICS FOR SPHERICAL NEEDLETS
"... We investigate invariant random fields on the sphere using a new type of spherical wavelets, called needlets. These are compactly supported in fre-quency and enjoy excellent localization properties in real space, with quasi-exponentially decaying tails. We show that, for random fields on the sphere, ..."
Abstract
- Add to MetaCart
We investigate invariant random fields on the sphere using a new type of spherical wavelets, called needlets. These are compactly supported in fre-quency and enjoy excellent localization properties in real space, with quasi-exponentially decaying tails. We show that, for random fields on the sphere
Natural image statistics mediate brightness ‘filling-in
- Proceedings of the Royal Society of London Series B – Biological Sciences
, 2003
"... Although the human visual system can accurately estimate the reflectance (or lightness) of surfaces under enormous variations in illumination, two equiluminant grey regions can be induced to appear quite differ-ent simply by placing a light–dark luminance transition between them. This illusion, the ..."
Abstract
-
Cited by 11 (0 self)
- Add to MetaCart
of the low spatial fre-quency (SF) structure of the image. We develop a simple computational model that relies on the statistics of natural scenes actively to reconstruct the image that is most likely to have caused an observed series of responses across SF channels. This principle is tested psychophysically
FACULTY FORUM Superiority of Women in Statistics Achievement
"... Contrary to Buck's (1 985) recent report, in my introductory sta-tistics course, female students make higher grades than male stu-dents. This article compares my experience with Buck's and men-tions some of my anecdotal observations concerning the women's superior performance. As Buck ..."
Abstract
- Add to MetaCart
. As Buck (1985) recently suggested, most studies indicate that, compared to men, women express greater mathemat-ics anxiety and perform less well in mathematics and mathematics-related courses. Comparing letter grade fre-quencies for men and women in her statistics classes, she found "no significant
ORIGINAL PAPER Joint statistics of natural frequencies of stochastic dynamic systems
"... structural systems is usually associated with some amount of uncertainty in specifying material proper-ties, geometric parameters and boundary conditions. In the context of structural dynamics it is necessary to consider joint probability distribution of the natural frequencies in order to account f ..."
Abstract
- Add to MetaCart
frequencies of linear sto-chastic systems is derived. The proposed method does not employ the small-randomness andGaussian random variable assumption usually used in the perturbation based methods. Joint distributions of the natural fre-quencies are investigated using numerical examples and the results
Statistical Thinking: No One Left Behind
"... Abstract Is the mind an “intuitive statistician”? Or are humans biased and error-prone when it comes to probabilistic thinking? While researchers in the 1950s and 1960s suggested that people reason approximately in accordance with the laws of probability theory, research conducted in the heuristics- ..."
Abstract
- Add to MetaCart
this line of research is the power of representation formats. For instance, information presented by means of natural fre-quencies, numerical or pictorial, fosters the understanding of statistical information and improves probabilistic reasoning, whereas conditional probabilities tend to im
Results 1 - 10
of
209