Results 1 - 10
of
51,418
Table 3. E ect of vocabulary size (seconds)
"... In PAGE 8: ....2. Vocabulary Sizes In the second set of experiments, we xed the values of D (= 5000) and L (= 10) and varied S from 20 to 500. Table3 shows that the e ect of the number of di erent symbols, on both the sequential matching and the 2-D-S-tree, is not very signi cant. The reasons that the performance of the 2-D-S-tree improves slightly as the vocabulary size grows are as follows.... ..."
Table 3. Vocabulary size of each test set.
"... In PAGE 3: ...resulting from the texts used as training material for the language models (TRAIN). On Table3 we can compare the size of the vocabulary of each test set before and after the application of the decomposition method. As we can verify the reduction on the vo- cabulary size increases for larger vocabularies.... In PAGE 3: ... Vocabulary size of each test set. To evaluate the e ect of the decomposition on the number of OOVs we took all the training texts of the language models (which yields a vocabulary of 139,758 words as we can see from Table3 ) and esti- mated the percentage of words not covered by the vocabulary sets of 5K and 20K. From Table 4 we see that this percentage gets lower with the decom- posed vocabulary.... ..."
Table 6 Language Model Vocabulary Size and Out of Vocabulary Ratio
2003
"... In PAGE 6: ... Word error rate reduction due to the unsupervised stem acquisition is 38% for the segmenter developed from the 10K word manually segmented corpus and 32% for the segmenter developed from 110K word manually segmented corpus. Language model vocabulary size (LM VOC Size) and the unknown stem ratio (OOV ratio) of various segmenters is given in Table6 . For unsupervised stem acquisition, we have set the frequency threshold at 10 for every 10-15 million word corpus, i.... ..."
Cited by 15
Table 6 Language Model Vocabulary Size and Out of Vocabulary Ratio
2003
"... In PAGE 6: ... Word error rate reduction due to the unsupervised stem acquisition is 38% for the segmenter developed from the 10K word manually segmented corpus and 32% for the segmenter developed from 110K word manually segmented corpus. Language model vocabulary size (LM VOC Size) and the unknown stem ratio (OOV ratio) of various segmenters is given in Table6 . For unsupervised stem acquisition, we have set the frequency threshold at 10 for every 10-15 million word corpus, i.... ..."
Cited by 15
Table 1: Vocabulary sizes for two Turkish corpora.
2000
Cited by 1
Table 2: Vocabulary sizes and number of trig-
Table 1: Transcription quality vs vocabulary size
Table 4. F score for 11 datasets for each of the vocabulary size. Vocabulary Sizes
2006
"... In PAGE 24: ... Normal) We compared the clustering results using both F score and H score (computed as described in Expressions (34) and (37) respectively). The scores at different vocabulary sizes are given in the Table4 and Table 58. Vocabulary Sizes F Data k1a k1b re0 re1 tr11 tr12 tr23 tr31 tr41 tr45 wap Winner in 50 K N 0.... ..."
Cited by 2
Table 1: Result of bench mark test (V. size: Vocabulary size)
Results 1 - 10
of
51,418