New Tools for Web-Scale N-grams

by Dekang Lin , Kenneth Church , Heng Ji , Satoshi Sekine , David Yarowsky , Shane Bergsma , Kailash Patil , Emily Pitler , Rachel Lathbury , Vikram Rao , Kapil Dalwani , Sushant Narsale
Citations:28 - 11 self

Documents Related by Co-Citation

833 Word Association Norms, Mutual Information, and Lexicography – Kenneth Ward Church, Patrick Hanks - 1990
63 Web-based models for natural language processing – Mirella Lapata, Frank Keller - 2005
547 LIBLINEAR: A Library for Large Linear Classification – Rong-en Fan, Kai-wei Chang, Cho-jui Hsieh, Xiang-rui Wang, Chih-jen Lin - 2008
2102 Building a Large Annotated Corpus of English: The Penn Treebank – Mitchell P. Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz - 1993
10 Durme and Ashwin Lall. 2010. Online generation of locality sensitive hash signatures – Benjamin Van
137 Paraphrasing with Bilingual Parallel Corpora – Colin Bannard, Chris Callison-burch - 2005
44 Web 1T 5-gram corpus version 1.1 – T BRANTS, A FRANZ - 2006
34 Search engine statistics beyond the n-gram: Application to noun compound bracketing – Preslav Nakov - 2005
42 Corpus Statistics Meet the Noun Compound: Some Empirical Results – Mark Lauer - 1995
41 Web 1T 5-gram version 1 – Thorsten Brants, Alex Franz - 2006
19 Large Scale Acquisition of Paraphrases for Learning Surface Patterns – Rahul Bhagat, Deepak Ravichandran
18 algorithms and nlp: using locality sensitive hash function for high speed noun clustering – Randomized
10 Large-scale supervised models for noun phrase bracketing – D Vadas, J Curran
698 Class-Based n-gram Models of Natural Language – Peter F. Brown, Peter V. deSouza, Robert L. Mercer, Vincent J. Della Pietra, Jenifer C. Lai - 1992
25 Adding noun phrase structure to the Penn treebank – D Vadas, J Curran - 2007
137 Using the web to obtain frequencies for unseen bigrams. Comput. Linguist – Frank Keller, Mirella Lapata - 2003
20 The Linguistic Structure of English Web-Search Queries – Cory Barr, Rosie Jones, Moira Regelson
118 Large language models in machine translation – Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, Jeffrey Dean - 2007
876 Automatic Acquisition of Hyponyms from Large Text Corpora – Marti A. Hearst - 1992