Orthographic errors in web pages: Towards cleaner web corpora (2006)

by Christoph Ringlstetter , Klaus U. Schulz , Stoyan Mihov
Venue:Computational Lingusitics
Citations:5 - 2 self

Documents Related by Co-Citation

294 Techniques for automatically correcting words in text – K Kukich - 1992
6 Text-Induced Spelling Correction – Martin Reynaert - 2005
11 OCRSpell: an interactive spelling correction system for OCR errors in text – Kazem Taghva, Eric Stofsky - 2001
5 A visual and interactive tool for optimizing lexical postcorrection of OCR results – Christian Strohmaier, Christoph Ringstetter, Klaus U. Schulz, Stoyan Mihov - 2003
4 Rule-based search in text databases with nonstandard orthography – Thomas Pilz, Wolfram Luther, Norbert Fuhr, Ulrich Ammon
2 Syntax des Frühneuhochdeutschen – Johannes Erben - 2000
3 A toolbox for record linkage – Rainer Schnell, Tobias Bachteler, Stefan Bender - 2004
2 and Stoyan Mihov. Fast String Correction with Levenshtein-Automata – Klaus U Schulz
2 Untersuchungen zur Druckersprache in den Flugschriften Martin Bucers – Christina StockmannHovekamp - 1991
2 Information retrieval for languages that lack a fixed orthography – Jan Strunk - 2003
2 Wortbildung des Frühneuhochdeutschen – Klaus-Peter Wegera, HansPeter Prell - 2000
2 Lexikologie und Lexikographie des Frühneuhochdeutschen – Dieter Wolf - 2000
21 A search engine for historical manuscript images – T Rath, R Manmatha, V Lavrenko - 2004
9 VARD versus Word: A comparison of the UCREL variant detector and modern spell checkers on English historic corpora – P Rayson, D Archer, N Smith - 2005
6 Content-Aware DataGuides: Interleaving IR and DB Indexing Techniques for Efficient Retrieval of Textual XML Data – Felix Weigel, Holger Meuss, François Bry, Klaus U. Schulz - 2003
8 Fast Approximate Search in Large Dictionaries – Stoyan Mihov, Klaus U. Schulz - 2004
5 The identification of spelling variants in English and German historical texts: manual or automatic – D Archer, A Ernst-Gerlach, S Kempken, T Pilz, P Rayson - 2006
10 A segmentation-free approach for keyword search in historical typewritten documents – B. Gatos, T. Konidaris, K. Ntzios, I. Pratikakis, S. J. Perantonis - 2005
4 Generating search term variants for text collections with historic spellings – A Ernst-Gerlach, N Fuhr - 2006