A Machine Learning Based Approach for Table Detection on the Web (2002)

by Yalin Wang , Jianying Hu
Venue:In Proceedings of the 11th Int’l Conf. on World Wide Web (WWW’02
Citations:47 - 0 self

Documents Related by Co-Citation

38 Mining tables from large scale html texts – Hsin-hsi Chen, Shih-chung Tsai, Jin-he Tsai - 2000
83 A Flexible Learning System for Wrapping Tables and Lists in HTML Documents – William W. Cohen, Matthew Hurst, Lee S. Jensen - 2002
40 A Survey of Table Recognition: Models, Observations, Transformations, and Inferences – R. Zanibbi, D. Blostein, J.R. Cordy - 2003
49 Towards Domain-Independent Information Extraction from Web Tables – Wolfgang Gatterbauer, Paul Bohunsky, Marcus Herzog, Bernhard Krüpl, Bernhard Pollak - 2007
104 Table Extraction Using Conditional Random Fields – David Pinto, Andrew Mccallum, Xing Wei, W. Bruce Croft - 2003
98 Web-scale information extraction in knowitall: (preliminary results – O Etzioni, M Cafarella, D Downey, S Kok, A-M Popescu, T Shaked, S Soderland, D S Weld, A Yates - 2004
214 Extracting structured data from web pages – Arvind Arasu - 2003
299 RoadRunner: Towards Automatic Data Extraction from Large Web Sites – Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo, Università Roma, Tre Università, Basilicata Università, Roma Tre - 2001
25 Table Structure Recognition Based On Robust Block Segmentation – Thomas Kieninger - 1998
19 A method to integrate tables of the World Wide Web – Minoru Yoshida, Kentaro Torisawa - 2001
21 Layout and Language: Challenges for Table Understanding on the Web – Matthew Hurst - 2001
13 Flexible web document analysis for delivery to narrow-bandwidth devices – Gerald Penn, Jianying Hu, Hengbin Luo, Ryan Mcdonald - 2001
46 HTML page analysis based on visual cues – Y Yang, H Zhang - 2001
174 Mining the Web for Synonyms: PMI-IR Versus LSA on TOEFL – Peter D. Turney - 2001
70 WebTables: Exploring the Power of Tables on the Web – Michael J. Cafarella, Eugene Wu, Alon Halevy, Yang Zhang, Daisy Zhe Wang - 2008
22 Using Visual Cues for Extraction of Tabular Data from Arbitrary HTML Documents – Bernhard Krüpl, Marcus Herzog, Wolfgang Gatterbauer - 2005
10 Table extraction using spatial reasoning on the CSS2 visual box model – Wolfgang Gatterbauer, Paul Bohunsky - 2006
341 Extracting patterns and relations from the world wide web – Sergey Brin - 1998
43 Tabular abstraction, editing, and formatting – Xinxin Wang - 1996