A Machine Learning Based Approach for Table Detection on the Web (2002)

by Yalin Wang , Jianying Hu
Venue:In Proceedings of the 11th Int’l Conf. on World Wide Web (WWW’02
Citations:38 - 0 self

Documents Related by Co-Citation

31 Mining tables from large scale html texts – Hsin-hsi Chen, Shih-chung Tsai, Jin-he Tsai - 2000
71 A Flexible Learning System for Wrapping Tables and Lists in HTML Documents – William W. Cohen, Matthew Hurst, Lee S. Jensen - 2002
34 Towards Domain-Independent Information Extraction from Web Tables – Wolfgang Gatterbauer, Paul Bohunsky, Marcus Herzog, Bernhard Krüpl, Bernhard Pollak - 2007
32 A Survey of Table Recognition: Models, Observations, Transformations, and Inferences – R. Zanibbi, D. Blostein, J.R. Cordy - 2003
164 Extracting structured data from web pages – Arvind Arasu - 2003
248 RoadRunner: Towards Automatic Data Extraction from Large Web Sites – Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo, Università Roma, Tre Università, Basilicata Università, Roma Tre - 2001
12 Flexible web document analysis for delivery to narrow-bandwidth devices – Gerald Penn, Jianying Hu, Hengbin Luo, Ryan Mcdonald - 2001
76 Table Extraction Using Conditional Random Fields – David Pinto, Andrew Mccallum, Xing Wei, W. Bruce Croft - 2003
74 Web-scale information extraction in knowitall (preliminary results – O Etzioni, M Cafarella, D Downey, S Kok, A Popescu, T Shaked, S Soderland, D S Weld, A Yates - 2004
41 HTML Page Analysis Based On Visual Cues – Y Yang, H Zhang - 2001
7 Table Extraction Using Spatial Reasoning on the CSS2 Visual Box Model – Wolfgang Gatterbauer, Paul Bohunsky - 2006
37 Tabular abstraction, editing, and formatting – Xinxin Wang - 1996
65 Web data extraction based on partial tree alignment – Yanhong Zhai - 2005
25 TINTIN: A System for Retrieval in Text Tables – Pallavi Pyreddy, W. Bruce Croft - 1997
93 Active Learning for Natural Language Parsing and Information Extraction – Cynthia A. Thompson, Mary Elaine Califf, Raymond J. Mooney - 1999
21 Table Structure Recognition Based On Robust Block Segmentation – Thomas Kieninger - 1998
16 Structured Data Meets the Web: A Few Observations – Jayant Madhavan, Alon Halevy, Shirley Cohen, Xin (luna Dong, Shawn R. Jeffery, David Ko, Cong Yu, Google Inc
15 Snowball: A Prototype System for Extracting Relations from Large Text Collections – Eugene Agichtein, Luis Gravano, Viktoriya Sokolovna, Aleksandr Voskoboynik - 2001
7 Knocking the Door to the Deep Web: Integrating Web Query Interfaces – Bin He, Zhen Zhang, Kevin Chen-chuan Chang - 2004