Mining tables from large scale html texts (2000)

by Hsin-hsi Chen , Shih-chung Tsai , Jin-he Tsai
Venue:In Proceedings of the 18th International Conference on Computational Linguistics (COLING’00
Citations:38 - 0 self

Documents Related by Co-Citation

47 A Machine Learning Based Approach for Table Detection on the Web – Yalin Wang, Jianying Hu - 2002
19 A method to integrate tables of the World Wide Web – Minoru Yoshida, Kentaro Torisawa - 2001
19 Learning to Recognize Tables in Free Text – Hwee Tou Ng - 1999
40 A Survey of Table Recognition: Models, Observations, Transformations, and Inferences – R. Zanibbi, D. Blostein, J.R. Cordy - 2003
17 Layout and Language: Preliminary investigations in recognizing the structure of tables – Matthew Hurst, Shona Douglas
212 Extracting structured data from web pages – Arvind Arasu - 2003
296 RoadRunner: Towards Automatic Data Extraction from Large Web Sites – Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo, Università Roma, Tre Università, Basilicata Università, Roma Tre - 2001
15 Detecting Tables in HTML Documents – Yalin Wang, Jianying Hu - 2002
25 Table Structure Recognition Based On Robust Block Segmentation – Thomas Kieninger - 1998
27 Recursive x-y cut using bounding boxes of connected components – J Ha, I T Phillips, R M Haralick - 1993
9 Applying the t-rec table recognition system to the business letter domain – T Kieninger, A Dengel - 2001
16 A Retargetable Table Reader – John H. Shamilian, Henry S. Baird, Thomas L. Wood - 1997
21 Layout and Language: Challenges for Table Understanding on the Web – Matthew Hurst - 2001
43 Tabular abstraction, editing, and formatting – Xinxin Wang - 1996
83 A Flexible Learning System for Wrapping Tables and Lists in HTML Documents – William W. Cohen, Matthew Hurst, Lee S. Jensen - 2002
98 Web-scale information extraction in knowitall: (preliminary results – O Etzioni - 2004
103 Table Extraction Using Conditional Random Fields – David Pinto, Andrew Mccallum, Xing Wei, W. Bruce Croft - 2003
20 Model-based Analysis of Printed Tables – Edward Green, Mukki S. Krishnamoorthy - 1995
49 Towards Domain-Independent Information Extraction from Web Tables – Wolfgang Gatterbauer, Paul Bohunsky, Marcus Herzog, Bernhard Krüpl, Bernhard Pollak - 2007