|
31
|
Mining tables from large scale html texts
– Hsin-hsi Chen, Shih-chung Tsai, Jin-he Tsai
- 2000
|
|
71
|
A Flexible Learning System for Wrapping Tables and Lists in HTML Documents
– William W. Cohen, Matthew Hurst, Lee S. Jensen
- 2002
|
|
34
|
Towards Domain-Independent Information Extraction from Web Tables
– Wolfgang Gatterbauer, Paul Bohunsky, Marcus Herzog, Bernhard Krüpl, Bernhard Pollak
- 2007
|
|
32
|
A Survey of Table Recognition: Models, Observations, Transformations, and Inferences
– R. Zanibbi, D. Blostein, J.R. Cordy
- 2003
|
|
164
|
Extracting structured data from web pages
– Arvind Arasu
- 2003
|
|
248
|
RoadRunner: Towards Automatic Data Extraction from Large Web Sites
– Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo, Università Roma, Tre Università, Basilicata Università, Roma Tre
- 2001
|
|
12
|
Flexible web document analysis for delivery to narrow-bandwidth devices
– Gerald Penn, Jianying Hu, Hengbin Luo, Ryan Mcdonald
- 2001
|
|
76
|
Table Extraction Using Conditional Random Fields
– David Pinto, Andrew Mccallum, Xing Wei, W. Bruce Croft
- 2003
|
|
74
|
Web-scale information extraction in knowitall (preliminary results
– O Etzioni, M Cafarella, D Downey, S Kok, A Popescu, T Shaked, S Soderland, D S Weld, A Yates
- 2004
|
|
41
|
HTML Page Analysis Based On Visual Cues
– Y Yang, H Zhang
- 2001
|
|
7
|
Table Extraction Using Spatial Reasoning on the CSS2 Visual Box Model
– Wolfgang Gatterbauer, Paul Bohunsky
- 2006
|
|
37
|
Tabular abstraction, editing, and formatting
– Xinxin Wang
- 1996
|
|
65
|
Web data extraction based on partial tree alignment
– Yanhong Zhai
- 2005
|
|
25
|
TINTIN: A System for Retrieval in Text Tables
– Pallavi Pyreddy, W. Bruce Croft
- 1997
|
|
93
|
Active Learning for Natural Language Parsing and Information Extraction
– Cynthia A. Thompson, Mary Elaine Califf, Raymond J. Mooney
- 1999
|
|
21
|
Table Structure Recognition Based On Robust Block Segmentation
– Thomas Kieninger
- 1998
|
|
16
|
Structured Data Meets the Web: A Few Observations
– Jayant Madhavan, Alon Halevy, Shirley Cohen, Xin (luna Dong, Shawn R. Jeffery, David Ko, Cong Yu, Google Inc
|
|
15
|
Snowball: A Prototype System for Extracting Relations from Large Text Collections
– Eugene Agichtein, Luis Gravano, Viktoriya Sokolovna, Aleksandr Voskoboynik
- 2001
|
|
7
|
Knocking the Door to the Deep Web: Integrating Web Query Interfaces
– Bin He, Zhen Zhang, Kevin Chen-chuan Chang
- 2004
|