Extracting data records from the web using tag path clustering (2009)

by Gengxin Miao , Junichi Tatemura , Wang-pin Hsiung , Arsany Sawires , Louise E. Moser
Venue:In WWW ’09: Proceedings of the 18th international conference on World wide web
Citations:16 - 0 self

Active Bibliography

1 A Generalized Tree Matching Algorithm Considering Nested Lists for Web Data Extraction – Nitin Jindal, Bing Liu
Visually Extracting Data Records from the Deep Web – Neil Anderson, Jun Hong
96 A Survey of Web Information Extraction Systems – Chia-Hui Chang, Mohammed Kayed, Moheb Ramzy Girgis, Khaled Shaalan - 2006
1 RecipeCrawler: Collecting Recipe Data from WWW Incrementally – Yu Li, Xiaofeng Meng, Liping Wang, Qing Li
and – Weifeng Su, Frederick H. Lochvsky
13 Automatic Extraction of Dynamic Record Sections From Search Engine Result Pages. VLDB – Hongkun Zhao, Weiyi Meng - 2006
6 FiVaTech: Page-level web data extraction from template pages – Mohammed Kayed, Chia-hui Chang - 2010
10 NET - A System for Extracting Web Data from Flat and Nested Data Records – Bing Liu, Yanhong Zhai - 2005
Noname manuscript No. (will be inserted by the editor) Harvesting Relational Tables from Lists on the Web – Hazem Elmeleegy, Jayant Madhavan Alon, J. Madhavan, A. Halevy
19 Harvesting Relational Tables from Lists on the Web – Hazem Elmeleegy, Jayant Madhavan, Alon Halevy
3 Efficient Record-Level Wrapper Induction – Shuyi Zheng, Ruihua Song, Ji-rong Wen, C. Lee Giles
7 Dynamic hierarchical Markov random fields for integrated web data extraction – Jun Zhu, Zaiqing Nie, Bo Zhang, Ji-rong Wen, John Lafferty
ABSTRACT – Marilena Oita - 2012
6 Can We Learn a Template-Independent Wrapper for News Article Extraction from a Single Training Site? ∗ – Junfeng Wang, Chun Chen
1 From One Tree to a Forest: a Unified Solution for Structured Web Data Extraction – Qiang Hao, Rui Cai, Yanwei Pang, Lei Zhang
1 Annotating Search Results from Web Databases – Yiyao Lu, Hai He, Hongkun Zhao, Weiyi Meng, Clement Yu, Senior Member
1 ObjectRunner: Lightweight, Targeted Extraction and Querying of Structured Web Data – Talel Abdessalem, Bogdan Cautis, Nora Derouiche
1 Hybrid Method for Automated News Content Extraction from the Web – Yu Li, Xiaofeng Meng, Qing Li, Liping Wang
11 Joint optimization of wrapper generation and template detection – Shuyi Zheng, Di Wu, Ruihua Song