Extracting data records from the web using tag path clustering (2009)

by Gengxin Miao , Junichi Tatemura , Wang-pin Hsiung , Arsany Sawires , Louise E. Moser
Venue:In WWW ’09: Proceedings of the 18th international conference on World wide web
Citations:16 - 0 self

Documents Related by Co-Citation

69 WebTables: Exploring the Power of Tables on the Web – Michael J. Cafarella, Eugene Wu, Alon Halevy, Yang Zhang, Daisy Zhe Wang - 2008
91 Web data extraction based on partial tree alignment – Yanhong Zhai - 2005
61 Mining Data Records in Web Pages – Bing Liu, Robert Grossman, Y. Zhai - 2003
100 IEPAD: Information Extraction Based on Pattern Discovery – Chia-hui Chang, Shao-Chen Lui - 2001
296 RoadRunner: Towards Automatic Data Extraction from Large Web Sites – Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo, Università Roma, Tre Università, Basilicata Università, Roma Tre - 2001
519 Wrapper Induction for Information Extraction – Nicholas Kushmerick - 1997
212 Extracting structured data from web pages – Arvind Arasu - 2003
96 A Survey of Web Information Extraction Systems – Chia-Hui Chang, Mohammed Kayed, Moheb Ramzy Girgis, Khaled Shaalan - 2006
49 Towards Domain-Independent Information Extraction from Web Tables – Wolfgang Gatterbauer, Paul Bohunsky, Marcus Herzog, Bernhard Krüpl, Bernhard Pollak - 2007
21 Answering table augmentation queries from unstrcutured lists on the web – R Gupta, S Sarawagi - 2009
5 Ondux: on-demand unsupervised learning for information extraction – E Cortez, A S da Silva, M A Gonçalves, E S de Moura - 2010
85 Data extraction and label assignment for web databases – J Wang, F H Lochovsky
7 CETR- Content Extraction via Tag Ratios – Tim Weninger, William H. Hsu, Jiawei Han
73 A Fully Automated Object Extraction System for the World Wide Web – David Buttler, Ling Liu, Calton Pu - 2001
53 Extracting content structure for web pages based on visual representation – Deng Cai, Shipeng Yu, Ji-rong Wen, Wei-ying Ma - 2003
3 Open information extraction from the Web. IJCAI’07 – M Banko, M J Cafarella, S Soderland, M Broadhead, O Etzioni
63 Organizing and searching the world wide web of facts – step two: harnessing the wisdom of the crowds – Dekang Lin, Jeffrey Bigham, Andrei Lifchits, Alpa Jain - 2007
14 Mining Data Records – B Liu, R Grossman, Y Zhai - 2003
2 Exploiting anchor text for the navigationalweb retrieval at ntcir-5 – A Fujii, K Itou, T Akiba, T Ishikawa - 2005