Wrapper Induction for Information Extraction (1997)

by Nicholas Kushmerick
Citations:519 - 30 self

Documents Related by Co-Citation

332 Bottom-Up Relational Learning of Pattern Matching Rules for Information Extraction – Mary Elaine Califf, Raymond J. Mooney, David Cohn - 2003
297 A Scalable Comparison-Shopping Agent for the World-Wide Web – Robert B. Doorenbos, Oren Etzioni, Daniel S. Weld - 1997
475 The TSIMMIS Project: Integration of Heterogeneous Information Sources – Sudarshan Chawathe, Hector Garcia-Molina, Joachim Hammer, Kelly Ireland, Yannis Papakonstantinou, Jeffrey Ullman, Jennifer Widom
75 Cut and Paste – Giansalvatore Mecca, Paolo Atzeni - 1998
342 Learning Information Extraction Rules for Semi-structured and Free Text – Stephen Soderland, Claire Cardie, Raymond Mooney - 1999
152 Information Extraction from HTML: Application of a General Machine Learning Approach – Dayne Freitag - 1998
116 Semi-automatic Wrapper Generation for Internet Information Sources – Naveen Ashish, Craig Knoblock - 1997
114 Modeling Web Sources for Information Integration – Craig A. Knoblock, Steven Minton, Jose Luis Ambite, Naveen Ashish, Pragnesh Jay Modi, Ion Muslea, Andrew G. Philpot, Sheila Tejada - 1997
138 Generating Finite-State Transducers For Semi-Structured Data Extraction From The Web – Chun-nan Hsu, Ming-Tzung Dung - 1998
491 Querying Semi-Structured Data – Serge Abiteboul - 1997
662 The Lorel Query Language for Semistructured Data – Serge Abiteboul, Dallan Quass, Jason Mchugh, Jennifer Widom, Janet Wiener - 1997
114 Learning to Extract Text-based Information from the World Wide Web – Stephen Soderland - 1997
243 Querying the World Wide Web – Alberto O. Mendelzon, George A. Mihaila, Tova Milo - 1997
178 Wrapper generation for semistructured internet sources – Naveen Ashish, Craig A. Knoblock - 1997
856 Learning logical definitions from relations – J. R. Quinlan - 1990
35 Learning Text Analysis Rules For Domain-Specific Natural Language Processing – Stephen G. Soderland - 1997
175 Extracting Semistructured Information from the Web – J. Hammer, H. Garcia-molina, J. Cho, R. Aranha, A. Crespo - 1997
214 Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity – William W. Cohen - 1998
181 Query Caching and Optimization in Distributed Mediator Systems – S. Adali, K. S. Candan, Y. Papakonstantinou, V. S. Subrahmanian - 1996