myPortal: Robust Extraction and Aggregation of Web Content

Cached

Download Links

by Marek Kowalkiewicz
Citations:1 - 0 self

Documents Related by Co-Citation

7 Robust pointing by xpath language: Authoring support and empirical evaluation – Mari Abe, Masahiro Hori - 2003
1 Documentum ECI Self-Repairing Wrappers: Performance Analysis ABSTRACT – Boris Chidlovskii
4 Learning Metrics between Tree Structured Data: Application to Image Recognition ⋆ – Laurent Boyer, Amaury Habrard, Marc Sebban
4 Learning stochastic tree edit distance – Marc Bernard, Amaury Habrard, Marc Sebban - 2006
1 Witold Abramowicz. Robust web content extraction – Marek Kowalkiewicz, Maria E Orlowska, Tomasz Kaczmarek - 2006
8 Wrapping Web Data into XML – Wei Han, David Buttler, Calton Pu - 2001
7 Robust Web Data Extraction with XML Path Expressions – Jussi Myllymaki, Jared Jackson - 2002
10 Xpath-wrapper induction by generating tree traversal patterns – T Anton - 2005
63 Building light-weight wrappers for legacy web data-sources using w4f – Arnaud Sahuguet - 1999
71 A Flexible Learning System for Wrapping Tables and Lists in HTML Documents – William W. Cohen, Matthew Hurst, Lee S. Jensen - 2002
9 Learning Stochastic Edit Distance: application in handwritten character recognition – Jose Oncina, Marc Sebban
54 Wrapper Maintenance: A Machine Learning Approach – Kristina Lerman, Steven N. Minton, Craig A. Knoblock - 2003
126 Generating Finite-State Transducers For Semi-Structured Data Extraction From The Web – Chun-nan Hsu, Ming-Tzung Dung - 1998
460 Wrapper Induction for Information Extraction – Nicholas Kushmerick - 1997
41 STALKER: Learning extraction rules for semistructure, Web-based information sources.In – I Muslea, S Minton, C Knoblock - 1998
255 Simple fast algorithms for the editing distance between trees and related problems – K Zhang, D Shasha - 1989
254 On Discriminative vs. Generative classifiers: A comparison of logistic regression and naive Bayes – Andrew Y. Ng, Michael I. Jordan - 2001
157 Visual Web Information Extraction with Lixto – Robert Baumgartner, Sergio Flesca, Georg Gottlob - 2001
248 RoadRunner: Towards Automatic Data Extraction from Large Web Sites – Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo, Università Roma, Tre Università, Basilicata Università, Roma Tre - 2001