Analysis of lexical signatures for improving information persistence on the World Wide Web (2004)

by Seung-Taek Park , David M. Pennock , C. Lee Giles , Robert Krovetz
Venue:ACM TRANSACTIONS ON INFORMATION SYSTEMS
Citations:13 - 0 self

Active Bibliography

11 Analysis of Lexical Signatures for Finding Lost or Related Documents – Seung-Taek Park, David M. Pennock, C. Lee Giles, Robert Krovetz - 2002
4 Methodologies for Crawler Based Web – Mike Thelwall - 2002
15 Methodologies for Crawler Based Web Surveys – Mike Thelwall - 2002
External Search Engine Mining – Maxim Gurevich - 2007
1 Random Sampling from a Search Engine’s Corpus ∗ – Ziv Bar-yossef, Maxim Gurevich - 2006
2 Focused Sampling: Computing Topical Web Statistics – Ziv Bar-Yossef, Tapas Kanungo, Robert Krauthgamer - 2005
63 Random sampling from a search engine’s index – Ziv Bar-yossef, Maxim Gurevich - 2006
20 Efficient search engine measurements – Ziv Bar-yossef, Maxim Gurevich - 2007
www.stacs-conf.org A COMPARISON OF TECHNIQUES FOR SAMPLING WEB PAGES – Eda Baykan, Monika Henzinger, Stefan F. Keller - 902
6 A COMPARISON OF TECHNIQUES FOR SAMPLING WEB PAGES – Eda Baykan, Monika Henzinger, Stefan F. Keller, Sebastian De Castelberg, Markus Kinzler
Web Structure in 2005 – Yu Hirate, Shin Kato, Hayato Yamana
7 Combining Text-, Link-, and Classification-based Retrieval Methods to Enhance Information Discovery on the Web – Kiduk Yang - 2002
Santo Fortunato 1,3 – M. Ángeles Serrano, Ana Maguitman, Marián Boguñá, Alessandro Vespignani - 2006
7 Decoding the structure of the WWW: A comparative analysis of Web crawls – M. Ángeles Serrano, Ana Maguitman
63 Information retrieval on the Web – Mei Kobayashi, Koichi Takeda - 2000
Background Readings for Collection Synthesis – n.n. - 2002
6 Lazy Preservation: Reconstructing Websites from the Web Infrastructure – Frank Mccown, Michael L. Nelson (director, William Y. Arms (member, Johan Bollen (member, Kurt Maly (member, Ravi Mukkamala (member, Frank Mccown, Director Dr, Michael L. Nelson - 2007
Effective Web Crawling - Chapter 2 – Carlos Castillo - 2004
23 Effective Web Crawling – Carlos Castillo, Dr. Alistair Moffat, Dr. Gonzalo Navarro - 2004