In this paper we survey algorithmic aspects of Web information retrieval. As an example, we discuss ranking of search engine results using connectivity analysis.
|
1839
|
The Anatomy of a Large-Scale Hypertextual Web Search Engine
– Brin, Page
- 1998
|
|
1669
|
Authoritative sources in a hyperlinked environment
– Kleinberg
- 1999
|
|
1439
|
Modern Information Retrieval
– Baeza-Yates, Ribeiro
- 1999
|
|
1064
|
The PageRank Citation Ranking: Bringing Order to the Web
– Page, Brin, et al.
- 1999
|
|
272
|
A Scalable Comparison-Shopping Agent for the World Wide Web
– Doorenbos, Etzioni, et al.
- 1997
|
|
263
|
Syntactic clustering of the Web
– Broder, Glassman, et al.
|
|
254
|
Enhanced hypertext categorization using hyperlinks
– Chakrabarti, Dom, et al.
- 1998
|
|
218
|
O.: Web document clustering: A feasibility demonstration
– Zamir, Etzioni
- 1998
|
|
208
|
The web as a graph: Measurements, models and methods
– Kleinberg, Kumar, et al.
- 1999
|
|
200
|
Efficient crawling through URL ordering
– Cho, Garcia-Molina, et al.
- 1998
|
|
158
|
Latent semantic indexing: A probabilistic analysis
– Papadimitriou, Tamaki, et al.
- 1998
|
|
136
|
The evolution of the Web and implications for an incremental crawler
– Cho, Garcia-Molina
- 2000
|
|
134
|
Analysis of a very large AltaVista query log
– Silverstein, Henzinger, et al.
- 1998
|
|
128
|
A.: A Technique for Measuring the Relative Size and Overlap of Public Web Search Engines
– Bharat, Broder
- 1998
|
|
127
|
Bibliographic coupling between scientific papers
– KESSLER
- 1963
|
|
113
|
H.: Copy detection mechanisms for digital documents
– Brin, Davis, et al.
- 1995
|
|
102
|
Efficient computation of PageRank
– Haveliwala
- 1999
|
|
91
|
The Connectivity Server: Fast access to linkage information on the Web
– Bharat, Bröder, et al.
- 1998
|
|
86
|
WebQuery: Searching and visualizing the Web through connectivity
– Carrière, Kazman
|
|
66
|
A new status index derived from sociometric analysis
– Katz
- 1953
|
|
64
|
On nearuniform URL sampling
– Henzinger, Heydon, et al.
- 2000
|
|
55
|
Optimal robot scheduling for web search engines
– man, Liu, et al.
- 1997
|
|
50
|
Finding replicated web collections
– Cho, Shivakumar, et al.
- 2000
|
|
49
|
Finding near-replicas of documents on the Web
– SHIVAKUMAR, GARCIA-MOLINA
- 1998
|
|
48
|
A comparison of techniques to find mirrored hosts on the WWW
– Bharat, Broder, et al.
- 2000
|
|
48
|
What is this page known for? computing web page reputations
– RAFIEI, MENDELZON
|
|
38
|
Clustering hypertext with applications to web searching
– Modha, Spangler
- 2000
|
|
21
|
Real Life Information Retrieval: A
– Jansen, Spink, et al.
- 1998
|
|
17
|
Finding related Web pages in the World Wide Web
– Dean, Henzinger
- 1998
|
|
11
|
Techniques for disaggregating centrality scores in social networks
– Mizruchi, Mariolis, et al.
- 1986
|
|
9
|
Measuring search engine quality using random walks on the Web
– Henzinger, Heydon, et al.
- 1999
|
|
9
|
Co-citation in the scienti literature: A new measure of the relationship between two documents
– Small
- 1973
|
|
4
|
Algorithmic aspects of information retrieval on the web
– Broder, Henzinger
- 2001
|
|
2
|
Citation analysis as a tool in journal evaluation
– Gar
- 1972
|
|
1
|
Citation Indexing
– Gar
- 1979
|