The link structure of a hypermedia environment can be a rich source of information about the content of the environment, provided we have effective means for understanding it. Versions of this principle have been studied in the hypertext research community and (in a context predating hypermedia) through journal citation analysis in the field of bibliometrics. But for the problem of searching in hyperlinked environments such as the World Wide Web, it is clear from the prevalent techniques that the information inherent in the links has yet to be fully exploited. In this work we develop a new method for automatically extracting certain types of information about a hypermedia environment from its link structure, and we report on experiments that demonstrate its effectiveness for a variety of search problems on the www. The central problem we consider is that of determining the relative "authority" of pages in such environments. This issue is central to a number of basic hypertext search t...
|
1839
|
The Anatomy of a Large-Scale Hypertextual Web Search Engine
– Brin, Page
- 1998
|
|
1636
|
Indexing by latent semantic analysis
– Deerwester, Dumais, et al.
- 1990
|
|
970
|
Principal Component Analysis
– Jolliffe
- 1986
|
|
565
|
Automatic Text Processing
– Salton
- 1989
|
|
430
|
Scatter/gather: a cluster-based approach to browsing large document collections
– Cutting, Karger, et al.
- 1992
|
|
422
|
Spectral Graph Theory
– Chung
- 1997
|
|
349
|
Improved algorithms for topic distillation in hyperlinked environments
– Bharat, Henzinger
- 1998
|
|
253
|
Inferring Web communities from link topology
– Gibson, Kleinberg, et al.
- 1998
|
|
244
|
Automatic resource compilation by analyzing hyperlink structure and associated text
– Chakrabarti, Dom, et al.
- 1998
|
|
239
|
Analysis of a Complex of Statistical Variables into Principal Components
– Hotelling
- 1993
|
|
205
|
Silk from a sow’s ear: Extracting usable structures from the Web
– Pirolli, Pitkow, et al.
- 1996
|
|
171
|
Co-citation in the scientific literature: A new measu re of the relationship between two documents
– Small
- 1973
|
|
158
|
Latent semantic indexing: A probabilistic analysis
– Papadimitriou, Tamaki, et al.
- 1998
|
|
137
|
Structural analysis of hypertexts: Identifying hierarchies and useful metrics
– BOTAFOGO, RIVLIN, et al.
- 1992
|
|
127
|
Bibliographic coupling between scientific papers
– KESSLER
- 1963
|
|
114
|
Lower bounds for the partitioning of graphs
– Donath, Hoffman
- 1973
|
|
105
|
Clustering categorical data: An approach based on dynamic systems
– GIBSON, KLEINBERG, et al.
- 1998
|
|
93
|
GENVL and WWWW: Tools for Taming the Web
– McBryan
- 1994
|
|
87
|
Parasite: Mining structural information on the web
– Spertus
- 1997
|
|
86
|
WebQuery: Searching and visualizing the Web through connectivity
– Carrière, Kazman
|
|
86
|
Hypursuit: A hierarchical network search engine that exploits content-link hypertext clustering
– Weiss, Velez, et al.
- 1996
|
|
82
|
How to personalize the Web
– Barrett, Maglio, et al.
- 1997
|
|
82
|
Fast Monte-Carlo algorithms for finding low rank approximations
– Frieze, Kannan, et al.
- 1998
|
|
82
|
Bibliometrics of the World Wide Web: an exploratory analysis of the intellectual structure of cyberspace
– Larson
|
|
79
|
Searching for information in a hypertext medical handbook
– Frisse
|
|
73
|
The quest for correct information on the Web: Hyper search engines
– Marchiori
- 1997
|
|
67
|
Citation analysis as a tool in journal evaluation
– Garfield
- 1972
|
|
66
|
A new status index derived from sociometric analysis
– Katz
- 1953
|
|
65
|
Applications of a Web query language
– Arocena, Mendelzon, et al.
|
|
46
|
Citation influence for journal aggregates of scientific publications: Theory, with applications to the literature of physics
– PINSKI, NARIN
- 1976
|
|
41
|
Introduction to Informetrics
– Egghe, Rousseau
- 1990
|
|
39
|
The structure of scientific literatures I: Identifying and graphing specialties
– SMALL, GRIFFITH
- 1974
|
|
16
|
lawfulness on the electronic frontier
– Pitkow, Pirolli, et al.
- 1997
|
|
14
|
An input-output approach to clique identification
– Hubbell
- 1965
|
|
13
|
Cocited author mapping as a valid representation of intellectual structure
– McCain
- 1986
|
|
8
|
The analysis of square matrices of scientometric transactions
– Price
- 1981
|
|
8
|
An improved method for analyzing square scientometric transaction matrices
– Noma
- 1982
|
|
5
|
Connectivity Server: Fast Access to
– Bharat, Broder, et al.
- 1998
|
|
5
|
The synthesis of specialty narratives from co-citation clusters
– SMALL
- 1986
|
|
4
|
Mathematical relations between impact factors and average number of citations
– EGGHE
- 1988
|
|
4
|
Algebraic connectivity of graphs
– Fielder
- 1973
|
|
4
|
Joint-space analysis of ‘pick-any’ data: Analysis of choices from an unconstrained set of alternatives
– LEVINE
- 1979
|
|
4
|
Subject and citation indexing. Part I: The clustering structure of composite representations in the cystic fibrosis document collection
– Shaw
- 1991
|
|
4
|
Subject and citation indexing. Part II: The optimal, cluster-based retrieval performance of composite representations
– Shaw
- 1991
|
|
3
|
Measuring the Relative Standing of Disciplinary Journals
– Doreian
- 1988
|
|
3
|
A measure of standing for citation networks within a wider environment
– DOREIAN
- 1994
|
|
3
|
Co-citation analysis and the invisible college
– NOMA
- 1984
|
|
2
|
Flowinterception problems, Facility Location
– Berman, Hodgson, et al.
- 1995
|
|
1
|
Upfal “Web search using automated classification,” poster at
– Chekuri, Goldwasser, et al.
- 1997
|
|
1
|
On the citation influence methodology of Pinski and
– Geller
|