Results 1 -
9 of
9
Extracting Macroscopic Information from Web Links
- Journal of the American Society for Information Science and Technology
, 2001
"... Much has been written about the potential and pitfalls of macroscopic web-based link analysis, yet there have been no studies that have provided clear statistical evidence that any of the proposed calculations can produce results over large areas of the web that correlate with phenomena external to ..."
Abstract
-
Cited by 51 (28 self)
- Add to MetaCart
Much has been written about the potential and pitfalls of macroscopic web-based link analysis, yet there have been no studies that have provided clear statistical evidence that any of the proposed calculations can produce results over large areas of the web that correlate with phenomena external to the Internet. This article attempts to provide such evidence through an evaluation of Ingwersen’s (1998) proposed external Web Impact Factor (WIF) for the original use of the web: the interlinking of academic research. In particular, it studies the case of the relationship between academic hyperlinks and research activity for universities in Britain, a country chosen for its variety of institutions and the existence of an official government rating exercise for research. After reviewing the numerous reasons why link counts may be unreliable, it demonstrates that four different WIFs do, in fact, correlate with the conventional academic research measures. The WIF delivering the greatest correlation with research rankings was the ratio of web pages with links pointing at research-based pages to faculty numbers. The scarcity of links to electronic academic papers in the data set suggests that, in contrast to citation analysis, this WIF is measuring the reputations of universities and their scholars, rather than the quality of their publications.
A comparison of sources of Links for academic Web Impact Factor Calculations
- Journal of Documentation
, 2002
"... There has been much recent interest in extracting information from collections of web links. One tool that has been used is Ingwersen’s Web Impact Factor. It has been demonstrated that several versions of this metric can produce results that correlate with research ratings of British universities sh ..."
Abstract
-
Cited by 18 (10 self)
- Add to MetaCart
There has been much recent interest in extracting information from collections of web links. One tool that has been used is Ingwersen’s Web Impact Factor. It has been demonstrated that several versions of this metric can produce results that correlate with research ratings of British universities showing that, despite being a measure of a purely Internet phenomenon, the results are susceptible to a wider interpretation. This paper addresses the question of which is the best possible domain to count backlinks from, if research is the focus of interest. WIFs for British universities calculated from several different source domains are compared, primarily the.edu,.ac.uk and.uk domains, and the entire web. The results show that all four areas produce WIFs that correlate strongly with research ratings, but that none produce incontestably superior figures. It was also found that the WIF was less able to differentiate in more homogenous subsets of universities, although positive results are still possible.
Methodologies for Crawler Based Web Surveys
, 2002
"... There have been many attempts to study the content of the web, either through human or automatic agents. Five different previously used web survey methodologies are described and analysed, each justifiable in its own right, but a simple experiment is presented that demonstrates concrete differences ..."
Abstract
-
Cited by 13 (6 self)
- Add to MetaCart
There have been many attempts to study the content of the web, either through human or automatic agents. Five different previously used web survey methodologies are described and analysed, each justifiable in its own right, but a simple experiment is presented that demonstrates concrete differences between them. The concept of crawling the web also bears further inspection, including the scope of the pages to crawl, the method used to access and index each page, and the algorithm for the identification of duplicate pages. The issues involved here will be well-known to many computer scientists but, with the increasing use of crawlers and search engines in other disciplines, they now require a public discussion in the wider research community. This paper concludes that any scientific attempt to crawl the web must make available the parameters under which it is operating so that researchers can, in principle, replicate experiments or be aware of and take into account differences between methodologies. A new hybrid random page selection methodology is also introduced.
A research and institutional size based model for national university web site interlinking
- Journal of Documentation
, 2002
"... Web links are a phenomenon of interest to bibliometricians by analogy with citations, and to others because of their use in web navigation and search engines. It is known that very few links on university web sites are targeted at scholarly expositions and yet, at least in the UK and Australia, a co ..."
Abstract
-
Cited by 11 (9 self)
- Add to MetaCart
Web links are a phenomenon of interest to bibliometricians by analogy with citations, and to others because of their use in web navigation and search engines. It is known that very few links on university web sites are targeted at scholarly expositions and yet, at least in the UK and Australia, a correlation has been established between link count metrics for universities and measures of institutional research. This paper operates on a finer-grained level of detail, focussing on counts of links between pairs of universities. It provides evidence of an underlying linear relationship with the quadruple product of the size and research quality of both source and target institution. This simple model is proposed as applying generally to national university systems, subject to a series of constraints to identify cases where it is unlikely to be applicable. It is hoped that the model, if confirmed by studies of other countries, will open the door to deeper mining of academic web link data.
An initial exploration of the link relationship between UK university Web sites
- ASLIB Proceedings
"... Aggregates of links are of interest to information scientists in the same way as citation counts are: as potential sources of data from which new knowledge can be mined. The recent discovery of a correlation between a web link count measure and the research quality of British universities is built u ..."
Abstract
-
Cited by 6 (4 self)
- Add to MetaCart
Aggregates of links are of interest to information scientists in the same way as citation counts are: as potential sources of data from which new knowledge can be mined. The recent discovery of a correlation between a web link count measure and the research quality of British universities is built upon by applying a range of multivariate statistical techniques to counts of links between pairs of universities. This represents an initial attempt at developing an understanding of this phenomenon. Plausible results are extracted, including the high degree of similarity between the Scottish universities and limited evidence of a dichotomy between new and traditional universities. Outliers in the data were also identified by the techniques, some of which were verified by being tracked down to identifiable web phenomena. This is an important outcome because successful anomaly identification is a precondition to more effective analysis of this kind of data. The identification of groupings is encouraging evidence that web links between universities can be mined for significant results, although it is clear that more methodological development is needed if any but the simplest patterns are to be extracted. Finally, based upon the types of patterns extracted it is argued that none of the methods used are capable of fully analysing link structures on their own.
The Web Impact Factor: a critical review
- The Electronic Library
"... Purpose – We analyse the link-based web site impact measure known as the Web Impact Factor (WIF). It is a quantitative tool for evaluating and ranking web sites, top-level domains and sub-domains. We also discuss the WIF's advantages and disadvantages, data collection problems, and validity and reli ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
Purpose – We analyse the link-based web site impact measure known as the Web Impact Factor (WIF). It is a quantitative tool for evaluating and ranking web sites, top-level domains and sub-domains. We also discuss the WIF's advantages and disadvantages, data collection problems, and validity and reliability of WIF results. Design/methodology/approach – A key to webometric studies has been the use of largescale search engines, such as Yahoo and AltaVista that allow measurements to be made of the total number of pages in a web site and the total number of backlinks to the web site. These search engines provide similar possibilities for the investigation of links between web sites/pages to those provided by the academic journals citation databases from the Institute of Scientific Information (ISI). But the content of the Web is not of the same nature and quality as the databases maintained by the ISI. Findings – This paper reviews how the WIF has been developed and applied. It has been suggested that Web Impact Factors can be calculated as a way of comparing the attractiveness of web sites or domains on the Web. It is concluded that, while the WIF is arguably useful for quantitative intra-country comparison, application beyond this (i.e., to inter-country assessment) has little value. Originality/value – The paper attempts to make a critical review over literature on the WIF and associated indicators.
Commercial web site links
- Internet Research: Electronic Networking Applications and Policy
"... Every hyperlink pointing at a web site is a potential source of new visitors, especially one near the top of a results page from a popular search engine. The order of the links in a search results page is often decided upon by an algorithm that takes into account the number and quality of links to a ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Every hyperlink pointing at a web site is a potential source of new visitors, especially one near the top of a results page from a popular search engine. The order of the links in a search results page is often decided upon by an algorithm that takes into account the number and quality of links to all matching pages. The number of standard links targeted at a site is therefore doubly important, yet little research has touched on the actual interlinkage between business web sites, which numerically dominate the web. This paper discusses business use of the web and related search engine design issues as well as research on general and academic links before reporting on a survey of the links published by a relatively random collection of business web sites. The results indicate that around 66 % of web sites do carry external links, most of which are targeted at a specific purpose, but that about 17 % publish general links, with implications for those designing and marketing web sites.
human intervention
"... A publicly accessible database of UK university website links and a discussion of the need for ..."
Abstract
- Add to MetaCart
A publicly accessible database of UK university website links and a discussion of the need for

