• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 2,119
Next 10 →

Table 6: Crawler CBMG.

in Analyzing Web Robots and Their Impact on Caching
by Virglio Almeida, Daniel Menascé, Rudolf Riedi, Flávia Peligrinelli, Rodrigo Fonseca, Wagner Meira, Jr. 2001
"... In PAGE 7: ... The behavior indicates that the ShopBots perform (almost) exclusively searches, and that usually they have long sessions. Table6 shows the averaged CBMG for the various crawlers. The rst observation is the much broader pool of states that are visited.... ..."
Cited by 3

Table 1. Focused Crawler Evaluation

in Domain-specific web site identification: The crossmarc focused web crawler
by Konstantinos Stamatakis, Vangelis Karkaletsis, Georgios Paliouras 2003
"... In PAGE 3: ...We performed a number of experiments, the results of which appear in Table1 . The first two rows show results for Version 1 with two sets of start points in the search engine hierarchies, general points relating to computer hardware retailers and more narrow points relating to notebooks and laptops.... ..."
Cited by 4

Table 4: Sources of information used to initialize the crawler

in Crawling the Hidden Web
by Sriram Raghavan, Hector Garcia-molina 2001
"... In PAGE 21: ...Table 4: Sources of information used to initialize the crawler To configure the crawler, we required a target set of sites as well as some information to initialize the LVS table. Table4 lists the online sources we used to generate some of the basic LVS entries required by the crawler. These entries included partial lists of names of semiconductor manufacturing companies as well as list of sub-sectors (or areas) within the semiconductor industry.... In PAGE 21: ... These entries included partial lists of names of semiconductor manufacturing companies as well as list of sub-sectors (or areas) within the semiconductor industry. The first two sources listed in Table4 were (manually) used only once, to extract information for explicit initialization. The remaining two sources were wrapped by custom wrappers to interface with the LVS manager and automatically provide values at run-time.... ..."
Cited by 124

Table 9: Performance comparison between crawlers.

in The Viuva Negra crawler
by Daniel Gomes, Mário J. Silva 2006
"... In PAGE 28: ... 7.2 Performance comparison Table9 presents a performance comparison between the previously analyzed crawlers extracted from published results. This comparison must be taken with a grain of salt because the experiments were run using different setups and in different periods of time.... In PAGE 29: ... The speed of the connection to the Internet must be considered to analyze crawling performance. The 3rd, 4th and 5th columns of Table9 show that the most performant crawlers used the fastest connections to the Internet. So, this might be also a reason why VN presented a lower download rate than the most performant crawlers.... In PAGE 29: ... In the %dups. column of Table9 , we present the percentage of duplicates harvested by the crawlers. The results show that our efforts to minimize the download of duplicates, saving on bandwidth and storage space, yield good results in practice.... In PAGE 29: ... A crawler should also minimize the number of visits to URLs that do not reference a downloadable content. The downloads/URLs column of Table9 presents the ratio between the number of downloads and the URLs visited. VN was configured as a focused crawler of the Portuguese web and discarded contents considered irrelevant.... ..."

Table 7: Crawler Function Distribution and Visits

in Analyzing Web Robots and Their Impact on Caching
by Virglio Almeida, Daniel Menascé, Rudolf Riedi, Flávia Peligrinelli, Rodrigo Fonseca, Wagner Meira, Jr. 2001
Cited by 3

TABLE II MEDIA CRAWLER STARTING PAGES

in Characteristics of Streaming Media Stored on the Internet
by Mingzhe Li, Mark Claypool, Robert Kinicki, James Nichols 2003
Cited by 1

TABLE II MEDIA CRAWLER STARTING PAGES

in Characteristics of Streaming Media Stored on the Internet
by Mingzhe Li, Mark Claypool, Robert Kinicki, James Nichols 2003
Cited by 1

Table 2: Top-Five Crawlers

in Workload-Aware Web Crawling and Server Workload Detection
by Shaozhi Ye, Guohan Lu, Xing Li 2004
Cited by 1

Table VII. Performance comparison between crawlers.

in The Viúva Negra crawler:
by Daniel Gomes, Mário J. Silva

Table 1 The main weblog crawler algorithm 1

in Clustering and Retrieval models. General Terms
by Ying Zhou, Joseph Davis
"... In PAGE 2: ... To construct the blogspace, we can repeat the basic algorithm for each distinct weblogs appear in the target end of the edges discovered so far. Table1 gives the crawling algorithm. In implementation, step 1 and step 2 are executed in pipeline style for efficiency and a variable depth is used to control the size of the blogspace.... ..."
Next 10 →
Results 1 - 10 of 2,119
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University