• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 11 - 20 of 4,247
Next 10 →

The Evolution of the Web and Implications for an Incremental Crawler

by Junghoo Cho, Hector Garcia-molina , 1999
"... In this paper we study how to build an effective incremental crawler. The crawler selectively and incrementally updates its index and/or local collection of web pages, instead of periodically refreshing the collection in batch mode. The incremental crawler can improve the "freshness" of th ..."
Abstract - Cited by 281 (18 self) - Add to MetaCart
;freshness" of the collection significantly and bring in new pages in a more timely manner. We first present results from an experiment conducted on more than half million web pages over 4 months, to estimate how web pages evolve over time. Based on these experimental results, we compare various design choices

An Experimental Comparison of Click Position-Bias Models

by Nick Craswell, Onno Zoeter, Michael Taylor, Bill Ramsey , 2008
"... Search engine click logs provide an invaluable source of relevance information, but this information is biased. A key source of bias is presentation order: the probability of click is influenced by a document’s position in the results page. This paper focuses on explaining that bias, modelling how p ..."
Abstract - Cited by 193 (1 self) - Add to MetaCart
Search engine click logs provide an invaluable source of relevance information, but this information is biased. A key source of bias is presentation order: the probability of click is influenced by a document’s position in the results page. This paper focuses on explaining that bias, modelling how

WebMate: A Personal Agent for Browsing and Searching

by Liren Chen, Katia Sycara - In Proceedings of the Second International Conference on Autonomous Agents , 1998
"... The World-Wide Web is developing very fast. Currently, finding useful information on the Web is a time consuming process. In this paper, we present WebMate, an agent that helps users to effectively browse and search the Web. WebMate extends the state of the art in Web-based information retrieval in ..."
Abstract - Cited by 239 (10 self) - Add to MetaCart
provide multiple pages as similarity/relevance guidance for the search. The system extracts and combines relevant keywords from these relevant pages and uses them for keyword refinement. Using these techniques, WebMate provides effective browsing and searching help and also compiles and sends to users

Machine-independent virtual memory management for paged uniprocessor and multiprocessor architectures

by Richard Rashid, Avadis Tevanian, Michael Young, David Golub, Robert Baron, David Black, William Boloaky, Jonathan Chew - IEEE Transactions on Computers (TC , 1988
"... This paper describes the design and implementation of virtual memory management within the CMU Mach Operating System and the experiences gained by the Mach kernel group in porting that system to a variety of architectures. As of this writing, Maeh runs on more than half a dozen uniprocessors and mul ..."
Abstract - Cited by 190 (10 self) - Add to MetaCart
and multiprocessors including the VAX family of uniprocessors and multiprocessors, the IBM RT PC, the SUN 3, the Encore MultiMax, the Sequent Balance 21000 and several experimental computers. Although these systems vary considerably in the kind of hardware support for memory management they provide, the machine

Optimizing the migration of virtual computers

by Constantine P Sapuntzakis , Ramesh Chandra , Ben Pfaff , Jim Chow , Monica S Lam , Mendel Rosenblum - In Proceedings of the 5th Symposium on Operating Systems Design and Implementation , 2002
"... Abstract This paper shows how to quickly move the state of a running computer across a network, including the state in its disks, memory, CPU registers, and I/O devices. We call this state a capsule. Capsule state is hardware state, so it includes the entire operating system as well as applications ..."
Abstract - Cited by 238 (5 self) - Add to MetaCart
techniques to reduce the amount of data sent over the network: copy-on-write disks track just the updates to capsule disks, "ballooning" zeros unused memory, demand paging fetches only needed blocks, and hashing avoids sending blocks that already exist at the remote end. We demonstrate

Towards Adaptive Web Sites: Conceptual Framework and Case Study

by Mike Perkowitz , Oren Etzioni - ARTIFICIAL INTELLIGENCE , 2000
"... The creation of a complex web site is a thorny problem in user interface design. In this paper we explore the notion of adaptiveweb sites: sites that semi-automatically improve their organization and presentation by learning from visitor access patterns. It is easy to imagine and implementweb sit ..."
Abstract - Cited by 198 (4 self) - Add to MetaCart
that facilitate navigation of a web site. We presentthePageGather algorithm, which automatically identifies candidate link sets to include in index pages based on user access logs. We demonstrate experimentally that PageGather outperforms the Apriori data mining algorithm on this task. In addition, we compare

Learning Algorithms for Keyphrase Extraction

by Peter D. Turney - INFORMATION RETRIEVAL , 2000
"... Many academic journals ask their authors to provide a list of about five to fifteen keywords, to appear on the first page of each article. Since these key words are often phrases of two or more words, we prefer to call them keyphrases. There is a wide variety of tasks for which keyphrases are useful ..."
Abstract - Cited by 213 (3 self) - Add to MetaCart
Many academic journals ask their authors to provide a list of about five to fifteen keywords, to appear on the first page of each article. Since these key words are often phrases of two or more words, we prefer to call them keyphrases. There is a wide variety of tasks for which keyphrases

The Pyramid-Technique: Towards Breaking the Curse of Dimensionality

by Stefan Berchtold, Christian Böhm, Hans-Peter Kriegel , 1998
"... In this paper, we propose the Pyramid-Technique, a new indexing method for high-dimensional data spaces. The Pyramid-Technique is highly adapted to range query processing using the maximum metric Lruax. In contrast to all other index structures, the performance of the Pyramid-Technique does not dete ..."
Abstract - Cited by 208 (2 self) - Add to MetaCart
second step, the single pyramids are cut into slices parallel to the basis of the pyramid. These slices form the data pages. Furthermore, we show that this partition provides a mapping from the given d-dimensional space to a l-dimensional space. Therefore, we are able to use a B+-tree to manage

From user access patterns to dynamic hypertext linking

by Tak Woon Yan, Matthew Jacobsen, Hector Garcia-molina, Umeshwar Dayal , 1996
"... This paper describes an approach for automatically classifying visitors of a web site according to their access patterns. User access logs are examined to discover clusters of users that exhibit similar information needs; e.g., users that access similar pages. This may result in a better understandi ..."
Abstract - Cited by 194 (2 self) - Add to MetaCart
This paper describes an approach for automatically classifying visitors of a web site according to their access patterns. User access logs are examined to discover clusters of users that exhibit similar information needs; e.g., users that access similar pages. This may result in a better

Protocol independent multicast-sparse mode (PIM-SM): Protocol specification

by D. Farinacci, A. Helmy, D. Thaler, S. Deering, M. Handley, V. Jacobson, C. Liu, P. Sharma, L. Wei , 1998
"... This memo defines an Experimental Protocol for the Internet community. This memo does not specify an Internet standard of any kind. Discussion and suggestions for improvement are requested. Distribution of this memo is unlimited. Acknowledgements The author list has been reordered to reflect the inv ..."
Abstract - Cited by 163 (13 self) - Add to MetaCart
the latex! Estrin, et. al. Experimental [Page 1] RFC 2117 PIM-SM June 1997 Introduction This document describes a protocol for efficiently routing to multicast groups that may span wide-area (and inter-domain) internets. We refer to the approach as Protocol Independent Multicast--Sparse Mode (PIM
Next 10 →
Results 11 - 20 of 4,247
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University