• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 99
Next 10 →

Multi-Faceted Information Retrieval System for Large Scale Email Archives

by Jukka Perki Ville, Ville Tuulos, Wray Buntine
"... We profile a system for search and analysis of largescale email archives. The system builds around four facets: Content-based search engine, statistical topic model, automatically inferred social networks and time-series analysis. The facets correspond to the types of information available in email ..."
Abstract - Add to MetaCart
We profile a system for search and analysis of largescale email archives. The system builds around four facets: Content-based search engine, statistical topic model, automatically inferred social networks and time-series analysis. The facets correspond to the types of information available in email

The Science of Large-Scale Information Retrieval

by Gregory B. Newby - Internet Archive 2000 Colloquium
"... Abstract: Information scientists have investigated potentially useful methods for Web search engines and archives, yet relatively few of these methods are in active use. These scientists tend to operate without the imperative for large-scale speed and performance, and may not fully evaluate their in ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
their innovations. This work presents some promising approaches to information retrieval with suggestions for the practicality of applying them to contemporary large-scale retrieval systems. It is suggested that Web search engines and archives may benefit from incorporating these approaches. For information

Supporting Collaboration by Large Scale Email Analysis *

by Michal Laclavík, Martin Šeleng, Marek Ciglan, Ladislav Hluchý
"... Abstract. Email has become the most widespread Internet application. It is a tool supporting not only communication but also cooperation, task management, archiving, or information and knowledge management. Furthermore Email is a source of information on personal, job or community network of an indi ..."
Abstract - Cited by 7 (6 self) - Add to MetaCart
of information retrieval and semantic annotation. We describe an experiment on the Hadoop distributed architecture designed to process large email archives. Key words: email, social network, information extraction, metadata 1

Spatialized Browsing in Large Data Archives

by Sara I Fabrikant , 2000
"... Exponentially growing data archives emphasize the need for efficient techniques and novel approaches to find and extract information. Information visualization has emerged in the Information Retrieval domain to facilitate access to large databases. This development acknowledges the need to focus on ..."
Abstract - Cited by 17 (2 self) - Add to MetaCart
Exponentially growing data archives emphasize the need for efficient techniques and novel approaches to find and extract information. Information visualization has emerged in the Information Retrieval domain to facilitate access to large databases. This development acknowledges the need to focus

Indexing and Retrieval of Broadcast News

by Steve Renals, Dave Abberley, David Kirby, Tony Robinson - Speech Communication , 2000
"... This paper describes a spoken document retrieval (SDR) system for British and North American Broadcast News. The system is based on a connectionist large vocabulary speech recognizer and a probabilistic information retrieval system. We discuss the development of a realtime Broadcast News speech r ..."
Abstract - Cited by 33 (7 self) - Add to MetaCart
, and we discuss the application of these developments to a large scale SDR task based on an archive of British English broadcast news. Keywords: Spoken Document Retrieval; Information Retrieval; Broadcast Speech; Large Vocabulary Speech Recognition. 1 Introduction Retrieval of audio segments

Rendering an Archive in Three Dimensions

by David Leiman Claire, David A. Leiman *a, Claire Twose B, Teresa Y. H. Lee C, Alex Fletcher D, Terry S. Yoo E , 2003
"... We examine the requirements for a publicly accessible, online collection of three-dimensional biomedical image data, including those yielded by radiological processes such as MRI, ultrasound and others. Intended as a repository and distribution mechanism for such medical data, we created the Nationa ..."
Abstract - Add to MetaCart
the National Online Volumetric Archive (NOVA) as a case study aimed at identifying the multiple issues involved in realizing a large-scale digital archive. In the paper we discuss such factors as the current legal and health information privacy policy affecting the collection of human medical images, retrieval

Austrian On-Line Archive Processing: Analyzing Archives of the World Wide Web

by Andreas Rauber, Andreas Aschenbrenner, Oliver Witvoet - In Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries (ECDL 2002 , 2002
"... With the popularity of the World Wide Web and the recognition of its worthiness of being archived we find numerous projects aiming at creating large-scale repositories containing excerpts and snapshots of Web data. Interfaces are being created that allow users to surf through time, analyzing the evo ..."
Abstract - Cited by 14 (4 self) - Add to MetaCart
With the popularity of the World Wide Web and the recognition of its worthiness of being archived we find numerous projects aiming at creating large-scale repositories containing excerpts and snapshots of Web data. Interfaces are being created that allow users to surf through time, analyzing

Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction

by Florian Eyben, Felix Weninger, Nicolas Lehment, Gerhard Rigoll , 2013
"... Without doubt general video and sound, as found in large multimedia archives, carry emotional information. Thus, audio and video retrieval by certain emotional categories or dimensions could play a central role for tomorrow’s intelligent systems, enabling search for movies with a particular mood, co ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Without doubt general video and sound, as found in large multimedia archives, carry emotional information. Thus, audio and video retrieval by certain emotional categories or dimensions could play a central role for tomorrow’s intelligent systems, enabling search for movies with a particular mood

Intelligent Parsing of Scanned Volumes for Web Based Archives

by Xiaonan Lu, James Z. Wang, C. Lee Giles
"... The proliferation of digital libraries and the large amount of existing documents raise important issues in efficient handling of documents. Printed texts in documents need to be converted into digital format and semantic information need to be parsed and managed for effective retrieval. In this wor ..."
Abstract - Add to MetaCart
The proliferation of digital libraries and the large amount of existing documents raise important issues in efficient handling of documents. Printed texts in documents need to be converted into digital format and semantic information need to be parsed and managed for effective retrieval

Cumulative Query Count A Large-scale Study of Automated Web Search Traffic

by Greg Buehrer, Jack W. Stokes
"... As web search providers seek to improve both relevance and response times, they are challenged by the ever-increasing tax of automated search query traffic. Third party systems interact with search engines for a variety of reasons, such as monitoring a website's rank, augmenting online games, o ..."
Abstract - Add to MetaCart
, or as behavioral patterns of automated interactions. We believe these features formulate a basis for a production-level query stream classifier. of research. One such dimension is adversarial informal retrieval. Popular challenges in this arena are email spam and web link spam. Email spam is designed to return
Next 10 →
Results 1 - 10 of 99
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University