Results 1 - 10
of
99
Multi-Faceted Information Retrieval System for Large Scale Email Archives
"... We profile a system for search and analysis of largescale email archives. The system builds around four facets: Content-based search engine, statistical topic model, automatically inferred social networks and time-series analysis. The facets correspond to the types of information available in email ..."
Abstract
- Add to MetaCart
We profile a system for search and analysis of largescale email archives. The system builds around four facets: Content-based search engine, statistical topic model, automatically inferred social networks and time-series analysis. The facets correspond to the types of information available in email
The Science of Large-Scale Information Retrieval
- Internet Archive 2000 Colloquium
"... Abstract: Information scientists have investigated potentially useful methods for Web search engines and archives, yet relatively few of these methods are in active use. These scientists tend to operate without the imperative for large-scale speed and performance, and may not fully evaluate their in ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
their innovations. This work presents some promising approaches to information retrieval with suggestions for the practicality of applying them to contemporary large-scale retrieval systems. It is suggested that Web search engines and archives may benefit from incorporating these approaches. For information
Supporting Collaboration by Large Scale Email Analysis *
"... Abstract. Email has become the most widespread Internet application. It is a tool supporting not only communication but also cooperation, task management, archiving, or information and knowledge management. Furthermore Email is a source of information on personal, job or community network of an indi ..."
Abstract
-
Cited by 7 (6 self)
- Add to MetaCart
of information retrieval and semantic annotation. We describe an experiment on the Hadoop distributed architecture designed to process large email archives. Key words: email, social network, information extraction, metadata 1
Spatialized Browsing in Large Data Archives
, 2000
"... Exponentially growing data archives emphasize the need for efficient techniques and novel approaches to find and extract information. Information visualization has emerged in the Information Retrieval domain to facilitate access to large databases. This development acknowledges the need to focus on ..."
Abstract
-
Cited by 17 (2 self)
- Add to MetaCart
Exponentially growing data archives emphasize the need for efficient techniques and novel approaches to find and extract information. Information visualization has emerged in the Information Retrieval domain to facilitate access to large databases. This development acknowledges the need to focus
Indexing and Retrieval of Broadcast News
- Speech Communication
, 2000
"... This paper describes a spoken document retrieval (SDR) system for British and North American Broadcast News. The system is based on a connectionist large vocabulary speech recognizer and a probabilistic information retrieval system. We discuss the development of a realtime Broadcast News speech r ..."
Abstract
-
Cited by 33 (7 self)
- Add to MetaCart
, and we discuss the application of these developments to a large scale SDR task based on an archive of British English broadcast news. Keywords: Spoken Document Retrieval; Information Retrieval; Broadcast Speech; Large Vocabulary Speech Recognition. 1 Introduction Retrieval of audio segments
Rendering an Archive in Three Dimensions
, 2003
"... We examine the requirements for a publicly accessible, online collection of three-dimensional biomedical image data, including those yielded by radiological processes such as MRI, ultrasound and others. Intended as a repository and distribution mechanism for such medical data, we created the Nationa ..."
Abstract
- Add to MetaCart
the National Online Volumetric Archive (NOVA) as a case study aimed at identifying the multiple issues involved in realizing a large-scale digital archive. In the paper we discuss such factors as the current legal and health information privacy policy affecting the collection of human medical images, retrieval
Austrian On-Line Archive Processing: Analyzing Archives of the World Wide Web
- In Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries (ECDL 2002
, 2002
"... With the popularity of the World Wide Web and the recognition of its worthiness of being archived we find numerous projects aiming at creating large-scale repositories containing excerpts and snapshots of Web data. Interfaces are being created that allow users to surf through time, analyzing the evo ..."
Abstract
-
Cited by 14 (4 self)
- Add to MetaCart
With the popularity of the World Wide Web and the recognition of its worthiness of being archived we find numerous projects aiming at creating large-scale repositories containing excerpts and snapshots of Web data. Interfaces are being created that allow users to surf through time, analyzing
Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction
, 2013
"... Without doubt general video and sound, as found in large multimedia archives, carry emotional information. Thus, audio and video retrieval by certain emotional categories or dimensions could play a central role for tomorrow’s intelligent systems, enabling search for movies with a particular mood, co ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Without doubt general video and sound, as found in large multimedia archives, carry emotional information. Thus, audio and video retrieval by certain emotional categories or dimensions could play a central role for tomorrow’s intelligent systems, enabling search for movies with a particular mood
Intelligent Parsing of Scanned Volumes for Web Based Archives
"... The proliferation of digital libraries and the large amount of existing documents raise important issues in efficient handling of documents. Printed texts in documents need to be converted into digital format and semantic information need to be parsed and managed for effective retrieval. In this wor ..."
Abstract
- Add to MetaCart
The proliferation of digital libraries and the large amount of existing documents raise important issues in efficient handling of documents. Printed texts in documents need to be converted into digital format and semantic information need to be parsed and managed for effective retrieval
Cumulative Query Count A Large-scale Study of Automated Web Search Traffic
"... As web search providers seek to improve both relevance and response times, they are challenged by the ever-increasing tax of automated search query traffic. Third party systems interact with search engines for a variety of reasons, such as monitoring a website's rank, augmenting online games, o ..."
Abstract
- Add to MetaCart
, or as behavioral patterns of automated interactions. We believe these features formulate a basis for a production-level query stream classifier. of research. One such dimension is adversarial informal retrieval. Popular challenges in this arena are email spam and web link spam. Email spam is designed to return
Results 1 - 10
of
99