Abstract:
The field of information retrieval has traditionally focused on textbases consisting of titles and abstracts. As a consequence, many underlying assumptions must be altered for retrieval from full-length text collections. This paper argues for making use of text structure when retrieving from full text documents, and presents a visualization paradigm, called TileBars, that demonstrates the usefulness of explicit term distribution information in Boolean-type queries. TileBars simultaneously and compactly indicate relative document length, query term frequency, and query term distribution. The patterns in a column of TileBars can be quickly scanned and deciphered, aiding users in making judgments about the potential relevance of the retrieved documents. KEYWORDS: Information retrieval, Full-length text, Visualization. INTRODUCTION Information access systems have traditionally focused on retrieval of documents consisting of titles and abstracts. As a consequence, the underlying assumpt...
Citations
|
988
|
Automatic Text Processing -- The Transformation, Analysis, and Retrieval of Information by Computer Addison-Wesley
– Salton
- 1989
|
|
709
|
The Visual Display of Quantitative Information
– Tufte
- 2001
|
|
225
|
Information visualization using 3D interactive animation
– Robertson, Card, et al.
- 1993
|
|
219
|
Multi-paragraph segmentation of expository text
– Hearst
- 1994
|
|
149
|
Overview of the first text retrieval conference (TREC-1
– Harman
- 1992
|
|
128
|
Edit wear and read wear
– Hill, Hollan, et al.
- 1992
|
|
120
|
An X11 toolkit based on the tcl language
– Ousterhout
- 1991
|
|
113
|
An Information System for Corporate Users: Wide Area Information Servers, Thinking Machines technical report TMC-99
– Kahle
- 1991
|
|
107
|
Constant interaction-time Scatter/Gather browsing of large document collections
– Cutting, Karger, et al.
- 1993
|
|
95
|
InfoCrystal: A Visual Tool for Information Retrieval and Management
– Spoerri
- 1993
|
|
73
|
Formative Design-Evaluation of SuperBook
– Egan, Remde, et al.
- 1989
|
|
48
|
Automating the Design of Graphical Presentations
– Mackinlay
- 1986
|
|
46
|
Understanding charts and graphs
– Kosslyn
- 1989
|
|
45
|
To see, or not to see - Is that the query
– Korfhage
- 1991
|
|
38
|
Context And Structure In Automated Full-Text Information Access". Doctor of Philosophy Thesis
– Hearst
- 1994
|
|
36
|
Efficient retrieval of partial documents
– Zobel, Moffat, et al.
- 1995
|
|
35
|
An Object-Oriented Architecture for Text Retrieval
– Cutting, Pedersen, et al.
- 1991
|
|
27
|
Value Bars: An information visualization and navigation tool for multi-attribute listings
– Chimera
- 1992
|
|
24
|
Text retrieval and inference
– Croft, B, et al.
- 1992
|
|
23
|
Rules and principles of scientific data visualization
– Senay, Ignatius
- 1996
|
|
19
|
A performance evaluation of similarity measures, document term weighting schemes and representations in a boolean environment
– Noreault, McGill, et al.
- 1981
|
|
18
|
Semiology of Graphics, University of Wisconsin Press
– Bertin
- 1983
|
|
14
|
Optimizing Document Indexing and Search Term Weighting Based on Probabilistic Models
– Fuhr, Buckley
- 1993
|
|
11
|
Querying a hypertext information retrieval system by the use of classi cation
– Aboud, Chrisment, et al.
- 1990
|
|
11
|
Concept-based retrieval of hypermedia information -- from term indexing to semantic hyperindexing
– Arents, Bogaerts
- 1993
|
|
10
|
Probabilistic retrieval in the TIPSTER collections: An application of staged logistic regression
– Cooper, Gey, et al.
- 1994
|
|
7
|
Practical enhanced Boolean retrieval
– Fox, Koll
- 1988
|
|
4
|
Per-Kristian Halvorsen, and Meg Withgott. Information theater versus information refinery
– Cutting
- 1990
|
|
3
|
An investigation of term distribution effects on individual queries
– Hearst
- 1995
|