Hypertext: The Importance of being Different (1997)
| Citations: | 7 - 0 self |
BibTeX
@MISC{Amitay97hypertext:the,
author = {Einat Amitay},
title = {Hypertext: The Importance of being Different},
year = {1997}
}
OpenURL
Abstract
This document layout approach answers the need for dealing with documents which were only processed by an OCR program. This technique also makes use of font types and bold print, identifying these as better candidates for anchors. One problem with this approach is that it depends on the linear structure of the documents and that it maintains this very same structure after the text to hypertext conversion (Myka, Argenton and Güntzer, 1996). Following Richmond, Smith and Amitay (1997), a combined approach might be taken. The authors suggest a simple algorithm for detecting subject boundaries within flat texts. The algorithm also finds topical words in each segment, enabling the detection of content and context. It also disambiguates the necessary words by allowing access to their immediate 32







