• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

DMCA

Silk from a sow’s ear: Extracting usable structures from the web (1996)

Cached

  • Download as a PDF

Download Links

  • [www.parc.xerox.com]
  • [www2.parc.com]
  • [www2.parc.com]
  • [www2.parc.com]
  • [arbor.ee.ntu.edu.tw]
  • [smartech.gatech.edu]
  • [www2.parc.com]
  • [smartech.gatech.edu]
  • [smartech.gatech.edu]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Peter Pirolli , James Pitkow , Ramana Rao
Citations:267 - 9 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@INPROCEEDINGS{Pirolli96silkfrom,
    author = {Peter Pirolli and James Pitkow and Ramana Rao},
    title = {Silk from a sow’s ear: Extracting usable structures from the web},
    booktitle = {},
    year = {1996},
    pages = {118--125},
    publisher = {ACM Press}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

In its current implementation, the World-Wide Web lacks much of the explicit structure and strong typing found in many closed hypertext systems. While this property probably relates to the explosive acceptance of the Web, it further complicates the already difficult problem of identifying usable structures and aggregates in large hypertext collections. These reduced structures, or localities, form the basis for simplifying visualizations of and navigation through complex hypertext systems. Much of the previous research into identifying aggregates utilize graph theoretic algorithms based upon structural topology, i.e., the linkages between items. Other research has focused on content analysis to form document collections. This paper presents our exploration into techniques that utilize both the topology and textual similarity between items as well as usage data collected by servers and page meta-information lke title and size. Linear equations and spreading activation models are employed to arrange Web pages based upon functional categories, node types, and relevancy.

Keyphrases

extracting usable structure    sow ear    world-wide web    document collection    linear equation    explosive acceptance    reduced structure    explicit structure    previous research    functional category    content analysis    textual similarity    difficult problem    strong typing    aggregate utilize graph theoretic algorithm    hypertext system    node type    structural topology    current implementation    activation model    meta-information lke title    large hypertext collection    complex hypertext system    usable structure    web page    usage data   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University