• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

DMCA

P.: Automatic meaning discovery using google (2004)

Cached

  • Download as a PDF

Download Links

  • [www.cs.montana.edu]
  • [labs.rightnow.com]
  • [www.cs.montana.edu]
  • [www.cs.montana.edu]
  • [arxiv.org]
  • [homepages.cwi.nl]
  • [homepages.cwi.nl]
  • [www.cwi.nl]
  • [homepages.cwi.nl]
  • [homepages.cwi.nl]
  • [drops.dagstuhl.de]
  • [studia.elka.pw.edu.pl]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Rudi Cilibrasi , Paul Vitanyi
Venue:Centrum Wiskunde & Informatica (CWI
Citations:41 - 3 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@TECHREPORT{Cilibrasi04p.:automatic,
    author = {Rudi Cilibrasi and Paul Vitanyi},
    title = {P.: Automatic meaning discovery using google},
    institution = {Centrum Wiskunde & Informatica (CWI},
    year = {2004}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

We have found a method to automatically extract the meaning of words and phrases from the world-wide-web using Google page counts. The approach is novel in its unrestricted problem domain, simplicity of implementation, and manifestly ontological underpinnings. The world-wide-web is the largest database on earth, and the latent semantic context information entered by millions of independent users averages out to provide automatic meaning of useful quality. We demonstrate positive correlations, evidencing an underlying semantic structure, in both numerical symbol notations and number-name words in a variety of natural languages and contexts. Next, we demonstrate the ability to distinguish between colors and numbers, and to distinguish between 17th century Dutch painters; the ability to understand electrical terms, religious terms, emergency incidents, and we conduct a massive experiment in understanding WordNet categories; the ability to do a simple automatic English-Spanish translation. 1

Keyphrases

automatic meaning discovery    useful quality    unrestricted problem domain    understanding wordnet category    ontological underpinnings    automatic meaning    natural language    simple automatic english-spanish translation    underlying semantic structure    google page count    emergency incident    numerical symbol notation    massive experiment    number-name word    17th century dutch painter    electrical term    religious term    positive correlation    latent semantic context information    independent user   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University