MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

Extending, Trimming and Fusing WordNet for Technical Documents (2001) [11 citations — 0 self]

by Piek Vossen
NAACL 2001 workshop on WordNet and Other Lexical Resources, Pittsbourgh
Add To MetaCart

Abstract:

This paper describes a tool for the automatic extension and trimming of a multilingual WordNet database for cross-lingual retrieval and multilingual ontology building in intranets and domain-specific document collections. Hierarchies, built from automatically extracted terms and combined with the WordNet relations, are trimmed with a disambiguation method based on the document salience of the words in the glosses. The disambiguation is tested in a cross-lingual retrieval task, showing considerable improvement (7%-11%). The condensed hierarchies can be used as browse-interfaces to the documents complementary to retrieval.

Citations

167 Technical Terminology: Some Linguistic Properties and an Algorithm for Identification in Text, Natural Language Engineering – Justeson, Katz - 1995
143 Deriving concept hierarchies from text – Sanderson, Croft - 1999
136 One Sense per Discourse – Gale, Church, et al. - 1992
128 EuroWordNet: A Multilingual Database with Lexical Semantic Networks – Vossen - 1998
47 A method for word sense disambiguation of unrestricted text – Mihalcea, Moldovan - 1999
21 Natural language processing and information retrieval – VOORHEES - 1999
19 SQLET: Short query linguistic expansion techniques, palliating one-word queries by providing intermediate structure to text – Grefenstette - 1997
17 Towards a Universal Index of Meaning – VOSSEN, PETERS, et al. - 1999
14 Projecting Corpusbased Semantic Links on a Thesaurus – Morin, Jacquemin - 1999
1 Automatic Sense Clustering – Peters, Peters, et al. - 1998