Extending, Trimming and Fusing WordNet for Technical Documents (2001) [11 citations — 0 self]
Abstract:
This paper describes a tool for the automatic extension and trimming of a multilingual WordNet database for cross-lingual retrieval and multilingual ontology building in intranets and domain-specific document collections. Hierarchies, built from automatically extracted terms and combined with the WordNet relations, are trimmed with a disambiguation method based on the document salience of the words in the glosses. The disambiguation is tested in a cross-lingual retrieval task, showing considerable improvement (7%-11%). The condensed hierarchies can be used as browse-interfaces to the documents complementary to retrieval.
Citations
| 167 | Technical Terminology: Some Linguistic Properties and an Algorithm for Identification in Text, Natural Language Engineering – Justeson, Katz - 1995 |
| 143 | Deriving concept hierarchies from text – Sanderson, Croft - 1999 |
| 136 | One Sense per Discourse – Gale, Church, et al. - 1992 |
| 128 | EuroWordNet: A Multilingual Database with Lexical Semantic Networks – Vossen - 1998 |
| 47 | A method for word sense disambiguation of unrestricted text – Mihalcea, Moldovan - 1999 |
| 21 | Natural language processing and information retrieval – VOORHEES - 1999 |
| 19 | SQLET: Short query linguistic expansion techniques, palliating one-word queries by providing intermediate structure to text – Grefenstette - 1997 |
| 17 | Towards a Universal Index of Meaning – VOSSEN, PETERS, et al. - 1999 |
| 14 | Projecting Corpusbased Semantic Links on a Thesaurus – Morin, Jacquemin - 1999 |
| 1 | Automatic Sense Clustering – Peters, Peters, et al. - 1998 |

