Searching for authors named "Gregory Grefenstette" – sorted by Relevance.
-
Light Parsing as Finite State Filtering
- . For a number of language processing tasks, such as information retrieval and information extraction tasks, pertinent information can be extracted from text without doing a full parse of the individual sentences. The most common restriction of the parser is to adopt a non-recursive model of the lan
- Cited by 3 (0 self) – Add To MetaCart
-
Evaluation Techniques for Automatic Semantic Extraction: Comparing Syntactic and Window Based Approaches
- As large on-line corpora become more prewlent, a number of attempts have been made to automatically extract thesaurus-like relations directly from text using knowledge poor methods. In the absence of any specific application, comparing the results of these attempts is difficult. Here we propose an e
- Cited by 34 (0 self) – Add To MetaCart
-
SQLET: Short Query Linguistic Expansion Techniques, Palliating One-Word Queries by Providing Intermediate Structure to Text
- Most people using the WWW try to find information using one or two word queries. The information retrieval systems derived from research models were designed for longer queries and do not provide an adequate response to the user's needs. On the other hand, recent advances in natural language process
- Cited by 19 (0 self) – Add To MetaCart
-
Corpus-Derived First, Second and Third-Order Word Affinities
- A number of corpus-based extraction techniques have been successfully implemented which
- Cited by 23 (1 self) – Add To MetaCart
-
Sextant: Exploring Unexplored Contexts For Semantic Extraction from Syntactic Analysis
- For a very long time, it has been considered that the only way of automatically extracting similar groups of words from a text collection for which no semantic information exists is to use document co-occurrence data. But, with robust syntactic parsers that are becoming more frequently available, sy
- Cited by 12 (1 self) – Add To MetaCart
-
Translating Chinese Romanized name into Chinese idiographic characters via corpus and web validation
- ABSTRACT. Cross-language information retrieval performance depends on the quality of the translation resources used to pass from a user’s source language query to target language documents. Translation lists of proper names are rare but vital resources for cross-language retrieval between languages
- Cited by 1 (0 self) – Add To MetaCart
-
Web as corpus
- The web, teeming as it is with language data, of all manner of varieties and languages, in vast quantity and freely available, is a fabulous linguists ’ playground. The Special Issue explores ways in which this dream is being explored. 1
- Cited by 14 (0 self) – Add To MetaCart
-
Introduction to the special issue on the web as corpus
- Cited by 49 (1 self) – Add To MetaCart
-
What is a word, What is a sentence? Problems of Tokenization
- Any linguistic treatment of freely occurring text must provide an answer to what is considered as a token. In artificial languages, the definition of what is considered as a token can be precisely and unambiguously defined. Natural languages, on the other hand, display such a rich variety that there
- Cited by 41 (3 self) – Add To MetaCart
-
Corpus-based Method for Automatic Identification of Support Verbs for Nominalizations
- Nominalization is a highly productive phenomena in most languages.
- Cited by 21 (0 self) – Add To MetaCart

