Results 1 - 10
of
18
Predicting the Semantic Orientation of Adjectives
, 1997
"... We identify and validate from a large corpus constraints from conjunctions on the positive or negative semantic orientation of the conjoined adjectives. A log-linear regression model uses these constraints to predict whether conjoined adjectives are of same or different orientations, achiev- ..."
Abstract
-
Cited by 200 (5 self)
- Add to MetaCart
We identify and validate from a large corpus constraints from conjunctions on the positive or negative semantic orientation of the conjoined adjectives. A log-linear regression model uses these constraints to predict whether conjoined adjectives are of same or different orientations, achiev- ing 82% accuracy in this task when each conjunction is considered independently.
Optimizing Ranking Functions: A Connectionist Approach to Adaptive Information Retrieval
- DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING, THE UNIVERSITY OF CALIFORNIA, SAN DIEGO
, 1994
"... This dissertation examines the use of adaptive methods to automatically improve the performance of ranked text retrieval systems. The goal of a ranked retrieval system is to manage a large collection of text documents and to order documents for a user based on the estimated relevance of the document ..."
Abstract
-
Cited by 26 (5 self)
- Add to MetaCart
This dissertation examines the use of adaptive methods to automatically improve the performance of ranked text retrieval systems. The goal of a ranked retrieval system is to manage a large collection of text documents and to order documents for a user based on the estimated relevance of the documents to the user's information need (or query). The ordering enables the user to quickly find documents of interest. Ranked retrieval is a difficult problem because of the ambiguity of natural language, the large size of the collections, and because of the varying needs of users and varying collection characteristics. We propose and empirically validate general adaptive methods which improve the ability of a large class of retrieval systems to rank documents effectively. Our main adaptive method is to numerically optimize free parameters in a retrieval system by minimizing a non-metric criterion function. The criterion measures how well the system is ranking documents relative to a target ordering, defined by a set of training queries which include the users' desired document orderings. Thus, the system learns parameter settings which better enable it to rank relevant documents before irrelevant. The non-metric approach is interesting because it is a general adaptive method, an alternative to supervised methods for training neural networks in domains in which rank order or prioritization is important. A second adaptive method is also examined, which is applicable to a restricted class of retrieval systems but which permits an analytic solution. The adaptive methods are applied to a number of problems in text retrieval to validate their utility and practical efficiency. The applications include: A dimensionality reduction of vector-based document representations to a vector spa...
Lexical Semantics of Adjectives: A Microtheory Of Adjectival Meaning
, 1995
"... . This work belongs to a family of research efforts, called microtheories and aimed at describing the static meaning of all lexical categories in several languages in the framework of the MikroKosmos project on computational semantics. The latter also involves other static microtheories describin ..."
Abstract
-
Cited by 20 (5 self)
- Add to MetaCart
. This work belongs to a family of research efforts, called microtheories and aimed at describing the static meaning of all lexical categories in several languages in the framework of the MikroKosmos project on computational semantics. The latter also involves other static microtheories describing world knowledge and syntax-semantics mapping as well as dynamic microtheories connected with the actual process of text analysis. This paper describes our approach to determining and representing adjectival meaning, compares it with the body of knowledge on adjectives in literature and presents a detailed, practically tested methodology and heuristics for the acquisition of lexical entries for adjectives. The work was based on the set of over 6,000 English and about 1,500 Spanish adjectives obtained from task-oriented corpora. Introduction The topic of this paper is the information about adjectival meaning which should be included in a computational lexicon. Thus, we concentrate on...
Principled Disambiguation: Discriminating Adjective Senses with . . .
- COMPUTATIONAL LINGUISTICS
, 1995
"... ... In this paper we argue for a linguistically principled approach to disambiguation, in which relevant contextual clues are narrowly defined, in syntactic and semantic terms, and in which only highly reliable clues are exploited. Statistical methods play a definite role in this work, helping to or ..."
Abstract
-
Cited by 19 (0 self)
- Add to MetaCart
... In this paper we argue for a linguistically principled approach to disambiguation, in which relevant contextual clues are narrowly defined, in syntactic and semantic terms, and in which only highly reliable clues are exploited. Statistical methods play a definite role in this work, helping to organize and analyze data, but the disambiguation method itself does not employ statistical data or decision criteria. This approach results in improved understanding of the disambiguation problem both in general and on a word-specific basis and leads to broadly applicable and nearly errorless clues to word sense. The approach is illustrated by an experiment discriminating among the senses of adjectives, which have been relatively neglected in work on sense disambiguation. In particular, the paper assesses the potential of nouns for discriminating among the senses of adjectives that modify them. This assessment is based on an empirical study of five of the most frequent ambiguous adjectives in English: hard, light, old, right, and short. About three-quarters of all instances of these adjectives can be disambiguated almost errorlessly by the nouns they modify or by the syntactic constructions in which they occur. Such disambiguation requires only simple rules, which can be automated easily. Furthermore, a small number of semantic attributes supply a compact means of representing the noun clues in a very few rules. Clues other than nouns are required when modified nouns are not useable. The sense of an ambiguous modified noun may be needed to determine the relevant semantic attribute for disambiguation of a target adjective; and other adjectives, verbs, and grammatical constructions all show evidence of high reliability, and sometimes of high applicability, when they stand in specific, ...
A quantitative evaluation of linguistic tests for the automatic prediction of semantic markedness
- In Proceedings of the 33rd Annual Meeting of the ACL
, 1995
"... {vh, kathy}~cs, columbia, edu We present a corpus-based study of methods that have been proposed in the linguistics literature for selecting the semantically unmarked term out of a pair of antonymous adjectives. Solutions to this problem are applicable to the more general task of selecting the posit ..."
Abstract
-
Cited by 6 (1 self)
- Add to MetaCart
{vh, kathy}~cs, columbia, edu We present a corpus-based study of methods that have been proposed in the linguistics literature for selecting the semantically unmarked term out of a pair of antonymous adjectives. Solutions to this problem are applicable to the more general task of selecting the positive term from the pair. Using automatically collected data, the accuracy and applicability of each method is quantified, and a statistical analysis of the significance of the results is performed. We show that some simple methods are indeed good indicators for the answer to the problem while other proposed methods fail to perform better than would be attributable to chance. In addition, one of the simplest methods, text frequency, dominates all others. We also apply two generic statistical learning methods for combining the indications of the individual methods, and compare their performance to the simple methods. The most sophisticated complex learning method offers a small, but statistically significant, improvement over the original tests.
2008) Computing Word-Pair Antonymy
- In Proceedings of the Conference on Empirical Methods in Natural Language Processing
"... Knowing the degree of antonymy between words has widespread applications in natural language processing. Manually-created lexicons have limited coverage and do not include most semantically contrasting word pairs. We present a new automatic and empirical measure of antonymy that combines corpus stat ..."
Abstract
-
Cited by 6 (1 self)
- Add to MetaCart
Knowing the degree of antonymy between words has widespread applications in natural language processing. Manually-created lexicons have limited coverage and do not include most semantically contrasting word pairs. We present a new automatic and empirical measure of antonymy that combines corpus statistics with the structure of a published thesaurus. The approach is evaluated on a set of closest-opposite questions, obtaining a precision of over 80%. Along the way, we discuss what humans consider antonymous and how antonymy manifests itself in utterances. 1
Hernán.” Knowledge Representation for Software Architecture Domain by
- Manual and Automatic Methodologies”. Electronic Journal of CLEI 2006 (EJCLEI2006
, 2006
"... At the moment, there is a need for new knowledge representation using Thesaurus or Ontologies because of the need to reuse knowledge. In this paper, a Software Architecture knowledge representation is created, for that purpose a manual and automatic methodology for creating it is used. A new manual ..."
Abstract
-
Cited by 2 (2 self)
- Add to MetaCart
At the moment, there is a need for new knowledge representation using Thesaurus or Ontologies because of the need to reuse knowledge. In this paper, a Software Architecture knowledge representation is created, for that purpose a manual and automatic methodology for creating it is used. A new manual methodology is provided in the paper. CAKE (Computer Aided Knowledge Environment) is the automatic process used as automatic methodology. The result is the first thesaurus in English for the Software Architecture Domain using the new manual methodology presented in the paper and the first ontology in Spanish for the Software Architecture Domain using the automatic methodology.
Clause Aggregation: An Approach to Generating Concise Text
- COLUMBIA UNIVERSITY
, 2002
"... This dissertation identifies and resolves constraints related to the task of combining related clauses to formulate fluent and concise sentences. To incorporate complex linguistic constructions into text generation systems, novel algorithms were designed to systematically generate conjunction, ellip ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
This dissertation identifies and resolves constraints related to the task of combining related clauses to formulate fluent and concise sentences. To incorporate complex linguistic constructions into text generation systems, novel algorithms were designed to systematically generate conjunction, ellipsis, and quantification constructions. Casper a submodule in a text generation system, was designed and implemented. It can convey the same information using fewer words by taking advantage of redundancies in the input based on syntactic, semantic, and discourse information. In addition to these symbol approaches, my research also employs corpus-based statistical approaches to enhance the fluency of the generated text. By employing advance linguistic constructions and removing redundancies through clause aggregations, the generated text or speech is more fluent and concise and thus improves human-computer interface.
Lexical RUles for Deverbal Adjectives
"... This work belongs to a family of research efforts, called microtheories and aimed at describing the static meaning of all lexical categories in several languages in the framework of the MikroKosmos project on computational semantics. The latter also involves other static microtheories describing ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
This work belongs to a family of research efforts, called microtheories and aimed at describing the static meaning of all lexical categories in several languages in the framework of the MikroKosmos project on computational semantics. The latter also involves other static microtheories describing world knowledge and syntax-semantics mapping as well as dynamic microtheories connected with the actual process of text analysis.
Ordering Among Premodifiers
- In Procs. of acl99, Univ
, 1999
"... We present a corpus-based study of the sequential ordering among premodifiers in noun phrases. This information is important for the fluency of generated text in practical applications. We propose and evaluate three approaches to identify sequential order among pre- modifiers: direct evidence, trans ..."
Abstract
- Add to MetaCart
We present a corpus-based study of the sequential ordering among premodifiers in noun phrases. This information is important for the fluency of generated text in practical applications. We propose and evaluate three approaches to identify sequential order among pre- modifiers: direct evidence, transitive closure, and clustering. Our implemented system can make over 94% of such ordering decisions correctly, as evaluated on a large, previously un- seen test corpus.

