Results 1 -
5 of
5
Automatic Recognition of Multi-Word Terms: the C-value/NC-value Method
, 2000
"... Technical terms (henceforth called terms), are important elements for digital libraries. In this paper we present a domain-independent method for the automatic extraction of multi-word terms, from machinereadable special language corpora. The method, (C-value/NC-value), combines linguistic and stati ..."
Abstract
-
Cited by 56 (6 self)
- Add to MetaCart
Technical terms (henceforth called terms), are important elements for digital libraries. In this paper we present a domain-independent method for the automatic extraction of multi-word terms, from machinereadable special language corpora. The method, (C-value/NC-value), combines linguistic and statistical information. The rst part, C-value enhances the common statistical measure of frequency of occurrence for term extraction, making it sensitive to a particular typeofmulti-word terms, the nested terms. The second part, NC-value, gives: 1) a method for the extraction of term context words (words that tend to appear with terms), 2) the incorporation of information from term context words to the extraction of terms.
An Approach to Program Understanding by Natural Language Understanding
- Natural Language Engineering
, 1999
"... An automated tool to assist in the understanding of legacy code components can be useful both in the areas of software reuse and software maintenance. Most previous work in this area has concentrated on functionally-oriented code. Whereas objectoriented code has been shown to be inherently more reus ..."
Abstract
-
Cited by 11 (2 self)
- Add to MetaCart
An automated tool to assist in the understanding of legacy code components can be useful both in the areas of software reuse and software maintenance. Most previous work in this area has concentrated on functionally-oriented code. Whereas objectoriented code has been shown to be inherently more reusable than functionally-oriented code, in many cases the eventual reuse of the object-oriented code was not considered during development. A knowledge-based, natural language processing approach to the automated understanding of object-oriented code as an aid to the reuse of object-oriented code is described. A system, called the PATRicia system (Program Analysis Tool for Reuse) that implements the approach is examined. The natural language processing/information extraction system that comprises a large part of the PATRicia system is discussed and the knowledge-base of the PATRicia system, in the form of conceptual graphs, is described. Reports provided by natural language-generation in the ...
What is Technical Text?
, 1997
"... Beyond labeling it easier to process than other types, few researchers who use technical text in their work try to define what it is. This paper describes a study that investigates the character of texts typically considered technical. ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
Beyond labeling it easier to process than other types, few researchers who use technical text in their work try to define what it is. This paper describes a study that investigates the character of texts typically considered technical.
Using Dia-MoLE For Unsupervised Learning Of Domain-Specific Dialogue Acts From Spontaneous Language
"... . This report introduces DIA-MOLE, a tool that supports an engineering-oriented approach towards dialogue modelling for a spoken-language interface. Our approach is applied to the domain of appointment scheduling. A major step towards dialogue models is to know about the basic units that are used to ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
. This report introduces DIA-MOLE, a tool that supports an engineering-oriented approach towards dialogue modelling for a spoken-language interface. Our approach is applied to the domain of appointment scheduling. A major step towards dialogue models is to know about the basic units that are used to construct a dialogue model. DIA-MOLE does not employ theory-based dialogue units because they are subject to human interpretation and often cannot be recognized from data available in a spoken-language system. We pursue a data-driven approach and apply unsupervised learning to a sample set of spontaneous dialogues using multiple knowledge sources, i.e. domain and task knowledge, word recognition and prosodic information. Using these data, DIA-MOLE supports segmentation of turns and interpretation of their illocutionary force based on a model of the task. For this purpose we had to develop a model of interactive problem solving in the domain of appointment scheduling. As a result of learning...
Parsing for Targeted Errors in Controlled Languages
- In Proceeding of Recent Advances In Natural Language Processing
, 1995
"... The use of Controlled Languages in technical documentation is becoming a large concern for many organisations. Authoring texts which conform to these specifications is a problematic process. Technological support for the writing process may offer a number of aids, including style, or grammar, checke ..."
Abstract
-
Cited by 2 (2 self)
- Add to MetaCart
The use of Controlled Languages in technical documentation is becoming a large concern for many organisations. Authoring texts which conform to these specifications is a problematic process. Technological support for the writing process may offer a number of aids, including style, or grammar, checkers. The ability to recognise variations to the prescribed grammar is at the heart of such systems. This paper presents a variation on the chart parsing method which encodes the grammar as finite state automata productions instead of a linear description of constituents. The system allows the grammar writer to define a number of variations to a grammar rule which are represented as transformations to the automata. 1 Introduction The SEATS (Specialised English Author Training System) project aims to create technology capable of supporting the process of writing technical documentation according to the stylistic requirements of a Controlled Language. Central to this support is a style checker ...

