## An annotated corpus and a grammar model of theorem description (1980)

Venue: | In [MKM03], 2003. [GHK + 80 |

Citations: | 1 - 0 self |

### BibTeX

@INPROCEEDINGS{Baba80anannotated,

author = {Yusuke Baba and Masakazu Suzuki},

title = {An annotated corpus and a grammar model of theorem description},

booktitle = {In [MKM03], 2003. [GHK + 80},

year = {1980},

publisher = {Springer-Verlag}

}

### OpenURL

### Abstract

Abstract. Digitizing documents is becoming increasingly popular in various fields, and training computers to understand the contents of digitized documents is of growing interest. Since the early 90’s, research of natural language processing using large annotated corpora such as the Penn TreeBank has developed. Applying the methods of corpus-based research, we built a syntactically annotated corpus of theorem descriptions, using a book of set theory, and extracted a grammar model of theorems from the obtained corpus, as the first step to understanding mathematical documents by computer. 1

### Citations

538 | Model theory
- Hodges
- 1993
(Show Context)
Citation Context ...P -> RB JJ NN NP -> RB RB JJ NP ···s4 Experiment To evaluate the descriptive power of the grammar obtained in section 3(let G be the grammar), we used 100 theorems collected from two books A[11] and B=-=[12]-=- which are written about Galois theory and model theory. We assigned the correct structure of S, IFC, NP and POS to each theorem, and extracted generation rules using algorithm sec:al. If all the rule... |

73 | A Corpus-based Probabilistic Grammar with Only Two Non-terminals
- Sekine, Grishman
- 1995
(Show Context)
Citation Context ...lity of large, syntactically annotated corpora such as the University of Pennsylvania Tree Bank(Penn TreeBank,[7]) lead to rapid developments in the field of natural language processing. Sekine et al.=-=[8]-=- extracted rules of grammar from the Penn TreeBank and released a parser of English, the “Apple Pie Parser[9]”, using the grammar. To extract a grammar from the Penn TreeBank, the first approach of Se... |

15 | Mathematical formula recognition using virtual link network
- Eto, Suzuki
- 2001
(Show Context)
Citation Context ...e first step to understanding mathematical documents by computer. 1 Introduction In recent years, digitizing documents has become increasingly popular in various fields, for example, in mathematics[1]=-=[2]-=-[3][4]. In connection with this movement, understanding the contents of digitized documents by computer is of growing interest[5][6]. The technology of understanding documents is applicable to useful ... |

7 |
Logic and Categories
- Cameron
- 1999
(Show Context)
Citation Context ...tion rules. In order to evaluate the descriptive power of the obtained grammar, we performed an experiment as described in section 4. 2 Building a Corpus 2.1 Preliminaries We used a book of set theory=-=[10]-=- to collect samples of theorem descriptions as the source of the corpus. To build an annotated corpus of theorems, we used 28 categories of PartsOf-Speech(POS) symbols, and 3 categories of phrase and ... |

6 |
et al. Building a Large Annotated Corpus of English: The Penn TreeBank
- Marcus
- 1993
(Show Context)
Citation Context ...g for natural language give more accurate results for mathematical documents. The availability of large, syntactically annotated corpora such as the University of Pennsylvania Tree Bank(Penn TreeBank,=-=[7]-=-) lead to rapid developments in the field of natural language processing. Sekine et al.[8] extracted rules of grammar from the Penn TreeBank and released a parser of English, the “Apple Pie Parser[9]”... |

4 |
Optical recognition of printed mathematical documents
- Inoue, Miyazaki, et al.
- 1998
(Show Context)
Citation Context ... the first step to understanding mathematical documents by computer. 1 Introduction In recent years, digitizing documents has become increasingly popular in various fields, for example, in mathematics=-=[1]-=-[2][3][4]. In connection with this movement, understanding the contents of digitized documents by computer is of growing interest[5][6]. The technology of understanding documents is applicable to usef... |

1 |
A prototype of a combined digital and retrodigitaized searchable mathematical journal
- Michler
- 1999
(Show Context)
Citation Context ...irst step to understanding mathematical documents by computer. 1 Introduction In recent years, digitizing documents has become increasingly popular in various fields, for example, in mathematics[1][2]=-=[3]-=-[4]. In connection with this movement, understanding the contents of digitized documents by computer is of growing interest[5][6]. The technology of understanding documents is applicable to useful sys... |

1 |
Report on the retrodigitiization project “Archiv der Mathemark”. Archiv der Mathemark
- Michler
(Show Context)
Citation Context ...t step to understanding mathematical documents by computer. 1 Introduction In recent years, digitizing documents has become increasingly popular in various fields, for example, in mathematics[1][2][3]=-=[4]-=-. In connection with this movement, understanding the contents of digitized documents by computer is of growing interest[5][6]. The technology of understanding documents is applicable to useful system... |