Results 1 -
4 of
4
Efficient Training of Conditional Random Fields
, 2002
"... This thesis explores a number of parameter estimation techniques for conditional random fields, a recently introduced probabilistic model for labelling and segmenting sequential data. Theoretical and practical disadvantages of the training techniques reported in current literature on CRFs are discus ..."
Abstract
-
Cited by 43 (2 self)
- Add to MetaCart
This thesis explores a number of parameter estimation techniques for conditional random fields, a recently introduced probabilistic model for labelling and segmenting sequential data. Theoretical and practical disadvantages of the training techniques reported in current literature on CRFs are discussed. We hypothesise that general numerical optimisation techniques result in improved performance over iterative scaling algorithms for training CRFs. Experiments run on a a subset of a well-known text chunking data set confirm that this is indeed the case. This is a highly promising result, indicating that such parameter estimation techniques make CRFs a practical and efficient choice for labelling sequential data, as well as a theoretically sound and principled probabilistic framework.
Introduction to Special Issue on Machine Learning Approaches to Shallow Parsing
- Journal of Machine Learning Research
, 2002
"... This article introduces the problem of partial or shallow parsing (assigning partial syntactic structure to sentences) and explains why it is an important natural language processing (NLP) task. The complexity of the task makes Machine Learning an attractive option in comparison to the handcrafti ..."
Abstract
-
Cited by 10 (0 self)
- Add to MetaCart
This article introduces the problem of partial or shallow parsing (assigning partial syntactic structure to sentences) and explains why it is an important natural language processing (NLP) task. The complexity of the task makes Machine Learning an attractive option in comparison to the handcrafting of rules. On the other hand, because of the same task complexity, shallow parsing makes an excellent benchmark problem for evaluating machine learning algorithms. We sketch the origins of shallow parsing as a specific task for machine learning of language, and introduce the articles accepted for this special issue, a representative sample of current research in this area. Finally, future directions for machine learning of shallow parsing are suggested.
Applying maximum entropy to robust chinese shallow parsing
- In: Proceedings of ROCLING2005. (2005
, 2005
"... Recently, shallow parsing has been applied to various information processing systems, such as information retrieval, information extraction, question answering, and automatic document summarization. A shallow parser is suitable for online applications, because it is much more efficient and less dema ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
Recently, shallow parsing has been applied to various information processing systems, such as information retrieval, information extraction, question answering, and automatic document summarization. A shallow parser is suitable for online applications, because it is much more efficient and less demanding than a full parser. In this research, we formulate shallow parsing as a sequential tagging problem and use a supervised machine learning technique, Maximum Entropy (ME), to build a Chinese shallow parser. The major features of the ME-based shallow parser are POSs and the context words in a sentence. We adopt the shallow parsing results of Sinica Treebank as our standard, and select 30,000 and 10,000 sentences from Sinica Treebank as the training set and test set respectively. We then test the robustness of the shallow parser with noisy data. The experiment results show that the proposed shallow parser is quite robust for sentences with unknown proper nouns. 1.
2.2 Hidden Markov Models....................... 9
"... This thesis explores a number of parameter estimation techniques for con-ditional random fields, a recently introduced [31] probabilistic model for la-belling and segmenting sequential data. Theoretical and practical disadvan-tages of the training techniques reported in current literature on CRFs ar ..."
Abstract
- Add to MetaCart
This thesis explores a number of parameter estimation techniques for con-ditional random fields, a recently introduced [31] probabilistic model for la-belling and segmenting sequential data. Theoretical and practical disadvan-tages of the training techniques reported in current literature on CRFs are dis-cussed. We hypothesise that general numerical optimisation techniques result in improved performance over iterative scaling algorithms for training CRFs. Experiments run on a a subset of a well-known text chunking data set [28] confirm that this is indeed the case. This is a highly promising result, indi-cating that such parameter estimation techniques make CRFs a practical and efficient choice for labelling sequential data, as well as a theoretically sound and principled probabilistic framework. iii Acknowledgements I would like to thank my supervisor, Miles Osborne, for his support and en-couragement throughout the duration of this project. iv Declaration I declare that this thesis was composed by myself, that the work contained herein is my own except where explicitly stated otherwise in the text, and that this work has not been submitted for any other degree or professional qualifi-cation except as specified.

