• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

A conditional random field word segmenter (2005)

Cached

  • Download as a PDF

Download Links

  • [research.microsoft.com]
  • [www.stanford.edu]
  • [www.stanford.edu]
  • [www-nlp.stanford.edu]
  • [nlp.stanford.edu]
  • [nlp.stanford.edu]
  • [nlp.stanford.edu]
  • [nlp.stanford.edu]
  • [www-nlp.stanford.edu]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Huihsin Tseng
Venue:In Fourth SIGHAN Workshop on Chinese Language Processing
Citations:16 - 1 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@INPROCEEDINGS{Tseng05aconditional,
    author = {Huihsin Tseng},
    title = {A conditional random field word segmenter},
    booktitle = {In Fourth SIGHAN Workshop on Chinese Language Processing},
    year = {2005}
}

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

We present a Chinese word segmentation system submitted to the closed track of Sighan bakeoff 2005. Our segmenter was built using a conditional random field sequence model that provides a framework to use a large number of linguistic features such as character identity, morphological and character reduplication features. Because our morphological features were extracted from the training corpora automatically, our system was not biased toward any particular variety of Mandarin. Thus, our system does not

Citations

1548 BConditional random fields: Probabilistic models for segmenting and labeling sequence data - Lafferty, McCallum, et al.
35 The first international Chinese word segmentation bakeoff - Sproat, Emerson - 2003
9 Tou and Jin Kiat Low. 2004. Chinese part-of-speech tagging: One-at-a-time or all-at-once? word-based or character-based - Ng
7 Chinese unknown word identification using character-based tagging and chunking - Goh, Asahara, et al. - 2003
4 Fangfang Feng and Andrew McCallum. 2004. Chinese segmentation and new word detection using conditional random fields - Peng - 2004
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University