A finite-state architecture for tokenization and grapheme-to-phoneme conversion for multilingual text analysis (1995)

by R Sproat
Venue:In Proceedings of the EACL SIGDAT Workshop