NADA: A Robust System for Non-Referential Pronoun Detection
BibTeX
@MISC{Bergsma_nada:a,
author = {Shane Bergsma and David Yarowsky},
title = {NADA: A Robust System for Non-Referential Pronoun Detection},
year = {}
}
OpenURL
Abstract
Nada is a novel, publicly-available program that accurately distinguishes between the referential and non-referential pronoun it in raw English text. Like recent state-of-the-art approaches, Nada uses very large-scale web N-gram features, but Nada makes these features practical by compressing the N-gram counts so they can fit into computer memory. Nada therefore operates as a fast, stand-alone system. Nada also improves over previous web-scale systems by considering the entire sentence, rather than narrow context windows, via long-distance lexical features. Nada very substantially outperforms other state-of-the-art systems in nonreferential detection accuracy. 1







