Results 1 -
1 of
1
More Accurate Tests for the Statistical Significance of Result Differences
, 2000
"... Sl,ai,isl,ica,1 significance, kest,ing of diflkn'ences in values of metrics like recall, i)rccision mM batmined F-s(x)re is a ne(:cssm'y tmrt of eml)iri(:a.l ua.t;ural bmguage 1)ro(;easing. Unfi)rtunat,ely we inertly used tests of'ten ulnlerestimake i,he significm ce mM so a.re less likely to detect ..."
Abstract
-
Cited by 25 (0 self)
- Add to MetaCart
Sl,ai,isl,ica,1 significance, kest,ing of diflkn'ences in values of metrics like recall, i)rccision mM batmined F-s(x)re is a ne(:cssm'y tmrt of eml)iri(:a.l ua.t;ural bmguage 1)ro(;easing. Unfi)rtunat,ely we inertly used tests of'ten ulnlerestimake i,he significm ce mM so a.re less likely to detect, difihrences l,hat exist between difM'eni techniques. This 1111deresl;illla(;ioll comes from an independcn(;e asSmnl)tion that is offten violated. Wc l)oint, out sonic ltse]'Hl l.es(,s (,]mL do nol, make lhis assmnl)- tion, including contput;a, tionally--iltcnsive domizai,ion tests.

