Text extraction from the web via text-totag ratio (2008)

by T Weninger, W H Hsu
Venue:in DEXA Workshops. IEEE Computer Society