Text Categorization with Many Redundant Features: Using Aggressive Feature Selection to Make SVMs Competitive with C4.5 (2004)

by Evgeniy Gabrilovich , Shaul Markovitch
Venue:In ICML’04
Citations:49 - 4 self

Active Bibliography

28 Wikipedia-based semantic interpretation for natural language processing – Evgeniy Gabrilovich, Shaul Markovitch
12 Harnessing the Expertise of 70,000 Human Editors: Knowledge-Based Feature Generation for Text Categorization – Evgeniy Gabrilovich, Shaul Markovitch
24 Parameterized Generation of Labeled Datasets for Text Categorization Based on a Hierarchical Directory – Dmitry Davidov, Evgeniy Gabrilovich, Shaul Markovitch - 2004
Text Categorization with a Small Number of . . . – Kang Hyuk Lee - 2003
14 Augmenting Wikipedia with Named Entity Tags – Wisam Dakka
5 An Evaluation of Text Classification Methods for Literary Study – Bei Yu - 2006
Deliverable: D8.1c (3/5) – Derek Greene
1 Machine Learning of Web Documents – David R. Karger, Lawrence Kai Shih, Lawrence Kai Shih - 2004
1090 Machine Learning in Automated Text Categorization – Fabrizio Sebastiani, Consiglio Nazionale Delle Ricerche - 2002
1 A Hybrid Attribute Selection Approach for Text Classification – Chen-Huei Chou, Atish P. Sinha, Huimin Zhao - 2010
74 Feature generation for text categorization using world knowledge – Evgeniy Gabrilovich, Shaul Markovitch - 2005
8 Classifying web documents in a hierarchy of categories: a comprehensive study – Michelangelo Ceci, Donato Malerba - 2007
Entropy based feature selection for text categorization – Christine Largeron, Christophe Moulin, Mathias Géry - 2011
Active Learning Query Selection with Historical Information – Michael Davy, Michael Davy, Michael Davy - 2009
7 Phrases and Feature Selection in E-Mail Classification – Elisabeth Crawford - 2004
CLASSIFYING STRUCTURED WEB SOURCES USING AGGRESSIVE FEATURE SELECTION – Hieu Quang Le, Stefan Conrad
31 Complex linguistic features for text classification: a comprehensive study – Ro Moschitti, Roberto Basili - 2004
47 Augmenting Naive Bayes Classifiers with Statistical Language Models – Fuchun Peng, Dale Schuurmans, Shaojun Wang - 2003
78 Overcoming the brittleness bottleneck using wikipedia: enhancing text categorization with encyclopedic knowledge – Evgeniy Gabrilovich, Shaul Markovitch - 2006