• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Ensemble of feature selection techniques for high dimensional data. MSc. Theses, Vege, Sri Harsha (2012)

by S H Vege
Add To MetaCart

Tools

Sorted by:
Results 1 - 1 of 1

A COMPARATIVE STUDY OF COMBINED FEATURE SELECTION METHODS FOR ARABIC TEXT CLASSIFICATION

by Aisha Adel, Nazlia Omar, Adel Al-shabi
"... Text classification is a very important task due to the huge amount of electronic documents. One of the problems of text classification is the high dimensionality of feature space. Researchers proposed many algorithms to select related features from text. These algorithms have been studied extensive ..."
Abstract - Add to MetaCart
Text classification is a very important task due to the huge amount of electronic documents. One of the problems of text classification is the high dimensionality of feature space. Researchers proposed many algorithms to select related features from text. These algorithms have been studied extensively for English text, while studies for Arabic are still limited. This study introduces an investigation on the performance of five widely used feature selection methods namely Chi-square, Correlation, GSS Coefficient, Information Gain and Relief F. In addition, this study also introduces an approach of combination of feature selection methods based on the average weight of the features. The experiments are conducted using Naïve Bayes and Support Vector Machine classifiers to classify a published Arabic corpus. The results show that the best results were obtained when using Information Gain method. The results also show that the combination of multiple feature selection methods outperforms the best results obtain by the individual methods.
(Show Context)

Citation Context

... Machine. In literature, the studies that triedsto combine the feature selection methods using differentsstrategies, they combine either two or five feature selectionsmethods like (Wang et al., 2010; =-=Vege, 2012-=-).sThe key idea behind combining feature selectionsmethods are that every individual method producessdifferent types of errors and feature selection methods arescombined to exploit their strengths. Co...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University