• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 59
Next 10 →

AN IMPROVED ARABIC WORD’S ROOTS EXTRACTION METHOD USING N-GRAM TECHNIQUE

by Nidal Yousef, Aymen Abu-errub, Ashraf Odeh, Hayel Khafajeh
"... Arabic language is distinguished by its morphological richness, which forces the workers in the field of Arabic language Processing (i.e., information retrieval, document’s classification, text summarizing) to deal with many words that seem to be different but in reality they came from an identical ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
root word. One of the methods to overcome this problem is to return the words to their roots. This research aims to provide a new algorithm, that returns roots of Arabic words using n-gram technique without using morphological rules in order to avoid the complexity arising from the morphological

Text Classification Improved through Multigram Models

by Dou Shen, Jian-tao Sun, Qiang Yang, Zheng Chen - In Proceedings of the ACM Fifteenth Conference on Information and Knowledge Management (CIKM'06
"... Classification algorithms and document representation approaches are two key elements for a successful document classification system. In the past, much work has been conducted to find better ways to represent documents. However, most of the attempts rely on certain extra resources such as WordNet, ..."
Abstract - Cited by 3 (0 self) - Add to MetaCart
language models, we put forward two text classification algorithms. The experiments on RCV1 show that our proposed algorithm based on n-multigram models alone can achieve the similar or even better classification performance compared with the classifier based on n-gram models but the model size of our

Combinatorial approach for Text Classification Algorithm

by Namekar Shirish Manohar, Deipali V. Gore
"... Abstract — There are large number of websites, web portals available in market places in order to buy or sell objects that means any kind of products. For examples filpkart, ebay, Amazon etc. due increase number of this web portals as well as its application Text mining, integration is become import ..."
Abstract - Add to MetaCart
there category. In this paper we used powerful method based on parallel text classification. To attack above problem availability of source data or document could help to find out better prediction. We formulated above problem to the best from our knowledge and study we showed classifier with parallel approach

Concept indexing: A fast dimensionality reduction algorithm with applications to document retrieval and categorization

by George Karypis, Eui-Hong (Sam) Han - IN CIKM’00 , 2000
"... In recent years, we have seen a tremendous growth in the volume of text documents available on the Internet, digital libraries, news sources, and company-wide intranets. This has led to an increased interest in developing meth-ods that can efficiently categorize and retrieve relevant information. Re ..."
Abstract - Cited by 81 (5 self) - Add to MetaCart
In recent years, we have seen a tremendous growth in the volume of text documents available on the Internet, digital libraries, news sources, and company-wide intranets. This has led to an increased interest in developing meth-ods that can efficiently categorize and retrieve relevant information

Sentiment Mining for Natural Language Documents Wikipedia-based Text Classification

by Joseph Christian, G. Noel , 2009
"... The Internet and even our personal computers are full of unlabeled text content and we could benefit from tools that automatically organize and tag this text content so that we can quickly access the appropriate content when needed. Supervised learning algorithms for text classification that learn f ..."
Abstract - Add to MetaCart
The Internet and even our personal computers are full of unlabeled text content and we could benefit from tools that automatically organize and tag this text content so that we can quickly access the appropriate content when needed. Supervised learning algorithms for text classification that learn

A Feature Weight Adjustment Algorithm for Document Categorization

by Shrikanth Shankar, George Karypis , 2000
"... In recent years we have seen a tremendous growth in the volume of text documents available on the Internet, digital libraries, news sources, and company-wide intra-nets. Automatic text categorization, which is the task of assigning text documents to pre-specified classes (topics or themes) of docume ..."
Abstract - Cited by 11 (1 self) - Add to MetaCart
In recent years we have seen a tremendous growth in the volume of text documents available on the Internet, digital libraries, news sources, and company-wide intra-nets. Automatic text categorization, which is the task of assigning text documents to pre-specified classes (topics or themes

Pruning Non-Informative Text Through Non-Expert Annotations to Improve Aspect-Level Sentiment Classification. COLING

by Ji Fang, Bob Price, Lotti Price , 2010
"... Sentiment analysis attempts to extract the author’s sentiments or opinions from unstructured text. Unlike approaches based on rules, a machine learning approach holds the promise of learning robust, highcoverage sentiment classifiers from labeled examples. However, people tend to use different ways ..."
Abstract - Cited by 3 (0 self) - Add to MetaCart
-sentiment information cannot be identified as noise by the learning algorithm and can easily become correlated with the sentiment label, thereby confusing sentiment classifiers. In this paper, we present a highly effective procedure for using crowd-sourcing techniques to label informative and non

A Detailed Study on Text Mining using Genetic Algorithm

by Shivani Patel, Prof Purnima Gandhi
"... Abstract-- The text mining studies are gaining more importance recently because of the availability of the increasing number of the electronic documents from a variety of sources. Few examines direct on applying Genetic Algorithm to text classification, summarization and information retrieval system ..."
Abstract - Add to MetaCart
Abstract-- The text mining studies are gaining more importance recently because of the availability of the increasing number of the electronic documents from a variety of sources. Few examines direct on applying Genetic Algorithm to text classification, summarization and information retrieval

Soft indexing of speech content for search in spoken documents

by Ciprian Chelba , Jorge Silva , Alex Acero - Computer Speech and Language
"... Abstract The paper presents the Position Specific Posterior Lattice (PSPL), a novel lossy representation of automatic speech recognition lattices that naturally lends itself to efficient indexing and subsequent relevance ranking of spoken documents. This technique explicitly takes into consideratio ..."
Abstract - Cited by 18 (2 self) - Add to MetaCart
into consideration the content uncertainty by means of using soft-hits. Indexing position information allows one to approximate N-gram expected counts and at the same time use more general proximity features in the relevance score calculation. In fact, one can easily port any state-of-the-art text

Semantic Smoothing for Text Clustering

by Jamal A. Nasira, Iraklis Varlamisb, Asim Karima, George Tsatsaronisc
"... In this paper we present a new semantic smoothing vector space kernel (S-VSM) for text documents clustering. In the suggested approach semantic relatedness between words is used to smooth the similarity and the represen-tation of text documents. The basic hypothesis examined is that considering sema ..."
Abstract - Add to MetaCart
semantic relatedness between two text documents may improve the perfor-mance of the text document clustering task. For our experimental evaluation we analyze the performance of several semantic relatedness measures when embedded in the proposed (S-VSM) and present results with respect to dif
Next 10 →
Results 1 - 10 of 59
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University