Results 1 - 10
of
17,916
Does Semantic Information Help in the Text Categorization Task? In
- 91–107. of the 2007 International NooJ Conference 17 LABBE C., LABBE D. (2001): Inter-textual
, 2008
"... In this paper, we investigate how effective the use of semantic infor-mation could be in text categorization tasks. To this end, we consider distinct representations of documents differing in the kind of information incorporated: (a) information about terms only, (b) semantic information, and (c) a ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
In this paper, we investigate how effective the use of semantic infor-mation could be in text categorization tasks. To this end, we consider distinct representations of documents differing in the kind of information incorporated: (a) information about terms only, (b) semantic information, and (c) a
NLP Found Helpful (at least for one Text Categorization Task)
- In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP-02
, 2002
"... Attempts to use natural language processing (NLP) for text categorization and information retrieval (IR) have had mixed results. Nevertheless, there is a strong intuition that NLP is important, at least for some tasks. In this paper, we discuss a task involving captioned images for which the subject ..."
Abstract
-
Cited by 9 (1 self)
- Add to MetaCart
Attempts to use natural language processing (NLP) for text categorization and information retrieval (IR) have had mixed results. Nevertheless, there is a strong intuition that NLP is important, at least for some tasks. In this paper, we discuss a task involving captioned images for which
Integrating and Evaluating WSD in the Adaptation of a Lexical Database in Text Categorization Task
"... Abstract. Improvement in the accuracy of identifying the correct word sense (WSD) will give better results for many natural language processing tasks. In this paper, we present a new approach using WSD as an aid for Text Categorization (TC). This approach integrates a set of linguistics resources as ..."
Abstract
- Add to MetaCart
Abstract. Improvement in the accuracy of identifying the correct word sense (WSD) will give better results for many natural language processing tasks. In this paper, we present a new approach using WSD as an aid for Text Categorization (TC). This approach integrates a set of linguistics resources
BoosTexter: A Boosting-based System for Text Categorization
"... This work focuses on algorithms which learn from examples to perform multiclass text and speech categorization tasks. Our approach is based on a new and improved family of boosting algorithms. We describe in detail an implementation, called BoosTexter, of the new boosting algorithms for text catego ..."
Abstract
-
Cited by 667 (20 self)
- Add to MetaCart
This work focuses on algorithms which learn from examples to perform multiclass text and speech categorization tasks. Our approach is based on a new and improved family of boosting algorithms. We describe in detail an implementation, called BoosTexter, of the new boosting algorithms for text
A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization
, 1997
"... The Rocchio relevance feedback algorithm is one of the most popular and widely applied learning methods from information retrieval. Here, a probabilistic analysis of this algorithm is presented in a text categorization framework. The analysis gives theoretical insight into the heuristics used in the ..."
Abstract
-
Cited by 456 (1 self)
- Add to MetaCart
in the Rocchio algorithm, particularly the word weighting scheme and the similarity metric. It also suggests improvements which lead to a probabilistic variant of the Rocchio classifier. The Rocchio classifier, its probabilistic variant, and a naive Bayes classifier are compared on six text categorization tasks
N-grambased text categorization
- In Proc. of SDAIR-94, 3rd Annual Symposium on Document Analysis and Information Retrieval
, 1994
"... Text categorization is a fundamental task in document processing, allowing the automated handling of enormous streams of documents in electronic form. One difficulty in handling some classes of documents is the presence of different kinds of textual errors, such as spelling and grammatical errors in ..."
Abstract
-
Cited by 445 (0 self)
- Add to MetaCart
Text categorization is a fundamental task in document processing, allowing the automated handling of enormous streams of documents in electronic form. One difficulty in handling some classes of documents is the presence of different kinds of textual errors, such as spelling and grammatical errors
A Sequential Algorithm for Training Text Classifiers
, 1994
"... The ability to cheaply train text classifiers is critical to their use in information retrieval, content analysis, natural language processing, and other tasks involving data which is partly or fully textual. An algorithm for sequential sampling during machine learning of statistical classifiers was ..."
Abstract
-
Cited by 631 (10 self)
- Add to MetaCart
was developed and tested on a newswire text categorization task. This method, which we call uncertainty sampling, reduced by as much as 500-fold the amount of training data that would have to be manually classified to achieve a given level of effectiveness. 1 Introduction Text classification is the automated
Inductive learning algorithms and representations for text categorization,”
- in Proceedings of the International Conference on Information and Knowledge Management,
, 1998
"... ABSTRACT Text categorization -the assignment of natural language texts to one or more predefined categories based on their content -is an important component in many information organization and management tasks. We compare the effectiveness of five different automatic learning algorithms for text ..."
Abstract
-
Cited by 652 (8 self)
- Add to MetaCart
ABSTRACT Text categorization -the assignment of natural language texts to one or more predefined categories based on their content -is an important component in many information organization and management tasks. We compare the effectiveness of five different automatic learning algorithms for text
Text Categorization with Support Vector Machines: Learning with Many Relevant Features
, 1998
"... This paper explores the use of Support Vector Machines (SVMs) for learning text classifiers from examples. It analyzes the particular properties of learning with text data and identifies, why SVMs are appropriate for this task. Empirical results support the theoretical findings. SVMs achieve substan ..."
Abstract
-
Cited by 2303 (9 self)
- Add to MetaCart
This paper explores the use of Support Vector Machines (SVMs) for learning text classifiers from examples. It analyzes the particular properties of learning with text data and identifies, why SVMs are appropriate for this task. Empirical results support the theoretical findings. SVMs achieve
Machine Learning in Automated Text Categorization
- ACM COMPUTING SURVEYS
, 2002
"... The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last ten years, due to the increased availability of documents in digital form and the ensuing need to organize them. In the research community the dominant approach to this p ..."
Abstract
-
Cited by 1734 (22 self)
- Add to MetaCart
The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last ten years, due to the increased availability of documents in digital form and the ensuing need to organize them. In the research community the dominant approach
Results 1 - 10
of
17,916