Termweighting approaches in automatic text retrieval
 INFORMATION PROCESSING AND MANAGEMENT
, 1988
"... The experimental evidence accumulated over the past 20 years indicates that text indexing systems based on the assignment of appropriately weighted single terms produce retrieval results that are superior to those obtainable with other more elaborate text representations. These results depend crucia ..."
Cited by 2189 (10 self)
crucially on the choice of effective termweighting systems. This article summarizes the insights gained in automatic term weighting, and provides baseline singletermindexing models with which other more elaborate content analysis procedures can be compared.
A Learningbased TermWeighting Approach for Information Retrieval
"... One of the core components in information retrieval(IR) is the documenttermweighting scheme. In this paper,we will propose a novel learningbased termweighting approach to improve the retrieval performance of vector space model in homogeneous collections. We first introduce a simple learning syst ..."
One of the core components in information retrieval(IR) is the documenttermweighting scheme. In this paper,we will propose a novel learningbased termweighting approach to improve the retrieval performance of vector space model in homogeneous collections. We first introduce a simple learning
An Investigation of Term Weighting Approaches for Microblog Retrieval
"... Abstract. The use of effective term frequency weighting and document length normalisation strategies have been shown over a number of decades to have a significant positive effect for document retrieval. When dealing with much shorter documents, such as those obtained from microblogs, it would see ..."
Cited by 1 (0 self)
Abstract. The use of effective term frequency weighting and document length normalisation strategies have been shown over a number of decades to have a significant positive effect for document retrieval. When dealing with much shorter documents, such as those obtained from microblogs, it would
An Extension of Topic Models for Text Classification: a Term Weighting Approach
"... Abstract — Text classification has become a critical step in big data analytics. For supervised machine learning approaches to text classification, availability of sufficient training data with classification labels attached to individual text units is essential to the performance. Since labeled dat ..."
Abstract — Text classification has become a critical step in big data analytics. For supervised machine learning approaches to text classification, availability of sufficient training data with classification labels attached to individual text units is essential to the performance. Since labeled
Teknisk naturvetenskaplig fakultet UTHenheten Besöksadress:
"... A study of term weighting approaches for shortlength documents ..."
An Axiomatic Study of Learned TermWeighting Schemes
"... At present, there exists many termweighting schemes each based on different underlying models of retrieval. Learning approaches are increasingly being applied to the termweighting problem, further increasing the number of useful termweighting approaches available. Many of these termweighting schem ..."
Cited by 3 (1 self)
At present, there exists many termweighting schemes each based on different underlying models of retrieval. Learning approaches are increasingly being applied to the termweighting problem, further increasing the number of useful termweighting approaches available. Many of these termweighting
Indexing by latent semantic analysis
 JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE
, 1990
"... A new method for automatic indexing and retrieval is described. The approach is to take advantage of implicit higherorder structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries. The p ..."
Cited by 3779 (35 self)
A new method for automatic indexing and retrieval is described. The approach is to take advantage of implicit higherorder structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries
Linear models and empirical bayes methods for assessing differential expression in microarray experiments.
 Stat. Appl. Genet. Mol. Biol.
, 2004
"... Abstract The problem of identifying differentially expressed genes in designed microarray experiments is considered. Lonnstedt and Speed (2002) derived an expression for the posterior odds of differential expression in a replicated twocolor experiment using a simple hierarchical parametric model. ..."
Cited by 1321 (24 self)
from spot filtering or spot quality weights. The posterior odds statistic is reformulated in terms of a moderated tstatistic in which posterior residual standard deviations are used in place of ordinary standard deviations. The empirical Bayes approach is equivalent to shrinkage of the estimated
Estimating the Support of a HighDimensional Distribution
, 1999
"... Suppose you are given some dataset drawn from an underlying probability distribution P and you want to estimate a "simple" subset S of input space such that the probability that a test point drawn from P lies outside of S is bounded by some a priori specified between 0 and 1. We propo ..."
Cited by 783 (29 self)
propose a method to approach this problem by trying to estimate a function f which is positive on S and negative on the complement. The functional form of f is given by a kernel expansion in terms of a potentially small subset of the training data; it is regularized by controlling the length
Loopy belief propagation for approximate inference: An empirical study. In:
 Proceedings of Uncertainty in AI,
, 1999
"... Abstract Recently, researchers have demonstrated that "loopy belief propagation" the use of Pearl's polytree algorithm in a Bayesian network with loops can perform well in the context of errorcorrecting codes. The most dramatic instance of this is the near Shannonlimit performanc ..."
Cited by 676 (15 self)
likelihood weighting 3.1 The PYRAMID network All nodes were binary and the conditional probabilities were represented by tablesentries in the conditional probability tables (CPTs) were chosen uniformly in the range (0, 1]. 3.2 The toyQMR network All nodes were binary and the conditional probabilities
