Results 1 - 10
of
51
Incorporating Contextual Information in Recommender Systems Using a Multidimensional Approach
- ACM Transactions on Information Systems
, 2005
"... The paper presents a multidimensional (MD) approach to recommender systems that can provide recommendations based on additional contextual information besides the typical information on users and items used in most of the current recommender systems. This approach supports multiple dimensions, exten ..."
Abstract
-
Cited by 61 (3 self)
- Add to MetaCart
The paper presents a multidimensional (MD) approach to recommender systems that can provide recommendations based on additional contextual information besides the typical information on users and items used in most of the current recommender systems. This approach supports multiple dimensions, extensive profiling, and hierarchical aggregation of recommendations. The paper also presents a multidimensional rating estimation method capable of selecting two-dimensional segments of ratings pertinent to the recommendation context and applying standard collaborative filtering or other traditional two-dimensional rating estimation techniques to these segments. A comparison of the multidimensional and two-dimensional rating estimation approaches is made, and the tradeoffs between the two are studied. Moreover, the paper introduces a combined rating estimation method that identifies the situations where the MD approach outperforms the standard two-dimensional approach and uses the MD approach in those situations and the standard two-dimensional approach elsewhere. Finally, the paper presents a pilot empirical study of the combined approach, using a multidimensional movie recommender system that was developed for implementing this approach and testing its performance. 1 1.
Adaptive Performance Prediction for Distributed Data-Intensive Applications
, 1999
"... The computational grid is becoming the platform of choice for large-scale distributed data-intensive applications. Accurately predicting the transfer times of remote data les, a fundamental component of such applications, is critical to achieving application performance. In this paper, we introduce ..."
Abstract
-
Cited by 34 (3 self)
- Add to MetaCart
The computational grid is becoming the platform of choice for large-scale distributed data-intensive applications. Accurately predicting the transfer times of remote data les, a fundamental component of such applications, is critical to achieving application performance. In this paper, we introduce a performance prediction method, ARM (Adaptive Regression Modeling), to determine data transfer times for network-bound distributed dataintensive applications. We demonstrate the eectiveness of the ARM method on two distributed data applications, SARA (Synthetic Aperture Radar Atlas) and SRB (Storage Resource Broker) , and discuss how it can be used for application scheduling. Our experiments demonstrate that applying the ARM method to these applications predicted data transfer times in wide-area multi-user grid environments with accuracy of 88% or better. 1 Introduction Ensembles of distributed computational, storage, and other resources, also known as computational grids [12, 14], are...
Categorical Data Analysis: Away from ANOVAs (transformation or not) and towards Logit Mixed Models
"... This paper identifies several serious problems with the widespread use of ANOVAs for the analysis of categorical outcome variables such as forced-choice variables, question-answer accuracy, choice in production (e.g. in syntactic priming research), et cetera. I show that even after applying the arc ..."
Abstract
-
Cited by 23 (4 self)
- Add to MetaCart
This paper identifies several serious problems with the widespread use of ANOVAs for the analysis of categorical outcome variables such as forced-choice variables, question-answer accuracy, choice in production (e.g. in syntactic priming research), et cetera. I show that even after applying the arcsine-square-root transformation to proportional data, ANOVA can yield spurious results. I discuss conceptual issues underlying these problems and alternatives provided by modern statistics. Specifically, I introduce ordinary logit models (i.e. logistic regression), which are well-suited to analyze categorical data and offer many advantages over ANOVA. Unfortunately, ordinary logit models do not include random effect modeling. To address this issue, I describe mixed logit models (Generalized Linear Mixed Models for binomially distributed outcomes, Breslow & Clayton, 1993), which combine the advantages of ordinary logit models with the ability to account for random subject and item effects in one step of analysis. Throughout the paper, I use a psycholinguistic data set to compare the different statistical methods.
Solving Electrical Distribution Problems Using Hybrid Evolutionary Data Analysis Techniques
- APPLIED INTELLIGENCE
, 1998
"... Real-world electrical engineering problems can take advantage of the last Data Analysis methodologies. In this paper we will show that Genetic Fuzzy Rule-Based Systems and Genetic Programming techniques are good choices for tackling with some practical modeling problems. We claim that both evolution ..."
Abstract
-
Cited by 22 (17 self)
- Add to MetaCart
Real-world electrical engineering problems can take advantage of the last Data Analysis methodologies. In this paper we will show that Genetic Fuzzy Rule-Based Systems and Genetic Programming techniques are good choices for tackling with some practical modeling problems. We claim that both evolutionary processes may produce good numerical results while providing us with a model that can be interpreted by a human being. We will analyze in detail the characteristics of these two methods and we will compare them to the some of the most popular classical statistical modeling methods and neural networks.
Strings: Variational deformable models of multivariate ordered features
- IEEE PAMI
, 2001
"... Abstract—We propose a new image segmentation technique called strings. A string is a variational deformable model that is learned from a collection of example objects rather than built from a priori analytical or geometrical knowledge. As opposed to existing approaches, an object boundary is represe ..."
Abstract
-
Cited by 12 (5 self)
- Add to MetaCart
Abstract—We propose a new image segmentation technique called strings. A string is a variational deformable model that is learned from a collection of example objects rather than built from a priori analytical or geometrical knowledge. As opposed to existing approaches, an object boundary is represented by a one-dimensional multivariate curve in functional space, a feature function, rather than by a point in vector space. In the learning phase, feature functions are defined by extraction of multiple shape and image features along continuous object boundaries in a given learning set. The feature functions are aligned, then subjected to functional principal components analysis and functional principal regression to summarize the feature space and to model its content, respectively. Also, a Mahalanobis distance model is constructed for evaluation of boundaries in terms of their feature functions, taking into account the natural variations seen in the learning set. In the segmentation phase, an object boundary in a new image is searched for with help of a curve. The curve gives rise to a feature function, a string, that is weighted by the regression model and evaluated by the Mahalanobis model. The curve is deformed in an iterative procedure to produce feature functions with minimal Mahalanobis distance. Strings have been compared with active shape models on 145 vertebra images, showing that strings produce better results when initialized close to the target boundary, and comparable results otherwise. Index Terms—Machine learning, deformable models, energy minimization, multivariate statistics, shape analysis, functional data analysis, chemometrics, active shape models. 1
SAMPLING ALGORITHMS AND CORESETS FOR ℓp REGRESSION
- SIAM J. COMPUT. VOL. 38, NO. 5, PP. 2060–2078
, 2009
"... The ℓp regression problem takes as input a matrix A ∈ Rn×d, a vector b ∈ Rn, and a number p ∈ [1, ∞), and it returns as output a number Z and a vector xopt ∈ Rd such that Z =minx∈Rd‖Ax − b‖p = ‖Axopt − b‖p. In this paper, we construct coresets and obtain an efficient two-stage sampling-based approx ..."
Abstract
-
Cited by 12 (5 self)
- Add to MetaCart
The ℓp regression problem takes as input a matrix A ∈ Rn×d, a vector b ∈ Rn, and a number p ∈ [1, ∞), and it returns as output a number Z and a vector xopt ∈ Rd such that Z =minx∈Rd‖Ax − b‖p = ‖Axopt − b‖p. In this paper, we construct coresets and obtain an efficient two-stage sampling-based approximation algorithm for the very overconstrained (n ≫ d) version of this classical problem, for all p ∈ [1, ∞). The first stage of our algorithm nonuniformly samples ˆr1 = O(36pdmax{p/2+1,p}+1)rowsofAand the corresponding elements of b, and then it solves the ℓp regression problem on the sample; we prove this is an 8-approximation. The second stage of our algorithm uses the output of the first stage to resample ˆr1/ɛ2 constraints, and then it solves the ℓp regression problem on the new sample; we prove this is a (1 + ɛ)-approximation. Our algorithm unifies, improves upon, and extends the existing algorithms for special cases of ℓp regression, namely,
Does More Intense Competition Lead to Higher Growth
- CEPR Discussion Paper Series No
, 1999
"... An earlier version of this paper was presented at a conference on “Industrial Organisation and ..."
Abstract
-
Cited by 11 (0 self)
- Add to MetaCart
An earlier version of this paper was presented at a conference on “Industrial Organisation and
Morphological influences on the recognition of monosyllabic monomorphemic words
- Journal of Memory and Language
, 2006
"... Balota, Cortese, Sergent-Marschall, Spieler, and Yap (2004) have cautioned researchers in the field about the drawbacks of factorial designs where variables are manipulated in a noncontinuous manner and effects are assessed in terms of the presence or absence of a significant effect. They have eloqu ..."
Abstract
-
Cited by 9 (3 self)
- Add to MetaCart
Balota, Cortese, Sergent-Marschall, Spieler, and Yap (2004) have cautioned researchers in the field about the drawbacks of factorial designs where variables are manipulated in a noncontinuous manner and effects are assessed in terms of the presence or absence of a significant effect. They have eloquently demonstrated for us the power of regression analyses based on hundreds or even thousands of data points and the potential
Articulatory Methods for Speech Production and Recognition
, 1996
"... roduction-based knowledge into the recognition framework. By using an explicit time-domain articulatory model of the mechanisms of co-articulation, it is hoped to obtain a more accurate model of contextual effects in the acoustic signal, while using fewer parameters than traditional acoustically-dri ..."
Abstract
-
Cited by 9 (0 self)
- Add to MetaCart
roduction-based knowledge into the recognition framework. By using an explicit time-domain articulatory model of the mechanisms of co-articulation, it is hoped to obtain a more accurate model of contextual effects in the acoustic signal, while using fewer parameters than traditional acoustically-driven approaches. Separate articulatory and acoustic models are provided, and in each case the parameters of the models are automatically optimised over a training data set. A predictive statistically-based model of co-articulation is described, and found to yield improved articulatory modelling accuracy compared with X-ray articulatory traces. Parameterised acoustic vectors are synthesised by a set of artificial neural networks, and the resulting acoustic representations are used to re-score N-best recognition hypothesis lists produced by an HMM-based recogniser. The system is evaluated on two test databases, one including speaker-specific X-ray training data and the other aco
Direct: a system for mining data value conversion rules from disparate sources, Decision Support Systems
, 2001
"... may be quoted without explicit permission, provided that full credit including © notice is given to the source. This paper also can be downloaded without charge from the ..."
Abstract
-
Cited by 7 (4 self)
- Add to MetaCart
may be quoted without explicit permission, provided that full credit including © notice is given to the source. This paper also can be downloaded without charge from the

