Results 1 - 10
of
20
Learning sentiment-specific word embedding for twitter sentiment classification.
- In ACL,
, 2014
"... Abstract We present a method that learns word embedding for Twitter sentiment classification in this paper. Most existing algorithms for learning continuous word representations typically only model the syntactic context of words but ignore the sentiment of text. This is problematic for sentiment a ..."
Abstract
-
Cited by 25 (1 self)
- Add to MetaCart
(Show Context)
Abstract We present a method that learns word embedding for Twitter sentiment classification in this paper. Most existing algorithms for learning continuous word representations typically only model the syntactic context of words but ignore the sentiment of text. This is problematic for sentiment analysis as they usually map words with similar syntactic context but opposite sentiment polarity, such as good and bad, to neighboring word vectors. We address this issue by learning sentimentspecific word embedding (SSWE), which encodes sentiment information in the continuous representation of words. Specifically, we develop three neural networks to effectively incorporate the supervision from sentiment polarity of text (e.g. sentences or tweets) in their loss functions. To obtain large scale training corpora, we learn the sentiment-specific word embedding from massive distant-supervised tweets collected by positive and negative emoticons. Experiments on applying SS-WE to a benchmark Twitter sentiment classification dataset in SemEval 2013 show that (1) the SSWE feature performs comparably with hand-crafted features in the top-performed system; (2) the performance is further improved by concatenating SSWE with existing feature set.
Evaluation Datasets for Twitter Sentiment Analysis A survey and a new dataset, the STS-Gold
"... Abstract. Sentiment analysis over Twitter offers organisations and individuals a fast and effective way to monitor the publics ’ feelings towards them and their competitors. To assess the performance of sentiment analysis methods over Twitter a small set of evaluation datasets have been released in ..."
Abstract
-
Cited by 13 (4 self)
- Add to MetaCart
(Show Context)
Abstract. Sentiment analysis over Twitter offers organisations and individuals a fast and effective way to monitor the publics ’ feelings towards them and their competitors. To assess the performance of sentiment analysis methods over Twitter a small set of evaluation datasets have been released in the last few years. In this paper we present an overview of eight publicly available and manually annotated evaluation datasets for Twitter sentiment analysis. Based on this review, we show that a common limitation of most of these datasets, when assessing sentiment analysis at target (entity) level, is the lack of distinctive sentiment annotations among the tweets and the entities contained in them. For example, the tweet “I love iPhone, but I hate iPad ” can be annotated with a mixed sentiment label, but the entity iPhone within this tweet should be annotated with a positive sentiment label. Aiming to overcome this limitation, and to complement current evaluation datasets, we present STS-Gold, a new evaluation dataset where tweets and targets (entities) are annotated individually and therefore may present different sentiment labels. This paper also provides a comparative study of the various datasets along several dimensions including: total number of tweets, vocabulary size and sparsity. We also investigate the pair-wise correlation among these dimensions as well as their correlations to the sentiment classification performance on different datasets.
Global Belief Recursive Neural Networks
"... Recursive Neural Networks have recently obtained state of the art performance on several natural language processing tasks. However, because of their feedforward architecture they cannot correctly predict phrase or word labels that are deter-mined by context. This is a problem in tasks such as aspec ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
(Show Context)
Recursive Neural Networks have recently obtained state of the art performance on several natural language processing tasks. However, because of their feedforward architecture they cannot correctly predict phrase or word labels that are deter-mined by context. This is a problem in tasks such as aspect-specific sentiment classification which tries to, for instance, predict that the word Android is positive in the sentence Android beats iOS. We introduce global belief recursive neural networks (GB-RNNs) which are based on the idea of extending purely feedfor-ward neural networks to include one feedbackward step during inference. This allows phrase level predictions and representations to give feedback to words. We show the effectiveness of this model on the task of contextual sentiment analy-sis. We also show that dropout can improve RNN training and that a combination of unsupervised and supervised word vector representations performs better than either alone. The feedbackward step improves F1 performance by 3 % over the standard RNN on this task, obtains state-of-the-art performance on the SemEval 2013 challenge and can accurately predict the sentiment of specific entities. 1
Semantic Patterns for Sentiment Analysis of Twitter
"... Version: Accepted Manuscript Link(s) to article on publisher’s website: ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
(Show Context)
Version: Accepted Manuscript Link(s) to article on publisher’s website:
Biocom Usp: Tweet Sentiment Analysis with Adaptive Boosting Ensemble
- The 8th International Workshop on Semantic Evaluation (SemEval 2014). Proceedings of the Workshop (pp 123128
, 2014
"... We describe our approach for the SemEval-2014 task 9: Sentiment Analy-sis in Twitter. We make use of an en-semble learning method for sentiment classification of tweets that relies on varied features such as feature hash-ing, part-of-speech, and lexical fea-tures. Our system was evaluated in the Twi ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
We describe our approach for the SemEval-2014 task 9: Sentiment Analy-sis in Twitter. We make use of an en-semble learning method for sentiment classification of tweets that relies on varied features such as feature hash-ing, part-of-speech, and lexical fea-tures. Our system was evaluated in the Twitter message-level task. 1
Sentiment Analysis on Monolingual, Multilingual and Code-Switching Twitter Corpora
"... We address the problem of performing po-larity classification on Twitter over differ-ent languages, focusing on English and Spanish, comparing three techniques: (1) a monolingual model which knows the language in which the opinion is written, (2) a monolingual model that acts based on the decision p ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
We address the problem of performing po-larity classification on Twitter over differ-ent languages, focusing on English and Spanish, comparing three techniques: (1) a monolingual model which knows the language in which the opinion is written, (2) a monolingual model that acts based on the decision provided by a language iden-tification tool and (3) a multilingual model trained on a multilingual dataset that does not need any language recognition step. Results show that multilingual models are even able to outperform the monolingual models on some monolingual sets. We introduce the first code-switching corpus with sentiment labels, showing the robust-ness of a multilingual approach. 1
lsislif: Feature extraction and label weighting for sentiment analysis in twitter
- In In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval
, 2015
"... This paper describes our sentiment analysis systems which have been built for SemEval-2015 Task 10 Subtask B and E. For sub-task B, a Logistic Regression classifier has been trained after extracting several groups of features including lexical, syntactic, lexicon-based, Z score and semantic features ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
(Show Context)
This paper describes our sentiment analysis systems which have been built for SemEval-2015 Task 10 Subtask B and E. For sub-task B, a Logistic Regression classifier has been trained after extracting several groups of features including lexical, syntactic, lexicon-based, Z score and semantic features. A weighting schema has been adapted for pos-itive and negative labels in order to take into account the unbalanced distribution of tweets between the positive and negative classes. This system is ranked third over 40 partici-pants, it achieves average F1 64.27 on Twit-ter data set 2015 just 0.57 % less than the first system. We also present our participation in Subtask E in which our system has got the sec-ond rank with Kendall metric but the first one with Spearman for ranking twitter terms ac-cording to their association with the positive sentiment. 1
From Speaker Identification to Affective Analysis: A Multi-Step System for Analyzing Children Stories
"... We propose a multi-step system for the analysis of children stories that enables several types of analysis. A hybrid approach is adopted, where pattern-based and statistical methods are used along with utilization of external knowledge sources. This system performs the following story analysis tasks ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
We propose a multi-step system for the analysis of children stories that enables several types of analysis. A hybrid approach is adopted, where pattern-based and statistical methods are used along with utilization of external knowledge sources. This system performs the following story analysis tasks: identification of characters in each story; attribution of quotes to specific story characters; identification of character age, gender and other salient personality attributes; and finally, affective analysis of the quoted material. The different types of analyses were evaluated using several datasets. For the quote attribution, as well as for the gender and age estimation, substantial improvement over baseline was realized, whereas results for personality attribute estimation and valence estimation are more modest. 1
SU-FMI: System Description for SemEval-2014 Task 9 on Sentiment Analysis in Twitter
"... Abstract We describe the submission of the team of the Sofia University to SemEval-2014 Task 9 on Sentiment Analysis in Twitter. We participated in subtask B, where the participating systems had to predict whether a Twitter message expresses positive, negative, or neutral sentiment. We trained an S ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract We describe the submission of the team of the Sofia University to SemEval-2014 Task 9 on Sentiment Analysis in Twitter. We participated in subtask B, where the participating systems had to predict whether a Twitter message expresses positive, negative, or neutral sentiment. We trained an SVM classifier with a linear kernel using a variety of features. We used publicly available resources only, and thus our results should be easily replicable. Overall, our system is ranked 20th out of 50 submissions (by 44 teams) based on the average of the three 2014 evaluation data scores, with an F1-score of 63.62 on general tweets, 48.37 on sarcastic tweets, and 68.24 on LiveJournal messages.
POPmine: Tracking Political Opinion on the Web
"... Abstract-The automatic content analysis of mass media in the social sciences has become necessary and possible with the raise of social media and computational power. One particularly promising avenue of research concerns the use of opinion mining. We design and implement the POPmine system which i ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract-The automatic content analysis of mass media in the social sciences has become necessary and possible with the raise of social media and computational power. One particularly promising avenue of research concerns the use of opinion mining. We design and implement the POPmine system which is able to collect texts from web-based conventional media (news items in mainstream media sites) and social media (blogs and Twitter) and to process those texts, recognizing topics and political actors, analyzing relevant linguistic units, and generating indicators of both frequency of mention and polarity (positivity/negativity) of mentions to political actors across sources, types of sources, and across time.