• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 11 - 20 of 43
Next 10 →

M.: Tags vs shelves: from social tagging to social classification

by Arkaitz Zubiaga, Christian Körner, Markus Strohmaier - In: Proceedings of the 22nd ACM Conference on Hypertext and Hypermedia, HT 2011 , 2011
"... Recent research has shown that different tagging motivation and user behavior can effect the overall usefulness of social tagging systems for certain tasks. In this paper, we provide further evidence for this observation by demonstrating that tagging data obtained from certain types of users- so-cal ..."
Abstract - Cited by 8 (4 self) - Add to MetaCart
Recent research has shown that different tagging motivation and user behavior can effect the overall usefulness of social tagging systems for certain tasks. In this paper, we provide further evidence for this observation by demonstrating that tagging data obtained from certain types of users- so-called Categorizers- outperforms data from other users on a social classification task. We show that segmenting users based on their tagging behavior has significant impact on the performance of automated classification of tagged data by using (i) tagging data from two different social tagging systems, (ii) a Support Vector Machine as a classification mechanism and (iii) existing classification systems such as the Library of Congress Classification System as ground truth. Our results are relevant for scientists studying pragmatics and semantics of social tagging systems as well as for engineers interested in influencing emerging properties of deployed social tagging systems.

Is Unlabeled Data Suitable for Multiclass SVM-based Web Page Classification?

by Arkaitz Zubiaga, Víctor Fresno, Raquel Martínez, Lenguajes Y Sistemas Informáticos
"... Support Vector Machines present an interesting and effective approach to solve automated classification tasks. Although it only handles binary and supervised problems by nature, it has been transformed into multiclass and semi-supervised approaches in several works. A previous study on supervised an ..."
Abstract - Cited by 2 (2 self) - Add to MetaCart
Support Vector Machines present an interesting and effective approach to solve automated classification tasks. Although it only handles binary and supervised problems by nature, it has been transformed into multiclass and semi-supervised approaches in several works. A previous study on supervised and semi-supervised SVM classification over binary taxonomies showed how the latter clearly outperforms the former, proving the suitability of unlabeled data for the learning phase in this kind of tasks. However, the suitability of unlabeled data for multiclass tasks using SVM has never been tested before. In this work, we present a study on whether unlabeled data could improve results for multiclass web page classification tasks using Support Vector Machines. As a conclusion, we encourage to rely only on labeled data, both for improving (or at least equaling) performance and for reducing the computational cost. 1

Twitter has...

by Hong Den, G Dong, Wan G Hieu, Le Tarek, Abde Lzah Er, Jiawei Han, Alice Leun, G John, Han Cock, Clare Voss
"... Ranking tweets is a fundamental task to make it easier to distill the vast amounts of information shared by users. In this paper, we explore the novel idea of ranking tweets on a topic using heterogeneous networks. We construct heterogeneous networks by harnessing cross-genre linkages between tweets ..."
Abstract - Add to MetaCart
Ranking tweets is a fundamental task to make it easier to distill the vast amounts of information shared by users. In this paper, we explore the novel idea of ranking tweets on a topic using heterogeneous networks. We construct heterogeneous networks by harnessing cross-genre linkages between tweets and semantically-related web documents from formal genres, and inferring implicit links between tweets and users. To rank tweets effectively by capturing the semantics and importance of different linkages, we introduce Tri-HITS, a model to iteratively propagate ranking scores across heterogeneous networks. We show that integrating both formal genre and inferred social networks

Tweet Ranking Based on Heterogeneous Networks

by Hong Den, G Dong, Wan G Hieu, Le Tarek, Abde Lzah Er, Jiawei Han, Alice Leun, G John, Han Cock, Clare Voss
"... Ranking tweets is a fundamental task to make it easier to distill the vast amounts of information shared by users. In this paper, we explore the novel idea of ranking tweets on a topic using heterogeneous networks. We construct heterogeneous networks by harnessing cross-genre linkages between tweets ..."
Abstract - Add to MetaCart
Ranking tweets is a fundamental task to make it easier to distill the vast amounts of information shared by users. In this paper, we explore the novel idea of ranking tweets on a topic using heterogeneous networks. We construct heterogeneous networks by harnessing cross-genre linkages between tweets and semantically-related web documents from formal genres, and inferring implicit links between tweets and users. To rank tweets effectively by capturing the semantics and importance of different linkages, we introduce Tri-HITS, a model to iteratively propagate ranking scores across heterogeneous networks. We show that integrating both formal genre and inferred social networks

WarwickDCS: From Phrase-Based to Target-Specific Sentiment Recognition

by Richard Townsend, Adam Tsakalidis, Yiwei Zhou, Bo Wang, Maria Liakata, Arkaitz Zubiaga, Alexandra Cristea, Rob Procter
"... We present and evaluate several hybrid sys-tems for sentiment identification for Twit-ter, both at the phrase and document (tweet) level. Our approach has been to use a novel combination of lexica, traditional NLP and deep learning features. We also analyse tech-niques based on syntactic parsing and ..."
Abstract - Add to MetaCart
We present and evaluate several hybrid sys-tems for sentiment identification for Twit-ter, both at the phrase and document (tweet) level. Our approach has been to use a novel combination of lexica, traditional NLP and deep learning features. We also analyse tech-niques based on syntactic parsing and token-based association to handle topic specific sen-timent in subtask C. Our strategy has been to identify subphrases relevant to the designated topic/target and assign sentiment according to our subtask A classifier. Our submitted subtask A classifier ranked fourth in the Se-mEval official results while our BASELINE and µPARSE classifiers for subtask C would have ranked second. 1

This is a preprint of an article accepted for publication in Social Network Analysis and Mining copyright @ 2013 (Springer). The final publication is available at link.springer.com. Tweet, but Verify: Epistemic Study of Information Verification on Twitter

by Arkaitz Zubiaga, Heng Ji
"... While Twitter provides an unprecedented opportunity to learn about breaking news and current events as they happen, it often produces skepticism among users as not all the information is accurate but also hoaxes are sometimes spread. While avoiding the diffusion of hoaxes is a major concern during f ..."
Abstract - Add to MetaCart
While Twitter provides an unprecedented opportunity to learn about breaking news and current events as they happen, it often produces skepticism among users as not all the information is accurate but also hoaxes are sometimes spread. While avoiding the diffusion of hoaxes is a major concern during fast-paced events such as natural disasters, the study of how users trust and verify information from tweets in these contexts has received little attention so far. We survey users on credibility perceptions regarding witness pictures posted on Twitter related to Hurricane Sandy. By examining credibility perceptions on features suggested for information verification in the field of Epistemology, we evaluate their accuracy in determining whether pictures were real or fake compared to professional evaluations performed by experts. Our study unveils insight about tweet presentation, as well as features that users should look at when assessing the veracity of tweets in the context of fast-paced events. Some of our main findings include that while author details not readily available on Twitter feeds should be emphasized in order to facilitate verification of tweets, showing multiple tweets corroborating a fact misleads users to trusting what actually is a hoax. We contrast some of the behavioral patterns found on tweets with literature in Psychology research.

de Tuits en la SEPLN 2015

by Nora Aranberri, Pablo Gamallo, Hugo Gonçalo Oliveira, Iñaki San Vicente, Antonio Toral, Arkaitz Zubiaga
"... Resumen: Este art́ıculo presenta un resumen de la tarea conjunta que tuvo lugar en el marco del taller TweetMT celebrado junto con SEPLN 2015, que consiste en traducir diversas colecciones de tweets en varios lenguajes. El art́ıculo describe el proceso de recolección y anotación de datos, el desar ..."
Abstract - Add to MetaCart
Resumen: Este art́ıculo presenta un resumen de la tarea conjunta que tuvo lugar en el marco del taller TweetMT celebrado junto con SEPLN 2015, que consiste en traducir diversas colecciones de tweets en varios lenguajes. El art́ıculo describe el proceso de recolección y anotación de datos, el desarrollo y evaluación de la tarea y los resultados obtenidos por los participantes. Palabras clave: Traducción Automática, Microblogs, Tuits, Social Media Abstract: This article presents an overview of the shared task that took place as part of the TweetMT workshop held at SEPLN 2015. The task consisted in translating collections of tweets from and to several languages. The article outlines the data collection and annotation process, the development and evaluation of the shared task, as well as the results achieved by the participants.

Overview of TweetLID: Tweet Language Identification at SEPLN 2014 Introducción a TweetLID: Tarea Compartida sobre Identificación de Idioma de Tuits en SEPLN 2014

by Arkaitz Zubiaga, Iñaki San Vicente, Pablo Gamallo, Jose ́ Ramom Pichel, Nora Aranberri, Aitzol Ezeiza
"... Resumen: Este art́ıculo presenta un resumen de la tarea compartida y taller TweetLID, organizado junto a SEPLN 2014. Resume brevemente el proceso de colección y anotación de datos, el desarrollo y evaluación de la tarea compartida, y por último, los resultados obtenidos por los participantes. Pa ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Resumen: Este art́ıculo presenta un resumen de la tarea compartida y taller TweetLID, organizado junto a SEPLN 2014. Resume brevemente el proceso de colección y anotación de datos, el desarrollo y evaluación de la tarea compartida, y por último, los resultados obtenidos por los participantes. Palabras clave: identificación de idioma, tuits, textos cortos, multilingüismo Abstract: This article presents a summary of the TweetLID shared task and workshop held at SEPLN 2014. It briefly summarizes the data collection and annotation process, the development and evaluation of the shared task, as well as the results achieved by the participants.

87 The Eighth AAAI Conference on Arti!cial Intelligence and Interactive

by Markus Fromherz, Héctor Muñoz-avila, Steve Kelling, Carl Lagoze, Weng-keen Wong, Jun Yu, Theodoros Damoulas, Jeff Gerbracht, Daniel Fink, Carla Gomes, Anders Holst, Markus Bohlin, Jan Ekman, Ola Sellin, Björn Lindström, Stefan Larsen, Nestor Rychtyckyj, Craig Plesco, Stephanie Valentine, Francisco Vides, George Lucchese, David Turner, Hong-hoe Kim, Wenzhe Li Julie Linsey, Mark O. Riedl, Vadim Bulitko, H. Levent Akin, Nobuhiro Ito, Adam Jacoff, Er Kleiner, Johannes Pellenz, Arnoud Visser, Mark Riedl, Gita Sukthankar, Arnav Jhala, Jichen Zhu, Santiago Ontañón, Stephen G, Rezarta Islamaj Dogan, A Gil, Haym Hirsh, Narayanan C. Krishnan, Michael Lewis, Cetin Mericli, Parisa Rashidi, Victor Raskin, Samarth Swarup, Wei Sun, Julia M. Taylor, Lana Yeganova, Mor Naaman, Daniele Quercia, Damiano Spina, Markus Strohmaier, Arkaitz Zubiaga , 2013
"... 101 Reports on the Workshops Held at the Sixth International ..."
Abstract - Add to MetaCart
101 Reports on the Workshops Held at the Sixth International

Analysis and enhancement of Wikification . . .

by Taylor Cassidy, Heng Ji, Lev Ratinov, et al. , 2012
"... Disambiguation to Wikipedia (D2W) is the task of linking mentions of concepts in text to their corresponding Wikipedia entries. Most previous work has focused on linking terms in formal texts (e.g. newswire) to Wikipedia. Linking terms in short informal texts (e.g. tweets) is difficult for systems a ..."
Abstract - Add to MetaCart
Disambiguation to Wikipedia (D2W) is the task of linking mentions of concepts in text to their corresponding Wikipedia entries. Most previous work has focused on linking terms in formal texts (e.g. newswire) to Wikipedia. Linking terms in short informal texts (e.g. tweets) is difficult for systems and humans alike as they lack a rich disambiguation context. We first evaluate an existing Twitter dataset as well as the D2W task in general. We then test the effects of two tweet context expansion methods, based on tweet authorship and topic-based clustering, on a state-of-the-art D2W system and evaluate the results.
Next 10 →
Results 11 - 20 of 43
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University