Results 1 - 10
of
15
Contextualising tags in collaborative tagging systems
- In Proc. of the Twentieth ACM Conference on Hypertext and Hypermedia (HT ’09
, 2009
"... Collaborative tagging systems are now popular tools for or-ganising and sharing information on the Web. While col-laborative tagging offers many advantages over the use of controlled vocabularies, they also suffer from problems such as the existence of polysemous tags. We investigate how the differe ..."
Abstract
-
Cited by 22 (0 self)
- Add to MetaCart
(Show Context)
Collaborative tagging systems are now popular tools for or-ganising and sharing information on the Web. While col-laborative tagging offers many advantages over the use of controlled vocabularies, they also suffer from problems such as the existence of polysemous tags. We investigate how the different contexts in which individual tags are used can be revealed automatically without consulting any external resources. We consider several different network represen-tations of tags and documents, and apply a graph cluster-ing algorithm on these networks to obtain groups of tags or documents corresponding to the different meanings of an ambiguous tag. Our experiments show that networks which explicitly take the social context into account are more likely to give a better picture of the semantics of a tag.
The metadata triumvirate: Social annotations, anchor texts and search queries
- In IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology
, 2008
"... In this paper, we study and compare three different but related types of “metadata ” about web documents: social annotations provided by readers of web documents, hyper-link anchor text provided by authors of web documents, and search queries of users trying to find web documents. We in-troduce a la ..."
Abstract
-
Cited by 13 (0 self)
- Add to MetaCart
(Show Context)
In this paper, we study and compare three different but related types of “metadata ” about web documents: social annotations provided by readers of web documents, hyper-link anchor text provided by authors of web documents, and search queries of users trying to find web documents. We in-troduce a large research data set called CABS120k08 which we have created for this study from a variety of informa-tion sources such as AOL500k, the Open Directory Project, del.icio.us/Yahoo!, Google and the WWW in general. We use this data set to investigate several characteristics of said metadata including length, novelty, diversity, and similarity and discuss theoretical and practical implications. 1
M.: Tags vs shelves: from social tagging to social classification
- In: Proceedings of the 22nd ACM Conference on Hypertext and Hypermedia, HT 2011
, 2011
"... Recent research has shown that different tagging motivation and user behavior can effect the overall usefulness of social tagging systems for certain tasks. In this paper, we provide further evidence for this observation by demonstrating that tagging data obtained from certain types of users- so-cal ..."
Abstract
-
Cited by 9 (3 self)
- Add to MetaCart
(Show Context)
Recent research has shown that different tagging motivation and user behavior can effect the overall usefulness of social tagging systems for certain tasks. In this paper, we provide further evidence for this observation by demonstrating that tagging data obtained from certain types of users- so-called Categorizers- outperforms data from other users on a social classification task. We show that segmenting users based on their tagging behavior has significant impact on the performance of automated classification of tagged data by using (i) tagging data from two different social tagging systems, (ii) a Support Vector Machine as a classification mechanism and (iii) existing classification systems such as the Library of Congress Classification System as ground truth. Our results are relevant for scientists studying pragmatics and semantics of social tagging systems as well as for engineers interested in influencing emerging properties of deployed social tagging systems.
Document Word Clouds: Visualising Web Documents as Tag Clouds to Aid Users in Relevance Decisions
- In Research and Advanced Technology for Digital Libraries, 13th European Conference, Proceedings, volume 5714 of Lecture Notes in Computer Science
, 2009
"... Abstract. Information Retrieval systems spend a great effort on de-termining the significant terms in a document. When, instead, a user is looking at a document he cannot benefit from such information. He has to read the text to understand which words are important. In this paper we take a look at t ..."
Abstract
-
Cited by 6 (0 self)
- Add to MetaCart
(Show Context)
Abstract. Information Retrieval systems spend a great effort on de-termining the significant terms in a document. When, instead, a user is looking at a document he cannot benefit from such information. He has to read the text to understand which words are important. In this paper we take a look at the idea of enhancing the perception of web documents with visualisation techniques borrowed from the tag clouds of Web 2.0. Highlighting the important words in a document by using a larger font size allows to get a quick impression of the relevant concepts in a text. As this process does not depend on a user query it can also be used for explorative search. A user study showed, that already simple TF-IDF values used as notion of word importance helped the users to decide quicker, whether or not a document is relevant to a topic. 1
Meaning as Collective Use: Predicting Semantic Hashtag Categories on Twitter ABSTRACT
"... This paper sets out to explore whether data about the usage of hashtags on Twitter contains information about their semantics. Towards that end, we perform initial statistical hypothesis tests to quantify the association between usage patterns and semantics of hashtags. To assess the utility of prag ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
This paper sets out to explore whether data about the usage of hashtags on Twitter contains information about their semantics. Towards that end, we perform initial statistical hypothesis tests to quantify the association between usage patterns and semantics of hashtags. To assess the utility of pragmatic features – which describe how a hashtag is used over time – for semantic analysis of hashtags, we conduct various hashtag stream classification experiments and compare their utility with the utility of lexical features. Our results indicate that pragmatic features indeed contain valuable information for classifying hashtags into semantic categories. Although pragmatic features do not outperform lexical features in our experiments, we argue that pragmatic features are important and relevant for settings in which textual information might be sparse or absent (e.g., in social video streams).
Optimizing academic conference classification using social tags
- Computational Science and Engineering, IEEE International Conference on
"... Abstract- Automatically classifying academic conference into semantic topic promises improved academic search and browsing for users. Social tagging is an increasingly popular way of describing the topic of academic conference. However, no attention has been devoted to academic conference classifica ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
Abstract- Automatically classifying academic conference into semantic topic promises improved academic search and browsing for users. Social tagging is an increasingly popular way of describing the topic of academic conference. However, no attention has been devoted to academic conference classification by making use of social tags. Motivated by this observation, this paper proposes a method which utilizes social tags as well as the content of academic conference in order to improve automatically identifying academic conference classification. The proposed method applies different automatic classification algorithms to improve classification quality by using social tags. Experimental results show that this method mentioned above performs better than the method which only utilizes the content to classify academic conference with 1 % Precision measure score increase and 1.64 % F1 measure score increase, which demonstrates the effectiveness of the proposed method. Keywords-classification; academic conference; feature selection. I.
Policy Aware Social Miner
"... Abstract—There is a wealth of sensitive information available on the Web about any individual that is generated either by her or by others on social networking sites. This information could be used to make important decisions about that individual. The problem is that although people know that searc ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
(Show Context)
Abstract—There is a wealth of sensitive information available on the Web about any individual that is generated either by her or by others on social networking sites. This information could be used to make important decisions about that individual. The problem is that although people know that searches for their personal information are possible, they have no way to either control the data that is put on the Web by others or indicate how they would like to restrict usage of their own data. We describe a framework called Policy Aware Social Miner (PASM) that would provide a solution to these problems by giving users a way to semantically annotate data on the Web using policies to guide how searches about them should be executed. PASM accepts search queries and applies the user’s policies on the results. It filters results over data the user owns and provides the user’s refutation link on search results that the user does not own. These usage control mechanisms for privacy allow users to break away from siloed data privacy management and have their privacy settings applied to all their data available on the Web. Keywords-Usage restrictions; Refutations; Social network mining; Web mining
Security Techniques for Prevention of Rank Manipulation in Social Tagging Services including Robotic Domains
"... With smartphone distribution becoming common and robotic applications on the rise, social tagging services for various applications including robotic domains have advanced significantly. Though social tagging plays an important role when users are finding the exact information through web search, r ..."
Abstract
- Add to MetaCart
(Show Context)
With smartphone distribution becoming common and robotic applications on the rise, social tagging services for various applications including robotic domains have advanced significantly. Though social tagging plays an important role when users are finding the exact information through web search, reliability and semantic relation between web contents and tags are not considered. Spams are making ill use of this aspect and put irrelevant tags deliberately on contents and induce users to advertise contents when they click items of search results. Therefore, this study proposes a detection method for tag-ranking manipulation to solve the problem of the existing methods which cannot guarantee the reliability of tagging. Similarity is measured for ranking the grade of registered tag on the contents, and weighted values of each tag are measured by means of synonym relevance, frequency, and semantic distances between tags. Lastly, experimental evaluation results are provided and its efficiency and accuracy are verified through them.
TAXONOMY CONSTRUCTION TECHNIQUES – ISSUES AND CHALLENGES
"... For any information to be organized, taxonomy is essential. Taxonomy plays a very important role for information and content management. Also it helps in searching of content. The most common method for constructing taxonomy was the manual construction. As the information available today is huge, co ..."
Abstract
- Add to MetaCart
(Show Context)
For any information to be organized, taxonomy is essential. Taxonomy plays a very important role for information and content management. Also it helps in searching of content. The most common method for constructing taxonomy was the manual construction. As the information available today is huge, constructing taxonomy for such information manually was time consuming and maintenance was difficult. This paper presents an overview of various taxonomy construction techniques available for easier construction of taxonomy or generating taxonomy automatically. Also this paper describes the advantages and disadvantages of each technique used.
Evaluating Tag Filtering Techniques for Web Resource
"... Social or collaborative tagging systems emerged as a novel classification scheme on the Web based on the collective knowledge of people. In sites such as Del.icio.us, Technorati or Flickr, users annotate a variety of resources, in-cluding Web pages, blogs, pictures, videos or bibliographic reference ..."
Abstract
- Add to MetaCart
(Show Context)
Social or collaborative tagging systems emerged as a novel classification scheme on the Web based on the collective knowledge of people. In sites such as Del.icio.us, Technorati or Flickr, users annotate a variety of resources, in-cluding Web pages, blogs, pictures, videos or bibliographic references; using freely chosen textual labels or tags. Underlying collaborative tagging sys-tems are ternary data structures known as folksonomies relating resources and users through tags, this information facilitate accessing and browsing massive repositories of resources. Collective annotations provided by people in the form of tags can also be exploited to organize resources on-line in a more formal classification scheme such as the ones provided by hierarchies or directories, alleviating the task of manual classification commonly required by systems such as directories on the Web. In this paper we present an empir-ical study carried out to determine the value of tags in resource classification. Furthermore, the use of several filtering and pre-processing operations to re-∗Corresponding author