DMCA
Semantic grounding of tag relatedness in social bookmarking systems (2008)
Cached
Download Links
- [isiosf.isi.it]
- [www.tagora-project.eu]
- [tagora-project.eu]
- [www.kde.cs.uni-kassel.de]
- [www.kde.cs.uni-kassel.de]
- DBLP
Other Repositories/Bibliography
Venue: | In The Semantic Web – ISWC 2008, Proc.Intl. Semantic Web Conference 2008, volume 5318 of LNAI |
Citations: | 61 - 10 self |
Citations
4668 | The Anatomy of a Large-Scale Hypertextual Web Search Engine
- Brin, Page
- 1998
(Show Context)
Citation Context ...ity is thus independent of the length of the vectors. Its value ranges from 0 (for totally orthogonal vectors) to 1 (for vectors pointing into the same direction). 4.3 FolkRank The PageRank algorithm =-=[34]-=- reflects the idea that a web page is important if there are many pages linking to it, and if those pages are important themselves. We employed the same principle for folksonomies [8]: a resource whic... |
3742 |
WordNet: An Electronic Lexical Database
- Fellbaum
- 1998
(Show Context)
Citation Context ...ty and relatedness are semantic notions, one way of defining them for a folksonomy is to map the tags to a thesaurus or lexicon like Roget’s thesaurus 3 3 http://www.gutenberg.org/etext/22or WordNet =-=[2]-=-, and to measure the relatedness there by means of well-known metrics. The other option is to define measures of relatedness directly on the network structure of the folksonomy. One important reason f... |
3267 | The pagerank citation ranking: bringing order to the web - Page, Brin, et al. - 1999 |
1509 |
Automatic Text Processing – The Transformation, Analysis, and Retrieval of Information by Computer
- Salton
- 1989
(Show Context)
Citation Context ...of harvesting the emergent semantics of a folksonomy. In this paper we analyse five measures of tag relatedness: the co-occurrence count, three distributional measures which use the cosine similarity =-=[7]-=- in the vector spaces spanned by users, tags, and resources, respectively, and FolkRank [8], a graph-based measure that is an adaptation of PageRank [9] to folksonomies. Our analysis is based on data ... |
1219 |
Formal Concept Analysis, Mathematical Foundations
- Ganter, Wille
- 1999
(Show Context)
Citation Context ...nd E = {{u, t, r} | (u, t, r) ∈ Y } is the set of hyper-edges. Alternatively, the folksonomy hyper-graph can be represented as a threedimensional (binary) adjacency matrix. In Formal Concept Analysis =-=[32]-=- this structure is known as a triadic context [33]. All these equivalent notions make explicit that folksonomies are special cases of three-mode data. Since measures of similarity and relatedness are ... |
1096 | Using information content to evaluate semantic similarity in a taxonomy
- Resnik
- 1995
(Show Context)
Citation Context ...e taxonomic path length in WordNet. Black bars (top) show the Jiang-Conrath measure of semantic distance. combines the taxonomic path length with an information-theoretic similarity measure by Resnik =-=[35]-=-. We use the implementation of those measures available in the WordNet::Similarity library [36]. It is important to remark that [1] provides a pragmatic grounding of the JiangConrath measure by means ... |
872 | Semantic similarity based on corpus statistics and lexical taxonomy
- Jiang, Conrath
- 1997
(Show Context)
Citation Context ...ons of WordNet to infer corresponding semantic relations in the folksonomy. In WordNet, we measure the similarity by using both the taxonomic path length and a similarity measure by Jiang and Conrath =-=[10]-=- that has been validated through user studies and applications [1]. The use of taxonomic path lengths, in particular, allows us to inspect the edge composition of paths leading from one tag to the cor... |
466 | Ontologies Are Us: A Unified Model of Social Networks and Semantics”,
- Mika
- 2005
(Show Context)
Citation Context ...studies about folksonomies is Ref. [11], where several concepts of bottomup social annotation are introduced. Ref. [12, 13, 11] provide overviews of the strengths and weaknesses of such systems. Ref. =-=[14, 15]-=- introduce a tri-partite graph representation for folksonomies, where nodes are users, tags and resources. Ref. [16] provides a first quantitative analysis of del.icio.us. We investigated the distribu... |
388 | WordNet::Similarity: Measuring the Relatedness of Concepts,”
- Pedersen, Patwardhan, et al.
- 2004
(Show Context)
Citation Context ... distance. combines the taxonomic path length with an information-theoretic similarity measure by Resnik [35]. We use the implementation of those measures available in the WordNet::Similarity library =-=[36]-=-. It is important to remark that [1] provides a pragmatic grounding of the JiangConrath measure by means of user studies and by its superior performance in the context of a spell-checking application.... |
327 | Folksonomies – cooperative classification and communication through shared metadata (2004). Computer Mediated Communication. Available at: www.adammathes.com/academic/computermediated-communication/folksonomies.html (accessed 20
- Mathes
- 2008
(Show Context)
Citation Context ...on 6. We discuss our results in the context of ontology learning and related tasks in Section 7, where we also point to future work. 2 Related Work One of the first studies about folksonomies is Ref. =-=[11]-=-, where several concepts of bottomup social annotation are introduced. Ref. [12, 13, 11] provide overviews of the strengths and weaknesses of such systems. Ref. [14, 15] introduce a tri-partite graph ... |
321 | Evaluating wordnet-based measures of lexical semantic relatedness,”
- Budanitsky, Hirst
- 2006
(Show Context)
Citation Context ...llows for an evaluation with well-established measures of similarity in existing lexical databases. Budanitsky and Hirst pointed out that similarity can be considered as a special case of relatedness =-=[1]-=-. As both similarity and relatedness are semantic notions, one way of defining them for a folksonomy is to map the tags to a thesaurus or lexicon like Roget’s thesaurus 3 3 http://www.gutenberg.org/et... |
311 |
Course in General Linguistics.
- Saussure
- 1986
(Show Context)
Citation Context ... which states that words found in similar contexts tend to be semantically similar. From a linguistic point of view, these two families of measures focus on orthogonal aspects of structural semiotics =-=[5, 6]-=-. The co-occurrence measures address the so-called syntagmatic relation, where words are considered related if they occur in the same part of text. The contextual measures address the paradigmatic rel... |
306 | The structure of collaborative tagging systems.
- Golder, Huberman
- 2006
(Show Context)
Citation Context ...ncept of bottom-up social annotation are introduced. Ref. [15, 18] introduce a tri-partite graph representation for folk4 http://del.icio.us/sonomies, where nodes are users, tags and resources. Ref. =-=[9]-=- provides a first quantitative analysis of del.icio.us. A considerable number of investigations is motivated by the vision of “bridging the gap” between the Semantic Web and Web 2.0 by means of ontolo... |
238 | Information retrieval in folksonomies: Search and ranking.
- Hotho
- 2006
(Show Context)
Citation Context ...s of tag relatedness: the co-occurrence count, three distributional measures which use the cosine similarity [7] in the vector spaces spanned by users, tags, and resources, respectively, and FolkRank =-=[8]-=-, a graph-based measure that is an adaptation of PageRank [9] to folksonomies. Our analysis is based on data from a large-scale snapshot of the popular social bookmarking system del.icio.us. 4 To prov... |
227 |
Mathematical Structures of Language.
- Harris
- 1968
(Show Context)
Citation Context ...d in several ways. Most of these definitions use statistical information about different types of co-occurrence between tags, resources and users. Other approaches adopt the distributional hypothesis =-=[3, 4]-=-, which states that words found in similar contexts tend to be semantically similar. From a linguistic point of view, these two families of measures focus on orthogonal aspects of structural semiotics... |
148 |
Collaborative creation of communal hierarchical taxonomies in social tagging systems.
- Heymann, Garcia-Molina
- 2006
(Show Context)
Citation Context ...udge term similarity. In order to adapt these approaches to folksonomies, several distributional measures of tag relatedness have been introduced in theoretical studies or implemented in applications =-=[23, 24]-=-. However, the choice of a specific measure of relatedness is often made without justification and often it appears to be rather ad hoc. A task which depends heavily on quantifying tag relatedness is ... |
143 | Towards the semantic web: Collaborative tag suggestions. Collaborative Web Tagging Workshop at WWW2006,
- Xu, Fu, et al.
- 2006
(Show Context)
Citation Context ...in approaches that analyze the content of the tagged resources with information retrieval techniques [25, 26] and approaches that use collaborative filtering methods based on the folksonomy structure =-=[27]-=-. An example of the latter class of approaches is Ref. [28], where we used our FolkRank algorithm [8] for tag recommendation. FolkRankbased measures will be also covered in this paper. Relatedness mea... |
137 |
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications,
- Cimiano
- 2006
(Show Context)
Citation Context ...ne similarity of contextually associated word pairs more appropriately [. . . ].” The distributional hypothesis is also at the basis of a number of approaches to synonym acquisition from text corpora =-=[22]-=-. As in other ontology learning scenarios, clustering techniques are often applied to group similar terms extracted from a corpus, and a core building block of such procedure is the metric used to jud... |
123 | Tag recommendations in folksonomies,
- Jaschke, Marinho, et al.
- 2007
(Show Context)
Citation Context ...ces with information retrieval techniques [25, 26] and approaches that use collaborative filtering methods based on the folksonomy structure [27]. An example of the latter class of approaches is Ref. =-=[28]-=-, where we used our FolkRank algorithm [8] for tag recommendation. FolkRankbased measures will be also covered in this paper. Relatedness measures also play a role in assisting users who browse the co... |
120 | Improved annotation of the blogosphere via autotagging and hierarchical clustering. In:
- Brooks, Montanez
- 2006
(Show Context)
Citation Context ...ies. Scientific publications in this domain are still sparse. Existing work can be broadly divided in approaches that analyze the content of the tagged resources with information retrieval techniques =-=[25, 26]-=- and approaches that use collaborative filtering methods based on the folksonomy structure [27]. An example of the latter class of approaches is Ref. [28], where we used our FolkRank algorithm [8] for... |
103 |
Social bookmarking tools (i): A general review. D-Lib Magazine
- Hammond, Hannay, et al.
- 2005
(Show Context)
Citation Context ...ks in Section 7, where we also point to future work. 2 Related Work One of the first studies about folksonomies is Ref. [11], where several concepts of bottomup social annotation are introduced. Ref. =-=[12, 13, 11]-=- provide overviews of the strengths and weaknesses of such systems. Ref. [14, 15] introduce a tri-partite graph representation for folksonomies, where nodes are users, tags and resources. Ref. [16] pr... |
79 |
AutoTag: A Collaborative Approach to Automated Tag Assignment for Weblog Posts.
- Mishne
- 2006
(Show Context)
Citation Context ...ies. Scientific publications in this domain are still sparse. Existing work can be broadly divided in approaches that analyze the content of the tagged resources with information retrieval techniques =-=[25, 26]-=- and approaches that use collaborative filtering methods based on the folksonomy structure [27]. An example of the latter class of approaches is Ref. [28], where we used our FolkRank algorithm [8] for... |
75 |
Semiotic dynamics and collaborative tagging.
- Cattuto, Loreto, et al.
- 2007
(Show Context)
Citation Context ...nodes are users, tags and resources. Ref. [16] provides a first quantitative analysis of del.icio.us. We investigated the distribution of tag co-occurrence frequencies 4 http://del.icio.us/ 2in Ref. =-=[17]-=- and the network structure of folksonomies in Ref. [18]. Tag-based metrics for resource distance have been introduced in Ref. [19]. To the best of our knowledge, no systematic characterization of tag ... |
56 | Network properties of folksonomies
- Cattuto, Schmitz, et al.
(Show Context)
Citation Context ... a first quantitative analysis of del.icio.us. We investigated the distribution of tag co-occurrence frequencies 4 http://del.icio.us/ 2in Ref. [17] and the network structure of folksonomies in Ref. =-=[18]-=-. Tag-based metrics for resource distance have been introduced in Ref. [19]. To the best of our knowledge, no systematic characterization of tag relatedness in folksonomies is available in the literat... |
53 |
A triadic approach to formal concept analysis.
- Lehmann, Wille
- 1995
(Show Context)
Citation Context ...hyper-edges. Alternatively, the folksonomy hyper-graph can be represented as a threedimensional (binary) adjacency matrix. In Formal Concept Analysis [32] this structure is known as a triadic context =-=[33]-=-. All these equivalent notions make explicit that folksonomies are special cases of three-mode data. Since measures of similarity and relatedness are not well developed for three-mode data yet, we wil... |
46 |
The Anatomy of a Large-Scale Hypertextual Web
- Brin, Page
- 1998
(Show Context)
Citation Context ...two tags t1 and t2 are represented by v1, v2 ∈ R n , then their cosine similarity is defined as: FolkRank v1 · v2 cossim(t1, t2) := arccos ∡(v1, v2) = ||v1||2 · ||v2||2 The PageRank algorithm =-=[1]-=- reflects the idea that a web page is important if there are many pages linking to it, and if those pages are important themselves. The same principle was employed for folksonomies in [13]: a resource... |
43 |
A synopsis of linguistic theory 1930–55.
- Firth
- 1957
(Show Context)
Citation Context ...d in several ways. Most of these definitions use statistical information about different types of co-occurrence between tags, resources and users. Other approaches adopt the distributional hypothesis =-=[3, 4]-=-, which states that words found in similar contexts tend to be semantically similar. From a linguistic point of view, these two families of measures focus on orthogonal aspects of structural semiotics... |
33 | The Dynamics and Semantics of Collaborative Tagging.
- Halpin, Robu, et al.
- 2006
(Show Context)
Citation Context ...5] provides a model of semantic-social networks for extracting lightweight ontologies from del.icio.us. Other approaches for learning taxonomic relations from tags are provided by Ref. [23, 24]. Ref. =-=[30]-=- presents a generative model for folksonomies and also addresses the learning of taxonomic relations. Ref. [31] applies statistical methods to infer global semantics from a folksonomy. The results of ... |
31 | Collaborative tagging as a tripartite network
- Lambiotte, Ausloos
(Show Context)
Citation Context ...studies about folksonomies is Ref. [11], where several concepts of bottomup social annotation are introduced. Ref. [12, 13, 11] provide overviews of the strengths and weaknesses of such systems. Ref. =-=[14, 15]-=- introduce a tri-partite graph representation for folksonomies, where nodes are users, tags and resources. Ref. [16] provides a first quantitative analysis of del.icio.us. We investigated the distribu... |
29 | Distributional measures as proxies for semantic relatedness
- Mohammad, Hirst
- 2005
(Show Context)
Citation Context ...emantic similarity to the case where documents are classified in the nodes of an ontology with non-hierarchical components. The measures introduced there were validated by means of a user study. Ref. =-=[21]-=- analyses distributional measures of word relatedness and compares them with measures of semantic relatedness in thesauri like WordNet. They concluded that “even though ontological measures are likely... |
24 |
Integrating collaborative tagging and emergent semantics for image retrieval
- Aurnhammer, Hanappe, et al.
- 2006
(Show Context)
Citation Context ...algorithm [8] for tag recommendation. FolkRankbased measures will be also covered in this paper. Relatedness measures also play a role in assisting users who browse the contents of a folksonomy. Ref. =-=[29]-=- shows that navigation in a folksonomy can be enhanced by suggesting tag relations grounded in content-based features. A considerable number of investigations are motivated by the vision of “bridging ... |
23 | Algorithmic computation and approximation of semantic similarity
- Maguitman, Menczer, et al.
(Show Context)
Citation Context ...d metrics for resource distance have been introduced in Ref. [19]. To the best of our knowledge, no systematic characterization of tag relatedness in folksonomies is available in the literature. Ref. =-=[20]-=- generalizes standard tree-based measures of semantic similarity to the case where documents are classified in the nodes of an ontology with non-hierarchical components. The measures introduced there ... |
16 |
Social bookmarking tools (II). A case study - Connotea. D-Lib Magazine
- Lund, Hammond, et al.
- 2005
(Show Context)
Citation Context ...ks in Section 7, where we also point to future work. 2 Related Work One of the first studies about folksonomies is Ref. [11], where several concepts of bottomup social annotation are introduced. Ref. =-=[12, 13, 11]-=- provide overviews of the strengths and weaknesses of such systems. Ref. [14, 15] introduce a tri-partite graph representation for folksonomies, where nodes are users, tags and resources. Ref. [16] pr... |
16 | Vocabulary Growth in Collaborative Tagging Systems. Arxiv e-print - Cattuto, Baldassarri, et al. - 2007 |
15 |
Inducing ontology from flickr tags., in: Collaborative Web Tagging Workshop at WWW2006
- Schmitz
- 2006
(Show Context)
Citation Context ...udge term similarity. In order to adapt these approaches to folksonomies, several distributional measures of tag relatedness have been introduced in theoretical studies or implemented in applications =-=[23, 24]-=-. However, the choice of a specific measure of relatedness is often made without justification and often it appears to be rather ad hoc. A task which depends heavily on quantifying tag relatedness is ... |
15 |
Y.: Emergent Semantics from Folksonomies: A Quantitative Study
- Zhang, Wu, et al.
- 2006
(Show Context)
Citation Context ...pproaches for learning taxonomic relations from tags are provided by Ref. [23, 24]. Ref. [30] presents a generative model for folksonomies and also addresses the learning of taxonomic relations. Ref. =-=[31]-=- applies statistical methods to infer global semantics from a folksonomy. The results of our paper are especially relevant to inform the design of such learning methods. 3 Folksonomy Definition and Da... |
14 | Emergent Community Structure In Social Tagging Systems”.
- Cattuto, Baldassarri, et al.
- 2008
(Show Context)
Citation Context ...tion of tag co-occurrence frequencies 4 http://del.icio.us/ 2in Ref. [17] and the network structure of folksonomies in Ref. [18]. Tag-based metrics for resource distance have been introduced in Ref. =-=[19]-=-. To the best of our knowledge, no systematic characterization of tag relatedness in folksonomies is available in the literature. Ref. [20] generalizes standard tree-based measures of semantic similar... |
3 |
Gerd Stumme, ‘Information retrieval in folksonomies: Search and ranking
- Hotho, Jäschke, et al.
- 2006
(Show Context)
Citation Context ... of a folksonomy. In this paper, we consider the three following measures for the relatedness of tags: the co-occurrence count, the cosine similarity [23] of co-occurrence distributions, and FolkRank =-=[13]-=-, a graph-based measure that is an adaptation of PageRank [20] to folksonomies. Our analysis is based on data from a large-scale snapshot of the popular social bookmarking system del.icio.us 4 . To pr... |
1 |
Semiotics: The Basics. Second edn
- Chandler
- 2007
(Show Context)
Citation Context ... which states that words found in similar contexts tend to be semantically similar. From a linguistic point of view, these two families of measures focus on orthogonal aspects of structural semiotics =-=[5, 6]-=-. The co-occurrence measures address the so-called syntagmatic relation, where words are considered related if they occur in the same part of text. The contextual measures address the paradigmatic rel... |