• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Expertise networks in online communities: structure and algorithms. (2007)

by J Zhang
Venue:In WWW,
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 181
Next 10 →

Knowledge sharing and Yahoo Answers: Everyone knows something

by Lada A. Adamic, Jun Zhang, Eytan Bakshy, Mark S. Ackerman - Proceedings of WWW'08 , 2008
"... Yahoo Answers (YA) is a large and diverse question-answer forum, acting not only as a medium for sharing technical knowledge, but as a place where one can seek advice, gather opinions, and satisfy one’s curiosity about a countless number of things. In this paper, we seek to understand YA’s knowledge ..."
Abstract - Cited by 185 (4 self) - Add to MetaCart
Yahoo Answers (YA) is a large and diverse question-answer forum, acting not only as a medium for sharing technical knowledge, but as a place where one can seek advice, gather opinions, and satisfy one’s curiosity about a countless number of things. In this paper, we seek to understand YA’s knowledge sharing activity. We analyze the forum categories and cluster them according to content characteristics and patterns of interaction among the users. While interactions in some categories resemble expertise sharing forums, others incorporate discussion, everyday advice, and support. With such a diversity of categories in which one can participate, we find that some users focus narrowly on specific topics, while others participate across categories. This not only allows us to map related categories, but to characterize the entropy of the users ’ interests. We find that lower entropy correlates with receiving higher answer ratings, but only for categories where factual expertise is primarily sought after. We combine both user attributes and answer characteristics
(Show Context)

Citation Context

...voted to answering one another’s technical questions. We analyzed a technical question answering community (Java Forum) and explored algorithms using network structure to evaluate expertise levels in =-=[23]-=-. Using simulations, we explored possible social settings and dynamics that may affect the interaction patterns and network structures in online communities [24]. The goal of these studies was to desi...

Finding high-quality content in social media with an application to community-based question answering

by Eugene Agichtein, Aristides Gionis, Carlos Castillo, Gilad Mishne, Debora Donato - In Proceedings of WSDM , 2008
"... The quality of user-generated content varies drastically from excellent to abuse and spam. As the availability of such content increases, the task of identifying high-quality content in sites based on user contributions—social media sites— becomes increasingly important. Social media in general exhi ..."
Abstract - Cited by 184 (14 self) - Add to MetaCart
The quality of user-generated content varies drastically from excellent to abuse and spam. As the availability of such content increases, the task of identifying high-quality content in sites based on user contributions—social media sites— becomes increasingly important. Social media in general exhibit a rich variety of information sources: in addition to the content itself, there is a wide array of non-content information available, such as links between items and explicit quality ratings from members of the community. In this paper we investigate methods for exploiting such community feedback to automatically identify high quality content. As a test case, we focus on Yahoo! Answers, a large community question/answering portal that is particularly rich in the amount and types of content and social interactions available in it. We introduce a general classification framework for combining the evidence from different sources of information, that can be tuned automatically for a given social media type and quality definition. In particular, for the community question/answering domain, we show that our system is able to separate high-quality items from the rest with an accuracy close to that of humans. Categories and Subject Descriptors H.3 [Information Storage and Retrieval]: H.3.1 Content Analysis and Indexing – indexing methods, linguistic
(Show Context)

Citation Context

...responding to the users of a question/answer system and having a directed edge e = (u,v) ∈ E from a user u ∈ V to a user v ∈ V if user u has answered to at least one question of user v. ExpertiseRank =-=[31]-=- corresponds to PageRank over the transposed graph G ′ = (V,E ′ ), that is, a score is propagated from the person receiving the answer to the person giving the answer. The recursion implies that if pe...

Design Lessons from the Fastest Q&A Site in the West

by Lena Mamykina, Bella Manoim, Manas Mittal, George Hripcsak, Björn Hartmann - CHI 2011 , 2011
"... This paper analyzes a Question & Answer site for programmers, Stack Overflow, that dramatically improves on the utility and performance of Q&A systems for technical domains. Over 92 % of Stack Overflow questions about expert topics are answered — in a median time of 11 minutes. Using a mixed ..."
Abstract - Cited by 68 (1 self) - Add to MetaCart
This paper analyzes a Question & Answer site for programmers, Stack Overflow, that dramatically improves on the utility and performance of Q&A systems for technical domains. Over 92 % of Stack Overflow questions about expert topics are answered — in a median time of 11 minutes. Using a mixed methods approach that combines statistical data analysis with user interviews, we seek to understand this success. We argue that it is not primarily due to an a priori superior technical design, but also to the high visibility and daily involvement of the design team within the community they serve. This model of continued community leadership presents challenges to both CSCW systems research as well as to attempts to apply the Stack Overflow model to other specialized knowledge domains.

Identifying Topical Authorities in Microblogs

by Aditya Pal, Scott Counts , 2011
"... Content in microblogging systems such as Twitter is produced by tens to hundreds of millions of users. This diversity is a notable strength, but also presents the challenge of finding the most interesting and authoritative authors for any given topic. To address this, we first propose a set of featu ..."
Abstract - Cited by 58 (1 self) - Add to MetaCart
Content in microblogging systems such as Twitter is produced by tens to hundreds of millions of users. This diversity is a notable strength, but also presents the challenge of finding the most interesting and authoritative authors for any given topic. To address this, we first propose a set of features for characterizing social media authors, including both nodal and topical metrics. We then show how probabilistic clustering over this feature space, followed by a within-cluster ranking procedure, can yield a final list of top authors for a given topic. We present results across several topics, along with results from a user study confirming that our method finds authors who are significantly more interesting and authoritative than those resulting from several baseline conditions. Additionally our algorithm is computationally feasible in near real-time scenarios making it an attractive alternative for capturing the rapidly changing dynamics of microblogs.
(Show Context)

Citation Context

... model of the two texts, their non-stop word overlap, and the ratio between their lengths. Others have modeled CQA as a graph induced as a result of a users’ interactions with other community members =-=[12, 21]-=-. Zhang et. al. [21] modeled CQA as an expertise graph and proposed Expertise Ranking, similar to PageRank [4]. Jurczyk et. al. [12] identified authorities in Q&A communities using link analysis by co...

Questions in, knowledge in?: a study of naver’s question answering community

by Kevin K. Nam, Mark S. Ackerman, Lada A. Adamic - In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI ’09 , 2009
"... Large general-purposed community question-answering sites are becoming popular as a new venue for generating knowledge and helping users in their information needs. In this paper we analyze the characteristics of knowledge generation and user participation behavior in the largest question-answering ..."
Abstract - Cited by 55 (2 self) - Add to MetaCart
Large general-purposed community question-answering sites are becoming popular as a new venue for generating knowledge and helping users in their information needs. In this paper we analyze the characteristics of knowledge generation and user participation behavior in the largest question-answering online community in South Korea, Naver Knowledge–iN. We collected and analyzed over 2.6 million question/answer pairs from fifteen categories between 2002 and 2007, and have interviewed twenty six users to gain insights into their motivations, roles, usage and expertise. We find altruism, learning, and competency are frequent motivations for top answerers to participate, but that participation is often highly intermittent. Using a simple measure of user performance, we find that higher levels of participation correlate with better performance. We also observe that users are motivated in part through a point system to build a comprehensive knowledge database. These and other insights have significant implications for future knowledge generating online communities.
(Show Context)

Citation Context

...ve semantically similar questions that may already contain answers. The validity of such answers is uncertain, a problem that has been addressed by identifying the level of expertise of the answerers =-=[11, 22]-=-, and by using community-generated ratings, such as best answer ratings [2, 1, 3]. In the present paper, rather than tryingto identify high quality answers, we instead examine how such Q&A systems ca...

Learning to Recognize Reliable Users and Content in Social Media with Coupled Mutual Reinforcement

by Jiang Bian, Eugene Agichtein, Yandong Liu, Hongyuan Zha, Ding Zhou - WWW 2009 MADRID! TRACK: DATA MINING / SESSION: GRAPH ALGORITHMS , 2009
"... Community Question Answering (CQA) has emerged as a popular forum for users to pose questions for other users to answer. Over the last few years, CQA portals such as Naver and Yahoo! Answers have exploded in popularity, and now provide a viable alternative to general purpose Web search. At the same ..."
Abstract - Cited by 50 (2 self) - Add to MetaCart
Community Question Answering (CQA) has emerged as a popular forum for users to pose questions for other users to answer. Over the last few years, CQA portals such as Naver and Yahoo! Answers have exploded in popularity, and now provide a viable alternative to general purpose Web search. At the same time, the answers to past questions submitted in CQA sites comprise a valuable knowledge repository which could be a gold mine for information retrieval and automatic question answering. Unfortunately, the quality of the submitted questions and answers varies widely- increasingly so that a large fraction of the content is not usable for answering queries. Previous approaches for retrieving relevant and high quality content have been proposed, but they require large amounts of manually labeled data – which
(Show Context)

Citation Context

...content quality in CQA either require large amounts of supervision (e.g., [2]) or focus on the network properties of the CQA without considering the actual content of the information exchanged (e.g., =-=[17]-=-). We observe that user reputation and the quality of the content they produce are often tied together in a mutually reinforcing relationship. Building on this observation, we propose a general framew...

Ecient top-k querying over social-tagging networks

by Ralf Schenkel , Tom Crecelius , Mouna Kacimi , Sebastian Michel , Thomas Neumann , Josiane Xavier Parreira , Gerhard Weikum - In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '08 , 2008
"... ABSTRACT Online communities have become popular for publishing and searching content, as well as for finding and connecting to other users. User-generated content includes, for example, personal blogs, bookmarks, and digital photos. These items can be annotated and rated by different users, and the ..."
Abstract - Cited by 49 (4 self) - Add to MetaCart
ABSTRACT Online communities have become popular for publishing and searching content, as well as for finding and connecting to other users. User-generated content includes, for example, personal blogs, bookmarks, and digital photos. These items can be annotated and rated by different users, and these social tags and derived user-specific scores can be leveraged for searching relevant content and discovering subjectively interesting items. Moreover, the relationships among users can also be taken into consideration for ranking search results, the intuition being that you trust the recommendations of your close friends more than those of your casual acquaintances. Queries for tag or keyword combinations that compute and rank the top-k results thus face a large variety of options that complicate the query processing and pose efficiency challenges. This paper addresses these issues by developing an incremental top-k algorithm with two-dimensional expansions: social expansion considers the strength of relations among users, and semantic expansion considers the relatedness of different tags. It presents a new algorithm, based on principles of threshold algorithms, by folding friends and related tags into the search space in an incremental on-demand manner. The excellent performance of the method is demonstrated by an experimental evaluation on three real-world datasets, crawled from deli.cio.us, Flickr, and LibraryThing.

Discovering Value from Community Activity on Focused Question Answering Sites: A Case Study of Stack Overflow

by Ashton Anderson, Daniel Huttenlocher, Jon Kleinberg, Jure Leskovec - Proc. KDD, 2012
"... Question answering (Q&A) websites are now large repositories of valuable knowledge. While most Q&A sites were initially aimed at providing useful answers to the question asker, there has been a marked shift towards question answering as a community-driven knowledge creation process whose end ..."
Abstract - Cited by 39 (0 self) - Add to MetaCart
Question answering (Q&A) websites are now large repositories of valuable knowledge. While most Q&A sites were initially aimed at providing useful answers to the question asker, there has been a marked shift towards question answering as a community-driven knowledge creation process whose end product can be of enduring value to a broad audience. As part of this shift, specific expertise and deep knowledge of the subject at hand have become increasingly important, and many Q&A sites employ voting and reputation mechanisms as centerpieces of their design to help users identify the trustworthiness and accuracy of the content. To better understand this shift in focus from one-off answers to a group knowledge-creation process, we consider a question together with its entire set of corresponding answers as our fundamental unit of analysis, in contrast with the focus on individual questionanswer pairs that characterized previous work. Our investigation considers the dynamics of the community activity that shapes the set of answers, both how answers and voters arrive over time and how this influences the eventual outcome. For example, we observe significant assortativity in the reputations of co-answerers, relationships between reputation and answer speed, and that the probability of an answer being chosen as the best one strongly depends on temporal characteristics of answer arrivals. We then show that our understanding of such properties is naturally applicable to predicting several important quantities, including the long-term value of the question and its answers, as well as whether a question requires a better answer. Finally, we discuss the implications of these results for the design of Q&A sites.
(Show Context)

Citation Context

...tigated users, their interests and motivation for contribution [1, 19, 4]. Insights from such studies informed the design of network-based ranking algorithms for identifying users with high expertise =-=[10, 17, 25, 26]-=-. The second is the perspective of information retrieval where a question is viewed as a “query” and answers could be thought of as “results” [15, 8, 16, 2, 9]. One goal of this line of work is to tak...

Modeling documents as mixtures of persons for expert finding

by Pavel Serdyukov, Djoerd Hiemstra
"... ..."
Abstract - Cited by 37 (5 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...entcentric approach is still better than HITS based ranking. Query-independent experts discovery using links acquired from posts and replies at specialized forums was studied recently by Zhang et al. =-=[27]-=-. A number of advanced pseudo-relevance feedback techniques are consistently applied to the expert search task. Query expansion from the top retrieved documents performed quite well [2,21]. Macdonald ...

Crowdsourcing and knowledge sharing: Strategic user behavior on taskcn

by Jiang Yang , Lada A Adamic , Mark S Ackerman - In Proceedings of the 9th ACM conference on electronic commerce, EC ’08 , 2008
"... ABSTRACT Witkeys are a thriving type of web-based knowledge sharing market in China, supporting a form of crowdsourcing. In a Witkey site, users offer a small award for a solution to a task, and other users compete to have their solution selected. In this paper, we examine the behavior of users on ..."
Abstract - Cited by 34 (3 self) - Add to MetaCart
ABSTRACT Witkeys are a thriving type of web-based knowledge sharing market in China, supporting a form of crowdsourcing. In a Witkey site, users offer a small award for a solution to a task, and other users compete to have their solution selected. In this paper, we examine the behavior of users on one of the biggest Witkey websites in China, Taskcn.com. On Taskcn, we observed several characteristics in users' activity over time. Most users become inactive after only a few submissions. Others keep attempting tasks. Over time, users tend to select tasks where they are competing against fewer opponents to increase their chances of winning. They will also, perhaps counterproductively, select tasks with higher expected rewards. Yet, on average, they do not increase their chances of winning, and in some categories of tasks, their chances actually decrease. This does not paint the full picture, however, because there is a very small core of successful users who manage not only to win multiple tasks, but to increase their win-tosubmission ratio over time. This core group proposes nearly 20% of the winning solutions on the site. The patterns we observe on Taskcn, we believe, hold clues to the future of crowdsourcing and freelance marketplaces, and raise interesting design implications for such sites.
(Show Context)

Citation Context

...ipants. Nonetheless, quality is high: for example, Wikipedia has been shown to be close in accuracy to Encyclopedia Britannica [13]. Interestingly, a recent trend has been toward increased contribution by non-elite users to both Wikipedia and Del.icio.us [14], with interesting implications for the quality and growth of the systems. Recent work has examined the dynamics of expertise sharing in other online forums besides Taskcn. These systems were quite different from Taskcn, and expose other points in the design space of expertise sharing systems. We examined the relatively focused Java Forum [15] hosted by Sun, where users request and share programming expertise, and the very diverse Yahoo! Answers [16], where many questions are prompts for discussion or support, rather than pure information or advice seeking. In Java Forum, we found that the most active users also tend to be the most expert. They are also likely to be helping newbie questions as well as technical questions requiring high levels of expertise. For Yahoo Answers, we found that in top level categories, such as science and math, where most questions are of a factual nature, specializing within a subcategory correlates wit...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University