Results 1 - 10
of
15
The link-prediction problem for social networks
- J. American Society for Information Science and Technology
"... Given a snapshot of a social network, can we infer which new interactions among its members are likely to occur in the near future? We formalize this question as the link-prediction problem, and we develop approaches to link prediction based on measures for analyzing the “proximity” of nodes in a ne ..."
Abstract
-
Cited by 269 (4 self)
- Add to MetaCart
Given a snapshot of a social network, can we infer which new interactions among its members are likely to occur in the near future? We formalize this question as the link-prediction problem, and we develop approaches to link prediction based on measures for analyzing the “proximity” of nodes in a network. Experiments on large co-authorship networks suggest that information about future interactions can be extracted from network topology alone, and that fairly subtle measures for detecting node proximity can outperform more direct measures. 1
Coauthorship Networks and Patterns of Scientific Collaboration
- Proceedings of the National Academy of Sciences
, 2004
"... Using data from three bibliographic databases in biology, physics, and mathematics respectively, networks are constructed in which the nodes are scientists and two scientists are connected if they have coauthored a paper together. We use these networks to answer a broad variety of questions abou ..."
Abstract
-
Cited by 88 (0 self)
- Add to MetaCart
Using data from three bibliographic databases in biology, physics, and mathematics respectively, networks are constructed in which the nodes are scientists and two scientists are connected if they have coauthored a paper together. We use these networks to answer a broad variety of questions about collaboration patterns, such as the numbers of papers authors write, how many people they write them with, what the typical distance between scientists is through the network, and how patterns of collaboration vary between subjects and over time. We also summarize a number of recent results by other authors on coauthorship patterns.
A probabilistic similarity metric for medline records: A model for author name disambiguation
- Journal of the American Society for Information Science and Technology
, 2005
"... We present a model for estimating the probability that a pair of author names (sharing last name and first initial), appearing on two different Medline articles, refer to the same individual. The model uses a simple yet powerful similarity profile between a pair of articles, based on title, journal ..."
Abstract
-
Cited by 12 (1 self)
- Add to MetaCart
We present a model for estimating the probability that a pair of author names (sharing last name and first initial), appearing on two different Medline articles, refer to the same individual. The model uses a simple yet powerful similarity profile between a pair of articles, based on title, journal name, coauthor names, medical subject headings (MeSH), language, affiliation, and name attributes (prevalence in the literature, middle initial, and suffix). The similarity profile distribution is computed from reference sets consisting of pairs of articles containing almost exclusively author matches versus nonmatches, generated in an unbiased manner. Although the match set is generated automatically and might contain a small proportion of nonmatches, the model is quite robust against contamination with nonmatches. We have created a free, public service (“Author-ity”:
The structure of the genetic programming collaboration network. Genetic Programming and Evolvable Machines
- Genetic Programming and Evolvable Machines
, 2007
"... The genetic programming bibliography aims to be the most complete reference of papers on genetic programming. In addition to locating publications, it contains coauthor and coeditor relationships which have not previously been studied. These reveal some similarities and differences between our field ..."
Abstract
-
Cited by 10 (6 self)
- Add to MetaCart
The genetic programming bibliography aims to be the most complete reference of papers on genetic programming. In addition to locating publications, it contains coauthor and coeditor relationships which have not previously been studied. These reveal some similarities and differences between our field and collaborative social networks in other scientific fields.
Asynchronous discussion groups as Small Worlds and Scale Free Networks
- First Monday
, 2004
"... What is the network form of online discussion groups? What are the topological parameters delineating the interaction on such groups? We report an empirical examination of the form of online discussion groups. We are interested in examining whether such groups conform to the Small World and the Scal ..."
Abstract
-
Cited by 9 (3 self)
- Add to MetaCart
What is the network form of online discussion groups? What are the topological parameters delineating the interaction on such groups? We report an empirical examination of the form of online discussion groups. We are interested in examining whether such groups conform to the Small World and the Scale Free models of networks. Support for these expectations provides a formal expression of growth, survival potential and preferential attachment in the connection patterns in discussion groups. The research questions were tested with a sample of over 8,000 active participants, and over 30,000 messages. We find that the social network resulting from discussion groups is indeed a Scale Free Network, based on In, Out and All Degree distributions. We also find that, for the same sample, discussion groups are a Small World Network too. As expected, the clustering coefficients for these groups differ significantly from random networks, while their characteristic path lengths are similar to random networks. Implications of the topology for the design and understanding of discussion groups include the stability and control of such groups, as well as their
An Algorithmic Approach to Social Networks
- PhD thesis at MIT References 118 Science and Artificial Intelligence Laboratory
, 2005
"... ..."
Search in the Formation of Large Networks: How Random are Socially Generated Networks?
, 2005
"... We present a model of network formation where entering nodes find other nodes to link to both completely at random and through search of the neighborhoods of these randomly met nodes. We show that this model exhibits the full spectrum of features that have been found to characterize large socially g ..."
Abstract
-
Cited by 8 (3 self)
- Add to MetaCart
We present a model of network formation where entering nodes find other nodes to link to both completely at random and through search of the neighborhoods of these randomly met nodes. We show that this model exhibits the full spectrum of features that have been found to characterize large socially generated networks. Moreover, we derive the distribution of degree (number of links) across nodes, and show that while the upper tail of the distribution is approximately “scale-free,” the lower tail may exhibit substantial curvature, just as in observed networks. We then fit the model to data from six networks. Besides offering a close fit of these diverse networks, the model allows us to impute the relative importance of search versus random attachment in link formation. We find that the fitted ratio of random meetings to search-based meetings varies dramatically across these applications. Finally, we show that as this random/search ratio varies, the resulting degree distributions can be completely ordered in the sense of second order stochastic dominance. This allows us to infer how the relative randomness in the formation process affects average utility in the network.
The Genetic Programming Collaboration Network and its Communities
"... Useful information about scientific collaboration structures and patterns can be inferred from computer databases of published papers. The genetic programming bibliography is the most complete reference of papers on genetic programming. In addition to locating publications, it contains coauthor and ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Useful information about scientific collaboration structures and patterns can be inferred from computer databases of published papers. The genetic programming bibliography is the most complete reference of papers on genetic programming. In addition to locating publications, it contains coauthor and coeditor relationships from which a more complete picture of the field emerges. This comprises several statistics and includes the study of the community structure considered as a collaborative social network. The communities found correspond closely to the actual ones and some new facets are highlighted by their automatic analysis. The investigation reveals many similarities between our field and coauthorship networks in other scientific fields but also some

