Results 1 - 10
of
3,051
Mixed membership stochastic block models for relational data with application to protein-protein interactions
- In Proceedings of the International Biometrics Society Annual Meeting
, 2006
"... We develop a model for examining data that consists of pairwise measurements, for example, presence or absence of links between pairs of objects. Examples include protein interactions and gene regulatory networks, collections of author-recipient email, and social networks. Analyzing such data with p ..."
Abstract
-
Cited by 378 (52 self)
- Add to MetaCart
We develop a model for examining data that consists of pairwise measurements, for example, presence or absence of links between pairs of objects. Examples include protein interactions and gene regulatory networks, collections of author-recipient email, and social networks. Analyzing such data
TREC 2009 at the University at Buffalo: Interactive Legal E-Discovery With Enron Emails
"... For the TREC 2009, the team from University at Buffalo, the State University of New York participated in the Legal E-Discovery track, working on the interactive search task. We explored indexing and searching at both the record level and the document level with the Enron email collection. We studied ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
For the TREC 2009, the team from University at Buffalo, the State University of New York participated in the Legal E-Discovery track, working on the interactive search task. We explored indexing and searching at both the record level and the document level with the Enron email collection. We
Communication networks from the enron email corpus ”it’s always about the people. enron is no different
- Computational & Mathematical Organization Theory
, 2005
"... The Enron email corpus is appealing to researchers because it is a) a large scale email collection from b) a real organization c) over a period of 3.5 years. In this paper we contribute to the initial investigation of the Enron email dataset from a social network analytic perspective. We report on h ..."
Abstract
-
Cited by 111 (11 self)
- Add to MetaCart
The Enron email corpus is appealing to researchers because it is a) a large scale email collection from b) a real organization c) over a period of 3.5 years. In this paper we contribute to the initial investigation of the Enron email dataset from a social network analytic perspective. We report
Recommending recipients in the enron email corpus
, 2007
"... Email is the most popular communication tool of the internet. In this paper we investigate how email systems can be enhanced to work as recipient recommendation systems, i.e., sug-gesting who recipients of a message might be, while the message is being composed, given its current contents and given ..."
Abstract
-
Cited by 11 (0 self)
- Add to MetaCart
recommendation can also prevent a user from forgetting to add an important collaborator or manager as recipient, preventing costly misunderstandings and communication delays. In this report we present the first study of recipient recommendation in a real large-scale corporate email collection, the Enron Email
The Enron Corpus:
- In ECML
, 2004
"... Automated classification of email messages into user-specific folders and information extraction from chronologically ordered email streams have become interesting areas in text learning research. However, the lack of large benchmark collections has been an obstacle for studying the problems and ..."
Abstract
- Add to MetaCart
Automated classification of email messages into user-specific folders and information extraction from chronologically ordered email streams have become interesting areas in text learning research. However, the lack of large benchmark collections has been an obstacle for studying the problems
The Enron email dataset database schema and brief statistical report
, 2004
"... Email logs have been considered as a useful resource for research in fields like link analysis, social network analysis and textual analysis. Most of the experiments in these fields of research are performed on synthetic data due to lack of an adequate and real life benchmark. The Enron email datase ..."
Abstract
-
Cited by 87 (0 self)
- Add to MetaCart
Email logs have been considered as a useful resource for research in fields like link analysis, social network analysis and textual analysis. Most of the experiments in these fields of research are performed on synthetic data due to lack of an adequate and real life benchmark. The Enron email
A Test Collection for Email Entity Linking
"... Most prior work on entity linking has focused on linking name mentions found in third-person communication (e.g., news) to broad-coverage knowledge bases (e.g., Wikipedia). A restricted form of domain-specific entity linking has, how-ever, been tried with email, linking mentions of people to specifi ..."
Abstract
- Add to MetaCart
to specific email addresses. This paper introduces a new test collection for the task of linking mentions of peo-ple, organizations, and locations to Wikipedia. Annotation of 200 randomly se-lected entities of each type from the Enron email collection indicates that domain-specific knowledge bases are indeed
Making Sense of Archived Email: Exploring the Enron Collection with NetLens
"... Informal communications media pose new challenges for information systems design, but the nature of informal interaction offers new opportunities as well. This paper describes NetLens-Email, a system designed to support exploration of the content-actor network in large email collection. Unique featu ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Informal communications media pose new challenges for information systems design, but the nature of informal interaction offers new opportunities as well. This paper describes NetLens-Email, a system designed to support exploration of the content-actor network in large email collection. Unique
Understanding the NetworkLevel Behavior of Spammers
, 2006
"... This paper studies the network-level behavior of spammers, including: IP address ranges that send the most spam, common spamming modes (e.g., BGP route hijacking, bots), how persistent across time each spamming host is, and characteristics of spamming botnets. We try to answer these questions by ana ..."
Abstract
-
Cited by 290 (22 self)
- Add to MetaCart
by analyzing a 17-month trace of over 10 million spam messages collected at an Internet “spam sinkhole”, and by correlating this data with the results of IP-based blacklist lookups, passive TCP fingerprinting information, routing information, and botnet “command and control ” traces. We find that most spam
Email as spectroscopy: Automated discovery of community structure within organizations
, 2003
"... Abstract. We describe a methodology for the automatic identification of communities of practice from email logs within an organization. We use a betweenness centrality algorithm that can rapidly find communities within a graph representing information flows. We apply this algorithm to an email corpu ..."
Abstract
-
Cited by 205 (7 self)
- Add to MetaCart
Abstract. We describe a methodology for the automatic identification of communities of practice from email logs within an organization. We use a betweenness centrality algorithm that can rapidly find communities within a graph representing information flows. We apply this algorithm to an email
Results 1 - 10
of
3,051