• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 3,177
Next 10 →

Mining Interesting Itemsets in Graph Datasets

by Boris Cule, Bart Goethals, Tayena Hendrickx
"... Abstract. Traditionally, pattern discovery in graphs has been mostly limited to searching for frequent subgraphs, reoccurring patterns within which nodes with certain labels are frequently interconnected in exactly the same way. We relax this requirement by claiming that a set of labels is interesti ..."
Abstract - Add to MetaCart
is interesting if they often occur in each other’s vicinity, but not necessarily always interconnected by exactly the same structures. Searching for such itemsets can be done both in datasets consisting of one large graph, and in datasets consisting of many graphs. We present novel methods dealing with both

Discovering Frequent Topological Structures from Graph Datasets

by R. Jin, C. Wang, D. Polshakov, S. Parthasarathy, G. Agrawal
"... The problem of finding frequent patterns from graph-based datasets is an important one that finds applications in drug discovery, protein structure analysis, XML querying, and social network analysis among others. In this paper we propose a framework to mine frequent large-scale structures, formally ..."
Abstract - Cited by 26 (2 self) - Add to MetaCart
The problem of finding frequent patterns from graph-based datasets is an important one that finds applications in drug discovery, protein structure analysis, XML querying, and social network analysis among others. In this paper we propose a framework to mine frequent large-scale structures

Exploration Using Signatures in Annotation Graph Datasets

by Louiqa Raschid, Guillermo Palma, Maria-esther Vidal, Andreas Thor
"... The widespread development and adoption of ontologies to capture semantic domain knowledge and the growth of annotation graph datasets has created many opportunities for large scale Linked Data analytics. Ontologies are developed by domain experts to capture knowledge specific to some domain. The bi ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
The widespread development and adoption of ontologies to capture semantic domain knowledge and the growth of annotation graph datasets has created many opportunities for large scale Linked Data analytics. Ontologies are developed by domain experts to capture knowledge specific to some domain

gSpan: Graph-Based Substructure Pattern Mining

by Xifeng Yan, Jiawei Han , 2002
"... We investigate new approaches for frequent graph-based pattern mining in graph datasets and propose a novel algorithm called gSpan (graph-based Substructure pattern mining) , which discovers frequent substructures without candidate generation. gSpan builds a new lexicographic order among graphs, and ..."
Abstract - Cited by 650 (34 self) - Add to MetaCart
We investigate new approaches for frequent graph-based pattern mining in graph datasets and propose a novel algorithm called gSpan (graph-based Substructure pattern mining) , which discovers frequent substructures without candidate generation. gSpan builds a new lexicographic order among graphs

Frequent Subgraph Discovery

by Michihiro Kuramochi, George Karypis , 2001
"... Over the years, frequent itemset discovery algorithms have been used to solve various interesting problems. As data mining techniques are being increasingly applied to non-traditional domains, existing approaches for finding frequent itemsets cannot be used as they cannot model the requirement of th ..."
Abstract - Cited by 406 (10 self) - Add to MetaCart
computationally efficient algorithm for finding all frequent subgraphs in large graph databases. We evaluated the performance of the algorithm by experiments with synthetic datasets as well as a chemical compound dataset. The empirical results show that our algorithm scales linearly with the number of input

Agglomerative Clustering of a Search Engine Query Log

by Doug Beeferman, Adam Berger - In Proceedings of the sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , 2000
"... This paper introduces a technique for mining a collection of user transactions with an Internet search engine to discover clusters of similar queries and similar URLs. The information we exploit is "clickthrough data": each record consists of a user's query to a search engine along wi ..."
Abstract - Cited by 330 (0 self) - Add to MetaCart
with the URL which the user selected from among the candidates offered by the search engine. By viewing this dataset as a bipartite graph, with the vertices on one side corresponding to queries and on the other side to URLs, one can apply an agglomerative clustering algorithm to the graph's vertices

Latent space approaches to social network analysis.

by Peter D Hoff , Adrian E Raftery , Mark S Handcock - Journal of the American Statistical Association, , 2002
"... Network models are widely used to represent relational information among interacting units. In studies of social networks, recent emphasis has been placed on random graph models where the nodes usually represent individual social actors and the edges represent the presence of a speci ed relation be ..."
Abstract - Cited by 318 (20 self) - Add to MetaCart
Network models are widely used to represent relational information among interacting units. In studies of social networks, recent emphasis has been placed on random graph models where the nodes usually represent individual social actors and the edges represent the presence of a speci ed relation

Discovering Statistically Significant Biclusters in Gene Expression Data

by Amos Tanay, Roded Sharan, Ron Shamir - In Proceedings of ISMB 2002 , 2002
"... In gene expression data, a bicluster is a subset of the genes exhibiting consistent patterns over a subset of the conditions. We propose a new method to detect significant biclusters in large expression datasets. Our approach is graph theoretic coupled with statistical modelling of the data. Under p ..."
Abstract - Cited by 302 (4 self) - Add to MetaCart
In gene expression data, a bicluster is a subset of the genes exhibiting consistent patterns over a subset of the conditions. We propose a new method to detect significant biclusters in large expression datasets. Our approach is graph theoretic coupled with statistical modelling of the data. Under

Spectral hashing

by Yair Weiss, Antonio Torralba, Rob Fergus , 2009
"... Semantic hashing [1] seeks compact binary codes of data-points so that the Hamming distance between codewords correlates with semantic similarity. In this paper, we show that the problem of finding a best code for a given dataset is closely related to the problem of graph partitioning and can be sho ..."
Abstract - Cited by 284 (4 self) - Add to MetaCart
Semantic hashing [1] seeks compact binary codes of data-points so that the Hamming distance between codewords correlates with semantic similarity. In this paper, we show that the problem of finding a best code for a given dataset is closely related to the problem of graph partitioning and can

A Min-max Cut Algorithm for Graph Partitioning and Data Clustering

by Chris H. Q. Ding, Xiaofeng He, Hongyuan Zha, Ming Gu, Horst Simon , 2001
"... An important application of graph partitioning is data clustering using a graph model -- the pairwise similarities between all data objects form a weighted graph adjacency matrix that contains all necessary information for clustering. Here we propose a new algorithm for graph partition with an objec ..."
Abstract - Cited by 213 (15 self) - Add to MetaCart
An important application of graph partitioning is data clustering using a graph model -- the pairwise similarities between all data objects form a weighted graph adjacency matrix that contains all necessary information for clustering. Here we propose a new algorithm for graph partition
Next 10 →
Results 1 - 10 of 3,177
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University