• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Relational Approach for Shortest Path Discovery over Large Graphs. (2012)

by J Gao, R M Jin, J S Zhou
Venue:PVLDB
Add To MetaCart

Tools

Sorted by:
Results 1 - 9 of 9

HLDB: Location-based services in databases

by Ittai Abraham, Daniel Delling, Amos Fiat, Andrew V. Goldberg, Renato F. Werneck - In Proceedings of the 20th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems (GIS’12), 339–348. ACM Press. Best Paper Award , 2012
"... This paper introduces HLDB, the first practical system that can answer exact spatial queries on continental road net-works entirely within a database. HLDB is based on hub la-bels (HL), the fastest point-to-point algorithm for road net-works, and its queries are implemented (quite naturally) in stan ..."
Abstract - Cited by 7 (4 self) - Add to MetaCart
This paper introduces HLDB, the first practical system that can answer exact spatial queries on continental road net-works entirely within a database. HLDB is based on hub la-bels (HL), the fastest point-to-point algorithm for road net-works, and its queries are implemented (quite naturally) in standard SQL. Within the database, HLDB answers exact distance queries and retrieves full shortest-path descriptions in real time, even on networks with tens of millions of ver-tices. The basic algorithm can be extended in a natural way (still in SQL) to answer much more sophisticated queries, such as finding the ten closest fast-food restaurants. We also introduce efficient new HL-based algorithms for even harder problems, such as best via point, ride sharing, and point of interest prediction. The HLDB framework makes it easy to implement these algorithms in SQL, enabling interactive applications on continental road networks.
(Show Context)

Citation Context

...echniques surveyed by Delling et al. [12] to SQL is hard. They rely on sophisticated data structures (such as graphs and priority queues) that cannot be implemented nearly as efficiently in databases =-=[18]-=-. The best previous database oracle we are aware of is due to Sankaranarayanan and Samet [35]. On a graph with n vertices, it can answer -approximate queries in O(logn) time after a preprocessing sta...

Horton+: A Distributed System for Processing Declarative Reachability Queries over Partitioned Graphs

by Mohamed Sarwat, Sameh Elnikety, Yuxiong He, Mohamed F. Mokbel
"... Horton+ is a graph query processing system that executes declarative reachability queries on a partitioned attributed multi-graph. It employs a query language, query optimizer, and a distributed execution engine. The query language expresses declarative reachability queries, and supports closures an ..."
Abstract - Cited by 3 (1 self) - Add to MetaCart
Horton+ is a graph query processing system that executes declarative reachability queries on a partitioned attributed multi-graph. It employs a query language, query optimizer, and a distributed execution engine. The query language expresses declarative reachability queries, and supports closures and predicates on node and edge attributes to match graph paths. We introduce three algebraic operators, select, traverse, and join, and a query is compiled into an execution plan containing these operators. As reachability queries access the graph elements in a random access pattern, the graph is therefore maintained in the main memory of a cluster of servers to reduce query execution time. We develop a distributed execution engine that processes a query plan in parallel on the graph servers. Since the query language is declarative, we build a query optimizer that uses graph statistics to estimate predicate selectivity. We experimentally evaluate the system performance on a cluster of 16 graph servers using synthetic graphs as well as a real graph from an application that uses reachability queries. The evaluation shows (1) the efficiency of the optimizer in reducing query execution time, (2) system scalability with the size of the graph and with the number of servers, and (3) the convenience of using declarative queries. 1.
(Show Context)

Citation Context

...ern matching [14], where the emphasis is on having reasonable latency for problems that are likely to be NP-complete. Online graph algorithms support simple graph queries, e.g., shortest path queries =-=[17, 47]-=-, reachability queries [8, 15, 24], smaller versions of complex queries, e.g., pattern matching queries [15, 50], or approximate queries on a streaming environment [2, 49]. Horton+ focuses on processi...

Shortest Path Computation over Disk-resident Large Graphs based on Extended Bulk Synchronous Parallel Methods

by Zhigang Wang , Yu Gu , Roger Zimmermann , Ge Yu
"... Abstract. The Single Source Shortest Path (SSSP) computation over large graphs has raised significant challenges to the memory capacity and processing efficiency. Utilizing disk-based parallel iterative computing is an economic solution. However, costs of disk I/O and communication affect the perfo ..."
Abstract - Add to MetaCart
Abstract. The Single Source Shortest Path (SSSP) computation over large graphs has raised significant challenges to the memory capacity and processing efficiency. Utilizing disk-based parallel iterative computing is an economic solution. However, costs of disk I/O and communication affect the performance heavily. This paper proposes a state-transition model for SSSP and then designs two optimization strategies based on it. First, we introduce a tunable hash index to reduce the scale of wastef ul data loaded from the disk. Second, we propose a new iterative mechanism and design an Across-step Message Pruning (ASMP) policy to deal with the communication bottleneck. The experimental results illustrate that our SSSP computation is 2 times faster than a basic Giraph (a memoryresident parallel framework) implementation. Compared with Hadoop and Hama (disk-resident parallel frameworks), the speedup is 21 to 43.
(Show Context)

Citation Context

... than a basic Giraph (a memoryresident parallel framework) implementation. Compared with Hadoop and Hama (disk-resident parallel frameworks), the speedup is 21 to 43. 1 Introduction The Single Source Shortest Path (SSSP) computation is a classical problem with numerous applications and has been well-studied over the past decades. However, new challenges have been raised by the rapid growth of graph data. For instance, up to March 2012, Facebook has owned about 900 million vertices (i.e., users) and over 100 billion edges. Such large graphs have exceeded the memory capacity of a single machine [1]. Even for memory-resident parallel frameworks [2, 3], the data processing capacity of a given cluster is also limited [4]. This problem can be relieved by enlarging the cluster scale, but the consumption will also increase. It is an economic solution if we extend memory-resident parallel frameworks by spilling data on the disk [5]. In this case, how to reduce costs of disk I/O and message communication becomes challenging especially for the iterative computation tasks, such as SSSP. For in-memory algorithms on SSSP, some are difficult to be executed in parallel due to the inherent priority or...

PrefixSolve: Efficiently Solving Multi-Source Multi-Destination Path Queries on RDF Graphs by Sharing Suffix Computations ABSTRACT

by Sidan Gao
"... Uncovering the “nature ” of the connections between a set of entities e.g. passengers on a flight and organizations on a watchlist can be viewed as a Multi-Source Multi-Destination (MSMD) Path Query problem on labeled graph data models such as RDF. Using existing graph-navigational path finding tech ..."
Abstract - Add to MetaCart
Uncovering the “nature ” of the connections between a set of entities e.g. passengers on a flight and organizations on a watchlist can be viewed as a Multi-Source Multi-Destination (MSMD) Path Query problem on labeled graph data models such as RDF. Using existing graph-navigational path finding techniques to solve MSMD problems will require queries to be decomposed into multiple single-source or destination path subqueries, each of which is solved independently. Navigational techniques on disk-resident graphs typically generate very poor I/O access patterns for large, disk-resident graphs and for MSMD path queries, such poor access patterns may be repeated if common graph exploration steps exist across subqueries. In this paper, we propose an optimization technique for general MSMD path queries that generalizes an efficient algebraic approach for solving a variety of single-source path problems. The generalization enables holistic evaluation of MSMD path queries without the need for query decomposition. We present a conceptual framework for sharing computation in the algebraic framework that is based on “suffix equivalence”. Suffix equivalence amongst subqueries captures the fact that multiple subqueries with different prefixes can share a suffix and as such share the computation of shared suffixes, which allows prefix path computations to share common suffix path computations. This approach offers orders of magnitude better performance than current existing techniques as demonstrated by a comprehensive experimental evaluation over real and synthetic datasets.
(Show Context)

Citation Context

...e applications such as drug target require a view into the entire scope of possible relationships, not just the most immediate ones. Unfortunately, most existing techniques primarily support shortest =-=[18, 15]-=- or bounded length paths/subgraphs [21] or pattern-based subgraphs [12] rather than generalized path querying. For most applications interested in finding paths or connections, the focus is often on s...

Evaluation of SPARQL Property Paths via Recursive SQL

by Nikolay Yakovets, Parke Godfrey, Jarek Gryz
"... Abstract. Property paths, a part of the proposed SPARQL 1.1 standard, allow for non-trivial navigation in RDF graphs. We investigate the evaluation of SPARQL queries with property paths in a relational RDF store. We propose a translation of SPARQL property paths into recursive SQL and discuss possib ..."
Abstract - Add to MetaCart
Abstract. Property paths, a part of the proposed SPARQL 1.1 standard, allow for non-trivial navigation in RDF graphs. We investigate the evaluation of SPARQL queries with property paths in a relational RDF store. We propose a translation of SPARQL property paths into recursive SQL and discuss possible optimization strategies. 1
(Show Context)

Citation Context

...hat assumes acyclicity. Finally, we shall compare the performance of our approach to other proposed methods of property path evaluation. In particular, we consider the recently proposed FEM framework =-=[11]-=-. FEM was originally developed to answer shortest paths queries on graphs stored in a relational database by iteratively applyingFrontier-Expand-Merge operations. In our work, we plan to adapt FEM to...

Towards Unified Ad-hoc Data Processing

by Xiaogang Shi, Bin Cui, Gillian Dobbie, Beng Chin Ooi
"... It is important to provide efficient execution for ad-hoc data pro-cessing programs. In contrast to constructing complex declarative queries, many users prefer to write their programs using procedural code with simple queries. As many users are not expert program-mers, their programs usually exhibit ..."
Abstract - Add to MetaCart
It is important to provide efficient execution for ad-hoc data pro-cessing programs. In contrast to constructing complex declarative queries, many users prefer to write their programs using procedural code with simple queries. As many users are not expert program-mers, their programs usually exhibit poor performance in practice and it is a challenge to automatically optimize these programs and efficiently execute the programs. In this paper, we present UniAD, a system designed to simpli-fy the programming of data processing tasks and provide efficien-t execution for user programs. We propose a novel intermediate representation named UniQL which utilizes HOQs to describe the operations performed in programs. By combining both procedural and declarative logics, we can perform various optimizations across the boundary between procedural and declarative codes. We de-scribe optimizations and conduct extensive empirical studies using UniAD. The experimental results on four benchmarks demonstrate that our techniques can significantly improve the performance of a wide range of data processing programs.
(Show Context)

Citation Context

...ese paradigms are usually limited in expressiveness and functionality. As many problems targeted by ad-hoc data processing are very complex, problem formulation in these systems is not a trivial task =-=[35, 11]-=-. Very often, users feel more comfortable writing their programs in procedural languages with embedded declarative primitives. Though many systems offer general purpose programming languages integrate...

Raghavan Raman Oracle Labs

by Adam Welc, Zhe Wu, Sungpack Hong, Hassan Chafi, Jay Banerjee
"... The problem of efficiently analyzing graphs of various shapes and sizes has been recently enjoying an increased level of at-tention both in the academia and in the industry. This trend prompted creation of specialized graph databases that have been rapidly gaining popularity of late. In this paper w ..."
Abstract - Add to MetaCart
The problem of efficiently analyzing graphs of various shapes and sizes has been recently enjoying an increased level of at-tention both in the academia and in the industry. This trend prompted creation of specialized graph databases that have been rapidly gaining popularity of late. In this paper we ar-gue that there exist alternatives to graph databases, provid-ing competitive or superior performance, that do not require replacement of the entire existing storage infrastructure by the companies wishing to deploy them. 1.
(Show Context)

Citation Context

... These include the choice of direction to expand at every step, different termination conditions, constraints to avoid expansion to nodes that could never lead to shortest paths, etc. Please refer to =-=[4, 9, 15]-=- for the description of some of these optimizations. 2.1.3 Bi-directional set-based Dijkstra In the non-set based version of Dijkstra’s bi-directional shortest path algorithm, by analogy to the uni-di...

Finding the CostOptimal Path with Time Constraint over TimeDependent Graphs

by Yajun Yang, Hong Gao, Jeffrey Xu Yu, Jianzhong Li
"... Shortest path query is an important problem and has been well s-tudied in static graphs. However, in practice, the costs of edges in graphs always change over time. We call such graphs as timedependent graphs. In this paper, we study how to find a costoptimal path with time constraint in time-depend ..."
Abstract - Add to MetaCart
Shortest path query is an important problem and has been well s-tudied in static graphs. However, in practice, the costs of edges in graphs always change over time. We call such graphs as timedependent graphs. In this paper, we study how to find a costoptimal path with time constraint in time-dependent graphs. Most existing works regarding the Time-Dependent Shortest Path (TD-SP) problem focus on finding a shortest path with the minimum travel time. All these works are based on the following fact: the earliest arrival time at a vertex v can be derived from the earliest arrival time at v’s neighbors. Unfortunately, this fact does not hold for our problem. In this paper, we propose a novel algorithm to compute a cost-optimal path with time constraint in time-dependent graphs. We show that the time and space complexities of our algorithm are O(knlogn + mk) and O((n + m)k) respectively. We confirm the effectiveness and efficiency of our algorithm through conducting experiments on real datasets with synthetic cost. 1.
(Show Context)

Citation Context

...phs and has been well studied in static graphs. The existing works for the shortest path problem propose various index techniques to enhance the efficiency of the shortest path query for large graphs =-=[23, 1, 8, 22, 5, 19]-=-. All these works make a trade-off between the querying time 683and index size. The main idea of these works are maintaining some shortest paths in index. Given a query, algorithms first retrieve the...

Continuous query processing; Temporal analytics; Dynamic social

by Jayanta Mondal, Amol Deshpande
"... networks; Incremental computation. ..."
Abstract - Add to MetaCart
networks; Incremental computation.
(Show Context)

Citation Context

...hrough strategic traversal of the underlying graph, e.g., reachability [103, 63, 68, 108], keyword search queries [55, 102, 29, 70], subgraph pattern matching [64, 51, 35, 65], shortest paths queries =-=[106, 53, 58]-=-, etc. However, distributed management of dynamic graph data is not as well-studied, especially in the data management research community. Recently there has been an increasing interest in large-scale...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University