Results 1 - 10
of
379
Bayeux: An architecture for scalable and fault-tolerant wide-area data dissemination
, 2001
"... The demand for streaming multimedia applications is growing at an incredible rate. In this paper, we propose Bayeux, an efficient application-level multicast system that scales to arbitrarily large receiver groups while tolerating failures in routers and network links. Bayeux also includes specific ..."
Abstract
-
Cited by 465 (12 self)
- Add to MetaCart
mechanisms for load-balancing across replicate root nodes and more efficient bandwidth consumption. Our simulation results indicate that Bayeux maintains these properties while keeping transmission overhead low. To achieve these properties, Bayeux leverages the architecture of Tapestry, a fault-tolerant
1 Bayeux: An Architecture for Scalable and Fault-tolerant Wide-Area Data Dissemination
"... Abstract- The demand for streaming multimedia applications is growing at an incredible rate. In this paper, we propose Bayeux, an efficient application-level multicast system that scales to arbitrarily large receiver groups while tolerating failures in routers and network links. Our simulation resul ..."
Abstract
- Add to MetaCart
results indicate that Bayeux maintains these properties while keeping transmission overhead low. Bayeux leverages the architecture of Tapestry, a fault-tolerant, wide-area overlay routing and location network. I.
Tapestry: An infrastructure for fault-tolerant wide-area location and routing
, 2001
"... In today’s chaotic network, data and services are mobile and replicated widely for availability, durability, and locality. Components within this infrastructure interact in rich and complex ways, greatly stressing traditional approaches to name service and routing. This paper explores an alternative ..."
Abstract
-
Cited by 1250 (31 self)
- Add to MetaCart
and directory information within this infrastructure is purely soft state and easily repaired. Tapestry is self-administering, fault-tolerant, and resilient under load. This paper presents the architecture and algorithms of Tapestry and explores their advantages through a number of experiments.
Exploiting Data-Flow for Fault-Tolerance in a Wide-Area Parallel System
- Proceedings of the 15th International Symposium on Reliable and Distributed Systems
, 1996
"... Wide-area parallel processing systems will soon be available to researchers to solve a range of problems. In these systems, it is certain that host failures and other faults will be a common occurrence. Unfortunately, most parallel processing systems have not been designed with fault-tolerance in mi ..."
Abstract
-
Cited by 17 (8 self)
- Add to MetaCart
Wide-area parallel processing systems will soon be available to researchers to solve a range of problems. In these systems, it is certain that host failures and other faults will be a common occurrence. Unfortunately, most parallel processing systems have not been designed with fault-tolerance
System Architecture: Fault-Tolerant, Wide-Area Access to Computing and Data Resources
"... Nile is a multi-disciplinary project building a distributed computing environment for HEP. It provides wide-area, fault-tolerant, integrated access to processing and data resources for collaborators of the CLEO experiment, though the goals and principles are applicable to many domains. Nile has thre ..."
Abstract
-
Cited by 8 (2 self)
- Add to MetaCart
Nile is a multi-disciplinary project building a distributed computing environment for HEP. It provides wide-area, fault-tolerant, integrated access to processing and data resources for collaborators of the CLEO experiment, though the goals and principles are applicable to many domains. Nile has
Fault Tolerant Wide-Area Parallel Computing
- IPDPS 2000 Workshop
, 2000
"... . Executing parallel applications across distributed networks introduces the problem of fault tolerance. A viable solution for fault tolerance must keep overhead manageable and not compromise the high performance objective of parallel processing. In this paper, we explore two options for achievin ..."
Abstract
-
Cited by 14 (1 self)
- Add to MetaCart
for achieving fault tolerance for a common class of parallel applications, single-program-multiple-data (SPMD). We quantitatively compare checkpoint-recovery and wide-area replication as a means of achieving fault tolerance. The experimental results obtained for a canonical SPMD application suggest
A Survey and Comparison of Peer-to-Peer Overlay Network Schemes
- IEEE COMMUNICATIONS SURVEYS AND TUTORIALS
, 2005
"... Over the Internet today, computing and communications environments are significantly more complex and chaotic than classical distributed systems, lacking any centralized organization or hierarchical control. There has been much interest in emerging Peer-to-Peer (P2P) network overlays because they ..."
Abstract
-
Cited by 302 (1 self)
- Add to MetaCart
or guarantees, hierarchical naming, trust and authentication, and, anonymity. P2P networks potentially offer an efficient routing architecture that is self-organizing, massively scalable, and robust in the wide-area, combining fault tolerance, load balancing and explicit notion of locality. In this paper, we
for Fault-Tolerant Storage Systems
, 2010
"... cleared through the authors ’ institutions. Approximate word count: 8,300. Abstract—Large scale, archival and wide-area storage systems use erasure codes to protect users from losing data due to the inevitable failures that occur. All but the most basic erasure codes employ bit-matrices to perform e ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
cleared through the authors ’ institutions. Approximate word count: 8,300. Abstract—Large scale, archival and wide-area storage systems use erasure codes to protect users from losing data due to the inevitable failures that occur. All but the most basic erasure codes employ bit-matrices to perform
Fault-tolerance in the network storage stack
- In IEEE Workshop on Fault-Tolerant Parallel and Distributed Systems, Ft. Lauderdale, FL
, 2002
"... This paper addresses the issue of fault-tolerance in applications that make use of network storage. A network storage abstraction called the Network Storage Stack is presented, along with its constituent parts. In particular, a data type called the exNode is detailed, along with tools that allow it ..."
Abstract
-
Cited by 9 (6 self)
- Add to MetaCart
it to be used to implement a wide-area, striped and replicated file. Using these tools, we evaluate the fault-tolerance of several exNode “files, ” composed of variable-size blocks stored on 14 different machines at five locations throughout the United States. The results demonstrate that while failures
A ddscompliant infrastructure for fault-tolerant and scalable data dissemination
- in: Proceedings of the The IEEE symposium on Computers and Communications, ISCC
"... A DDS-compliant infrastructure for fault-tolerant and scalable data dissemination ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
A DDS-compliant infrastructure for fault-tolerant and scalable data dissemination
Results 1 - 10
of
379