• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Unreliable Failure Detectors for Reliable Distributed Systems (1996)

Cached

  • Download as a PDF

Download Links

  • [lpdwww.epfl.ch]
  • [www.cs.princeton.edu]
  • [www.cs.princeton.edu]
  • [www.cs.cornell.edu]
  • [www.cs.cornell.edu]
  • [ftp.cs.toronto.edu]
  • [www.escet.urjc.es]
  • [courses.csail.mit.edu]
  • [mpri.master.univ-paris7.fr]
  • [www.cs.huji.ac.il]
  • [www.dsc.ufcg.edu.br]

  • Other Repositories/Bibliography

  • DBLP
  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Tushar Deepak Chandra , Sam Toueg
Venue:Journal of the ACM
Citations:807 - 17 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@ARTICLE{Chandra96unreliablefailure,
    author = {Tushar Deepak Chandra and Sam Toueg},
    title = {Unreliable Failure Detectors for Reliable Distributed Systems},
    journal = {Journal of the ACM},
    year = {1996},
    volume = {43},
    pages = {225--267}
}

Years of Citing Articles

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

We introduce the concept of unreliable failure detectors and study how they can be used to solve Consensus in asynchronous systems with crash failures. We characterise unreliable failure detectors in terms of two properties — completeness and accuracy. We show that Consensus can be solved even with unreliable failure detectors that make an infinite number of mistakes, and determine which ones can be used to solve Consensus despite any number of crashes, and which ones require a majority of correct processes. We prove that Consensus and Atomic Broadcast are reducible to each other in asynchronous systems with crash failures; thus the above results also apply to Atomic Broadcast. A companion paper shows that one of the failure detectors introduced here is the weakest failure detector for solving Consensus [Chandra et al. 1992].

Citations

1300 Impossibility of distributed consensus with one faulty process - Fischer, Lynch, et al. - 1985
1069 The byzantine generals problem - Lamport, Shostak, et al. - 1982
716 Implementing fault-tolerant services using the state machine approach: a tutorial - Schneider - 1990
480 Reliable Communication in the Presence of Failures - Birman, Joseph - 1987
450 Reaching agreement in the presence of faults - Pease, Shostak, et al. - 1980
440 Knowledge and common knowledge in a distributed environment - Halpern, Moses - 1990
400 Consensus in the presence of partial synchrony - Dwork, Lynch, et al. - 1988
374 The weakest failure detector for solving consensus - Chandra, Hadzilacos, et al. - 1996
351 Reliable Broadcast Protocols - Chang, Maxemchuck - 1984
337 Transis: A communication sub-system for high availability - Amir, Dolev, et al. - 1992
282 Fault-tolerant Broadcasts and Related Problems. Chapter 5 - Hadzilacos, Toueg - 1993
217 On the minimal synchronism needed for distributed consensus - Dolev, Dwork, et al. - 1987
210 Atomic broadcast: from simple message diffusion to Byzantine agreement - Cristian, Aghili, et al. - 1995
210 Preserving and using context information in interprocess communication - Peterson, Bucholz, et al. - 1989
178 Another advantage of free choice: completely asynchronous agreement protocols - Ben-Or - 1983
157 Using process groups to implement failure detection in asynchronous environments - Ricciardi, Birman - 1991
150 Abu-Amara H., Memory requirements for agreement among unreliable asynchronous processes - Loui - 1987
149 A modular approach to faulttolerant broadcasts and related problems - Hadzilacos, Toueg - 1994
148 Delta-4: A Generic Architecture for Dependable Distributed Computing, Distributed Fault-Tolerance - Powell, Verissimo - 1991
132 Asynchronous consensus and broadcast protocols - Bracha, Toueg - 1985
102 The Consensus Problem in Unreliable Distributed Systems (A Brief Survey - Fischer - 1983
89 Reaching approximate agreement in the presence of faults - Dolev, Lynch, et al. - 1986
76 SIFT: Design and Analysis of a Fault-Tolerant Computer for Aircraft Control - WENSLEY - 1978
69 Revisiting the relationship between Non-blocking Atomic Commitment and Consensus - Guerraoui - 1995
69 Automatically increasing the fault-tolerance of distributed algorithms - Neiger, Toueg - 1990
66 The implementation of reliable distributed multiprocess systems - Lamport - 1978
48 C.: Randomization in Byzantine agreement - Chor, Dwork - 1989
44 Fault-tolerance in the advanced automation system - Cristian, Dancey, et al. - 1990
43 Using failure detectors to solve consensus in asynchronous shared-memory systems (extended abstract - Lo, Hadzilacos - 1994
41 Bounds on the time to reach agreement in the presence of timing uncertainty - Attiya, Dwork, et al. - 1991
35 Achievable cases in an asynchronous environment - Attiya, Bar-Noy, et al. - 1987
32 A combinatorial characterization of the distributed tasks which are solvable in the presence of one faulty processor - Biran, Moran, et al. - 1988
31 Towards optimal distributed consensus - Berman, Garay, et al. - 1989
27 Cheating husbands and other stories: a case study of knowledge, action, and communication - Moses, Dolev, et al. - 1986
25 Election vs. consensus in asynchronous systems - Sabel, Marzullo - 1995
23 Failure detectors and the wait-free hierarchy (extended abstract - Neiger - 1995
20 Reliable scheduling in a TMR database system - Pittelli, Garcia-Molina - 1989
18 Fault-tolerant decision making in totally asynchronous distributed systems - Bridgland, Watro - 1987
17 Time and message efficient reliable broadcasts - Chandra, Toueg - 1990
16 Shumuel Zaks. A combinatorial characterization of the distributed 1-solvable tasks - Biran, Moran - 1990
15 Impossibility of group membership in asynchronous systems - Chandra, Hadzilacos, et al. - 1995
14 A New Solution for the Byzantine Generals Problem - Reischuk - 1985
11 Early-delivery atomic broadcast - Gopal, Strong, et al. - 1990
8 The Amoeba distributed operating system: Selected papers 1984–1987 - Mullender
7 Issues in the design of highly available computing services - Cristian - 1987
5 Isis - A Distributed Programming Environment - Birman - 1990
3 Early-stopping distributed bidding and applications - Budhiraja, Gopal, et al. - 1990
1 E-mail correspondence. Showed that 3W cannot be used to solve non-blocking atomic commit - Chandra, Larrea - 1994
1 Time and message e#cient reliable broadcasts - Chandra, Toueg - 1990
1 Achievable cases in an asynchronous environment - ATrIYA, BAR-N•, et al. - 1987
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University