Why Do Computers Stop And What Can Be Done About It? (1985)

by Jim Gray
Citations:194 - 0 self

Documents Related by Co-Citation

58 Measuring system and software reliability using an automated data collection process. Quality and Reliability Engineering – B MURPHY, T GENT - 1995
149 A Census of Tandem System Availability Between 1985 and 1990 – J Gray - 1990
96 Recursive Restartability: Turning the Reboot Sledgehammer into a Scalpel – George Candea, Armando Fox - 2001
45 Failure data analysis of a lan of windows nt based computers – M Kalyanakrishnam, Z Kalbarczyk - 1999
50 Networked Windows NT System Field Failure Data Analysis – Jun Xu, Zbigniew Kalbarczyk, Ravishankar K. Iyer - 1999
154 Software rejuvenation: Analysis, module and applications – Yennun Huang, Chandra Kintala, Nick Kolettis, N. Dudley Fulton - 1995
246 An Empirical Study of Operating System Errors – Andy Chou, Junfeng Yang, Benjamin Chelf, Seth Hallem, Dawson Engler - 2001
51 Exploring Failure Transparency and the Limits of Generic Recovery – David E. Lowell - 2000
229 Why do Internet services fail, and what can be done about it? – David Oppenheimer, Archana Ganapathi, David A. Patterson - 2003
211 Hypervisor-based fault tolerance – Thomas C. Bressoud, Fred B. Schneider - 1995
44 To Err is Human – Aaron B. Brown, David A. Patterson - 2001
155 Lessons from Giant-Scale Services – Eric A. Brewer - 2001
12 Analysis of workload influence on dependability – J Meyer, L Wei - 1988
7 Failure analysis and modelling of a VAX cluster system – D Tang, R K Iyer, S S Subramani - 1990
62 Error Log Analysis: Statistical Modeling and Heuristic Trend Analysis – Ting-ting Y. Lin, Member Ieee, Daniel P. Siewiorek, Fellow Ieee - 1990
55 Measurement and Modeling of Computer Reliability as Affected by System Activity – R K Iyer, D J Rossetti, M C Hsueh - 1986
45 Failure data analysis of a large-scale heterogeneous server environment – Ramendra K. Sahoo, Mark S. Squillante - 2004
59 Modeling Machine Availability in Enterprise and Wide-area Distributed Computing Environments – Daniel Nurmi, John Brevik, Rich Wolski - 2003
542 A Survey of Rollback-Recovery Protocols in Message-Passing Systems – E. N. ( Mootaz) Elnozahy, Lorenzo Alvisi, Yi-min Wang, David B. Johnson - 1996