Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? (2007)

Cached

Download Links

by Bianca Schroeder , Garth A. Gibson
Citations:155 - 8 self

Active Bibliography

52 Understanding Failures in Petascale Computers – Bianca Schroeder, Garth A. Gibson
108 A large-scale study of failures in highperformance computing systems – Bianca Schroeder, Garth A. Gibson - 2006
Characteristics, Impact, and Tolerance of Partial Disk Failures – Lakshmi Narayanan Bairavasundaram - 2008
Towards reliable storage systems – Haryadi S. Gunawi - 2009
M. Gallet et al. Wp Space-Correlated Failures in Distributed SystemsWp – Matthieu Gallet, Nezih Yigitbasi, Bahman Javadi, Derrick Kondo, Ru Iosup, Dick Epema
28 Using Fault Injection and Modeling to Evaluate the Performability of Cluster-Based Services – Kiran Nagaraja, Xiaoyan Li, Ricardo Bianchini, Richard P. Martin, Thu D. Nguyen - 2003
53 Improving Cluster Availability Using Workstation Validation – Taliver Heath, Richard P. Martin, Thu D. Nguyen - 2002
N. Yigitbasi. et al. Wp Time-Correlated Failures in Distributed SystemsWp – Nezih Yigitbasi, Matthieu Gallet, Derrick Kondo, Ru Iosup, Dick Epema
4 A model for space-correlated failures in large-scale distributed systems – Matthieu Gallet, Nezih Yigitbasi, Bahman Javadi, Derrick Kondo, Alexandru Iosup, D. Epema - 2010
10 Understanding Customer Problem Troubleshooting . . . – Weihang Jiang, Chongfeng Hu, Shankar Pasupathy, Arkady Kanevsky, Zhenmin Li, Yuanyuan Zhou
31 An Analysis of Traces from a Production MapReduce Cluster – Soila Kavulya, Jiaqi Tan, Rajeev G, Priya Narasimhan - 2009
94 IRON File Systems – Vijayan Prabhakaran, Nitin Agrawal, Lakshmi Bairavasundaram, Haryadi Gunawi, Andrea C. Arpaci-dusseau, Remzi H. Arpaci-dusseau - 2005
30 Are Disks the Dominant Contributor for Storage Failures? A Comprehensive Study of Storage Subsystem Failure Characteristics – Weihang Jiang, Chongfeng Hu, Yuanyuan Zhou, Arkady Kanevsky
3 An Adaptive Semantic Filter for Blue Gene/L Failure Log Analysis – Yinglung Liang, Yanyong Zhang, Hui Xiong, Ramendra Sahoo - 2007
1 An Adaptive Semantic Filter for Blue Gene/L Failure Log Analysis – Yinglung Liang, Yanyong Zhang, Hui Xiong, Ramendra Sahoo
8 Evaluating the Impact of Communication Architecture on the Performability of Cluster-Based Services – Kiran Nagaraja, Neeraj Krishnan, Ricardo Bianchini, Richard P. Martin, Thu D. Nguyen - 2003
18 Exploring event correlation for failure prediction in coalitions of clusters – Song Fu, Cheng-zhong Xu - 2007
Appears in 4th Usenix Symposium on Internet Technologies and Systems (USITS ‘03), 2003. Why do Internet services fail, and what can be done about it? – Archana Ganapathi, David A. Patterson
4 Why Does Windows Crash – Archana Ganapathi, Archana Ganapathi - 2005