MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

A Tutorial on Reed-Solomon Coding for Fault-Tolerance in RAID-like Systems (1997) [119 citations — 24 self]

Abstract:

It is well-known that Reed-Solomon codes may be used to provide error correction for multiple failures in RAID-like systems. The coding technique itself, however, is not as well-known. To the coding theorist, this technique is a straightforward extension to a basic coding paradigm and needs no special mention. However, to the systems programmer with no training in coding theory, the technique may be a mystery. Currently, there are no references that describe how to perform this coding that do not assume that the reader is already well-versed in algebra and coding theory. This paper is intended for the systems programmer. It presents a complete specification of the coding algorithm plus details on how it may be implemented. This specification assumes no prior knowledge of algebra or coding theory. The goal of this paper is for a systems programmer to be able to implement Reed-Solomon coding for reliability in RAID-like systems without needing to consult any external references.

Citations

581 A case for redundant arrays of inexpensive disks (RAID – Patterson, Gibson, et al. - 1988
405 An Introduction to Disk Drive Modeling – Ruemmler, Wilkes - 1994
377 Efficient Dispersal of Information for Security, Load Balancing and Fault Tolerance – Rabin - 1989
283 Algebraic Coding Theory – Berlekamp - 1968
257 RAID: HighPerformance, Reliable Secondary Storage – Chen, L, et al. - 1994
231 The zebra striped network file system – HARTMAN, OUSTERHOUT - 1995
191 An Introduction to Coding Theory – Lint - 1992
124 Redundant Disk Arrays: Reliable, Parallel Secondary Storage – Gibson - 1990
80 The TickerTAIP Parallel RAID Architecture – Cao, Lim, et al. - 1993
70 On secret sharing systems – Karnin, Green, et al. - 1983
62 Disk array storage system reliability – BURKHARD, MENON - 1993
52 Some applications of Rabin's fingerprinting method – Broder - 1993
50 Evenodd: an optimal scheme for tolerating double disk failures in RAID architectures – BLAUM, BRUCK, et al. - 1993
42 Reed-Solomon Codes and Their Applications – Wicker, Bhargava - 1999
27 Error-Correcting Codes, second edition – Peterson, Weldon - 1972
22 Failure correction techniques for large disk arrays – Gibson, Hellerstein, et al. - 1989
18 Improving the performance of coordinated checkpointers on networks of workstations using RAID techniques – Plank - 1996
17 Algorithm-based diskless checkpointing for fault-tolerant matrix computations – Plank, Kim, et al. - 1995
16 Faster checkpointing with N + 1 parity – Plank, Li - 1994
15 RAID organization and performance – Schwarz, Burkhard - 1992
14 Holographic dispersal and recovery of information – Preparata - 1989
12 The Theory of Error-Correcting Codes, Part I – MACWILLIAMS, SLOANE - 1977
12 Evaluation of checkpoint mechanisms for massively parallel machines – Chiueh, Deng - 1996
12 on-line failure recovery in redundant disk arrays – Fast - 1993
11 Efficient placement of parity and data to tolerate two disk failures in disk array systems – PARK - 1995
5 Maximal and NearMaximal Shift Register Sequences: Efficient Event Counters and Easy Discrete Logarithms – Clark, Weng - 1994
3 Codes for Error Control and Synchronization – Wiggert - 1988
1 Applied Parallel Research – Plank, Li - 1994