Abstract:
It is well-known that Reed-Solomon codes may be used to provide error correction for multiple failures in RAID-like systems. The coding technique itself, however, is not as well-known. To the coding theorist, this technique is a straightforward extension to a basic coding paradigm and needs no special mention. However, to the systems programmer with no training in coding theory, the technique may be a mystery. Currently, there are no references that describe how to perform this coding that do not assume that the reader is already well-versed in algebra and coding theory. This paper is intended for the systems programmer. It presents a complete specification of the coding algorithm plus details on how it may be implemented. This specification assumes no prior knowledge of algebra or coding theory. The goal of this paper is for a systems programmer to be able to implement Reed-Solomon coding for reliability in RAID-like systems without needing to consult any external references.
Citations
|
581
|
A case for redundant arrays of inexpensive disks (RAID
– Patterson, Gibson, et al.
- 1988
|
|
405
|
An Introduction to Disk Drive Modeling
– Ruemmler, Wilkes
- 1994
|
|
377
|
Efficient Dispersal of Information for Security, Load Balancing and Fault Tolerance
– Rabin
- 1989
|
|
283
|
Algebraic Coding Theory
– Berlekamp
- 1968
|
|
257
|
RAID: HighPerformance, Reliable Secondary Storage
– Chen, L, et al.
- 1994
|
|
231
|
The zebra striped network file system
– HARTMAN, OUSTERHOUT
- 1995
|
|
191
|
An Introduction to Coding Theory
– Lint
- 1992
|
|
124
|
Redundant Disk Arrays: Reliable, Parallel Secondary Storage
– Gibson
- 1990
|
|
80
|
The TickerTAIP Parallel RAID Architecture
– Cao, Lim, et al.
- 1993
|
|
70
|
On secret sharing systems
– Karnin, Green, et al.
- 1983
|
|
62
|
Disk array storage system reliability
– BURKHARD, MENON
- 1993
|
|
52
|
Some applications of Rabin's fingerprinting method
– Broder
- 1993
|
|
50
|
Evenodd: an optimal scheme for tolerating double disk failures in RAID architectures
– BLAUM, BRUCK, et al.
- 1993
|
|
42
|
Reed-Solomon Codes and Their Applications
– Wicker, Bhargava
- 1999
|
|
27
|
Error-Correcting Codes, second edition
– Peterson, Weldon
- 1972
|
|
22
|
Failure correction techniques for large disk arrays
– Gibson, Hellerstein, et al.
- 1989
|
|
18
|
Improving the performance of coordinated checkpointers on networks of workstations using RAID techniques
– Plank
- 1996
|
|
17
|
Algorithm-based diskless checkpointing for fault-tolerant matrix computations
– Plank, Kim, et al.
- 1995
|
|
16
|
Faster checkpointing with N + 1 parity
– Plank, Li
- 1994
|
|
15
|
RAID organization and performance
– Schwarz, Burkhard
- 1992
|
|
14
|
Holographic dispersal and recovery of information
– Preparata
- 1989
|
|
12
|
The Theory of Error-Correcting Codes, Part I
– MACWILLIAMS, SLOANE
- 1977
|
|
12
|
Evaluation of checkpoint mechanisms for massively parallel machines
– Chiueh, Deng
- 1996
|
|
12
|
on-line failure recovery in redundant disk arrays
– Fast
- 1993
|
|
11
|
Efficient placement of parity and data to tolerate two disk failures in disk array systems
– PARK
- 1995
|
|
5
|
Maximal and NearMaximal Shift Register Sequences: Efficient Event Counters and Easy Discrete Logarithms
– Clark, Weng
- 1994
|
|
3
|
Codes for Error Control and Synchronization
– Wiggert
- 1988
|
|
1
|
Applied Parallel Research
– Plank, Li
- 1994
|