Metadata Efficiency in a Comprehensive Versioning File System (2002)
Cached
Download Links
| Venue: | In Proc. of FAST 2003 |
| Citations: | 21 - 2 self |
BibTeX
@INPROCEEDINGS{Soules02metadataefficiency,
author = {Craig A. N. Soules and Garth R. Goodson and John D. Strunk and Gregory R. Ganger},
title = {Metadata Efficiency in a Comprehensive Versioning File System},
booktitle = {In Proc. of FAST 2003},
year = {2002}
}
Years of Citing Articles
OpenURL
Abstract
A comprehensive versioning file system creates and retains a new file version for every WRITE or other modification request. The resulting history of file modifications provides a detailed view to tools and administrators seeking to investigate a suspect system state. Conventional versioning systems do not efficiently record the many prior versions that result. In particular, the versioned metadata they keep consumes almost as much space as the versioned data. This paper examines two space-efficient metadata structures for versioning file systems and describes their integration into the Comprehensive Versioning File System (CVFS). Journal-based metadata encodes each metadata version into a single journal entry; CVFS uses this structure for inodes and indirect blocks, reducing the associated space requirements by 80%. Multiversion b-trees extend the per-entry key with a timestamp and keep current and historical entries in a single tree; CVFS uses this structure for directories, reducing the associated space requirements by 99%. Experiments with CVFS verify that its current-version performance is similar to that of non-versioning file systems. Although access to historical versions is slower than conventional versioning systems, checkpointing is shown to mitigate this effect. We thank the members and companies of the PDL Consortium (including EMC, Hewlett-Packard, Hitachi, IBM, Intel, Network Appliance, Panasas, Seagate, Sun, and Veritas) for their interest, insights, feedback, and support. We thank IBM and Intel for hardware grants supporting our







