Extensible, Scalable Monitoring For Clusters of Computers (1997)
Cached
Download Links
- [usenix.org]
- [www.usenix.org]
- [now.cs.berkeley.edu]
- DBLP
Other Repositories/Bibliography
| Venue: | Proc. 1997 Large Installation System Administration Confere (LISA XI |
| Citations: | 25 - 3 self |
BibTeX
@INPROCEEDINGS{Anderson97extensible,scalable,
author = {Eric Anderson},
title = {Extensible, Scalable Monitoring For Clusters of Computers},
booktitle = {Proc. 1997 Large Installation System Administration Confere (LISA XI},
year = {1997},
pages = {9--16}
}
Years of Citing Articles
OpenURL
Abstract
We describe the CARD (Cluster Administration using Relational Databases) system 1 for monitoring large clusters of cooperating computers. CARD scales both in capacity and in visualization to at least 150 machines, and can in principle scale far beyond that. The architecture is easily extensible to monitor new cluster software and hardware. CARD detects and automatically recovers from common faults. CARD uses a Java applet as its primary interface allowing users anywhere in the world to monitor the cluster through their browser.







