MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

Host Load Prediction Using Linear Models (2000) [37 citations — 14 self]

by Peter A. Dinda ,  David R. O'Hallaron
Add To MetaCart

Abstract:

This paper evaluates linear models for predicting the Digital Unix five-second host load average from 1 to 30 seconds into the future. A detailed statistical study of a large number of long, fine grain load traces from a variety of real machines leads to consideration of the Box-Jenkins models (AR, MA, ARMA, ARIMA), and the ARFIMA models (due to self-similarity.) We also consider a simple windowed-mean model. The computational requirements of these models span a wide range, making some more practical than others for incorporation into an online prediction system. We rigorously evaluate the predictive power of the models by running a large number of randomized testcases on the load traces and then data-mining their results. The main conclusions are that load is consistently predictable to a very useful degree, and that the simple, practical models such as AR are sufficient for host load prediction. We recommend AR(16) models or better for host load prediction. We implement an online host load prediction system around the AR(16) model and evaluate its overhead, finding that it uses miniscule amounts of CPU time and network bandwidth

Citations

686 The Art of Computer System Performance Analysis – Jain - 1991
343 Adaptive load sharing in homogeneous distributed systems – Eager, Lazowska, et al. - 1986
272 Architectural Support for Quality of Service for CORBA Objects,” Theory and Practice of Object Systems – Zinky, Bakken, et al. - 1997
255 Exploiting process lifetime distributions for load balancing – Harchol-Balter, Downey - 1997
194 Forecasting network performance to support dynamic scheduling using the network weather service – Wolski - 1997
163 The Common Object Request Broker Architecture: Core Specification, Revision 3.0 – Group - 2002
154 Introduction to Time Series and Forecasting – Brockwell, Davis - 1996
124 An introduction to long-memory time series models and fractional differencing – Granger, Joyeux - 1980
115 Load-balancing heuristics and process behavior – Leland, Ott - 1986
95 Java Remote Method Invocation Specification – Microsystems - 1997
94 The Limited Performance Benefits of Migrating Active Processes for Load Sharing – Eager, Lazowska, et al. - 1988
86 Scheduling from the perspective of the application – BERMAN, WOLSKI - 1996
86 The Available Capacity of a Privately Owned Workstation Environment – Mutka, Livny - 1991
77 Statistical methods for data with long-range dependence – Beran - 1992
67 Space-time modelling with long-memory dependence: assessing Ireland’s wind power resource – Haslett, Raftery - 1989
65 Predicting the CPU availability of time-shared unix systems on the computational grid – Wolski, Spring, et al.
43 An Extensible Toolkit for Resource Prediction in Distributed Systems – Dinda, O’Hallaron - 1999
40 O'Hallaron. An evaluation of linear models for host load prediction.In – Dinda, R - 1999
39 Time Series Models for Internet Traffic – Basu, Mukherjee - 1999
34 The performance of a service for network-aware applications – Obraczka, Georghiu - 1998
33 Preliminary report on the design of a framework for distributed visualization – Aeschlimann, Dinda, et al. - 1999
30 A time series model of long-term NSFNET backbone traffic – Groschwitz, Polyzos - 1994
26 The statistical properties of host load. Scientific Programming, 7(3,4 – DINDA - 1999
23 A resource monitoring system for network-aware applications – LOWEKAMP, MILLER, et al. - 1998
16 The case for prediction-based best-effort real-time systems – Dinda, Lowekamp, et al. - 1999
15 On distributed scheduling using load prediction from past information. Abstracts published – SAMADANI, KALTHOFEN - 1996
11 Resource Signal Prediction and Its Application to Real-time Scheduling Advisors – Dinda - 2000
8 Fractal structures and processes – BASSINGTHWAIGHTE, BEARD, et al. - 1995
7 Fracdiff: Maximum likelihood estimation of the parameters of a fractionally differenced ARIMA(p; d; q) model. Computer Program – FRALEY - 1991
4 Fractional differencing, Biometrika 68 – HOSKING - 1981