Results 1 - 10
of
25
ZENTURIO: An Experiment Management System for Cluster and Grid Computing
- In Proceedings of the 4th International Conference on Cluster Computing (CLUSTER 2002
, 2002
"... The need to conduct and manage large sets of experiments for scientific applications dramatically increased over the last decade. However, there is still very little tool support for this complex and tedious process. In this paper we introduce the ZENTURIO experiment management system for parameter ..."
Abstract
-
Cited by 15 (3 self)
- Add to MetaCart
(Show Context)
The need to conduct and manage large sets of experiments for scientific applications dramatically increased over the last decade. However, there is still very little tool support for this complex and tedious process. In this paper we introduce the ZENTURIO experiment management system for parameter studies, performance analysis, and software testing for cluster and Grid architectures. ZENTURIO uses the ZEN directive-based language to specify arbitrary complex program executions. ZENTURIO is designed as a collection of Grid services that comprise: (1) a registry service which supports registering and locating Grid services; (2) an experiment generator that parses files with ZEN directives and instruments applications for performance analysis and parameter studies; (3) an experiment executor that compiles and controls the execution of experiments on the target machine. A graphical user portal allows the user to control and monitor the experiments and to automatically visualise performance and output data across multiple experiments. ZENTURIO has been implemented based on Java/Jini distributed technology. It supports experiment management on cluster architectures via PBS and on Grid infrastructures through GRAM. We report results of using ZENTURIO for performance analysis of an ocean simulation application and a parameter study of a computational finance code.
ZEN: A Directive-based Language for Automatic Experiment Management of Distributed and Parallel Programs
, 2002
"... Performance-oriented code development, software testing, performance analysis and parameter studies for distributed and parallel systems commonly require to conduct a large number of executions. Every execution of an application can be viewed as a scientific experiment. So far there exists very litt ..."
Abstract
-
Cited by 8 (5 self)
- Add to MetaCart
Performance-oriented code development, software testing, performance analysis and parameter studies for distributed and parallel systems commonly require to conduct a large number of executions. Every execution of an application can be viewed as a scientific experiment. So far there exists very little support to specify and to control execution of a large number of experiments. Various problems must be addressed, such as which input files to read, where to store program's output, what performance metrics to measure and what range of problem parameters to observe. This paper describes ZEN, a directivebased language to support automatic experiment management for a wide variety of parallel and distributed architectures. It is used to specify arbitrarily complex program executions in the context of performance analysis and tuning, parameter studies, and software testing. ZEN introduces directives to substitute strings and insert assignment statements inside arbitrary files, such as program, input, script, or makefiles. This enables the programmer to invoke experiments for arbitrary value ranges of any problem parameter, including program variables, file names, compiler options, target machines, machine sizes, scheduling strategies, data distributions, etc. The number of experiments can be controlled through ZEN constraint directives. Finally, the programmer may request a large set of performance metrics to be computed for any code region of interest. The scope of ZEN directives can be restricted to arbitrary file or code regions.
Automatic Search for Performance Problems in Parallel and Distributed) Programs by Using Multi-Experiment Analysis
- In International Conference On High Performance Computing (HiPC 2002
, 2002
"... We introduce Aksum, a novel system for performance analysis that helps programmers to locate and to understand performance problems in message passing, shared memory and mixed parallel programs. ..."
Abstract
-
Cited by 7 (2 self)
- Add to MetaCart
(Show Context)
We introduce Aksum, a novel system for performance analysis that helps programmers to locate and to understand performance problems in message passing, shared memory and mixed parallel programs.
Teuta: Tool Support for Performance Modeling of Distributed and Parallel Applications
- Parallel Applications, International Conference on Computational Science, Tools for Program Development and Analysis in Computational Science, Krakow
, 2004
"... In this paper we describe Teuta, which we have developed to provide tool support for the UML-based performance modeling of distributed and parallel applications. Teuta features include model checking and model traversing. Model checking is used to verify whether the model conforms to the UML specifi ..."
Abstract
-
Cited by 6 (2 self)
- Add to MetaCart
(Show Context)
In this paper we describe Teuta, which we have developed to provide tool support for the UML-based performance modeling of distributed and parallel applications. Teuta features include model checking and model traversing. Model checking is used to verify whether the model conforms to the UML specification. In addition, Teuta supports semantic model checking for the domain of high performance computing. For the generation of different model representations the model traversing is used. In addition, we present our methodology for automatic generation of the simulation model from the UML model of an application. This simulation model is used to evaluate the performance of the application. We demonstrate the usefulness of Teuta by modeling LAPW0, a distributed material science application.
On Utilizing Experiment Data Repository for Performance Analysis of Parallel Applications
- In 9th International Europar Conference(EuroPar 2003), Lecture Notes in Computer Science
, 2003
"... Performance data usually must be archived for various performance analysis and optimization tasks such as multi-experiment analysis, performance comparison, automated performance diagnosis. However, little eort has been done to employ data repositories to organize and store performance data. Thi ..."
Abstract
-
Cited by 6 (3 self)
- Add to MetaCart
(Show Context)
Performance data usually must be archived for various performance analysis and optimization tasks such as multi-experiment analysis, performance comparison, automated performance diagnosis. However, little eort has been done to employ data repositories to organize and store performance data. This lack of systematic organization of data has hindered several aspects of performance analysis tools such as performance comparison, performance data sharing and tools integration. In this paper we describe our approach to exploit a relational-based experiment data repository in SCALEA which is a performance instrumentation, measurement, analysis and visualization tool for parallel programs.
From Web Services to OGSA: Experiences in Implementing an OGSA-based Grid Application
- 4th International Workshop on Grid Computing
, 2003
"... In previous work we have presented the ZENTURIO experiment management system for performance and parameter studies of parallel and distributed applications on cluster and Grid architectures. In this paper we describe experiences of an on-going work, targeting the implementation of ZENTURIO on top of ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
(Show Context)
In previous work we have presented the ZENTURIO experiment management system for performance and parameter studies of parallel and distributed applications on cluster and Grid architectures. In this paper we describe experiences of an on-going work, targeting the implementation of ZENTURIO on top of the Open Grid Services Architecture (OGSA). We analyse the opportunities offered by a Web services toolkit to develop Grid services as required by OGSA and compare them with the solutions offered by the Open Grid Services Infrastructure (OGSI) specification. Issues regarding proxy management, service lifecycle, UDDI service repository, firewall management, Factory and Registry services, service throughput, and security are comparatively analysed in both implementations.
A Web Service-based Experiment Management System for the Grids
- In 17th International Parallel and Distributed Processing Symposium (IPDPS 2003
, 2002
"... We have developed ZENTURIO, which is an experiment management system for performance and parameter studies as well as software testing for cluster and Grid architectures. In this paper we describe our experience with developing ZENTURIO as a collection of Web services. A directivebased language call ..."
Abstract
-
Cited by 4 (1 self)
- Add to MetaCart
We have developed ZENTURIO, which is an experiment management system for performance and parameter studies as well as software testing for cluster and Grid architectures. In this paper we describe our experience with developing ZENTURIO as a collection of Web services. A directivebased language called ZEN is used to annotate arbitrary les and specify arbitrary application parameters. An Experiment Generator Web service parses annotated application les and generates appropriate codes for experiments. An Experiment Executor Web service compiles, executes, and monitors experiments on a single or a set of local machines on the Grid. Factory and Registry services are employed to create and register Web services, respectively. An event infrastructure has been customised to support high-level events under ZENTURIO in order to avoid expensive polling and to detect important system and application status information. A graphical user portal allows the user to generate, control, and monitor experiments. We compare our design with the Open Grid Service Architecture (OGSA) and highlight similarities and dierences. We report results of using ZENTURIO to conduct performance analysis of a material science code that executes on the Grid under the Globus Grid infrastructure.
Performance Analysis for MPI Applications with SCALEA
, 2002
"... The performance of message passing programs can be challenging to comprehend. In previous work we have introduced SCALEA, which is a performance instrumentation, measurement, analysis, and visualization tool for parallel and distributed programs. In this paper we report on experiences with SCALE ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
The performance of message passing programs can be challenging to comprehend. In previous work we have introduced SCALEA, which is a performance instrumentation, measurement, analysis, and visualization tool for parallel and distributed programs. In this paper we report on experiences with SCALEA for performance analysis of two realistic MPI codes taken from laser physics and material science. SCALEA has been used to automatically instrument - based on user provided directives - the source codes, to compute performance overheads, to relate them to the source code, and to provide a range of performance diagrams in order to explain performance problems as part of a graphical user interface. Multiple-experiment performance analysis allows to compare and to evaluate the performance outcome of several experiments which have been conducted on a SMP cluster architecture.
Monitoring and Performance Analysis of Grid Application
- In: Computational Science - ICCS 2003
, 2003
"... Abstract. This paper presents main ideas and design details of a performance analysis tool – G-PM and a grid application monitoring system – OCM-G for applications running on the Grid which are under development within the EU CrossGrid project. Besides of the operation of G-PM’s components, we overv ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
(Show Context)
Abstract. This paper presents main ideas and design details of a performance analysis tool – G-PM and a grid application monitoring system – OCM-G for applications running on the Grid which are under development within the EU CrossGrid project. Besides of the operation of G-PM’s components, we overview its internal interfaces. G-PM enables not only standard measurements, but also comprises application-specific metrics and high-level measurements. The OCM-G is aimed to provide services via which tools supporting application development are enabled to gather information, manipulate, and detect events that occur when applications are running. The functionality of the OCM-G is available via a standardized interface, On-line Monitoring
Evaluating OpenMP Performance Analysis Tools with the APART Test Suite
- TEST SUITE, FIFTH EUROPEAN WORKSHOP ON OPENMP (EWOMP ’03), RWTH AACHEN
, 2003
"... This paper outlines the design of ATS (the APART Test Suite) for evaluating (automatic) performance analysis tools with respect to their effectiveness in detecting actual performance problems, with focus on the ATS test programs related to OpenMP. It reports on results from applying two OpenMP perfo ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
This paper outlines the design of ATS (the APART Test Suite) for evaluating (automatic) performance analysis tools with respect to their effectiveness in detecting actual performance problems, with focus on the ATS test programs related to OpenMP. It reports on results from applying two OpenMP performance analysis tools to the test cases generated from ATS.