Results 1 - 10
of
145,622
Job Management Systems Analysis
"... Job Management System (JMS) is a system responsible for management of user’s jobs on computer cluster. In this paper, we describe common JMS architecture and functionalities that JMS has to implement. Furthermore, we describe in details features and our own experiences with following systems: Condor ..."
Abstract
- Add to MetaCart
Job Management System (JMS) is a system responsible for management of user’s jobs on computer cluster. In this paper, we describe common JMS architecture and functionalities that JMS has to implement. Furthermore, we describe in details features and our own experiences with following systems
A System Model for Distributed Job Scheduling: The Distributed Job Management System
, 1993
"... A system model for distributed job scheduling: the distributed job management system ..."
Abstract
- Add to MetaCart
A system model for distributed job scheduling: the distributed job management system
Performance evaluation of selected job management systems
- In Proc. of 16th IPDPS
, 2002
"... One important component of grid software infrastructure and parallel systems management is the Job Management System (JMS). With many JMSs available commercially and in public domain, it is difficult to choose the most efficient JMS for a given computing environment. All previous comparisons of JMSs ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
One important component of grid software infrastructure and parallel systems management is the Job Management System (JMS). With many JMSs available commercially and in public domain, it is difficult to choose the most efficient JMS for a given computing environment. All previous comparisons
Effective Utilization and Reconfiguration of Distributed Hardware Resources Using Job Management Systems
- In Proceedings of Parallel and Distributed Processing Symposium, International
, 2003
"... Reconfigurable hardware resources are very expensive, and yet can be underutilized. This paper describes a middleware capable of discovering underutilized computing nodes with FPGA-based accelerator boards in a networked environment. Using an extended Job management system (JMS), this middleware per ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Reconfigurable hardware resources are very expensive, and yet can be underutilized. This paper describes a middleware capable of discovering underutilized computing nodes with FPGA-based accelerator boards in a networked environment. Using an extended Job management system (JMS), this middleware
Exploring Distributed Resource Allocation Techniques in the SLURM Job Management System
"... Abstract — With the exponentially growth of distributed computing systems in both flops and cores, scientific applications are growing more diverse with a variety of workloads. These workloads include traditional large-scale High Performance Computing MPI jobs, and ensemble workloads, such as Many-T ..."
Abstract
- Add to MetaCart
management system that is magnitudes more scalable and available than today’s centralized batch-scheduled job management systems. In this paper, we present a distributed job launch prototype SLURM++, which extends the SLURM resource manager by integrating the ZHT zero-hop distributed key-value store
Next Generation Job Management Systems for Extreme Scales”, under review at
- ACM HPDC
, 2014
"... With the exponential growth of supercomputers in parallelism, applications are growing more diverse, including traditional large-scale HPC MPI jobs, and ensemble workloads such as finer-grained many-task computing (MTC) applications. Delivering high throughput and low latency for both workloads requ ..."
Abstract
-
Cited by 2 (2 self)
- Add to MetaCart
requires developing a distributed job management system that is magnitudes more scalable than today’s centralized ones. In this paper, we present a distributed job launch prototype, SLURM++, which is comprised of multiple controllers with each one managing a partition of SLURM daemons, while ZHT (a
Evaluating scalability and efficiency of the Resource and Job Management System on large HPC Clusters
"... Abstract. The Resource and Job Management System (RJMS) is the middleware in charge of de-livering computing power to applications in HPC systems. The increasing number of computational resources in modern supercomputers brings new levels of parallelism and complexity. To maxi-mize the global throug ..."
Abstract
- Add to MetaCart
Abstract. The Resource and Job Management System (RJMS) is the middleware in charge of de-livering computing power to applications in HPC systems. The increasing number of computational resources in modern supercomputers brings new levels of parallelism and complexity. To maxi-mize the global
Knowledge management and knowledge management systems: Conceptual foundations and an agenda . . .
, 1998
"... ..."
Decentralized Trust Management
- In Proceedings of the 1996 IEEE Symposium on Security and Privacy
, 1996
"... We identify the trust management problem as a distinct and important component of security in network services. Aspects of the trust management problem include formulating security policies and security credentials, determining whether particular sets of credentials satisfy the relevant policies, an ..."
Abstract
-
Cited by 1025 (24 self)
- Add to MetaCart
, and deferring trust to third parties. Existing systems that support security in networked applications, including X.509 and PGP, address only narrow subsets of the overall trust management problem and often do so in a manner that is appropriate to only one application. This paper presents a comprehensive
Federated database systems for managing distributed, heterogeneous, and autonomous databases
- ACM Computing Surveys
, 1990
"... A federated database system (FDBS) is a collection of cooperating database systems that are autonomous and possibly heterogeneous. In this paper, we define a reference architecture for distributed database management systems from system and schema viewpoints and show how various FDBS architectures c ..."
Abstract
-
Cited by 1218 (34 self)
- Add to MetaCart
A federated database system (FDBS) is a collection of cooperating database systems that are autonomous and possibly heterogeneous. In this paper, we define a reference architecture for distributed database management systems from system and schema viewpoints and show how various FDBS architectures
Results 1 - 10
of
145,622