Results 1 - 10
of
13
LegionFS: A Secure and Scalable File System Supporting Cross-Domain High-Performance Applications
- in Proceddings of the ACM/IEEE SuperComputing 2001 (SC 2001), Computational Grid I/O
, 2001
"... Realizing that current file systems can not cope with the diverse requirements of wide-area collaborations, researchers have developed data access facilities to meet their needs. Recent work has focused on comprehensive data access architectures. In order to fulfill the evolving requirements in this ..."
Abstract
-
Cited by 46 (4 self)
- Add to MetaCart
Realizing that current file systems can not cope with the diverse requirements of wide-area collaborations, researchers have developed data access facilities to meet their needs. Recent work has focused on comprehensive data access architectures. In order to fulfill the evolving requirements in this environment, we suggest a more fully-integrated architecture built upon the fundamental tenets of naming, security, scalability, extensibility, and adaptability. These form the underpinning of the Legion File System (LegionFS). This paper motivates the need for these requirements and presents benchmarks that highlight the scalability of LegionFS. LegionFS aggregate throughput follows the linear growth of the network, yielding an aggregate read bandwidth of 193.8 MB/s on a 100 Mbps Ethernet backplane with 50 simultaneous readers. The serverless architecture of LegionFS is shown to benefit important scientific applications, such as those accessing the Protein Data Bank, within both local- and wide-area environments. 1.
VMPlants: Providing and Managing Virtual Machine Execution Environments for Grid Computing
- in SC ’04: Proceedings of the 2004 ACM/IEEE conference on Supercomputing
, 2004
"... Virtual machines provide flexible, powerful execution environments for Grid computing, offering isolation and security mechanisms complementary to operating systems, customization and encapsulation of entire application environments, and support for legacy applications. This paper describes a Grid s ..."
Abstract
-
Cited by 45 (3 self)
- Add to MetaCart
Virtual machines provide flexible, powerful execution environments for Grid computing, offering isolation and security mechanisms complementary to operating systems, customization and encapsulation of entire application environments, and support for legacy applications. This paper describes a Grid service – VMPlant – that provides for automated configuration and creation of flexible VMs that, once configured to meet application needs, can then subsequently be copied (“cloned”) and dynamically instantiated to provide homogeneous execution environments across distributed Grid resources. In combination with complementary middleware for user, data and resource management, the functionality enabled by VMPlant allows for problem-solving environments to deliver Grid applications to users with unprecedented flexibility. VMPlant supports a graph-based model for the definition of customized VM configuration actions; partial graph matching, VM state storage and “cloning ” for efficient creation. This paper presents the VMPlant architecture, describes a prototype implementation of the service, and presents an analysis of its performance. 1
User-Level Remote Data Access in Overlay Metacomputers
- In Proceedings of the 4th IEEE International Conference on Cluster Computing
, 2002
"... A practical problem faced by users of metacomputers and computational grids is: If my computation can move from one system to another, how can I ensure that my data will still be available to my computation? Depending on the level of software, technical, and administrative support available, a data ..."
Abstract
-
Cited by 14 (4 self)
- Add to MetaCart
A practical problem faced by users of metacomputers and computational grids is: If my computation can move from one system to another, how can I ensure that my data will still be available to my computation? Depending on the level of software, technical, and administrative support available, a data grid or a distributed file system would be reasonable solutions. However, it is not always possible (or practical) to have a diverse group of systems administrators agree to adopt a common infrastructure to support remote data access. Yet, having transparent access to any remote data is an important, practical capability. We have developed the Trellis File System (Trellis FS) to allow programs to access data files on any file system and on any host on a network that can be named by a Secure Copy Locator (SCL) or a Uniform Resource Locator (URL). Without requiring any new protocols or infrastructure, Trellis can be used on practically any POSIX-based system on the Internet. Read access, write access, sparse access, local caching of data, prefetching, and authentication are supported. Trellis is implemented as a user-level C library, which mimics the standard stream I/O functions, and is highly portable. Trellis is not a replacement for traditional file systems or data grids; it provides new capabilities by overlaying on top of other file systems, including grid-based file systems. And, by building upon an alreadyexisting infrastructure (i.e., Secure Shell and Secure Copy), Trellis can be used in situations where a suitable data grid or distributed file system does not yet exist.
Distributed File System Support for Virtual Machines in Grid Computing
- In Proceedings of High Performance Distributed Computing (HPDC
, 2004
"... This paper presents a data management solution which allows fast Virtual Machine (VM) instantiation and efficient run-time execution to support VMs as execution environments in Grid computing. It is based on novel distributed file system virtualization techniques and is unique in that: 1) it provide ..."
Abstract
-
Cited by 14 (3 self)
- Add to MetaCart
This paper presents a data management solution which allows fast Virtual Machine (VM) instantiation and efficient run-time execution to support VMs as execution environments in Grid computing. It is based on novel distributed file system virtualization techniques and is unique in that: 1) it provides on-demand access to VM state for unmodified VM monitors; 2) it supports user-level and write-back disk caches, per-application caching policies and middleware-driven consistency models; and 3) it supports the use of meta-data associated with files to expedite data transfers. The paper reports on its performance in a WAN setup using VMware-based VMs. Results show that the solution delivers performance over 30 % better than native NFS and can bring application-perceived overheads below 10 % relatively to a local disk setup. The solution also allows a VM with 1.6GB virtual disk and 320MB virtual memory to be cloned within 160 seconds when it is first instantiated (and within 25 seconds for subsequent clones). 1.
VP/GFS: an Architecture for Virtual Private Grid File Systems
- ACIS Laboratory, Department of Electrical and Computer Engineering, University of Florida
, 2003
"... This paper describes techniques for establishing private distributed file system sessions for computational grids. These techniques build on previous work on proxy-based virtualization of Network File Systems (NFS); novel in this paper are the support for multiple proxies, encrypted tunneling via th ..."
Abstract
-
Cited by 6 (5 self)
- Add to MetaCart
This paper describes techniques for establishing private distributed file system sessions for computational grids. These techniques build on previous work on proxy-based virtualization of Network File Systems (NFS); novel in this paper are the support for multiple proxies, encrypted tunneling via the Secure Shell protocol (SSH), and user-level, proxy-based client disk caching. These techniques can be implemented through middleware-controlled creation of proxies and tunnels, and can be integrated with existing grid infrastructures. Results from performance analyses conducted in wide- and local-area setups show that the application-perceived overhead of this solution is small (10% or less) for the compute-intensive application SPECseis. For the remaining studied applications -- Andrew, virtual machine (VM) boot, and VM resume -- results show that efficient caching is important to achieve good performance. With VM state cached in disk by a client file system proxy, subsequent instantiations of a resumable non-persistent classic VM take 6 to 16 seconds.
Supporting Application-Tailored Grid File System Sessions with WSRF-Based Services
- in: Proc. the 14th IEEE International Symposium on High Performance Distributed Computing
, 2005
"... This paper presents novel service-based Grid data management middleware that leverages standards defined by WSRF specifications to create and manage dynamic Grid file system sessions. A unique aspect of the service is that the sessions it creates can be customized to address application data transfe ..."
Abstract
-
Cited by 6 (5 self)
- Add to MetaCart
This paper presents novel service-based Grid data management middleware that leverages standards defined by WSRF specifications to create and manage dynamic Grid file system sessions. A unique aspect of the service is that the sessions it creates can be customized to address application data transfer needs. Application-tailored configurations enable selection of both performance-related features (block-based partial file transfers and/or whole-file transfers, cache parameters and consistency models) and reliability features (file system copy-on-write checkpointing to aid recovery of client-side failures; replication, autonomous failure detection and data access redirection for server-side failures). These enhancements, in addition to cross-domain user identity mapping and encrypted communication, are implemented via user level proxies managed by the service, requiring no changes to existing kernels. Sessions established using the service are mounted as distributed file systems and can be used transparently by unmodified binary applications. The paper analyzes the use of the service to support virtual machine based Grid systems and workflow execution, and also reports on the performance and reliability of service managed wide-area file system sessions with experiments based on scientific applications (NanoMOS/Matlab, CH1D, GAUSS and SPECseis). 1.
Active Yellow Pages: A Pipelined Resource Management Architecture for Wide-Area Network Computing
- In Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing (HPDC'01
, 2001
"... management architecture for computational grids. The design is based on two key realizations. One is that resource management involves a sequence of tasks that is best handled by a pipeline. As shown in the paper, this approach results in a scalable architecture for decentralized scheduling. The oth ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
management architecture for computational grids. The design is based on two key realizations. One is that resource management involves a sequence of tasks that is best handled by a pipeline. As shown in the paper, this approach results in a scalable architecture for decentralized scheduling. The other realization is that static aggregation of resources for improved scheduling is inadequate in wide-area computing environments because the needs of users and jobs change with both, location and time. The described architecture addresses this problem by dynamically aggregating resources in a manner that continuously optimizes system response. This is accomplished by way of an active yellow pages directory that allows aggregation constraints to be (re)defined on the fly. An initial prototype of the active yellow pages service has been deployed in the PUNCH network computing environment. Experiences with the production PUNCH system and preliminary results from controlled experiments indicate that the active yellow pages service performs well.
Towards autonomic virtual applications in the in-vigo system
- In Proceedings of the 2nd IEEE 117terrlational Corlference on Autor7ornic Computing (ICA C05
, 2005
"... Grid environments enable users to share non-dedicated resources that lack performance guarantees. This paper describes the design of application-centric middleware components to automatically recover from failures and dynamically adapt to grid environments with changing resource availabilities, impr ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Grid environments enable users to share non-dedicated resources that lack performance guarantees. This paper describes the design of application-centric middleware components to automatically recover from failures and dynamically adapt to grid environments with changing resource availabilities, improving fault-tolerance and performance. The key components of the application-centric approach are a global per-application execution history and an autonomic component that tracks the performance of a job on a grid resource against predictions based on the application execution history, to guide rescheduling decisions. Performance models of unmodified applications built using their execution history are used to predict failure as well as poor performance. A prototype of the proposed approach, an Autonomic Virtual Application Manager (AVAM), has been implemented in the context of the In-VIGO grid environment and its effectiveness has been evaluated for applications that generate CPU-intensive jobs with relatively short execution times (ranging from tens of seconds to less than an hour) on resources with highly variable loads-- a workload generated by typical educational usage scenarios of In-VIGO-like grid environments. A memory-based learning algorithm is used to build the performance models for CPU-intensive applications that are used to predict the need for rescheduling. Results show that In-VIGO jobs managed by the AVAM consistently meet their execution deadlines under varying load conditions and gracefully recover from unexpected failures. 1.
Support for Data-Intensive, Variable-Granularity Grid Applications via Distributed File System Virtualization - A Case Study of Light Scattering Spectroscopy
"... A key challenge faced by large-scale, distributed applications in Grid environments is efficient, seamless data management. In particular, for applications that can benefit from access to data at variable granularities, data management can pose additional programming burdens to an application develo ..."
Abstract
-
Cited by 2 (2 self)
- Add to MetaCart
A key challenge faced by large-scale, distributed applications in Grid environments is efficient, seamless data management. In particular, for applications that can benefit from access to data at variable granularities, data management can pose additional programming burdens to an application developer. This paper presents a case for the use of virtualized distributed file systems as a basis for data management for data-intensive, variable-granularity applications. The approach leverages on-demand transfer mechanisms of existing, de-facto network file system clients and servers that support transfers of partial data sets in an application-transparent fashion, and complement them with user-level performance and functionality enhancements such as caching and encrypted communication channels. The paper uses a nascent application from the medical imaging field (Light Scattering Spectroscopy – LSS) as a motivation for the approach, and as a basis for evaluating its performance. Results from performance experiments that consider the 16-processor parallel execution of LSS analysis and database generation programs show that, in the presence of data locality, a virtualized wide-area distributed file system setup and configured by Grid middleware can achieve performance levels close (13 % overhead or less) to that of a local disk, and superior (up to 680 % speedup) to non-virtualized distributed file systems. 1.
unknown title
"... Governments increasingly need to collaborate with each other in order to solve global problems [1]. Information gathering, sharing, processing and retrieval are inherent to such collaborative efforts. Information systems that support international collaborations among governments face challenges tha ..."
Abstract
- Add to MetaCart
Governments increasingly need to collaborate with each other in order to solve global problems [1]. Information gathering, sharing, processing and retrieval are inherent to such collaborative efforts. Information systems that support international collaborations among governments face challenges that require research on how to • Protect, control access, filter, summarize, correlate and share information across agencies and organizations

