Results 1 -
7 of
7
Efficient overloading techniques for primary-backup scheduling in real-time systems
, 2004
"... ..."
QoS-aware fault-tolerant scheduling for real-time tasks on heterogeneous clusters
- IEEE Trans. Comput
, 2011
"... Abstract—Fault-tolerant scheduling plays a significant role in improving system reliability of clusters. Although extensive fault-tolerant scheduling algorithms have been proposed for real-time tasks in parallel and distributed systems, quality of service (QoS) requirements of tasks have not been ta ..."
Abstract
-
Cited by 8 (0 self)
- Add to MetaCart
(Show Context)
Abstract—Fault-tolerant scheduling plays a significant role in improving system reliability of clusters. Although extensive fault-tolerant scheduling algorithms have been proposed for real-time tasks in parallel and distributed systems, quality of service (QoS) requirements of tasks have not been taken into account. This paper presents a fault-tolerant scheduling algorithm called QAFT that can tolerate one node’s permanent failures at one time instant for real-time tasks with QoS needs on heterogeneous clusters. In order to improve system flexibility, reliability, schedulability, and resource utilization, QAFT strives to either advance the start time of primary copies and delay the start time of backup copies in order to help backup copies adopt the passive execution scheme, or to decrease the simultaneous execution time of the primary and backup copies of a task as much as possible to improve resource utilization. QAFT is capable of adaptively adjusting the QoS levels of tasks and the execution schemes of backup copies to attain high system flexibility. Furthermore, we employ the overlapping technology of backup copies. The latest start time of backup copies and their constraints are analyzed and discussed. We conduct extensive experiments to compare our QAFT with two existing schemes—NOQAFT and DYFARS. Experimental results show that QAFT significantly improves the scheduling quality of NOQAFT and DYFARS. Index Terms—Heterogeneous clusters, real-time, scheduling, fault tolerance, quality of service (QoS), heuristic. Ç 1
Exploiting redundancies to enhance schedulability in fault-tolerant and real-time distributed systems
- IEEE Trans. Syst., Man, Cybern. A, Syst., Humans
, 2009
"... Abstract—In the past decades, distributed systems have been widely applied to real-time applications, most of which have fault-tolerance requirements to assure high reliability. Due to the stringent space constraints of real-time systems, the issue of schedulability becomes a major concern in the de ..."
Abstract
-
Cited by 5 (1 self)
- Add to MetaCart
(Show Context)
Abstract—In the past decades, distributed systems have been widely applied to real-time applications, most of which have fault-tolerance requirements to assure high reliability. Due to the stringent space constraints of real-time systems, the issue of schedulability becomes a major concern in the design of faulttolerant and real-time distributed systems. Most existing real-time and fault-tolerant scheduling algorithms, which are based on the primary–backup scheme for periodic real-time tasks, introduce unnecessary redundancies by aggressively using active-backup copies. To solve this problem, we propose two novel fault-tolerant techniques, which are seamlessly integrated with fixed-prioritybased scheduling algorithms. These techniques leverage redundancies to enhance schedulability in fault-tolerant and real-time distributed systems. Our fault-tolerant techniques make use of the primary–backup scheme to tolerate permanent hardware failures. The first technique (referred to as Tercos) terminates the execution of active-backup copies, when corresponding primary copies are successfully completed. Tercos is designed to reduce scheduling lengths in fault-free scenarios to enhance schedulability by virtue of executing portions of active-backup copies in passive forms. The second technique (referred to as Debus) uses a deferred-active-backup scheme to further minimize schedule lengths to improve the schedulability performance. Debus schedules active-backup copies as late as possible, while terminating active-backup copies when their primary copies are completed. Experimental results show that, compared with existing algorithms in literature, Tercos can significantly improve schedulability by up to 17.0 % (with an average of 9.7%). Furthermore, empirical results reveal that Debus can enhance schedulability over Tercos by up to 12 % (with an average of 7.8%). Index Terms—Distributed systems, fault tolerance, rate– monotonic (RM) algorithm, real-time task scheduling, primary– backup copy. I.
THE PROCESS OF SYNCHRONIZATION IN DUAL REDUNDANT FAULT-TOLERANT SYSTEM
"... Abstract: Synchronization is used in dual redundant fault-tolerant system to make two computers work jointly. It determines the work mode and controls the operations of the system. The paper presents a dual redundant fault-tolerant system and proposes its process of synchronization based on task. T ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract: Synchronization is used in dual redundant fault-tolerant system to make two computers work jointly. It determines the work mode and controls the operations of the system. The paper presents a dual redundant fault-tolerant system and proposes its process of synchronization based on task. The synchronization treats task as the minimal operation unit. And it is implemented with the assist of dual-computer-controller and outer memory, the latter includes task buffer and global data region. Dual-computer-controller controls the input and output of tasks that are stored in task buffer. The switch in of backup computer is implemented by duplicating global data from global data region used by host computer. Additional resolutions for key points, such as the switch between two computers, are also put forward in conclusion. It is proved that the task-level synchronization can make two computers work in phase, and the system is applicable to critical fields requiring high dependability.
Reliability-Driven Scheduling of Periodic Tasks in Heterogeneous Real-Time Systems
"... In this paper we comprehensively investigated the issue of reliability-driven real-time scheduling for periodic tasks in heterogeneous systems. First, we built a reliability model in which the concept of reliability cost is introduced in the context of heterogeneous realtime systems. Next, we propos ..."
Abstract
- Add to MetaCart
(Show Context)
In this paper we comprehensively investigated the issue of reliability-driven real-time scheduling for periodic tasks in heterogeneous systems. First, we built a reliability model in which the concept of reliability cost is introduced in the context of heterogeneous realtime systems. Next, we proposed a novel reliabilitydriven scheduling algorithm (referred to as Repars) for periodic tasks in heterogeneous systems. Third, after extending the reliability model to meet the needs of our fault-tolerant scheme, we developed a fault-tolerant scheduling algorithm or Refine. Refine aims to enhance system reliability while being able to tolerate one-processor failures in heterogeneous real-time systems. Experimental results showed that Repars is superior to RMFF in terms of both schedulability and reliability. When compared with Repars, Refine significantly reduced the reliability cost by up to 34% with graceful degradation in schedulability. 1.
Nashville, Tennessee Approved:
, 2009
"... Dr. Janos SztipanovitsTo Amma and Appa for their love and encouragement over the years ..."
Abstract
- Add to MetaCart
(Show Context)
Dr. Janos SztipanovitsTo Amma and Appa for their love and encouragement over the years
A Fault-Tolerant Scheduling Algorithm using Hybrid Overloading Technology for Dynamic Grouping based Multiprocessor Systems
"... In order to extend the application area of fault-tolerant scheduling algorithm based on hybrid overloading for multiprocessor and increase the fault-tolerant num-ber of processors, we propose a new fault-tolerant scheduling algorithm, which is based on hybrid overloading and dynamic grouping for mul ..."
Abstract
- Add to MetaCart
In order to extend the application area of fault-tolerant scheduling algorithm based on hybrid overloading for multiprocessor and increase the fault-tolerant num-ber of processors, we propose a new fault-tolerant scheduling algorithm, which is based on hybrid overloading and dynamic grouping for multiprocessor by combining logic grouping strategy for processors in primary backup overloading and backup backup overloading.This algorithm presents the formalization of the dynamic grouping for processors in fault-tolerant scheduling based on hybrid overloading and enlarges the task number included in overloading task link. In the process of fault-tolerant schedul-ing the processors are dynamically divided into some groups based on overloading task link, so as to keep good scheduling success ratio and enhance the fault-tolerant per-formance of processors. Both theoretical analysis and simulation experiment prove this algorithm’s effectiveness respectively.