5 May 20051 CmpE 516 Fault Tolerant Scheduling in Multiprocessor Systems Betül Demiröz.

Slides:



Advertisements
Similar presentations
Real Time Scheduling.
Advertisements

EE5900 Advanced Embedded System For Smart Infrastructure
CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Fault-Tolerant Scheduling Techniques.
Computer Structure 2014 – Out-Of-Order Execution 1 Computer Structure Out-Of-Order Execution Lihu Rappoport and Adi Yoaz.
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
CPE555A: Real-Time Embedded Systems
Carnegie Mellon R-BATCH: Task Partitioning for Fault-tolerant Multiprocessor Real-Time Systems Junsung Kim, Karthik Lakshmanan and Raj Rajkumar Electrical.
Task Allocation and Scheduling n Problem: How to assign tasks to processors and to schedule them in such a way that deadlines are met n Our initial focus:
Module 2 Priority Driven Scheduling of Periodic Task
CS 582 / CMPE 481 Distributed Systems Fault Tolerance.
1 Chapter Fault Tolerant Design of Digital Systems.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.
7. Fault Tolerance Through Dynamic or Standby Redundancy 7.5 Forward Recovery Systems Upon the detection of a failure, the system discards the current.
By Group: Ghassan Abdo Rayyashi Anas to’meh Supervised by Dr. Lo’ai Tawalbeh.
CprE 458/558: Real-Time Systems
Real-Time Operating System Chapter – 8 Embedded System: An integrated approach.
MULTIPROCESSOR SYSTEMS OUTLINE  Coordinated job Scheduling  Separate Systems  Homogeneous Processor Scheduling  Master/Slave Scheduling.
Distributed Real-Time systems 1 By: Mahdi Sadeghizadeh Website: Sadeghizadeh.ir Advanced Computer Networks.
Task Scheduling By Dr. Amin Danial Asham.
A S CHEDULABILITY A NALYSIS FOR W EAKLY H ARD R EAL - T IME T ASKS IN P ARTITIONING S CHEDULING ON M ULTIPROCESSOR S YSTEMS Energy Reduction in Weakly.
© Oxford University Press 2011 DISTRIBUTED COMPUTING Sunita Mahajan Sunita Mahajan, Principal, Institute of Computer Science, MET League of Colleges, Mumbai.
Scheduling policies for real- time embedded systems.
Multiprocessor and Real-Time Scheduling Chapter 10.
Chapter 101 Multiprocessor and Real- Time Scheduling Chapter 10.
1 Nasser Alsaedi. The ultimate goal for any computer system design are reliable execution of task and on time delivery of service. To increase system.
Reference: Ian Sommerville, Chap 15  Systems which monitor and control their environment.  Sometimes associated with hardware devices ◦ Sensors: Collect.
Real-Time Scheduling CS4730 Fall 2010 Dr. José M. Garrido Department of Computer Science and Information Systems Kennesaw State University.
SENG521 (Fall SENG 521 Software Reliability & Testing Fault Tolerant Software Systems: Techniques (Part 4b) Department of Electrical.
Real-Time Scheduling CS 3204 – Operating Systems Lecture 20 3/3/2006 Shahrooz Feizabadi.
CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Introduction to Real-Time Systems.
CprE 458/558: Real-Time Systems
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.
6. Application mapping 6.1 Problem definition
1 11/29/2015 Chapter 6: CPU Scheduling l Basic Concepts l Scheduling Criteria l Scheduling Algorithms l Multiple-Processor Scheduling l Real-Time Scheduling.
Real-Time Scheduling CS 3204 – Operating Systems Lecture 13 10/3/2006 Shahrooz Feizabadi.
CSCI1600: Embedded and Real Time Software Lecture 24: Real Time Scheduling II Steven Reiss, Fall 2015.
DISTRIBUTED COMPUTING
Real time scheduling G.Anuradha Ref:- Stallings. Real time computing Correctness of the system depends not only on the logical result of computation,
Introduction to Embedded Systems Rabie A. Ramadan 5.
Tolerating Communication and Processor Failures in Distributed Real-Time Systems Hamoudi Kalla, Alain Girault and Yves Sorel Grenoble, November 13, 2003.
CSCI1600: Embedded and Real Time Software Lecture 23: Real Time Scheduling I Steven Reiss, Fall 2015.
Dynamic Priority Driven Scheduling of Periodic Task
Introduction to Real-Time Systems
1.  System Characteristics  Features of Real-Time Systems  Implementing Real-Time Operating Systems  Real-Time CPU Scheduling  An Example: VxWorks5.x.
1 Uniprocessor Scheduling Chapter 3. 2 Alternating Sequence of CPU And I/O Bursts.
Chapter 4 CPU Scheduling. 2 Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation.
Fault-Tolerant Rate- Monotonic Scheduling Sunondo Ghosh, Rami Melhem, Daniel Mosse and Joydeep Sen Sarma.
Lecture 6: Real-Time Scheduling
Distributed Process Scheduling- Real Time Scheduling Csc8320(Fall 2013)
Real-Time Operating Systems RTOS For Embedded systems.
Embedded System Scheduling
Fault Tolerance In Operating System
CprE 458/558: Real-Time Systems
Chapter 6: CPU Scheduling
Module 5: CPU Scheduling
Fault Tolerance Distributed Web-based Systems
CSCI1600: Embedded and Real Time Software
Chapter 6: CPU Scheduling
Outline Scheduling algorithms Multi-processor scheduling
Multiprocessor and Real-Time Scheduling
CSCI1600: Embedded and Real Time Software
Chapter 10 Multiprocessor and Real-Time Scheduling
Operating System , Fall 2000 EA101 W 9:00-10:00 F 9:00-11:00
CS703 - Advanced Operating Systems
Shortest-Job-First (SJR) Scheduling
Chapter 6: CPU Scheduling
Module 5: CPU Scheduling
Chapter 6: CPU Scheduling
Module 5: CPU Scheduling
Anand Bhat*, Soheil Samii†, Raj Rajkumar* *Carnegie Mellon University
Presentation transcript:

5 May CmpE 516 Fault Tolerant Scheduling in Multiprocessor Systems Betül Demiröz

5 May Outline General consepts about tasks and scheduling Real time systems Fault Tolerant Scheduling Basic approaches used in Fault Tolerant Scheduling Algorithms and their execution details

5 May Task Deadline time the task should be finished Preemptive tasks can be stopped during execution restarted Nonpreemptive tasks cannot be restarted interrupted during execution

5 May Task Properties Periodic Aperiodic activated only when certain events occur arrival times are not known scheduled dynamically Dependent Independent

5 May Task Scheduling Distribution of tasks to the processors according to a given policy. Major goals of task scheduling: distribute the system load reduce total execution time

5 May Static & Dynamic Scheduling Static Scheduling compile time scheduling an accurate weight estimation is needed schedules of all tasks are predetermined Dynamic Scheduling scheduling at run time uses actual values of execution times of processes and communication times

5 May Real Time Systems Hard Real Time Correctness depends on logical results the result production time missing a deadline may be catastrophic mission-critical or life-critical applications fault tolerance is extremely important Soft Real Time

5 May Processors In The System Uniprocessor there is a single processor Multiprocessor there are n processors in the system can be identical (homogenous) can have different properties (heterogenous)

5 May Hard Real Time Systems Use multiprocessor Advantages more reliable unless a processor failure causes the whole system to fail can happen if no fault-tolerant capability is provided one processor failure does not cause the whole system to fail more computational power Disadvantage the probability of processor failure is higher

5 May Fault Tolerant System The system should produce correct results even in the presence of faults Important for most real time applications Tasks can have deadlines, and should be finished before the deadline fault tolerance required hard real time systems

5 May Error Detection in Fault Tolerant Scheduling Fail-Signal notify other processors of a detected fault Alarms or watchdogs detection of timing failures Signatures detection of HW/SW faults Acceptance Tests test results for HW/SW faults

5 May Fault Tolerance In Multiprocessor Systems Multiple copies of tasks scheduled on different processors Aim: the task completes before its deadline

5 May Fault Tolerante Scheduling In Multiprocessor Systems (Cont.) Multiple copies of tasks are scheduled to different processors One or more copies can run to ensure task completion before deadline PB (Primary/Backup Approach) TMR (Triple Modular Redundancy) Error checking is done by comparing results

5 May PB (Primary/Backup Approach) If incorrect results are generated from primary processor, backup processor is activated Small HW resource requirements Tasks are nonpreemptive, aperiodic, real-time

5 May An Algorithm For Real Time Fault Tolerant Scheduling in Multiprocessor Systems N periodic tasks are scheduled on a number of processors For each task i, there is a primary copy P i and a backup copy B i If primary copy fails, backup copy is activated Enough time needed to execute backup copies Static scheduling of tasks

5 May Scheduling Requirements Each task is executed by one processor at a time All tasks should meet their deadlines Maximize the number of processor failures to be tolerated P i and B i are assigned to only one processor which are different. Tasks are preemptive The number of processors used should be minimized

5 May Scheduling Algorithm Primary tasks are arranged in order of decreasing computation times Primary copies are scheduled (m processors are used) assign each copy to existing processors Primary schedule is dublicated for the backup copies (m processors are used) Any pair of primary and backup copies should not overlap

5 May An Example Distribution S={T 1, T 2, T 3, T 4, T 5 } C={5, 4, 4, 3, 2} T 1 -> P 1 T 2 -> P 2 T 3 -> P 1 T 4 -> P 2 T 5 -> P 2

5 May Example Cont.

5 May Another Algorithm Two copies of tasks allowed to start execution on different times Improves schedulability of tasks N identical processors and a scheduling processor are used Dynamic scheduling

5 May System Model A task is scheduled if Previously scheduled and the arrived task meet their deadlines Otherwise Task is rejected because its deadline is not met despite of a fault

5 May Techniques Used Backup copies are activated only when a fault occurs on the processor executing the primary copy Backup Overloading overlaping multiple slots for backups Backup De-allocation Release the slot for a backup copy when its primary copy is completed successfully

5 May Backup Overloading

5 May Backup Deallocation

5 May Proposed Technique The primary copy and backup copy are scheduled and executed in parallel The backup copy is divided into preceding part executed together with primary copy (redundant part) remaining part executed after the primary copy is completed (backup part) Backup overloading and backup deallocation is used

5 May Proposed Technique (Cont.)

5 May Scheduling Algorithm Schedule primary copy try to find a free slot between arrival time and deadline time Schedule backup copy schedule both redundant and backup parts

5 May System Overwiev

5 May Experiments Basic parameters used in experiments system load number of processors and tasks used computation time window size Analysing results rejection rate

5 May Experimental Results

5 May Thank You ANY QUESTIONS?