QoPS: A QoS based Scheme for Parallel Job Scheduling M. IslamP. Balaji P. Sadayappan and D. K. Panda Computer and Information Science The Ohio State University.

Slides:



Advertisements
Similar presentations
Feedback Control Real-Time Scheduling: Framework, Modeling, and Algorithms Chenyang Lu, John A. Stankovic, Gang Tao, Sang H. Son Presented by Josh Carl.
Advertisements

Real Time Scheduling.
Evaluating the Cost-Benefit of Using Cloud Computing to Extend the Capacity of Clusters Presenter: Xiaoyu Sun.
1 EE5900 Advanced Embedded System For Smart Infrastructure RMS and EDF Scheduling.
CS5270 Lecture 31 Uppaal, and Scheduling, and Resource Access Protocols CS 5270 Lecture 3.
CPE555A: Real-Time Embedded Systems
Mehdi Kargahi School of ECE University of Tehran
Towards Provision of Quality of Service Guarantees in Job Scheduling Mohammad IslamPavan Balaji P. SadayappanD. K. Panda Computer Science and Engineering.
Opportune Job Shredding: An Efficient Approach for Scheduling Parameter Sweep Applications Rohan Kurian, Pavan Balaji, P. Sadayappan The Ohio State University.
CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems (m, k)-firm tasks and QoS enhancement.
Task Allocation and Scheduling n Problem: How to assign tasks to processors and to schedule them in such a way that deadlines are met n Our initial focus:
Module 2 Priority Driven Scheduling of Periodic Task
Scheduling of parallel jobs in a heterogeneous grid environment Scheduling of parallel jobs in a heterogeneous grid environment Each site has a homogeneous.
Service Level Agreement based Allocation of Cluster Resources: Handling Penalty to Enhance Utility Chee Shin Yeo and Rajkumar Buyya Grid Computing and.
All Hands Meeting, 2006 Title: Grid Workflow Scheduling in WOSE (Workflow Optimisation Services for e- Science Applications) Authors: Yash Patel, Andrew.
Managing Risk of Inaccurate Runtime Estimates for Deadline Constrained Job Admission Control in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing.
Senior Design Project: Parallel Task Scheduling in Heterogeneous Computing Environments Senior Design Students: Christopher Blandin and Dylan Machovec.
Towards Feasibility Region Calculus: An End-to-end Schedulability Analysis of Real- Time Multistage Execution William Hawkins and Tarek Abdelzaher Presented.
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
Scheduling for Embedded Real-Time Systems Amit Mahajan and Haibo.
Parallel Job Scheduling Algorithms and Interfaces Research Exam for Cynthia Bailey Lee Department of Computer Science and Engineering University of California,
Project 2 – solution code
Optimization for QoS on Systems with Tasks Deadlines Luis Fernando Orleans Pedro Nuno Furtado.
Cs238 CPU Scheduling Dr. Alan R. Davis. CPU Scheduling The objective of multiprogramming is to have some process running at all times, to maximize CPU.
EE 249, Fall Discussion: Scheduling Haibo Zeng Amit Mahajan.
Embedded Systems Exercise 3: Scheduling Real-Time Periodic and Mixed Task Sets 18. May 2005 Alexander Maxiaguine.
By Group: Ghassan Abdo Rayyashi Anas to’meh Supervised by Dr. Lo’ai Tawalbeh.
Integrated Risk Analysis for a Commercial Computing Service Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS) Lab. Dept.
CPU Scheduling Introduction to Operating Systems: Module 13.
Operating System Concepts and Techniques Lecture 5 Scheduling-1 M. Naghibzadeh Reference M. Naghibzadeh, Operating System Concepts and Techniques, First.
MM Process Management Karrie Karahalios Spring 2007 (based off slides created by Brian Bailey)
Scheduling of Parallel Jobs In a Heterogeneous Multi-Site Environment By Gerald Sabin from Ohio State Reviewed by Shengchao Yu 02/2005.
Marcos Dias de Assunção 1,2, Alexandre di Costanzo 1 and Rajkumar Buyya 1 1 Department of Computer Science and Software Engineering 2 National ICT Australia.
Task Scheduling By Dr. Amin Danial Asham.
Dynamic Slack Reclamation with Procrastination Scheduling in Real- Time Embedded Systems Paper by Ravindra R. Jejurikar and Rajesh Gupta Presentation by.
A S CHEDULABILITY A NALYSIS FOR W EAKLY H ARD R EAL - T IME T ASKS IN P ARTITIONING S CHEDULING ON M ULTIPROCESSOR S YSTEMS Energy Reduction in Weakly.
Scheduling policies for real- time embedded systems.
Multiprocessor Real-time Scheduling Jing Ma 马靖. Classification Partitioned Scheduling In the partitioned approach, the tasks are statically partitioned.
CS Spring 2011 CS 414 – Multimedia Systems Design Lecture 31 – Multimedia OS (Part 1) Klara Nahrstedt Spring 2011.
Real-Time Scheduling CS4730 Fall 2010 Dr. José M. Garrido Department of Computer Science and Information Systems Kennesaw State University.
Real-Time Scheduling CS 3204 – Operating Systems Lecture 20 3/3/2006 Shahrooz Feizabadi.
Prepare by : Ihab shahtout.  Overview  To give an overview of fixed priority schedule  Scheduling and Fixed Priority Scheduling.
Static Process Scheduling Section 5.2 CSc 8320 Alex De Ruiter
Object-Oriented Design and Implementation of the OE-Scheduler in Real-time Environments Ilhyun Lee Cherry K. Owen Haesun K. Lee The University of Texas.
5 May CmpE 516 Fault Tolerant Scheduling in Multiprocessor Systems Betül Demiröz.
Real-Time Support for Mobile Robotics K. Ramamritham (+ Li Huan, Prashant Shenoy, Rod Grupen)
Distributed Process Scheduling : A Summary
Special Class on Real-Time Systems
Real-Time Scheduling CS 3204 – Operating Systems Lecture 13 10/3/2006 Shahrooz Feizabadi.
1 Real-Time Scheduling. 2Today Operating System task scheduling –Traditional (non-real-time) scheduling –Real-time scheduling.
CSCI1600: Embedded and Real Time Software Lecture 24: Real Time Scheduling II Steven Reiss, Fall 2015.
Real time scheduling G.Anuradha Ref:- Stallings. Real time computing Correctness of the system depends not only on the logical result of computation,
6.1 CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation.
Introduction to Embedded Systems Rabie A. Ramadan 5.
Job Scheduling P. (Saday) Sadayappan Ohio State University.
CSCI1600: Embedded and Real Time Software Lecture 23: Real Time Scheduling I Steven Reiss, Fall 2015.
Dynamic Priority Driven Scheduling of Periodic Task
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 31 – Process Management (Part 1) Klara Nahrstedt Spring 2009.
CS333 Intro to Operating Systems Jonathan Walpole.
CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Energy-aware QoS packet scheduling.
For a good summary, visit:
Real-Time Scheduling --- An Overview Real-Time Scheduling --- An Overview Xiaoping He.
CS Spring 2010 CS 414 – Multimedia Systems Design Lecture 32 – Multimedia OS Klara Nahrstedt Spring 2010.
Chapter 4 CPU Scheduling. 2 Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation.
Distributed Process Scheduling- Real Time Scheduling Csc8320(Fall 2013)
CSCI1600: Embedded and Real Time Software
P. (Saday) Sadayappan Ohio State University
A Characterization of Approaches to Parrallel Job Scheduling
CSCI1600: Embedded and Real Time Software
Processes and operating systems
Presentation transcript:

QoPS: A QoS based Scheme for Parallel Job Scheduling M. IslamP. Balaji P. Sadayappan and D. K. Panda Computer and Information Science The Ohio State University Presented by Gerald Sabin

06/24/2003The Ohio State University2  Independent Parallel Job Scheduling Model –Dynamically arriving Independent Parallel Jobs –Resource mapping: Submitted Jobs to Resources present  Number of Techniques studied over the years –Backfilling (Ex: Conservative, EASY, No Guarantee) –Priority based scheduling  Differentiated service to different classes of jobs  Soft Real-time or Best Effort guarantees to the completion time  Hard Real-Time or “Deadline-based” scheduling –Allow Users to specify the deadline they desire –Cost model based on Resources Used AND Deadline Specified –Requires a deadline-based scheduling algorithm: LONG OVERDUE ! Job Schedulers Today

06/24/2003The Ohio State University3 QoS for Job Scheduling  Two Components in providing QoS –Cost Model Component  Based on Resources Used AND Deadline Specified  More urgent jobs are charged more  Guarantees the service requested –Job Scheduling Component  Admission Control –Can we meet the specified deadline?  Once admitted, cannot miss the specified deadline  We only deal with the Job Scheduling Component

06/24/2003The Ohio State University4 Overview  Related Work  The QoPS Algorithm  Simulation Approach  Experimental Results  Conclusions and Future Work

06/24/2003The Ohio State University5 Related Work  Feitelson’s Slack-Based (SB) Scheduling [feit97] –Focused on improving Utilization and Turnaround time –Jobs have an associated slack, based on their priority  This determines how much they can be delayed  Ramamritham’s Real-Time (RT) Scheduling [krithi90] –Deadline-based scheduling algorithm –Non-periodic Single Processor Jobs –Statically available at start time [feit97]: “Supporting Priorities and Improving Utilization of the IBM SP2 Scheduler using Slack based Backfilling”, D. Talby, D. G. Feitelson, IPPS, Apr ’97 [krithi90]: “Efficient Scheduling Algorithms for Real-Time Multiprocessor Systems”, K. Ramamritham, J. A. Stankovic, P-F. Shiah, TPDS, Apr ‘90

06/24/2003The Ohio State University6 Slack-Based (SB) Scheduling Algorithm  When a job (J N+1 ) arrives –Calculate its slack (based on its priority) –If J 1, J 2, …, J N are already present and scheduled in that order –Try placing the job (J N+1 ) in each possible position in this list –For each of the N+1 schedules feasible, calculate a cost function ‘f’  A schedule is feasible if no job exceeds the slack given to it –Choose the schedule with the “best cost function value” J1J1 J2J2 J3J3 J4J4 J5J5 J6J6 J7J7 J1J1 J2J2 J3J3 J4J4 J5J5 J6J6 J7J7 J1J1 J2J2 J3J3 J4J4 J5J5 J6J6 J7J7 J1J1 J2J2 J3J3 J4J4 J5J5 J6J6 J7J7 Cost Function Evaluation f0f0 f1f1 f2f2 f3f3 f4f4 f5f5 f6f6 f best

06/24/2003The Ohio State University7 Real-Time (RT) Scheduling Algorithm  Static Scheme, so there’s no concept of new jobs arriving  Sort jobs based on a heuristic function  Start from a NULL schedule  For each of the jobs –If placing the job in the current schedule misses its deadline  Backtrack to the last known feasible schedule –If (number of backtracks > p) Discard the Schedule  If all jobs have been placed within their deadlines –Accept the Schedule

06/24/2003The Ohio State University8 J4J4 J4J4 J2J2 J2J2 J4J4 J4J4 J3J3 J3J3 J2J2 J2J2 J1J1 J1J1 Working of the RT Algorithm J N J N-1 J N-2... J 3 J 2 J 1 Sorted by Earliest Deadline first (EDF) NULL J3J3 J3J3

06/24/2003The Ohio State University9 Modified Slack-Based (MSB) Algorithm  Modified Slack-Based (MSB) Algorithm –Motivation of SB: To improve Utilization and Response Time –SB assigns slack to jobs based on job priorities –MSB assigns slack to jobs based on the deadline specified –Rest of the algorithm is unchanged

06/24/2003The Ohio State University10 Modified Real-Time (MRT) Algorithm  Modified Real-Time (MRT) Algorithm –RT was designed for non-periodic uni-processor jobs –All jobs are Statically available at the start of the execution –MRT involves two modifications to RT  To allow dynamically arriving jobs –Run the algorithm every time a job arrives  To allow scheduling of parallel jobs –Allowing backfilling of jobs

06/24/2003The Ohio State University11 Overview  Related Work  The QoPS Algorithm  Simulation Approach  Experimental Results  Conclusions and Future Work

06/24/2003The Ohio State University12 The Basic QoPS Algorithm  Similar to the MSB algorithm, but… –Provides more flexibility in reordering scheduled jobs  When a job (J N+1 ) arrives –If J 1, J 2, …, J N are already present and scheduled in that order –Place the job (J N+1 ) at the start of all jobs  Try scheduling the jobs in that order –If all jobs are able to meet their deadlines, Great ! Admit it ! –If some job fails, we have two options: –Option1:  Consider the failed job as a critical job  Push the failed job to the start of the schedule and retry  ‘k’ number of such re-orderings of existing jobs are allowed  If (number of re-orderings > k) switch to option 2 –Option2:  Back off exponentially in the position at which you try placing job (J N+1 ) and retry

06/24/2003The Ohio State University13 J3J3 J2J2 J2J2 J1J1 Working of the QoPS Algorithm J 12 J 11 J7J7 J8J8 J6J6 J5J5 J 10 J9J9 J4J4 J1J1 J 13 J1J1 J2J2 J3J3 J3J3 J1J1 J2J2 J3J3 J1J1 J2J2 J3J3 J1J1 J4J4 J3J3 J2J2 J6J6 J5J5 J4J4 J1J1 J2J2 J3J3 Max. Violations Allowed = 2 Current Violations = 0Current Violations = 1Current Violations = 2Current Violations = 0

06/24/2003The Ohio State University14 Overview  Related Work  The QoPS Algorithm  Simulation Approach  Experimental Results  Conclusions and Future Work

06/24/2003The Ohio State University15 Simulation Approach CTC/SDSC Trace Load Variation Deadline Calculator Deadline-based Trace QoPS Simulation MSB Simulation MRT Simulation EASY Simulation Duplication/Expansion

06/24/2003The Ohio State University16 Trace Generation  Many job logs available, but no associated deadlines  Synthetic Deadline Generation –Generate a schedule for the job trace using EASY –For any job J, if the Turnaround time in this schedule is T –Deadline for J = Arrival Time + max (runtime, (1-SF) x T) –SF is the “Stringency factor” (0 < SF < 1)  0 would give the least stringent deadlines and 1 the most stringent  Some jobs might not come with deadlines –Very lax deadlines to prevent starvation –If ‘T’ is the current expected Turnaround time,  Deadline = Arrival Time + max (24hrs, R x T) –R is the “Relaxation Factor” of the schedule

06/24/2003The Ohio State University17 Overview  Related Work  The QoPS Algorithm  Simulation Approach  Experimental Results  Conclusions and Future Work

06/24/2003The Ohio State University18 Experimental Results  Two evaluation scenarios –Scenario1:  All jobs have deadlines  Pure comparison of the three algorithms –Scenario2:  Mixed jobs: Some have deadlines, others are artificially provided  More realistic  Tests Conducted: –Job Acceptance rate –Impact on Non-deadline Jobs –Utilization Variation, etc

06/24/2003The Ohio State University19 Admittance Capacity Comparison All jobs have deadlines; Stringency Factor = 0.2; CTC Trace QoPS admits the most number of jobs (and Processor Seconds)

06/24/2003The Ohio State University20 Utilization Comparison All jobs have deadlines; CTC Trace Deadline-based schemes lose about 10% Utilization

06/24/2003The Ohio State University21 Admittance Capacity Comparison (Mixed Jobs) 20% jobs have deadlines; Stringency Factor = 0.2; CTC Trace QoPS admits the most number of jobs (and Processor Seconds)

06/24/2003The Ohio State University22 Response Time and Slow Down Vs Load 20% jobs have deadlines; Stringency Factor = 0.2; CTC Trace QoPS gives the best slow-down in spite of accepting more jobs; Unfair to EASY

06/24/2003The Ohio State University23 Utilization Vs Load (Mixed Jobs) EASY has a higher Utilization Accepts more (all) jobs; Unfair to the deadline-based schemes

06/24/2003The Ohio State University24 Response Time and Slow Down Vs Utilization 20% jobs have deadlines; Stringency Factor = 0.2; CTC Trace Fairer Comparison; QoPS still performs better in most cases, especially Slow Down

06/24/2003The Ohio State University25 Overview  Related Work  The QoPS Algorithm  Simulation Approach  Experimental Results  Conclusions and Future Work

06/24/2003The Ohio State University26 Conclusions  “Deadline-based” scheduling is desirable –No such scheme for parallel jobs –Previous schemes can be extended, but…  Not proposed for this kind of scheduling  Might not fit in perfectly –Proposed the QoPS algorithm  Allows jobs to specify required deadlines –Admission control checks admissibility –Job Scheduler schedules admitted jobs  Outperforms extended previous schemes (MSB and MRT) –But, the main idea is not performance –Deadline Scheduling is a necessity and QoPS is an effort to meet it

06/24/2003The Ohio State University27 Future Work  Cost Metric component in QoS  Currently using a first fit mechanism –Best fit is expected to do much better  Job Shedding Vs Non Job Shedding –If deadline can’t be met  Should we reject the job (will the user try again?)  Should we give it the best available deadline  Grid based extensions to QoPS

06/24/2003The Ohio State University28 Thank You !

Backup Slides

06/24/2003The Ohio State University30 Admittance Capacity for SDSC trace All jobs have deadlines; Stringency Factor = 0.2; CTC Trace QoPS admits the most number of jobs (and Processor Seconds)

06/24/2003The Ohio State University31 Admittance Capacity with Job Expansion All jobs have deadlines; Stringency Factor = 0.2; CTC Trace QoPS admits the most number of jobs (and Processor Seconds)

06/24/2003The Ohio State University32 Impact of Relaxation Factor 80% jobs have deadlines; Stringency Factor = 0.2; CTC Trace With low “R”, Longer jobs perform better (reflects in Resp. Time)