P. (Saday) Sadayappan Ohio State University

Slides:



Advertisements
Similar presentations
Evaluating the Cost-Benefit of Using Cloud Computing to Extend the Capacity of Clusters Presenter: Xiaoyu Sun.
Advertisements

Scheduling Criteria CPU utilization – keep the CPU as busy as possible (from 0% to 100%) Throughput – # of processes that complete their execution per.
Towards Provision of Quality of Service Guarantees in Job Scheduling Mohammad IslamPavan Balaji P. SadayappanD. K. Panda Computer Science and Engineering.
Opportune Job Shredding: An Efficient Approach for Scheduling Parameter Sweep Applications Rohan Kurian, Pavan Balaji, P. Sadayappan The Ohio State University.
Scheduling of parallel jobs in a heterogeneous grid environment Scheduling of parallel jobs in a heterogeneous grid environment Each site has a homogeneous.
Service Level Agreement based Allocation of Cluster Resources: Handling Penalty to Enhance Utility Chee Shin Yeo and Rajkumar Buyya Grid Computing and.
Managing Risk of Inaccurate Runtime Estimates for Deadline Constrained Job Admission Control in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing.
Senior Design Project: Parallel Task Scheduling in Heterogeneous Computing Environments Senior Design Students: Christopher Blandin and Dylan Machovec.
Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)
Integrated Risk Analysis for a Commercial Computing Service Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS) Lab. Dept.
Efficient Scheduling of Heterogeneous Continuous Queries Mohamed A. Sharaf Panos K. Chrysanthis Alexandros Labrinidis Kirk Pruhs Advanced Data Management.
Scheduling of Parallel Jobs In a Heterogeneous Multi-Site Environment By Gerald Sabin from Ohio State Reviewed by Shengchao Yu 02/2005.
Marcos Dias de Assunção 1,2, Alexandre di Costanzo 1 and Rajkumar Buyya 1 1 Department of Computer Science and Software Engineering 2 National ICT Australia.
Meta Scheduling Sathish Vadhiyar Sources/Credits/Taken from: Papers listed in “References” slide.
CPU S CHEDULING Lecture: Operating System Concepts Lecturer: Pooja Sharma Computer Science Department, Punjabi University, Patiala.
Scheduling policies for real- time embedded systems.
Operating Systems Scheduling. Bursts of CPU usage alternate with periods of waiting for I/O. (a) A CPU-bound process. (b) An I/O-bound process. Scheduling.
Job Scheduling P. (Saday) Sadayappan Ohio State University.
QoPS: A QoS based Scheme for Parallel Job Scheduling M. IslamP. Balaji P. Sadayappan and D. K. Panda Computer and Information Science The Ohio State University.
Author Utility-Based Scheduling for Bulk Data Transfers between Distributed Computing Facilities Xin Wang, Wei Tang, Raj Kettimuthu,
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
Chapter 4 CPU Scheduling. 2 Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation.
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Best Practices Consortium
Copyright ©: Nahrstedt, Angrave, Abdelzaher
CPU SCHEDULING.
Dan C. Marinescu Office: HEC 439 B. Office hours: M, Wd 3 – 4:30 PM.
Introduction to Load Balancing:
Routers and Redundancy
1 The roles of actuaries & general operating environment
Copyright ©: Nahrstedt, Angrave, Abdelzaher
April 6, 2001 Gary Kimura Lecture #6 April 6, 2001
Scheduling Jobs Across Geo-distributed Datacenters
AWS Batch Overview A highly-efficient, dynamically-scaled, batch computing service May 2017.
Process Scheduling B.Ramamurthy 9/16/2018.
Chapter 6: CPU Scheduling
Chapter 10 Verification and Validation of Simulation Models
Intel® PCC Proposal Presentation
ICS 143 Principles of Operating Systems
CS 143A - Principles of Operating Systems
Process Scheduling B.Ramamurthy 11/18/2018.
CPU Scheduling G.Anuradha
Module 5: CPU Scheduling
Lecture 21: Introduction to Process Scheduling
3: CPU Scheduling Basic Concepts Scheduling Criteria
Chapter5: CPU Scheduling
Supplement D Waiting Line Models
Chapter 6: CPU Scheduling
A Characterization of Approaches to Parrallel Job Scheduling
CPU SCHEDULING.
Chapter 5: CPU Scheduling
Chapter 5: CPU Scheduling
The Heart of Student Success
Chavit Denninnart, Mohsen Amini Salehi and Xiangbo Li
Lecture 21: Introduction to Process Scheduling
Project Name - Testing Iteration 1 UAT Kick-off
Uniprocessor scheduling
Operating System , Fall 2000 EA101 W 9:00-10:00 F 9:00-11:00
Scheduling & Dispatching
Uniprocessor Process Management & Process Scheduling
Chapter 6: CPU Scheduling
Module 5: CPU Scheduling
CPU SCHEDULING CPU SCHEDULING.
Scheduling 21 May 2019.
Chapter 6: CPU Scheduling
Process/Code Migration and Cloning
Scheduling & Dispatching
Uniprocessor Process Management & Process Scheduling
Cloud Resource Scheduling for Online and Batch Applications
Module 5: CPU Scheduling
Presentation transcript:

P. (Saday) Sadayappan Ohio State University Job Scheduling P. (Saday) Sadayappan Ohio State University

Problem Statement Given a stream of parallel jobs and a set of computing resources, determine when and where to execute each job In the form that the job scheduling problem is addressed at most supercomputer centers: Homogeneous set of processors Each job asks for a specific, fixed number of processors

Job Scheduling Today Earliest job schedulers (Intel iPSC) used a simple FCFS strategy; low utilization (50%) Back-filling was implemented at Argonne Give an earliest-possible reservation to job at head of the queue, but allow a later arriving job to bypass it, if the reservation is not violated Utilization improves to ~90% Used at most production facilities today

Can Performance be Improved? Metrics: System Metric: Utilization User Metrics: Response time (wait+run time), Slowdown (response-time/run-time) Over a hundred papers published: Focus mainly on improving user metrics: much greater potential for its improvement than utilization Question: How important is it to squeeze an additional 5-10% utilization on a system that is already achieving over 85% utilization?

Improving Response Time Question: How important is it to evaluate alternatives to standard back-fill scheduling, with a goal of improved user response-time? Many studies have reported simulation studies showing significant improvement of slowdown or response-time with new schemes; but most production schedulers simply use aggressive back-fill. Why?

Possible Reasons for Non-Adoption Academic studies do not model specific policy issues of a center, e.g. “good citizen rules,” multiple queues etc. Most results are based on job log traces at Feitelson’s archive, with many logs from academic centers exhibiting low system utilization (< 70%). Most studies report overall averages over entire trace: insufficient to assess impact of change: E.g., using a Shortest-Job-First queue policy instead of the usual FCFS policy significantly improves overall average slowdown by a factor of 4; but increases response time for 24 hour jobs to 50 hours instead of 26 hours.

QoS for Job Scheduling Job schedulers do not provide QoS: No response time guarantees No equitable way of offering different service for urgent versus non-urgent jobs Technical and Accounting issues: Develop job schedulers that can do deadline-based scheduling Develop accounting models to charge based on urgency of job: Charge = f1(resource-usage) + f2(wait-time-limit) Question: How desirable is it to develop job schedulers with QoS functionality?

Questions?