P. (Saday) Sadayappan Ohio State University

Slides:

Advertisements

Similar presentations

Evaluating the Cost-Benefit of Using Cloud Computing to Extend the Capacity of Clusters Presenter: Xiaoyu Sun.

Advertisements

Scheduling Criteria CPU utilization – keep the CPU as busy as possible (from 0% to 100%) Throughput – # of processes that complete their execution per.

Towards Provision of Quality of Service Guarantees in Job Scheduling Mohammad IslamPavan Balaji P. SadayappanD. K. Panda Computer Science and Engineering.

Opportune Job Shredding: An Efficient Approach for Scheduling Parameter Sweep Applications Rohan Kurian, Pavan Balaji, P. Sadayappan The Ohio State University.

Scheduling of parallel jobs in a heterogeneous grid environment Scheduling of parallel jobs in a heterogeneous grid environment Each site has a homogeneous.

Service Level Agreement based Allocation of Cluster Resources: Handling Penalty to Enhance Utility Chee Shin Yeo and Rajkumar Buyya Grid Computing and.

Managing Risk of Inaccurate Runtime Estimates for Deadline Constrained Job Admission Control in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing.

Senior Design Project: Parallel Task Scheduling in Heterogeneous Computing Environments Senior Design Students: Christopher Blandin and Dylan Machovec.

Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)

Integrated Risk Analysis for a Commercial Computing Service Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS) Lab. Dept.

Efficient Scheduling of Heterogeneous Continuous Queries Mohamed A. Sharaf Panos K. Chrysanthis Alexandros Labrinidis Kirk Pruhs Advanced Data Management.

Scheduling of Parallel Jobs In a Heterogeneous Multi-Site Environment By Gerald Sabin from Ohio State Reviewed by Shengchao Yu 02/2005.

Marcos Dias de Assunção 1,2, Alexandre di Costanzo 1 and Rajkumar Buyya 1 1 Department of Computer Science and Software Engineering 2 National ICT Australia.

Meta Scheduling Sathish Vadhiyar Sources/Credits/Taken from: Papers listed in “References” slide.

CPU S CHEDULING Lecture: Operating System Concepts Lecturer: Pooja Sharma Computer Science Department, Punjabi University, Patiala.

Scheduling policies for real- time embedded systems.

Operating Systems Scheduling. Bursts of CPU usage alternate with periods of waiting for I/O. (a) A CPU-bound process. (b) An I/O-bound process. Scheduling.

Job Scheduling P. (Saday) Sadayappan Ohio State University.

QoPS: A QoS based Scheme for Parallel Job Scheduling M. IslamP. Balaji P. Sadayappan and D. K. Panda Computer and Information Science The Ohio State University.

Author Utility-Based Scheduling for Bulk Data Transfers between Distributed Computing Facilities Xin Wang, Wei Tang, Raj Kettimuthu,

Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.

Chapter 4 CPU Scheduling. 2 Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation.

1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.

Best Practices Consortium

Copyright ©: Nahrstedt, Angrave, Abdelzaher

CPU SCHEDULING.

Dan C. Marinescu Office: HEC 439 B. Office hours: M, Wd 3 – 4:30 PM.

Introduction to Load Balancing:

Routers and Redundancy

1 The roles of actuaries & general operating environment

Copyright ©: Nahrstedt, Angrave, Abdelzaher

April 6, 2001 Gary Kimura Lecture #6 April 6, 2001

Scheduling Jobs Across Geo-distributed Datacenters

AWS Batch Overview A highly-efficient, dynamically-scaled, batch computing service May 2017.

Process Scheduling B.Ramamurthy 9/16/2018.

Chapter 6: CPU Scheduling

Chapter 10 Verification and Validation of Simulation Models

Intel® PCC Proposal Presentation

ICS 143 Principles of Operating Systems

CS 143A - Principles of Operating Systems

Process Scheduling B.Ramamurthy 11/18/2018.

CPU Scheduling G.Anuradha

Module 5: CPU Scheduling

Lecture 21: Introduction to Process Scheduling

3: CPU Scheduling Basic Concepts Scheduling Criteria

Chapter5: CPU Scheduling

Supplement D Waiting Line Models

Chapter 6: CPU Scheduling

A Characterization of Approaches to Parrallel Job Scheduling

CPU SCHEDULING.

Chapter 5: CPU Scheduling

Chapter 5: CPU Scheduling

The Heart of Student Success

Chavit Denninnart, Mohsen Amini Salehi and Xiangbo Li

Lecture 21: Introduction to Process Scheduling

Project Name - Testing Iteration 1 UAT Kick-off

Uniprocessor scheduling

Operating System , Fall 2000 EA101 W 9:00-10:00 F 9:00-11:00

Scheduling & Dispatching

Uniprocessor Process Management & Process Scheduling

Chapter 6: CPU Scheduling

Module 5: CPU Scheduling

CPU SCHEDULING CPU SCHEDULING.

Scheduling 21 May 2019.

Chapter 6: CPU Scheduling

Process/Code Migration and Cloning

Scheduling & Dispatching

Uniprocessor Process Management & Process Scheduling

Cloud Resource Scheduling for Online and Batch Applications

Module 5: CPU Scheduling

Presentation transcript:

P. (Saday) Sadayappan Ohio State University Job Scheduling P. (Saday) Sadayappan Ohio State University

Problem Statement Given a stream of parallel jobs and a set of computing resources, determine when and where to execute each job In the form that the job scheduling problem is addressed at most supercomputer centers: Homogeneous set of processors Each job asks for a specific, fixed number of processors

Job Scheduling Today Earliest job schedulers (Intel iPSC) used a simple FCFS strategy; low utilization (50%) Back-filling was implemented at Argonne Give an earliest-possible reservation to job at head of the queue, but allow a later arriving job to bypass it, if the reservation is not violated Utilization improves to ~90% Used at most production facilities today

Can Performance be Improved? Metrics: System Metric: Utilization User Metrics: Response time (wait+run time), Slowdown (response-time/run-time) Over a hundred papers published: Focus mainly on improving user metrics: much greater potential for its improvement than utilization Question: How important is it to squeeze an additional 5-10% utilization on a system that is already achieving over 85% utilization?

Improving Response Time Question: How important is it to evaluate alternatives to standard back-fill scheduling, with a goal of improved user response-time? Many studies have reported simulation studies showing significant improvement of slowdown or response-time with new schemes; but most production schedulers simply use aggressive back-fill. Why?

Possible Reasons for Non-Adoption Academic studies do not model specific policy issues of a center, e.g. “good citizen rules,” multiple queues etc. Most results are based on job log traces at Feitelson’s archive, with many logs from academic centers exhibiting low system utilization (< 70%). Most studies report overall averages over entire trace: insufficient to assess impact of change: E.g., using a Shortest-Job-First queue policy instead of the usual FCFS policy significantly improves overall average slowdown by a factor of 4; but increases response time for 24 hour jobs to 50 hours instead of 26 hours.

QoS for Job Scheduling Job schedulers do not provide QoS: No response time guarantees No equitable way of offering different service for urgent versus non-urgent jobs Technical and Accounting issues: Develop job schedulers that can do deadline-based scheduling Develop accounting models to charge based on urgency of job: Charge = f1(resource-usage) + f2(wait-time-limit) Question: How desirable is it to develop job schedulers with QoS functionality?

Questions?