Resource Provision for Batch and Interactive Workloads in Data Centers Ting-Wei Chang, Pangfeng Liu Department of Computer Science and Information Engineering,

Slides:

Advertisements

Similar presentations

Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)

Advertisements

Feedback Control Real-Time Scheduling: Framework, Modeling, and Algorithms Chenyang Lu, John A. Stankovic, Gang Tao, Sang H. Son Presented by Josh Carl.

QoS-based Management of Multiple Shared Resources in Dynamic Real-Time Systems Klaus Ecker, Frank Drews School of EECS, Ohio University, Athens, OH {ecker,

Hadi Goudarzi and Massoud Pedram

SLA-Oriented Resource Provisioning for Cloud Computing

Bag-of-Tasks Scheduling under Budget Constraints Ana-Maria Oprescu, Thilo Kielman Presented by Bryan Rosander.

GPU Virtualization Support in Cloud System Ching-Chi Lin Institute of Information Science, Academia Sinica Department of Computer Science and Information.

Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.

Energy-efficient Virtual Machine Provision Algorithms for Cloud System Ching-Chi Lin Institute of Information Science, Academia Sinica Department of Computer.

1 Swiss Federal Institute of Technology Computer Engineering and Networks Laboratory Embedded Systems Exercise 2: Scheduling Real-Time Aperiodic Tasks.

A Cyber-Physical Systems Approach to Energy Management in Data Centers Presented by Chen He Adopted form the paper authors.

Meeting Service Level Objectives of Pig Programs Zhuoyao Zhang, Ludmila Cherkasova, Abhishek Verma, Boon Thau Loo University of Pennsylvania Hewlett-Packard.

Service Level Agreement based Allocation of Cluster Resources: Handling Penalty to Enhance Utility Chee Shin Yeo and Rajkumar Buyya Grid Computing and.

Cloud Scheduling Dynamic Request Allocation with Respect to Context and SLA Charles Snyder.

Soft Real-Time Semi-Partitioned Scheduling with Restricted Migrations on Uniform Heterogeneous Multiprocessors Kecheng Yang James H. Anderson Dept. of.

Managing Risk of Inaccurate Runtime Estimates for Deadline Constrained Job Admission Control in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing.

Charles Reiss *, Alexey Tumanov †, Gregory R. Ganger †, Randy H. Katz *, Michael A. Kozuch ‡ * UC Berkeley† CMU‡ Intel Labs.

Automatic Resource Scaling for Web Applications in the Cloud Ching-Chi Lin Institute of Information Science, Academia Sinica Department of Computer Science.

Gueyoung Jung, Nathan Gnanasambandam, and Tridib Mukherjee International Conference on Cloud Computing 2012.

Process Scheduling for Performance Estimation and Synthesis of Hardware/Software Systems Slide 1 Process Scheduling for Performance Estimation and Synthesis.

QoS-constrained List Scheduling Heuristics for Parallel Applications on Grids 16-th Euromicro PDP Toulose, February 2008 QoS-CONSTRAINED LIST SCHEDULING.

Embedded Systems Exercise 3: Scheduling Real-Time Periodic and Mixed Task Sets 18. May 2005 Alexander Maxiaguine.

A Prediction-based Real-time Scheduling Advisor Peter A. Dinda Prescience Lab Department of Computer Science Northwestern University

Integrated Risk Analysis for a Commercial Computing Service Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS) Lab. Dept.

November , 2009SERVICE COMPUTATION 2009 Analysis of Energy Efficiency in Clouds H. AbdelSalamK. Maly R. MukkamalaM. Zubair Department.

Resource Provisioning based on Lease Preemption in InterGrid Mohsen Amini Salehi, Bahman Javadi, Rajkumar Buyya Cloud Computing and Distributed Systems.

Trust-Aware Optimal Crowdsourcing With Budget Constraint Xiangyang Liu 1, He He 2, and John S. Baras 1 1 Institute for Systems Research and Department.

Cloud Resource Scheduling for Online and Batch Applications Kick-off meeting.

Progress Report 2014/02/12. Previous in IPDPS’14 Energy-efficient task scheduling on per- core DVFS architecture ◦ Batch mode  Tasks with arrival time.

An Energy-Efficient Hypervisor Scheduler for Asymmetric Multi- core 1 Ching-Chi Lin Institute of Information Science, Academia Sinica Department of Computer.

Euro-Par, A Resource Allocation Approach for Supporting Time-Critical Applications in Grid Environments Qian Zhu and Gagan Agrawal Department of.

BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.

Job scheduling algorithm based on Berger model in cloud environment Advances in Engineering Software (2011) Baomin Xu,Chunyan Zhao,Enzhao Hua,Bin Hu 2013/1/251.

Resource Mapping and Scheduling for Heterogeneous Network Processor Systems Liang Yang, Tushar Gohad, Pavel Ghosh, Devesh Sinha, Arunabha Sen and Andrea.

An Energy-efficient Task Scheduler for Multi-core Platforms with per-core DVFS Based on Task Characteristics Ching-Chi Lin Institute of Information Science,

June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.

“A cost-based admission control algorithm for digital library multimedia systems storing heterogeneous objects” – I.R. Chen & N. Verma – The Computer Journal.

Efficient Load Balancing Algorithm for Cloud Computing Network Che-Lun Hung 1, Hsiao-hsi Wang 2 and Yu-Chen Hu 2 1 Dept. of Computer Science & Communication.

Scheduling MPI Workflow Applications on Computing Grids Juemin Zhang, Waleed Meleis, and David Kaeli Electrical and Computer Engineering Department, Northeastern.

Xi He Golisano College of Computing and Information Sciences Rochester Institute of Technology Rochester, NY THERMAL-AWARE RESOURCE.

Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.

Cloud Resource Scheduling for Online and Batch Applications Midterm report 12/16.

Optimizing server placement in distributed systems in the presence of competition Jan-Jan Wu( 吳真貞 ), Shu-Fan Shih ( 施書帆 ), Pangfeng Liu ( 劉邦鋒 ), Yi-Min.

Multi-Task Assignment for CrowdSensing in Mobile Social Network Mingjun Xiao ∗, Jie Wu†, Liusheng Huang ∗, Yunsheng Wang‡, and Cong Liu§

Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,

Architecture for Resource Allocation Services Supporting Interactive Remote Desktop Sessions in Utility Grids Vanish Talwar, HP Labs Bikash Agarwalla,

1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.

Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.

Cloud-Assisted VR.

Introduction | Model | Solution | Evaluation

From Algorithm to System to Cloud Computing

Dynamic Graph Partitioning Algorithm

Analyzing Security and Energy Tradeoffs in Autonomic Capacity Management Wei Wu.

Ching-Chi Lin Institute of Information Science, Academia Sinica

Chapter 2 Scheduling.

Efficient Load Balancing Algorithm for Cloud

Cloud-Assisted VR.

Babak Sorkhpour, Prof. Roman Obermaisser, Ayman Murshed

A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.

Lecture 21: Introduction to Process Scheduling

Multi-hop Coflow Routing and Scheduling in Data Centers

Jason Neih and Monica.S.Lam

An Adaptive Middleware for Supporting Time-Critical Event Response

Smita Vijayakumar Qian Zhu Gagan Agrawal

CPU SCHEDULING.

Richard Anderson Lecture 6 Greedy Algorithms

Richard Anderson Lecture 7 Greedy Algorithms

Lecture 21: Introduction to Process Scheduling

Cloud Resource Scheduling for Online and Batch Applications

Richard Anderson Autumn 2019 Lecture 7

Presentation transcript:

Resource Provision for Batch and Interactive Workloads in Data Centers Ting-Wei Chang, Pangfeng Liu Department of Computer Science and Information Engineering, National Taiwan University Graduate Institute of Networking and Multimedia, Nation Taiwan University Ching-Chi Lin Institute of Information Science, Academia Sinica Department of Computer Science and Information Engineering, National Taiwan University Jan-Jan Wu Institute of Information Science, Academia Sinica Research Center for Information Technology Innovation, Academia Sinica Chia-Chun Shih, Chao-Wen Huang Chunghwa Telecom Laboratories

Agenda Introduction Problem Definition Resource Provisioning Algorithm Evaluation Conclusion

Motivation Private cloud has limited amount of hardware resources. ◦ Fixed amount of servers for most of the time. Applications have varying characteristics and SLAs. ◦ SLA: service level agreement ◦ Insufficient resources allocation leads to SLA violation, which incurs penalty.

Goal Dynamically adjust the computing resources for different types of applications, such that the penalty is minimized. ◦ Penalty incurred by SLA violation. ◦ Private cloud, where hardware resources are considered to be fixed and limited.

Application Type – Batch Job A set of independent computation- intensive tasks with the similar execution time. ◦ [SLA]: finish within a (soft) deadline.

Application Type – Interactive Job Interactive job ◦ Long-running application that serves requests from users.  State-less ◦ [SLA]: response within a threshold.

Penalty of Jobs The penalty of a job j with m processing units: ◦ r : the penalty rate. ◦ v: the amount of SLA violation

SLA Violation For each job j  c(m): the completion time  d: the (soft) deadline  E: the expected fraction of satisfying requests.  f(m): the actual fraction of satisfying responses

Contribution Design a framework that allocate resources to batch and interactive jobs. Provide theoretical analyses on the expected penalty of jobs. Propose a heuristic algorithm that minimizes the total penalties.

Resource Provisioning Problem Given a set of batch jobs B, a set of interactive jobs I, the number of processing units M, and a penalty value C. Is there a schedule to run all jobs with the total penalty incurred before all batch jobs complete no more than C? ◦ Each job has a penalty rate r and corresponding SLA requirement.

Finding A Solution Given total M processing units. ◦ Dynamically determine M i and M b that minimize the total penalty. MiMi MbMb

Heuristic Estimate the penalty of interactive jobs. Estimate the penalty of batch jobs. Determine M i and M b. ◦ The number of processing units assigned to interactive and batch jobs.

Penalty Estimation – Interactive Jobs Given k interactive job For any given M i : Determine m 1 ~ m k that minimize …… m1m1 m2m2 mkmk

Minimizing Penalty of Interactive Jobs Compute the minimum penalty of all interactive jobs using dynamic programming. ◦ Define D(j, m) as the minimum penalty of running the first j jobs with m processing units. Minimum penalty:

Penalty Estimation – Batch Jobs Given k batch job For any given M b : b2b2 b2b2 b3b3 b3b3 b3b3 b1b1 Time

Job Execution Order and Scheduling Execution order: Greedy ◦ Select the job with the least effects to other unselected jobs until all jobs are selected. Scheduling: ◦ An available processing unit pick a task from the sorted job list for execution. Compute penalty

Determine M i and M b Penalty of interactive jobs: Penalty of batch jobs: Minimize penalty P:

Evaluation Environment Hardware ◦ Managing nodes and four worker nodes. ◦ Each worker node has 16 processing units. Workload trace ◦ Batch job: Samples from SDSC-Par96 trace log. ◦ Interactive job: ◦ Samples form Calgary-HTTP and Saskatchewan-HTTP trace log.

Evaluation Conducted two sets of experiments ◦ Batch jobs  Compares the average penalty of our greedy algorithm against other methods. ◦ Batch and interactive jobs  Compares the SLA violation penalty.

Penalty among Different Methods Apply different methods to schedule batch jobs, and compare the total penalty. ◦ Compare our Greedy method with Earliest Deadline First strategy(EDF), Least Slack Time First strategy (LST), Least Slack Time Rate First strategy (LSTR), and Highest Penalty Rate First strategy (HPRF)

Penalty of Mixed Jobs Apply different methods to determine M i and M b, and compare the total penalty. ◦ Compare our dynamic programming(DP) with static fraction(SF) and penalty proportion(PP).

Conclusion We propose a heuristic algorithm that minimize the total SLA violation penalty of different types of applications. ◦ Batch and interactive jobs. The experimental results suggest that our system effectively reduces total penalty by allocating proper amount of resources to heterogeneous jobs.

Thank you!

Penalty of Batch Job Penalty of a batch job: ◦ r : the penalty rate. ◦ c(m) : the completion time with m processing units. ◦ d :the (soft) deadline.

Penalty of Interactive Job Penalty of a interactive job: ◦ r : the penalty rate. ◦ E : the expected fraction of satisfying requests. ◦ f(m) : the fraction of satisfying requests when given m processing units.

Penalty Estimation – Interactive Jobs Penalty of an interactive job j: ◦ f(j, m) : the fraction of satisfying requests of job j when given m processing units. Estimate P(j, m) by computing f(j, m) for each j and m using queuing theory.

Penalty Estimation – Batch Jobs Penalty of an batch job i: ◦ c(i,m,s): the completion time of job i with m processing units. ◦ s: the starting time of job i.

Accuracy on Penalty Estimation Compute the ratio between P t and the actual final penalty. ◦ P t : the estimated penalty of all jobs during execution ◦ Converges quickly.