I think your suggestion is, Can we do two things at once? Well, we’re of the view that we can walk and chew gum at the same time. —Richard Armitage, deputy.

Slides:



Advertisements
Similar presentations
Network Design and Optimization Queueing Theory Overview Dr. Greg Bernstein Grotto Networking
Advertisements

S. D. Deshmukh OM V. Capacity Planning in Services u Matching Supply and Demand u The Service Process u Performance Measures u Causes of Waiting u Economics.
ANDREW MAO, STACY WONG Regrets and Kidneys. Intro to Online Stochastic Optimization Data revealed over time Distribution of future events is known Under.
Load Balancing of Elastic Traffic in Heterogeneous Wireless Networks Abdulfetah Khalid, Samuli Aalto and Pasi Lassila
1 Stochastic Event Capture Using Mobile Sensors Subject to a Quality Metric Nabhendra Bisnik, Alhussein A. Abouzeid, and Volkan Isler Rensselaer Polytechnic.
Queueing Theory: Recap
SES 2007 A Multiresolution Approach for Statistical Mobility Prediction of Unmanned Ground Vehicles 44 th Annual Technical Meeting of the Society of Engineering.
Queuing Theory For Dummies
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
Model Fitting Jean-Yves Le Boudec 0. Contents 1 Virus Infection Data We would like to capture the growth of infected hosts (explanatory model) An exponential.
1 IOE/MFG 543 Chapter 11: Stochastic single machine models with release dates.
Dynamic Power Management for Systems with Multiple Power Saving States Sandy Irani, Sandeep Shukla, Rajesh Gupta.
Chapter 7 Sampling and Sampling Distributions
Nov 14 th  Homework 4 due  Project 4 due 11/26.
1 DYNAMIC SCHEDULING OF A PARALLEL SERVER SYSTEM IN HEAVY TRAFFIC UNDER A COMPLETE RESOURCE POOLING CONDITION Ruth J. Williams University of California,
Presenting: Assaf Tzabari
Join-the-Shortest-Queue (JSQ) Routing in Web Server Farms
June 6, 2002D.H.J. Epema/PDS/TUD1 Processor Co-Allocation in Multicluster Systems DAS-2 Workshop Amsterdam June 6, 2002 Anca Bucur and Dick Epema Parallel.
Effect of higher moments of job size distribution on the performance of an M/G/k system VARUN GUPTA Joint work with: Mor Harchol-Balter Carnegie Mellon.
Effect of higher moments of job size distribution on the performance of an M/G/k system VARUN GUPTA Joint work with: Mor Harchol-Balter Carnegie Mellon.
1 IOE/MFG 543 Chapter 10: Single machine stochastic models Sections 10.1 and 10.4 You may skip Sections
Buffer Management for Shared- Memory ATM Switches Written By: Mutlu Apraci John A.Copelan Georgia Institute of Technology Presented By: Yan Huang.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
1 Real-Time Queueing Network Theory Presented by Akramul Azim Department of Electrical and Computer Engineering University of Waterloo, Canada John P.
Distributed Quality-of-Service Routing of Best Constrained Shortest Paths. Abdelhamid MELLOUK, Said HOCEINI, Farid BAGUENINE, Mustapha CHEURFA Computers.
OPSM 301: Operations Management
Particle Filtering in Network Tomography
Standard Statistical Distributions Most elementary statistical books provide a survey of commonly used statistical distributions. The reason we study these.
Network Analysis A brief introduction on queues, delays, and tokens Lin Gu, Computer Networking: A Top Down Approach 6 th edition. Jim Kurose.
Queuing models Basic definitions, assumptions, and identities Operational laws Little’s law Queuing networks and Jackson’s theorem The importance of think.
1 Exponential distribution: main limitation So far, we have considered Birth and death process and its direct application To single server queues With.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Modelling and channel borrowing in mobile communications networks Richard J. Boucherie University of Twente Faculty of Mathematical Sciences University.
Networks of Queues Plan for today (lecture 6): Last time / Questions? Product form preserving blocking Interpretation traffic equations Kelly / Whittle.
Queuing Theory Basic properties, Markovian models, Networks of queues, General service time distributions, Finite source models, Multiserver queues Chapter.
Waiting Lines and Queuing Models. Queuing Theory  The study of the behavior of waiting lines Importance to business There is a tradeoff between faster.
Queuing Networks Jean-Yves Le Boudec 1. Contents 1.The Class of Multi-Class Product Form Networks 2.The Elements of a Product-Form Network 3.The Product-Form.
1 Chapters 8 Overview of Queuing Analysis. Chapter 8 Overview of Queuing Analysis 2 Projected vs. Actual Response Time.
Appointment Systems - a Stochastic and Fluid Approach Michal Penn The William Davidson Faculty of Industrial Engineering and Management Technion - Israel.
Michael J. Neely, University of Southern California CISS, Princeton University, March 2012 Asynchronous Scheduling for.
Opportunistic Traffic Scheduling Over Multiple Network Path Coskun Cetinkaya and Edward Knightly.
An Optimal Design of the M/M/C/K Queue for Call Centers
1 OUTPUT ANALYSIS FOR SIMULATIONS. 2 Introduction Analysis of One System Terminating vs. Steady-State Simulations Analysis of Terminating Simulations.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
1 Optimizing Decisions over the Long-term in the Presence of Uncertain Response Edward Kambour.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
Modeling and Simulation CS 313
OPERATING SYSTEMS CS 3502 Fall 2017
Heuristic Optimization Methods
The Best Duo
Modeling and Simulation CS 313
Serve Assignment Policies
Deadline Scheduling and Heavy tail distributionS
Analytics and OR DP- summary.
Generalization and adaptivity in stochastic convex optimization
Greedy Algorithms / Interval Scheduling Yin Tat Lee
Our favorite simple stochastic process.
Queuing models Basic definitions, assumptions, and identities
Chapter 6. Large Scale Optimization
Queuing models Basic definitions, assumptions, and identities
Deadline Scheduling and Heavy tailED distributions
Buffer Management for Shared-Memory ATM Switches
Queuing Networks Mean Value Analysis
Quantifying Trade-Offs via Competitive Analysis (Clean Slate Seminar)
Resource Sharing with Subexponential Distributions
Optimal Control for Generalized Network-Flow Problems
Presentation transcript:

I think your suggestion is, Can we do two things at once? Well, we’re of the view that we can walk and chew gum at the same time. —Richard Armitage, deputy secretary of state, June 2, 2004 To do two things at once is to do neither. —Publilius Syrus, Roman slave, first century B.C. Therefore… Multitask a few…..let the rest queue.

Computer systems are not different… Multi-Programming Level (# concurrent tasks/threads) Efficiency (useful work per second) thrashing Therefore… Server Active tasks Tasks not-yet-started Q: Max number of tasks allowed to share server? Common Soln: K* Admission Control K*

Presenting How to Multitask Smartly

4 A Queueing-theoretic abstraction server # jobs at server (n) Total speed  (n) K* (known)

5 A Queueing-theoretic abstraction PS server 1 2 K K = MPL # jobs at server (n) Total speed  (n) K* FIFO buffer The G/G/PS-MPL model General arrival process for jobs Job sizes i.i.d. ~ X Sizes unknown, distribution of X known GOAL: Find MPL (i.e. K) to minimize mean response time response time n jobs each job gets rate  (n) n  :: (known)

Optimal MPL= K* ? # jobs at server (n) Total speed (jobs/sec) K*=5 Exponential job sizes (C 2 = 1) Weibull job sizes (C 2 = 19) Mean response time MPL Mean response time K* OPT MPL Example Poisson(0.8) arrival process 45% improvement

Optimal MPL= K* ? # jobs at server (n) Total speed (jobs/sec) K*=5 Exponential job sizes (C 2 = 1) Weibull job sizes (C 2 = 19) Mean response time MPL Mean response time K* OPT MPL Example Poisson(0.8) arrival process DEPENDS! 45% improvement

8 Intuition for the effect of MPL FIFO server Processor Sharing (PS)

9 Intuition for the effect of MPL FIFO server Processor Sharing (PS) High job-size variability (C 2 ) High arrival rate   # jobs at server (n) Total speed  (n) K* MPL

10 Intuition for the effect of MPL server FIFO Processor Sharing (PS) High job-size variability (C 2 ) High arrival rate Optimal MPL ↑ C 2 ↑    Arrival rate ↑  Optimal MPL → K* MPL

11 Why is finding the optimal MPL hard? - No analysis even for the case of Poisson arrivals (M/G/PS-MPL) Hurdle 1: State dependent service rates Hurdle 2: Finite Multi- Programming-Limit Hurdle 3: General (high C 2 ) job size distribution Mean response time only depends on E[X] [Kelly 79] Transform of response time known for Exponential job size distribution [Rege Sengupta 85] Heavy traffic diffusion approximation [Zhang Zwart 09] Our work

12 Where we are… GOAL: Optimal MPL for M/G/PS-MPL SUBGOAL: Approximation for M/G/PS-MPL What if the traffic rate is unknown? What if the traffic is not Poisson? GOAL #2: Traffic-oblivious dynamic MPL control policies ?

13 Where we are… GOAL: Optimal MPL for M/G/PS-MPL SUBGOAL: Approximation for M/G/PS-MPL What if the traffic rate is unknown? What if the traffic is not Poisson? GOAL #2: Traffic-oblivious dynamic MPL control policies ?

14 RESULT: M/G/PS-MPL approximation PS Server  (n) 1 2 K FIFO buffer Job sizes ~ X Poisson( ) Easy for X  Exp (C 2 =1)

15 RESULT: M/G/PS-MPL approximation Involves only the first two moments of job size distribution Generalizes -Lee and Longton approx. for M/G/K multi-server system:  (n)= n·  -Zhang and Zwart approx. for the case  (n)=  THEOREM: The above approx. is exact for the following degenerate hyper-exponential job size distribution: - 2 degrees of freedom - Can match any E[X] and C 2 ≥ 1

16 Where we are… GOAL: Optimal MPL for M/G/PS-MPL SUBGOAL: Approximation for M/G/PS-MPL What if the traffic rate is unknown? What if the traffic is not Poisson? GOAL #2: Traffic-oblivious dynamic MPL control policies ?

17 Where we are… GOAL: Optimal MPL for M/G/PS-MPL SUBGOAL: Approximation for M/G/PS-MPL What if the traffic rate is unknown? What if the traffic is not Poisson? GOAL #2: Traffic-oblivious dynamic MPL control policies ?

18 Static MPL policy for M/G/PS-MPL Set MPL = MPL*, where:

19 Approx accurately predicts the behavior of the curve as a function of MPL MPL* decreases as arrival rate increases (Poisson(0.8) arrivals) (Poisson(0.9) arrivals) # jobs at server (n) Total speed  (n) MPL Mean Resp. Time Our approx. Weibull (C 2 =19) K* K*K* MPL* K* gives 25% worse performance than MPL* K* gives 45% worse performance than MPL* Our approx.

20 Where we are… GOAL: Optimal MPL for M/G/PS-MPL SUBGOAL: Approximation for M/G/PS-MPL What if the traffic rate is unknown? What if the traffic is not Poisson? GOAL #2: Traffic-oblivious dynamic MPL control policies ?

21 Where we are… GOAL: Optimal MPL for M/G/PS-MPL SUBGOAL: Approximation for M/G/PS-MPL What if the traffic rate is unknown? What if the traffic is not Poisson? GOAL #2: Traffic-oblivious dynamic MPL control policies ?

22 Traffic-oblivious policies Straw man proposal 1: Choose a “robust” static MPL –Must choose MPL=K* : but suboptimal in light/moderate traffic Straw man proposal 2: Learn the arrival rate –Can’t adapt to changes on small time scale/correlations WE DEMONSTRATE: Dynamic MPL control policies which are 1. Traffic-oblivious: self-adapt to variations in the arrival process 2. Light-weight: make decisions based only on current queue length, Q(t), and current MPL, K(t) I don’t know the arrival rate!! My arrivals are not Poisson!!

23 Traffic-oblivious policies (KEY STEPS) GOAL: A robust MPL-control policy π OPT STEP 1:Approximate the orig. job size distribution X ≈ _____ STEP 2:Construct candidate policies π 1 π 2 π 3 … π N STEP 3:Pick the “best” candidate policy

24 Traffic-oblivious policies (KEY STEPS) GOAL: A robust MPL-control policy π OPT STEP 1:Approximate the orig. job size distribution X ≈ _____ STEP 2:Construct candidate policies π 1 π 2 π 3 … π N STEP 3:Pick the “best” candidate policy

25 GOAL: Approximate the job size distribution by a tractable distribution IDEA: The H* distribution: Models many small vs. few large jobs Tractable due to Exponential branch Traffic-oblivious policies (KEY STEPS) For age/size independent policies, H* provides a good approximation for mean response time p and γ chosen to match E[X] and C 2

26 Traffic-oblivious policies (KEY STEPS) GOAL: A robust MPL-control policy π OPT STEP 1:Approximate the orig. job size distribution X ≈ H* STEP 2:Construct candidate policies π 1 π 2 π 3 … π N STEP 3:Pick the “best” candidate policy

27 Traffic-oblivious policies (KEY STEPS) GOAL: A robust MPL-control policy π OPT STEP 1:Approximate the orig. job size distribution X ≈ H* STEP 2:Construct candidate policies π 1 π 2 π 3 … π N STEP 3:Pick the “best” candidate policy

28 GOAL: A family of candidate dynamic policies IDEA: 1. Assume arrival ~ Poisson( ) 2. Solve for 3. Obtain the family by varying Traffic-oblivious policies (KEY STEPS) Optimal Dynamic MPL policy to minimize mean response time under H* job sizes  Intuitively : What should this look like?

29 Structure of Current Queue Length Current MPL MPL ↓ MPL ↑ K*= obtained by combining policy iteration with some new tricks to handle infinite state space “Stochastic” component“Fluid” = Target MPL

30 Traffic-oblivious policies (KEY STEPS) GOAL: A robust MPL-control policy π OPT STEP 1:Approximate the orig. job size distribution X ≈ H* STEP 2:Construct candidate policies π(λ 1 ) π(λ 2 ) … π(λ N ) STEP 3:Pick the “best” candidate policy

31 Traffic-oblivious policies (KEY STEPS) GOAL: A robust MPL-control policy π OPT STEP 1:Approximate the orig. job size distribution X ≈ H* STEP 2:Construct candidate policies π(λ 1 ) π(λ 2 ) … π(λ N ) STEP 3:Pick the “best” candidate policy

32 GOAL: Robustly choose a member from family IDEA: Minimize the worst-case relative deviation in an adversarial setting Traffic-oblivious policies (KEY STEPS) a perf. ratio( , )  Mean resp. time of policy  under Poisson( ) Mean resp. time of OPT static MPL under Poisson( ) a* = argmin max perf. ratio(  (a), ) - You pick  - Adversary sees , picks - You lose perf. ratio( , ) - Optimize against worst adversary

33 Traffic-oblivious policies (KEY STEPS) GOAL: A robust MPL-control policy π OPT STEP 1:Approximate the orig. job size distribution X ≈ H* STEP 2:Construct candidate policies π(λ 1 ) π(λ 2 ) … π(λ N ) STEP 3:Pick the “best” candidate policy min max OPT

34 Back to picking optimal MPL… Scenario #1 Arrival process ~ Poisson(?) Scenario #2 Arrival process ~ ?(?) (results for Batch Poisson arrivals) Worst Perf. Loss (vs. OPT Static MPL) K* Our Dynamic Policy K* Our Dynamic Policy - Dynamic policies remain “robust” to non-Poisson arrival process - Static policies become worse 45%20% 14%50%

35 SUMMARY Efficiency ≠ Min. response time Static MPL policies via the first approximation for M/G/PS-MPL with state dependent service rates First traffic-oblivious MPL control policies for resource sharing systems See paper for more results MPL* MPL ↓ MPL ↑

36 True arrival rate ( )

37 True arrival rate ( )

38 Max. dev. < 14% True arrival rate ( )

39 vs. (Simulation results for Weibull job size dist.) rel. dev. Mean arrival rate Poisson arrivalsBatch Poisson (BPP) arrivals Dynamic policies remain robust to non-Poisson arrival processes Traffic-oblivious dynamic policies Traffic-aware static policies