System Performance & Scalability i206 Fall 2010 John Chuang.

Slides:



Advertisements
Similar presentations
Introduction to Queuing Theory
Advertisements

1 Chapter 8 Queueing models. 2 Delay and Queueing Main source of delay Transmission (e.g., n/R) Propagation (e.g., d/c) Retransmission (e.g., in ARQ)
Queueing Model 박희경.
1 Part II Web Performance Modeling: basic concepts © 1998 Menascé & Almeida. All Rights Reserved.
ECS 152A Acknowledgement: slides from S. Kalyanaraman & B.Sikdar
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Simple queuing models (Sec )
1 Performance Evaluation of Computer Networks Objectives  Introduction to Queuing Theory  Little’s Theorem  Standard Notation of Queuing Systems  Poisson.
Single queue modeling. Basic definitions for performance predictions The performance of a system that gives services could be seen from two different.
Data Communication and Networks Lecture 13 Performance December 9, 2004 Joseph Conron Computer Science Department New York University
1 Queueing Theory H Plan: –Introduce basics of Queueing Theory –Define notation and terminology used –Discuss properties of queuing models –Show examples.
Queueing Network Model. Single Class Model Open - Infinite stream of arriving customers Closed - Finite population eg Intranet users Indistinguishable.
Queuing. Elements of Waiting Lines  Population –Source of customers Infinite or finite.
CS533 Modeling and Performance Evaluation of Network and Computer Systems Queuing Theory (Chapter 30-31)
Queueing Theory.
Queuing Theory. Queuing theory is the study of waiting in lines or queues. Server Pool of potential customers Rear of queue Front of queue Line (or queue)
Introduction to Queuing Theory. 2 Queuing theory definitions  (Kleinrock) “We study the phenomena of standing, waiting, and serving, and we call this.
Internet Queuing Delay Introduction How many packets in the queue? How long a packet takes to go through?

Introduction to Queuing Theory
Queueing Theory I. Summary Little’s Law Queueing System Notation Stationary Analysis of Elementary Queueing Systems  M/M/1  M/M/m  M/M/1/K  …
Queuing Theory Summary of results. 2 Notations Typical performance characteristics of queuing models are: L : Ave. number of customers in the system L.
Network Analysis A brief introduction on queues, delays, and tokens Lin Gu, Computer Networking: A Top Down Approach 6 th edition. Jim Kurose.
Introduction to Queuing Theory
Queuing models Basic definitions, assumptions, and identities Operational laws Little’s law Queuing networks and Jackson’s theorem The importance of think.
Management of Waiting Lines McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
CS433 Modeling and Simulation Lecture 13 Queueing Theory Dr. Anis Koubâa 03 May 2009 Al-Imam Mohammad Ibn Saud University.
 Birth Death Processes  M/M/1 Queue  M/M/m Queue  M/M/m/B Queue with Finite Buffers  Results for other Queueing systems 2.
Introduction to Operations Research
Performance Analysis of Computer Systems and Networks Prof. Varsha Apte.
Lecture 10: Queueing Theory. Queueing Analysis Jobs serviced by the system resources Jobs wait in a queue to use a busy server queueserver.
NETE4631:Capacity Planning (2)- Lecture 10 Suronapee Phoomvuthisarn, Ph.D. /
Introduction to Queueing Theory
Queuing Theory Basic properties, Markovian models, Networks of queues, General service time distributions, Finite source models, Multiserver queues Chapter.
1 Queueing Theory Frank Y. S. Lin Information Management Dept. National Taiwan University
Queueing Theory What is a queue? Examples of queues: Grocery store checkout Fast food (McDonalds – vs- Wendy’s) Hospital Emergency rooms Machines waiting.
TexPoint fonts used in EMF.
CS433 Modeling and Simulation Lecture 12 Queueing Theory Dr. Anis Koubâa 03 May 2008 Al-Imam Mohammad Ibn Saud University.
1 Chapters 8 Overview of Queuing Analysis. Chapter 8 Overview of Queuing Analysis 2 Projected vs. Actual Response Time.
Queuing Theory and Traffic Analysis Based on Slides by Richard Martin.
CS352 - Introduction to Queuing Theory Rutgers University.
CSCI1600: Embedded and Real Time Software Lecture 19: Queuing Theory Steven Reiss, Fall 2015.
Maciej Stasiak, Mariusz Głąbowski Arkadiusz Wiśniewski, Piotr Zwierzykowski Model of the Nodes in the Packet Network Chapter 10.
Internet Applications: Performance Metrics and performance-related concepts E0397 – Lecture 2 10/8/2010.
Introduction Definition M/M queues M/M/1 M/M/S M/M/infinity M/M/S/K.
Queuing Theory.  Queuing Theory deals with systems of the following type:  Typically we are interested in how much queuing occurs or in the delays at.
COMT 4291 Queuing Analysis COMT Call/Packet Arrival Arrival Rate, Inter-arrival Time, 1/ Arrival Rate measures the number of customer arrivals.
Queueing Fundamentals for Network Design Application ECE/CSC 777: Telecommunications Network Design Fall, 2013, Rudra Dutta.
Management of Waiting Lines Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent.
Random Variables r Random variables define a real valued function over a sample space. r The value of a random variable is determined by the outcome of.
Mohammad Khalily Islamic Azad University.  Usually buffer size is finite  Interarrival time and service times are independent  State of the system.
Simple Queueing Theory: Page 5.1 CPE Systems Modelling & Simulation Techniques Topic 5: Simple Queueing Theory  Queueing Models  Kendall notation.
QUEUING THEORY 1.  - means the number of arrivals per second   - service rate of a device  T - mean service time for each arrival   = ( ) Utilization,
Queuing Theory Simulation & Modeling.
WAITING LINES AND SIMULATION
Al-Imam Mohammad Ibn Saud University
Queueing Theory What is a queue? Examples of queues:
B.Ramamurthy Appendix A
CSCI1600: Embedded and Real Time Software
Queueing Theory Carey Williamson Department of Computer Science
Introduction Notation Little’s Law aka Little’s Result
Queuing models Basic definitions, assumptions, and identities
Waiting Lines Queues.
System Performance: Queuing
Queuing models Basic definitions, assumptions, and identities
TexPoint fonts used in EMF.
Queuing Theory By: Brian Murphy.
Queueing Theory 2008.
Carey Williamson Department of Computer Science University of Calgary
Waiting Line Models Waiting takes place in virtually every productive process or service. Since the time spent by people and things waiting in line is.
Course Description Queuing Analysis This queuing course
Presentation transcript:

System Performance & Scalability i206 Fall 2010 John Chuang

2

John Chuang3 Computing Trends  Multi-core CPUs  Data centers  Cloud computing  What are the drivers? -scalability, availability, cost-effectiveness

John Chuang4 Lecture Outline  Performance Metrics  Availability  Queuing theory -M/M/1 queue  Scalability -M/M/m queue

John Chuang5 What is Performance?  Users want fast response time and high availability  Managers want happy users, and many of them, while minimizing cost  What are standard measures of system performance?

John Chuang6 Performance Metrics  Response time (seconds)  Throughput (MIPS, Mbps, TPS,...)  Resource utilization (%)  Availability (%)

John Chuang7 Availability Down-time per yearOne hour down-time per: 90%36 days9 hours 99%3.7 days4.1 days 99.9%9 hours41.6 days 99.99%53 minutes1.14 years %5 minutes11.41 years Availability = MTTF / (MTTF + MTTR) -Mean-time-to-failure (MTTF) -Mean-time-to-recover (MTTR)

John Chuang8 Response Time ClientServer Formulate request Message latency Processing time Interpret response Network Queuing time Adapted from: David Messerschmitt

John Chuang9 Queuing Theory 1. Arrival Process 2. Service Time Distribution 3. Number of Servers 4. System Capacity 5. Customer Population 6. Service Discipline Source: Raj Jain

John Chuang10 Kendall’s Notation (1953)  A/B/c/k/N/D -A: arrival process -B: service time distribution -c: number of servers -k: system capacity -N: population size -D: service discipline M: Markov (exponential, memoryless, random, Poisson) D: deterministic E: Erlang H: hyper-exponential G: general FCFS: first come first served FCLS: first come last served RR: round-robin etc. 1. Arrival Process 2. Service Time Distribution 3. Number of Servers 4. System Capacity 5. Customer Population 6. Service Discipline

John Chuang11 Example Systems  M/M/1/ / /FCFS (simplified as M/M/1) -Markovian (Poisson, memoryless) arrival -Markovian service time -1 server -Infinite server capacity -Infinite arrival stream -First-come-first-serve discipline  Other examples: -M/M/1/k (finite capacity) -M/M/m (m servers) -G/D/1 (arbitrary arrival, deterministic service time) 8 8

John Chuang12 M/M/1 Queue  Poisson arrival, with average arrival rate of jobs/sec  Poisson service, with average service rate of  jobs/sec  Single server with infinite queue  System utilization (hopefully < 1):  = /   Average number of jobs in system: N =  n·p n =  /(1 -  )  System throughput (if  < 1) : X =  Average response time (from Little’s Law): R = N/X = 1/(  - )

John Chuang13 Example: Web Server  Web server receives 40 requests/second  Web server can process 100 requests/second  What is server utilization?  At any given time, how many requests are at server (waiting plus being processed)?  What is the mean total delay at server (waiting plus processing)?  What happens when traffic rate doubles?

John Chuang14 Example: Web Server  = 40 requests/second   = 100 requests/second  Utilization =  = /  = 40/100 = 40%  # of requests = N =  /(1 -  ) = 0.67  Average time spent at server = R = N/X = 0.67/40 = 17ms

John Chuang15 Example: Traffic Doubled  = 80 requests/second   = 100 requests/second  Utilization =  = /  = 80/100 = 80%  # of requests = N =  /(1 -  ) = 4  Average time spent at server = R = N/X = 4/80 = 50ms (more than doubled!)

John Chuang16 Approaching Congestion  = 99 requests/second   = 100 requests/second  Utilization =  = /  = 99/100 = 99%  # of requests = N =  /(1 -  ) = 99  Average time spent at server = R = N/X = 99/99 = 1 second!

John Chuang17 Utilization Affects Performance

John Chuang18 M/M/1/k Queue (Finite Capacity)   = /   N =  /(1-  ) – (k+1)  k+1 /(1-  k+1 )  R = N/X = N/ eff -where eff = (1-P k ) = effective arrival rate -and P k =  k (1-  )/(1-  k+1 ) = probability of a full queue  Loss rate = - eff

John Chuang19 M/M/1/k Response Time

John Chuang20 M/M/1/k Throughput

John Chuang21 Lecture Outline  Performance Metrics  Availability  Queuing theory -M/M/1 queue  Scalability -M/M/m queue

John Chuang22 Scalability  The capability of a system to increase total throughput under an increased load when resources (typically hardware) are added -Cost of additional resource -Performance degradation under increased load

John Chuang23 Scalability Example  Original web server: can process  requests/sec; accepts requests at /sec  Now request rate increases to 10 /sec and web server is swamped (  = 10 /  )!  Need to add new hardware!

John Chuang24 Which is better?  Option 1: One big web server that can process 10  requests/sec  Option 2: Ten web servers, each can process  requests/sec; each accepts 10% of requests ( /sec per server)  Option 3: Ten web servers, each can process  requests/sec; share single queue (load balancer) that accepts requests at 10 /sec

John Chuang25                        Option 1: M/M/1 queue with big server Option 2: (ten M/M/1 queues) Option 3: M/M/10 queue

John Chuang26 M/M/m Queue (m Servers)   = /m   N = m  +  /(1-  ) where and 

John Chuang27 Which is Better? Option 1 (M/M/1 big) Option 2 (ten M/M/1) Option 3 (M/M/10) Utilization (  ) 0.5 Number of requests (N) 11* Response Time (R) 2ms20ms10.07ms m = 10;  = 100; = 50 Remember: Scalability is not just about performance!