© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Injecting Realistic Burstiness to.

Slides:



Advertisements
Similar presentations
Lookback Scheduling for Long-Term Quality-of-Service Over Multiple Cells Hatem Abou-zeid*, Stefan Valentin, Hossam S. Hassanein*, and Mohamed F. Feteiha.
Advertisements

IWQoS2006, New Haven, CT, June 19 – 21, 2006 Improving Performance of Internet Services Through Reward-Driven Request Prioritization Alexander Totok and.
1 Sizing the Streaming Media Cluster Solution for a Given Workload Lucy Cherkasova and Wenting Tang HPLabs.
MINERVA: an automated resource provisioning tool for large-scale storage systems G. Alvarez, E. Borowsky, S. Go, T. Romer, R. Becker-Szendy, R. Golding,
Doc.: IEEE /0604r1 Submission May 2014 Slide 1 Modeling and Evaluating Variable Bit rate Video Steaming for ax Date: Authors:
G. Alonso, D. Kossmann Systems Group
1 M. Teixeira and Pablo Sabadin Planning Databases Service Level.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part IV Capacity Planning Methodology.
1 10 Web Workload Characterization Web Protocols and Practice.
Performance Engineering Methodology Chapter 4. Performance Engineering Performance engineering analyzes the expected performance characteristics of a.
1 Part IV Capacity Planning Methodology © 1998 Menascé & Almeida. All Rights Reserved.
An Adaptable Benchmark for MPFS Performance Testing A Master Thesis Presentation Yubing Wang Advisor: Prof. Mark Claypool.
1 Part II Web Performance Modeling: basic concepts © 1998 Menascé & Almeida. All Rights Reserved.
1 Giuliano Casale, Eddy Z. Zhang, Evgenia Smirni {casale, eddy, Speaker: Giuliano Casale Numerical Methods for Structured Markov Chains,
Statistics & Modeling By Yan Gao. Terms of measured data Terms used in describing data –For example: “mean of a dataset” –An objectively measurable quantity.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
Performance Evaluation
1 KPC-Toolbox Demonstration Eddy Zheng Zhang, Giuliano Casale, Evgenia Smirni Computer Science Department College of William & Mary.
1 Multiple class queueing networks Mean Value Analysis - Open queueing networks - Closed queueing networks.
UCF Noise Map, an Android Application Project Presentation Cindy Atherton and Deepa Shinde.
Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.
01/22/09ICDCS20061 Load Unbalancing to Improve Performance under Autocorrelated Traffic Ningfang Mi College of William and Mary Joint work with Qi Zhang.
Yaksha: A Self-Tuning Controller for Managing the Performance of 3-Tiered Web Sites Abhinav Kamra, Vishal Misra CS Department Columbia University Erich.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
A Traffic Characterization of Popular On-Line Games Wu-Chang Feng, Francis Chang, Wu- Chi Feng, and Jonathan Walpole IEEE/ACM Trans. Networking, Jun
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Automated Workload Management in.
1 Part VI System-level Performance Models for the Web © 1998 Menascé & Almeida. All Rights Reserved.
Using Standard Industry Benchmarks Chapter 7 CSE807.
Self-Similarity of Network Traffic Presented by Wei Lu Supervised by Niclas Meier 05/
Adaptive Control of Virtualized Resources in Utility Computing Environments HP Labs: Xiaoyun Zhu, Mustafa Uysal, Zhikui Wang, Sharad Singhal University.
Efficient Scheduling of Heterogeneous Continuous Queries Mohamed A. Sharaf Panos K. Chrysanthis Alexandros Labrinidis Kirk Pruhs Advanced Data Management.
1 An SLA-Oriented Capacity Planning Tool for Streaming Media Services Lucy Cherkasova, Wenting Tang, and Sharad Singhal HPLabs,USA.
Network Analysis A brief introduction on queues, delays, and tokens Lin Gu, Computer Networking: A Top Down Approach 6 th edition. Jim Kurose.
Ranking the Importance of Alerts for Problem Determination in Large Computer System Guofei Jiang, Haifeng Chen, Kenji Yoshihira, Akhilesh Saxena NEC Laboratories.
Software Performance Testing Based on Workload Characterization Elaine Weyuker Alberto Avritzer Joe Kondek Danielle Liu AT&T Labs.
Budget-based Control for Interactive Services with Partial Execution 1 Yuxiong He, Zihao Ye, Qiang Fu, Sameh Elnikety Microsoft Research.
A Regression-Based Analytic Model for Dynamic Resource Provisioning of Multi-Tier Applications Qi Zhang College of William and Mary Williamsburg, VA 23187,
Introduction to Web Applications Programming Lab : II MS (IT)
Microsoft Research1 Characterizing Alert and Browse Services for Mobile Clients Atul Adya, Victor Bahl, Lili Qiu Microsoft Research USENIX Annual Technical.
1 Challenges in Scaling E-Business Sites  Menascé and Almeida. All Rights Reserved. Daniel A. Menascé Department of Computer Science George Mason.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part V Workload Characterization for the Web (Book, chap. 6)
Queueing Models with Multiple Classes CSCI 8710 Tuesday, November 28th Kraemer.
Design and Evaluation of a Model for Multi-tiered Internet Applications Bhuvan Urgaonkar Internship project talk – Services Management Middleware Dept,
1 Admission Control and Request Scheduling in E-Commerce Web Sites Sameh Elnikety, EPFL Erich Nahum, IBM Watson John Tracey, IBM Watson Willy Zwaenepoel,
Empirical Quantification of Opportunities for Content Adaptation in Web Servers Michael Gopshtein and Dror Feitelson School of Engineering and Computer.
Authors: Mianyu Wang, Nagarajan Kandasamy, Allon Guez, and Moshe Kam Proceedings of the 3 rd International Conference on Autonomic Computing, ICAC 2006,
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part V Workload Characterization for the Web.
1 Part VII Component-level Performance Models for the Web © 1998 Menascé & Almeida. All Rights Reserved.
Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VI System-level Performance Models for the Web (Book, Chapter 8)
정하경 MMLAB Fundamentals of Internet Measurement: a Tutorial Nevil Brownlee, Chris Lossley, “Fundamentals of Internet Measurement: a Tutorial,” CMG journal.
When Average is Not Average: Large Response Time Fluctuations in n-Tier Applications Qingyang Wang, Yasuhiko Kanemasa, Calton Pu, Motoyuki Kawaba.
Emir Halepovic, Jeffrey Pang, Oliver Spatscheck AT&T Labs - Research
Development of a QoE Model Himadeepa Karlapudi 03/07/03.
Web Prefetching Lili Qiu Microsoft Research March 27, 2003.
Quality Is in the Eye of the Beholder: Meeting Users ’ Requirements for Internet Quality of Service Anna Bouch, Allan Kuchinsky, Nina Bhatti HP Labs Technical.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VI System-level Performance Models for the Web.
Abhinav Kamra, Vishal Misra CS Department Columbia University
Empirically Characterizing the Buffer Behaviour of Real Devices
Analyzing Security and Energy Tradeoffs in Autonomic Capacity Management Wei Wu.
Monkey See, Monkey Do A Tool for TCP Tracing and Replaying
Capacity Analysis, cont. Realistic Server Performance
Building a Database on S3
TimeTrader: Exploiting Latency Tail to Save Datacenter Energy for Online Search Balajee Vamanan, Hamza Bin Sohail, Jahangir Hasan, and T. N. Vijaykumar.
Web switch support for differentiated services
Admission Control and Request Scheduling in E-Commerce Web Sites
Feifei Li, Ching Chang, George Kollios, Azer Bestavros
Multiple-resource Request Scheduling. for Differentiated QoS
Lecture 2 Part 3 CPU Scheduling
Presentation transcript:

© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Injecting Realistic Burstiness to a Traditional Client-Server Benchmark Ningfang Mi College of William and Mary Giuliano Casale SAP Research Ludmila Cherkasova Hewlett-Packard Labs Evgenia Smirni College of William and Mary Presenter: Lucy Cherkasova

2 International Conference on Autonomic Computing and Communications (ICAC) 2009 Origin of Burstiness Enterprise and Internet applications: Clients DB Server Front Server Web + Application Server HTTP request HTTP reply SQL query SQL reply Burstiness ?? Highly Correlated Arrivals ?

3 International Conference on Autonomic Computing and Communications (ICAC) 2009 Client-Server Benchmark E.g., TPC-W ( On-line bookstore Web site) Exponentially distributed user think times Exponentially distributed user think times Clients DB Server Front Server Web + Application Server HTTP request HTTP reply SQL query SQL reply Burstiness ?? Highly Correlated Arrivals ?

4 International Conference on Autonomic Computing and Communications (ICAC) 2009 Accounts for randomness and variability … but not for burstiness … but not for burstiness  Can we ignore burstiness in the arrival process? Typical Client-Server Benchmark BurstinessVariability Service time Request number

5 International Conference on Autonomic Computing and Communications (ICAC) 2009 Why Need to Inject Burstiness? Burstiness impacts the performance of resource allocation mechanisms. Example: Session-based admission control (SBAC) −User session: sequence of transaction requests −Session is a unit of work −Typically, long sessions are “sales”. −Useful system throughput is the number of completed sessions −Admission controller admits/rejects sessions based on observed CPU utilization of the server (a combination of last measurement and some history). L. Cherkasova, P. Phaal. Session Based Admission Control: a Mechanism for Peak Load Management of Commercial Web Sites. IEEE J. TOC, June 2002.

6 International Conference on Autonomic Computing and Communications (ICAC) 2009 SBAC Reject a new session when utilization is above the threshold Abort an accepted session when the server queue is full highly undesirable Front Server Web + Application Server DB Server New Client Arrival Requests from already accepted clients limited server queue

7 International Conference on Autonomic Computing and Communications (ICAC) 2009 Impact of Burstiness We performed experiments for the same workload with different arrival patterns: non-bursty vs bursty Aborted ratio = aborted sessions/accepted sessions highly undesirable Queue SizeNon-burstyBursty %11.37% %6.28% %2.50%

8 International Conference on Autonomic Computing and Communications (ICAC) 2009 Why Need to Inject Burstiness? (2) Service level agreement (SLA) −support given response time guarantees for accepted sessions SLA of 1.2s can be supported for 98% of requests with queue size =250 for non-bursty traffic Only 90% of requests meet SLA=1.2s bursty traffic. Queue Size Response Time (s) Non-Bursty Bursty

9 International Conference on Autonomic Computing and Communications (ICAC) 2009 Limitations of Standard TPC-W Think times are drawn randomly from the exponential distribution identical for all clients incompatible Exponential think times are incompatible with the notion of burstiness. Need to inject burstiness into user think times.

10 International Conference on Autonomic Computing and Communications (ICAC) 2009 Our Methodology Basic Idea: modify the distribution of client think time to create bursty arrivals Markovian Arrival Process −Regulate the arrivals by using a 2-phase Markovian Arrival Process (MAP). MAPs are variations of popular On/OFF traffic models that can be easily shaped to create correlated inter- arrival times All clients share a MAP(2) to draw think times A new module for client-server benchmarks index of dispersion −Regulate the intensity of traffic surges by using the index of dispersion. A simple tunable knob of burstiness

11 International Conference on Autonomic Computing and Communications (ICAC) 2009 Index of Dispersion (I) Popular burstiness index in networking Definition −SCV – the squared coefficient of variation (variance/mean 2 ) −ρ k – autocorrelation coefficients i.e., correlation of service times −Exponential: I = SCV = 1 variability burstiness BurstinessVariability Service time Request number

12 International Conference on Autonomic Computing and Communications (ICAC) 2009 Markovian Arrival Process (MAP) variability temporal locality MAPs have ability to provide variability and temporal locality. We use a class of MAPs with two states only Normal Traffic λ long Traffic Surge λ short 2 states: λ short > λ long p l,s p s,l p s,s p l,l time Num. of arrivals p l,s, p s,l, p s,s, p l,s shape correlation

13 International Conference on Autonomic Computing and Communications (ICAC) 2009 MAP Fitting Input −Estimated mean service demands at servers: E[D i ] −Mean user think time E[Z] −The pre-defined index of dispersion I Output −A MAP(2) to draw user think times

14 International Conference on Autonomic Computing and Communications (ICAC) 2009 MAP Fitting (2) Key: determine (λ short, λ long, p l,s, p s,l ) Condition for traffic surge Condition for normal traffic Mean think time We use non-linear optimizer to search for such f and p s,l and find a MAP(2) to best match the predefined I Departure > Arrival Arrival > Departure the arrival rate is f times higher than the throughput of the system the arrival rate is f times slower for balanced system throughput Balancing the height and the width of the burst

15 International Conference on Autonomic Computing and Communications (ICAC) 2009 Realistic values for Burstiness −What is the range of realistic values for defining burstiness via index of dispersion I ? Exponential: I = SCV = 1 Bursty: values of thousands, −e.g., FIFA World Cup 1998, one of the servers over 10 days, I = 6300

16 International Conference on Autonomic Computing and Communications (ICAC) 2009 TPC-W Testbed On-line bookstore Web site Testbed: clients + front server + DB server −Constant number of emulated browsers (EBs) User session −sequence of transaction requests −think time (mean=7 sec) between two transaction requests 14 transactions types grouped in three mixes: −Browsing mix −Shopping mix −Ordering mix

17 International Conference on Autonomic Computing and Communications (ICAC) 2009 Validation – Arrival Process Arrival clients to the system (front server) Think times drawn by a MAP(2) with I create the bursty conditions. Shopping Mix Non-bursty (I=1) Time (s) Number of active clients Bursty (I=4000) Time (s) Number of active clients

18 International Conference on Autonomic Computing and Communications (ICAC) 2009 Validation – Utilization Distribution Shopping Mix Non-bursty (I=1) Bursty (I=4000) pdf Utilization (%) Front DB

19 International Conference on Autonomic Computing and Communications (ICAC) 2009 Validation - Average Latency Browsing Mix Response time (ms) Shopping Mix Response time (ms)

20 International Conference on Autonomic Computing and Communications (ICAC) 2009 Validation – Latency Distributions Browsing Mix CDF Shopping Mix Response time (ms) CDF

21 International Conference on Autonomic Computing and Communications (ICAC) 2009 Conclusion Burstiness critical for autonomic system design −need representative benchmarks for system evaluation −need reproducible and controllable bursty workloads Traditional client-server benchmarks ignore burstiness in arrival flows −e.g., TPC-W with exponential think times Explicitly inject burstiness −a simple and tunable parameter: index of dispersion −can introduce different intensity of traffic surges Supported by NSF grants CNS and CCF and HPLabs gift.