1 Challenges in Scaling E-Business Sites  Menascé and Almeida. All Rights Reserved. Daniel A. Menascé Department of Computer Science George Mason.

Slides:



Advertisements
Similar presentations
Achieving Elasticity for Cloud MapReduce Jobs Khaled Salah IEEE CloudNet 2013 – San Francisco November 13, 2013.
Advertisements

Copyright © 2005 Department of Computer Science CPSC 641 Winter PERFORMANCE EVALUATION Often in Computer Science you need to: – demonstrate that.
Adapted from Menascé & Almeida.1 Workload Characterization for the Web.
Capacity Planning and Predicting Growth for Vista Amy Edwards, Ezra Freeloe and George Hernandez University System of Georgia 2007.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part IV Capacity Planning Methodology.
Performance Engineering Methodology Chapter 4. Performance Engineering Performance engineering analyzes the expected performance characteristics of a.
Adapted from Menascé & Almeida1 Web and Intranet Performance Issues.
1 Part IV Capacity Planning Methodology © 1998 Menascé & Almeida. All Rights Reserved.
1 Part II Web Performance Modeling: basic concepts © 1998 Menascé & Almeida. All Rights Reserved.
1 CSSE 477 – A bit more on Performance Steve Chenoweth Friday, 9/9/11 Week 1, Day 2 Right – Googling for “Performance” gets you everything from Lady Gaga.
1 Part III Web and Intranet Performance Issues © 1998 Menascé & Almeida. All Rights Reserved.
1 Web Performance Modeling Chapter New Phenomena in the Internet and WWW Self-similarity - a self-similar process looks bursty across several time.
A Hierarchical Characterization of a Live Streaming Media Workload E. Veloso, V. Almeida W. Meira, A. Bestavros, S. Jin Proceedings of Internet Measurement.
Transactions – T4.3 Title: Concurrency Control Performance Modeling: Alternatives and Implications Authors: R. Agarwal, M. J. Carey, M. Livny ACM TODS,
Web Server Software Architectures Author: Daniel A. Menascé Presenter: Noshaba Bakht.
1 PERFORMANCE EVALUATION H Often in Computer Science you need to: – demonstrate that a new concept, technique, or algorithm is feasible –demonstrate that.
Measuring Performance Chapter 12 CSE807. Performance Measurement To assist in guaranteeing Service Level Agreements For capacity planning For troubleshooting.
1 Multiple class queueing networks Mean Value Analysis - Open queueing networks - Closed queueing networks.
© Copyright , Blue Martini Software. San Mateo California, USA 1 1 Integrating E-Commerce and Data Mining: Architecture and Challenges Llew Mason.
1 Internet Management and Security We will look at management and security of networks and systems. Systems: The end nodes of the Internet Network: The.
Bandwidth Allocation in a Self-Managing Multimedia File Server Vijay Sundaram and Prashant Shenoy Department of Computer Science University of Massachusetts.
1 Part VI System-level Performance Models for the Web © 1998 Menascé & Almeida. All Rights Reserved.
Copyright © , Software Engineering Research. All rights reserved. Creating Responsive Scalable Software Systems Dr. Lloyd G. Williams Software.
Computer Science Cataclysm: Policing Extreme Overloads in Internet Applications Bhuvan Urgaonkar and Prashant Shenoy University of Massachusetts.
Web application architecture
Computer Systems Performance Evaluation CSCI 8710 Kraemer Fall 2008.
WEB ANALYTICS Prof Sunil Wattal. Business questions How are people finding your website? What pages are the customers most interested in? Is your website.
Prof. Vishnuprasad Nagadevara Indian Institute of Management Bangalore
Computer System Lifecycle Chapter 1. Introduction Computer System users, administrators, and designers are all interested in performance evaluation. Whether.
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
Computer Networks Performance Evaluation. Chapter 12 Single Class MVA Performance by Design: Computer Capacity Planning by Example Daniel A. Menascé,
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part III Web and Intranet Performance Issues.
AN INTRODUCTION TO THE OPERATIONAL ANALYSIS OF QUEUING NETWORK MODELS Peter J. Denning, Jeffrey P. Buzen, The Operational Analysis of Queueing Network.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science An Analytical Model for Multi-tier Internet Services and its Applications Bhuvan.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Identifying Application Impacts on Network Design Designing and Supporting Computer.
E-Commerce Architectures and Technologies Rob Oshana Southern Methodist University.
1 Performance Evaluation of Computer Systems and Networks Introduction, Outlines, Class Policy Instructor: A. Ghasemi Many thanks to Dr. Behzad Akbari.
Software Performance Testing Based on Workload Characterization Elaine Weyuker Alberto Avritzer Joe Kondek Danielle Liu AT&T Labs.
E-Commerce Architectures and Technologies Rob Oshana Southern Methodist University.
E-Commerce Architectures and Technologies Rob Oshana Southern Methodist University.
Computer Networks Performance Metrics. Performance Metrics Outline Generic Performance Metrics Network performance Measures Components of Hop and End-to-End.
The Performance mission is to ensure a positive user experience by monitoring and eliminating system bottlenecks, establish response time baselines for.
1 PREFETCHING INLINES TO IMPROVE WEB SERVER LATENCY Ronald Dodge US Army Daniel Menascé, Ph. D. George Mason University
1 Chapters 8 Overview of Queuing Analysis. Chapter 8 Overview of Queuing Analysis 2 Projected vs. Actual Response Time.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part V Workload Characterization for the Web (Book, chap. 6)
Online Music Store. MSE Project Presentation III
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VIII Concluding Remarks.
1 Admission Control and Request Scheduling in E-Commerce Web Sites Sameh Elnikety, EPFL Erich Nahum, IBM Watson John Tracey, IBM Watson Willy Zwaenepoel,
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Injecting Realistic Burstiness to.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part V Workload Characterization for the Web.
Chapter 3 System Performance and Models Introduction A system is the part of the real world under study. Composed of a set of entities interacting.
1 Part VII Component-level Performance Models for the Web © 1998 Menascé & Almeida. All Rights Reserved.
Capacity Planning Plans Capacity Planning Operational Laws
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VI System-level Performance Models for the Web (Book, Chapter 8)
Internet Applications: Performance Metrics and performance-related concepts E0397 – Lecture 2 10/8/2010.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part II System Performance Modeling: basic concepts, operational analysis (book, chap. 3)
1 Web Performance Modeling Issues Daniel A. Menascé Department of Computer Science George Mason University 
-1- System Software Research Lab. Resource Management Policies for E-commerce Servers Daniel A. Menasce, Virgilio A. F. Almeida 2 nd Workshop on Internet.
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VIII Web Performance Modeling (Book, Chapter 10)
Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VI System-level Performance Models for the Web.
OPERATING SYSTEMS CS 3502 Fall 2017
Application Software Platform Services Graphics Data Interchange
CIIT-Human Computer Interaction-CSC456-Fall-2015-Mr
Performance Engineering
Chapter 2 Scheduling.
Computer Systems Performance Evaluation
Admission Control and Request Scheduling in E-Commerce Web Sites
Queuing Analysis.
Computer Systems Performance Evaluation
Presentation transcript:

1 Challenges in Scaling E-Business Sites  Menascé and Almeida. All Rights Reserved. Daniel A. Menascé Department of Computer Science George Mason University

2 Impacts of Bad Performance  Bad performance: response time above 8 seconds (eight-second rule).  $43.5 billion lost each year in e-commerce due to bad performance (Zona Research, April 1999).  Holiday Season of 1998: over 1/3 of customers gave up due to slowness, 44% turned to conventional stores, 14% moved to another site.  Menascé and Almeida. All Rights Reserved.

3 Performance Problems for E-commerce tend to get worse!  Proliferation of mobile devices  Easier to use interfaces (VUI, wireless and Web services on cars and airplanes, novel browsing paradigms)  Increasing load placed by agents  Impacts of authentication protocols (e.g., TLS) on e-commerce site performance.  Menascé and Almeida. All Rights Reserved.

4 Typical Questions  Is the online trading site prepared to accommodate a 75% increase of trades/day?  Do I have enough servers to handle a peak demand 10x the average?  How fast can the site architecture be scaled up? What components should be upgraded? Database servers? Web servers? Application servers? Bandwidth?  How can I design a site that will meet its business goals?  Menascé and Almeida. All Rights Reserved.

5 Outline  Scalability  A Reference Model for E-Business  Workload Characterization  Customer behavior model graphs  Client/Server Interaction Diagrams  Concluding Remarks  Menascé and Almeida. All Rights Reserved.

6 A Reference Model for Electronic Business Business Model Functional Model Customer Model Resource Model Characteristics of the Business Navigational Structure of the Site Patterns of Customer Behavior Site Architecture and Service Demands External Metrics Internal Metrics Business View Technological View Descriptors  Menascé and Almeida. All Rights Reserved.

7 External Metrics and Descriptors  Cover the Nature of Business  Metrics:  Revenue throughput (dollars/sec)  Potential lost revenue/sec  Click-to-look ratio  Look-to-basket ratio  Basket-to-buy ratio  Click-to-buy ratio  Availability  Download times  Page views/day  Unique visitors/day  Menascé and Almeida. All Rights Reserved.

8 External Metrics and Descriptors  Descriptors:  Number of registered customers  Number of potential customers  Maximum number of simultaneous customers in the store  Number of items in the catalog  Estimated operational cost  Menascé and Almeida. All Rights Reserved.

9 Workload Characterization 1.Determine the e-business functions made available by the site. Associate URLs or URL patterns to each e-business function. 2.Analyze the site’s HTTP logs to determine customer sessions. 3.Cluster customer sessions into groups of “similar” customer sessions, i.e., sessions that exhibit similar behavior.  Menascé and Almeida. All Rights Reserved.

10 home browse search addselect pay entry 1.0 Customer Behavior Model Graph  Menascé and Almeida. All Rights Reserved.

11 Metrics Derived from the CBMG  Average Number of Visits Per State  E.g., average number of searches per visit to the site,  Average Buy (or open account) to Visit Ratio – also called conversion ratio.  Average Session Length Per Visit  Menascé and Almeida. All Rights Reserved.

12 Metrics Derived from the CBMG  Menascé and Almeida. All Rights Reserved.

13 Workload Characterization Methodology Merge and Filter Get Sessions Get CBMGs (clustering algorithm) HTTP Logs Session Log CBMGs Request Log  Menascé and Almeida. All Rights Reserved.

14 Result of Clustering Analysis Cluster 1: majority of sessions, short sessions, and highest BV ratio. Cluster 6: small fraction of sessions, large sessions, smallest BV ratio.  Menascé and Almeida. All Rights Reserved.

15 Buy to Visit Ratio vs. Session Length  Menascé and Almeida. All Rights Reserved.

16 Architecture of E-Commerce Sites  Menascé and Almeida. All Rights Reserved.

17 Remarks  For e-commerce, need to characterize the workload at higher levels of abstraction: sessions vs. requests.  Workload characterization has to be customer behavior-oriented as opposed to request-oriented.  Customer Behavior Model Graphs capture customer behavior and can be mapped to resource demands.  Menascé and Almeida. All Rights Reserved.

18  Capacity Planning and Performance Management:  key to EC: the competitors are just a click away!  require predictive models: avoid ROTs!  throwing more plumbing is not the solution.  models have to integrate the business, customer, and resource aspects of the problem. Remarks (cont’d)  Menascé and Almeida. All Rights Reserved.

19 Performance Models of E-Commerce Sites Daniel A. Menascé Dept. of Computer Science George Mason University

20 Component-level models server incoming link outgoing link server

21 Component-level models server cpu disk 1 disk 2 incoming link outgoing link server

22 Component-level models server cpu disk 1 disk 2 incoming link outgoing link server

23 Component-level Models  Each component is represented by a resource (e.g. CPU, disk, communication link) and a queue of requests waiting for the resource. resource queue

24 Basic Concepts  Utilization of a resource:  Fraction of time the resource is busy serving requests during a measurement interval.  Example: the CPU was busy during 40 minutes during a measurement period of one hour. It’s utilization is then:  40/60 = = 66.7%  Utilizations are measured as dimensionless numbers

25 Basic Concepts (cont’d)  A request may be using a resource (e.g., CPU, disk, etc) or waiting to use it.  The time spent using the resource does not depend on the number of resources waiting to use the resource.  The time spent waiting to use the resource depends on the load, i.e., on the number of requests in the queue waiting to use the resource.

26 Performance Model Parameters  Workload Intensity  HTPP Requests/sec  Transactions/sec  E-business functions/sec  Service demands for each resource and each type of request.

27 Service Demands cpu disk 1 disk 2 incoming link outgoing link server sec sec sec 0.08 sec 0.12 sec Service demands do not include any queuing time! It is just service time.

28 Computing Waiting Times cpu disk 1 disk 2 incoming link outgoing link server sec sec sec 0.08 sec 0.12 sec Waiting times depend on the load (arrival rate of requests) and on the service demands.

29 DiDi RiRi Service demand at resource i Utilization of resource i (U i ) Computing Residence Times Residence time at resource i

30 Residence Time at Incoming Link cpu disk 1 disk 2 incoming link outgoing link Web server sec sec sec 0.08 sec 0.12 sec  req/sec

31 Residence Time at Outgoing Link cpu disk 1 disk 2 incoming link outgoing link Web server sec sec sec 0.08 sec 0.12 sec  req/sec

32 Residence Time at the CPU cpu disk 1 disk 2 incoming link outgoing link Web server sec sec sec 0.08 sec 0.12 sec  req/sec

33 Residence Time at Disk 1 cpu disk 1 disk 2 incoming link outgoing link Web server sec sec sec 0.08 sec 0.12 sec  req/sec

34 Residence Time at Disk 2 cpu disk 1 disk 2 incoming link outgoing link Web server sec sec sec 0.08 sec 0.12 sec  req/sec

35 Summary of Results Average Response TimeSum of service demands

36 Response vs. Arrival Rate

37 Example: online trading site  Open QN for the online trading site: