Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ 97 2000.12.14 Koo-Min Ahn.

Slides:

Advertisements

Similar presentations

The TIME-WAIT state in TCP and its Effect on Busy Servers Theodore Faber University of Southern California Xindian Long.

Advertisements

Web Server Benchmarking Using the Internet Protocol Traffic and Network Emulator Carey Williamson, Rob Simmonds, Martin Arlitt et al. University of Calgary.

Transport Layer – TCP (Part2) Dr. Sanjay P. Ahuja, Ph.D. Fidelity National Financial Distinguished Professor of CIS School of Computing, UNF.

Traffic Shaping Why traffic shaping? Isochronous shaping

Delay and Throughput in Random Access Wireless Mesh Networks Nabhendra Bisnik, Alhussein Abouzeid ECSE Department Rensselaer Polytechnic Institute (RPI)

Receiver-driven Layered Multicast S. McCanne, V. Jacobsen and M. Vetterli University of Calif, Berkeley and Lawrence Berkeley National Laboratory SIGCOMM.

Chapter 10 Congestion Control in Data Networks1 Congestion Control in Data Networks and Internets COMP5416 Chapter 10.

1 SEDA: An Architecture for Well- Conditioned, Scalable Internet Services Matt Welsh, David Culler, and Eric Brewer Computer Science Division University.

12 Nov 07 CS DHTTP: An Efficient and Cache- Friendly Transfer Protocol for the Web By Michael Rabinovich and Hua Wang Presented by Jerry Usery.

What’s the Problem Web Server 1 Web Server N Web system played an essential role in Proving and Retrieve information. Cause Overloaded Status and Longer.

The War Between Mice and Elephants LIANG GUO, IBRAHIM MATTA Computer Science Department Boston University ICNP (International Conference on Network Protocols)

Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part IV Capacity Planning Methodology.

Measurements of Congestion Responsiveness of Windows Streaming Media (WSM) Presented By:- Ashish Gupta.

1 Web Server Performance in a WAN Environment Vincent W. Freeh Computer Science North Carolina State Vsevolod V. Panteleenko Computer Science & Engineering.

1 Part IV Capacity Planning Methodology © 1998 Menascé & Almeida. All Rights Reserved.

The War Between Mice and Elephants Presented By Eric Wang Liang Guo and Ibrahim Matta Boston University ICNP

End-to-End Analysis of Distributed Video-on-Demand Systems Padmavathi Mundur, Robert Simon, and Arun K. Sood IEEE Transactions on Multimedia, February.

Buffer Sizing for Congested Internet Links Chi Yin Cheung Cs 395 Advanced Networking.

Reduced TCP Window Size for Legacy LAN QoS II Niko Färber Sept. 20, 2000.

ISCSI Performance in Integrated LAN/SAN Environment Li Yin U.C. Berkeley.

Behaviour and Performance of Interactive Multi-player Game Servers Ahmed Abdelkhalek, Angelos Bilas, and Andreas Moshovos.

Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?

OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.

A Distributed Proxy Server for Wireless Mobile Web Service Kisup Kim, Hyukjoon Lee, and Kwangsue Chung Information Network 2001, 15 th Conference.

Architectural Impact of SSL Processing Jingnan Yao.

Performance and Robustness Testing of Explicit-Rate ABR Flow Control Schemes Milan Zoranovic Carey Williamson October 26, 1999.

1 Internet Management and Security We will look at management and security of networks and systems. Systems: The end nodes of the Internet Network: The.

Wide Web Load Balancing Algorithm Design Yingfang Zhang.

Capacity planning for web sites. Promoting a web site Thoughts on increasing web site traffic but… Two possible scenarios…

Understanding Factors That Influence Performance of a Web Server Presentation CS535 Project By Thiru.

Using Standard Industry Benchmarks Chapter 7 CSE807.

Advanced Network Architecture Research Group 2001/11/149 th International Conference on Network Protocols Scalable Socket Buffer Tuning for High-Performance.

Performance of Web Applications Introduction One of the success-critical quality characteristics of Web applications is system performance. What.

SEDA: An Architecture for Well-Conditioned, Scalable Internet Services

1 Design and Performance of a Web Server Accelerator Eric Levy-Abegnoli, Arun Iyengar, Junehwa Song, and Daniel Dias INFOCOM ‘99.

CORE KAIST EECS Computer Engineering Research Lab A General Purpose Proxy Filtering Mechanism Applied to the Mobile Environment Bruce Zenel Jupyung Lee.

Modeling and Performance Evaluation of Network and Computer Systems Introduction (Chapters 1 and 2) 10/4/2015H.Malekinezhad1.

Web Server Support for Tired Services Telecommunication Management Lab M.G. Choi.

Jozef Goetz, Application Layer PART VI Jozef Goetz, Position of application layer The application layer enables the user, whether human.

An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.

Software Performance Testing Based on Workload Characterization Elaine Weyuker Alberto Avritzer Joe Kondek Danielle Liu AT&T Labs.

Section 5: The Transport Layer. 5.2 CS Computer Networks John Mc Donald, Dept. of Computer Science, NUI Maynooth. Introduction In the previous section.

ACN: RED paper1 Random Early Detection Gateways for Congestion Avoidance Sally Floyd and Van Jacobson, IEEE Transactions on Networking, Vol.1, No. 4, (Aug.

1 Lecture 14 High-speed TCP connections Wraparound Keeping the pipeline full Estimating RTT Fairness of TCP congestion control Internet resource allocation.

Performance of HTTP Application in Mobile Ad Hoc Networks Asifuddin Mohammad.

A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.

Scalable Kernel Performance for Internet Servers under Realistic Loads. Gaurav Banga, etc... Western Research Lab : Research Report 1998/06 (Proceedings.

Advanced Network Architecture Research Group 2001/11/74 th Asia-Pacific Symposium on Information and Telecommunication Technologies Design and Implementation.

Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part V Workload Characterization for the Web (Book, chap. 6)

Authors: Haowei Yuan and Patrick Crowley Publisher: 2013 Proceedings IEEE INFOCOM Presenter: Chia-Yi Chu Date: 2013/08/14 1.

Providing Differentiated Levels of Service in Web Content Hosting Jussara Almeida, etc... First Workshop on Internet Server Performance, 1998 Computer.

1 Admission Control and Request Scheduling in E-Commerce Web Sites Sameh Elnikety, EPFL Erich Nahum, IBM Watson John Tracey, IBM Watson Willy Zwaenepoel,

Lecture (Mar 23, 2000) H/W Assignment 3 posted on Web –Due Tuesday March 28, 2000 Review of Data packets LANS WANS.

Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part V Workload Characterization for the Web.

1 Part VII Component-level Performance Models for the Web © 1998 Menascé & Almeida. All Rights Reserved.

Deadline-based Resource Management for Information- Centric Networks Somaya Arianfar, Pasi Sarolahti, Jörg Ott Aalto University, Department of Communications.

CSE Computer Networks Prof. Aaron Striegel Department of Computer Science & Engineering University of Notre Dame Lecture 19 – March 23, 2010.

Hyun-Jin Choi, CORE Lab. E.E. 1 httperf – A Tool for Measuring Web Server Performance Dec Choi, Hyun-Jin David Mosberger and Tai Jin HP Research.

Internet Applications: Performance Metrics and performance-related concepts E0397 – Lecture 2 10/8/2010.

Development of a QoE Model Himadeepa Karlapudi 03/07/03.

1 Transport Layer: Basics Outline Intro to transport UDP Congestion control basics.

Providing Differentiated Levels of Service in Web Content Hosting J ussara Almeida, Mihaela Dabu, Anand Manikutty and Pei Cao First Workshop on Internet.

1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.

Chapter 10 Congestion Control in Data Networks and Internets 1 Chapter 10 Congestion Control in Data Networks and Internets.

Ó 1998 Menascé & Almeida. All Rights Reserved.1 Part VIII Web Performance Modeling (Book, Chapter 10)

1 Design and Implementation of a High-Performance Distributed Web Crawler Polytechnic University Vladislav Shkapenyuk, Torsten Suel 06/13/2006 석사 2 학기.

Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting

Capacity Analysis, cont. Realistic Server Performance

CSE 461 HTTP and the Web.

Admission Control and Request Scheduling in E-Commerce Web Sites

Presentation transcript:

Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn

Contents Introduction Problem in generating synthetic HTTP request A scalable method for generating HTTP request (main idea) Quantitative evaluation Conclusion

Introduction Improving web performance –Web caching, HTTP protocol enhancement, better HTTP servers and proxies, server OS and s/w Measuring web s/w performance –Characterizing web server workload : file type, transfer size, other related statistics Recently, web server evaluation –Generation of synthetic HTTP client traffic –Problem : lead to deviation of benchmarking conditions from reality and fail to predict the performance of a given web server

Problems in generating synthetic HTTP request (1) inability to generate excess load –A huge number of clients –Think time dist ’ n with large mean and variance –Think time is not independent –Web content causes high correlation of requests Peak request rates can exceed the capa of server HTTP requests arriving at a server is bursty

Problems in generating synthetic HTTP request (2) Additional problem : WAN based Web –Simple method does not model high and variable WAN delays –Packet losses due to congestion are absent in LAN based method –With an increasing number of clients per client machine, client CPU and memory contention are likely to arise  the bottleneck in a web transaction is no longer the server but the client

A scalable method for generating HTTP request (to be continue) Basic architecture –If WAN effects are to be evaluated, the client machine should be connected to the server through a router

S-Client A S-client consists of a pair processes connected by a Unix domain socketpair Connection establishment process is responsible for generating HTTP request at a certain rate and a certain request dist ’ n After a connection is established, the connection establishment process sends a HTTP request to the server, then it passes on the connection to the connection handling process, which handles the HTTP response

Connection establishment process of S- Client (1) The process open D connections to the server using D sockets in non-blocking mode D connection requests are spaces out over T ms After the process executes a non-blocking connect() to initiate a connect The process checks if for any of its D active sockets, the connection is completed or if T ms have elapsed since a connect() was performed on this socket

Connection establishment process of S- Client (2) if for any of its D active sockets, the connection is completed : the process sends a HTTP request on the newly established connection, handoff off this connection to the other process of the S- Client through the Unix domain socketpair, closes the socket, and then initiates another connection to the server if T ms have elapsed since a connect() was performed on this socket : the process simply closes the socket and initiates another connection to the server

Connection handling process of S-Client The process waits for (1) data to arrive on any of the active connections or (2) for a new connection to arrive on the Unix domain socket connecting it to the other process In case of new data on an active socket, it read this data If this completes the server ’ s response, it closes the socket A new connection arriving at the Unix domain socket is simply added to the set of active connect

Two key idea of S-Client 1)Shorten TCP ’ s connection establishment timeout –Using Non-blocking connect and closing the socket if no connection was establishment after T second –The purpose is to allow the generation of request rates beyond the capacity of the server with a reasonable number of client sockets 2)Maintain a constant number of unconnected sockets that are trying to establish new connections –To ensure that generated request rate is independent of the rate at which server handles request –Once request rate matches the capa of server, the additional queuing delay in the server ’ s accept queue no longer reduces the request rate of simulated client

Think time distribution This scheme uses a constant think time chosen to achieve a certain constant request rate It is possible to generate more complex request processes by adding appropriate think periods between the point where a S- Client detects a connection was established and when it next attempt to initiate another connection

Experiment setup Client machine –4 Sun SPARC 20 model 61 workstation (60Mhz SuperSPARC+, 36KB L1, 1MB L2, SPEXint ) –32 MB memory and SunOS 4.1.3_U1 Server Machine –Dual processor SPARC 20 model 61 machines –64 MB memory and Solaris Mbps ATM Local area network NCSA httpd server software revision Not consider WAN delay

Simulation result : request generation rate

Simulation result : overload behavior of a web server

Simulation result : Throughput under bursty condition

Simulation result (1) Request generation rate –File size is 1294 byte, connection establishment timeout period T is 500ms –As we add more clients, the queue length at the accept queue of the server ’ s listen socket increases and the request rate remains nearly constant at the capacity of the server –S-clients enable the generation of request loads that greatly exceed the capacity of the server

Simulation result (2) Overload behavior of a web server –As before, the server saturates at about 130 transactions per second –This fall in throughput with increasing request rate is due to the CPU resources spent on protocol processing for incoming requests –This large drop in throughput of an overload server highlights the importance of evaluating the overload behavior of a web server

Simulation result (3) Throughput under bursty conditions –Bursty traffic parameter is (1)the ratio between the maximum request rate and the average request rate, (2) the fraction of time for which the request rate exceed the average rate –(6,5) refers to the case where for 5% of the time the request rate is 6 times the average request rate –High burstiness both in parameter (1) and (2) degrades the throughput substantially

Conclusion No benchmark we know of attempts to accurately model the effects of request overloads on server performance (Webstone, SPECWeb96) This method to evaluate a typical web server indicates that measuring web server performance under overload and bursty traffic conditions gives new and importance insights in web server performance