Measuring and Mitigating Web Performance Bottlenecks in Broadband Access Networks Srikanth Sundaresan, Nick Feamster (Georgia Tech) Renata Teixeira (Inria)

Slides:



Advertisements
Similar presentations
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Advertisements

Let's say we want to access domain - reliablescribe.com First we need to buy a computer We need to subscribe to an Internet Service Provider (ISP) The.
Ningning HuCarnegie Mellon University1 Optimizing Network Performance In Replicated Hosting Peter Steenkiste (CMU) with Ningning Hu (CMU), Oliver Spatscheck.
Networking Problems in Cloud Computing Projects. 2 Kickass: Implementation PROJECT 1.
THE CASE FOR PREFETCHING AND PREVALIDATING TLS SERVER CERTIFICATES Emily Stark, Lin-Shung Huang, Dinesh Israni, Collin Jackson, Dan Boneh Presented by:
University of Michigan Electrical Engineering and Computer Science Anatomizing Application Performance Differences on Smartphones Junxian Huang, Qiang.
Broadband Internet Performance: A View from the Gateway Srikanth Sundaresan, Walter de Donato, Nick Feamster, Renata Teixeira, Sam Crawford, Antonio Pescapè.
On the Effectiveness of Measurement Reuse for Performance-Based Detouring David Choffnes Fabian Bustamante Fabian Bustamante Northwestern University INFOCOM.
Dynamic Adaptive Streaming over HTTP2.0. What’s in store ▪ All about – MPEG DASH, pipelining, persistent connections and caching ▪ Google SPDY - Past,
1 “Tracking the Evolution of Web Traffic: Felix Hernandez-Campos, Kevin Jeffay, F. Donelson Smith IEEE/ACM International Symposium on Modeling,
Peeking Behind the NAT An Empirical Study of Home Networks Sarthak Grover, Mi Seon Park, Srikanth Sundaresan, Sam Burnett, Hyojoon Kim, Bharath Ravi, Nick.
Broadband Internet Performance: A View from the Gateway Srikanth Sundaresan, Walter de Donato, Nick Feamster, Renata Teixeira, Sam Crawford, Antonio Pescapè.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
How’s My Network (HMN)? A Java approach to Home Network Measurement Alan Ritacco, Craig Wills, and Mark Claypool Computer Science Department Worcester.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
1 Prefetching the Means for Document Transfer: A New Approach for Reducing Web Latency 1. Introduction 2. Data Analysis 3. Pre-transfer Solutions 4. Performance.
Web Proxy Caching: The Devil is in the Details Ramon Cacere Fred Douglis Anja Feldmann Gideon Glass Michael Rabinovich AT&T Labs-Research Florham Park,
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
CSE 190: Internet E-Commerce Lecture 16: Performance.
Chapter 2: Application Layer
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
CDNs & Replication Prof. Vern Paxson EE122 Fall 2007 TAs: Lisa Fowler, Daniel Killebrew, Jorge Ortiz.
1 Web Proxies Dr. Rocky K. C. Chang 6 November 2005.
Tracking the Evolution of Web Traffic: Felix Hernandez-Campos, Kevin Jeffay F. Donelson Smith IEEE/ACM International Symposium on Modeling, Analysis.
HTTP Performance Objective: In this problem, we consider the performance of HTTP, comparing non-persistent HTTP with persistent HTTP. Suppose the page.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
APPLICATION AND NETWORK PERFORMANCE - TCP TUNING Alan Bodnar.
CS332 Ch. 28 Spring 2014 Victor Norman. Access delay vs. Queuing Delay Q: What is the difference between access delay and queuing delay? A: I think the.
Network. Add speed of light and speed in fiber Inspired by High Performance Browser Networking (Ilya Grigorik) Even Faster Web Sites (Steve Souders)
Broadband Internet Performance: A View from the Gateway Document Number: IEEE Smet”, per Date Submitted:
Page 18/25/2015 CSE 40373/60373: Multimedia Systems CSE 4/60373: Multimedia Systems  Outline for today  32: Y.-F. Chen, Y. Huang, R. Jana, H. Jiang,
Active Network Applications Tom Anderson University of Washington.
 Zhichun Li  The Robust and Secure Systems group at NEC Research Labs  Northwestern University  Tsinghua University 2.
DNS and HTTP. Finally, the application layer! We have learned about: – Signals being sent on wires – Frames carried over dumb local networks – Packets.
SAINT ‘01 Proactive DNS Caching: Addressing a Performance Bottleneck Edith Cohen AT&T Labs-Research Haim Kaplan Tel-Aviv University.
Paper Presentation – CAP Page 2 Outline Review - DNS Proposed Solution Simulation Results / Evaluation Discussion.
Infrastructure for Better Quality Internet Access & Web Publishing without Increasing Bandwidth Prof. Chi Chi Hung School of Computing, National University.
CDN Brokering* Presented By Nick Arnold Authors Alexandros Biliris, et. Al.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
Software Performance Testing Based on Workload Characterization Elaine Weyuker Alberto Avritzer Joe Kondek Danielle Liu AT&T Labs.
Sharing Information across Congestion Windows CSE222A Project Presentation March 15, 2005 Apurva Sharma.
ECO-DNS: Expected Consistency Optimization for DNS Chen Stephanos Matsumoto Adrian Perrig © 2013 Stephanos Matsumoto1.
Kiew-Hong Chua a.k.a Francis Computer Network Presentation 12/5/00.
Data Communications and Computer Networks Chapter 2 CS 3830 Lecture 8 Omar Meqdadi Department of Computer Science and Software Engineering University of.
“Load Testing Early and Often” By Donald Doane Presentation to the Rockville MDCFUG.
Performance of Web Proxy Caching in Heterogeneous Bandwidth Environments IEEE Infocom, 1999 Anja Feldmann et.al. AT&T Research Lab 발표자 : 임 민 열, DB lab,
Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.
Confidential Challenges of Internet Streaming Sloan Media Tech February 8, 2001.
Why Assumptions about Cloud Performance Can Be Dangerous to Your Business Ron Wilson, Director Cloud Strategy
Emir Halepovic, Jeffrey Pang, Oliver Spatscheck AT&T Labs - Research
1 NetProfiler: Profiling Networks From the Edge Venkat Padmanabhan Microsoft Research June 2005 With Sharad Agarwal (MSR), Jitu Padhye (MSR), Dilip Joseph.
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
Computer Communication & Networks Lecture # 10. Bandwidth ϞBand Width means the width of the band (channel) ϞIt is the maximum capacity of a link ϞThat.
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
The Internet. Important Terms Network Network Internet Internet WWW (World Wide Web) WWW (World Wide Web) Web page Web page Web site Web site Browser.
1 COMP 431 Internet Services & Protocols HTTP Persistence & Web Caching Jasleen Kaur February 11, 2016.
An Analysis of Internet Content Delivery Systems 19 rd November, 2007 Youngsub CSE, SNU.
Web Proxy Caching: The Devil is in the Details Ramon Caceres, Fred Douglis, Anja Feldmann Young-Ho Suh Network Computing Lab. KAIST Proceedings of the.
John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante Northwestern, EECS.
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
An Architecture for Internet Data Transfer Niraj Tolia, Michael Kaminsky, David G. Andersen, and Swapnil Patil NSDI ’ Eunsang Cho.
Whole Page Performance Leeann Bent and Geoffrey M. Voelker University of California, San Diego.
BISmark: the Broadband Internet Service benchmark
Peering at the Internet’s Frontier: A First Look at ISP Interconnectivity in Africa Arpit Gupta, Matt Calder, Nick Feamster, Marshini Chetty, Enrico.
Residential Internet Performance: A view from the Gateway
Evaluating Proxy Caching Algorithms in Mobile Environments
Inferring Queue Sizes in Access Networks by Active Measurement
Austin Hounsel* Kevin Borgolte* Paul Schmitt*
Presentation transcript:

Measuring and Mitigating Web Performance Bottlenecks in Broadband Access Networks Srikanth Sundaresan, Nick Feamster (Georgia Tech) Renata Teixeira (Inria) Nazanin Magharei (Cisco) 1

Every Millisecond Counts 500ms delay causes 1.2% decrease in Bing revenue [Souders 2009] 400ms delay causes 0.74% decrease in Google searches [Brutlag 2009] 2 100ms delay causes 1% decrease in Amazon revenue [Linden 2013] Many Web services companies spend considerable effort reducing Web response time.

Many Performance Optimizations What about the last-mile? Home Network ISP Internet Server-Side Persistent connections TCP ICW Content optimization ISP Edge CDNs DNS caching Client-Side Browser caching SPDY/QUIC 3

The Last-Mile Affects Performance Last-mile latency can be significant – More than 50% of AT&T DSL users have last-mile latency greater than 20 ms [Sundaresan 2011] Optimizations are affected by last-mile performance The effects of the last-mile on Web performance has not been specifically studied This paper: Measure and mitigate access link bottlenecks in Web performance 4

Two Contributions Measure last-mile Web bottlenecks – homes, from access point – Latency is bottleneck beyond 16 Mbps Mitigate Web performance bottlenecks – Popularity-based pre-fetching in the home – DNS pre-fetching and TCP connection caching in the home can improve page load time by up to 35% 5

Measuring Last-Mile Effects on Web Performance Challenges: Instrumenting end-hosts – No ground truth: Every browser is different – Confounding factors affect measurements Solution: Measure from router – Piggyback on existing deployments (FCC, BISmark) – Consistent measurements – similar hardware – Continuous measurements – better characterization 6

Mirage: Deployment and Data Emulates basic browser function – Estimates page load time – Breaks down fetch latencies Deployed by FCC/Samknows – 5,000+ homes in US: from 11 ISPs – Profiles 9 popular sites – Measurements are publicly available [FCC 2012] 7

Time to first byte Mirage Identifies Latency Bottlenecks 8 Time Objects Does not: – Resolve or fetch active objects – Establish ground truth (there isn’t any) Lookup Connect Server processing Object fetch Page load time

Popular Sites Have High Page Load Time 9 Median load time for Yahoo exceeds 2 seconds Page Load Time (ms)

Higher Throughput Doesn’t Always Help 10 Page load time stops improving above 16 Mbps. Page Load Time (ms) Throughput (Mbps)

Mirage Identifies Latency Bottlenecks 11 Time Objects Lookup Connect Server processing Object fetch Page load time Latency overhead can dominate fetch time, particularly for small Web objects

Common Last-Mile Latencies Result in High Page Load Times 12 Many DSL users may experience high page load times. Throughput (Mbps) Page Load Time (ms) Last Mile Latency (ms)

Two Contributions Measure last-mile Web bottlenecks – homes, from access point – Latency is bottleneck beyond 16 Mbps Mitigate Web performance bottlenecks – Popularity-based pre-fetching in the home – DNS pre-fetching and TCP connection caching in the home can improve page load time by up to 35% 13

Many Optimizations Don’t Help in Last Mile Many (CDNs, server-side) stop at the ISP edge Client-side optimizations are application-specific 14 Home Network ISP Internet Server-Side Persistent connections TCP ICW Content optimization ISP Edge CDNs DNS caching Client-Side Browser caching SPDY/QUIC

Solution: Pre-fetching in the Home 15 Home Network Internet Popular Domains List Refresh DNS record Maintain TCP connection Idea: Refresh DNS records and TCP connections to popular domains at the router

Popularity-based Pre-fetching: Evaluation 16 What is the improvement in the best case? – Mirage on BISmark platform (65 nodes worldwide) How do the benefits complement browser opimizations? – Phantomjs in controlled setting How can we make it practical? – Evaluate caching using real user traces from 12 homes

DNS pre-fetching improves page load time by up to 10% Effect of DNS Pre-fetching 17 Internet DNS Pre-fetching 1.Clear caches 2.Fetch page 3.Fetch page again DNS Resolver Fetch web page using Mirage

DNS pre-fetching can save up to 50 ms 18 Max DNS lookup time is reduced by ms!

TCP connection caching improves page load time by up to 35% Effect of TCP Connection Pre-fetching 19 Internet TCP Connection caching 1.Clear caches 2.Fetch page through HTTP proxy 3.Clear content cache 4.Fetch page again DNS Resolver HTTP Proxy Fetch web page using Mirage

TCP Connection Caching Reduces Page Load Times 20 TCP connection caching improves page load times by ms

Popularity-Based Prefetching Reduces Page Load by up to 50% 21

Popularity-Based Prefetching Reduces Page Load by up to 50% 22

Popularity-based Pre-fetching: Evaluation 23 What is the improvement in the best case? – Mirage, using BISmark (65 nodes worldwide) How do the benefits complement browser optimizations? – Phantomjs in controlled setting How can we make it practical? – Evaluate caching using user traces from 12 homes

Reducing the overhead of pre-fetching Solution: Pre-fetch only popular sites with timeout Analysis of passive usage traces in 12 homes – Simulation based on traces – Test list size and timeout intervals Cache hits improvement (with list size of 20 and timeout of 2 minutes): – DNS hit rate improves from 11-50% to 19-90% – TCP hit rate improves from 1-8% to 6-21% 24

Improvements in Browser Improvements with Phantomjs – up to 7% improvement with DNS pre- fetching and 20% with TCP caching; up to 60% with Home Proxy 25

Conclusion Page load times are high for popular sites – Latency is a bottleneck when downstream throughput is > 16 Mbps Popularity-based prefetching improves performance by up to 35%. – Complementary to existing optimizations Data and code are publicly available at