CoDeeN,Large Files, & CoDeploy KyoungSoo Park, Vivek Pai, Larry Peterson Princeton University.

Slides:



Advertisements
Similar presentations
Inktomi Confidential and Proprietary The Inktomi Climate Lab: An Integrated Environment for Analyzing and Simulating Customer Network Traffic Stephane.
Advertisements

Cache Storage For the Next Billion Students: Anirudh Badam, Sunghwan Ihm Research Scientist: KyoungSoo Park Presenter: Vivek Pai Collaborator: Larry Peterson.
Fun Fun Project One1 Building Your Very Own Web Server.
Scheduling in Web Server Clusters CS 260 LECTURE 3 From: IBM Technical Report.
By Vikas Debnath KV IT-Solutions Pvt. Ltd.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
Spring 2003CS 4611 Content Distribution Networks Outline Implementation Techniques Hashing Schemes Redirection Strategies.
EEC-484/584 Computer Networks Lecture 6 Wenbing Zhao
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
CCC/WNUG Exchange Update May 5, 2005 Nate Wilken Web and Messaging Applications Information Technology Arizona State University.
CDNs & Replication Prof. Vern Paxson EE122 Fall 2007 TAs: Lisa Fowler, Daniel Killebrew, Jorge Ortiz.
A Distributed Proxy Server for Wireless Mobile Web Service Kisup Kim, Hyukjoon Lee, and Kwangsue Chung Information Network 2001, 15 th Conference.
Anycast Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
Dynamic parallel access to replicated content in the Internet Pablo Rodriguez and Ernst W. Biersack IEEE/ACM Transactions on Networking, August 2002.
SESSION 9 THE INTERNET AND THE NEW INFORMATION NEW INFORMATIONTECHNOLOGYINFRASTRUCTURE.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
AKAMAI Content Delivery Services AKAMAI Content Delivery Services CIS726 : PRESENTATION Avinash Ponugoti Avinash Ponugoti Nagarjuna Nagulapati Sathish.
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
Towards Understanding Modern Web Traffic
Copyright © 2002 Wensong Zhang. Page 1 Free Software Symposium 2002 Linux Virtual Server: Linux Server Clusters for Scalable Network Services Wensong Zhang.
Proxy servers By Akshit Y10. What is a proxy server O A proxy server is a computer that offers a computer network service to allow clients to make indirect.
1 Advanced Application and Web Filtering. 2 Common security attacks Finding a way into the network Exploiting software bugs, buffer overflows Denial of.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
On the Use and Performance of Content Distribution Networks Balachander Krishnamurthy Craig Wills Yin Zhang Presenter: Wei Zhang CSE Department of Lehigh.
EE616 Technical Project Video Hosting Architecture By Phillip Sutton.
Performance of Web Applications Introduction One of the success-critical quality characteristics of Web applications is system performance. What.
IMPLEMENTING F-SECURE POLICY MANAGER. Page 2 Agenda Main topics Pre-deployment phase Is the implementation possible? Implementation scenarios and examples.
NETWORK SERVERS Oliver Topping (with a little help from my Mum)
Healing the Web: An Overview of CoDeeN & Related Projects Vivek Pai, Larry Peterson + many others Princeton University.
The Dark Side of the Web: An Open Proxy’s View Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University.
Securing Web Service by Automatic Robot Detection KyoungSoo Park, Vivek S. Pai Princeton University Kang-Won Lee, Seraphin Calo IBM T.J. Watson Research.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Application of Content Computing in Honeyfarm Introduction Overview of CDN (content delivery network) Overview of honeypot and honeyfarm New redirection.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
TOWARDS UNDERSTANDING DEVELOPING WORLD TRAFFIC Sunghwan Ihm (Princeton) KyoungSoo Park (KAIST) Vivek S. Pai (Princeton)
Content distribution networks (CDNs) r The content providers are the CDN customers. Content replication r CDN company installs hundreds of CDN servers.
® IBM Software Group © 2007 IBM Corporation Best Practices for Session Management
Authors: Haowei Yuan and Patrick Crowley Publisher: 2013 Proceedings IEEE INFOCOM Presenter: Chia-Yi Chu Date: 2013/08/14 1.
OSDI 2002 Boston, MA 1 The Effectiveness of Request Redirection on CDN Robustness Limin Wang Vivek Pai and Larry Peterson Princeton University.
Empirical Quantification of Opportunities for Content Adaptation in Web Servers Michael Gopshtein and Dror Feitelson School of Engineering and Computer.
A P2P-Based Architecture for Secure Software Delivery Using Volunteer Assistance Purvi Shah, Jehan-François Pâris, Jeffrey Morgan and John Schettino IEEE.
New HTTP Protocols HTTP/0.9 -Earliest standard simple GET/PUT requests (no headers, constraints, resolution) HTTP/1.0 - Current standard Request For Comment.
DotSlash – or how to deal with 15 minutes of fame Weibin Zhao Henning Schulzrinne Columbia University CATT/WICAT Annual Research Review November 14, 2003.
1 MSRBot Web Crawler Dennis Fetterly Microsoft Research Silicon Valley Lab © Microsoft Corporation.
Content Distribution Network, Proxy CDN: Distributed Environment
The CoDeeN Content Distribution Network Vivek S. Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, Larry Peterson Princeton University August 12, 2003.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
1 The World Wide Web Architectural Overview Static Web Documents Dynamic Web Documents HTTP – The HyperText Transfer Protocol Performance Enhancements.
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
MICROSOFT AJAX CDN (CONTENT DELIVERY NETWORK) Make Your ASP.NET site faster to retrieve.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
The Dark Side of the Web: An Open Proxy’s View Vivek Pai, Limin Wang, KyoungSoo Park, Ruoming Pang, and Larry Peterson Princeton University.
Presented by Michael Rainey South Mississippi Linux Users Group
Vivek Pai, Larry Peterson, & the CoDeeN group Princeton University
Building Your Very Own Web Server
Tiny http client and server
Clustered Web Server Model
Bentley Systems, Incorporated
Scale and Performance in the CoBlitz Large-File Distribution Service
Large-scale file systems and Map-Reduce
TYPES OF SERVER. TYPES OF SERVER What is a server.
Utilization of Azure CDN for the large file distribution
Co* Projects : CoDNS, CoDeploy, CoMon
Distributed Content in the Network: A Backbone View
Pub/sub-based Web Applications
Content Distribution Networks
AKAMAI Content Delivery Services
Presentation transcript:

CoDeeN,Large Files, & CoDeploy KyoungSoo Park, Vivek Pai, Larry Peterson Princeton University

What Is CoDeeN? Content Distribution Networks Web pages load faster if You’re contacting a nearby server That server isn’t overloaded The page is already in memory You use long-lived TCP connections right

CoDeeN By The Numbers In operation ~10 months 150 nodes (~120 live) 6.5 million reqs/day 5 million “good” reqs/day about 300GB/day (estimate) 7K-20K unique IPs per 24 hours Over 600,000 unique IPs served

Our “Strategy” Stay operational Build some credibility Exploit that + activity to branch out Involves doing sales pitches Tap into new consumers In particular, nonprofits, non-commercial

What Most CDNs (want to) Serve

But What About Big Files?

How Big? 200 TeraBytes of data total Interviews: about 3.5GB each Files: average of 700MB each

Problem: “Nobody” Handles 700MB CDNs designed for avg size 10KB 1MB = 100 files 700MB = 70,000 files Commercial disks ~ 100GB Our storage ~ 3GB

New Problems Why not replicate less? You’re farther away Why not merge requests? client readahead slow client

Our Approach AgentCDN Server Client file file0-1 file2-3 file4-5 file3-4 file1-2

Low-Level HTTP Stuff GET name/ranges Header: blah HTTP/ Partial Range: start-end/length Header: blah GET name Range: bytes ranges Header: blah HTTP/ OK Content-length: piece length New-header: obj length egress ingress

Benefits Transparent to client (no software) Server only needs byte-range support Every real server has it Will generate more log entries Can use/augment HTTP infrastructure Caching, redirection, etc Adding security controls Low incremental overhead Agent is about 300 semicolons CDN mods about 20 semicolons

Dual-Use Technology Other one-to-many problems Node/experiment installs Software updates Push model instead of pull Solution? Build “master” script Push to nodes Nodes pull as needed

CoDeploy Now in beta Small set of tools at source No (new) installation at target Needed tools at CoDeeN-hosting nodes Fun components Peer-review system of CoDeeN nodes Nearest CoDeeN finder Parallel ssh, scp

What To Expect Next Will redeploy auto-rewriting service Akamai-like URL mangling Was in testing before December upgrade Tie rewriter into “hosting” service Make it simpler for provider to use CoDeeN

More Info KyoungSoo Park Vivek Pai