Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP.

Slides:



Advertisements
Similar presentations
Enabling Secure Internet Access with ISA Server
Advertisements

Neighbor Discovery for IPv6 Mangesh Kaushikkar. Overview Introduction Terminology Protocol Overview Message Formats Conceptual Model of a Host.
Umut Girit  One of the core members of the Internet Protocol Suite, the set of network protocols used for the Internet. With UDP, computer.
Socket Layer Security. In this Presentation: need for web security SSL/TLS transport layer security protocols HTTPS secure shell (SSH)
EEC-484/584 Computer Networks Lecture 6 Wenbing Zhao
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Cis e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © )
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #13 Web Caching Protocols ICP, CARP.
CSCI 4550/8556 Computer Networks Comer, Chapter 19: Binding Protocol Addresses (ARP)
CS335 Networking & Network Administration Tuesday, May 11, 2010.
Internet Networking Spring 2002 Tutorial 13 Web Caching Protocols ICP, CARP.
Hands-On Microsoft Windows Server 2003 Networking Chapter 7 Windows Internet Naming Service.
Freenet A Distributed Anonymous Information Storage and Retrieval System I Clarke O Sandberg I Clarke O Sandberg B WileyT W Hong.
Implementing ISA Server Caching. Caching Overview ISA Server supports caching as a way to improve the speed of retrieving information from the Internet.
Chapter 19 Binding Protocol Addresses (ARP) Chapter 20 IP Datagrams and Datagram Forwarding.
1Bloom Filters Lookup questions: Does item “ x ” exist in a set or multiset? Data set may be very big or expensive to access. Filter lookup questions with.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 7: Planning a DNS Strategy.
Web Caching Schemes For The Internet – cont. By Jia Wang.
1 The Mystery of Cooperative Web Caching 2 b b Web caching : is a process implemented by a caching proxy to improve the efficiency of the web. It reduces.
World Wide Web Caching: Trends and Technology Greg Barish and Katia Obraczka USC Information Science Institute IEEE Communications Magazine, May 2000 Presented.
COMPUTER TERMS PART 1. COOKIE A cookie is a small amount of data generated by a website and saved by your web browser. Its purpose is to remember information.
(part 3).  Switches, also known as switching hubs, have become an increasingly important part of our networking today, because when working with hubs,
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
Network Administration
ICMP (Internet Control Message Protocol) Computer Networks By: Saeedeh Zahmatkesh spring.
Redirection and Load Balancing
23-Support Protocols and Technologies Dr. John P. Abraham Professor UTPA.
How the Internet Works. The Internet and the Web The Web is actually just one of many computer applications that run on the Internet Among others are.
1 Computer Communication & Networks Lecture 28 Application Layer: HTTP & WWW p Waleed Ejaz
Behzad Akbari Spring 2012 (These slides are based on lecture slides by Lawrie Brown)
World Wide Web Caching: Trends and Technologys Gerg Barish & Katia Obraczka USC Information Sciences Institute, USA,2000.
10/8/2015CST Computer Networks1 IP Routing CST 415.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 17 This presentation © 2004, MacAvon Media Productions Multimedia and Networks.
Multicast Routing Algorithms n Multicast routing n Flooding and Spanning Tree n Forward Shortest Path algorithm n Reversed Path Forwarding (RPF) algorithms.
TCP/IP Transport and Application (Topic 6)
The Inter-network is a big network of networks.. The five-layer networking model for the internet.
© J. Liebeherr, All rights reserved 1 Multicast Routing.
The Intranet.
ICP and the Squid Web Cache Duanc Wessels k Claffy August 13, 1997 元智大學系統實驗室 宮春富 2000/01/26.
Multimedia and Networks. Protocols (rules) Rules governing the exchange of data over networks Conceptually organized into stacked layers – Application-oriented.
TCP/IP (Transmission Control Protocol / Internet Protocol)
HTTP evolution - TCP/IP issues Lecture 4 CM David De Roure
Implementing ISA Server Caching
ICP and the Squid Web Cache Duane Wessels and K. Claffy 산업공학과 조희권.
Setup and Management for the CacheRaQ. Confidential, Page 2 Cache Installation Outline – Setup & Wizard – Cache Configurations –ICP.
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 17 This presentation © 2004, MacAvon Media Productions Multimedia and Networks.
Web Services. 2 Internet Collection of physically interconnected computers. Messages decomposed into packets. Packets transmitted from source to destination.
Lecture 4 Mechanisms & Kernel for NOSs. Mechanisms for Network Operating Systems  Network operating systems provide three basic mechanisms that support.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Cache Digest Alex Rousskov Duane Wessels National Laboratory for Applied Network Research April 17, 1998 元智大學 資訊工程研究所 系統實驗室 陳桂慧 February 9, 1999.
Networks, Part 2 March 7, Networks End to End Layer  Build upon unreliable Network Layer  As needed, compensate for latency, ordering, data.
Hiearchial Caching in Traffic Server. Hiearchial Caching  A set of techniques and mechanisms to increase the size and performance of network caches.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
Internet Cache Protocol Erez Tal Assaf Oren Avner Cohen Submission Date: 5/2/01 Guides: Ran Wolff and Itai Dabran.
TCP/IP1 Address Resolution Protocol Internet uses IP address to recognize a computer. But IP address needs to be translated to physical address (NIC).
The Intranet.
Internet and Intranet.
Web Caching? Web Caching:.
Switching Techniques In large networks there might be multiple paths linking sender and receiver. Information may be switched as it travels through various.
Internet Networking recitation #12
Internet and Intranet.
Computer Communication & Networks
Starting TCP Connection – A High Level View
Switching Techniques.
Internet and Intranet.
HyperText Transfer Protocol
Internet and Intranet.
Presentation transcript:

Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP

2 ICP - Internet Caching Protocol ICP is Web caching protocol ICP version 2 defined in RFC 2186 Message format used for communicating among Web caches Used to exchange hints about the existence of URLs in neighbor caches. Caches exchange ICP queries and replies gather information to use in selecting the most appropriate location from which to retrieve an object

3 ICPv2 Protocol specification Generally, Web caches use HTTP for the transfer of object data However, caches can benefit from a simpler, lighter communication protocol. ICP is primarily used in a cache mesh to locate specific Web objects in neighboring caches. One cache sends an ICP query to its neighbors. The neighbors send back ICP replies indicating a "HIT" or a "MISS."

4 ICP Implementation In current practice, ICP is implemented on top of UDP There is no requirement that it be limited to UDP. ICP over UDP offers features important to Web caching applications. Query/reply exchange needs to occur quickly. A cache cannot wait longer than that before beginning to retrieve an object. Failure to receive a reply message means the network path is either congested or broken. In either case we would not want to select that neighbor.

5 Cache selection ICP messages can also be used for cache selection Failure to receive a reply from a cache network or system failure. The ICP reply may include extra information Can assist selection of the most appropriate source from which to retrieve an object.

6 ICPv2 application specification RFC 2187RFC 2187 A single Web cache will reduce the amount of traffic generated by the clients behind it Similarly, a group of Web caches can benefit by sharing another cache in much the same way In a cache hierarchy (or mesh) one cache establishes peering relationships with its neighbor caches

7 Web Cache Hierarchies Two types of cache relationship: Parent A parent cache is essentially one level up in a cache hierarchy Sibling A sibling cache is on the same level Neighbor (peer) Is either parent or sibling which is a single “cache-hop” away

8 A Simple Web Cache Hierarchy Internet Parent Cache Local Cache Sibling Cache Cache Clients Hits Resolved Hits and Misses Resolved Direct Retrievals

9 Levels The general flow of document requests is up the hierarchy When a cache does not hold a requested object It may ask via ICP whether any of its neighbor caches has the object. If there is a ‘Hit’ then the cache will request it from them. Else the cache must forward the request either to a parent, or directly to the origin server.

10 Parent and Sibling Caches “Neighbor hit" may be fetched from either parent or sibling cache “Neighbor miss" may NOT be fetched from a sibling. In other words: sibling relationship - can retrieve objects the sibling already has cached. parent relationship - can retrieve any object regardless of whether or not it is cached.

11 ICP Additional Delay Caches are designed to return ICP requests quickly. The application does minimal processing of the ICP request Most ICP-related delay is due to transmission on the network. ICP serves to provide an indication of neighbor reachability. If ICP replies from a neighbor fail to arrive, it should not be used at this time Network path is congested (or down) Cache application is not running on the ICP-queried neighbor machine ICP provides also some form of load balancing, because an idle cache can reply faster than a busy one.

12 Determine whether to use ICP Not every HTTP request requires an ICP query to be sent Obviously, cache hits will not need ICP because the request is satisfied immediately For origin servers very close to the cache, we do not want to use any neighbor caches Some classes of requests the cache (or the administrator) may prefer to forward directly to the origin server all non-GET request methods URLs containing certain strings (e.g. “cgi_bin”)

13 Source Selection The cache sends queries to each peer. In order to maximize the chance to get a HIT reply from one of the peers, the cache waits for all ICP replies to be received (query timeout is applied). HIT reply - object retrieval commences immediately from the replying peer. When all peers MISS either parent cache or the origin server is selected.

14 Multicast for Efficient Distribution A cache may deliver ICP queries to a multicast address. Neighbor caches may join the multicast group to receive such queries. But for multicast we have no way to know exactly how many replies to expect ICP replies sent to unicast address: Multicasting ICP replies would not reduce the number of packets sent. It prevents other group members from receiving unexpected replies. The reply should follow unicast routing path to indicate connectivity between the receiver and the sender since the subsequent HTTP request will be unicast routed.

15 Differences Between ICP and HTTP HTTP supports a rich and sophisticated set of features. ICP was designed to be simple, small, and efficient. HTTP request and reply headers consist of lines of ASCII text. ICP uses a fixed size header and represents numbers in binary.

16 CARP - Cache Array Routing Protocol Microsoft® Proxy Server 2.0 uses the Cache Array Routing Protocol (CARP) Series of algorithms that are applied on top of HTTP Multiple proxy servers are arrayed as a single logical cache Does not require a new wire protocol Uses HTTP, compatible with existing firewalls and proxy servers

17 Hash-based Routing Provides a deterministic "request resolution path" through an array of proxies The request resolution path Hashing of proxy array member identities and URLs For any given URL request, the proxy server will know exactly where in the proxy array the information will be stored (or still not)

18 Benefits Deterministic request resolution path: No query messaging between proxy servers that existed in ICP Eliminates the duplication of contents that otherwise occurs on an array of proxy servers Has positive scalability, becomes faster and more efficient as more proxy servers are added

19 How CARP works A hash function is computed for the name of each proxy server A hash function is computed for the name of each requested URL The hash value of the URL is combined with the hash value for each proxy Whichever URL+Proxy Server hash comes up with the highest value, becomes "owner" of the information cache If a server fails, its URLs are automatically rerouted to the server with the next highest score

20 How CARP works (cont.) The result: Deterministic location for all cached information Web browser or downstream proxy server can know exactly where a requested URL either already is stored locally, or will be located after caching Because the hash functions used to assign values are so great: 2^32 = the result is a statistically distributed load balancing across the array

21 Updating Membership List Array manager maintains a current list of members of a particular proxy array All proxies servers in the array stores their own local copies of the array list and periodically send requests for updates to the array manager They also watches all HTTP requests to any array members and if a request fails, then marks that proxy member down until next update from the array manager