Presentation is loading. Please wait.

Presentation is loading. Please wait.

Computer Science 1 Resource Overbooking and Application Profiling in Shared Hosting Platforms Bhuvan Urgaonkar Prashant Shenoy Timothy Roscoe † UMASS Amherst.

Similar presentations


Presentation on theme: "Computer Science 1 Resource Overbooking and Application Profiling in Shared Hosting Platforms Bhuvan Urgaonkar Prashant Shenoy Timothy Roscoe † UMASS Amherst."— Presentation transcript:

1 Computer Science 1 Resource Overbooking and Application Profiling in Shared Hosting Platforms Bhuvan Urgaonkar Prashant Shenoy Timothy Roscoe † UMASS Amherst and Intel Research † Fifth USENIX OSDI, Boston, Dec 2002

2 Computer Science 2 Introduction r Proliferation of Internet applications m E-commerce, streaming media, online games, … r Commonly hosted on clusters of servers m Cheaper alternative to large multiprocessors Clients Internet Streaming Games E-commerce cluster

3 Computer Science 3 Hosting Platforms r Hosting platform: server cluster that runs third-party applications r Applications pay for server resources m CPU, network bandwidth, memory, disk r Platform provider guarantees resource availability r Challenge: Maximize # hosted applications while providing resource guarantees

4 Computer Science 4 Design Challenges r How to determine an application’s resource needs? r How to provision resources to meet these needs? r How to map applications to servers in the platform? r How to handle dynamic variations in load?

5 Computer Science 5 Talk Outline þ Introduction r Inferring Resource Requirements r Provisioning Resources r Mapping Applications to Servers r Experimental Evaluation r Related Work

6 Computer Science 6 Terminology r Hosting platform models m Dedicated: Applications get integral # nodes m Shared: Applications may get fractional # nodes Applications Platform nodes http App serv DB serv r Capsule: component of an application running on a node

7 Computer Science 7 Provisioning By Overbooking r Worst-case provisioning is wasteful m Low utilization of resources r Applications may be tolerant to occasional violations m E.g., CPU guarantees should be met 99% of the time r Possible to provide useful guarantees even after provisioning less than worst-case needs ð Overbook resources to improve utilization m E.g., Airline reservations

8 Computer Science 8 Application Profiling r Use the Linux Trace Toolkit [Yaghmour00] time Begin CPU quantumEnd CPU quantum ON OFF r Profiling: process of determining resource usage m Run the application on an isolated set of nodes m Subject the application to a real workload m Model CPU and network usage as ON-OFF processes

9 Computer Science 9 Resource Usage Distribution time Measurement Interval Cumulative Probability Fractional usage 01 1 A 0.99 B Fractional usage ON-OFF PROCESS Probability 01 1 PDF CDF

10 Computer Science 10 Profiles of Server Applications r Applications exhibit different degrees of burstiness r Need to capture variability in resource usage 0 0.02 0.04 0.06 0.08 0.1 00.20.40.60.81 Postgres Server, 10 clients Probability Fraction of CPU 0 0.05 0.1 0.15 0.2 0.25 0.3 00.10.20.30.40.50.60.7 Apache Web Server, 50% cgi-bin Probability Fraction of CPU 0 0.05 0.1 0.15 0.2 0.25 0.3 00.1.20.30.40.50.60.70.8 Streaming Media Server, 20 clients Probability Fraction of NW bandwidth

11 Computer Science 11 Capturing Burstiness: Token Bucket r Token Bucket (σ, ρ) m Resource usage over t ≤ σ.t + ρ ρ1ρ1 ρ2ρ2 time usage σ 1.t + ρ 1 σ 2.t + ρ 2 time ON-OFF PROCESS r Choose (σ, ρ) based on a high percentile

12 Computer Science 12 Resource Overbooking Mechanism r Applications specify overbooking tolerance O i m Probability with which capsule needs may be violated r Controlled overbooking via admission control: m Resource requirements of all capsules are met Σ K (σ k ·T min + ρ k )·(1 - O k ) ≤ C·T min m Overbooking tolerances of all capsules are met Pr (Σ K U k > C) ≤ min (O 1,…,O k ) r A node that has sufficient resources for a capsule is feasible for it

13 Computer Science 13 Mapping Capsules to Nodes r A bipartite graphs of capsules and feasible nodes r Greedy mapping: consider capsules in non-decreasing order of degrees r Multiple feasible nodes => random, best fit, worst fit… 1 2 3 1 2 3 4 capsules nodes capsules nodes 1 3 3 1 2 4 Final Mapping

14 Computer Science 14 Talk Outline þ Introduction þ Inferring Resource Requirements þ Provisioning Resources þ Mapping Applications to Servers r Experimental Evaluation r Related Work

15 Computer Science 15 The SHARC Prototype r A Linux-based Shared Hosting Platform m 6 Dell Poweredge 1550 servers m Gigabit Ethernet link r Software Components m Profiling  Vanilla Linux + Linux Trace Toolkit m Control plane  Overbooking, placement m QoS-enhanced Linux kernel  HSFQ schedulers

16 Computer Science 16 Experimental Setup r Prototype running on a 5 node cluster m Each server: 1 GHz PIII with 512MB RAM and Gigabit ethernet m Control plane runs on a dedicated node m Applications run on the other four nodes r Workload: mix of server applications m Apache web server with SPECWeb99 (static & dynamic HTTP) m PostgreSQL database server with pgbench (TPC-B) benchmark m MPEG streaming server with 1.5 Mb/s VBR MPEG-1 clients m Quake I game server with “terminator” bots

17 Computer Science 17 Resource Overbooking Benefits r Small amounts of overbooking can yield large gains r Bursty applications yield larger benefits Placement of Apache Web Servers 400 600 800 1000 1200 1400 020406080100120140 No Ovb Ovb=1% Ovb=5% Web Servers Placed Number of Nodes 0 50 100 150 200 250 300 350 020406080100120140 Placement of Streaming Media Servers No Ovb Ovb=1% Ovb=5% Media Servers Placed Number of Nodes

18 Computer Science 18 Performance with Overbooking 0 5 10 15 20 25 Isolated100th99th95thAverage Performance of Postgres Throughput (trans/s) CPU Provisioning 0 10 20 30 40 50 60 70 Isolated100th99th95thAverage Performance of Apache Throughput (req/s) CPU Provisioning r Performance degradation is within specified overbooking tolerance

19 Computer Science 19 Handling Flash Crowds r Detect overloads by online profiling r Reacting to overloads (ongoing work) m Compute new allocations m Change allocations, move capsules, add servers

20 Computer Science 20 Related Work r Single node resource management m Proportional share schedulers: WFQ, SFQ, BVT, … m Reservation based schedulers: Nemesis, Rialto, … r Cluster-based resource management m Cluster Reserves [Aron00] m MUSE [Chase01]: economic approach m Oceano [IBM], Planetary computing [HP] m Clusters for high availability: Porcupine [Saito99] m Grid computing [Globus]

21 Computer Science 21 Concluding Remarks r Resource management in shared hosting platforms m Application profiling to determine resource usage m Controlled overbooking to improve utilization m Mapping applications to servers r Future work m Handling dynamic workloads m Managing memory and disk bandwidth r URL: http://lass.cs.umass.edu


Download ppt "Computer Science 1 Resource Overbooking and Application Profiling in Shared Hosting Platforms Bhuvan Urgaonkar Prashant Shenoy Timothy Roscoe † UMASS Amherst."

Similar presentations


Ads by Google