We think you have liked this presentation. If you wish to download it, please recommend it to your friends in any social system. Share buttons are a little bit lower. Thank you!
Presentation is loading. Please wait.
Published bySonny Barley
Modified over 2 years ago
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 6 Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism: Computer Architecture A Quantitative Approach, Fifth Edition
2 Copyright © 2012, Elsevier Inc. All rights reserved. Introduction Warehouse-scale computer (WSC) Provides Internet services Search, social networking, online maps, video sharing, online shopping, , cloud computing, etc. Differences with HPC clusters: Clusters have higher performance processors and network Clusters emphasize thread-level parallelism, WSCs emphasize request-level parallelism Differences with datacenters: Datacenters consolidate different machines and software into one location Datacenters emphasize virtual machines and hardware heterogeneity in order to serve varied customers Introduction
3 Copyright © 2012, Elsevier Inc. All rights reserved. Introduction Important design factors for WSC: Cost-performance Small savings add up Energy efficiency Affects power distribution and cooling Work per joule Dependability via redundancy Network I/O Interactive and batch processing workloads Ample computational parallelism is not important Most jobs are totally independent Request-level parallelism Operational costs count Power consumption is a primary, not secondary, constraint when designing system Scale and its opportunities and problems Can afford to build customized systems since WSC require volume purchase Introduction
4 Copyright © 2012, Elsevier Inc. All rights reserved. Prgrmg Models and Workloads Batch processing framework: MapReduce Map: applies a programmer-supplied function to each logical input record Runs on thousands of computers Provides new set of key-value pairs as intermediate values Reduce: collapses values using another programmer-supplied function Programming Models and Workloads for WSCs
5 Copyright © 2012, Elsevier Inc. All rights reserved. Prgrmg Models and Workloads Example: map (String key, String value): // key: document name // value: document contents for each word w in value EmitIntermediate(w,1); // Produce list of all words reduce (String key, Iterator values): // key: a word // value: a list of counts int result = 0; for each v in values: result += ParseInt(v); // get integer from key-value pair Emit(AsString(result)); Programming Models and Workloads for WSCs
6 Copyright © 2012, Elsevier Inc. All rights reserved. Prgrmg Models and Workloads MapReduce runtime environment schedules map and reduce task to WSC nodes Availability: Use replicas of data across different servers Use relaxed consistency: No need for all replicas to always agree Workload demands Often vary considerably Programming Models and Workloads for WSCs
7 Copyright © 2012, Elsevier Inc. All rights reserved. Computer Architecture of WSC WSC often use a hierarchy of networks for interconnection Each 19 rack holds 48 1U servers connected to a rack switch Rack switches are uplinked to switch higher in hierarchy Uplink has 48 / n times lower bandwidth, where n = # of uplink ports Oversubscription Goal is to maximize locality of communication relative to the rack Computer Ar4chitecture of WSC
8 Copyright © 2012, Elsevier Inc. All rights reserved. Storage Storage options: Use disks inside the servers, or Network attached storage through Infiniband WSCs generally rely on local disks Google File System (GFS) uses local disks and maintains at least three relicas Computer Ar4chitecture of WSC
9 Copyright © 2012, Elsevier Inc. All rights reserved. Array Switch Switch that connects an array of racks Array switch should have 10 X the bisection bandwidth of rack switch Cost of n-port switch grows as n 2 Often utilize content addressible memory chips and FPGAs Computer Ar4chitecture of WSC
10 Copyright © 2012, Elsevier Inc. All rights reserved. WSC Memory Hierarchy Servers can access DRAM and disks on other servers using a NUMA-style interface Computer Ar4chitecture of WSC
11 Copyright © 2012, Elsevier Inc. All rights reserved. Infrastructure and Costs of WSC Location of WSC Proximity to Internet backbones, electricity cost, property tax rates, low risk from earthquakes, floods, and hurricanes Power distribution Physcical Infrastrcuture and Costs of WSC
12 Copyright © 2012, Elsevier Inc. All rights reserved. Infrastructure and Costs of WSC Cooling Air conditioning used to cool server room 64 F – 71 F Keep temperature higher (closer to 71 F) Cooling towers can also be used Minimum temperature is wet bulb temperature Physcical Infrastrcuture and Costs of WSC
13 Copyright © 2012, Elsevier Inc. All rights reserved. Infrastructure and Costs of WSC Cooling system also uses water (evaporation and spills) E.g. 70,000 to 200,000 gallons per day for an 8 MW facility Power cost breakdown: Chillers: 30-50% of the power used by the IT equipment Air conditioning: 10-20% of the IT power, mostly due to fans How man servers can a WSC support? Each server: Nameplate power rating gives maximum power consumption To get actual, measure power under actual workloads Oversubscribe cumulative server power by 40%, but monitor power closely Physcical Infrastrcuture and Costs of WSC
14 Copyright © 2012, Elsevier Inc. All rights reserved. Measuring Efficiency of a WSC Power Utilization Effectiveness (PEU) = Total facility power / IT equipment power Median PUE on 2006 study was 1.69 Performance Latency is important metric because it is seen by users Bing study: users will use search less as response time increases Service Level Objectives (SLOs)/Service Level Agreements (SLAs) E.g. 99% of requests be below 100 ms Physcical Infrastrcuture and Costs of WSC
15 Copyright © 2012, Elsevier Inc. All rights reserved. Cost of a WSC Capital expenditures (CAPEX) Cost to build a WSC Operational expenditures (OPEX) Cost to operate a WSC Physcical Infrastrcuture and Costs of WSC
16 Copyright © 2012, Elsevier Inc. All rights reserved. Cloud Computing WSCs offer economies of scale that cannot be achieved with a datacenter: 5.7 times reduction in storage costs 7.1 times reduction in administrative costs 7.3 times reduction in networking costs This has given rise to cloud services such as Amazon Web Services Utility Computing Based on using open source virtual machine and operating system software Cloud Computing
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 6 Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism: Computer.
1 现代计算机体系结构 主讲教师：张钢 教授 天津大学计算机学院 通信邮箱： 提交作业邮箱： 2015 年.
1 Lecture 20: WSC, Datacenters Topics: warehouse-scale computing and datacenters (Sections ) – the basics followed by a look at the future.
MS108 Computer System I Lecture 13 Warehouse Computing Prof. Xiaoyao Liang 2015/6/10 1.
1 Lecture 24: WSC, Datacenters Topics: network-on-chip wrap-up, warehouse-scale computing and datacenters (Sections )
CS203 – Advanced Computer Architecture Warehouse Scale Computing.
1 Lecture: WSC, Datacenters Topics: network-on-chip wrap-up, warehouse-scale computing and datacenters (Sections )
Introduction to the new mainframe: Large-Scale Commercial Computing © Copyright IBM Corp., All rights reserved. Chapter 2: Capacity.
University of Minnesota Optimizing MapReduce Provisioning in the Cloud Michael Cardosa, Aameek Singh†, Himabindu Pucha†, Abhishek Chandra
Cloud Computing Data Centers Dr. Sanjay P. Ahuja, Ph.D FIS Distinguished Professor of Computer Science School of Computing, UNF.
Energy in Cloud Computing and Renewable Energy Reza Farivar.
1 Copyright © 2011, Elsevier Inc. All rights Reserved. Chapter 6 Authors: John Hennessy & David Patterson.
Lecture 12: MapReduce: Simplified Data Processing on Large Clusters Xiaowei Yang (Duke University)
MapReduce: Hadoop Implementation. Outline MapReduce overview Applications of MapReduce Hadoop overview.
1 Chapter 11: Data Centre Administration Objectives Data Centre Structure Data Centre Structure Data Centre Administration Data Centre Administration Data.
POM - J. Galván 1 PRODUCTION AND OPERATIONS MANAGEMENT Ch. 8: Capacity Planning.
Data Mining on the Web via Cloud Computing COMS E6125 Web Enhanced Information Management Presented By Hemanth Murthy.
1 Chapter 40 - Physiology and Pathophysiology of Diuretic Action Copyright © 2013 Elsevier Inc. All rights reserved.
2 Industry trends and challenges Windows Server 2012: Beyond virtualization Complete virtualization platform Improved scalability and performance.
Data Centers and Cloud Computing 1. 2 Data Centers 3.
MapReduce: Simplified Data Processing on Large Clusters Jeffrey Dean and Sanjay Ghemawat.
MAP REDUCE PROGRAMMING Dr G Sudha Sadasivam. Map - reduce sort/merge based distributed processing Best for batch- oriented processing Sort/merge is primitive.
Hadoop Hardware Infrastructure considerations ©2013 OpalSoft Big Data.
1 Sizing the Streaming Media Cluster Solution for a Given Workload Lucy Cherkasova and Wenting Tang HPLabs.
How AWS Pricing Works Jinesh Varia Technology Evangelist.
Sabyasachi Ghosh Mark Redekopp Murali Annavaram Ming-Hsieh Department of EE USC KnightShift: Enhancing Energy Efficiency by.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
Datacenter Power State-of-the-Art Randy H. Katz University of California, Berkeley LoCal 0 th Retreat “Energy permits things to exist; information, to.
Homework 2 In the docs folder of your Berkeley DB, have a careful look at documentation on how to configure BDB in main memory. In the docs folder of your.
Paint IT Green A Guide to Lowering Your ICT Carbon from Data Centre.
N. GSU Slide 1 Chapter 02 Cloud Computing Systems N. Xiong Georgia State University.
Introduction to Hadoop and HDFS. Table of Contents Hadoop – Overview Hadoop Cluster HDFS.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW Describe the clustering capabilities of Microsoft Windows.
Map Reduce for data-intensive computing (Some of the content is adapted from the original authors’ talk at OSDI 04)
1 Copyright © 2012, Elsevier Inc. All rights reserved. Chapter 5 Multiprocessors and Thread-Level Parallelism Computer Architecture A Quantitative Approach,
Sweet Storage SLOs with Frosting Andrew Wang, Shivaram Venkataraman, Sara Alspaugh, Ion Stoica, Randy Katz.
CHAPTER 2 OPERATING SYSTEM OVERVIEW 1. Operating System Operating System Definition A program that controls the execution of application programs and.
Background Computer System Architectures Computer System Software.
Government Technology Bay Area Technology Forum November 5, 2009 Presented By: Jon Fullinwider Director, Local Government Citrix Systems, Inc. Presented.
DATABASE SYSTEM CONCEPTS AND ARCHITECTURE CHAPTER 2 1.
By Jeff Dean & Sanjay Ghemawat Google Inc. OSDI 2004 Presented by : Mohit Deopujari.
Challenges towards Elastic Power Management in Internet Data Center.
1 Case Study of Demand Response: Con Edison Business Program by Eileen Egan-Annechino Consolidated Edison Company of New York, Inc.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. 5 Capacity Planning For Products and Services.
VMware vSphere Configuration and Management v6 1.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Introduction to Computer Administration Introduction.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3: Operating Systems Computer Science: An Overview Tenth Edition.
4a. Aula 2o. Período de Livro texto Copyright © 2012, Elsevier Inc. All rights reserved March 5, 2012 Prof. Kai Hwang, USC Cloud Roles in.
© 2017 SlidePlayer.com Inc. All rights reserved.