CoolAir Temperature- and Variation-Aware Management for Free-Cooled Datacenters Íñigo Goiri, Thu D. Nguyen, and Ricardo Bianchini 1.

Slides:



Advertisements
Similar presentations
SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA, SAN DIEGO Green Datacenter Initiatives at SDSC Matt Campbell SDSC Data Center Services.
Advertisements

Data Center Design Issues Bill Tschudi, LBNL
Clair Christofersen Mentor – Aaron Andersen August 2, 2012
Parasol and GreenSwitch: Managing Datacenters Powered by Renewable Energy Íñigo Goiri, William Katsak, Kien Le, Thu D. Nguyen, and Ricardo Bianchini Department.
GreenSlot: Scheduling Energy Consumption in Green Datacenters Íñigo Goiri, Kien Le, Md. E. Haque, Ryan Beauchea, Thu D. Nguyen, Jordi Guitart, Jordi Torres,
A Comparison of Measure Avoided Cost Calculations using Utility TOU Load Shapes and DEER Hourly Measure Savings 14 March 2006.
Foundations of Real Estate Management TM BOMA International ® Module 3: Building Operations I Heating, Ventilating, and Cooling the Building.
Controlling Sutardja Dai Hall Andrew Krioukov Stephen Dawson-Haggerty, Jay Taneja David Culler.
1 * Other names and brands may be claimed as the property of others. Copyright © 2010, Intel Corporation. Data Center Efficiency with Optimized Cooling.
Using Benchmarking to Identify Energy Efficiency Opportunity in Cleanrooms; The Labs 21 Approach William Tschudi and Peter Rumsey June 29, 2004
Cooling Product Positioning
Climate Most important environmental factor and the first one that should be considered Can dictate what passive design strategies will be suitable and.
Utility-Function-Driven Energy- Efficient Cooling in Data Centers Authors: Rajarshi Das, Jeffrey Kephart, Jonathan Lenchner, Hendrik Hamamn IBM Thomas.
Datacenter Power State-of-the-Art Randy H. Katz University of California, Berkeley LoCal 0 th Retreat “Energy permits things to exist; information, to.
Heat Pump Clothes Dryer Presented on April 23, 2008 By Brian Pollica Michael Hildenbrand.
Transitioning unique NASA data and research technologies to the NWS 1 Evaluation of WRF Using High-Resolution Soil Initial Conditions from the NASA Land.
Intelligent Placement of Datacenters for Internet Services Íñigo Goiri, Kien Le, Jordi Guitart, Jordi Torres, and Ricardo Bianchini 1.
44 th Annual Conference & Technical Exhibition By Thomas Hartman, P.E. The Hartman Company Georgetown, Texas Sustainable Chilled Water.
 Site  Requirements  Local Resources  Initial layout ideas  Brief material selection  Supply options.
24 x 7 Energy Efficiency February, 2007 William Tschudi
September 18, 2009 Critical Facilities Round Table 1 Introducing the Heat Wheel to the Data Center Robert (Dr. Bob) Sullivan, Ph.D. Data Center Infrastructure.
Federal Data Center Consolidation Initiative Karen Petraska August 17, 2011.
University of Karlsruhe, System Architecture Group Balancing Power Consumption in Multiprocessor Systems Andreas Merkel Frank Bellosa System Architecture.
GreenHadoop: Leveraging Green Energy in Data-Processing Frameworks Íñigo Goiri, Kien Le, Thu D. Nguyen, Jordi Guitart, Jordi Torres, and Ricardo Bianchini.
Advanced Energy Management in Cloud Computing multi data center environments Giuliana Carello, DEI, Politecnico di Milano Danilo.
Important variables Water: Air: Conversion:
Data Centers - They’re Back… E SOURCE Forum September, 2007 William Tschudi
Air Conditioning and Computer Centre Power Efficiency The Reality Christophe Martel Tony Cass.
Lecture Objectives: Finish with HVAC Systems Discuss Final Project.
Sensor-Based Fast Thermal Evaluation Model For Energy Efficient High-Performance Datacenters Q. Tang, T. Mukherjee, Sandeep K. S. Gupta Department of Computer.
COMP 4923 A2 Data Center Cooling Danny Silver JSOCS, Acadia University.
Leveraging Renewable Energy in Data Centers Ricardo Bianchini on tour 2012.
Data centre air management Case studies Sophia Flucker.
Variable-Speed Heat Pump Model for a Wide Range of Cooling Conditions and Loads Tea Zakula Nick Gayeski Leon Glicksman Peter Armstrong.
GreenSoftware: Managing Datacenters Powered by Renewable Energy Íñigo Goiri, William Katsak, Md E Haque, Kien Le, Ryan Beauchea, Jordi Guitart, Jordi Torres,
Building Green Cloud Services at Low Cost Josep Ll. Berral, Íñigo Goiri, Thu D. Nguyen, Ricard Gavaldà, Jordi Torres, Ricardo Bianchini.
Thermal-aware Issues in Computers IMPACT Lab. Part A Overview of Thermal-related Technologies.
Software Architecture for Dynamic Thermal Management in Datacenters Tridib Mukherjee Graduate Research Assistant IMPACT Lab ( Department.
Thermal Aware Data Management in Cloud based Data Centers Ling Liu College of Computing Georgia Institute of Technology NSF SEEDM workshop, May 2-3, 2011.
The Data Center Challenge
ATAC: Ambient Temperature- Aware Capping for Power Efficient Datacenters Sungkap Yeo Mohammad M. Hossain Jen-cheng Huang Hsien-Hsin S. Lee.
We can…. 2 GLOBAL REFERENCES Rev: 00 References :
COMP 4923 A2 Data Center Cooling Danny Silver JSOCS, Acadia University.
1 Thermal Management of Datacenter Qinghui Tang. 2 Preliminaries What is data center What is thermal management Why does Intel Care Why Computer Science.
Authors: William Tschudi, Lawrence Berkeley National Lab Stephen Fok, Pacific Gas and Electric Company Stephen Fok, Pacific Gas and Electric Company Presented.
ApproxHadoop Bringing Approximations to MapReduce Frameworks
#watitis2015 TOWARD A GREENER HORIZON: PROPOSED ENERGY SAVING CHANGES TO MFCF DATA CENTERS Naji Alamrony
Increasing DC Efficiency by 4x Berkeley RAD Lab
Introduction to Energy Management. Week/Lesson 13 Control Strategies for Occupant Comfort.
Dynamic Placement of Virtual Machines for Managing SLA Violations NORMAN BOBROFF, ANDRZEJ KOCHUT, KIRK BEATY SOME SLIDE CONTENT ADAPTED FROM ALEXANDER.
Energy Savings in CERN’s Main Data Centre
Data Center Energy Use, Metrics and Rating Systems Steve Greenberg Energy Management Engineer Environmental Energy Technologies Division Lawrence Berkeley.
All content in this presentation is protected – © 2008 American Power Conversion Corporation Row Cooling.
Second law of thermodynamics. The Second Law of Thermodynamics.
Bernard Boutherin 1 Feedback after four years of operation.
Data Center Energy Efficiency SC07 Birds of a Feather November, 2007 William Tschudi
Laws of Thermodynamics. Zeroth Law of Thermodynamics “If two thermodynamic systems are each in thermal equilibrium with a third, then they are in thermal.
Dell EMC Modular Data Centers
Free Air Cooling for Data Centres
Unit 2: Chapter 2 Cooling.
Forecasting with Cyber-physical Interactions in Data Centers (part 3)
The Data Center Challenge
Jacob R. Lorch Microsoft Research
Automated Cost-Aware Data Center Management (part 4)
Objectives Humidifying - steam - adiabatic Cooling towers
Lecture Objectives: Discuss HW4 parts
Objectives Psychometrics Examples in AHUs.
What is climate? TIME How does climate differ from weather?
Liebert DSE High efficiency thermal management
Objectives Processes in Cooling towers and AHUs.
Presentation transcript:

CoolAir Temperature- and Variation-Aware Management for Free-Cooled Datacenters Íñigo Goiri, Thu D. Nguyen, and Ricardo Bianchini 1

Hybrid: typical + free cooling Typical datacenter cooling Filters Evaporative cooler Fans Server racks Outside air Cooling tower Water chiller Air handling unit Server racks Microsoft DC in Chicago 2 Free cooling

Potentially negative impact on hardware reliability, especially disks High temperature Wide temperature variation High humidity Free cooling limitations 3 Outside Disk Inlet Outside temp directly impacts inlet and disk temps Daily temperature variation can be large

Roadmap Motivation and background CoolAir: Managing free-cooled datacenters Cooling modeling Cooling management Compute management CoolAir for Parasol Evaluation and general lessons Conclusions 4

Energy-aware management of cooling & workload Minimize hardware reliability issues Limit temperature and relative humidity Reduce temperature variation Major tasks 1.Predict conditions and energy 2.Select best cooling settings 3.Apply cooling settings 4.Place and schedule load CoolAir Datacenter CoolAir: Managing free-cooled datacenters Cooling Servers Cooling Manager Compute Manager Cooling Modeler Weather Forecast 5

Predictions based on linear regression model Datacenter Cooling modeling Historic Data Cooling Learner Cooling Model Temperature inside Humidity inside Cooling power Temperature outside Location in the datacenter Datacenter utilization Cooling setting Temperature inside/outside Humidity outside Cooling setting Cooling operation 6

Use predictions from cooling model Reduce variation with a temp band based on expected outside temp Maintain temperature within the band Middle: forecast outside temp + offset Periodically Predict environmentals and energy Select best settings using utility Apply cooling settings Cooling management Band selection example Temperature 7 Average Outside temperature forecast Hour Offset

Compute management Spatial placement Distribute load to servers Group servers into “pods” of similar behavior Reduce solving and modeling complexity Favor pods with higher heat recirculation Against common practice in non-free-cooled DCs Lower recirculation pods are closer to cooling → temperature variation Temporal scheduling When to execute deferrable loads (see paper) 8 Sensors Server Pod Front view of Parasol’s racks Rack 1 Rack 2

Roadmap Motivation and background CoolAir: Managing free-cooled datacenters CoolAir for Parasol Evaluation and general lessons Conclusions 9

Case study: Parasol Default cooling controller: Outside temperature ≤ 30⁰C → Free cooling with variable fan speed Outside temperature > 30⁰C → AC cycling with hysteresis External view Internal layout (top view) Exhaust Free Cooling Rack 1 Rack 2 Cold aisleHot aisle Cooling Controller Door Relays Partition Air duct 10 Air Conditioner

CoolAir for Parasol Data collection and model learning Historical sensor info for two months Generated extreme settings to learn faster Cooling configurer Interface with Parasol’s “thermostat” Control fan speed and AC Compute configurer for Hadoop Send idle worker nodes to sleep while keeping data available >90% with <0.5⁰C errors 11

Example of CoolAir on Parasol 12

Evaluation methodology Parasol as the baseline system 64 Atom servers: 8 pods in 2 racks Hadoop workloads: Non-deferrable Facebook (see paper for others) Real experiments and validated simulations Evaluated policies (see paper for others) 13 PolicyTemperatureHumidityEnergySpatial placement BaselineReactive <30⁰C ✔✔✘ CoolAirAdaptive band ✔✔ High recirculation

Baseline vs CoolAir 14 Warmer locations are more inefficient Up to 4⁰C reduction Up to 60% reduction

Multiple geographical locations Power efficiency (PUE) improvement Reduction in max temperature range Improves PUE in warmer locations where PUE is worse Reduces variation the most in colder locations where variation is highest Sacrifices PUE slightly in cold locations to -0.01

Principles and lessons learned Variation management requires fine-grain cooling and load control Management challenges depend on the climate Warm: managing absolute temperature costs more than variation Cold: managing temperature variation is more critical and successful Temp band and spatial placement are key; temporal scheduling is not Other lessons in the paper 16

Conclusions CoolAir successfully manages Absolute temperature and temperature variation Relative humidity Energy CoolAir broadens the set of areas where free cooling can be used Principles should apply to larger datacenters 17

CoolAir Temperature- and Variation-Aware Management for Free-Cooled Datacenters Íñigo Goiri, Thu D. Nguyen, and Ricardo Bianchini 18

Motivation Typical cooling in datacenters Chillers, cooling tower, air handlers Very energy hungry Bring cool air from outside: free cooling Reduces cooling energy Typically used in cooler and drier climates Warmer locations: hybrid Free cooling when external temperature and humidity are suitable 19 Microsoft DC in Chicago

Validation of models and simulations ~80% with <0.5⁰C errors Real behavior Simulation Simulation close to real behavior 20

Example of CoolAir on Parasol 21

Principles and lessons learned (long version) Absolute temperatures and variations are high in many locations Variability management requires fine-grain cooling and load control Management challenges depend on the climate Warm: managing absolute temperature costs more than variation Cold: managing temperature variation is more critical and successful Temp band and spatial placement are key; temporal scheduling is not Management is “easier” when allowed temperatures can be higher Weather forecast inaccuracy is not a problem (temp band) Energy cost of CoolAir is low even in hot climates 22

Why simulation? Limitations of a real system  simulation External conditions change → can’t compare runs Results for a whole year at multiple locations around the world Average simulation errors < 6% Parasol cooling changes are too abrupt  variable-speed AC 23

Absolute temperature violation No violations Small violations 24 Large violations CoolAir

Maximum daily temperature variation Reduced variation 25 CoolAir Show only maximum

Power efficiency Low efficiency PUE (Power Usage Efficiency) Warmer locations are more inefficient Small increase over the Energy version 26 CoolAir

Evaluated policies Policies isolate the impact of CoolAir characteristics PoliciesTemperatureHumidityEnergySpatial BaselineReactive <30⁰C ✔✔✘ TemperaturePredictive <30⁰C ✔✔ Low recirculation VariationAdaptive band ✔✘ High recirculation EnergyPredictive <30⁰C ✔✔ Low recirculation CoolAirAdaptive band ✔✔ High recirculation 27