Presentation is loading. Please wait.

Presentation is loading. Please wait.

System-level, Unified In-band and Out-of-band Dynamic Thermal Control Dong LiVirginia Tech Rong GeMarquette University Kirk CameronVirginia Tech.

Similar presentations


Presentation on theme: "System-level, Unified In-band and Out-of-band Dynamic Thermal Control Dong LiVirginia Tech Rong GeMarquette University Kirk CameronVirginia Tech."— Presentation transcript:

1 System-level, Unified In-band and Out-of-band Dynamic Thermal Control Dong LiVirginia Tech Rong GeMarquette University Kirk CameronVirginia Tech

2 Motivation Hot spots or elevated temperatures in areas of the data center are quite common Out-of-band techniques (e.g. CPU cooling fans) are less studied In-band and out-of-band techniques operate independently without cooperating with each other  Challenge 1: enforcing the same user control policy across diverse physical mechanisms  Challenge 2: in needs of a tunable controller

3 Temperature Characteristics of Parallel Applications

4 Three temperature characteristics  Sudden change and gradual change lead to actual temperature increase or decrease  Jitter lacks sustained increase or decrease following a spike Design a controller to recognize these types and respond accordingly

5 History-based Context-aware Temperature Control (Basic idea) Periodically profile temperature and use the historical information to predict future CPU temperature Identify the appropriate technique to perform thermal control and balance power and performance for the next interval based on the prediction

6 History-based Context-aware Temperature Control (Temperature Profiling and Prediction) Use a two-level window to track the changes in temperature in both long and short time periods Temperature samples The level-one temperature window to react to the “sudden” Average value to reduce jitter frontrear The level-two temperature window (FIFO) to react to the “gradual”

7 History-based Context-aware Temperature Control (Temperature Profiling and Prediction) We assume that temperature will change with the same rate for the next round of sampling The temperature difference (  t L1/L2 ) is then used to determine the appropriate temperature regulator response

8 History-based Context-aware Temperature Control (Target Mode Identification) Inputs:  Predicted temperature behavior based on the temperature profile (  t L1/L2 )  A parameter (P p ) specified by the user that indicates the aggressiveness of the temperature controller Outputs:  Fan speed  Frequency setting  The controls follow the thermal control policy (P p )

9 History-based Context-aware Temperature Control (Target Mode Identification) We maintain a “thermal control array” for each available thermal control technique on the system {g 1, g 2, g 3, …, g n p, …, g N } Each number represents a mode that controls temperature at a degree WeakStrong Effectiveness of controlling temperature

10 History-based Context-aware Temperature Control (Target Mode Identification) To coordinate multiple thermal management techniques, we fill out the arrays in a unified way {g 1, g 2, g 3, …, g n p, …, g N } WeakStrong Effectiveness of controlling temperature n p is determined by P p Filled with the most effective mode g N Filled with a subset of physically available modes evenly extracted from the full set

11 History-based Context-aware Temperature Control (Target Mode Identification) n p is determined by P p P MIN P MAX 1N Mapping npnp PpPp

12 History-based Context-aware Temperature Control (Target Mode Identification) P MIN P MAX 1N Mapping npnp PpPp {g 1,…,,…, g N }gnpgnp A smaller P p leads to a more aggressive thermal control  More array items store the most efficient temperature mode  A small increment in array index can lead to large increment in cooling effect

13 History-based Context-aware Temperature Control (Target Mode Identification) We use the predicted temperature variance (  t L1/L2 ) from the two-level window to identify an index in the thermal control array {g 1,…, g i,…g i+c*  t,…, g N } current modenext mode T MIN T MAX 1N Mapping C = (N-1)/(T max – T min )

14 Performance Evaluation (Platform) Implement a fan driver that dynamically set the fan speed according to processor temperature Collect temperature samples from digital thermal sensors embedded in the processor The processor can be scaled among 5 frequencies

15 Performance Evaluation (Dynamic Fan Control) Our dynamic fan control responds to temperature changes under different control policies (P p =25 (aggressive), P p =50(moderate), and P p =75(weak))

16 Performance Evaluation (Dynamic Fan Control) P p = 50; benchmark: bt.B.4 We compare our dynamic fan control method with the traditional static method and constant fan speed control

17 Performance Evaluation (Dynamic Fan Control) In general, larger maximum PWM duty cycle leads to lower temperature A less powerful fan is able to deliver similar cooling effects as a more powerful fan with our dynamic control

18 Performance Evaluation (Temperature Aware DVFS Control) Benchmark: LU.B.4; coupled with traditional static fan control; P p =50 Our DVFS control scales down frequency only when average temperature is stabilized Our DVFS control scales up frequency to its original value once the temperature is consistently below the threshold so as to avoid performance loss

19 Performance Evaluation (Temperature Aware DVFS Control) Our DVFS control performs better than CPUSPEED in terms power-saving and performance CPUSPEEDtDVFS Max allowed PWM duty cycle 75%50%25%75%50%25% # freq changes101122139223 Execution Time (s)219222223219233234 Ave Power(Watt)99.7899.30100.8097.9394.1992.78 Power-Delay Product (Watt*s) 21852.7822044.2122478.6421447.2721946.0321710.32

20 Performance Evaluation (Dynamic Hybrid Fan and DVFS Control) Our method effectively unifies different thermal control techniques and reacts to different user control policies with minimum performance impact

21 Conclusion We classify thermal characteristics of parallel applications and use a two-level temperature window to make our controller more effective We introduce a simple parameter (P p ) to allow the user to specify the aggressiveness of in-band and out- of-band techniques for thermal reductions We integrate an out-of-band method (fan control) and an in-band method (DVFS) We explore an efficient fan control method

22 Thank You


Download ppt "System-level, Unified In-band and Out-of-band Dynamic Thermal Control Dong LiVirginia Tech Rong GeMarquette University Kirk CameronVirginia Tech."

Similar presentations


Ads by Google