Presentation is loading. Please wait.

Presentation is loading. Please wait.

LEMap: Controlling Leakage in Large Chip-multiprocessor Caches via Profile-guided Virtual Address Translation Jugash Chandarlapati Mainak Chaudhuri Indian.

Similar presentations


Presentation on theme: "LEMap: Controlling Leakage in Large Chip-multiprocessor Caches via Profile-guided Virtual Address Translation Jugash Chandarlapati Mainak Chaudhuri Indian."— Presentation transcript:

1 LEMap: Controlling Leakage in Large Chip-multiprocessor Caches via Profile-guided Virtual Address Translation Jugash Chandarlapati Mainak Chaudhuri Indian Institute of Technology, Kanpur

2 Low Energy MapMotivation 37% L2 cache energy 10% dead time per page

3 Low Energy MapMotivation Past work has exploited this dead time at cache block grain Past work has exploited this dead time at cache block grain For large last-level caches the book- keeping overhead becomes enormous For large last-level caches the book- keeping overhead becomes enormous Good news: large potential of dead time exploitation at page grain Good news: large potential of dead time exploitation at page grain –By-product: smart involvement of OS Design smart VA to PA mapping to cluster virtual pages accessed together in time so that average size of idle region increases Design smart VA to PA mapping to cluster virtual pages accessed together in time so that average size of idle region increases

4 Low Energy MapHighlights Three major contributions Three major contributions –First proposal to exploit smart virtual address translation schemes for region-based leakage control in large multi-banked shared CMP NUCAs –A new application-directed page placement system call to realize the leakage-aware translation –7% total system energy saving, 50% L2 cache energy saving, 52% L2 cache power saving for an 8-core CMP with a 16 MB shared L2 cache at 65 nm on selected SPLASH-2, SPEC OMP, and DIS applications

5 Low Energy Map LEMap: Basic idea C0C1C2C3 C7C6C5C4 CROSSBAR B0B1B2B3B4B5B6B7 B15B14B13B12B11B10B9B8 Subbanks L2 bank control Idle subbank: drowsy Baseline Showing one time window

6 Low Energy Map LEMap: Basic idea C0C1C2C3 C7C6C5C4 CROSSBAR B0B1B2B3B4B5B6B7 B15B14B13B12B11B10B9B8 Subbanks L2 bank control Idle subbank: drowsy LEMap Showing one time window

7 Low Energy Map LEMap: Basic idea Map a cluster of virtual pages that are accessed together onto a few subbanks Map a cluster of virtual pages that are accessed together onto a few subbanks –Improves effectiveness of low power drowsy mode due to larger number of idle subbanks –Can power down a subbank after the last access to the cluster of virtual pages mapped onto it –Take care of proximity by choosing a subbank for a cluster of virtual pages such that average access latency is minimized (important for NUCAs)

8 Low Energy Map Implementing LEMap Collect (core id, virtual page id, timestamp) tuple for each L2 cache access via a profile run Collect (core id, virtual page id, timestamp) tuple for each L2 cache access via a profile run Cluster the virtual pages based on timestamp using a hierarchical agglomerative clustering algorithm Cluster the virtual pages based on timestamp using a hierarchical agglomerative clustering algorithm –Grow the birth and death times of a cluster gradually until the cluster size exceeds subbank size Map each cluster on a physical subbank via application-directed page placement system call that takes a vector of VPNs Map each cluster on a physical subbank via application-directed page placement system call that takes a vector of VPNs

9 Simulation results Done on an 8-core CMP with a 16 MB shared L2 cache with leakage controlled at 128 KB subbank grain (16 banks) Done on an 8-core CMP with a 16 MB shared L2 cache with leakage controlled at 128 KB subbank grain (16 banks) Models dynamic and leakage (gate and subthreshold) power of all on-chip components at 65 nm including memory controller (leakage model extracted from HSpice simulations) Models dynamic and leakage (gate and subthreshold) power of all on-chip components at 65 nm including memory controller (leakage model extracted from HSpice simulations) Models DRAM dynamic power following Micron technical notes Models DRAM dynamic power following Micron technical notes Executes eight explicitly parallel shared memory applications drawn from SPLASH-2, SPEC OMP, and DIS suites Executes eight explicitly parallel shared memory applications drawn from SPLASH-2, SPEC OMP, and DIS suites

10 Low Energy Map Simulation results: L2 cache power 52% L2 cache power saving

11 Low Energy Map Simulation results: L2 cache energy 50% L2 cache energy saving

12 Low Energy Map Simulation results: Total energy 7% total energy saving

13 Low Energy Map Simulation results: Execution time 3% loss on average

14 Low Energy MapSummary A novel virtual to physical address translation mechanism to control leakage in large shared caches in CMP A novel virtual to physical address translation mechanism to control leakage in large shared caches in CMP Uses profile information to optimize virtual page to physical subbank placement in parallel programs Uses profile information to optimize virtual page to physical subbank placement in parallel programs Controls leakage at subbank grain to reduce timekeeping overhead of drowsy Controls leakage at subbank grain to reduce timekeeping overhead of drowsy Achieves 50% L2 cache energy saving and 7% total energy saving on eight benchmark programs compared to drowsy Achieves 50% L2 cache energy saving and 7% total energy saving on eight benchmark programs compared to drowsy

15 Low Energy MapAcknowledgment Intel: graduate fellowship Intel: graduate fellowship IBM: faculty award IBM: faculty award

16 LEMap: Controlling Leakage in Large Chip-multiprocessor Caches via Profile-guided Virtual Address Translation Jugash Chandarlapati Mainak Chaudhuri Indian Institute of Technology, Kanpur THANK YOU!


Download ppt "LEMap: Controlling Leakage in Large Chip-multiprocessor Caches via Profile-guided Virtual Address Translation Jugash Chandarlapati Mainak Chaudhuri Indian."

Similar presentations


Ads by Google