Presentation is loading. Please wait.

Presentation is loading. Please wait.

Mellow Writes: Extending Lifetime in Resistive Memories through Selective Slow Write Backs Lunkai Zhang, Diana Franklin, Frederic T. Chong 1 Brian Neely,

Similar presentations


Presentation on theme: "Mellow Writes: Extending Lifetime in Resistive Memories through Selective Slow Write Backs Lunkai Zhang, Diana Franklin, Frederic T. Chong 1 Brian Neely,"— Presentation transcript:

1 Mellow Writes: Extending Lifetime in Resistive Memories through Selective Slow Write Backs Lunkai Zhang, Diana Franklin, Frederic T. Chong 1 Brian Neely, Dmitri Strukov, Yuan Xie

2 An Everyday Experience: Sharpening a Knife Do it harshly … – Takes a shorter time. – Bad for the knife. Do it gently … – Takes a longer time. – Good for the knife. 2

3 A Similar Trade-off Also Works for Resistive Memory Write Operations! 3 For typical Resistive Memory technologies, slower writes are predicted to have a quadratic endurance advantage! Citation: D. B. Strukov, “Endurance-write-speed tradeoffs in nonvolatile memories,” Applied Physics A, vol. 122, no. 4, pp. 1–4, 2016. Write with higher power… – Takes a shorter time. – Lower endurance, shorter lifetime Write with lower power … – Takes a longer time. – Higher endurance, longer lifetime

4 A Single Write Latency Is NOT Enough! A Single Shorter Write Latency => Memory Lifetime suffers for some applications. A Single Longer Write Latency => Performance suffers for some other applications. Is it possible to let a system adaptively use different write speeds, so we can improve the lifetime without loss of performance? 4

5 Relatively Low Bank Utilization Memory banks are idle for most of the time. Is it possible to use the bank idle time to slowly write back the data? 5

6 Schemes Mellow Writes – Bank-Aware Mellow Writes – Eager Mellow Writes Wear Quota 6

7 Schemes Mellow Writes – Bank-Aware Mellow Writes – Eager Mellow Writes Wear Quota 7

8 Motivation: Bank Level Imbalance Bank 0 has only 1 memory block to be written back. It is less likely that the write queue will be blocked by Bank 0. 8 Bank 2 has more memory blocks to be written back. It is more likely that the write queue will be blocked by Bank 2. # Awaiting Writes 3 1 2 1

9 Bank-Aware Mellow Writes 9 Slow Write Normal Write Approach : Slowly writing back a memory block only when there is no other memory block queued for the same bank. # Awaiting Writes 3 1 2 1 Write back the only memory block for Bank 0 in slow speed. Write back current memory block for Bank 2 in normal speed.

10 Simulated System OoO Alpha core. 32KB L1 I/D-$, 256KB L2$, 2MB L3$ (LLC). 4GB Resistive Main Memory (ReRAM technology), 16 Banks, 32-entry read/write queues, write drain, Start-Gap Wear Leveling, (1.0x latency = 150ns, 1.00x endurance = 5.0 * 10^6 ): – Norm Writes (1.0x): 1.00x latency, 1.00x endurance – Slow Writes (3.0x): 3.00x latency, 9.00x endurance Norm Writes with no write cancellation. Slow writes with write cancellation. Eight-Year lifetime requirement. 10

11 Effectiveness of Bank-Aware Mellow Writes 11 No Noticeable Performance Degradation. Geomean 87% lifetime improvement compared with All-Norm. 4 out of 11 applications meet the 8-year lifetime requirements.

12 Schemes Mellow Writes – Bank-Aware Mellow Writes – Eager Mellow Writes Wear Quota 12

13 So … is it possible to reschedule the writes? If we can evenly reschedule the writes … Wasted! Too Crowded! Motivation: Write Scheduling Imbalance in a Memory Bank With Bank-Aware Mellow Writes 13

14 Eager Mellow Writes We predict which dirty cache lines in the Last Level Cache will not be written again before their evictions, and eagerly and slowly write back these cache lines. In some sense, we treat Last Level Cache as a large write buffer, in which we find proper write backs to fill the idle memory intervals. 14

15 Choosing Cache Lines for Eager Mellow Writes In this paper, we choose dirty cache lines which are predicted to be useless as the candidates for Eager Mellow Writes. Those are, the cache lines will not be accessed again before their eviction. 15 Set 0 Set 1 Set 2 Set 3 Predicted uselessPredicted useful Candidates of Eager Mellow Writes if Dirty Last Level Cache

16 A Utility Based Approach To Predict Useless Cache Lines For an LRU Set-associative Last Level Cache (LLC): Add an access counter for each LRU stack position in LLC. Increase the corresponding access counter if there is an access hit on an LRU position. For every time slice (500,000 cycles), choose the consecutive least-used LRU positions with sum less than 1/32 LLC accesses. In the next time slice, consider these cache lines with these LRU positions as useless, and they can be eagerly written back. 16 Citation: Moinuddin K. Qureshi & Yale N. Patt, “Utility-Based Cache Partitioning: A Low-Overhead, High-Performance, Runtime Mechanism to Partition Shared Caches”, MICRO'06.

17 Architectural Level Modifications 17 + Eager Mellow Write Requests + Eager Mellow Queue Lowest Priority, No Write Drains, Just Slow Writes

18 Effectiveness of Eager Mellow Writes 18 No Performance Degradation, even some performance benefit Geomean 158% lifetime improvement compared with All-Norm. 6 out of 11 applications meet the 8-year lifetime requirements. 5 applications still suffer from short lifetime!

19 Schemes Mellow Writes – Bank-Aware Mellow Writes – Eager Mellow Writes Wear Quota 19

20 Partition the time into Time slices. Wear Quota (per bank): the average available wear of each time slice. 20 Expected Lifetime Total Amount of Available Wear of Resistive Main Memory Wear Quota Time Slice

21 Wear Quota 21 Wear Quota Time Slice 1 Wear Time Slice 1 Wear Quota Time Slice 2 Wear Quota Time Slice 3 Wear Quota Time Slice 4 Wear Time Slice 2 Wear Time Slice 3 Wear Time Slice 4 Within Wear Quota Exceeding Wear Quota Within Wear Quota Time Slice 1: Mellow Writes Policy Time Slice 2: Mellow Writes Policy Time Slice 3: All-Slow Writes Policy Time Slice 4: Mellow Writes Policy Within Wear Quota

22 Effectiveness of Wear Quota All 11 applications meet the 8-year lifetime requirements. 22 Does not degrade the performance if the lifetime requirement is already met. Degrades the performance only when necessary!

23 How About Energy? 23 Operation Level A 3x Slow Write consumes 66% more Energy Compared with a normal write. Total Memory Consumption of the Execution On Average Less than 50% more memory energy compared with All-Norm Policy An Affordable Cost Compared with the Lifetime Benefit.

24 Sensitivity to Analytic Model In a typical ReRAM technology, compared with default speed writes, slow writes are predicted to achieve a quadratic endurance benefit. Based on a wider range of device parameters, the endurance benefit could be linear to cubic. What will happen if we have a different endurance benefit? Even with a pessimistic linear endurance benefit, we can still achieve 47% lifetime improvement. 24 Citation: D. B. Strukov, “Endurance-write-speed tradeoffs in nonvolatile memories,” Applied Physics A, vol. 122, no. 4, pp. 1–4, 2016.

25 Conclusion – A new dynamic trade-off between write latency and endurance. – Two Mellow Writes schemes which improve the lifetime without sacrificing the performance. – Wear Quota scheme which guarantees a minimal lifetime with relatively small performance loss. – Low hardware overhead and easy to implement.

26 Thanks! Lunkai Zhang The University of Chicago lunkai@uchicago.edu 26


Download ppt "Mellow Writes: Extending Lifetime in Resistive Memories through Selective Slow Write Backs Lunkai Zhang, Diana Franklin, Frederic T. Chong 1 Brian Neely,"

Similar presentations


Ads by Google