Reliability Analysis of An Energy-Aware RAID System Shu Yin Xiao Qin Auburn University.

Slides:



Advertisements
Similar presentations
Demand Response Forecasting, Measurement & Verification Methods/Challenges/Considerations National Town Meeting on Demand Response Washington, DC - June.
Advertisements

Conserving Disk Energy in Network Servers ACM 17th annual international conference on Supercomputing Presented by Hsu Hao Chen.
HyLog: A High Performance Approach to Managing Disk Layout Wenguang Wang Yanping Zhao Rick Bunt Department of Computer Science University of Saskatchewan.
1 Conserving Energy in RAID Systems with Conventional Disks Dong Li, Jun Wang Dept. of Computer Science & Engineering University of Nebraska-Lincoln Peter.
Chapter 3 Presented by: Anupam Mittal.  Data protection: Concept of RAID and its Components Data Protection: RAID - 2.
Disk Scrubbing in Large Archival Storage Systems Thomas Schwarz, S.J. 1,2 Qin Xin 1,3, Ethan Miller 1, Darrell Long 1, Andy Hospodor 1,2, Spencer Ng 3.
ANALYZING STORAGE SYSTEM WORKLOADS Paul G. Sikalinda, Pieter S. Kritzinger {psikalin, DNA Research Group Computer Science Department.
1 Virtual Machine Resource Monitoring and Networking of Virtual Machines Ananth I. Sundararaj Department of Computer Science Northwestern University July.
6/5/ TRAP-Array: A Disk Array Architecture Providing Timely Recovery to Any Point-in-time Authors: Qing Yang,Weijun Xiao,Jin Ren University of Rhode.
End-to-End Analysis of Distributed Video-on-Demand Systems Padmavathi Mundur, Robert Simon, and Arun K. Sood IEEE Transactions on Multimedia, February.
Failure Trends in a Large Disk Drive Population Authors: Eduardo Pinheiro, Wolf- Dietrich Weber and Luiz Andr´e Barroso Presented by Vinuthna & Arjun.
Energy Efficient Prefetching – from models to Implementation 6/19/ Adam Manzanares and Xiao Qin Department of Computer Science and Software Engineering.
Energy Efficient Prefetching with Buffer Disks for Cluster File Systems 6/19/ Adam Manzanares and Xiao Qin Department of Computer Science and Software.
1 Validation and Verification of Simulation Models.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
1 Efficient Management of Data Center Resources for Massively Multiplayer Online Games V. Nae, A. Iosup, S. Podlipnig, R. Prodan, D. Epema, T. Fahringer,
Gordon: Using Flash Memory to Build Fast, Power-efficient Clusters for Data-intensive Applications A. Caulfield, L. Grupp, S. Swanson, UCSD, ASPLOS’09.
By : Nabeel Ahmed Superior University Grw Campus.
RAID-x: A New Distributed Disk Array for I/O-Centric Cluster Computing Kai Hwang, Hai Jin, and Roy Ho.
Cloud Data Center/Storage Power Efficiency Solutions Junyao Zhang 1.
RAID: High-Performance, Reliable Secondary Storage Mei Qing & Chaoxia Liao Nov. 20, 2003.
PARAID: The Gear-Shifting Power-Aware RAID Charles Weddle, Mathew Oldham, An-I Andy Wang – Florida State University RuGang Xu, Peter Reiher – University.
PARAID: The Gear-Shifting Power-Aware RAID Charles Weddle, Mathew Oldham, An-I Andy Wang – Florida State University Peter Reiher – University of California,
RAID REDUNDANT ARRAY OF INEXPENSIVE DISKS. Why RAID?
Exploiting Flash for Energy Efficient Disk Arrays Shimin Chen (Intel Labs) Panos K. Chrysanthis (University of Pittsburgh) Alexandros Labrinidis (University.
TRACK-ALIGNED EXTENTS: MATCHING ACCESS PATTERNS TO DISK DRIVE CHARACTERISTICS J. Schindler J.-L.Griffin C. R. Lumb G. R. Ganger Carnegie Mellon University.
Virtualization. Virtualization  In computing, virtualization is a broad term that refers to the abstraction of computer resources  It is "a technique.
1 Selecting LAN server (Week 3, Monday 9/8/2003) © Abdou Illia, Fall 2003.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Pavel Slavík, Marek Gayer, Frantisek Hrdlicka, Ondrej Kubelka Czech Technical University in Prague Czech Republic 2003 Winter Simulation Conference December.
1 PARAID: A Gear-Shifting Power-Aware RAID Charles Weddle, Mathew Oldham, Jin Qian, An-I Andy Wang – Florida St. University Peter Reiher – University of.
PARAID: A Gear-Shifting Power-Aware RAID Charles Weddle, Mathew Oldham, Jin Qian, An-I Andy Wang – Florida St. University Peter Reiher – University of.
Price Performance Metrics CS3353. CPU Price Performance Ratio Given – Average of 6 clock cycles per instruction – Clock rating for the cpu – Number of.
NAS 2011 Liang Kai,Xiaofang Zhang, Xiao Zhang Northwestern Polytechnical University Research on Energy Consumption of General Network Storage.
Example: Rumor Performance Evaluation Andy Wang CIS 5930 Computer Systems Performance Analysis.
Chapter 10 Verification and Validation of Simulation Models
Conquest-2: Improving Energy Efficiency and Performance Through a Disk/RAM Hybrid File System An-I Andy Wang Florida State University (NSF CCR ,
NUS.SOC.CS5248 A Time Series-based Approach for Power Management in Mobile Processors and Disks X. Liu, P. Shenoy and W. Gong Presented by Dai Lu.
Improving Disk Throughput in Data-Intensive Servers Enrique V. Carrera and Ricardo Bianchini Department of Computer Science Rutgers University.
DiskSim – Storage System Simulator Michigan-CMU
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
Near Real-Time Verification At The Forecast Systems Laboratory: An Operational Perspective Michael P. Kay (CIRES/FSL/NOAA) Jennifer L. Mahoney (FSL/NOAA)
Automatic Statistical Evaluation of Resources for Condor Daniel Nurmi, John Brevik, Rich Wolski University of California, Santa Barbara.
RAID Systems Ver.2.0 Jan 09, 2005 Syam. RAID Primer Redundant Array of Inexpensive Disks random, real-time, redundant, array, assembly, interconnected,
Thomas Schwarz, S.J. Qin Xin, Ethan Miller, Darrell Long, Andy Hospodor, Spencer Ng Summarized by Leonid Kibrik.
1 PARAID: A Gear-Shifting Power-Aware RAID Charles Weddle, Mathew Oldham, Jin Qian, An-I Andy Wang – Florida St. University Peter Reiher – University of.
Overview and Comparison of Software Tools for Power Management in Data Centers Msc. Enida Sheme Acad. Neki Frasheri Polytechnic University of Tirana Albania.
Virtualization Supplemental Material beyond the textbook.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
CHARACTERIZING CLOUD COMPUTING HARDWARE RELIABILITY Authors: Kashi Venkatesh Vishwanath ; Nachiappan Nagappan Presented By: Vibhuti Dhiman.
Seminar on RAID TECHNOLOGY Redundant Array of Independent Disk By CHANDAN.R 8 TH ISE, 1ap05is013 Under the guidance of Mr.Mithun.B.N, Lecturer,Dept.ISE.
Distributed Process Discovery From Large Event Logs Sergio Hernández de Mesa {
COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques Dr. Xiao Qin Auburn University
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.
Fast Data Analysis with Integrated Statistical Metadata in Scientific Datasets By Yong Chen (with Jialin Liu) Data-Intensive Scalable Computing Laboratory.
1 PARAID: A Gear-Shifting Power-Aware RAID Charles Weddle, Mathew Oldham, Jin Qian, An-I Andy Wang – Florida St. University Peter Reiher – University of.
Red, Green, Black & Blue: Western Digital Hard Drives Explained | ☎ (+1) | ☎ (+1)
Motivation Energy costs are rising –An increasing concern for servers –No longer limited to laptops Energy consumption of disk drives –24% of the power.
PARAID: A Gear-Shifting Power-Aware RAID
Exploring the Backblaze Hard Drive Data Big, Missing, Problematic Data
Bernd Panzer-Steindel, CERN/IT
Scaling for the Future Katherine Yelick U.C. Berkeley, EECS
Chapter 10 Verification and Validation of Simulation Models
PARAID: A Gear-Shifting Power-Aware RAID
Design Unit 26 Design a small or home office network
Qingbo Zhu, Asim Shankar and Yuanyuan Zhou
Energy-Efficient Storage Systems
CSE 451: Operating Systems Winter 2004 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
Implementation of a small-scale desktop grid computing infrastructure in a commercial domain    
Presentation transcript:

Reliability Analysis of An Energy-Aware RAID System Shu Yin Xiao Qin Auburn University

Presentation Outline Motivation; Related Work; MREED Model; Experimental Result; Conclusion/Future Work. 2

Mobile Multimedia Data-Intensive Applications 3 Motivation Bio- Informatics 3D GraphicWeather Forecast

Cluster System 4 Cluster in Data Center

Problem: Energy Dissipation EPA Report to Congress on Server and Data Center Energy Efficiency,

Problem: Energy Dissipation (cont.) Using 2010 Historical Trends Scenario – Server and Data Centers Consume 120 Billion kWh per year; –Assume average commercial end user is charged 9.46 kWh; –Disk systems can account for 27% of the computing energy cost of data centers. 6

Software- directed Power Management Dynamic Power Management Redundancy Technique Multi- speed Setting Existing Energy Conservation Techniques 7

Contradictory of Energy Efficiency and Reliability 8 Example: Disk spin up and down

MREED Model 9 R= R BaseValue [1] *τ+α*R(f) [2] [1] E. Pinheiro, W.-D. Weber, and L.A. Barroso. Failure trends in a large disk drive population. Proc. USENIX Conf. File and Storage Tech., February2007. [2] IDEMA Standards. Specification of hard disk drive reliability. R(f)=1.51e -6 f 2 – 1.09e -5 f e -2 Baseline Failure Rate Derived from Disk Utilization Temperature Factor Coefficient to R BaseValue, α=1 in our research

MREED Model (Temperature Factor τ [3] ) 10 Temperature (˚C) Acceleration Factor De-rating Factor Adjusted MTBF [3] G. Cole, “Estimating Drive Reliability in Desktop Computers and Consumer Electronics Systems” Seagate Personal Storage Group, 2000

MREED Model ( M ATHEMATICAL RELIABILITY MODELS FOR E NERGY- E FFICIENT R AID SYSTEMS) 11

MREED Model ( M ATHEMATICAL RELIABILITY MODELS FOR E NERGY- E FFICIENT R AID SYSTEMS) 12 Energy-Conservation RAID Technique Energy-Conservation RAID Technique Weibull Distribution Analysis Weibull Distribution Analysis Access Pattern Frequency Frequency Temperature Annual Failure Rate System Reliability System Level Reliability System Level Reliability

Weibull Analysis 13 A Leading Method for Fitting Life Date Advantages: Accurate Small Samples Widely Used

MREED Model ( Energy Conservation Techniques- PARAID ) Power-Aware RAID (PARAID) [4] System Structure [4] Charles Weddle, Mathew Oldhan, Jin Qian, An-I Andy Wang. PARAID- A Gear-Shifting Power-Aware RAID. USENIX FAST Soft state RAID Gears

Model Validation 15 Techniques Run the Systems for A Couple of Decades The Event Validity Validation Techniques [5] [5] R.G. Sargent, “Verification and Validation of Simulation Models”, in Proceedings of the 37 th conference on Winter Simulation, ser. WSC’05 Winter Simulation Conference, 2005.

Model Validation 16 Challenges Unable to Monitor PARAID Running for Years Sample Size is Small from A Validation Perspective (e.g. 100 Disks for Five Years)

Model Validation (DiskSim [6] Simulation) 17 [6] S.W.S John, S. Bucy, Jiri Schindler and G.R. Ganger, “The DiskSim Simulation Environment Version 4.0 Reference Manual”, 2008 Input Trace (File Level) File to Block Mapper Simulate File (Block Access) DiskSim (Block Level) File to Block Level Converter Outline

Model Validation (DiskSim Simulation) 18 Diagram of the Storage System Corresponding to the DiskSim RAID-0 Driver 0 Bus 0 CTLR 2 BUS 2 Driver 2 CTLR 3 BUS 3 Driver 3 CTLR 4 BUS 4 Driver 4 CTLR 1 BUS 1 Driver 1 CTLR 0 BUS 0 Driver 0

Model Validation (Result) 19 Utilization Comparison Between MREED and DiskSim Simulator

Model Validation (Result) 20 Gear Shifting Comparison Between MREED and DiskSim Simulator

Reliability Evaluation (Experimental Setup) 21 Disk TypeSeagate ST FC Capacity146 GB Cache SizeSata 16MB Buffer to Host Transfer Rate4Gb/s (Max) Total Number of Disks5 File Size100 MB Number of Files1000 Synthetic TracePoisson Distribution Time Period24 Hours Interval Time (Time Phase)1 Hour Power On Hour Per Year8760 Hours

Reliability Evaluation (Disk Utilization Comparison) 22 Disks Utilization Comparison Between PARAID-0 and RAID-0 at A Low Access Rate (20 Times Per Hour)

23 Disks Utilization Comparison Between PARAID-0 and RAID-0 at A High Access Rate (80 Times Per Hour) Reliability Evaluation (Disk Utilization Comparison)

24 AFR Comparison Between PARAID-0 and RAID-0 at A Low Access Rate (20 Times Per Hour) Reliability Evaluation (AFR Comparison)

25 AFR Comparison Between PARAID-0 and RAID-0 at A High Access Rate (80 Per Hour) Reliability Evaluation (AFR Comparison) AFR

Future Work Extend the MREED Model Power-Aware RAID-5; –Data Stripping Investigate Trade-off Between Reliability & Energy- Efficiency ; Evaluate and Compare an array of energy-saving techniques with respect to specific application domains; 26

Conclusion A Reliability Model (MREED) for Power-Ware RAID; Weibull Distribution Analysis to MREED; Validation of MREED; Impacts of the Gear-shifting on Reliability of PARAID. 27

Questions