RAID.

Slides:



Advertisements
Similar presentations
Storage Management Lecture 7.
Advertisements

What is RAID Redundant Array of Independent Disks.
1 Lecture 18: RAID n I/O bottleneck n JBOD and SLED n striping and mirroring n classic RAID levels: 1 – 5 n additional RAID levels: 6, 0+1, 10 n RAID usage.
RAID (Redundant Arrays of Independent Disks). Disk organization technique that manages a large number of disks, providing a view of a single disk of High.
RAID A RRAYS Redundant Array of Inexpensive Discs.
Triple-Parity RAID and Beyond Hai Lu. RAID RAID, an acronym for redundant array of independent disks or also known as redundant array of inexpensive disks,
RAID Oh yes Whats RAID? Redundant Array (of) Independent Disks. A scheme involving multiple disks which replicates data across multiple drives. Methods.
RAID Redundant Array of Independent Disks
CSCE430/830 Computer Architecture
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
R.A.I.D. Copyright © 2005 by James Hug Redundant Array of Independent (or Inexpensive) Disks.
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
I/O Systems and Storage Systems May 22, 2000 Instructor: Gary Kimura.
CSE 451: Operating Systems Winter 2010 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura.
Chapter 6 RAID. Chapter 6 — Storage and Other I/O Topics — 2 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f.
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
CSE 321b Computer Organization (2) تنظيم الحاسب (2) 3 rd year, Computer Engineering Winter 2015 Lecture #4 Dr. Hazem Ibrahim Shehata Dept. of Computer.
Two or more disks Capacity is the same as the total capacity of the drives in the array No fault tolerance-risk of data loss is proportional to the number.
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
Disk Access. DISK STRUCTURE Sector: Smallest unit of data transfer from/to disk; 512B 2/4/8 adjacent sectors transferred together: Blocks Read/write heads.
Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
CSI-09 COMMUNICATION TECHNOLOGY FAULT TOLERANCE AUTHOR: V.V. SUBRAHMANYAM.
Basic File Structures and Hashing Lectured by, Jesmin Akhter, Assistant professor, IIT, JU.
RAID SECTION (2.3.5) ASHLEY BAILEY SEYEDFARAZ YASROBI GOKUL SHANKAR.
Copyright © Curt Hill, RAID What every server wants!
Redundant Array of Independent Disks.  Many systems today need to store many terabytes of data.  Don’t want to use single, large disk  too expensive.
The concept of RAID in Databases By Junaid Ali Siddiqui.
RAID Disk Arrays Hank Levy. 212/5/2015 Basic Problems Disks are improving, but much less fast than CPUs We can use multiple disks for improving performance.
RAID Systems Ver.2.0 Jan 09, 2005 Syam. RAID Primer Redundant Array of Inexpensive Disks random, real-time, redundant, array, assembly, interconnected,
Chapter 14: Mass-Storage Systems Disk Structure. Disk Scheduling. RAID.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
RAID Technology By: Adarsha A,S 1BY08A03. Overview What is RAID Technology? What is RAID Technology? History of RAID History of RAID Techniques/Methods.
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
I/O Errors 1 Computer Organization II © McQuain RAID Redundant Array of Inexpensive (Independent) Disks – Use multiple smaller disks (c.f.
Magnetic Disks Have cylinders, sectors platters, tracks, heads virtual and real disk blocks (x cylinders, y heads, z sectors per track) Relatively slow,
CSE 451: Operating Systems Spring 2010 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan Allen Center 534.
CS Introduction to Operating Systems
Storage HDD, SSD and RAID.
CMPE Database Systems Workshop June 16 Class Meeting
A Case for Redundant Arrays of Inexpensive Disks (RAID) -1988
A Case for Redundant Arrays of Inexpensive Disks (RAID)
Multiple Platters.
What every server wants!
Vladimir Stojanovic & Nicholas Weaver
CSE 451: Operating Systems Spring 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) John Zahorjan Allen Center.
RAID Disk Arrays Hank Levy 1.
RAID RAID Mukesh N Tekwani
ICOM 6005 – Database Management Systems Design
RAID Disk Arrays Hank Levy 1.
CSE 451: Operating Systems Spring 2005 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Autumn 2010 Module 19 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Winter 2009 Module 13 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
TECHNICAL SEMINAR PRESENTATION
RAID Redundant Array of Inexpensive (Independent) Disks
UNIT IV RAID.
Disk Storage, Basic File Structures, and Hashing.
Mark Zbikowski and Gary Kimura
CSE 451: Operating Systems Autumn 2004 Redundant Arrays of Inexpensive Disks (RAID) Hank Levy 1.
1/17/2019.
CSE 451: Operating Systems Winter 2012 Redundant Arrays of Inexpensive Disks (RAID) and OS structure Mark Zbikowski Gary Kimura 1.
CSE 451: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
CSE 451: Operating Systems Autumn 2009 Module 19 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
RAID Disk Arrays Hank Levy 1.
RAID RAID Mukesh N Tekwani April 23, 2019
CSE 451: Operating Systems Winter 2004 Module 17 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
IT 344: Operating Systems Winter 2007 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Chia-Chi Teng CTB
Storage Management Lecture 7.
CSE 451: Operating Systems Winter 2006 Module 18 Redundant Arrays of Inexpensive Disks (RAID) Ed Lazowska Allen Center 570.
Seminar on Enterprise Software
Presentation transcript:

RAID

Not all exponential growth is created equal RAM capacities have quadrupled every 2-3 years. CPU performance has doubled every 2 years. Disk capacity has improved by 50% a year. Disk transfer rates have improved by 20% a year. Disk access times have improved by 10% a year. Although Solid State Drives (SSDs), have improved the performance of storage. Still, storage hasn't been keeping up with the improvement in the rest of computer technology.

What electronic technology has lagged even worst than disk performance? 1. Screen Resolution 2. Battery Capacity 3. Network Transfer Rates 4. Programmer Attention Spans

Redundant Arrays of Inexpensive Disks RAID is a combination of software and hardware meant to improve disk performance. Sometimes, the letter I in RAID means "independent" now-a-days. The main goal of RAID is to even out the widely different rates of performance improvement of disks against those in memory and microprocessors. http://videogamewriters.com/sunday-sidebar-mmo-fatigue-is-it-affecting-you-37055

Data Striping Data striping means breaking up contiguous data that would normally go on a single disk. The data is distributed to many disks, either by byte (a) or by block (b), see left. With each read or write, all the disks participate. Hence the data access can be parallelized across disks. To the OS (and DBMS), the storage looks like a single, huge, superfast disk.

Mirroring / Shadowing To make your life confusing, the concept of using replicates (copies of the data) are called mirroring or shadowing in the RAID literature. But they mean exactly the same thing. Why have disk replicates? The mean-time-to-failure (approximate lifespan) of a hard drive is about 20 years. But if you have 1000 drives in your datacenter, the mean- time-to-failure for at least one drive failure is 60 days. You probably don't want to lose important data every 60 days. So you have redundancy / replicates / mirrors / shadows / copies, so that you can recover in the event of failure. http://memory-beta.wikia.com/wiki/Mirror,_Mirror

Using an external hard drive for full disk backup (e. g Using an external hard drive for full disk backup (e.g. Apple's Time Machine) is which of the following? 1. Replication 2. Mirroring 3. Shadowing 4. Fanboying

Detecting Failure and Recovering In order to detect failure (especially failure smaller than an entire disk exploding), many systems employ error-correcting codes involving parity bits or specialized codes, like Hamming codes. The implementation of such systems is beyond the scope of this class. In the event of a failure, the damaged system is often physically replaced, and the new system is filled with the data that was "lost" by the other replicates of the data. http://www.slideshare.net/saikrishnatanguturu/computer-networks-34379994

Load Balancing Where do you put your replicates? Simplest option is to replicate the entire disk onto another entire disk. However, spreading out the replicates across many disks is better for load balancing (if many requests are for the same data). https://bowtiesandbooze.wordpress.com/2011/09/05/fatty-mcchicken-patty/

Why would distributing a replicate across multiple disks improve performance? 1. It doesn't 2. If the data is popular, many disks could serve up different parts. 3. It is safer to have a replicate broken into pieces 4. Don't do it, splinching might result.

RAID Levels Different raid organizations were defined based on different combinations of the two factors of granularity of data interleaving (striping) and pattern used to compute redundant information. Raid level 0 has no redundant data and hence has the best write performance at the risk of data loss, but it does use striping to improve read/write performance. Raid level 1 uses mirrored disks. Raid level 2 uses memory-style redundancy by using Hamming codes, which contain parity bits for distinct overlapping subsets of components. Level 2 includes both error detection and correction. Raid level 3 uses byte-level data striping and a single parity disk relying on the disk controller to figure out which disk has failed. Raid levels 4 and 5 use block-level data striping, with level 5 distributing data and parity information across all disks. Raid level 6 applies the so-called P + Q redundancy scheme using Reed-Soloman codes to protect against up to two disk failures by using just two redundant disks.

RAID Level Comparison Different raid organizations are being used under different situations Raid level 1 (mirrored disks) is the easiest for rebuild of a disk from other disks It is used for critical applications like logs Raid level 2 uses memory-style redundancy by using Hamming codes, which contain parity bits for distinct overlapping subsets of components. Level 2 includes both error detection and correction. Raid level 3 (single parity disks relying on the disk controller to figure out which disk has failed) and level 5 (block-level data striping) are preferred for Large volume storage, with level 3 giving higher transfer rates. Most popular uses of the RAID technology currently are: Level 0 (with striping), Level 1 (with mirroring) and Level 5 with an extra drive for parity. Design Decisions for RAID include: Level of RAID, number of disks, choice of parity schemes, and grouping of disks for block-level striping.

Popular RAID Levels The most common RAID levels in use are: RAID Level 1 (Complete Mirroring Of Disks) shown in (a). RAID Level 5 (Striping of data across disks with distributed parity) shown in (b). It is also common to nest and combine RAID levels.

Weakness of RAID Non-independent disk failures - If disks are more likely to fail with use, or from wear-and- tear, or from factors that affect other disks, then many failures may occur at the same time, overwhelming the RAID system. Disk Capacity Increasing Faster Than Disk Transfer Rates - When a disk fails, it can take many hours (even days) to copy the data to a replacement disk. During this time, the RAID system's performance is reduced (and depending on the amount of redundancy, its ability to recover from other errors). https://xkcd.com/612/

What RAID Level corresponds to Complete Replication across sites? 1. RAID 0 (striping) 2. RAID 1 (mirroring) 3. RAID 2-6 (striping and mirroring) 4.

What RAID Level corresponds to Partitioning across sites? 1. RAID 0 (striping) 2. RAID 1 (mirroring) 3. RAID 2-6 (striping and mirroring) 4.

What RAID Level corresponds to Partial Replication across sites? 1. RAID 0 (striping) 2. RAID 1 (mirroring) 3. RAID 2-6 (striping and mirroring) 4.