COMP381 by M. Hamdi 1 RAID (Redundant Array of Inexpensive Disks) & Storage Systems.

Slides:



Advertisements
Similar presentations
Redundant Array of Independent Disks (RAID) Striping of data across multiple media for expansion, performance and reliability.
Advertisements

Data Storage Solutions Module 1.2. Data Storage Solutions Upon completion of this module, you will be able to: List the common storage media and solutions.
RAID (Redundant Arrays of Independent Disks). Disk organization technique that manages a large number of disks, providing a view of a single disk of High.
RAID Oh yes Whats RAID? Redundant Array (of) Independent Disks. A scheme involving multiple disks which replicates data across multiple drives. Methods.
NAS vs. SAN 10/2010 Palestinian Land Authority IT Department By Nahreen Ameen 1.
Raid dr. Patrick De Causmaecker What is RAID Redundant Array of Independent (Inexpensive) Disks A set of disk stations treated as one.
CSCE430/830 Computer Architecture
Operating Systems ECE344 Ashvin Goel ECE University of Toronto Disks and RAID.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
RAID- Redundant Array of Inexpensive Drives. Purpose Provide faster data access and larger storage Provide data redundancy.
R.A.I.D. Copyright © 2005 by James Hug Redundant Array of Independent (or Inexpensive) Disks.
2P13 Week 11. A+ Guide to Managing and Maintaining your PC, 6e2 RAID Controllers Redundant Array of Independent (or Inexpensive) Disks Level 0 -- Striped.
CSE521: Introduction to Computer Architecture Mazin Yousif I/O Subsystem RAID (Redundant Array of Independent Disks)
Lecture 36: Chapter 6 Today’s topic –RAID 1. RAID Redundant Array of Inexpensive (Independent) Disks –Use multiple smaller disks (c.f. one large disk)
CSCE 212 Chapter 8 Storage, Networks, and Other Peripherals Instructor: Jason D. Bakos.
RAID Technology. Use Arrays of Small Disks? 14” 10”5.25”3.5” Disk Array: 1 disk design Conventional: 4 disk designs Low End High End Katz and Patterson.
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
1 Recap (RAID and Storage Architectures). 2 RAID To increase the availability and the performance (bandwidth) of a storage system, instead of a single.
Computer ArchitectureFall 2007 © November 28, 2007 Karem A. Sakallah Lecture 24 Disk IO and RAID CS : Computer Architecture.
1 Storage (cont’d) Disk scheduling Reducing seek time (cont’d) Reducing rotational latency RAIDs.
1 Lecture 26: Storage Systems Topics: Storage Systems (Chapter 6), other innovations Final exam stats:  Highest: 95  Mean: 70, Median: 73  Toughest.
High Performance Computing Course Notes High Performance Storage.
Computer ArchitectureFall 2008 © November 12, 2007 Nael Abu-Ghazaleh Lecture 24 Disk IO.
S.1 Review: Major Components of a Computer Processor Control Datapath Memory Devices Input Output Cache Main Memory Secondary Memory (Disk)
Storage Area Network (SAN)
Session 3 Windows Platform Dina Alkhoudari. Learning Objectives Understanding Server Storage Technologies Direct Attached Storage DAS Network-Attached.
Data Storage Willis Kim 14 May Types of storages Direct Attached Storage – storage hardware that connects to a single server Direct Attached Storage.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
By : Nabeel Ahmed Superior University Grw Campus.
File Systems and N/W attached storage (NAS) | VTU NOTES | QUESTION PAPERS | NEWS | VTU RESULTS | FORUM | BOOKSPAR ANDROID APP.
Redundant Array of Inexpensive Disks (RAID). Redundant Arrays of Disks Files are "striped" across multiple spindles Redundancy yields high data availability.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 6 – RAID ©Manuel Rodriguez.
Chapter 6 RAID. Chapter 6 — Storage and Other I/O Topics — 2 RAID Redundant Array of Inexpensive (Independent) Disks Use multiple smaller disks (c.f.
RAID Shuli Han COSC 573 Presentation.
Storage & Peripherals Disks, Networks, and Other Devices.
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
Redundant Array of Independent Disks
RAID: High-Performance, Reliable Secondary Storage Mei Qing & Chaoxia Liao Nov. 20, 2003.
N-Tier Client/Server Architectures Chapter 4 Server - RAID Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept RAID – Redundant Array.
I/O – Chapter 8 Introduction Disk Storage and Dependability – 8.2 Buses and other connectors – 8.4 I/O performance measures – 8.6.
1 Chapter 7: Storage Systems Introduction Magnetic disks Buses RAID: Redundant Arrays of Inexpensive Disks.
RAID COP 5611 Advanced Operating Systems Adapted from Andy Wang’s slides at FSU.
Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
Page 110/12/2015 CSE 30341: Operating Systems Principles Network-Attached Storage  Network-attached storage (NAS) is storage made available over a network.
Mark A. Magumba Storage Management. What is storage An electronic place where computer may store data and instructions for retrieval The objective of.
RAID SECTION (2.3.5) ASHLEY BAILEY SEYEDFARAZ YASROBI GOKUL SHANKAR.
Redundant Array of Independent Disks.  Many systems today need to store many terabytes of data.  Don’t want to use single, large disk  too expensive.
Using NAS as a Gateway to SAN Dave Rosenberg Hewlett-Packard Company th Street SW Loveland, CO 80537
RAID Systems Ver.2.0 Jan 09, 2005 Syam. RAID Primer Redundant Array of Inexpensive Disks random, real-time, redundant, array, assembly, interconnected,
1 Lecture 27: Disks Today’s topics:  Disk basics  RAID  Research topics.
Storage Networking. Storage Trends Storage grows %/year, gets more complicated It’s necessary to pool storage for flexibility Intelligent storage.
Introduction to RAID Rogério Perino de Oliveira Neves Patrick De Causmaecker
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
RAID Technology By: Adarsha A,S 1BY08A03. Overview What is RAID Technology? What is RAID Technology? History of RAID History of RAID Techniques/Methods.
RAID TECHNOLOGY RASHMI ACHARYA CSE(A) RG NO
Network-Attached Storage. Network-attached storage devices Attached to a local area network, generally an Ethernet-based network environment.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 10: Mass-Storage Systems.
I/O Errors 1 Computer Organization II © McQuain RAID Redundant Array of Inexpensive (Independent) Disks – Use multiple smaller disks (c.f.
Video Security Design Workshop:
Disks and RAID.
Vladimir Stojanovic & Nicholas Weaver
Introduction to Networks
Storage Virtualization
RAID RAID Mukesh N Tekwani
TECHNICAL SEMINAR PRESENTATION
UNIT IV RAID.
RAID RAID Mukesh N Tekwani April 23, 2019
Improving performance
Presentation transcript:

COMP381 by M. Hamdi 1 RAID (Redundant Array of Inexpensive Disks) & Storage Systems

COMP381 by M. Hamdi 2 Disk Capacity Growth

COMP381 by M. Hamdi 3 Disk Latency & Bandwidth Improvements Disk latency is one average seek time plus the rotational latency Disk bandwidth is the peak transfer rate of formatted data In the time that the disk bandwidth doubles the latency improves by a factor of only 1.2 to 1.4

COMP381 by M. Hamdi 4 Media Bandwidth/Latency Demands Bandwidth requirements –High quality video Digital data = (30 frames/s) × (640 x 480 pixels) × (24-b color/pixel) = 221 Mb/s ( MB/s) –High quality audio Digital data = (44,100 audio samples/s) × (16-b audio samples) × (2 audio channels for stereo) = 1.4 Mb/s (0.175 MB/s) Latency issues –How sensitive is your eye (ear) to variations in video (audio) rates? –How can you ensure a constant rate of delivery? –How important is synchronizing the audio and video streams? 15 to 20 ms early to 30 to 40 ms late is tolerable

COMP381 by M. Hamdi 5 Storage Pressures Storage growth estimates: % per year –Growth of e-business, e-commerce, and  now common for organizations to manage hundreds of TB of data –Mission critical data must be continuously available –Regulations require long-term archiving –More storage-intensive applications on market #1 pain pointsStorage and Security are the #1 pain points for the IT community (shared the #1 spot) Managing storage growth effectively is a challenge

COMP381 by M. Hamdi 6 - 1,000,000 2,000,000 3,000,000 4,000,000 5,000,000 6,000,000 7,000, (in Terabytes) From 1999 to 2001 Storage Shipped grew at 78% CAGR From 2002 to 2006 Storage shipped grew at 83% CAGR Data Growth Trends

COMP381 by M. Hamdi 7 Storage Cost Storage cost as proportion of total IT spending as compared to server cost

COMP381 by M. Hamdi 8 Storage Management Cost Costs of managing storage can be 10X the cost of storage (Graph below: for every dollar spent how much you spend for management and maintenance) 90¢ 70¢ 50¢ 30¢ 10¢ DAS NAS SAN

COMP381 by M. Hamdi 9 Increasing Data Volume and Value Management GAP Increasing Storage Management Cost Decreasing Storage Technology Cost $3.00 Equipment $7.00 Management Storage Customers’ Issues Availability/Reliability and Performance are EXTREMLY important

COMP381 by M. Hamdi 10 Importance of Storage Reliability

COMP381 by M. Hamdi 11 RAID To increase the availability and the performance (bandwidth) of a storage system, instead of a single disk, a set of disks (disk arrays) can be used. Similar to memory interleaving, data can be spread among multiple disks (striping), allowing simultaneous access to the data and thus improving the throughput. However, the reliability of the system drops (n devices have 1/n the reliability of a single device).

COMP381 by M. Hamdi 12 Array Reliability Reliability of N disks = Reliability of 1 Disk ÷N 50,000 Hours ÷ 70 disks = 700 hours Disk system Mean Time To Failure (MTTF): Drops from 6 years to 1 month! Arrays without redundancy too unreliable to be useful!

COMP381 by M. Hamdi 13 RAID A disk array’s availability can be improved by adding redundant disks: –If a single disk in the array fails, the lost information can be reconstructed from redundant information. These systems have become known as RAID - Redundant Array of Inexpensive Disks. –Depending on the number of redundant disks and the redundancy scheme used, RAIDs are classified into levels. –6 levels of RAID (0-5) are accepted by the industry. –Level 2 and 4 are not commercially available, they are included for clarity

COMP381 by M. Hamdi 14 RAID-0 Strip 12 Strip 8 Strip 4 Strip 0 Strip 13 Strip 9 Strip 5 Strip 1 Strip 14 Strip 10 Strip 6 Strip 2 Strip 15 Strip 11 Strip 7 Strip 3 Striped, non-redundant –Parallel access to multiple disks  Excellent data transfer rate  Excellent I/O request processing rate (for large strips) if the controller supports independent Reads/Writes  Not fault tolerant (AID) Typically used for applications requiring high performance for non-critical data (e.g., video streaming and editing)

COMP381 by M. Hamdi 15 RAID 1 - Mirroring Called mirroring or shadowing, uses an extra disk for each disk in the array (most costly form of redundancy) Whenever data is written to one disk, that data is also written to a redundant disk: good for reads, fair for writes If a disk fails, the system just goes to the mirror and gets the desired data. Fast, but very expensive. Typically used in system drives and critical files –Banking, insurance data –Web (e-commerce) servers Strip 3 Strip 2 Strip 1 Strip 0 Strip 3 Strip 2 Strip 1 Strip 0

COMP381 by M. Hamdi 16 RAID 2: Memory-Style ECC f 0 (b) b2b2 b1b1 b0b0 b3b3 f 1 (b) P(b) Data Disks Multiple ECC Disks and a Parity Disk Multiple disks record the (error correcting code) ECC information to determine which disk is in fault A parity disk is then used to reconstruct corrupted or lost data Needs log 2 (number of disks) redundancy disks Least used since ECC is irrelevant because most new Hard drives support built-in error correction

COMP381 by M. Hamdi 17 RAID 3 - Bit-interleaved Parity Use 1 extra disk for each array of n disks. Reads or writes go to all disks in the array, with the extra disk to hold the parity information in case there is a failure. The parity is carried out at bit level: –A parity bit is kept for each bit position across the disk array and stored in the redundant disk. –Parity: sum modulo 2. parity of 1010 is 0 parity of 1110 is 1 Or use XOR of bits Logical record Striped physical records P Physical record

COMP381 by M. Hamdi 18 RAID 3 - Bit-interleaved Parity If one of the disks fails, the data for the failed disk must be recovered from the parity information: –This is achieved by subtracting the parity of good data from the original parity information: –Recovering from failures takes longer than in mirroring, but failures are rare, so is okay –Examples:

COMP381 by M. Hamdi 19 RAID 4 - Block-interleaved Parity In RAID 3, every read or write needs to go to all disks since bits are interleaved among the disks. Performance of RAID 3: –Only one request can be serviced at a time – Poor I/O request rate –Excellent data transfer rate –Typically used in large I/O request size applications, such as imaging or CAD RAID 4: If we distribute the information block-interleaved, where a disk sector is a block, then for normal reads different reads can access different segments in parallel. Only if a disk fails we will need to access all the disks to recover the data.

COMP381 by M. Hamdi 20 RAID 4: Block Interleaved Parity block 0 block 4 block 8 block 12 block 1 block 5 block 9 block 13 block 2 block 6 block 10 block 14 block 3 block 7 block 11 block 15 P(0-3) P(4-7) P(8-11) P(12-15) Allow for parallel access by multiple I/O requests Doing multiple small reads is now faster than before. A write, however, is a different story since we need to update the parity information for the block. Large writes (full stripe), update the parity: P’ = d0’ + d1’ + d2’ + d3’; Small writes (eg. write on d0), update the parity: P = d0 + d1 + d2 + d3 P’ = d0’ + d1 + d2 + d3 = P + d0’ + d0; However, writes are still very slow since parity disk is the bottleneck.

COMP381 by M. Hamdi 21 RAID 4: Small Writes

COMP381 by M. Hamdi 22 RAID 5 - Block-interleaved Distributed Parity To address the write deficiency of RAID 4, RAID 5 distributes the parity blocks among all the disks.

COMP381 by M. Hamdi 23 RAID 5 - Block-interleaved Distributed Parity This allows some writes to proceed in parallel –For example, writes to blocks 8 and 5 can occur simultaneously.

COMP381 by M. Hamdi 24 RAID 5 - Block-interleaved Distributed Parity However, writes to blocks 8 and 11 cannot proceed in parallel. Performance of RAID 5 I/O request rate: excellent for reads, good for writes Data transfer rate: good for reads, good for writes Typically used for high request rate, read-intensive data lookup

COMP381 by M. Hamdi 25 Performance of RAID 5 - Block- interleaved Distributed Parity Performance of RAID 5 –I/O request rate: excellent for reads, good for writes –Data transfer rate: good for reads, good for writes –Typically used for high request rate, read-intensive data lookup –File and Application servers, Database servers, WWW, , and News servers, Intranet servers The most versatile and widely used RAID.

COMP381 by M. Hamdi 26 Storage Area Networks (SAN)

COMP381 by M. Hamdi 27 Which Storage Architecture? DAS - Directly-Attached Storage NAS - Network Attached Storage SAN - Storage Area Network

COMP381 by M. Hamdi 28 Storage Architectures (Direct Attached Storage (DAS)) Unix NetWare NT/W2K NetWare Server Storage NetWare Server Storage NT Server Storage NT Server Storage Virtual Drive 3 Unix Server Storage Unix Server Storage

COMP381 by M. Hamdi 29 DAS CPUs Bus Memory SCSI Adaptor SCSI Disk Drive NIC SCSI protocol MS Windows Traditional Server Block I/O

COMP381 by M. Hamdi 30 Disk C ERP Server Disk B Exchange Server Disk A File Server Storage Architectures (Direct Attached Storage (DAS))

COMP381 by M. Hamdi 31 The Problem with DAS Direct Attached Storage (DAS) Data is bound to the server hosting the disk Expanding the storage may mean purchasing and managing another server In heterogeneous environments, management is complicated Windows NT/2KLinux/UnixNetware

COMP381 by M. Hamdi 32 Advantages Low cost Simple to use Easy to install Disadvantages No shared resources Difficult to backup Limited distance Limited, high-availability options Complex maintenance Solution for small organizations only Storage Architectures (Direct Attached Storage (DAS))

COMP381 by M. Hamdi 33 Storage Architectures (Network Attached Storage (NAS)) Hosts IP Network NAS Controller Disk subsystem Shared Information

COMP381 by M. Hamdi 34 NAS Network Attached Storage What is it? NAS devices contain embedded processors that run specialized OS or micro kernel that understands networking protocols and is optimized for particular tasks, such as file service. NAS devices usually deploy some level of RAID storage.

COMP381 by M. Hamdi 35 NAS CPUs Bus Memory NIC MS Windows “Diskless” App Server (or rather a “Less Disk” server) IP network File protocol (CIFS, NFS) CPUs Bus Memory NIC Optimised OS NAS appliance SCSI Adaptor SCSI Disk Drive SCSI Adaptor SCSI Disk Drive SCSI protocol Block I/O

COMP381 by M. Hamdi 36 The NAS Network IP network App Server NAS Appliance NAS - truly an appliance

COMP381 by M. Hamdi 37 More on NAS NAS Devices can easily and quickly attach to a LAN NAS is platform and OS independent and appears to applications as another server NAS Devices provide storage that can be addressed via standard file system (e.g., NFS, CIFS) protocols

COMP381 by M. Hamdi 38 Advantages Easy to install Easy to maintain Shared information Unix, Windows file sharing Remote access Disadvantages Not suitable for databases Storage islands Not-very-scalable solution NAS controller is a bottle neck Vendor-dependable Suitable for file based application Storage Architectures (Network Attached Storage (NAS))

COMP381 by M. Hamdi 39 Some NAS Problems Network Attached Storage (NAS) Each appliance represents a larger island of storage Data is bound to the NAS device hosting the disk and cannot be accessed if the system hosting the drive fails Storage is labor-intensive and thus expensive Network is bottleneck Windows NT/2KLinux/UnixNetware NAS

COMP381 by M. Hamdi 40 Some Benefits of NAS Files are easily shared among users at high demand and performance Files are easily accessible by the same user from different locations Demand for local storage at the desktop is reduced Storage can be added more economically and partitioned among users— reasonably scalable Data can be backed up form the common repository more efficiently than from desktops Multiple file servers can be consolidated into a single managed storage pool

COMP381 by M. Hamdi 41 Storage Architectures (Storage Area Networks (SAN)) Storage Network Hosts IP Network Clients Shared Storage

COMP381 by M. Hamdi 42 SAN Storage Area Network what is it? In short, SAN is essentially just another type of network, consisting of storage components (instead of computers), one or more interfaces, and interface extension technologies. The storage units communicate in much the same form and function as computers communicate on a LAN.

COMP381 by M. Hamdi 43 Advantages of SANs Superior Performance Reduces Network bottlenecks Highly Scalable Allows backup of storage devices with minimal impact on production operations Flexibility in configuration

COMP381 by M. Hamdi 44 Additional Benefits of SANs Storage Area Network (SAN) Server Consolidation Storage Consolidation Storage Flexibility and Management LAN Free backup and archive Modern data protection (change from traditional tape backup to snap-shot, archive, geographically separate mirrored storage) Windows NT/2K Linux/Unix Netware SAN Switch

COMP381 by M. Hamdi 45 Additional Benefits of SANs Disks appear to be directly attached to each host Provides potential of direct attached performance over Fibre Channel distances (Uses block level I/O) Provides flexibility of multiple host access –Storage can be partitioned, with each partition dedicated to a particular host computer –Storage can be shared among a heterogeneous set of host computers Economies of scale can reduce management costs by allowing administration of a centralized pool of storage and allocating storage to projects on an as-needed basis SAN can be implemented within a single computer room environment, across a campus network, or across a wide area network