1 Chapter 6: Storage. 2 Magnetic media  Magnetic disks: are suitable for dynamic data that requires frequent changes provide good access time and high.

Slides:



Advertisements
Similar presentations
Redundant Array of Independent Disks (RAID) Striping of data across multiple media for expansion, performance and reliability.
Advertisements

I/O Management and Disk Scheduling Chapter 11. I/O Driver OS module which controls an I/O device hides the device specifics from the above layers in the.
Tertiary Storage Devices
Faculty of Information Technology Department of Computer Science Computer Organization Chapter 7 External Memory Mohammad Sharaf.
RAID (Redundant Arrays of Independent Disks). Disk organization technique that manages a large number of disks, providing a view of a single disk of High.
Secondary Storage Rohit Khokher
CSCE430/830 Computer Architecture
Lesson #8 Optical Storage Media
Faculty of Information Technology Department of Computer Science Computer Organization and Assembly Language Chapter 6 External Memory.
R.A.I.D. Copyright © 2005 by James Hug Redundant Array of Independent (or Inexpensive) Disks.
CSE521: Introduction to Computer Architecture Mazin Yousif I/O Subsystem RAID (Redundant Array of Independent Disks)
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
Computer ArchitectureFall 2007 © November 28, 2007 Karem A. Sakallah Lecture 24 Disk IO and RAID CS : Computer Architecture.
Read “engineering” here as “Engineering & Computer Science”
Other Disk Details. 2 Disk Formatting After manufacturing disk has no information –Is stack of platters coated with magnetizable metal oxide Before use,
1 Chapter 6 Storage and Multimedia: The Facts and More.
Computer Organization and Architecture External Memory.
1 Storing Digital Audio. 2 Storage  There are many different types of storage medium and encoding methods for the storage of digital audio  CD  DVD.
CS 333 Introduction to Operating Systems Class 16 – Secondary Storage Management Jonathan Walpole Computer Science Portland State University.
CD-ROM. The CD Family GStandards äRed Book (1982) covers CD & CD-DA äYellow Book (1985) covers CD-ROM Ûextended to cover CD-ROM/XA äGreen Book (1988)
Backing Storage Chapter 18.
RAID Systems CS Introduction to Operating Systems.
Storage Device Computer Component : Storage Device (External Memory, Secondary Memory, Secondary Storage) Storage Types Magnetic Types Optical Types.
Computer SCIENCE Data Representation and Machine Concepts Section 1.3
AS Level ICT Selection and use of storage requirements, media, and devices: Devices and media.
Presented by: Manjeet Singh. Secondary Storage Devices Magnetic media Tape Disks Optical Media Compact Discs CD-R, WORM (Write Once, Read Many) CD-RW.
Information Technology Storage Devices Prof. Adnan Khalid.
Transactions and Reliability. File system components Disk management Naming Reliability  What are the reliability issues in file systems? Security.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 6 – RAID ©Manuel Rodriguez.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Storage Systems CSE 598d, Spring 2007 Lecture 5: Redundant Arrays of Inexpensive Disks Feb 8, 2007.
CS 352 : Computer Organization and Design University of Wisconsin-Eau Claire Dan Ernst Storage Systems.
External Memory.
L/O/G/O External Memory Chapter 3 (C) CS.216 Computer Architecture and Organization.
SECONDARY STORAGE Secondary storage devices are used to save, to back up, and to transport files Over the past several years, data storage capacity has.
1 Chapter 3 Understanding Computers, 11 th Edition Storage Medium The physical material on which a computer keeps data, instructions and information. Can.
A Secondary Storage: CD – ROM Dr. Robert J. Hammell Assistant Professor Towson University Computer and Information Sciences Department 8000 York Road -
RAID COP 5611 Advanced Operating Systems Adapted from Andy Wang’s slides at FSU.
Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.
IT- 101 Introduction to Information Technology Lecture #9.
 Around billion CDs have been sold all over the world  CD’s revolutionized the way that data could be stored and moved  DVDs replaced VHS  CDs.
4.Basic Storage 4.Basic Storage. 2 Basic storage units have problems dealing with large multimedia data  Single Hard Drives - SCSI/IDE Drives. So called.
COEN 180 Optical Storage. Store data based on the optical properties of a device. Strong, established market for removable media. Small market for archival.
Raymond Washington Secondary Storage Chapter 8. Traditional Floppy Disk The Traditional Floppy Disk is the 1.44 MB 3 1/2 - inch disk. They came out about.
Disks Chapter 5 Thursday, April 5, Today’s Schedule Input/Output – Disks (Chapter 5.4)  Magnetic vs. Optical Disks  RAID levels and functions.
1 CP586 © Peter Lo 2003 Multimedia Communication Multimedia Storage.
CDs and DVDs  Are types of optical storage media that consists of a flat, round, portable disc made of metal plastic and lacquer.  Store software, data,
Overview of Physical Storage Media
8 SECONDARY STORAGE CHAPTER
1 Multimedia Storage Chapter 8. 2 Magnetic Media Magnetic disks are: n Suitable for dynamic data that requires frequent changes. n Good access time and.
Media. Media Compact Disk A Compact Disc (CD) is an optical disc used to store digital data, originally developed for storing digital audio. The CD, introduced.
File Structures. 2 Chapter - Objectives Disk Storage Devices Files of Records Operations on Files Unordered Files Ordered Files Hashed Files Dynamic and.
The concept of RAID in Databases By Junaid Ali Siddiqui.
Compact Disc (CD) Coding –
Chapter4: Memory External Memory (2).
Storage devices 1. Storage Storage device : stores data and programs permanently its retained after the power is turned off. The most common type of storage.
1 Lecture 27: Disks Today’s topics:  Disk basics  RAID  Research topics.
Disks. ●Circular-shaped storage medium ●Two main types: Magnetic and Optical ●Random access to memory ●The hardware must be controlled by driver software.
STORAGE DEVICES Storage devices are categorized by the method they use to store files.
What do you mean by external storage devices? External storage devices are the devices that temporarily store information for transporting from computer.
Engr: Sajida Introduction to computing Optical storage The storage devices which use laser to read data from or write data to the reflective surface store.
CS Introduction to Operating Systems
RAID Non-Redundant (RAID Level 0) has the lowest cost of any RAID
Vladimir Stojanovic & Nicholas Weaver
Overview Continuation from Monday (File system implementation)
TECHNICAL SEMINAR PRESENTATION
RAID Redundant Array of Inexpensive (Independent) Disks
UNIT IV RAID.
Computers: Tools for an Information Age
Chapter 5 External Memory
Presentation transcript:

1 Chapter 6: Storage

2 Magnetic media  Magnetic disks: are suitable for dynamic data that requires frequent changes provide good access time and high transfer rate are used for data that must be kept online during data capturing and processing. Data may be archived onto other magnetic or optical storage afterwards. are suitable for video-on-demand applications where large amounts of time dependent information must be transferred at high bit rate

3 RAID  Redundant Arrays of Inexpensive Disks. (developed at UC Berkeley in 1987)  Redefined to “...Independent...” (referring to their failure modes)  uses parallelism between multiple disks to improve aggregate I/O performance

4 RAID  Data is distributed across several physical disks.  As an alternative to single large expensive disk (SLED) in traditional mainframe systems.  Several levels of RAID, seeking to optimize among performance (data) availability cost

5 RAID  Advantages: high data transfer rate for large data accesses (multiple disks can be used to serve the same request at the same time) short queueing time on small data accesses (multiple requests can be served at the same time) uniform load balancing across all of the disks  Disadvantage: Large disk arrays are highly vulnerable to disk failures. E.g., single disk MTTF = 200,000 hrs  100-disk disk array MTTF = 2000 hrs, or 3 months.  need to add redundancy for better availability  write overhead! mean time to failure

6 Data striping a file

7 Data striping  distribute data transparently over multiple disks to make them appear as a single, fast, large disk.  multiple I/Os can be served in parallel parallel independent requests  shorter queueing time parallel accesses of a single request  higher transfer rate

8 Redundancy  Redundancy: many disks  low reliability redundancy is needed to tolerate disk failures

9 Redundancy  Two important factors in RAID design: granularity of data interleaving: fine: distribute the data so that all of the array’s disks cooperate in servicing every request  high I/O transfer rate, but disk requests will suffer from a longer queueing time and thus a higher access delay coarse: small requests  access few disks, small queueing time; large requests  all disks, good transfer rate data and parity distribution How redundant information is computed? Where shall the redundant information reside?

10 RAID: Level 0 (non-redundant)  data striping only  no data protection redundancy  no need to write redundant information  best write performance (among all RAID levels)  any disk failure  data loss  used in supercomputing environments where performance and capacity, rather than reliability, are the primary concerns

11 RAID: Level 1 (mirrored)  use twice as many disks as level 0  data is duplicated, called mirroring, or shadowing  read is faster, but write is slightly slower (why?)  If a disk fails, its mirror copy can still serve.  used in database application where availability and transaction rate are more important than storage efficiency redundant data

12 RAID 1  Q: Compare the following 3 RAID-1 configurations Simple Shadowing Declustering Chained Declustering 3a3a3b3b3c3c2a2a 1c1c 2b2b 0a0a 2c2c 0b0b 1a1a 0c0c 1b1b 0123

13 RAID: Level 2 (error correction)  uses Hamming code  For n disks, about log 2 (n) of them store redundant data. (More space efficient than mirroring.)  If a disk fails, multiple redundant disks need to be read to identify the bad one. However, only one redundant disk needs be read to recover the lost data.  RAID-2 assumes disk logic can’t tell which disk is returning faulty readings. No practical use.

14 RAID: Level 4 (block-interleaved parity)  Current disk controller logic can generally detect and identify disk failures.  Note that a single parity disk is enough to recover data lost due to single disk failure.  Performance: small read  access one data disk; large read  access many data disks; small write  4 I/O (read the data disk and the parity disk, compute the difference between the old and new images, update the data disk, update the parity disk; Q: what about large writes?

15 Parity disk a file parity disk 1:odd # of 1’s 0:even # of 1’s

16 RAID: Level 4 (block-interleaved parity)  2 update schemes: Read-modify-write Regenerate-write.

17 Read-modify-write a file parity disk change to 1111

18 Read-modify-write a file parity disk change to 1111 read

19 Read-modify-write a file parity disk change to 1111 modify 11 10

20 Read-modify-write a file parity disk change to 1111 write 11 10

21 Regenerate write a file parity disk change to

22 Regenerate write a file parity disk change to write

23 Enhancing RAID-4 B1B1 D1D1 A1A1 C1C1 B2B2 D2D2 A2A2 C2C2 B3B3 D3D3 A3A3 C3C3 Bp Dp Ap Cp B1B1 Dp A1A1 C1C1 B2B2 D1D1 A2A2 Cp Bp D2D2 A3A3 C2C2 B3B3 D3D3 Ap C3C3 Parity Disk Striped Parity

24 Distributing parity  Parity disk simplify the mapping of logical addresses to disk addresses. every write must update the associated bits on the single parity disk.  Striped parity can perform parallel parity update

25 RAID: Level 5 (block-interleaved distributed parity)  eliminates the parity disk bottleneck by distributing the parity uniformly over all of the disks.  improves read performance by allowing all disks be used to serve read requests.

26 RAID: Level 6 (P+Q redundancy)  uses 2 redundant disks to protect up to two disk failures  computes 2 different parities instead of 1  similar read performance as with Level 5, but write is slightly worse

27 Optical media  well accepted because: high storage capacity random access to data life span of more than 30 years (c.f. << 20 years for magnetic media) portable  Optical videodisk was invented by Friebus in Prototype using laser to record and read was demonstrated by Phillips and MCA in 1972.

28 Optical media  Videodisks developed by Philips has been commercially available since  Then compact disk technology for digital audio (CD-DA) came out in early 1980s. The use of optical disks for digital data storage came with the introduction and improvement of CD-ROM during the 1980s.

Optical disk technology  Optical storage media use the intensity of reflected laser light as an information source.

30  An optical disk consists of 3 layers: Protective layer (a very thin layer on the label side) Reflective layer (aluminum coating) Substrate layer (transparent) Optical disk technology  In the factory, depressions are cut on the disk surface, forming “lands” and “pits” (0.12  m different in heights).

31 Optical disk technology  Laser reflected from the lands and pits give different readings.  Do you know: that data are read from the disk “inside-out?” that the spiral track is about 3.5 miles long? that you should never write on the label side? that a CD should be cleaned “radially”?

32 Advantages of optical media  High density (for a portable device). Distance between tracks is 1.6  m, each track is 0.6  m wide, i.e., 1 bit per sq.  m or 1Mb per sq.mm. A floppy disk has 96 tracks per inch while an optical disk has tracks per inch.  Long life. Magnetization can decrease over time. ‘Lands’ and ‘pits’ not changed unless physically damaged. (However, mal-manufacturing/mis- handling may lead to “laser rot”.)

33 Advantages of optical media  Low wearing. Laser source in head can be positioned at 1mm from disk surface. Disk head does not have to be as close to the surface as with magnetic disks. This reduces friction and increases life span.

34 Digital optical disks  Audio CD was developed by Philips and Sony in  Basic technology extended to 650 MB CD-ROM in  At single speed, data rate is 150KBps.  CD-ROM/XA announced in 1986 to support applications of text, images, audio and FSFM video.  Recent developments include CD-R, CD-RW, MO (magneto optical), and DVD bytes 1024 bytes Full-screen full-motion

35 CD-DA (Compact Disk Digital Audio)  1982 by Philips and Sony  12cm diameter, 1.2 mm thick optical disk, stores/plays in CLV (constant linear velocity). Spiral tracks of about 20,000 windings in total.  Data are recorded such that pit-to-land and land-to- pit transitions are coding ‘1’s. ‘0’s are coded as no transition.  Redundancy added to break up consecutive ‘1’s and ‘0’s.

36 CD-DA  Data rate: 44.1KHz sampling, 16-bit quantization, KBps  Capacity: 747MB, 74 min high-quality sound  Capability of random access to tracks and index points.

37 8 to 14 modulation (EFM)  Pits and lands may not follow too closely one after another on a CD-DA. Rule 1: between any 2 ‘1’s, there are at least 2 ‘0’s.  For synchronization, pit or land sequences are not allowed to be too long. Rule 2: at most 10 ‘0’s can follow one after another.  Solution: Map every 8-bit pattern into a 14-bit pattern that satisfies the 2 rules. Among the 2 14 patterns, 267 of them are valid.  Also, between consecutive 14-bit sequences, 3 merging bits are added to enforce the rules.

38 8 to 14 modulation (Example)

39 Low level data encoding  Thus, an eight-bit byte of actual data is encoded into a total of 17 channel bits.  For synchronization and error correction, every 24 bytes of data is packaged into a frame: sync pattern ( bits) control byte (17 bits) 12 data bytes (12 * 17 bits) 4 error correction bytes (4 * 17 bits) 12 data bytes (12 * 17 bits) 4 error correction bytes (4 * 17 bits) Total: 588 channel bits for 192 actual data bits. numbers in brackets are channel bits

40 First level error correction  Recall that each frame contains 24 data bytes and 8 error correction bytes.  The first 4 correction bytes cover the frame’s data. The other 4 correction bytes cover data over 7 frames.  When a frame is read, the first 4 correction bytes are checked. If not ok, the decoder decodes the data bytes after subsequent correction codes are read.  7 frames = 7.7 mm track length. Try cover your CD with a small piece of paper and see if it still works.

41 Interleaved coding  An audio CD records samples from an audio wave. Successive sample values are closely related.  A scratch on a CD may wipe out a continuous segment of data.  With interleaving, the samples are dispersed: data on a CD

42 Interleaved coding  Suppose a burst error occurs destroying 4 samples  Interpolation to restore (approximately) the sample values:

43 CD-ROM (Compact Disk Read Only)  1985 by Philips and Sony.  Tracks are divided into audio and data types. Disks containing both types are called Mixed Mode Disks.  It operates in 2 modes: Mode 1: for computer data Mode 2: for media data

44 CD-ROM (Compact Disk Read Only)  Mode 1 Mode 1 achieves a better (lower) error rate by using a second level of error correction. Random access to sub-track units called blocks (2352 bytes). (For CD-DA, random access is on track level only.) Each block has the address of min:sec:block. This is specified in the Header field.

45 CD-ROM  Mode 1 for computer data. A capacity of 333,000 blocks to be played in 74 min, i.e., 650MB storage with a data rate of 150KBps. Sync 12 User Data 2048 Blanks 8 ECC 276 Header 4 EDC bytes Note: each 24 of the 2352 bytes are represented physically on the disc as a frame (i.e., 588 channel bits). Each block is thus represented by 98 frames.

46 CD-ROM  Mode 2 Mode 2 holds data of any media. Additional error correction not crucial, so not used. Disk has capacity of ~740MB and a data rate of 171KBps. Sync 12 User Data 2336 Header bytes

47 CD-ROM  CD-ROM is a very economical medium for publication and distribution.  Limitations of CD-ROM: Random access to a CD track can be anywhere from 200ms up to 1 sec in access time. Optical disk heads are heavier than magnetic heads. More inertia takes a longer seek time for head movements. Continuous media stored sequentially in CD-ROM tracks. Although important for multimedia applications, simultaneous playback of audio and other data is not possible.

48 CD-ROM/XA (Extended Architecture)  1989, established by Microsoft, Philips and Sony.  Goal: concurrent output of several media. Within 1 track, blocks of different media can be stored. It allows interleaved storage and retrieval of multimedia data.  A sub-header is added to each block to describe the block.  CD-ROM/XA uses CD-ROM mode 2 to define actual blocks. Two forms:

49 CD-ROM/XA Form 1 provides more error detection/correction at the expense of redundancy bytes (of 2352) are for user data. Form 2 allows 13% more storage for user data, but at the expense of error correction. Sync 12 User Data 2324 Subheader 8 EDC 4 Header bytes Sync 12 User Data 2048 Subheader 8 ECC 276 Header 4 EDC bytes

50 CD-R (Compact Disk Recordable)  CD-R allows tracks to be recorded once.  4 layers: protective, reflective, dye, and substrate. Polycarbonate Lacquer Aluminum Polycarbonate Lacquer Dye Gold Traditional CD-ROM “Molded” by a stamper CD-R Media Burned by high power laser beam Don’t leave out in sunlight 24K

51 CD-R  Land and pit reflections realized by irreversible thermal effect (above 250C) on the dye.  Playable on CD players.

52 CD-R  Recording sessions A CD has 3 areas: lead-in, actual data, lead-out. Lead-in includes the table of contents: directory, indices to individual tracks. Data area includes all tracks where actual data is stored. Lead-out marks the end of the data area. Information Lead-out Lead-in Lead-out Session 1Session 2...

53 CD-R  Recording sessions Multiple sessions of lead-in, data, lead-out can be written separately over time. During 1 write activity, all data for a session are written with their table of contents, after which the session can be read by a CDROM drive. Information Lead-out Lead-in Lead-out Session 1Session 2...

54 CD-MO (Compact Disk Magneto Optical)  Specification published by Philips and Sony in  Working principle is different from other CD technologies (incompatible with other CD formats.) Based on the polarization of light by magnetic field. Disk surface is light reflecting magnetic substrate. During writing, surface is heated to above 150C, and a magnetic field polarizes individual dipoles. During reading, surface is irradiated with a laser beam, polarization of laser light changed according to the magnetization.

55 CD-RW  An alloy is used in CD-RW that can take on two states: crystalline: reflects light well amorphous: doesn’t reflect light well  To change the alloy into the crystalline state or the amorphous state, two laser beams of different power are used: “Write power”: amorphous state “Erase power”: crystalline state “Read power”: no state change, used to pick up the readings from the disk

56 Sony MiniDisk  A small MO disk with a data capacity of about 1/5 of a CD.  An MD can record about the same length of music as a CD since the audio is compressed.  The compression technique used is called ATRAC (Adaptive Transform Acoustic Coding). Typical compression ratio is 5:1.  Like MP3, ATRAC uses psychoacoustics.

57 Digital Versatile Disk (DVD)  also called Digital Video Disk.  capacity: 4.7 to 17 billion bytes (25 CDs)  digital video can be stored and distributed cheaper than tape; also it allows interactivity  can be used to store up to 133 minutes (8-9 hrs for high capacity DVDs) of studio quality video and multi-channel surround-sound audio, or 30 hours of CD-quality audio

58 DVD  DVD achieves a greater capacity by reducing the minimum pit length from micron (CD) to 0.4 micron (DVD) reducing the inter-track space from 1.6 micron (CD) to 0.74 micron (DVD)

59 DVD  To read the condensed pits, DVD uses a laser of a shorter wavelength ( nm; for CD it is 780 nm).  Reducing the pit size and track distance increases the disc’s capacity to 4.7GB.  Dual layering. A semi-reflective layer on top of a fully reflective layer  8.5GB total.  Double side. Two substrates bonded back-to-back. Each side could have one layer or two layers  capacity ranges from 9.4GB to 17GB.

60 DVD 1.2mm 0.6mm single-sided, single layer, 4.7GB single-sided, dual layer, 8.5GB double-sided, dual layer, 17GB

61 DVD  Other factors that improve the storage capacity 8-16 modulation instead of merge bits slightly larger “usable” surface area more efficient error coding

62 DVD  Other features Error correction is about 10 times better than that of CD. Some DVDs are recorded using opposite track path (i.e., one spiral layer starts from the center, followed by another spiral layer that starts from the rim towards the center). OTP reduces the seek time when the player switch layer during video playback.