Tape is Dead Disk is Tape Flash is Disk RAM Locality is King Jim Gray Microsoft December 2006 Presented at CIDR2007 Gong Show

Slides:



Advertisements
Similar presentations
1 The 5 Minute Rule Jim Gray Microsoft Research Kilo10 3 Mega10 6 Giga10 9 Tera10 12 today,
Advertisements

Storing Data: Disk Organization and I/O
 Storage Capacity:3.75 MB  Physical Size:87.9 ft 3  Price for storage:15,000$/MB  Seek Time:600 ms  Rotational Speed:1200 RPM.
Agenda Types of Storage Media Semiconductor ROM and RAM Magnetic Tapes, Floppy Disks, and Hard Disks Optical CS C446 Data Storage Technologies & Networks.
Avishai Wool lecture Introduction to Systems Programming Lecture 8.3 Non-volatile Memory Flash.
Computer ArchitectureFall 2008 © November 12, 2007 Nael Abu-Ghazaleh Lecture 24 Disk IO.
CS4432: Database Systems II Lecture 2 Timothy Sutherland.
Overview: Memory Memory Organization: General Issues (Hardware) –Objectives in Memory Design –Memory Types –Memory Hierarchies Memory Management (Software.
Data Storage Technology
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
Solid State Drive Feb 15. NAND Flash Memory Main storage component of Solid State Drive (SSD) USB Drive, cell phone, touch pad…
1 CS : Technology Trends Ion Stoica ( September 12, 2011.
The Impacts of SSD in Your Storage System Author: David Dale, SNIA (NetApp)
Nic Shulver, Fundamentals of Computer Networks Memory and Disc Drive hardware Working Memory and Permanent Storage.
“Five minute rule ten years later and other computer storage rules of thumb” Authors: Jim Gray, Goetz Graefe Reviewed by: Nagapramod Mandagere Biplob Debnath.
Operating Systems CMPSC 473 I/O Management (2) December Lecture 24 Instructor: Bhuvan Urgaonkar.
Storage.
Physical Memory By Gregory Marshall. MEMORY HIERARCHY.
Lecture 11: DMBS Internals
Flashing Up the Storage Layer I. Koltsidas, S. D. Viglas (U of Edinburgh), VLDB 2008 Shimin Chen Big Data Reading Group.
Introduction to Database Systems 1 Storing Data: Disks and Files Chapter 3 “Yea, from the table of my memory I’ll wipe away all trivial fond records.”
IT 344: Operating Systems Winter 2010 Module 13 Secondary Storage Chia-Chi Teng CTB 265.
Data Storage What is storage? Storage refers to the media and methods used to keep information available for later use. Some things will be needed right.
Logging in Flash-based Database Systems Lu Zeping
Persistent Storage (disk?) Requirements (For The Low End ==the bottom 99%of the market ) Jim Gray Microsoft Research.
DBI313. MetricOLTPDWLog Read/Write mixMostly reads, smaller # of rows at a time Scan intensive, large portions of data at a time, bulk loading Mostly.
Inside your computer. Hardware Review Motherboard Processor / CPU Bus Bios chip Memory Hard drive Video Card Sound Card Monitor/printer Ports.
Inside your computer. Hardware Motherboard Processor / CPU Bus Bios chip Memory Hard drive Video Card Sound Card Monitor/printer Ports.
A Detailed Discussion of SRAM Niels Asmussen Maggie Hamill William Hunt.
3 Computing System Fundamentals
OCR GCSE Computing © Hodder Education 2013 Slide 1 OCR GCSE Computing Chapter 2: Memory.
Computer Guts and Operating Systems CSCI 101 Week Two.
Computers Today Information taken from: Microsoft Office 97 Professional by Lawrence Press Microsoft Office 2003 by Glencoe Dell Dimension B110 2 iMac.
Introduction to Computer Architecture. What is binary? We use the decimal (base 10) number system Binary is the base 2 number system Ten different numbers.
Programming for GCSE Topic 5.1: Memory and Storage T eaching L ondon C omputing William Marsh School of Electronic Engineering and Computer Science Queen.
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
1 CS : Technology Trends Ion Stoica and Ali Ghodsi ( August 31, 2015.
CS4432: Database Systems II Data Storage 1. Storage in DBMSs DBMSs manage large amounts of data How does a DBMS store and manage large amounts of data?
CSE 451: Operating Systems Winter 2015 Module 14 Secondary Storage Mark Zbikowski Allen Center 476 © 2013 Gribble, Lazowska, Levy,
By : Reem Hasayen. A storage device is a hardware device capable of storing information. There are two types of storage devices used in computers 1. Primary.
Computer Organization. The Five Hardware Units General purpose computers use the "Von Neumann" architecture Also referred to as "stored program" architecture.
CS 101 – Sept. 28 Main vs. secondary memory Examples of secondary storage –Disk (direct access) Various types Disk geometry –Flash memory (random access)
Storage Systems CSE 598d, Spring 2007 Lecture ?: Rules of thumb in data engineering Paper by Jim Gray and Prashant Shenoy Feb 15, 2007.
Memory The term memory is referred to computer’s main memory, or RAM (Random Access Memory). RAM is the location where data and programs are stored (temporarily),
DMBS Internals I February 24 th, What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the.
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
DMBS Architecture May 15 th, Generic Architecture Query compiler/optimizer Execution engine Index/record mgr. Buffer manager Storage manager storage.
What is it and why do we need it? Chris Ward CS147 10/16/2008.
Jérôme Jaussaud, Senior Product Manager
CS 140 Lecture Notes: Technology and Operating Systems Slide 1 Technology Changes Mid-1980’s2012Change CPU speed15 MHz2.5 GHz167x Memory size8 MB4 GB500x.
1 Lecture 16: Data Storage Wednesday, November 6, 2006.
1 Paolo Bianco Storage Architect Sun Microsystems An overview on Hybrid Storage Technologies.
TYPES OF MEMORY.
Join Processing for Flash SSDs: Remembering Past Lessons
Lecture 16: Data Storage Wednesday, November 6, 2006.
Database Management Systems (CS 564)
Join Processing for Flash SSDs: Remembering Past Lessons
CS : Technology Trends August 31, 2015 Ion Stoica and Ali Ghodsi (
Lecture 11: DMBS Internals
Repairing Write Performance on Flash Devices
Lecture 9: Data Storage and IO Models
CS 101 – Sept. 25 Continue Chapter 5
CS 140 Lecture Notes: Technology and Operating Systems
CS 140 Lecture Notes: Technology and Operating Systems
CSE 451: Operating Systems Autumn 2009 Module 13 Secondary Storage
Tape is Dead Disk is Tape Flash is Disk RAM Locality is King
Motherboard External Hard disk USB 1 DVD Drive RAM CPU (Main Memory)
CS 295: Modern Systems Storage Technologies Introduction
Tape is Dead Disk is Tape Flash is Disk RAM Locality is King
Dong Hyun Kang, Changwoo Min, Young Ik Eom
Presentation transcript:

Tape is Dead Disk is Tape Flash is Disk RAM Locality is King Jim Gray Microsoft December 2006 Presented at CIDR2007 Gong Show

Tape Is Dead Disk is Tape 1TB disks are available 10+ TB disks are predicted in 5 years Unit disk cost: ~$400 → ~$80 But: ~ hours to read (sequential) ~ days to read (random) Need to treat most of disk as Cold-storage archive

FLASH Storage? Mb NAND flash chips Gb NAND flash Doubled each year since 1995 Market driven by Phones, Cameras, iPod,… Low entry-cost, ~$30/chip → ~$3/chip Tb NAND flash == 128 GB chip == 1TB or 2TB “disk” for ~$400 or 128GB disk for $40 or 32GB disk for $5 Samsung prediction

FLASH Some Parameters Chip read ~ 20 MB/s write ~ 10 MB/s N chips have N x bandwidth Latency ~ 25 μs to start read, ~ 100 μs to read a “2K page” ~ 2,000 μs to erase ~ 200 μs to write a “2K page” Power ~ 1W for 8 chips and controller 5,000 IO/s per chip!

What’s Wrong With FLASH? Expensive: $/GB –50x more than disk today –Ratio may drop to 10x in 2012 Limited lifetime –~100k to 1M writes / page –requires “wear leveling” but, if you have 1B pages, then 15,000 years to “use” ½ the pages. Slow to write you can only write 0’s, so erase (set all 1) then write.

Obvious Uses For Flash PDAs, cameras, iPod, …. Laptop disks –power, rugged, quiet, big enough, … Not so obvious use: –ARCHIVE for photo/music/.. because it’s simple to understand. –Enterprise drives (lots of IO/s per $ per watt per liter )

One Could Make a Flash Disk (or a Flash File System) 6K random reads/sec, 3K random writes/sec The IO capacity of disks Uses 1 W vs 500W… Less space, … See “A Design for High-Performance Flash Disks” Birrell, Isard, Thacker, Wobber MSR-TR A Design for High-Performance Flash Disks replace with 1 10TB disk and 3 FLASH disks

We Are Not There Yet Current FLASH disks could do much better on writes (100x better (!)) Algorithms are known but… This changes many ratios Access time is 20x less (~200us) IOps is 100x more Re-evaluate page sizes MSR-TR FlashDB: Dynamic Self-tuning Database for NAND Flash, Suman Nath, Aman Kansal FlashDB: Dynamic Self-tuning Database for NAND Flash,

RAM Locality is King The cpu mostly waits for RAM Flash / Disk are 100,000 …1,000,000 clocks away from cpu RAM is ~100 clocks away unless you have locality (cache). If you want 1CPI (clock per instruction) you have to have the data in cache (program cache is “easy” ) This requires cache conscious data-structures and algorithms sequential (or predictable) access patterns Main Memory DB is going to be common. 10 years 100:1