File Processing : Storage Media

Slides:



Advertisements
Similar presentations
AS ICT. Identify suitable uses of common storage media understand the types of access and access speeds required for each use (e.g. serial/sequential,
Advertisements

Lecture # 7. Topics Storage Techniques of Bits Storage Techniques of Bits Mass Storage Mass Storage Disk System Performance Disk System Performance File.
- Dr. Kalpakis CMSC Dr. Kalpakis 1 Outline In implementing DBMS we need to answer How should the system store and manage very large amounts of data?
Storing Data: Disks and Files: Chapter 9
CpSc 3220 File and Database Processing Lecture 1 Course Overview File Storage Basics.
CS4432: Database Systems II Data Storage - Lecture 2 (Sections 13.1 – 13.3) Elke A. Rundensteiner.
1 Advanced Database Technology February 12, 2004 DATA STORAGE (Lecture based on [GUW ], [Sanders03, ], and [MaheshwariZeh03, ])
1 Storage Hierarchy Cache Main Memory Virtual Memory File System Tertiary Storage Programs DBMS Capacity & Cost Secondary Storage.
SECTIONS 13.1 – 13.3 Sanuja Dabade & Eilbroun Benjamin CS 257 – Dr. TY Lin SECONDARY STORAGE MANAGEMENT.
CS4432: Database Systems II Lecture 2 Timothy Sutherland.
1 CS222: Principles of Database Management Fall 2010 Professor Chen Li Department of Computer Science University of California, Irvine Notes 01.
MODULE 5: Introduction to Memory system
SECTIONS 13.1 – 13.3 Sanuja Dabade & Eilbroun Benjamin CS 257 – Dr. TY Lin SECONDARY STORAGE MANAGEMENT.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
Storing Data. Memory vs. Storage Storage devices are like file drawers, in that they hold programs and data. Programs and data are stored in units called.
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez-Martínez Electrical and Computer Engineering Department Lecture 5 – Storage Organization.
1 Lecture 7: Data structures for databases I Jose M. Peña
1 Database Systems Storage Media Asma Ahmad 21 st Apr, 11.
Lecture 11: DMBS Internals
© Janice Regan, CMPT 300, May CMPT 300 Introduction to Operating Systems Operating Systems Overview: Using Hardware.
Physical Storage and File Organization COMSATS INSTITUTE OF INFORMATION TECHNOLOGY, VEHARI.
Lecture 8 of Advanced Databases Storage and File Structure Instructor: Mr.Ahmed Al Astal.
Computer Organisation 1 Secondary Storage Sébastien Piccand
CS 111 – Aug – 1.3 –Information arranged in memory –Types of memory –Disk properties Commitment for next day: –Read pp , In other.
1 Secondary Storage Management Submitted by: Sathya Anandan(ID:123)
ICS 321 Fall 2011 Overview of Storage & Indexing (i) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 11/9/20111Lipyeow.
Overview of Physical Storage Media
1/14/2005Yan Huang - CSCI5330 Database Implementation – Storage and File Structure Storage and File Structure.
Database Management Systems,Shri Prasad Sawant. 1 Storing Data: Disks and Files Unit 1 Mr.Prasad Sawant.
File Processing : Storage Media 2015, Spring Pusan National University Ki-Joune Li.
Physical Storage Organization. Advanced DatabasesPhysical Storage Organization2 Outline Where and How data are stored? –physical level –logical level.
+ CS 325: CS Hardware and Software Organization and Architecture Memory Organization.
Programming for GCSE Topic 5.1: Memory and Storage T eaching L ondon C omputing William Marsh School of Electronic Engineering and Computer Science Queen.
11.1Database System Concepts. 11.2Database System Concepts Now Something Different 1st part of the course: Application Oriented 2nd part of the course:
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
CS4432: Database Systems II Data Storage 1. Storage in DBMSs DBMSs manage large amounts of data How does a DBMS store and manage large amounts of data?
Database Systems Disk Management Concepts. WHY DO DISKS NEED MANAGING? logical information  physical representation bigger databases, larger records,
Em Spatiotemporal Database Laboratory Pusan National University File Processing : Storage Media 2004, Spring Pusan National University Ki-Joune Li.
CS 101 – Sept. 28 Main vs. secondary memory Examples of secondary storage –Disk (direct access) Various types Disk geometry –Flash memory (random access)
Memory Characteristics Location Capacity Unit of transfer Access method Performance Physical type Physical characteristics Organisation.
Memory The term memory is referred to computer’s main memory, or RAM (Random Access Memory). RAM is the location where data and programs are stored (temporarily),
DMBS Internals I February 24 th, What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the.
A Presentation on Hard Disks By: Team 4 (HIS44): (1)Samarjyoti Das (972151) (2)Subhadeep Ghosh (986570) (3)Dipanjan Das (986510) (4)Sudhamayee Pradhan.
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
DMBS Architecture May 15 th, Generic Architecture Query compiler/optimizer Execution engine Index/record mgr. Buffer manager Storage manager storage.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
COSC 6340: Disks 1 Disks and Files DBMS stores information on (“hard”) disks. This has major implications for DBMS design! » READ: transfer data from disk.
1 Lecture 16: Data Storage Wednesday, November 6, 2006.
Data Storage and Querying in Various Storage Devices.
Computer System Structures Storage
Short History of Data Storage
TYPES OF MEMORY.
Storage Overview of Physical Storage Media Magnetic Disks RAID
CHAPTER 10: Computer Peripherals
Storage and Disks.
Lecture 16: Data Storage Wednesday, November 6, 2006.
Database Management Systems (CS 564)
CS 554: Advanced Database System Notes 02: Hardware
Introduction to Database
Disks and Files DBMS stores information on (“hard”) disks.
Lecture 11: DMBS Internals
Lecture 9: Data Storage and IO Models
Sanuja Dabade & Eilbroun Benjamin CS 257 – Dr. TY Lin
File Processing : Storage Media
Computer Application Waseem Gulsher
STORAGE – 3 TIERS Key Revision Points.
Lecture 18: DMBS Overview and Data Storage
Networks & I/O Devices.
Presentation transcript:

File Processing : Storage Media 2017, Spring Pusan National University Ki-Joune Li

Major Functions of Computer Computation Storage Communication Presentation

Storage of Data Major Challenges How to store and manage a large amount of data Example : more than 100 peta bytes for EOS Project How to represent sophisticated data

Modeling and Representation of Real World Example Building DB about Korean History Very complicated and Depending on viewpoint Database Course : 2017 Fall semester Real World Computer World

Managing Large Volume of Data Cost for Storage Media Not very important and negligible Processing Time Time is the most valuable resource Comparison between main memory and disk access time RAM (Random Access Memory) : several 10-9 sec SSD (Solid State Driver) : under 10-4 sec HDD (Hard Disk Driver): several 10-3 sec  HDD is 106 times slower than RAM Difference between handling data in RAM and HDD Handling data in HDD: Same way that we handle data in RAM How to handle this gap between RAM and Disk Memory

Managing Large Volume of Data Management of Data Secure Management From hacking From any kinds of disasters Consistency of Data Example Failure during a flight reservation transaction Concurrent transaction

Goals of File Systems To provide with 1. efficient Data Structures for storing large and complex data 2. Access Methods for rapid search 3. Query Processing Methods 4. Robust Management of Transactions

Memory Hierarchy Large Data Volume Memory Hierarchy Not be stored in main memory But in secondary memory Memory Hierarchy Faster Cache Memory 8 M bytes (Core i7, L3 Cache) Cheaper Main Memory 16 G bytes Secondary Memory 1 T bytes Tertiary Memory 10 Peta bytes

SSD (Flash Memory) Solid State Driver Only Electronic Operations unlike HDD. Characteristics Aging Problem: only a limited number of write/erase cycles. (e.g. 1 M) Asymmetric Read/Write Speed: a byte (or word) can be read at a time Write: Erasing of memory has to be done to an entire bank of memory Reading is fast and a byte (or word) can be read at a time Writing is a little bit slower than reading Easing is slower NAND vs. NOR Flash Memory

Optical Storage Non-volatile Speed Juke-box systems CD, DVD Slower than HDD Juke-box systems Large numbers of removable disks, Few drives, and Mechanism for automatic loading/unloading of disks For storing large volumes of data

Tape Non-Volatile and Large Volume (e.g. 15 TB per Cartridge) Primarily Used for backup Sequential access: much slower than disk But data transfer rate: up to 750 MB for some tape driver

Data Access with Secondary Memory Get Data Hit Ratio rh = nh / na Access Request Get Data How to increase hit ratio ? If in main memory Main Memory Load on main memory Access to Disk If not in main memory Disk

Why Hit Ratio is so important ? Example for(int i=0;i<1000;i++) Nbytes=read(fd,buf,100); 1000 disk accesses ? when rh = 0 when rh = 1 1000 * 10-2 sec = 10 sec 1000 * 10-8 sec = 10-5 sec

Physical Structure of Disk 200~400 sectors 512 bytes 2 * nDF

Disk Access Time Disk Access Time t = tS + tR + tT , where tS : Seek Time Time to reposition the head over the correct track Average seek time is 1/2 the worst case seek time 4 to 10 milliseconds on typical disks tR : Rotational Latency Time to reposition the head over the correct sector Average rotational latency : ½ r (to find index point) + ½ r = r In case of 15000 rpm : r =1*60sec/15000 = 4 msec tT : Transfer Time Time to transfer data from disk to main memory via channel Proportional to the number of sectors to read Real transfer time is negligible

Block-Oriented Disk Access Example for(int i=0;i<1000;i++) Nbytes=read(fd,buf,10); 10 bytes 1000 times 100 times Buffer in main memory 1024 bytes Number of Disk Accesses 10 times 1 block (e.g. 1024 bytes)

Disk Block Unit of Disk Access Block Size Why not large block ? Normally multiple of sectors 1K, 4K, 16K or 64K bytes depending on configuration Why not large block ? Limited by the size of available main memory Too large : unnecessary accesses of sectors e.g. only 100 bytes, when block size is given as 64K 1 block : 128 sectors (about ½ track, ½ rotation, 2 msec) Too wasteful