Arjun Suresh S7, R College of Engineering Trivandrum.

Slides:



Advertisements
Similar presentations
Section 6.2. Record data by magnetizing the binary code on the surface of a disk. Data area is reusable Allows for both sequential and direct access file.
Advertisements

Adam Jorgensen Pragmatic Works Performance Optimization in SQL Server Analysis Services 2008.
Multi-Level Caches Vittorio Zaccaria. Preview What you have seen: Data organization, Associativity, Cache size Policies -- how to manage the data once.
Daniel Schall, Volker Höfner, Prof. Dr. Theo Härder TU Kaiserslautern.
Storing Data: Disks and Files: Chapter 9
A Scalable and Reconfigurable Search Memory Substrate for High Throughput Packet Processing Sangyeun Cho and Rami Melhem Dept. of Computer Science University.
Query Processing and Optimizing on SSDs Flash Group Qingling Cao
MEMS Based Mass Storage Systems. What is MEMS? (M)icro(E)lectric(M)echanical(S)ystems Consist of mech µ(structures, sensors, actuators), electronics,
1 HYRISE – A Main Memory Hybrid Storage Engine By: Martin Grund, Jens Krüger, Hasso Plattner, Alexander Zeier, Philippe Cudre-Mauroux, Samuel Madden, VLDB.
Boost Write Performance for DBMS on Solid State Drive Yu LI.
Lecture 12: DRAM Basics Today: DRAM terminology and basics, energy innovations.
SECTIONS 13.1 – 13.3 Sanuja Dabade & Eilbroun Benjamin CS 257 – Dr. TY Lin SECONDARY STORAGE MANAGEMENT.
Secondary Storage CSCI 444/544 Operating Systems Fall 2008.
1 CS222: Principles of Database Management Fall 2010 Professor Chen Li Department of Computer Science University of California, Irvine Notes 01.
SECTIONS 13.1 – 13.3 Sanuja Dabade & Eilbroun Benjamin CS 257 – Dr. TY Lin SECONDARY STORAGE MANAGEMENT.
Multidimensional Data Many applications of databases are ``geographic'' = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
MEMS-BASED INTEGRATED-CIRCUIT MASS-STORAGE SYSTEMS L. R. Carley, G. R. Ganger, D. F. Nagle Carnegie-Mellon University.
Lecture 11: DMBS Internals
C-Store: Column Stores over Solid State Drives Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY Jun 19, 2009.
DBMSs On A Modern Processor: Where Does Time Go? by A. Ailamaki, D.J. DeWitt, M.D. Hill, and D. Wood University of Wisconsin-Madison Computer Science Dept.
Flashing Up the Storage Layer I. Koltsidas, S. D. Viglas (U of Edinburgh), VLDB 2008 Shimin Chen Big Data Reading Group.
Topic: Disks – file system devices. Rotational Media Sector Track Cylinder Head Platter Arm Access time = seek time + rotational delay + transfer time.
1 IT420: Database Management and Organization Storage and Indexing 14 April 2006 Adina Crăiniceanu
July, 2001 High-dimensional indexing techniques Kesheng John Wu Ekow Otoo Arie Shoshani.
© Stavros Harizopoulos 2006 Performance Tradeoffs in Read-Optimized Databases Stavros Harizopoulos MIT CSAIL joint work with: Velen Liang, Daniel Abadi,
CS 345: Topics in Data Warehousing Tuesday, October 19, 2004.
Microprocessor-based systems Curse 7 Memory hierarchies.
Speaker: 吳晋賢 (Chin-Hsien Wu) Embedded Computing and Applications Lab Department of Electronic Engineering National Taiwan University of Science and Technology,
1 CPS216: Advanced Database Systems Notes 04: Operators for Data Access Shivnath Babu.
Performance Tradeoffs in Read-Optimized Databases Stavros Harizopoulos * MIT CSAIL joint work with: Velen Liang, Daniel Abadi, and Sam Madden massachusetts.
© Stavros Harizopoulos 2006 Performance Tradeoffs in Read- Optimized Databases: from a Data Layout Perspective Stavros Harizopoulos MIT CSAIL Modified.
Chapter 6 1 © Prentice Hall, 2002 The Physical Design Stage of SDLC (figures 2.4, 2.5 revisited) Project Identification and Selection Project Initiation.
DBI313. MetricOLTPDWLog Read/Write mixMostly reads, smaller # of rows at a time Scan intensive, large portions of data at a time, bulk loading Mostly.
Parallel Programming on the SGI Origin2000 With thanks to Igor Zacharov / Benoit Marchand, SGI Taub Computer Center Technion Moshe Goldberg,
Log-structured Memory for DRAM-based Storage Stephen Rumble, John Ousterhout Center for Future Architectures Research Storage3.2: Architectures.
Introduction to Computer Architecture. What is binary? We use the decimal (base 10) number system Binary is the base 2 number system Ten different numbers.
Yi-Lin, Tu 2013 IEE5011 –Fall 2013 Memory Systems Wide I/O High Bandwidth DRAM Yi-Lin, Tu Department of Electronics Engineering National Chiao Tung University.
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
Weaving Relations for Cache Performance Anastassia Ailamaki Carnegie Mellon David DeWitt, Mark Hill, and Marios Skounakis University of Wisconsin-Madison.
Database Systems Disk Management Concepts. WHY DO DISKS NEED MANAGING? logical information  physical representation bigger databases, larger records,
Introduction: Memory Management 2 Ideally programmers want memory that is large fast non volatile Memory hierarchy small amount of fast, expensive memory.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Chapter 4 Logical & Physical Database Design
March, 2002 Efficient Bitmap Indexing Techniques for Very Large Datasets Kesheng John Wu Ekow Otoo Arie Shoshani.
1 CMP-MSI.07 CARES/SNU A Reusability-Aware Cache Memory Sharing Technique for High Performance CMPs with Private Caches Sungjune Youn, Hyunhee Kim and.
ICOM 5016 – Introduction to Database Systems Lecture 13- File Structures Dr. Bienvenido Vélez Electrical and Computer Engineering Department Slides by.
DMBS Internals I February 24 th, What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the.
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
MEMS and Caching for File Systems Andy Wang COP 5611 Advanced Operating Systems.
COSC 6340: Disks 1 Disks and Files DBMS stores information on (“hard”) disks. This has major implications for DBMS design! » READ: transfer data from disk.
1 Contents Memory types & memory hierarchy Virtual memory (VM) Page replacement algorithms in case of VM.
Storage Tuning for Relational Databases Philippe Bonnet – Spring 2015.
SQL Server Internals 101 AYMAN SENIOR MICROSOFT.
Select Operation Strategies And Indexing (Chapter 8)
Chin-Hsien Wu & Tei-Wei Kuo
CMSC 611: Advanced Computer Architecture Memory & Virtual Memory Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material.
Memory COMPUTER ARCHITECTURE
CPS216: Data-intensive Computing Systems
Lecture 16: Data Storage Wednesday, November 6, 2006.
Database Management Systems (CS 564)
Part V Memory System Design
BitWarp Energy Efficient Analytic Data Processing on Next Generation General Purpose GPUs Jason Power || Yinan Li || Mark D. Hill || Jignesh M. Patel.
Lecture 11: DMBS Internals
Lecture 9: Data Storage and IO Models
Sanuja Dabade & Eilbroun Benjamin CS 257 – Dr. TY Lin
Part V Memory System Design
Physical Database Design
KISS-Tree: Smart Latch-Free In-Memory Indexing on Modern Architectures
Contents Memory types & memory hierarchy Virtual memory (VM)
Presentation transcript:

Arjun Suresh S7, R College of Engineering Trivandrum

MEMS-based storage Relational database layout FRM (Flexible Retrieval Model) Query Processing on FRM Evaluation Conclusion Outline

Building Storage on MEMS MEMS are Micro Electrical Mechanical Systems.  Basic functions: sensors and actuators  Built by standard silicon processing  Combine mechanical and electrical components  Enable “systems-on-a-chip”

Why need MEMS-based storage?  Huge gaps between disks and RAM 1000,000 latency gap (10ms vs 50 ns), widening 50% yearly price gap per byte 1000 life gap  MEMS narrows gaps 10X smaller latency than disk 100X cheaper than RAM in the range 1 – 10 GB 100 MB - 1 GB/s bandwidth 10 GB capacity with a penny size  Desired for energy and volume critical systems

EEPROM/Flash DRAM Hard Disk Latency Cost per Byte MEMS-based Storage Technology Trends

Using pits in the polymers made by tip heating to store data IBM Millipede

MEMS Storage Architecture Read/write tips Read/write tips Media Bits stored underneath each tip Bits stored underneath each tip side view

MEMS Storage Architecture Media Sled X Y

Springs MEMS Storage Architecture

Anchors attach the springs to the chip. Anchors attach the springs to the chip. Anchor X Y MEMS Storage Architecture

Sled is free to move Sled is free to move X Y MEMS Storage Architecture

Sled is free to move Sled is free to move X Y MEMS Storage Architecture

Springs pull sled toward center Springs pull sled toward center X Y MEMS Storage Architecture

X Y Springs pull sled toward center Springs pull sled toward center MEMS Storage Architecture

Actuators pull sled in both dimensions Actuators pull sled in both dimensions Actuator X Y MEMS Storage Architecture

Actuators pull sled in both dimensions Actuators pull sled in both dimensions X Y MEMS Storage Architecture

Actuators pull sled in both dimensions Actuators pull sled in both dimensions X Y MEMS Storage Architecture

Actuators pull sled in both dimensions Actuators pull sled in both dimensions X Y MEMS Storage Architecture

Actuators pull sled in both dimensions Actuators pull sled in both dimensions X Y MEMS Storage Architecture

Probe tips are fixed Probe tips are fixed Probe tip X Y MEMS Storage Architecture

X Y Probe tips are fixed Probe tips are fixed MEMS Storage Architecture

X Y Sled only moves over the area of a single rectangle Sled only moves over the area of a single rectangle One probe tip per rectangle One probe tip per rectangle Each tip accesses data at the same relative position Each tip accesses data at the same relative position MEMS Storage Architecture

Properties of MEMS Storage Sweep area of One probe tip

Properties of MEMS Storage N bits M bits One tip region One tip sector

Physical Parameters Number of tips6400 Max number of active tips1280 Tip sector size8 bytes Bits per tip region2000X2000 X axis settle time0.125ms Average turnaround time0.06ms

Existing Work on Integration of MEMS Storage Solution proposed by CMU researchers Mapping MEMS storage into conventional disk Adapt I/O scheduling and data placement to MEMS Preliminary study shows Reduce the I/O stall times by 4 to 74 times over disks Improve the overall application run times by 1.9 to 4.4 Reduce the energy consumption by times

Better solutions?? The approach of mapping MEMS into disk Simplify the procedure of integration of MEMS Does not consider the physical properties of MEMS

Relational Data Placement StudentGrade recordID name char(16) perm ID int(8) age int(8) grade int(8) record1Mary record2John record3Bob record4Jane

N-ary Storage Model (NSM) Store records in a relation in slotted disk pages Organize records sequentially on the disk pages Page HeaderMary John Bob Jane P4P3P2p1

Decomposition Storage Model(DSM) Divide a relation into sub-relations based on the number of attributes Each sub-relation corresponds to each attribute Each sub-relation is organized into pages in the same way as NSM Page Header1Mary2 John3Bob4Jane P4P3P2P1 Page Header P4P3P2P1 Page Header P4P3P2P1 Page Header P4P3P2P1 grade nameperm ID age A disk page

Partition Attributes Across (PAX) Within each page, PAX groups all values of each attribute into a mini-page A page is divided into mini-pages based on the number of attributes It stores the same data as NSM in each page Page HeaderMaryJohn BobJane P4P3P2P P4P3P2P P4P3P2P P4P3P2P1 A disk page

Common Workload Requirements Relational data should be compatible with OLTP workloads Due to the update characteristics, relations need to be accessed in a row-wise manner OLAP workloads Only a subset of attributes is of interest, data placement should facilitate data retrieval on a column-wise fashion

Flexible Retrieval Model (FRM) Facilitates data retrieval in both row-wise and column-wise manner Retrieves the relevant subsets of the relations Uses two dimensional layout of MEMS storage Improves the I/O utilization Maximize the concurrent tips to only retrieve the necessary data It is also cache-friendly Use intra-record locality

FRM Data Placement and Retrieval Given a relation with three attributes, the size of each attribute is 8 bytes Placing this relation in 4x4 MEMS-based storage 4 concurrent tips (0, 0) (0, 1) (0, 0) (0, 1) Attr1 Attr2 Attr3 (0, 0) (0, 1) (0, 2) (1, 2)

Query Processing on FRM Selection and projection without index Two-dimensional table scan Selection and projection with index Encode data position as 4-tuple (tip-x, tip-y, offset-x, offset-y) Joins Index-based or hash

Experiment Setup MEMS storage: 1280 concurrent tips out of 6400 total tips Pentium II Celeron 433X2 processors L1 cache:16KB, 32-byte cache line, 20 ns delay L2 cache:128KB, 32-byte cache line, 200 ns delay A relation R with 1.28 million records Sixteen 8-byte attributes in each record Queries: SELECT A 1, A 2, …, A n FROM R WHERE A 1 > Bound;

Memory utilization The selected attributes in queries Consumed memory(MB)

I/O performance NSM and PAX have the same I/O time I/O time of FRM is proportional to the size of retrieved attributes

Selection Queries (2 attributes) Cache Utilization

Selection Queries (13 attributes)

Projection Queries Selectivity = 50%

Projection Queries Selectivity =10%

Conclusion and Future Work Proposed a relational data placement scheme for MEMS-based storage Take advantage of two-dimensional MEMS access feature Arrange MEMS rows and columns to relational attributes and records Save IO cost and improve cache performance Other cache friendly techniques for MEMS- based storage devices are to be explored

Thank You……