Data Storage Systems: A Survey Abdullah Aldhamin July 29, 2013 CMPT 880: Large-Scale Multimedia Systems and Cloud Computing Course Project.

Slides:



Advertisements
Similar presentations
Tuning the Dennis Shasha and Philippe Bonnet, 2013.
Advertisements

Challenges in Getting Flash Drives Closer to CPU Myoungsoo Jung (UT-Dallas) Mahmut Kandemir (PSU) The University of Texas at Dallas.
Myoungsoo Jung (UT Dallas) Mahmut Kandemir (PSU)
1 Stochastic Modeling of Large-Scale Solid-State Storage Systems: Analysis, Design Tradeoffs and Optimization Yongkun Li, Patrick P. C. Lee and John C.S.
Trading Flash Translation Layer For Performance and Lifetime
SSDs: advantages exhibit higher speed than disks drive down power consumption offer standard interfaces like HDDs do.
HPC USER FORUM I/O PANEL April 2009 Roanoke, VA Panel questions: 1 response per question Limit length to 1 slide.
JBOD storage Server Message Block (SMB) PowerShell & SCVMM 2012 R2 Management Scale-out file server clusters Storage Space Hyper-V clusters.
Analysis of HDFS Under HBase: A Facebook Messages Case Study Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows,
Impact of Data Locality on Garbage Collection in SSDs: A General Analytical Study Yongkun Li, Patrick P. C. Lee, John C. S. Lui, Yinlong Xu The Chinese.
SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO IEEE Symposium of Massive Storage Systems, May 3-5, 2010 Data-Intensive Solutions.
FAWN: A Fast Array of Wimpy Nodes Authors: David G. Andersen et al. Offence: Jaime Espinosa Chunjing Xiao.
Hybrid Hard Drive William Sullivan HTM /09/2007.
STORAGE Virtualization
CIT  In this chapter you will learn how to:  Explain how hard drives store data  Describe the differences between traditional hard drives and.
Gordon: Using Flash Memory to Build Fast, Power-efficient Clusters for Data-intensive Applications A. Caulfield, L. Grupp, S. Swanson, UCSD, ASPLOS’09.
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
Hystor : Making the Best Use of Solid State Drivers in High Performance Storage Systems Presenter : Dong Chang.
Comparing Coordinated Garbage Collection Algorithms for Arrays of Solid-state Drives Junghee Lee, Youngjae Kim, Sarp Oral, Galen M. Shipman, David A. Dillow,
1: IntroductionData Management & Engineering1 Course Overview: CS 395T Semantic Web, Ontologies and Cloud Databases Daniel P. Miranker Objectives: Get.
Solid State Drive Feb 15. NAND Flash Memory Main storage component of Solid State Drive (SSD) USB Drive, cell phone, touch pad…
Solutions Road Show 2014 March 2014 | India Neeraj Matiyani Director Enterprise Storage Solutions Changing the Economics of Storage: Flash at the Price.
1© Copyright 2013 EMC Corporation. All rights reserved. EMC and Microsoft SharePoint Server Performance Name Title Date.
DEDUPLICATION IN YAFFS KARTHIK NARAYAN PAVITHRA SESHADRIVIJAYAKRISHNAN.
Just a really fast drive Jakub Topič, I3.B
Intel® Solid-State Drive Data Center TCO Calculator The data in this presentation is based on your analysis and business assumptions when using the Intel®
Operating Systems CMPSC 473 I/O Management (2) December Lecture 24 Instructor: Bhuvan Urgaonkar.
Object-based Storage Long Liu Outline Why do we need object based storage? What is object based storage? How to take advantage of it? What's.
Understanding Intrinsic Characteristics and System Implications of Flash Memory based Solid State Drives Feng Chen, David A. Koufaty, and Xiaodong Zhang.
Unit 5, Lesson 13 Storage Technologies and Devices AOIT Computer Systems Copyright © 2008–2013 National Academy Foundation. All rights reserved.
C-Store: Column Stores over Solid State Drives Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY Jun 19, 2009.
Flashing Up the Storage Layer I. Koltsidas, S. D. Viglas (U of Edinburgh), VLDB 2008 Shimin Chen Big Data Reading Group.
Chapter 10 Storage and File Structure Yonsei University 2 nd Semester, 2013 Sanghyun Park.
SOLID STATE DRIVES By: Vaibhav Talwar UE84071 EEE(5th Sem)
LOGO Service and network administration Storage Virtualization.
Mayuresh Varerkar ECEN 5613 Current Topics Presentation March 30, 2011.
FlashSystem family 2014 © 2014 IBM Corporation IBM® FlashSystem™ V840 Product Overview.
2010 IEEE ICECS - Athens, Greece, December1 Using Flash memories as SIMO channels for extending the lifetime of Solid-State Drives Maria Varsamou.
Strata IT Training Chapter 10 Advanced Storage Topics.
Microsoft Azure Storage. Networking Compute Storage Virtual Machine Operating System Applications Data & Access Runtime Provision.
Solid State Disks Members: Rakesh Dudi Tianhao Sun Timothy Wease.
INSTITUTE OF COMPUTING TECHNOLOGY DMA Cache Architecturally Separate I/O Data from CPU Data for Improving I/O Performance Dang Tang, Yungang Bao, Weiwu.
Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan.
Storing data on your computer & network. Storage devices Hardware Hard drive Flash drive Tape File server (HD) SAN NAS Software System software Windows.
1 Public DAFS Storage for High Performance Computing using MPI-I/O: Design and Experience Arkady Kanevsky & Peter Corbett Network Appliance Vijay Velusamy.
Wei-Shen, Hsu 2013 IEE5011 –Autumn 2013 Memory Systems Solid State Drive with Flash Memory Wei-Shen, Hsu Department of Electronics Engineering National.
PROBLEM STATEMENT A solid-state drive (SSD) is a non-volatile storage device that uses flash memory rather than a magnetic disk to store data. SSDs provide.
Survey of Program Compilation and Execution Bangor High School Ali Shareef 2/28/06.
12/18/20151 Operating Systems Design (CS 423) Elsa L Gunter 2112 SC, UIUC Based on slides by Roy Campbell, Sam.
A Semi-Preemptive Garbage Collector for Solid State Drives
Workshop sullo Storage da Small Office a Enterprise Class Presentato da:
연세대학교 Yonsei University Data Processing Systems for Solid State Drive Yonsei University Mincheol Shin
Lecture 22 SSD. LFS review Good for …? Bad for …? How to write in LFS? How to read in LFS?
Jeffrey Ellak CS 147. Topics What is memory hierarchy? What are the different types of memory? What is in charge of accessing memory?
Operating Systems: Summary INF1060: Introduction to Operating Systems and Data Communication.
Tackling I/O Issues 1 David Race 16 March 2010.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
Copyright © 2010 Hitachi Data Systems. All rights reserved. Confidential – NDA Strictly Required Hitachi Storage Solutions Hitachi HDD Directions HDD Actual.
 The emerged flash-memory based solid state drives (SSDs) have rapidly replaced the traditional hard disk drives (HDDs) in many applications.  Characteristics.
Rethinking RAID for SSD based HPC Systems Yugendra R. Guvvala, Yong Chen, and Yu Zhuang Department of Computer Science, Texas Tech University, Lubbock,
Cachebox 200 Series Performance Caching, Outstanding Value CONDUIT TECHNOLOGIES PVT.LTD.
1 Paolo Bianco Storage Architect Sun Microsystems An overview on Hybrid Storage Technologies.
Taeho Kgil, Trevor Mudge Advanced Computer Architecture Laboratory The University of Michigan Ann Arbor, USA CASES’06.
Decentralized Distributed Storage System for Big Data Presenter: Wei Xie Data-Intensive Scalable Computing Laboratory(DISCL) Computer Science Department.
Sales Brief of Huawei ES3000 V2 PCIe SSD Card
Database Management Systems (CS 564)
(Architectural Support for) Semantically-Smart Disk Systems
Computer Fundamentals
Dong Hyun Kang, Changwoo Min, Young Ik Eom
Presentation transcript:

Data Storage Systems: A Survey Abdullah Aldhamin July 29, 2013 CMPT 880: Large-Scale Multimedia Systems and Cloud Computing Course Project

Motivation Research interest in storage systems, specifically in SSDs

Outline Objective Overview Solid-State Drives Use Cases

Objective Storage system architectures in enterprise data centers. What is cloud storage? Integrating flash-based solid-state drives in large-scale storage systems.

Overview Different storage architectures in data centers: – Block I/O interface (DAS and SAN) – File I/O interface (NAS) – Is there a “better” solution? – Shortcomings for today’s computing…

Overview Cloud Storage: – What is cloud storage? Object-based storage – Example: Windows Azure Storage (WAS) – Some research problems

Overview Flash-based solid state drive: – What is it? – Pros and Cons. – How can we integrate it in large-scale storage systems? Future direction

Integrating SSDs in Large-Scale Storage Systems Considerations and Facts: – Non-uniform read access latencies, correlated with workload access pattern – Internal drive-specific operations impacts the performance – Internal fragmentation leads to performance degradation – More writes  write amplification  bad wear leveling – Performance vs. Lifetime vs. Cost

Integrating SSDs in Large-Scale Storage Systems SSD in Storage System SSD-Only System Hybrid System End-PointAccelerator Write Buffer Read Cache

Gordon: SSD-only HPC Cluster The first HPC cluster designed with SSD-only storage Optimized to utilize SSDs high bandwidth, for data-intensive applications

Gordon … (Cont’d) Design goals: – Reduce performance gap between processor and I/O in large-scale data-intensive computing – Improve the system performance – Less power

Gordon… (Cont’d) How SSD is integrated? – Replaced conventional hard disks with SSDs – Major device-level modification: New flash translation layer

Gordon… (Cont’d) Costly $$$ Not suitable for widespread adoption Requires major device level modification – Optimized for specific workloads

Hybrid: Griffin Griffin hybrid storage system – SSD is an end-point store for the data – Uses HDDs as write-back buffers Log-structured HDDs to buffer incoming writes Extends SSD lifetime

Hybrid: Hystor Hybrid storage system – SSD is used to improve I/O performance Read cache Write-back buffer – Challenge: What data to should be cached to gain from SSD performance and improve performance? Minimum system changes.

Conclusion The choice of I/O interface allows for different storage access features Cloud storage continues to grow to accommodate for the overflowing of data collected Solid state drive has become an instrumental player in storage systems: but how can we best use it?