Functional Assessment of Erasure Coded Storage Archive Blair Crossman Taylor Sanchez Josh Sackos LA-UR-13-25967 Computer Systems, Cluster, and Networking.

Slides:



Advertisements
Similar presentations
Data Storage Solutions Module 1.2. Data Storage Solutions Upon completion of this module, you will be able to: List the common storage media and solutions.
Advertisements

Windows® Deployment Services
Digital Library Service – An overview Introduction System Architecture Components and their functionalities Experimental Results.
What’s New: Windows Server 2012 R2 Tim Vander Kooi Systems Architect
1 Dynamic DNS. 2 Module - Dynamic DNS ♦ Overview The domain names and IP addresses of hosts and the devices may change for many reasons. This module focuses.
Windows Deployment Services WDS for Large Scale Enterprises and Small IT Shops Presented By: Ryan Drown Systems Administrator for Krannert.
Chapter 5: Server Hardware and Availability. Hardware Reliability and LAN The more reliable a component, the more expensive it is. Server hardware is.
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
Data - Information - Knowledge
P2P: Advanced Topics Filesystems over DHTs and P2P research Vyas Sekar.
Jharrod LaFon (HPC-3) Jim Williams (HPC-3) 2011 Computer System, Cluster, and Networking Summer Institute Russell Husted (MTU) Derek Walker (NCA&TSU) Povi.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Homework 2 In the docs folder of your Berkeley DB, have a careful look at documentation on how to configure BDB in main memory. In the docs folder of your.
Wide-area cooperative storage with CFS
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System 1 Zaihua Ji Doug Schuster Steven Worley Computational.
Installing software on personal computer
Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.
Frangipani: A Scalable Distributed File System C. A. Thekkath, T. Mann, and E. K. Lee Systems Research Center Digital Equipment Corporation.
1 Objectives Discuss the Windows Printer Model and how it is implemented in Windows Server 2008 Install the Print Services components of Windows Server.
Copyright © 2012 Cleversafe, Inc. All rights reserved. 1 Combining the Power of Hadoop with Object-Based Dispersed Storage.
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google∗
Chapter Three OPERATING SYSTEMS.
Windows ® Deployment Services Infrastructure Planning and Design Published: February 2008 Updated: January 2012.
Page 19/4/2015 CSE 30341: Operating Systems Principles Raid storage  Raid – 0: Striping  Good I/O performance if spread across disks (equivalent to n.
Chapter-4 Windows 2000 Professional Win2K Professional provides a very usable interface and was designed for use in the desktop PC. Microsoft server system.
Module 3: Preparing for Cluster Service Installation.
Installing Microsoft Windows Server 2008 Lesson 2.
Module 13: Configuring Availability of Network Resources and Content.
Tutorial 11 Installing, Updating, and Configuring Software
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.
STEALTH Content Store for SharePoint using Caringo CAStor  Boosting your SharePoint to the MAX! "Optimizing your Business behind the scenes"
A BigData Tour – HDFS, Ceph and MapReduce These slides are possible thanks to these sources – Jonathan Drusi - SCInet Toronto – Hadoop Tutorial, Amir Payberah.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
So, Jung-ki Distributed Computing System LAB School of Computer Science and Engineering Seoul National University Implementation of Package Management.
Chord & CFS Presenter: Gang ZhouNov. 11th, University of Virginia.
Ceph Storage in OpenStack Part 2 openstack-ch,
Please Note: Information contained in this document is considered LENOVO CONFIDENTIAL For Lenovo Internal Use Only Do Not Copy or Distribute!! For Lenovo.
Introduction to Hadoop and HDFS
A Comparison of Library Tracking Methods in High Performance Computing Computer System Cluster and Networking Summer Institute 2013 Poster Seminar William.
Guide to Linux Installation and Administration, 2e1 Chapter 2 Planning Your System.
The exponential growth of data –Challenges for Google,Yahoo,Amazon & Microsoft in web search and indexing The volume of data being made publicly available.
1 Week #10Business Continuity Backing Up Data Configuring Shadow Copies Providing Server and Service Availability.
Large Scale Parallel File System and Cluster Management ICT, CAS.
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
Ceph: A Scalable, High-Performance Distributed File System
Melanie Palmer, Rob Sullivan, John Bilberry LA-UR
Linux Operations and Administration
Page 1 of 38 Lenovo Confidential Lenovo Confidential Lenovo Confidential Lenovo Confidential Lenovo Confidential Please Note: Information contained in.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Database Laboratory Regular Seminar TaeHoon Kim Article.
DDN Web Object Scalar for Big Data Management Shaun de Witt, Roger Downing (STFC) Glenn Wright (DDN)
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
The purpose of a CPU is to process data Custom written software is created for a user to meet exact purpose Off the shelf software is developed by a software.
BY: SALMAN 1.
Course: Cluster, grid and cloud computing systems Course author: Prof
vSphere 6 Foundations Beta Question Answer
VSPHERE 6 FOUNDATIONS BETA Study Guide QUESTION ANSWER
Organizations Are Embracing New Opportunities
BY: SALMAN.
AWS COURSE DEMO BY PROFESSIONAL-GURU. Amazon History Ladder & Offering.
Vembu SaaSBackup - An Introduction
XenData SX-550 LTO Archive Servers
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System Zaihua Ji Doug Schuster Steven Worley Computational.
Different types of Linux installation
Specialized Cloud Architectures
IBM Tivoli Storage Manager
Enterprise Class Virtual Tape Libraries
Nolan Leake Co-Founder, Cumulus Networks Paul Speciale
DataOptimizer Transparent File Tiering for NetApp Storage Robert Graf
Presentation transcript:

Functional Assessment of Erasure Coded Storage Archive Blair Crossman Taylor Sanchez Josh Sackos LA-UR Computer Systems, Cluster, and Networking Summer Institute

Presentation Overview Introduction Caringo Testing Scality Testing Conclusions 1

Storage Mediums Tape o Priced for capacity not bandwidth Solid State Drives o Priced for bandwidth not capacity Hard Disk o Bandwidth scales with more drives 2

Object Storage: Flexible Containers Files are stored in data containers Meta data outside of file system Key-value pairs File system scales with machines METADATA EXPLOSIONS!! 3

What is the problem? RAID, replication, and tape systems were not designed for exascale computing and storage Hard disk c apacity continues to grow Solution to multiple hard disk failures is needed 4

Erasure Coding : Reduce Rebuild Recalculate Reduce! Rebuild! Recalculate! 5

Project Description Erasure coded object storage file system is a potential replacement for LANL’s tape archive system Installed and configured two prototype archives o Scality o Caringo Verified the functionality of systems 6

Functionality Not Performance Caringo o SuperMicro admin node o 1GigE interconnect o 10 IBM System x3755  4 x 1TB HDD o Erasure coding: o n=3 o k=3 Scality o SuperMicro admin node o 1GigE interconnect o 6 HP Proliant (DL160 G6)  4 x 1TB HDD o Erasure coding: o n=3 o k=3 7

Project Testing Requirements Data o Ingest : Retrieval : Balance : Rebuild Metadata o Accessibility : Customization : Query POSIX Gateway o Read : Write : Delete : Performance overhead 8

How We Broke Data Pulled out HDDs (Scality, kill daemon) Turned off nodes Uploaded files, downloaded files Used md5sum to compare originals to downloaded copies 9

Caringo: The automated storage system Warewulf/Perceus like diskless (RAM) boot Reconfigurable, requires reboot DHCP PXE boot provisioned Little flexibility or customizability 10

No Node Specialization Nodes "bid" for tasks Lowest latency wins Distributes the work Each node performs all tasks Administrator : Compute : Storage Automated Power management Set a sleep timer Set an interval to check disks Limited Administration Options 11

Caringo Rebuilds Data As It Is Written Balances data as written o Primary Access Node o Secondary Access Node Automated o New HDD/Node: auto balanced o New drives format automatically o Rebuilds Constantly o If any node goes down rebuild starts immediately o Volumes can go "stale” o 14 Day Limit on unused volumes 12

What’s a POSIX Gateway Content File Server o Fully Compliant POSIX object o Performs system administration tasks o Parallel writes Was not available for testing 13

“Elastic” Metadata Accessible Query: key values o By file size, date, etc. Indexing requires “Elastic Search” machine to do indexing o Can be the bottleneck in system 14

Minimum Node Requirements Needs a full n + k nodes to: rebuild write balance Does not need full n +k to: read query metadata administration 15

Static Disk Install Requires disk install Static IP addresses Optimizations require deeper knowledge 16

Virtual Ring Resilience Success until less virtual nodes available than n+k erasure configuration. Data stored to ‘ring’ via distributed hash table 17

Manual Rebuilds, But Flexible Rebuilds on less than required nodes o Lacks full protection Populates data back to additional node New Node/HDD: Manually add node Data is balanced during: Writing Rebuilding 18

Indexer Sold Separately Query all erasure coding metadata per server Per item metadata User Definable Did not test Scality’s ‘Mesa’ indexing service Extra software 19

Fuse gives 50% Overhead, but scalable 20

On the right path Scality o Static installation, flexible erasure coding o Helpful o Separate indexer o 500MB file limit ('Unlimited' update coming) Caringo o Variable installation, strict erasure coding o Good documentation o Indexer included o 4TB file limit (addressing bits limit) 21

Very Viable Some early limitations Changes needed on both products Scality seems more ready to make those changes. 22

Questions? 23

Acknowledgements Special Thanks to : Dane Gardner - NMC Instructor Matthew Broomfield - NMC Teaching Assistant HB Chen - HPC-5 - Mentor Jeff Inman - HPC-1- Mentor Carolyn Connor - HPC-5, Deputy Director ISTI Andree Jacobson - Computer & Information Systems Manager NMC Josephine Olivas - Program Administrator ISTI Los Alamos National Labs, New Mexico Consortium, and ISTI 24