Software Defined storage, Big Data and Ceph. What is all the fuss about? Kamesh Pemmaraju, Sr. Product Mgr, Dell Neil Levine, Dir. of Product Mgmt, Red.

Slides:



Advertisements
Similar presentations
Symantec 2010 Windows 7 Migration EMEA Results. Methodology Applied Research performed survey 1,360 enterprises worldwide SMBs and enterprises Cross-industry.
Advertisements

Symantec 2010 Windows 7 Migration Global Results.
1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
© 2004 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Installation & management of SUSE.
1/17/20141 Leveraging Cloudbursting To Drive Down IT Costs Eric Burgener Senior Vice President, Product Marketing March 9, 2010.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Processes and Operating Systems
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
Evaluating Caching and Storage Options on the Amazon Web Services Cloud Gagan Agrawal, Ohio State University - Columbus, OH David Chiu, Washington State.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination. Introduction to the Business.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
Database Systems: Design, Implementation, and Management
Auto-scaling Axis2 Web Services on Amazon EC2 By Afkham Azeez.
Our Digital World Second Edition
Tom Hamilton – America’s Channel Database CSE
Lesson 8: Creating and Configuring Virtual Machine Storage
Scalable Multi-Access Flash Store for Big Data Analytics
Customer Experience Solutions. Delivered. 1 BANK 2.0 Making Banks Successful in the Era of Engagement Banking.
13 Copyright © 2005, Oracle. All rights reserved. Monitoring and Improving Performance.
Virtualization & Disaster Recovery
Deploying Virtualised Infrastructures for Improved Efficiency and Reduced Cost Adrian Groeneveld Senior Product Marketing Manager Adrian Groeneveld Senior.
Chapter 1: Introduction to Scaling Networks
Seungmi Choi PlanetLab - Overview, History, and Future Directions - Using PlanetLab for Network Research: Myths, Realities, and Best Practices.
© 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 1 Taiwan ITQ.
EIS Bridge Tool and Staging Tables September 1, 2009 Instructor: Way Poteat Slide: 1.
1 Cloud Services Professionals ReadySpace IDA Cloud Computing Call 6.
Operating Systems Operating Systems - Winter 2011 Dr. Melanie Rieback Design and Implementation.
Operating Systems Operating Systems - Winter 2012 Dr. Melanie Rieback Design and Implementation.
Sample Service Screenshots Enterprise Cloud Service 11.3.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
© 2009 VMware Inc. All rights reserved Confidential Overview: vCenter Server Heartbeat Q
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
2  Industry trends and challenges  Windows Server 2012: Modern workstyle, enabled  Access from virtually anywhere, any device  Full Windows experience.
Thanks to Microsoft Azure’s Scalability, BA Minds Delivers a Cost-Effective CRM Solution to Small and Medium-Sized Enterprises in Latin America MICROSOFT.
KAIST Computer Architecture Lab. The Effect of Multi-core on HPC Applications in Virtualized Systems Jaeung Han¹, Jeongseob Ahn¹, Changdae Kim¹, Youngjin.
1 Chapter 11: Data Centre Administration Objectives Data Centre Structure Data Centre Structure Data Centre Administration Data Centre Administration Data.
Analyzing Genes and Genomes
+ Thriving at Petabyte scale & beyond
Essential Cell Biology
PSSA Preparation.
Essential Cell Biology
The DDS Benchmarking Environment James Edmondson Vanderbilt University Nashville, TN.
Immunobiology: The Immune System in Health & Disease Sixth Edition
Energy Generation in Mitochondria and Chlorplasts
Ceph – scale-out storage for low latency cloud applications
Agile Infrastructure built on OpenStack Building The Next Generation Data Center with OpenStack John Griffith, Senior Software Engineer,
Cisco and NetApp Confidential. Distributed under non-disclosure only. Name Date FlexPod Entry-level Solution FlexPod Value, Sized Right for Smaller Workloads.
Module – 7 network-attached storage (NAS)
Understand what’s new for Windows File Server Understand considerations for building Windows NAS appliances Understand how to build a customized NAS experience.
© 2013 Mellanox Technologies 1 NoSQL DB Benchmarking with high performance Networking solutions WBDB, Xian, July 2013.
Opensource for Cloud Deployments – Risk – Reward – Reality
Ceph Storage in OpenStack Part 2 openstack-ch,
Hadoop Hardware Infrastructure considerations ©2013 OpalSoft Big Data.
Microsoft Azure Storage. Networking Compute Storage Virtual Machine Operating System Applications Data & Access Runtime Provision.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
FlexPod Converged Solution. FlexPod is… A prevalidated flexible, unified platform featuring: Cisco Unified Computing System™ Programmable infrastructure.
OPENSTACK Presented by Jordan Howell and Katie Woods.
Course: Cluster, grid and cloud computing systems Course author: Prof
Organizations Are Embracing New Opportunities
Scalable SoftNAS Cloud Protects Customers’ Mission-Critical Data in the Cloud with a Highly Available, Flexible Solution for Microsoft Azure MICROSOFT.
Unitrends Enterprise Backup Solution Offers Backup and Recovery of Data in the Microsoft Azure Cloud for Better Protection of Virtual and Physical Systems.
Dell Data Protection | Rapid Recovery: Simple, Quick, Configurable, and Affordable Cloud-Based Backup, Retention, and Archiving Powered by Microsoft Azure.
Nolan Leake Co-Founder, Cumulus Networks Paul Speciale
OpenStack for the Enterprise
Presentation transcript:

Software Defined storage, Big Data and Ceph. What is all the fuss about? Kamesh Pemmaraju, Sr. Product Mgr, Dell Neil Levine, Dir. of Product Mgmt, Red Hat OpenStack Summit Atlanta, May 2014

CEPH

CEPH UNIFIED STORAGE FILE SYSTEM BLOCK STORAGE OBJECT STORAGE Keystone Geo-Replication Native API 3 Multi-tenant S3 & Swift OpenStack Linux Kernel iSCSI Clones Snapshots CIFS/NFS HDFS Distributed Metadata Linux Kernel POSIX Copyright © 2013 by Inktank | Private and Confidential

ARCHITECTURE 4 Copyright © 2013 by Inktank | Private and Confidential APPHOST/VMCLIENT

COMPONENTS 5 S3/SWIFTHOST/HYPERVISORiSCSICIFS/NFSSDK INTERFACES STORAGE CLUSTERS MONITORSOBJECT STORAGE DAEMONS (OSD) BLOCK STORAGE FILE SYSTEM OBJECT STORAGE Copyright © 2014 by Inktank | Private and Confidential

THE PRODUCT

7 INKTANK CEPH ENTERPRISE WHAT’S INSIDE? Ceph Object and Ceph Block Calamari Enterprise Plugins (2014) Support Services Copyright © 2013 by Inktank | Private and Confidential

USE CASE: OPENSTACK 9

Copyright © 2013 by Inktank | Private and Confidential USE CASE: OPENSTACK 10 Volumes Ephemeral Copy-on-Write Snapshots

Copyright © 2013 by Inktank | Private and Confidential USE CASE: OPENSTACK 11

Copyright © 2013 by Inktank | Private and Confidential USE CASE: CLOUD STORAGE 12 S3/Swift

Copyright © 2013 by Inktank | Private and Confidential USE CASE: WEBSCALE APPLICATIONS 13 Native Protocol Native Protocol Native Protocol Native Protocol

ROADMAP INKTANK CEPH ENTERPRISE 14 Copyright © 2013 by Inktank | Private and Confidential May 2014Q

Copyright © 2013 by Inktank | Private and Confidential USE CASE: PERFORMANCE BLOCK 15 CEPH STORAGE CLUSTER

Copyright © 2013 by Inktank | Private and Confidential USE CASE: PERFORMANCE BLOCK 16 CEPH STORAGE CLUSTER Read/Write

Copyright © 2013 by Inktank | Private and Confidential USE CASE: PERFORMANCE BLOCK 17 CEPH STORAGE CLUSTER Write Read

Copyright © 2013 by Inktank | Private and Confidential USE CASE: ARCHIVE / COLD STORAGE 18 CEPH STORAGE CLUSTER

ROADMAP INKTANK CEPH ENTERPRISE 19 Copyright © 2013 by Inktank | Private and Confidential April 2014September

Copyright © 2013 by Inktank | Private and Confidential USE CASE: DATABASES 20 Native Protocol Native Protocol Native Protocol Native Protocol

Copyright © 2013 by Inktank | Private and Confidential USE CASE: HADOOP 21 Native Protocol Native Protocol Native Protocol Native Protocol

22 Training for Proof of Concept or Production Users Online Training for Cloud Builders and Storage Administrators Instructor led with virtual lab environment INKTANK UNIVERSITY Copyright © 2014 by Inktank | Private and Confidential VIRTUALPUBLIC May 21 – 22 European Time-zone June US Time-zone

23 Ceph Reference Architectures and case study

24 Outline Planning your Ceph implementation Choosing targets for Ceph deployments Reference Architecture Considerations Dell Reference Configurations Customer Case Study

25 Business Requirements – Budget considerations, organizational commitment – Avoiding lock-in – use open source and industry standards – Enterprise IT use cases – Cloud applications/XaaS use cases for massive-scale, cost-effective storage – Steady-state vs. Spike data usage Sizing requirements – What is the initial storage capacity? – What is the expected growth rate? Workload requirements – Does the workload need high performance or it is more capacity focused? – What are IOPS/Throughput requirements? – What type of data will be stored? – Ephemeral vs. persistent data, Object, Block, File? Planning your Ceph Implementation

26 How to Choose Targets Use Cases for Ceph Virtualization and Private Cloud (traditional SAN/NAS) High Performance (traditional SAN) Performance Capacity NAS & Object Content Store (traditional NAS) Cloud Applications Traditional IT XaaS Compute Cloud Open Source Block XaaS Content Store Open Source NAS/Object Ceph Target

27 Tradeoff between Cost vs. Reliability (use-case dependent) Use the Crush configs to map out your failures domains and performance pools Failure domains – Disk (OSD and OS) – SSD journals – Node – Rack – Site (replication at the RADOS level, Block replication, consider latencies) Storage pools – SSD pool for higher performance – Capacity pool Plan for failure domains of the monitor nodes Consider failure replacement scenarios, lowered redundancies, and performance impacts Architectural considerations – Redundancy and replication considerations

28 Server Considerations Storage Node: – one OSD per HDD, 1 – 2 GB ram, and 1 Gz/core/OSD, – SSD’s for journaling and for using the tiering feature in Firefly – Erasure coding will increase useable capacity at the expense of additional compute load – SAS JBOD expanders for extra capacity (beware of extra latency and oversubscribed SAS lanes) Monitor nodes (MON): odd number for quorum, services can be hosted on the storage node for smaller deployments, but will need dedicated nodes larger installations Dedicated RADOS Gateway nodes for large object store deployments and for federated gateways for multi-site

29 Networking Considerations Dedicated or Shared network – Be sure to involve the networking and security teams early when design your networking options – Network redundancy considerations – Dedicated client and OSD networks – VLAN’s vs. Dedicated switches – 1 Gbs vs 10 Gbs vs 40 Gbs! Networking design – Spine and Leaf – Multi-rack – Core fabric connectivity – WAN connectivity and latency issues for multi-site deployments

30 Ceph additions coming to the Dell Red Hat OpenStack solution Pilot configuration Components Dell PowerEdge R620/R720/R720XD Servers Dell Networking S4810/S55 Switches, 10GB Red Hat Enterprise Linux OpenStack Platform Dell ProSupport Dell Professional Services Avail. w/wo High Availability Specs at a glance Node 1: Red Hat Openstack Manager Node 2: OpenStack Controller (2 additional controllers for HA) Nodes 3-8: OpenStack Nova Compute Nodes: 9-11: Ceph 12x3 TB raw storage Network Switches: Dell Networking S4810/S55 Supports ~ virtual machines Benefits Rapid on-ramp to OpenStack cloud Scale up, modular compute and storage blocks Single point of contact for solution support Enterprise-grade OpenStack software package Benefits Rapid on-ramp to OpenStack cloud Scale up, modular compute and storage blocks Single point of contact for solution support Enterprise-grade OpenStack software package Storage bundles

31 Example Ceph Dell Server Configurations TypeSizeComponents Performance20 TB R720XD 24 GB DRAM 10 X 4 TB HDD (data drives) 2 X 300 GB SSD (journal) Capacity44TB / 105 TB* R720XD 64 GB DRAM 10 X 4 TB HDD (data drives) 2 X 300 GB SSH (journal) MD X 4 TB HHD (data drives) Extra Capacity144 TB / 240 TB* R720XD 128 GB DRAM 12 X 4 TB HDD (data drives) MD3060e (JBOD) 60 X 4 TB HHD (data drives)

32 Dell & Red Hat & Inktank have partnered to bring a complete Enterprise-grade storage solution for RHEL-OSP + Ceph The joint solution provides: – Co-engineered and validated Reference Architecture – Pre-configured storage bundles optimized for performance or storage – Storage enhancements to existing OpenStack Bundles – Certification against RHEL-OSP – Professional Services, Support, and Training › Collaborative Support for Dell hardware customers › Deployment services & tools What Are We Doing To Enable?

33 UAB Case Study

34 Overcoming a data deluge Inconsistent data management across research teams hampers productivity Growing data sets challenged available resources Research data distributed across laptops, USB drives, local servers, HPC clusters Transferring datasets to HPC clusters took too much time and clogged shared networks Distributed data management reduced researcher productivity and put data at risk

35 Solution: a storage cloud Centralized storage cloud based on OpenStack and Ceph Flexible, fully open-source infrastructure based on Dell reference design − OpenStack, Crowbar and Ceph − Standard PowerEdge servers and storage − 400+ TBs at less than 41¢ per gigabyte Distributed scale-out storage provisions capacity from a massive common pool − Scalable to 5 petabytes Data migration to and from HPC clusters via dedicated 10Gb Ethernet fabric Easily extendable framework for developing and hosting additional services − Simplified backup service now enabled “We’ve made it possible for users to satisfy their own storage needs with the Dell private cloud, so that their research is not hampered by IT.” David L. Shealy, PhD Faculty Director, Research Computing Chairman, Dept. of Physics

36 Building a research cloud Project goals extend well beyond data management Designed to support emerging data-intensive scientific computing paradigm – 12 x 16-core compute nodes – 1 TB RAM, 420 TBs storage – 36 TBs storage attached to each compute node Virtual servers and virtual storage meet HPC − Direct user control over all aspects of the application environment − Ample capacity for large research data sets Individually customized test/development/ production environments − Rapid setup and teardown Growing set of cloud-based tools & services − Easily integrate shareware, open source, and commercial software “We envision the OpenStack-based cloud to act as the gateway to our HPC resources, not only as the purveyor of services we provide, but also enabling users to build their own cloud-based services.” John-Paul Robinson, System Architect

37 Research Computing System (Next Gen) A cloud-based computing environment with high speed access to dedicated and dynamic compute resources Open Stack node HPC Cluster HPC Storage DDR InfinibandQDR Infiniband 10Gb Ethernet Cloud services layer Virtualized server and storage computing cloud based on OpenStack, Crowbar and Ceph UAB Research Network

38 THANK YOU!

39 Contact Information Reach Kamesh and Neil for additional information: Dell.com/OpenStack Visit the Dell and Inktank booths in the OpenStack Summit Expo Hall

THANK YOU! Neil Levine Dir. of Product Mgmt Red

41