Challenges of deploying Wide-Area-Network Distributed Storage System under network and reliability constraints – A case study

Slides:



Advertisements
Similar presentations
Computing Infrastructure
Advertisements

4/11/2017 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks.
Ceph Distributed File System: Simulating a Site Failure s: {bazli.abkarim, mt.wong, Mohd Bazli Ab Karim, Ming-Tat Wong, Jing-Yuan.
© 2010 VMware Inc. All rights reserved Confidential Performance Tuning for Windows Guest OS IT Pro Camp Presented by: Matthew Mitchell.
The Budget Crunch And Our Virtual Datacenter Presented By Joe Lanager.
Linux Clustering A way to supercomputing. What is Cluster? A group of individual computers bundled together using hardware and software in order to make.
SQL Server on VMware Jonathan Kehayias (MCTS, MCITP) SQL Database Administrator Tampa, FL.
Novell Server Linux vs. windows server 2008 By: Gabe Miller.
1 stdchk : A Checkpoint Storage System for Desktop Grid Computing Matei Ripeanu – UBC Sudharshan S. Vazhkudai – ORNL Abdullah Gharaibeh – UBC The University.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Matei Ripeanu.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
Introduction to DoC Private Cloud
© 2009 IBM Corporation Statements of IBM future plans and directions are provided for information purposes only. Plans and direction are subject to change.
Session 3 Windows Platform Dina Alkhoudari. Learning Objectives Understanding Server Storage Technologies Direct Attached Storage DAS Network-Attached.
Data Storage Willis Kim 14 May Types of storages Direct Attached Storage – storage hardware that connects to a single server Direct Attached Storage.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Disk Array Performance Estimation AGH University of Science and Technology Department of Computer Science Jacek Marmuszewski Darin Nikołow, Marek Pogoda,
ITEC 451 Project 2 Charisse, Chris, Eileen. Build Small Office Network Three Servers Firewall Device Switch & Router 10 VPN Network Budget – $10,000 –
1 Management Pain points now Existing tools: Do not map to virtual environments Provisioning Backup Health monitoring Performance monitoring / management.
1. Outline Introduction Virtualization Platform - Hypervisor High-level NAS Functions Applications Supported NAS models 2.
Buying a Laptop. 3 Main Components The 3 main components to consider when buying a laptop or computer are Processor – The Bigger the Ghz the faster the.
Cluster computing facility for CMS simulation work at NPD-BARC Raman Sehgal.
1 Some Context for This Session…  Performance historically a concern for virtualized applications  By 2009, VMware (through vSphere) and hardware vendors.
© 2013 Mellanox Technologies 1 NoSQL DB Benchmarking with high performance Networking solutions WBDB, Xian, July 2013.
Edge Based Cloud Computing as a Feasible Network Paradigm(1/27) Edge-Based Cloud Computing as a Feasible Network Paradigm Joe Elizondo and Sam Palmer.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Advisor: Professor.
Evolved Virtual Data Center Reserved Resource Pools.
A Workflow-Aware Storage System Emalayan Vairavanathan 1 Samer Al-Kiswany, Lauro Beltrão Costa, Zhao Zhang, Daniel S. Katz, Michael Wilde, Matei Ripeanu.
Report : Zhen Ming Wu 2008 IEEE 9th Grid Computing Conference.
Yongzhi Wang, Jinpeng Wei VIAF: Verification-based Integrity Assurance Framework for MapReduce.
Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.
Big Red II & Supporting Infrastructure Craig A. Stewart, Matthew R. Link, David Y Hancock Presented at IUPUI Faculty Council Information Technology Subcommittee.
Hadoop Hardware Infrastructure considerations ©2013 OpalSoft Big Data.
Sandor Acs 05/07/
CEPH: A SCALABLE, HIGH-PERFORMANCE DISTRIBUTED FILE SYSTEM S. A. Weil, S. A. Brandt, E. L. Miller D. D. E. Long, C. Maltzahn U. C. Santa Cruz OSDI 2006.
JLab Scientific Computing: Theory HPC & Experimental Physics Thomas Jefferson National Accelerator Facility Newport News, VA Sandy Philpott.
Investigating the Performance of Audio/Video Service Architecture II: Broker Network Ahmet Uyar & Geoffrey Fox Tuesday, May 17th, 2005 The 2005 International.
ITEP computing center and plans for supercomputing Plans for Tier 1 for FAIR (GSI) in ITEP  8000 cores in 3 years, in this year  Distributed.
Data Replication and Power Consumption in Data Grids Susan V. Vrbsky, Ming Lei, Karl Smith and Jeff Byrd Department of Computer Science The University.
Ceph: A Scalable, High-Performance Distributed File System
Business and Partnering Opportunities: “Windows Server 8” Continuous Availability Designing Systems for Continuous Availability and Scalability Session.
Virtualization Supplemental Material beyond the textbook.
A Silvio Pardi on behalf of the SuperB Collaboration a INFN-Napoli -Campus di M.S.Angelo Via Cinthia– 80126, Napoli, Italy CHEP12 – New York – USA – May.
 Introduction  Architecture NameNode, DataNodes, HDFS Client, CheckpointNode, BackupNode, Snapshots  File I/O Operations and Replica Management File.
ClinicalSoftwareSolutions Patient focused.Business minded. Slide 1 Opus Server Architecture Fritz Feltner Sept 7, 2007 Director, IT and Systems Integration.
Awesome distributed storage system
Data Evolution: 101. Parallel Filesystem vs Object Stores Amazon S3 CIFS NFS.
A Scalable Distributed Datastore for BioImaging R. Cai, J. Curnutt, E. Gomez, G. Kaymaz, T. Kleffel, K. Schubert, J. Tafas {jcurnutt, egomez, keith,
 System Requirements are the prerequisites needed in order for a software or any other resources to execute efficiently.  Most software defines two.
CPSC 231 Secondary storage (D.H.)1 Learning Objectives Understanding disk organization. Sectors, clusters and extents. Fragmentation. Disk access time.
AMS02 Software and Hardware Evaluation A.Eline. Outline  AMS SOC  AMS POC  AMS Gateway Computer  AMS Servers  AMS ProductionNodes  AMS Backup Solution.
1 Thierry Titcheu Chekam 1,2, Ennan Zhai 3, Zhenhua Li 1, Yong Cui 4, Kui Ren 5 1 School of Software, TNLIST, and KLISS MoE, Tsinghua University 2 Interdisciplinary.
New directions in storage | ISGC 2015, Taipei | Patrick Fuhrmann | 19 March 2015 | 1 Presenter: Patrick Fuhrmann dCache.org Patrick Fuhrmann, Paul Millar,
Sql Server Architecture for World Domination Tristan Wilson.
G. Russo, D. Del Prete, S. Pardi Frascati, 2011 april 4th-7th The Naples' testbed for the SuperB computing model: first tests G. Russo, D. Del Prete, S.
STORAGE EXPERIENCES AT MWT2 (US ATLAS MIDWEST TIER2 CENTER) Aaron van Meerten University of Chicago Sarah Williams Indiana University OSG Storage Forum,
S. Pardi Computing R&D Workshop Ferrara 2011 – 4 – 7 July SuperB R&D on going on storage and data access R&D Storage Silvio Pardi
Performance measurement of transferring files on the federated SRB
Sarah Diesburg Operating Systems COP 4610
Virtualization OVERVIEW
Diskpool and cloud storage benchmarks used in IT-DSS
Distributed Network Traffic Feature Extraction for a Real-time IDS
dCache “Intro” a layperson perspective Frank Würthwein UCSD
Section 7 Erasure Coding Overview
Windows Server* 2016 & Intel® Technologies
Introduction to HDFS: Hadoop Distributed File System
Grid Datafarm and File System Services
A Survey on Distributed File Systems
Design Unit 26 Design a small or home office network
Sarah Diesburg Operating Systems CS 3430
Presentation transcript:

Challenges of deploying Wide-Area-Network Distributed Storage System under network and reliability constraints – A case study Mohd Bazli Ab Karim Advanced Computing Lab MIMOS Berhad, Malaysia In PRAGMA Student Workshop PRAGMA26, Tainan, Taiwan 9-11 April 2014

Outline Distributed Storage System? PRAGMA25 – DFS over Local Area Network – Ceph vs. GlusterFS PRAGMA26 – DFS over Wide Area Network – DFS over WAN vs. DFS over LAN 2

Data Center SAN/NAS Disaster Recovery Site(s) SAN/NAS Distributed File System Data Center 1 Local/DAS Data Center 2 Local/DAS Data Center n Local/DAS... One/Multiple Virtual Volume(s) R/W Replications Data Striping and Parallel R/W 3

PRAGMA 25 – DFS on LAN 4 Dell PowerEdge T110 II Proc: Intel Xeon E3-1220v GHz Memory: 8 GB Hard Drives: Seagate Constellation ES 2TB 7200RPM SATA RAID Controller: LSI Logic SAS2008 Network: 1GbE Operating System: Ubuntu Ceph: (Cuttlefish) GlusterFS: Experiment Hardware Specification

PRAGMA 25 – Ceph vs GlusterFS 5

PRAGMA 26 – DFS over WAN 6 MIMOS Headquarters in Kuala Lumpur and its branch office in Kulim. From Google Map 350km

7OSDOSDOSD OSD MON/MDSCLIENT OSDOSD OSD OSD MON/MDSCLIENT OSDOSD OSD OSD MON/MDSCLIENT WAN Datacenter 1 in HQ office SGI ALTIX XE x Dual Core Intel Xeon CPU X GHz 16 Gb RAM 3.5 SATA Drive 250GB Datacenter 2 in HQ office DELL 2 x Dual Core Intel Xeon CPU GHz 12 Gb RAM 3.5 SAS Drive 73GB Datacenter 1 in Kulim office SGI ALTIX XE x Dual Core Intel Xeon CPU X GHz 16 Gb RAM 3.5 SATA Drive 250GB PRAGMA 26 – DFS over WAN (Setup) The storage pool was set with 3 replica counts with minimum number of replica counts required is 2.

8 PRAGMA 26 - DFS over WAN (Networking) Round-trip time in ms Bandwidth (Mbps) 2 TCP Iperf MinAvgMaxMdev DC1 KL to DC1 Kulim 25096% DC2 KL to DC1 Kulim 25096% DC1 KL to DC2 KL %

ceph osd tree # id weight type name up/down reweight root default host poc-tpm1-osd osd.0 up host poc-tpm1-osd osd.1 up host poc-tpm1-osd osd.2 up host poc-tpm1-osd osd.3 up host poc-tpm2-osd osd.4 up host poc-tpm2-osd osd.5 up host poc-tpm2-osd osd.6 up host poc-tpm2-osd osd.7 up host poc-khtp-osd osd.8 up host poc-khtp-osd osd.9 up host poc-khtp-osd osd.10 up host poc-khtp-osd osd.11 up 1 9 CRUSH Map - default

# rules rule data { ruleset 0 type replicated min_size 1 max_size 10 step take default step chooseleaf firstn 0 type host step emit } rule metadata { ruleset 1 type replicated min_size 1 max_size 10 step take default step chooseleaf firstn 0 type host step emit } 10 CRUSH Map Rules - default Pick one leaf node of type host Pick one leaf node of type host

ceph osd tree # id weight type name up/down reweight root default datacenter tpm host poc-tpm1-osd osd.0 up host poc-tpm1-osd osd.1 up host poc-tpm1-osd osd.2 up host poc-tpm1-osd osd.3 up datacenter tpm host poc-tpm2-osd osd.4 up host poc-tpm2-osd osd.5up host poc-tpm2-osd osd.6 up host poc-tpm2-osd osd.7 up datacenter khtp host poc-khtp-osd osd.8 up host poc-khtp-osd osd.9 up host poc-khtp-osd osd.10 up host poc-khtp-osd osd.11 up 1 11 CRUSH Map - New

# rules rule data { ruleset 0 type replicated min_size 2 max_size 10 step take default step chooseleaf firstn 0 type datacenter step emit } rule metadata { ruleset 1 type replicated min_size 2 max_size 10 step take default step chooseleaf firstn 0 type datacenter step emit } 12 CRUSH Map Rules – New Pick one leaf node of type datacenter Pick one leaf node of type datacenter

13 DFS on WAN vs. DFS on LAN

14 Hypothesis:  Write performance is slower than read performance. This is due to write operation requires a creation of new file and also to store overhead information known as metadata, which typically consists of directory information, and space allocation.  DFS IO performs better in LAN compared to WAN due to limited capacity of WAN bandwidth and its latency, jitter etc. Results:  DFS in LAN provides better overall I/O rates compared to DFS in WAN due to its better network connectivity and bandwidth size.  DFS in WAN scores better in writing 64K and 128K block sizes compared to DFS in LAN. Analysis:  DFS in WAN performances in I/O is still acceptable e.g. smaller files size with 16K, 32K, 64K, 128K block sizes, where DFS in LAN only performs slightly better than in WAN.

Summary 15  Distributed file system in wide area network works at acceptable I/O rates and it is ideal for usage of smaller file sizes.  Investigating distributed file system in wide area network, focusing on features like:  support cloud deployment architecture,  ability to provide parallel read and write operations on a distributed file system with different geographical locations.

16