A comparison of distributed data storage middleware for HPC, GRID and Cloud Mikhail Goldshtein 1, Andrey Sozykin 1, Grigory Masich 2 and Valeria Gribova.

Slides:



Advertisements
Similar presentations
Storage Workshop Summary Wahid Bhimji University Of Edinburgh On behalf all of the participants…
Advertisements

GridKa May 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Installing dCache into an existing Storage environment at GridKa Forschungszentrum.
Storage: Futures Flavia Donno CERN/IT WLCG Grid Deployment Board, CERN 8 October 2008.
Clouds from FutureGrid’s Perspective April Geoffrey Fox Director, Digital Science Center, Pervasive.
C LOUD C OMPUTING Presented by Ye Chen. What is cloud computing? Cloud computing is a model for enabling ubiquitous, convenient, on- demand network access.
Institute for High Energy Physics ( ) NEC’2007 Varna, Bulgaria, September Activities of IHEP in LCG/EGEE.
Bondyakov A.S. Institute of Physics of ANAS, Azerbaijan JINR, Dubna.
A.V. Bogdanov Private cloud vs personal supercomputer.
Distributed High Performance Computing Environment of Ural Branch of RAS M.L.Goldshtein, A.V.Sozykin, Institute of Mathematics and Mechanics UrB RAS, Yekaterinburg.
A. Mohapatra, HEPiX 2013 Ann Arbor1 UW Madison CMS T2 site report D. Bradley, T. Sarangi, S. Dasu, A. Mohapatra HEP Computing Group Outline  Infrastructure.
Computing for ILC experiment Computing Research Center, KEK Hiroyuki Matsunaga.
StoRM Some basics and a comparison with DPM Wahid Bhimji University of Edinburgh GridPP Storage Workshop 31-Mar-101Wahid Bhimji – StoRM.
L ABORATÓRIO DE INSTRUMENTAÇÃO EM FÍSICA EXPERIMENTAL DE PARTÍCULAS Enabling Grids for E-sciencE Grid Computing: Running your Jobs around the World.
Large Scale Test of a storage solution based on an Industry Standard Michael Ernst Brookhaven National Laboratory ADC Retreat Naples, Italy February 2,
LCG Service Challenge Phase 4: Piano di attività e impatto sulla infrastruttura di rete 1 Service Challenge Phase 4: Piano di attività e impatto sulla.
Data Management The GSM-WG Perspective. Background SRM is the Storage Resource Manager A Control protocol for Mass Storage Systems Standard protocol:
GStore: GSI Mass Storage ITEE-Palaver GSI Horst Göringer, Matthias Feyerabend, Sergei Sedykh
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Grid Lab About the need of 3 Tier storage 5/22/121CHEP 2012, The need of 3 Tier storage Dmitri Ozerov Patrick Fuhrmann CHEP 2012, NYC, May 22, 2012 Grid.
EMI 1 Release The EMI 1 (Kebnekaise) release features for the first time a complete and consolidated set of middleware components from ARC, dCache, gLite.
Grid and Cloud Computing Globus Provision Dr. Guy Tel-Zur.
Your university or experiment logo here GridPP Storage Future Jens Jensen GridPP workshop RHUL, April 2010.
BNL Wide Area Data Transfer for RHIC & ATLAS: Experience and Plans Bruce G. Gibbard CHEP 2006 Mumbai, India.
Overview of grid activities in France in relation to FKPPL FKPPL Workshop Thursday February 26th, 2009 Dominique Boutigny.
July 29' 2010INDIA-CMS_meeting_BARC1 LHC Computing Grid Makrand Siddhabhatti DHEP, TIFR Mumbai.
Terascala – Lustre for the Rest of Us  Delivering high performance, Lustre-based parallel storage appliances  Simplifies deployment, management and tuning.
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Implementation of a reliable and expandable on-line storage for compute clusters Jos van Wezel.
WebFTS File Transfer Web Interface for FTS3 Andrea Manzi On behalf of the FTS team Workshop on Cloud Services for File Synchronisation and Sharing.
11 November 2010 Natascha Hörmann Computing at HEPHY Evaluation 2010.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
Scientific Storage at FNAL Gerard Bernabeu Altayo Dmitry Litvintsev Gene Oleynik 14/10/2015.
Computing Jiří Chudoba Institute of Physics, CAS.
LCG LCG-1 Deployment and usage experience Lev Shamardin SINP MSU, Moscow
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Upcoming Features and Roadmap Ricardo Rocha ( on behalf of the.
EMI INFSO-RI European Middleware Initiative (EMI) Alberto Di Meglio (CERN)
Andrea Manzi CERN On behalf of the DPM team HEPiX Fall 2014 Workshop DPM performance tuning hints for HTTP/WebDAV and Xrootd 1 16/10/2014.
EGI-Engage Data Services and Solutions Part 1: Data in the Grid Vincenzo Spinoso EGI.eu/INFN Data Services.
JINR WLCG Tier 1 for CMS CICC comprises 2582 Core Disk storage capacity 1800 TB Availability and Reliability = 99% 49% 44% JINR (Dubna)End of.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Overview of DMLite Ricardo Rocha ( on behalf of the LCGDM team.
European Middleware Initiative (EMI) Alberto Di Meglio (CERN) Project Director.
INFSO-RI Enabling Grids for E-sciencE Enabling Grids for E-sciencE Storage Element Model and Proposal for Glue 1.3 Flavia Donno,
Development of a Tier-1 computing cluster at National Research Centre 'Kurchatov Institute' Igor Tkachenko on behalf of the NRC-KI Tier-1 team National.
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
J Jensen/J Gordon RAL Storage Storage at RAL Service Challenge Meeting 27 Jan 2005.
Andrea Manzi CERN EGI Conference on Challenges and Solutions for Big Data Processing on cloud 24/09/2014 Storage Management Overview 1 24/09/2014.
EMI INFSO-RI EMI 1, open source middleware and the road to sustainability Alberto Di Meglio (CERN) Project Director EGI User Forum EMI Technical.
EMI INFSO-RI Patrick Fuhrmann EMI Data area leader At the EGI Technical Forum 2011, in Lyon EMI-Data The second year.
An Analysis of Data Access Methods within WLCG Shaun de Witt, Andrew Lahiff (STFC)
EMI INFSO-RI /04/2011What's new in EMI 1: Kebnekaise What’s new in EMI 1 Kathryn Cassidy (TCD)‏ EMI NA2.
EMI is partially funded by the European Commission under Grant Agreement RI EMI Outlook and Open Source Activities Alberto DI MEGLIO, CERN Project.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Standard Protocols in DPM Ricardo Rocha.
The status of IHEP Beijing Site WLCG Asia-Pacific Workshop Yaodong CHENG IHEP, China 01 December 2006.
EMI is partially funded by the European Commission under Grant Agreement RI Future Proof Storage with DPM Oliver Keeble (on behalf of the CERN IT-GT-DMS.
Implementation of GLUE 2.0 support in the EMI Data Area Elisabetta Ronchieri on behalf of JRA1’s GLUE 2.0 Working Group INFN-CNAF 13 April 2011, EGI User.
Riccardo Zappi INFN-CNAF SRM Breakout session. February 28, 2012 Ingredients 1. Basic ingredients (Fabric & Conn. level) 2. (Grid) Middleware ingredients.
Distributed storage system with dCache E.Y.Kuklin, A.V.Sozykin, A.Y.Bersenev Institute of Mathematics and Mechanics UB RAS, Yekaterinburg G.F.Masich Institute.
Sviluppo middleware sostenibile Il caso di EMI
Overview of the Training Agenda
Grid Computing: Running your Jobs around the World
Ian Bird, CERN & WLCG CNAF, 19th November 2015
Managing Storage in a (large) Grid data center
Experiences with http/WebDAV protocols for data access in high throughput computing
Introduction to Data Management in EGI
Clouds of JINR, University of Sofia and INRNE Join Together
Christof Hanke, HEPIX Spring Meeting 2008, CERN
EGI UMD Storage Software Repository (Mostly former EMI Software)
Enabling High Speed Data Transfer in High Energy Physics
The INFN Tier-1 Storage Implementation
Nolan Leake Co-Founder, Cumulus Networks Paul Speciale
Presentation transcript:

A comparison of distributed data storage middleware for HPC, GRID and Cloud Mikhail Goldshtein 1, Andrey Sozykin 1, Grigory Masich 2 and Valeria Gribova 3 1 Institute of Mathematics and Mechanics UrB RAS, Russia, Yekaterinburg 2 Institute of Continuous Media Mechanics UrB RAS, Russia, Perm 3 Institute of Automation and Control Processes FEB RAS, Russia, Vladivostok

European Middleware Initiative EMI - Software platform for high performance distributed computing, Joint effort of the major European distributed computing middleware providers (ARC, dCache, gLite, UNICORE) Widely used in Europe, including Worldwide LHC Computing Grid (WLCG) Higgs boson: Alberto Di Meglio: Without the EMI middleware, such an important result could not have been achieved in such a short time 2

Storage solutions in EMI 3 dCache - Disk Pool Manager (DPM) - StoRM (STOrage Resource Manager) -

dCache 4

Disk Pool Manager 5

StoRM 6

Usage statistics in WLCG 7

Distributed storage systems Traditional approach: Grid Distributed file systems (IBM GPFS, Lustre File System, etc.) Modern technologies: Standard Internet Protocols (Parallel NFS, WebDAV, etc.) Cloud storage (Amazone S3, HDFS, etc.) 8

Classic NFS 9

Parallel NFS 10

Comparison results 11 FeaturedCacheDPMStoRM Grid protocolsSRM, xroot, dcap GridFTP SRM, RFIO, xroot, GridFTP SRM, RFIO, xroot, GridFTP, file Standard protocols NFS 4.1, WebDAV - Cloud backendHDFS (in development) HDFS, Amazon S3 - Quality of documentation HighMediumHigh Ease of administration EasyMediumEasy

Distributed dCache based Tire 1 WLCG storage 12

Implementation 13

Implementation details Hardware: 4 x Supermicro servers (3 in Yekaterinburg, 1 in Perm), 210 TB useful capacity (252 full capacity, RAID5 + Hotspare are used) ОС Scientific Linux 6.3 dCache 2.6 from EMI repository Protocol: NFS v4.1 (Parallel NFS) RHEL has a parallel NFS client, no need to install additional software to clusters 14

Performance testing 15 IOR test (

Future works Evaluation of NFS performance over 10GE and WAN Evaluation of dCache in the experiments (Particle Image Velocimetry and so on) Participation in GRID projects: Grid of Russian National Nanotechnology Network WLCG (through Joint Institute for Nuclear Research, Dubna, Russia) Connection to Hadoop Cluster (when dCache will support HDFS) 16

Thank you! Andrey Sozykin Institute of Mathematics and Mechanics UrB RAS, Russia, Yekaterinburg 17