1 SRM-Lite: overcoming the firewall barrier for data movement Arie Shoshani Alex Sim Viji Natarajan Lawrence Berkeley National Laboratory SDM Center All-Hands.

Slides:



Advertisements
Similar presentations
HEPiX GFAL and LCG data management Jean-Philippe Baud CERN/IT/GD.
Advertisements

Jens G Jensen CCLRC/RAL hepsysman 2005Storage Middleware SRM 2.1 issues hepsysman Oxford 5 Dec 2005.
1 The SciDAC Scientific Data Management Center: Infrastructure and Results Arie Shoshani Lawrence Berkeley National Laboratory SC 2004 November, 2004.
1 SRM-Lite: overcoming the firewall barrier for large scale file replication Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory April, 2007.
Computing Sciences Directorate, L B N L 1 CHEP 2003 Storage Resource Management In the Grid Environment Alex Sim Junmin Gu Arie Shoshani Scientific Data.
1 CHEP 2003 Arie Shoshani Experience with Deploying Storage Resource Managers to Achieve Robust File replication Arie Shoshani Alex Sim Junmin Gu Scientific.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
Toni Saarinen, Tite4 Tomi Ruuska, Tite4 Earth System Grid - ESG.
A. Sim, CRD, L B N L 1 Data Management Foundations Workshop, Mar. 3, 2009 Storage in OSG and BeStMan Alex Sim Scientific Data Management Research Group.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Data Grid Web Services Chip Watson Jie Chen, Ying Chen, Bryan Hess, Walt Akers.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Presented by The Earth System Grid: Turning Climate Datasets into Community Resources David E. Bernholdt, ORNL on behalf of the Earth System Grid team.
A. Sim, CRD, L B N L GIN-Data : SRM Island Inter-Op Testing With SRM-TESTER Alex Sim, Vijaya Natarajan Computational Research Division Lawrence Berkeley.
A. Sim, CRD, L B N L 1 Oct. 23, 2008 BeStMan Extra Slides.
Data management in grid. Comparative analysis of storage systems in WLCG.
SDM Center February 2, 2005 Progress on MPI-IO Access to Mass Storage System Using a Storage Resource Manager Ekow J. Otoo, Arie Shoshani and Alex Sim.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
A. Sim, CRD, L B N L 1 OSG Applications Workshop 6/1/2005 OSG SRM/DRM Readiness and Plan Alex Sim / Jorge Rodriguez Scientific Data Management Group Computational.
1 Use of SRMs in Earth System Grid Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory.
Computing Sciences Directorate, L B N L 1 SC 2003 Storage Resource Managers: Essential Components for the Grid Arie Shoshani Staff: Alex Sim, Junmin Gu,
File and Object Replication in Data Grids Chin-Yi Tsai.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Bulk Data Movement: Components and Architectural Diagram Alex Sim Arie Shoshani LBNL April 2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Andrew C. Smith – Storage Resource Managers – 10/05/05 Functionality and Integration Storage Resource Managers.
1 Meeting Location: LBNL Sept 18, 2003 The functionality of a Replica Registration Service Attendees Michael Haddox-Schatz, JLAB Ann Chervenak, USC/ISI.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
Intergrid KoM Santander 22 june, 2006 E-Infraestructure shared between Europe and Latin America José Manuel Gutiérrez
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
The Earth System Grid: A Visualisation Solution Gary Strand.
Web Portal Design Workshop, Boulder (CO), Jan 2003 Luca Cinquini (NCAR, ESG) The ESG and NCAR Web Portals Luca Cinquini NCAR, ESG Outline: 1.ESG Data Services.
The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
Fox 2 AISRP April 4-6, 2005  Earth System Grid  Grid-enabled OPeNDAP  Architecture - Server and Application access  Framework experience.
1 Grid File Replication using Storage Resource Management Presented By Alex Sim Contributors: JLAB: Bryan Hess, Andy Kowalski Fermi: Don Petravick, Timur.
GridFTP GUI: An Easy and Efficient Way to Transfer Data in Grid
Computing Sciences Directorate, L B N L 1 CHEP 2003 Standards For Storage Resource Management BOF Co-Chair: Arie Shoshani * Co-Chair: Peter Kunszt ** *
January 26, 2003Eric Hjort HRMs in STAR Eric Hjort, LBNL (STAR/PPDG Collaborations)
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
1 Research and Development. 2 R&D Agenda  Security  Bulk Data Movement  Data Replication and Mirroring  Monitoring  Metrics  Versioning  Product.
SDM Center Coupling Parallel IO to SRMs for Remote Data Access Ekow Otoo, Arie Shoshani and Alex Sim Lawrence Berkeley National Laboratory.
1 Use of SRM File Streaming by Gateway Alex Sim Arie Shoshani May 2008.
Computing Sciences Directorate, L B N L 1 SC 2003 Storage Resource Managers: Essential Components for the Grid Arie Shoshani Staff: Alex Sim, Junmin Gu,
Padova, 5 October StoRM Service view Riccardo Zappi INFN-CNAF Bologna.
ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
PPDG meeting, July 2000 Interfacing the Storage Resource Broker (SRB) to the Hierarchical Resource Manager (HRM) Arie Shoshani, Alex Sim (LBNL) Reagan.
EGI-Engage Data Services and Solutions Part 1: Data in the Grid Vincenzo Spinoso EGI.eu/INFN Data Services.
Author - Title- Date - n° 1 Partner Logo WP5 Status John Gordon Budapest September 2002.
1 Xrootd-SRM Andy Hanushevsky, SLAC Alex Romosan, LBNL August, 2006.
Protocols and Services for Distributed Data- Intensive Science Bill Allcock, ANL ACAT Conference 19 Oct 2000 Fermi National Accelerator Laboratory Contributors:
Production Mode Data-Replication Framework in STAR using the HRM Grid CHEP ’04 Congress Centre Interlaken, Switzerland 27 th September – 1 st October Eric.
SRM-iRODS Interface Development WeiLong UENG Academia Sinica Grid Computing 1.
G RID D ATA M ANAGEMENT. D ATA M ANAGEMENT Distributed community of users need to access and analyze large amounts of data Requirement arises in both.
Current Globus Developments Jennifer Schopf, ANL.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
1 Scientific Data Management Group LBNL SRM related demos SC 2002 DemosDemos Robust File Replication of Massive Datasets on the Grid GridFTP-HPSS access.
A. Sim, CRD, L B N L 1 OSG Site Administrators Meeting, Dec. 13, 2007 Berkeley Storage Manager (BeStMan) Alex Sim Scientific Data Management Research Group.
Enabling Grids for E-sciencE EGEE-II INFSO-RI Status of SRB/SRM interface development Fu-Ming Tsai Academia Sinica Grid Computing.
Introduction to Storage Element Hsin-Wei Wu Academia Sinica Grid Computing Center, Taiwan.
9/20/04Storage Resource Manager, Timur Perelmutov, Jon Bakken, Don Petravick, Fermilab 1 Storage Resource Manager Timur Perelmutov Jon Bakken Don Petravick.
A. Sim, CRD, L B N L 1 Production Data Management Workshop, Mar. 3, 2009 BeStMan and Xrootd Alex Sim Scientific Data Management Research Group Computational.
Vincenzo Spinoso EGI.eu/INFN
Status of the SRM 2.2 MoU extension
Data Bridge Solving diverse data access in scientific applications
The Earth System Grid: A Visualisation Solution
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
Presentation transcript:

1 SRM-Lite: overcoming the firewall barrier for data movement Arie Shoshani Alex Sim Viji Natarajan Lawrence Berkeley National Laboratory SDM Center All-Hands Meeting November, 2007

2 Outline What are Resource Storage Managers (SRM) Requirement of using SRM behind firewalls Satisfying the Requirements Architecture Potential uses

3 Storage Resource Managers SRMs are middleware components whose function is to provide:SRMs are middleware components whose function is to provide: dynamic space allocation AND file management in spaces for storage components on the local or wide-area network Based on a common standard SRM (BeStMan) client/user applications Unix-based Disk Pools Examples of storage systems currently supported by SRMs dCache CASTOR CCLRC RAL GPFS SRM (DPM) SRM (StoRM) SRM/ dCache SRM/ CASTOR SRM (StoRM) Unix-based Disk Pools

4 Storage Resource Managers: Main concepts Non-interference with local policies Advance space reservations Dynamic space management Pinning file in spaces Support abstract concept of a file name: Site URL (SURL) Temporary assignment of file names for transfer: Transfer URL (TURL) Directory Management and ACLs Multi-file requests (srmRquestToPut, srmRequestToGet, srmCopy) Transfer protocol negotiation Peer to peer request support Support for asynchronous multi-file requests Support abort, suspend, and resume operations SRM relies on other services for data movement (GridFTP, HTTPS, SCP, …)

5 Concepts: Site URL and Transfer URL Provide: Site URL (SURL) URL known externally – e.g. in Replica Catalogs e.g. srm://ibm.cnaf.infn.it:8444/dteam/test Get back: transfer URL (TURL) Path can be different than SURL – SRM internal mapping Protocol chosen by SRM based on request protocol preference e.g. gsiftp://ibm139.cnaf.infn.it:2811//gpfs/dteam/test One SURL can have many TURL Files can be replicated in multiple storage components Files may be in near-line and/or on-line storage In light-weight SRM (a single file system on disk) SURL can be the same as TURL except protocol File sharing is possible Same physical file, but many requests Needs to be managed by SRM

6 Tomcat servlet engine Tomcat servlet engine MCS Metadata Cataloguing Services MCS Metadata Cataloguing Services RLS Replica Location Services RLS Replica Location Services SOAP RMI MyProxy server MyProxy server MCS client RLS client MyProxy client GRAM gatekeeper GRAM gatekeeper CAS Community Authorization Services CAS Community Authorization Services CAS client disk MSS Mass Storage System HPSS High Performance Storage System disk HPSS High Performance Storage System disk DRM Storage Resource Management DRM Storage Resource Management HRM Storage Resource Management HRM Storage Resource Management HRM Storage Resource Management HRM Storage Resource Management HRM Storage Resource Management HRM Storage Resource Management gridFTP server gridFTP server gridFTP server gridFTP server gridFTP server gridFTP server gridFTP server gridFTP server openDAPg server openDAPg server gridFTP Striped server gridFTP Striped server LBNL LLNL ISI NCAR ORNL ANL DRM Storage Resource Management DRM Storage Resource Management Earth Science Grid Analysis Environment (in production for 4 years) >5000 users160 TBs managed SRMs are used and inter-communicate in several sites SRMs

7 Robust Data Movement provided by SRMs and DataMover Problem: move thousands of files robustlyProblem: move thousands of files robustly Takes many hours Need error recovery Mass storage systems failures Network failures Solution: Use Storage Resource Managers (SRMs)Solution: Use Storage Resource Managers (SRMs) File streaming paradigm By reserving and releasing storage space automatically Problem: too slowProblem: too slow Solution:Solution: in GridFTP Use parallel streams Use large FTP windows Pre-stage files from MSS Use concurrent transfers NCAR Anywhere LBNL Disk Cache Disk Cache SRM-COPY (thousands of files) SRM-GET (one file at a time) DataMover SRM (performs writes) SRM (performs reads) GridFTP GET (pull mode) stage files archive files Network transfer Get list of files MSS Example setup for Earth System Grid (ESG)

8 File tracking shows recovery from transient failures Total: 45 GBs

9 Requirements for SRM-Lite Run SRM behind a firewall Cannot have third party transfers (source/target is local) May not be able to run GridFTP Remote site may not support it Some communities choose not to use GSI Need support for multi-file transfer Or entire directory Need support for asynchronous request Also support for intermediate status of request Need to support concurrent file transfers

10 Satisfying the Requirements: SRM-Lite Run SRM behind a firewall Must have a client tool (SRM-Lite) May not be able to run GridFTP Support high-performance SCP: Use HPN-SSS from Pittsburgh supercomputing Center But, also use other transfer protocols (GridFTP, bbcp, https, …) Need support for multi-file transfer Manage queues for large requests Need support for asynchronous request SRM-Lite returns a “request token”; token can be used for “request status” Need to support concurrent file transfers Use multi-threading to manage concurrent transfers Monitor transfers and recover from mid-transfer interruptions

11 Scenario A: firewall at one site Disk Cache SSH Server NERSC SSH Channel (SCP) GridFTP/FTP/ BBCP/HTTP transfers Process StepsProcess Steps Login to ORNL using OTP At ORNL invoke SRM-Lite User composes XML input file, srmlite.xml for selected files/directories to copy from/to another site Or, user gives command line option for a selected file/directory SRM-Lite uses srmlite.xml or command line input to automatically Push/Pull files to/from NERSC Use multiple threads for concurrent transfers Disk Cache ORNL SRM- Lite OTP Login srmlite.xml Local Commands And Protocols Put example: Source: file:////my_directory/file_foo Target: scp://host/target_dir/file_foo Get example: Source: GridFTP://host/target_dir/file_foo Target: file:////my_directory/file_foo

12 Scenario B: one end has a firewall, The other end has SRM Disk Cache HPSS SRM NERSC GridFTP/FTP/ SCP transfers Disk Cache ORNL SRM- Lite OTP Login srmlite.txt SRM Request Put example: Source: file:////my_directory/file_foo Target: srm://host/target_dir/file_foo

13 Scenario C: firewalls at both ends Disk Cache SSH Server SSH Channel (SCP) Process StepsProcess Steps Login to Site1 using OTP At site1 invoke SRM-Lite SRM-Lite at site1 uses SSH to invoke SRM-Lite at site2 Use SSH channel for SCP Same as before: User composes XML input file, srmlite.xml for selected files/directories to copy from/to another site Or, user gives command line option for a selected file/directory Disk Cache SRM- Lite OTP Login srmlite.xml SRM- Lite site2site1

14 Scenario C: SRM-Lite manages MSS access SSH Server SSH Channel (SCP) SRM- Lite OTP Login srmlite.xml SRM- Lite site2site1 Disk Cache HPSS Disk Cache HPSS

15 GUI for SRM-Lite Used in ESG Special version for data movement to user workstations Called DataMover-Lite Versions exist for Linux, PC, Mac

16 Usage Combustion project The Applied Partial Differential Equations Center (APDEC) John Bell Efficient, robust data movement from sites behind firewalls At DoE and DoD sites Kepler-SRM-Lite actor To be used for managing multi-file transfers from sites behind firewalls Launch SRM-Lite remotely through SSH Initial version – help from NCSU: Pierre Mouallem Two modes Entire request Streaming file requests To be used in CPES workflows first with Norbert’s help