Core SRB Technology for 2005 NCOIC Workshop By Michael Wan And Wayne Schroeder SDSC SDSC/UCSD/NPACI.

Slides:



Advertisements
Similar presentations
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
Advertisements

The Storage Resource Broker and.
The Storage Resource Broker and.
Overview of the SDSC Storage Resource Broker Wayne Schroeder (and other SRB team members) May, 2004 San Diego Supercomputer Center, University of California.
Peter Berrisford RAL – Data Management Group SRB Services.
Database System Concepts and Architecture
Data Grid: Storage Resource Broker Mike Smorul. SRB Overview Developed at San Diego Supercomputing Center. Provides the abstraction mechanisms needed.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.
San Diego Supercomputer CenterNational Partnership for Advanced Computational Infrastructure1 Grid Based Solutions for Distributed Data Management Reagan.
Security Requirements for Shared Collections Storage Resource Broker Reagan W. Moore
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Applying Data Grids to Support Distributed Data Management Storage Resource Broker Reagan W. Moore Ian Fisk Bing Zhu University of California, San Diego.
Jean-Yves Nief, CC-IN2P3 Wilko Kroeger, SCCS/SLAC Adil Hasan, CCLRC/RAL HEPiX, SLAC October 11th – 13th, 2005 BaBar data distribution using the Storage.
On Developing Data Grid Workflows using Storage Resource Broker (SRB) and Kepler Tim H. Wong - UC Davis Efrat Frank - SDSC Bertram Ludäscher - UC Davis.
Magda – Manager for grid-based data Wensheng Deng Physics Applications Software group Brookhaven National Laboratory.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Data Grid Interactions with Firewalls Michael Wan Reagan Moore SDSC/UCSD/NPACI.
Linux Operations and Administration
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Chapter 9: Novell NetWare
National Partnership for Advanced Computational Infrastructure Digital Library Architecture Reagan Moore Chaitan Baru Amarnath Gupta George Kremenek Bertram.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Jan Storage Resource Broker Managing Distributed Data in a Grid A discussion of a paper published by a group of researchers at the San Diego Supercomputer.
Rule-Based Data Management Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar {moore, schroede, mwan, {moore, schroede, mwan,
2. Database System Concepts and Architecture
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
1 Apache. 2 Module - Apache ♦ Overview This module focuses on configuring and customizing Apache web server. Apache is a commonly used Hypertext Transfer.
February 2-3, 2006SRB Workshop, San Diego P eter Cao, NCSA Mike Wan, SDSC Sponsored by NLADR, NFS PACI Project in Support of NCSA-SDSC Collaboration Object-level.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Grid Services/SRB/SRM & Practical Hai-Ning Wu Academia Sinica Grid Computing.
December 1, 2005HDF & HDF-EOS Workshop IX P eter Cao, NCSA December 1, 2005 Sponsored by NLADR, NFS PACI Project in Support of NCSA-SDSC Collaboration.
The Network Performance Advisor J. W. Ferguson NLANR/DAST & NCSA.
File and Object Replication in Data Grids Chin-Yi Tsai.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
BaBar Data Distribution using the Storage Resource Broker Adil Hasan, Wilko Kroeger (SLAC Computing Services), Dominique Boutigny (LAPP), Cristina Bulfon.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Integrating HDF5 with SRB The HDF5-SRB Architecture Peter Cao, HDF, NCSA February 24, 2005.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
Author - Title- Date - n° 1 Partner Logo WP5 Summary Paris John Gordon WP5 6th March 2002.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
Experiment Management System CSE 423 Aaron Kloc Jordan Harstad Robert Sorensen Robert Trevino Nicolas Tjioe Status Report Presentation Industry Mentor:
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
SAN DIEGO SUPERCOMPUTER CENTER By: Roman Olschanowsky An Introduction to the.
GCRC Meeting 2004 BIRN Coordinating Center Software Development Vicky Rowley.
Michael Doherty RAL UK e-Science AHM 2-4 September 2003 SRB in Action.
1 AHM -2-4 Sept 2003 e-Science Centre Running SRB Ananta Manandhar.
ITGS Network Architecture. ITGS Network architecture –The way computers are logically organized on a network, and the role each takes. Client/server network.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Introduction to The Storage Resource.
Biomedical Informatics Research Network The Storage Resource Broker & Integration with NMI Middleware Arcot Rajasekar, BIRN-CC SDSC October 9th 2002 BIRN.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
The Storage Resource Broker and.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
Data Infrastructure in the TeraGrid Chris Jordan Campus Champions Presentation May 6, 2009.
1 eScience Grid Environments th May 2004 NESC - Edinburgh Deployment of Storage Resource Broker at CCLRC for E-science Projects Ananta Manandhar.
IRODS Advanced Features Michael Wan
Building Preservation Environments from Federated Data Grids Reagan W. Moore San Diego Supercomputer Center Storage.
High Performance Storage System (HPSS) Jason Hick Mass Storage Group HEPiX October 26-30, 2009.
An Overview of iRODS Integrated Rule-Oriented Data System
Introduction to Data Management in EGI
The Client/Server Database Environment
Chapter 3: Windows7 Part 4.
Arcot Rajasekar Michael Wan Reagan Moore (sekar, mwan,
Presentation transcript:

Core SRB Technology for 2005 NCOIC Workshop By Michael Wan And Wayne Schroeder SDSC SDSC/UCSD/NPACI

Outline Basic Concepts behind SRB SRB architecture SRB features SRB Usage Model Wayne: –SRB productization - Installation, Administration, etc –Security and Authentication –Examples and demo

Initial Design of SRB Transparency and Uniformity –Data are increasingly distributed –Design Goal – use a single interface and authorization mechanism to access data across: –Multiple hosts –Multiple OS platforms –Multiple resource type (UNIX FS, HPSS, UniTree, DBMS..)

Initial Design of SRB Global view –Global Logical Name space – Data organization UNIX like directories (collections) and files (data) Mapping of logical name to physical attributes - host address, physical path. UNIX like API and utilities –Single Global User Name Space Single sign-on No need for UNIX account on every systems Robust access control

SRB Architecture Federated middleware system Client/server model – –Federation of resource servers with uniform interfaces client-server server-server - Each request handler has 2 versions –Local –Remote – pass off to server that can handle the request –All Servers use same software Simplicity – easy to implement, easy to debug –Robust access control user level, grant access to multiple users group level tickets MCAT – –Metadata catalog

Federation of Servers MCAT Server1 Server2 Mcat Server

SRB as a Data Grid SRB MCAT DB SRB Data Grid has arbitrary number of servers Complexity is hidden from users

SRB server design Three layers design –Top layer Interacts with clients and other servers through tcp/ip sockets User authentication Handle function requests – parses requests and invokes handlers in middle and bottom layers.

SRB server design (cont2) Middle layer (logical layer) –Most requests pass through here –Input parameters are in their logical representations (logical path name, logical resource name) –Generally, two types of requests – Data access – –Queries MCAT, translates from logical to physical representations –Calls functions in the bottom (physical) layer to access data Metadata access – –Interacts with MCAT

SRB server design (con2) –Bottom layer (physical layer) Where all data I/O to/from resources are done Handles three types of resources File system –Drivers to interface with different FS –FS supported : UNIX, HPSS, ADS, UniTree, gridFTP (to be released) DB large objects DB tables –Access DB tables (query, insert, …)

SRB Features -Authentication Support 2 authentication schemes –Encrypt1 (SDSC) – No plain text password over the net –GSI (Globus) –Wayne will give details

Performance Enhancement Parallel I/O –For transferring large files –Uses multi-threads for data transfer and disk I/O –Interface with HPSS’s mover protocol for parallel I/O –Parallel third party transfer for copy and replicate –One hop data transfer between client and data resource Bulk Operation –Uploading and downloading large number of small files –Multi-threads –Bulk registration – 500 files in one call –3-10 times speedup

SRB server1 SRB agent SRB server2 Sput – serial mode MCAT Sput SRB agent srbObjCreate srbObjWrite 1.Logical-to-Physical mapping 2. Identification of Replicas 3.Access & Audit Control Peer-to-peer Request Server(s) Spawning Data Transfer R

SRB server1 SRB agent SRB server2 Parallel mode Data Transfer – Client Initiated MCAT Sput -M SRB agent srbObjPut 1.Logical-to-Physical mapping 2. Identification of Replicas 3.Access & Audit Control Return socket addr., port and cookie Connect to server Data transfer R 5 6

Performance Enhancement (cont1) Container – –physical grouping of small files –for tape I/O or archival resources –Easy to use, transparent to users

Data Replication A SRB file can have multiple replica Replica can be stored in different resources Sls –l mfile –fedsrbbrick8 0 demoResc % mfile –fedsrbbrick8 1 demoResc % mfile Commands that uses replica –Sreplicate – replicate a file to the specified resource –Sbackupsrb – backup a file to the specified resource –SsyncD – Synchronize the replica of a file

PhyMove –move SRB files to another resource Move files to another resource without making another replica Normally used by admin to move files around Bulk phyMove – large number of small files Parallel I/O – large files Container – move files into container Heavily used by the BBSRC project for distributed archive. –Files uploaded to local server –Files eventually moved to a central archival resource by admin

Performance Enhancement (cont2) Use of checksum –a MCAT metadata associated with a file –Checksum routines is part of server and client codes –For verification and synchronization of data –Built into most data handling utilities Sput, Sget, Srsync, Schksum

Metadata in SRB SRB System Metadata Free-form Metadata (User-defined) –Attribute-Value-Unit Triplets… Extensible Schema Metadata –User Defined –Tables integrated into MCAT Core Schema External Database Metadata operations –Metadata Insertion through User Interfaces –Bulk Metadata Insertion –Template based Metadata Extraction –Query Metadata through well defined Interfaces

SRB Proxy operation Perform operations on server on behalf of user –Operation where data is located –File format conversion, md5 checksum, subsetting and filtering, etc Two types of proxy operations –Proxy commands Server fork and exec executable/script on server Pipe output back to client –Proxy functions Functions built into server Well defined framework for writing proxy functions

HDF5-SRB Model Data flow Client API srbObjRequest(void *obj, int objID) Server API srbObjProcess(void *obj, int objID) 1. packMsg() 2. unpackMsg() 3. H5Obj::op() 4. Access file 5. packMsg() 6. unpackMsg() SRB Server HDF5 Library HDF5 file

Zone Federation Federation of multiple MCATs –MCAT ZONE defines a federation of SRB resources controlled by a single MCAT Each Zone has full control of its own administrative domain Each Zone can operate entirely independently from other zone. Data and Resource sharing across ZONES –Use storage resources in foreign zones –Share data across zones –Copy data across zones

Peer to peer Federated MCAT Zone MCAT1 MCAT2 MCAT3 Server1.1 Server1.2 Server2.1 Server2.2 Server3.1

SRB Client Implementations A set of Basic APIs –Over 160 APIs –Used by all clients to make request to servers Scommands –Unix like command line utilities for UNIX and Window platforms –Over 60 - Sls, Scp, Sput, Sget …

SRB Client Implementations (cont) inQ – Window GUI browser Jargon – Java SRB client classes –Pure Java implementation mySRB – Web based GUI –run using web browser Java Admin Tool –GUI for User and Resource management Matrix – Web service for SRB work flow

inQ Windows GUI

MySRB – Web Based SRB Interface SRB Browser Advanced Metadata manipulation

SRB Usage Model Various Usage models Specific Usages –SLAC’s Babar experiment –UK eScience BBSRC –BIRN

SRB Configuration – Peer-to-peer Data Grid Resource server Resource server Resource server Resource server Data sharing, no central resourcet Projects – NARA, BIRN

SRB Configuration - Exploding Star Source Server Satellite server Satellite server Satellite server Satellite server Satellite server Data source – physics experiment Projects – Babar, kek

SRB Configuration - Imploding Star Central Cache Server Satellite source server Satellite source server Satellite source server Satellite source server Satellite source server Archival Storage Model Projects – UK eScience – BBSRC Central Archival server

Peer to peer Federation of MCAT Zone MCAT1 MCAT2 MCAT3 Server1.1 Server1.2 Server2.1 Server2.2 Server3.1

Summary of the Babar Project Preproduction evaluation – 2003 –Highlight of Wilco Kroeger’s (SLAC) talk at IEEE 2003 –Title - “Distributing Babar Data using SRB” BaBar Computing resources are geographically distributed: 5 Tier-A center GridKA (D), IN2P3 (F), INFN-Padova (I), RAL (UK), SLAC (USA) Data have to be replicated to the Tier-A sites. Number of files is 1M. Size 100’s TB

Babar Preproduction – SRB Usage Allows transparent access to files. –Don’t need to know host or storage medium (disk,tape). Accessing files/collections by attributes. –Find files that were produced at a certain time or site. –Find collections from a particular run period. Preproduction test – 2 weeks of MCAT and file transfer tests

Babar Production Update Transferred ~70 Tb and 140K files Peak rate ~2 Tb/day. Average rate – 1 Tb/day Downtime encountered – hardware problem –DB updates Plan to federate SLAC and In2p3 Zones – –In2p3 picks up some of the load Thanks to Wilko Kroeger (SLAC) and Jean- Yves Nief (In2p3) for the info

UK eScience BBSRC Archival of Biological Data from 16 sites to a central resource Data ingested into local resources Admin uses bulk Sphymove to move data from local resources to a central cache Moves data into containers Replicates containers to cache resource at RAL Replicates containers to ADS archival at RAL Removes cache copies

UK eScience BBSRC Develop some software on their own –User interface using Jargon GUI Users not exposed to all SRB functionalities –Request tracker – track data movement after ingestion Status –Project started at beginning of this year –Just done with pilot program using SRB3.2 –Upgrading to 3.3 for production

Biomedical Informatics Research Network (BIRN) Major collaboration with SDSC, several of the projects’ Co-Investigators and Co-PIs are at SDSC.. SRB provides the ability to transparently share data across remote sites.

The BIRN SRB Data Grid

The BIRN Data Grid

SRB in BIRN BIRN Toolkit Mediator Viewing/Visualization Queries/ResultsApplications Data Management File System MCAT HPSS Data Model Data Access Data Grid Computational Grid Collaboration NMI Grid Management Globus GridPort Scheduler Distributed Resources Database SRB Database