CERN SRM Development Benjamin Coutourier Shaun de Witt CHEP06 - Mumbai.

Slides:



Advertisements
Similar presentations
30-31 Jan 2003J G Jensen, RAL/WP5 Storage Elephant Grid Access to Mass Storage.
Advertisements

HEPiX GFAL and LCG data management Jean-Philippe Baud CERN/IT/GD.
Jens G Jensen CCLRC/RAL hepsysman 2005Storage Middleware SRM 2.1 issues hepsysman Oxford 5 Dec 2005.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
Acknowledgments Byron Bush, Scott S. Hilpert and Lee, JeongKyu
EGEE is a project funded by the European Union under contract IST Using SRM: DPM and dCache G.Donvito,V.Spinoso INFN Bari
Storage: Futures Flavia Donno CERN/IT WLCG Grid Deployment Board, CERN 8 October 2008.
CASTOR SRM v1.1 experience Presentation at SRM meeting 01/09/2004, Berkeley Olof Bärring, CERN-IT.
– n° 1 StoRM latest performance test results Alberto Forti Otranto, Jun
A. Sim, CRD, L B N L GIN-Data : SRM Island Inter-Op Testing With SRM-TESTER Alex Sim, Vijaya Natarajan Computational Research Division Lawrence Berkeley.
Maarten Litmaath (CERN), EGEE User Forum, CERN, 2006/03/02 (v3) Use of the SRM interface Use case What is the SRM? –Who develops it? –Is it a standard?
Distributed File Systems
CERN, 29 August 2006 Status Report Riccardo Zappi INFN-CNAF, Bologna.
A. Sim, CRD, L B N L 1 Oct. 23, 2008 BeStMan Extra Slides.
Data management in grid. Comparative analysis of storage systems in WLCG.
Status report on SRM v2.2 implementations: results of first stress tests 2 th July 2007 Flavia Donno CERN, IT/GD.
SRM 2.2: status of the implementations and GSSD 6 th March 2007 Flavia Donno, Maarten Litmaath INFN and IT/GD, CERN.
CERN IT Department CH-1211 Geneva 23 Switzerland t Storageware Flavia Donno CERN WLCG Collaboration Workshop CERN, November 2008.
Module 13 Implementing Business Continuity. Module Overview Protecting and Recovering Content Working with Backup and Restore for Disaster Recovery Implementing.
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
Δ Storage Middleware GridPP10 What’s new since GridPP9? CERN, June 2004.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Andrew C. Smith – Storage Resource Managers – 10/05/05 Functionality and Integration Storage Resource Managers.
1 Meeting Location: LBNL Sept 18, 2003 The functionality of a Replica Registration Service Attendees Michael Haddox-Schatz, JLAB Ann Chervenak, USC/ISI.
SRM workshop – September’05 1 SRM: Expt Reqts Nick Brook Revisit LCG baseline services working group Priorities & timescales Use case (from LHCb)
Light weight Disk Pool Manager experience and future plans Jean-Philippe Baud, IT-GD, CERN September 2005.
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
New stager commands Details and anatomy CASTOR external operation meeting CERN - Geneva 14/06/2005 Sebastien Ponce, CERN-IT.
Server-side Programming The combination of –HTML –JavaScript –DOM is sometimes referred to as Dynamic HTML (DHTML) Web pages that include scripting are.
SRM Monitoring 12 th April 2007 Mirco Ciriello INFN-Pisa.
INFSO-RI Enabling Grids for E-sciencE Enabling Grids for E-sciencE Pre-GDB Storage Classes summary of discussions Flavia Donno Pre-GDB.
WLCG Grid Deployment Board, CERN 11 June 2008 Storage Update Flavia Donno CERN/IT.
SRM & SE Jens G Jensen WP5 ATF, December Collaborators Rutherford Appleton (ATLAS datastore) CERN (CASTOR) Fermilab Jefferson Lab Lawrence Berkeley.
1 LHCb File Transfer framework N. Brook, Ph. Charpentier, A.Tsaregorodtsev LCG Storage Management Workshop, 6 April 2005, CERN.
Maarten Litmaath (CERN), GDB meeting, CERN, 2006/03/08 An update on SRM Mumbai & subsequent discussion summary –
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Management cluster summary Krzysztof Nienartowicz JRA1 All Hands meeting, Helsinki.
Maarten Litmaath (CERN), GDB meeting, CERN, 2006/06/07 SRM v2.2 working group update Results of the May workshop at FNAL
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
CSI 3125, Preliminaries, page 1 SERVLET. CSI 3125, Preliminaries, page 2 SERVLET A servlet is a server-side software program, written in Java code, that.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Upcoming Features and Roadmap Ricardo Rocha ( on behalf of the.
CERN IT Department CH-1211 Genève 23 Switzerland t HEPiX Conference, ASGC, Taiwan, Oct 20-24, 2008 The CASTOR SRM2 Interface Status and plans.
Distributed Logging Facility Castor External Operation Workshop, CERN, November 14th 2006 Dennis Waldron CERN / IT.
INFSO-RI Enabling Grids for E-sciencE SRMv2.2 in DPM Sophie Lemaitre Jean-Philippe.
Author - Title- Date - n° 1 Partner Logo WP5 Status John Gordon Budapest September 2002.
CASTOR project status CASTOR project status CERNIT-PDP/DM October 1999.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Overview of DMLite Ricardo Rocha ( on behalf of the LCGDM team.
WLCG Grid Deployment Board CERN, 14 May 2008 Storage Update Flavia Donno CERN/IT.
SRM-2 Road Map and CASTOR Certification Shaun de Witt 3/3/08.
Handling of T1D0 in CCRC’08 Tier-0 data handling Tier-1 data handling Experiment data handling Reprocessing Recalling files from tape Tier-0 data handling,
1 Xrootd-SRM Andy Hanushevsky, SLAC Alex Romosan, LBNL August, 2006.
SRM-iRODS Interface Development WeiLong UENG Academia Sinica Grid Computing 1.
Grid Deployment Board 5 December 2007 GSSD Status Report Flavia Donno CERN/IT-GD.
1 SRM v2.2 Discussion of key concepts, methods and behaviour F. Donno CERN 11 February 2008.
CASTOR in SC Operational aspects Vladimír Bahyl CERN IT-FIO 3 2.
SRM 2.2: experiment requirements, status and deployment plans 6 th March 2007 Flavia Donno, INFN and IT/GD, CERN.
Arie Shoshani – Dec 2002 Arie Shoshani meeting held at CERN December, 2002 Comments and Observations SRM + GLUE.
Bologna, March 30, 2006 Riccardo Zappi / Luca Magnoni INFN-CNAF, Bologna.
CERN IT Department CH-1211 Genève 23 Switzerland t Towards end-to-end debugging for data transfers Gavin McCance Javier Conejero Banon Sophie.
CERN IT Department CH-1211 Genève 23 Switzerland t DPM status and plans David Smith CERN, IT-DM-SGT Pre-GDB, Grid Storage Services 11 November.
SRM 2.2 Shaun de Witt Introduction Design Design Dependencies Dependencies Set-up Set-up Requirements Requirements Support Support.
Jean-Philippe Baud, IT-GD, CERN November 2007
Status of the SRM 2.2 MoU extension
SRM v2.2 / v3 meeting report SRM v2.2 meeting Aug. 29
StoRM Architecture and Daemons
SRM Developers' Response to Enhancement Requests
Database Architectures and the Web
The INFN Tier-1 Storage Implementation
Data Management cluster summary
INFNGRID Workshop – Bari, Italy, October 2004
Presentation transcript:

CERN SRM Development Benjamin Coutourier Shaun de Witt CHEP06 - Mumbai

Background Original version based on SRM 1.1 Specification implemented by CERN Latest version based on SRM Specification Collaborative Effort CERN (CH) RAL (UK) Based on modified WSDL ( wg/srm.v2.1.1.modified.wsdl) wg/srm.v2.1.1.modified.wsdl

Tools Based on modified WSDL ( Selected gsoap-2.7.2

Tools cgsi-soap plugin Oracle (10.2.1) umbrello ( g++ (3.2.3) valgrind

Design Objectives Low latency Short requests handled synchronously Longer requests (involving CASTOR stager) mostly handled asynchronously Multi-threading architecture Robustness Asynchronous requests stored in database

Design Objectives Interoperability Actually a common theme with all SRMs Using common WSDL Tested CASTOR SRM with DCACHE clients and DCACHE SRM with CASTOR clients Robustness Load testing submitting many requests near simultaneously – using Tier1 machines

Design Database SRM Server SRM Daemon CASTOR Stager CASTOR Nameserver SRM CASTOR Client Clients

Design Significant reuse of CASTOR code dlf threadpools database services IObject model

Server Design Thread pool default 10 threads but can be overridden Currently no maximum, but it should probably exist Soap backlog default 40 messages, but can be overridden

Daemon Design Four dedicated threads pool of threads for PUT requests pool of threads for GET requests single thread for COPY request single thread for SRM Garbage collection Selection from database triggered by database entry (TBC).

Data Flow Summary Directory Functions client – server – nameserver PrepareToXXX, Copy, putDone client – server – daemon – stager Other Data Transfer client - server Space Management client - server

Development Issues gsoap Steep learning curve default namespace issues sometimes generated ns1__, sometimes ns2__ We explicitly use srm__ API changes between minor releases using same wsdl Meaning the generated API’s.

Development Issues Umbrello Not as robust or well documented as similar commercial tools Spent several days recovering from undocumented problems. ORACLE Need matching versions of client and sever libraries (not v9 clients and v10 servers anyway

Interoperability issues SRM Specs do not state when/where to use status codes For a request like srmRm with multiple files If any file succeeds, we return SUCCESS If all files fail, we return FAILURE Each file that is successful, we return DONE Each file that fails we return FAILURE

Interoperability Issues Explanation in return status CASTOR SRM returns empty string DPM SRM returns NULL Type Promotion Castor only supports Permanent file types If client requests volatile or durable – SRM returns SUCCESS Return PERMANENT is file structure

Status By end of January All methods implemented except Permission functions Full regression test suite available Still to do Permission functions VOMS integration Complete memory leak checking Thread Tuning/Signal handling/documentation

Status Few issues with interface to CASTOR still need investigating. Some methods only log first DLF call Some APIs which accept multiple files only return a single result.

CASTOR specific Only permanent files supported Space reservation is notional Handled entirely within SRM with no reference to CASTOR CASTOR storage considered semi-infinite srmLs limits number of returns Configurable limit Set to 2048 currently

CASTOR specific Suspend/Resume not supported Dynamic space compacting not supported Pin lifetimes are advisory Used in weighting CASTOR garbage collection policy Pins are applied once files are staged putDone issued or file staged.

Castor Specific Non-static TURL Need to call status to get new TURL srmRmdir does not support recursion