J Jensen / WP5 /RAL UCL 4/5 March 2004 GridPP / DataGrid wrap-up Mass Storage Management J Jensen

Slides:



Advertisements
Similar presentations
30-31 Jan 2003J G Jensen, RAL/WP5 Storage Elephant Grid Access to Mass Storage.
Advertisements

Andrew McNab - Manchester HEP - 24 May 2001 WorkGroup H: Software Support Both middleware and application support Installation tools and expertise Communication.
Andrew McNab - Manchester HEP - 22 April 2002 EU DataGrid Testbed EU DataGrid Software releases Testbed 1 Job Lifecycle Authorisation at your site More.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
Andrew McNab - Manchester HEP - 2 May 2002 Testbed and Authorisation EU DataGrid Testbed 1 Job Lifecycle Software releases Authorisation at your site Grid/Web.
A conceptual model of grid resources and services Authors: Sergio Andreozzi Massimo Sgaravatto Cristina Vistoli Presenter: Sergio Andreozzi INFN-CNAF Bologna.
Andrew McNab - Manchester HEP - 6 November Old version of website was maintained from Unix command line => needed (gsi)ssh access.
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – Title – n° 1 Grid Data Management in Action Experience in Running and.
GridPP meeting Feb 03 R. Hughes-Jones Manchester WP7 Networking Richard Hughes-Jones.
Magda – Manager for grid-based data Wensheng Deng Physics Applications Software group Brookhaven National Laboratory.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
GridPP9 – 5 February 2004 – Data Management DataGrid is a project funded by the European Union GridPP is funded by PPARC WP2+5: Data and Storage Management.
CMS Report – GridPP Collaboration Meeting VI Peter Hobson, Brunel University30/1/2003 CMS Status and Plans Progress towards GridPP milestones Workload.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
DataGrid is a project funded by the European Commission under contract IST GridPP-2 Middleware 4 th -5 th Mar 2004 Information and Monitoring.
EGEE is a project funded by the European Union under contract IST Testing processes Leanne Guy Testing activity manager JRA1 All hands meeting,
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL U.S. ATLAS Physics and Computing Advisory Panel Review Argonne National Laboratory Oct 30, 2001.
Data Management The GSM-WG Perspective. Background SRM is the Storage Resource Manager A Control protocol for Mass Storage Systems Standard protocol:
Δ Storage Middleware GridPP10 What’s new since GridPP9? CERN, June 2004.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Author - Title- Date - n° 1 Partner Logo WP5 Summary Paris John Gordon WP5 6th March 2002.
INFSO-RI Enabling Grids for E-sciencE SA1 and gLite: Test, Certification and Pre-production Nick Thackray SA1, CERN.
3-Jul-02D.P.Kelsey, Security1 Security meetings Report to EDG PTB 3 Jul 2002 David Kelsey CLRC/RAL, UK
GLite – An Outsider’s View Stephen Burke RAL. January 31 st 2005gLite overview Introduction A personal view of the current situation –Asked to be provocative!
GridPP Building a UK Computing Grid for Particle Physics Professor Steve Lloyd, Queen Mary, University of London Chair of the GridPP Collaboration Board.
Light weight Disk Pool Manager experience and future plans Jean-Philippe Baud, IT-GD, CERN September 2005.
LCG EGEE is a project funded by the European Union under contract IST LCG PEB, 7 th June 2004 Prototype Middleware Status Update Frédéric Hemmer.
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
11/5/2001WP5 UKHEPGRID1 WP5 Mass Storage UK HEPGrid UCL 11th May Tim Folkes, RAL
SRM & SE Jens G Jensen WP5 ATF, December Collaborators Rutherford Appleton (ATLAS datastore) CERN (CASTOR) Fermilab Jefferson Lab Lawrence Berkeley.
Presenter Name Facility Name UK Testbed Status and EDG Testbed Two. Steve Traylen GridPP 7, Oxford.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Andrew McNabSecurity Middleware, GridPP8, 23 Sept 2003Slide 1 Security Middleware Andrew McNab High Energy Physics University of Manchester.
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
2-Sep-02Steve Traylen, RAL WP6 Test Bed Report1 RAL and UK WP6 Test Bed Report Steve Traylen, WP6
Andrew McNab - Manchester HEP - 17 September 2002 UK Testbed Deployment Aim of this talk is to the answer the questions: –“How much of the Testbed has.
WP3 Information and Monitoring Rob Byrom / WP3
10 May 2001WP6 Testbed Meeting1 WP5 - Mass Storage Management Jean-Philippe Baud PDP/IT/CERN.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
Author - Title- Date - n° 1 Partner Logo WP5 Status John Gordon Budapest September 2002.
Daniele Spiga PerugiaCMS Italia 14 Feb ’07 Napoli1 CRAB status and next evolution Daniele Spiga University & INFN Perugia On behalf of CRAB Team.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL DOE/NSF Review of US LHC Software and Computing Fermilab Nov 29, 2001.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
J Jensen/J Gordon RAL Storage Storage at RAL Service Challenge Meeting 27 Jan 2005.
GridPP2 Data Management work area J Jensen / RAL GridPP2 Data Management Work Area – Part 2 Mass storage & local storage mgmt J Jensen
Bob Jones – Project Architecture - 1 March n° 1 Project Architecture, Middleware and Delivery Schedule Bob Jones Technical Coordinator, WP12, CERN.
18/12/03PPD Christmas Lectures 2003 Grid in the Department A Guide for the Uninvolved PPD Computing Group Christmas Lecture 2003 Chris Brew.
Massimo Sgaravatto INFN Padova
The EDG Testbed Deployment Details
Gavin McCance University of Glasgow GridPP2 Workshop, UCL
Classic Storage Element
StoRM: a SRM solution for disk based storage systems
Moving the LHCb Monte Carlo production system to the GRID
EO Applications Parallel Session
Comparison of LCG-2 and gLite v1.0
John Gordon EDG Conference Barcelona, May 2003
Fabric and Storage Management
Testbed Software Test Plan Status
Grid Portal Services IeSE (the Integrated e-Science Environment)
WP7 objectives, achievements and plans
DCache things Paul Millar … on behalf of the dCache team.
Stephen Burke, PPARC/RAL Jeff Templon, NIKHEF
Data Management cluster summary
A conceptual model of grid resources and services
Report on GLUE activities 5th EU-DataGRID Conference
Status and plans for bookkeeping system and production tools
gLite The EGEE Middleware Distribution
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

J Jensen / WP5 /RAL UCL 4/5 March 2004 GridPP / DataGrid wrap-up Mass Storage Management J Jensen

WP5 Mass Storage Management - n° 2 Objectives u Develop Uniform interface to mass storage u Interface with Replica Management u Users shouldn’t need to know what the storage system is u Publish information

WP5 Mass Storage Management - n° 3 Objectives – uniform interface u Control interface n Developed own interface, later deployed as web service n Now moving toward SRM 1 for int’l compatibility u Data Transfer interface n Globus GridFTP required u Information interface n Publish to MDS – then to R-GMA via GIN

WP5 Mass Storage Management - n° 4 Achievements – Storage Element u Flexible architecture n Cope with changing requirements n Pluggable features such as access control n Easy to extend u Security n Secure interfaces n File level access control (not in EDG 2.1 though) u Currently supports CASTOR, HPSS, ADS, as well as disk = SE “Classic” = Not in EDG 2.1 += EDG SE Information Producers GridFTP RFIO NFS SRM Core SE Core Mass Storage Interfaces Data Transfer InformationControl

WP5 Mass Storage Management - n° 5 Storage Element team u Most developments done by 3-4 people at RAL n O Synge, R Tam, G Johnson, T Shah, J Jensen – code (>) n J Gordon, J Jensen – WP mgmt u Contributions from Manchester n A McNab et al – GSI, GACL, pool accounts u Contributions from Liverpool n M George, A Washbrook – testing, ACL handling u Contributions from Edinburgh n A Earl – testing, metadata stuff u Other contributions n ADS team at RAL n WP2, WP3,…

WP5 Mass Storage Management - n° 6 Storage Element u SE’s performance is acceptable n Performance dominated by data transfer times s E.g. 0.7 second per file for small files via GridFTP n Performance dominated by mass storage access s 10 minutes to stage in file from ADS s 30 minutes to stage in file from CASTOR n Basic core performance – 0.3 seconds per command

WP5 Mass Storage Management - n° 7 Storage Element u Scalability n Scalability an issue, particularly for EO with many small files n Release 2.1 : files ok, files not n Limits reached in underlying file system n Being addressed in new metadata implementation u Install and configure n Manual n LCFG-ng n Difficult with heterogonous storage interface machines

WP5 Mass Storage Management - n° 8 Achievements – SE deployment EDG SEs as of 17 Feb 2004 Note Taiwan ! Data from R-GMA (WP3) and mapcenter (WP7) Many sites have more than one SE – a few sites have only Classic SE London alone has three sites: IC, UCL, QMUL

WP5 Mass Storage Management - n° 9 Int’l Collaborations n SRM s Collaboration between Fermilab, Jefferson Lab, Lawrence Berkeley, RAL, CERN s Contributed to the design of the SRM version 2 protocol n GLUE s Contributed to the design of GLUE storage schema n GGF s Tracked developments in appropriate working groups s SRM not currently part of GGF – may be from ‘10 n Dissemination s Talks at conferences and in working groups, publications,…

WP5 Mass Storage Management - n° 10 Achievements beyond release 2.1 u Access Control Lists (ACL) n Based on GACL n Fine-grained: Access based on user, file, and operation n Files can share ACLs n Work required to make more usable and user- friendly

WP5 Mass Storage Management - n° 11 Lessons learned u Choice of architecture was definitely right n Architecture has successfully coped with changing requirements u Look for opportunities for component reuse n Used web services deployment and security components provided by WP2 n Deployed and developed further information producers supplied by WP3 n Almost all parts of the Data Transfer components developed externally

WP5 Mass Storage Management - n° 12 Lessons learned u Prototype implementations live longer than expected n SE’s metadata system was implemented as prototype n Scalability issues discovered on application testbed

WP5 Mass Storage Management - n° 13 Lessons learned u Inter-WP integration requires a lot of effort ! n At times, nearly 100% of WP5 devoted to ITeam work and site installation support n Storage interface machines are heterogeneous s More installation support was required u Need to agree standard protocols n Standards must be open and well-defined

WP5 Mass Storage Management - n° 14 Exploitation u Used in final EU review middleware demo to access mass storage u Used successfully on EDG testbeds by all EDG applications WPs u The SE provides the Grid interface to ADS at RAL n This is important because ADS is being used by a large variety of scientific applications groups n To be used on LCG testbed Really Soon Now™

WP5 Mass Storage Management - n° 15 Exploitation u “Atlas Data Challenge 1.5” n SE is currently used by Atlas to transfer data between ADS at RAL and CASTOR at CERN n About 1500 files; 2 TB in total n Files are copied by EDG RM and registered in an RC at NIKHEF n This work is being done by Atlas outside the EDG testbeds n Keep running into problems: s UKHEP root expiry s SEs at CERN being “recycled” by ARDA s Limits found in parallel transfer in GridFTP