GFAL 2.0 Devresse Adrien CERN lcgutil team lcgutil-support@cern.ch.

Slides:



Advertisements
Similar presentations
HEPiX GFAL and LCG data management Jean-Philippe Baud CERN/IT/GD.
Advertisements

Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
Workflows over Grid-based Web services General framework and a practical case in structural biology gLite 3.0 Data Management David García Aristegui Grid.
Alejandro Alvarez Ayllon on behalf of the DM Clients developers GFAL/LCGUTIL to GFAL2/GFAL2 migration.
EGEE is a project funded by the European Union under contract IST Using SRM: DPM and dCache G.Donvito,V.Spinoso INFN Bari
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
Data Management The GSM-WG Perspective. Background SRM is the Storage Resource Manager A Control protocol for Mass Storage Systems Standard protocol:
Xrootd, XrootdFS and BeStMan Wei Yang US ATALS Tier 3 meeting, ANL 1.
The LCG File Catalog (LFC) Jean-Philippe Baud – Sophie Lemaitre IT-GD, CERN May 2005.
New DM Clients and retirement plans for old ones Alejandro Alvarez Ayllon on behalf of the DM Clients developers IT/SDC 11/12/2013.
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
 CASTORFS web page - CASTOR web site - FUSE web site -
INFSO-RI Enabling Grids for E-sciencE gLite Data Management and Interoperability Peter Kunszt (JRA1 DM Cluster) 2 nd EGEE Conference,
Introduction to HDFS Prasanth Kothuri, CERN 2 What’s HDFS HDFS is a distributed file system that is fault tolerant, scalable and extremely easy to expand.
WebFTS File Transfer Web Interface for FTS3 Andrea Manzi On behalf of the FTS team Workshop on Cloud Services for File Synchronisation and Sharing.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT DPM / LFC and FTS news Ricardo Rocha ( on behalf of the IT/GT/DMS.
SEE-GRID-SCI Storage Element Installation and Configuration Branimir Ackovic Institute of Physics Serbia The SEE-GRID-SCI.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
CERN IT Department CH-1211 Geneva 23 Switzerland GT HTTP solutions for data access, transfer, federation Fabrizio Furano (presenter) on.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Upcoming Features and Roadmap Ricardo Rocha ( on behalf of the.
Andrea Manzi CERN On behalf of the DPM team HEPiX Fall 2014 Workshop DPM performance tuning hints for HTTP/WebDAV and Xrootd 1 16/10/2014.
EGI-Engage Data Services and Solutions Part 1: Data in the Grid Vincenzo Spinoso EGI.eu/INFN Data Services.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Overview of DMLite Ricardo Rocha ( on behalf of the LCGDM team.
1 Xrootd-SRM Andy Hanushevsky, SLAC Alex Romosan, LBNL August, 2006.
DMLite GridFTP frontend Andrey Kiryanov IT/SDC 13/12/2013.
EMI is partially funded by the European Commission under Grant Agreement RI Roadmap & Future Work Ricardo Rocha ( on behalf of the DPM team )
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
1 DIRAC Data Management Components A.Tsaregorodtsev, CPPM, Marseille DIRAC review panel meeting, 15 November 2005, CERN.
Andrea Manzi CERN EGI Conference on Challenges and Solutions for Big Data Processing on cloud 24/09/2014 Storage Management Overview 1 24/09/2014.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) LFC Installation and Configuration Dong Xu IHEP,
DPM: Future Proof Storage Ricardo Rocha ( on behalf of the DPM team ) EMI INFSO-RI
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Standard Protocols in DPM Ricardo Rocha.
EMI is partially funded by the European Commission under Grant Agreement RI DPM in EMI-II HTTP and NFS interfaces Oliver Keeble On behalf of DPM.
EMI is partially funded by the European Commission under Grant Agreement RI Future Proof Storage with DPM Oliver Keeble (on behalf of the CERN IT-GT-DMS.
Riccardo Zappi INFN-CNAF SRM Breakout session. February 28, 2012 Ingredients 1. Basic ingredients (Fabric & Conn. level) 2. (Grid) Middleware ingredients.
Martedi 8 novembre 2005 Consorzio COMETA “Progetto PI2S2” FESR Data Management System Annamaria Muoio -- INFN Catania PI2S2 First Tutorial -- Messina,
EGEE Data Management Services
a brief summary for users
Jean-Philippe Baud, IT-GD, CERN November 2007
Remotely access grid files with Parrot
GFAL Grid File Access Library
GFAL Grid File Access Library
GFAL: Grid File Access Library
gLite Basic APIs Christos Filippidis
Dynamic Storage Federation based on open protocols
Ricardo Rocha ( on behalf of the DPM team )
DPM Installation Configuration
StoRM: a SRM solution for disk based storage systems
Vincenzo Spinoso EGI.eu/INFN
Status of the SRM 2.2 MoU extension
Future of WAN Access in ATLAS
LFC Installation and Configuration
gLite Data management system overview
Gfal/lcg-util -> Gfal2/gfal2-util
Introduction to Data Management in EGI
Taming the protocol zoo
SRM2 Migration Strategy
Hands-On Session: Data Management
EGI UMD Storage Software Repository (Mostly former EMI Software)
Data Management cluster summary
Data Management Ouafa Bentaleb CERIST, Algeria
Data services in gLite “s” gLite and LCG.
Architecture of the gLite Data Management System
gLite Data and Metadata Management
INFNGRID Workshop – Bari, Italy, October 2004
Data Management system in gLite middleware
Presentation transcript:

GFAL 2.0 Devresse Adrien CERN lcgutil team lcgutil-support@cern.ch

What is GFAL 2.0 ? → One and only one Toolkit for all the Grid and Cloud data operations : → Client data access : → support for all common protocols → support for all common storage systems → Data management → One API for all data management → Data transfers → 3rd party transfer support CERN, IT-GT-DMS

Goal of GFAL 2.0 ? → Hide the Grid complexity from the client side. → Simplify the data access world → One API. → Reduce the number of dependencies. → Plugin system → Makes application independent of the technology / protocols. CERN, IT-GT-DMS

The complexity of a random read file operation Client LFC get file replicas list SRM Server DPM Head Node Disk node SRM GET rfio open rfio lseek rfio read lfc_stat GFAL 2.0 gfal2_open gfal2_pread SRM surl → turl CERN, IT-GT-DMS

An old-style Grid application : Grid Application X Core logic File access layer dCache, DPM, Castor lib Globus framework Xrootd framework SRM client Http lib CERN, IT-GT-DMS

A GFAL 2.0 Grid application : Grid Application X Core logic GFAL 2.0 API CERN, IT-GT-DMS

GFAL 2.0 architecture : Extensible... API POSIX File API Transfer Plugin layer Plugin LFC Plugin SRM Plugin RFIO Plugin Dcap Plugin Http/dav Plugin xrootd Plugin S3 ( ? ) experience catalogue plugin (???)  Any Cloud ( Http/Dav ) DPM Castor Dropbox Extensible... EOS dCache

What GFAL 2.0 can do ? In any protocol : Meta-data operations : → stat, rm, mkdir, → mv, rmdir, etc... → list directory, xattr, etc... Remote I/O in any protocol → open/read/write/close → pread/pwrite → etc.. In any protocol : → SRM → GRIDFTP → RFIO-Castor → RFIO-DPM → LFN → GUID → DCAP / GSIDCAP → XROOTD → HTTP(S) * → WebDav(s) * → local file CERN, IT-GT-DMS

Feature : Protocol generic filecopy Third party transfer copy : gfalt_copyfile(context, NULL, « srm://monfichier », « srm://madest/higgs ») gfalt_copyfile(context, NULL, « srm://monfichier », « gsiftp://madest/chuck_norris ») Automatic protocol resolution : gfalt_copyfile(context, NULL, « guid:1233 », «root://xrootd_dpm/curiosity») gfalt_copyfile(context, NULL, « file:///tmp/super_file», «gsidcap://42») CERN, IT-GT-DMS

Really protocol generic : Take a, b in srm, guid, lfn, , rfio, dcap , gsidcap, gsiftp, file, xrootd, (http, dav ) gfalt_copyfile(context, NULL, « a://src/file », «b://dst/file») FINISHED ! CERN, IT-GT-DMS

Speed Up x2-x4 for short transfers Feature : Transparent parallel operations Destination exist ? File copy operation : SRM GET SRM GET Destination exist ? SRM PUT SRM PUT transfer transfer PUTDONE PUTDONE Speed Up x2-x4 for short transfers Old tools... GFAL 2.0

thread-safe session reuse Feature : Transparent thread-safe session reuse Pool of Connections SRM, GSIFTP, HTTP, Etc,.. Transfer 1 Create session End Transfer 2 End One server connection

Feature : extended attributes GFAL 2.0 supports POSIX extended attributes ( getxattr, setxattr, etc...) → Clean way to uses protocol specific functionalities. → Simple get/list/set API → Easy to use.

NO ENVIRONMENT VAR ! Environment Variable User

Feature : Works out of the box : NO ENV VAR ! → Every parameter is in a configuration file : → /etc/gfal2.d/*.conf → No environment variable needed. → Tune it once, re-use everywhere. → Still compatible with classical environment variable of GFAL 1.0, for an easy transition.

Does not make the coffee Missing Feature : Does not make the coffee Coming soon …. ( RFC-2324 compliant ! )

How to use GFAL 2.0 ? : → libgfal2 : → gfal2-python : → a C Library → a set of independent plugins → gfal2-python : → simple and pythonic python bindings → gfalFS fuse module for gfal 2 → allows to mount any grid endpoint in a local folder → gfal-tools : experimental command line tools → gfal-ls, gfal-copy, gfal-mkdir, gfal-cat, …. CERN, IT-GT-DMS

Already in production state ! How to get GFAL 2.0 ? : Already in production state ! → Part of EMI 2 release: → EMI 2 repository : www.eu-emi.eu → Packaged on EPEL : → sudo yum install gfal2-all gfal2-doc gfalFS → is going to be packaged on Debian ( thx Mathias ) → Source available, designed to be portable. https://svnweb.cern.ch/trac/lcgutil/wiki/gfal2 CERN, IT-GT-DMS

Support of GFAL 2.0 : → Part of EMI will be supported after the end of EMI → GFAL 2.0 is the core of FTS 3.0 will be supported in any case. → Long term successor to : lcg clients : lcg-cp, lcg-cr, lcg util python gfal 1.0, etc... → Development Open to everybody CERN, IT-GT-DMS

From gfal1.0 to GFAL 2.0 : → GFAL 2.0 does a lot more than GFAL 1.0 : → a lot of Major changes : Not 100% backward compatible. gfal 1.0 + lcg-util (lcg-cp, .. ) + is-interface + gridftp-ifce + …. CERN, IT-GT-DMS

From gfal1.0 to GFAL 2.0 : But, We provides facilities for the transition The complete POSIX API is backward compatible We provide examples for common operations Doxygen documentation of the API. Any question is Welcome : lcgutil-support@cern.ch CERN, IT-GT-DMS

Lcg-util command line tools successor ? gfal-tools ( ? ) : Lcg-util command line tools successor ? →Users wants command line tools → lcg-util (lcg-cp, lcg-cr ) are heavily used. → lcg-util suffers from his age... → outdated features → hard to maintain → lots of dependencies → → a GFAL 2.0 based -tools ?

gfal-tools : → Experimental tools for lcg-util replacement on the long term. → We need feedback and informations from the experiments : → Which part of lcg-util is needed : → lcg-cp, lcg-ls, lcg-cr, ( and ? ) → What can be improved  in their point of view ?: → It's time for suggestion ! → How can we help to a smooth transition ?

Gfal-tools : Current status → A simple set of generic command line file utils  based on GFAL 2: → gfal-copy → gfal-ls → gfal-rm → gfal-cat → gfal-save → etc.... → Prototype already available for comments ... → Still under development ! → It's the right time for comments...

https://svnweb.cern.ch/trac/lcgutil/wiki/gfal2 lcgutil-support@cern.ch