Grid Data Management Assaf Gottlieb Tel-Aviv University assafgot tau.ac.il EGEE is a project funded by the European Union under contract IST-2003-508833.

Slides:



Advertisements
Similar presentations
Workflows over Grid-based Web services General framework and a practical case in structural biology gLite 3.0 Data Management Hands-on David García Aristegui.
Advertisements

Workflows over Grid-based Web services General framework and a practical case in structural biology gLite 3.0 Data Management David García Aristegui Grid.
INFSO-RI Enabling Grids for E-sciencE Data Management System Jean Salzemann CNRS/IN2P3 ACGRID School, Hanoi (Vietnam) November 6th,
EGEE is a project funded by the European Union under contract IST Grid Data Management Hands-on Simone Campana LCG Experiment Integration and.
Grid Data Management Assaf Gottlieb - Israeli Grid NA3 Team EGEE is a project funded by the European Union under contract IST EGEE tutorial,
EGEE is a project funded by the European Union under contract IST Data Services Valeria Ardizzone EGEE NA4 Generic Applications INFN Catania.
EGEE is a project funded by the European Union under contract IST Data Services Simone Campana LCG Experiment Integration and Support CERN-IT.
The LCG File Catalog (LFC) Jean-Philippe Baud – Sophie Lemaitre IT-GD, CERN May 2005.
Ninth EELA Tutorial for Users and Managers E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
БАЗОВЫЕ СРЕДСТВА РАБОТЫ С ФАЙЛАМИ В GRID. The gLite3 Architecture Security Service: Grid Security Infrastructure (GSI) Secure Sockets Layer (SSL) communication.
EGEE-II INFSO-RI Enabling Grids for E-sciencE gLite Data Management System Yaodong Cheng CC-IHEP, Chinese Academy.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
LFC tutorial Jean-Philippe Baud, IT-GT, CERN July 2010.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Grid Services/SRB/SRM & Practical Hai-Ning Wu Academia Sinica Grid Computing.
EGEE-II INFSO-RI Enabling Grids for E-sciencE gLite Demo Yaodong Cheng CC-IHEP, Chinese Academy of Sciences The.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Data Management Hands-on Claudio Cherubino.
The LCG File Catalog (LFC) Jean-Philippe Baud – Sophie Lemaitre IT-GD, CERN May 2005.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware Data Management in gLite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Nov. 18, EGEE and gLite are registered trademarks gLite Middleware Usage Dusan.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
Jan 31, 2006 SEE-GRID Nis Training Session Hands-on V: Standard Grid Usage Dušan Vudragović SCL and ATLAS group Institute of Physics, Belgrade.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Hands-on on data management Tony Calanducci.
E-science grid facility for Europe and Latin America Data Management Services E2GRIS1 Rafael Silva – UFCG (Brazil) Universidade Federal.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
EGEE is a project funded by the European Union under contract IST Grid Data Management Roberto Barbera Univ. Of Catania and INFN
Managing Data DIRAC Project. Outline  Data management components  Storage Elements  File Catalogs  DIRAC conventions for user data  Data operation.
SEE-GRID-SCI Storage Element Installation and Configuration Branimir Ackovic Institute of Physics Serbia The SEE-GRID-SCI.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
SEE-GRID-SCI Hands-On Session: Using Grid Vladimir Slavnic Institute of Physics, Belgrade Serbia The SEE-GRID-SCI initiative.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Data Management System Giuseppe Andronico.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Data management in LCG and EGEE David Smith.
Data Management The European DataGrid Project Team
Further aspects of EGEE middleware components INFN, Catania EGEE is funded by the European Union under contract IST
Data Management The European DataGrid Project Team
Recovery of Lost Files Jiří Chudoba Institute of Physics, Prague.
EGEE is a project funded by the European Union under contract IST Grid Data Management Simone Campana LCG Experiment Integration and Support.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data management in EGEE.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Data Management Hands-on Juan Eduardo Murrieta.
12th EELA Tutorial for Users and Managers E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
1 DIRAC Data Management Components A.Tsaregorodtsev, CPPM, Marseille DIRAC review panel meeting, 15 November 2005, CERN.
INFSO-RI Enabling Grids for E-sciencE Data Management + Practical Ruediger Berlich / Forschungszentrum Karlsruhe Mike Mineter /
Istituto Nazionale di Astrofisica Information Technology Unit INAF-SI Job with data management Giuliano Taffoni.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra gLite 1.4 Data Management System Salvatore Scifo, Riccardo Bruno Test.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra Data Management System gLite – LCG – FiReMan Salvatore Scifo INFN Catania.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Algiers, EUMED/Epikh Application Porting Tutorial, 2010/07/04.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) LFC Installation and Configuration Dong Xu IHEP,
GRID commands lines Original presentation from David Bouvet CC/IN2P3/CNRS.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Data Management Maha Metawei
INFSO-RI Enabling Grids for E-sciencE Practicals on LFC and gLite DMS Tony Calanducci Emidio Giorgio INFN Retreat between GILDA.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America LFC Server Installation and Configuration.
Martedi 8 novembre 2005 Consorzio COMETA “Progetto PI2S2” FESR Data Management System Annamaria Muoio -- INFN Catania PI2S2 First Tutorial -- Messina,
GFAL Grid File Access Library
gLite Basic APIs Christos Filippidis
Java API del Logical File Catalog (LFC)
gLite Data management system overview
Hands-On Session: Data Management
ESRIN Grid Workshop Tutorial
Data Management in Release 2
Riccardo Bruno, Salvatore Scifo gLite - Tutorial Catania, dd.mm.yyyy
Data Management Ouafa Bentaleb CERIST, Algeria
Data services in gLite “s” gLite and LCG.
Architecture of the gLite Data Management System
gLite Data and Metadata Management
Data Management system in gLite middleware
Presentation transcript:

Grid Data Management Assaf Gottlieb Tel-Aviv University assafgot tau.ac.il EGEE is a project funded by the European Union under contract IST

EGEE tutorial, Outline  Introduction  Grid Data Management Services  File catalogues  Data Management commands  Hands on

EGEE tutorial, Introduction  The Input / Output Sandbox is limited to small files (< 10 MB)  Large files are stored in permanent resources called SE = Storage Elements.  SE are present at almost every site together with the computing resources  No periodical deletion! User responsible for his files.

EGEE tutorial, Grid Data Management Services Grid Data Management Services enable users to:  move files in and out of the Grid  Replicate files on different SE’s  Locate files on various SE’s Data Management means movement and replication of files on grid elements

EGEE tutorial,  Data transfer is done by a number of protocols (gsiftp, rfio, file, etc`)  Usage of a central File catalogue By using high level data management tools which enable transparency of the transport layer details (protocols), storage location and the internal structure of the SE’s The SE is a “black box” Grid Data Management Services – cont’d

EGEE tutorial, File Catalogs  How do I keep track of all of the files I have on the Grid ?  Even if I remember all the lfn’s of my files, what about someone else's files ?  How does the Grid keep track of lfn-guid-surl associations ?  Well… for that we have a FILE CATALOG

EGEE tutorial, File Catalogs SE gLite UI SE

EGEE tutorial, Logical File Name 1 Logical File Name 2 Logical File Name n GUID Physical File SURL n Physical File SURL 1 File Catalogs – cont’d RMC = Replica Metadata Catalog LRC = Local Replica Catalog

EGEE tutorial, File Catalogs – cont’d GUID Xxxxxx-xxxx-xxx-xxx- System Metadata “size” => “cksum_type” => “MD5” “cksum” => “yy-yy-yy” Symlink /grid/dteam/mydir/mylink Replica srm://host.example.com/foo/bar host.example.com Replica srm://host.example.com/foo/bar host.example.com Replica srm://host.example.com/foo/bar host.example.com Replica srm://host.example.com/foo/bar host.example.com Symlink /grid/dteam/mydir/mylink Symlink /grid/dteam/mydir/mylink LFN /grid/dteam/dir1/dir2/file1.root User Metadata User Defined Metadata  The LFN acts as a main key in the database. It has:  Symbolic links to it (additional LFNs)  Unique Identifier (GUID)  System metadata  Information on replicas

EGEE tutorial,  Logical File Name (LFN)  An alias created by the user to refer to some file  A LFN is of the form: lfn:/grid/ / /  Example: lfn:/grid/gilda/importantResults/Test1240.dat  Globally Unique Identifier (GUID)  A file can always be identified by its GUID (based on UUID)  A GUID is of the form: guid:  All replicas of a file will share the same GUID  Example: guid:f81d4fae-7dec-11d0-a765-00a0c91e6bf6 both lfn’s and guid’s refer to files (not replicas) Files : name conventions

EGEE tutorial, Replicas : name conventions  Storage URL (SURL)  (AKA: Physical/Storage File Name (PFN/SFN))  Used by the LRC to find where the replica is physically stored  A SURL is of the form: sfn:// / /  Example: sfn://tbed1.cern.ch/flatfiles/SE00/gilda/project1/testSUTL.dat  Transport URL (TURL)  Temporary locator of a physical replica including the access protocol understood by a SE  A TURL is of the form: :// / /  Example: gsiftp://tbed1.cern.ch/gilda/project1/testTURL.dat provide info about the physical location of the replica

EGEE tutorial, Data Management commands  lcg-cp Copies a Grid file to a local destination  lcg-cr Copies a file to a SE and registers the file in the LRC  lcg-del Deletes one file (either one replica or all replicas)  lcg-rep Copies a file from SE to SE and registers it in the LRC

EGEE tutorial, Data Management commands – cont’d  lcg-lg Gets the guid for a given lfn or surl  lcg-aa Adds an alias in RMC for a given guid  lcg-la Lists the aliases for a given LFN, GUID or SURL  lcg-gt Gets the turl for a given surl and transfer protocol

EGEE tutorial, Data Management commands – cont’d  lcg-lr Lists the replicas for a given lfn, guid or surl  lcg-ra Removes an alias in RMC for a given guid  lcg-rf Registers a SE file in the LRC (optionally in the RMC)  lcg-uf Un-registers a file residing on an SE from the LRC

EGEE tutorial, File catalog commands  lfc-ls List file/directory entries in a directory.  lfc-mkdir Create directory.  lfc-rename Rename a file/directory.  lfc-rm Remove an empty directory.  lfc-chmodChange access mode of a file/directory  lfc-chownChange owner and group of a file/directory

EGEE tutorial,