EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks The EGEE project and the gLite middleware.

Slides:



Advertisements
Similar presentations
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using EGEE middleware: AA and simple job submission.
Advertisements

CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
Lousy Introduction into SWITCHaai
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Interoperability AAI and Grids Christoph.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MyProxy and EGEE Ludek Matyska and Daniel.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
INFSO-RI Enabling Grids for E-sciencE EGEE and gLite Slides by: Erwin Laure EGEE Deputy Middleware Manager.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Introduction to EGEE hands-on Gergely Sipos.
1 P-GRADE Portal and GEMLCA Legacy Code Architecture Peter Kacsuk MTA SZTAKI
An open source approach for grids Bob Jones CERN EU DataGrid Project Deputy Project Leader EU EGEE Designated Technical Director
ATLAS/LHCb GANGA DEVELOPMENT Introduction Requirements Architecture and design Interfacing to the Grid Ganga prototyping A. Soroko (Oxford), K. Harrison.
The National Grid Service and OGSA-DAI Mike Mineter
Current status of grids: the need for standards Mike Mineter TOE-NeSC, Edinburgh.
Andrew McNab - Manchester HEP - 22 April 2002 EU DataGrid Testbed EU DataGrid Software releases Testbed 1 Job Lifecycle Authorisation at your site More.
EGEE is a project funded by the European Union under contract IST EGEE Tutorial Turin, January Hands on Job Services.
12th EELA Tutorial, Lima, FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America.
EU 2nd Year Review – Jan – Title – n° 1 WP1 Speaker name (Speaker function and WP ) Presentation address e.g.
INFSO-RI Enabling Grids for E-sciencE Workload Management System and Job Description Language.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 15 th April 2009 Visit of Spanish Royal Academy.
Basic Grid Job Submission Alessandra Forti 28 March 2006.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of the EGEE project and the gLite middleware Gergely Sipos MTA SZTAKI
A short introduction to GRID Gabriel Amorós IFIC.
The gLite API – PART I Giuseppe LA ROCCA INFN Catania ACGRID-II School 2-14 November 2009 Kuala Lumpur - Malaysia.
GRID Computing: Ifrastructure, Development and Usage in Bulgaria M. Dechev, G. Petrov, E. Atanassov.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Grid application development with gLite and P-GRADE Portal Miklos Kozlovszky MTA SZTAKI.
Computational grids and grids projects DSS,
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
Nadia LAJILI User Interface User Interface 4 Février 2002.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
EGEE-II INFSO-RI Enabling Grids for E-sciencE An Introduction to the EGEE Project Presented by Min Tsai ISGC 2007, Taipei With thanks.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Introduction to GILDA and gaining access.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE EGEE training community.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE Gergely Sipos
CEOS WGISS-21 CNES GRID related R&D activities Anne JEAN-ANTOINE PICCOLO CEOS WGISS-21 – Budapest – 2006, 8-12 May.
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
EGEE-0 / LCG-2 middleware Practical.
INFSO-RI Enabling Grids for E-sciencE GILDA and GENIUS Guy Warner NeSC Training Team An induction to EGEE for GOSC and the NGS NeSC,
1 P-GRADE Portal tutorial at EGEE’09 Introduction to hands-on Gergely Sipos MTA SZTAKI EGEE.
1 P-GRADE Portal hands-on Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences.
User Interface UI TP: UI User Interface installation & configuration.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
EGEE is a project funded by the European Union under contract IST GENIUS and GILDA Guy Warner NeSC Training Team Induction to Grid Computing.
GRID commands lines Original presentation from David Bouvet CC/IN2P3/CNRS.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Work Load Management & Simple Job Submission Practical Shu-Ting Liao APROC, ASGC EGEE Tutorial.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
Bob Jones EGEE Technical Director
Grid2Win Porting of gLite middleware to Windows XP platform
gLite Information System
Practical: The Information Systems
Grid2Win: Porting of gLite middleware to Windows XP platform
Introduction to Grid Technology
Workload Management System
Overview of the EGEE project and the gLite middleware
EGEE Middleware: gLite Information Systems (IS)
The GENIUS portal and the GILDA t-Infrastructure
Presentation transcript:

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE project and the gLite middleware Gergely Sipos EGEE User training and Induction Activity deputy manager MTA SZTAKI Computer and Automation Research Institute Hungarian Academy of Sciences

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 2 Contents What is EGEE? –Goals –Status –Activities EGEEs Grid middleware: gLite 3 –Security –Job management –Information system –File catalogue Overview of the afternoon hands-on

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 3 Introduction The EGEE project: –4 year project, funded by the EU (EGEE, EGEE-II) –Seamless Grid infrastructure for e-Science, available for scientists 24 hours-a-day EGEE:1 April 2004 – 31 March partners in 27 countries, federated in regional Grids We are now in 2 nd phase of the project EGEE-II:1 April 2006 – 31 March partners in 32 countries grouped into 13 federations EGEE-III proposal is under preparation Transition to self-sustainable European Grid service infrastructure

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 4 EGEE goals – international e-infrastructure Network infrastructure & Resource centres Operations, Support and training Collaboration International Grid Build, deploy and operate a consistent, robust a large scale production grid service that –Links with and build on national, regional and international initiatives Improve and maintain the middleware in order to deliver a reliable service to users Attract new users from research and industry and ensure training and support for them

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 5 Production Service Real Time Monitor –Java tool –Displays jobs running (submitted through brokers) –Shows jobs moving around world map in real time, along with changes in status (snapshot 16 January 2007)

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 6 From April 2006, natural continuation of EGEE –Expanded consortium –Emphasis on providing an infrastructure increased support for applications interoperate with other infrastructures more involvement from Industry SA: service activities - establishing operations NA: network activities - supporting users and applications JRA: joint research activities - e.g. hardening middleware EGEE-II

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 7 EGEE Infrastructure Country participating in EGEE > 200 sites in 40 countries ~ CPUs ~ 5 PB storage 98k jobs/day > 200 Virtual Organizations The worlds largest multi-disciplinary Grid

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 8 Who is using EGEE? >200 VOs from several scientific domains –Astronomy & Astrophysics –Civil Protection –Computational Chemistry –Comp. Fluid Dynamics –Computer Science/Tools –Condensed Matter Physics –Earth Sciences –Fusion –High Energy Physics –Life Sciences Further applications under evaluation 98k jobs/day Applications have moved from testing to routine and daily usage ~80-90% efficiency

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 9 Virtual organisations and grids What is a Virtual Organisation? –People in different organisations seeking to cooperate and share resources across their organisational boundaries –E.g. A research collaboration Each grid is an infrastructure enabling one or more virtual organisations to share and access resources Each resource is exposed to the grid through an abstraction that masks heterogeneity, e.g. –Multiple diverse computational platforms –Multiple data resources Resources are usually owned by VO members. Negotiations lead to VOs sharing resources

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 10 EGEE Infrastructure Production service (production VOs) –Based on gLite 3.0 middleware –With new resource centres and new applications encouraged to participate –Stable, well-supported infrastructure, running only well-tested and reliable middleware Pre-production service –Run in parallel with the production service (restricted no. of sites) –First deployment of new versions of the middleware –Applications test-bed GILDA training testbed To be used in the afternoon – –Complete suite of Grid elements and applications Testbed, CA certification, VOs, monitoring –Everyone can register and use GILDA for training and testing

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 11 Basic services of the EGEE grid Computing Element Storage Element Site X Information System Submit job query Retrieve output Resource Broker User Interface publish state File and Replica Catalog Authorization Service (VO Management Service) query create credential

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 12 Who provides the resources?! ServiceProviderNote User interface User / institute / VOComputer with client software Resource Broker VOs - EGEE does not fund RBs Information System Grid operations - EGEE funded effort File and replica catalog VOs - EGEE does not fund catalogs Computing Element (cluster) VOs - EGEE does not fund CEs Scalability requires that VOs provide resources to match average need Storage Element VOs - EGEE does not fund SEs VO or application specific services User / institute / VOTo extend the capabilities of the core infrastructure

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 13 gLite security – the users viewpont Users checklist –User obtains grid certificate from a recognized Certificate Authority –User finds and joins a VO –User gets account on the UI machine (and in a Grid portal) –User connects to UI by ssh and generates proxy (OR User logs it to a portal service and generates proxy) VO administrators checklist –Receive users join request –Add users identity and role into VOMS DB –Create account on UI UI CA VO mgr Annually VO database Grid Security Infrastructure VOMS Once VO services: broker, CE, SE, …

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 14 Security – viewpoint of the grid User Interface Storage Element Broker Computing Element Users public key Users private key Proxy Generate VO database VOMS server Proxy + VO role

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 15 Generating proxy – user command giorgio]$ ls -l.globus total 8 -rw-r--r-- 1 giorgio users 1613 Jun 7 15:16 usercert.pem -r giorgio users 1914 Jun 7 15:16 userkey.pem giorgio]$ voms-proxy-init --voms gilda Cannot find file or dir: /home/giorgio/.glite/vomses Your identity: /C=IT/O=GILDA/OU=Personal Certificate/L=INFN/CN=Emidio Enter GRID pass phrase: *********** Creating temporary proxy Done Contacting voms.ct.infn.it:15001 [/C=IT/O=GILDA/OU=Host/L=INFN "gilda" Done Creating proxy Done Your proxy is valid until Fri Jul 14 00:34:

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 16 Example JDL file Executable = gridTest; What program to start? StdError = stderr.log; How to call the Error? StdOutput = stdout.log; How to call the Output? InputSandbox = {/home/sipos/test/gridTest}; What file(s) to copy from the UI? OutputSandbox = {stderr.log, stdout.log}; What file(s) to copy back to the UI? Specifying a job: Job Description Language Submit job to grid via the resource broker (RB): edg-job-submit my.jdl Execute this on the UI Returns a job-id used to monitor job, retrieve output

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 17 Current production middleware ComputingElement edg-job-submit… Input sandbox Output sandbox edg-job-get-output Output sandbox User interface Executable = gridTest; StdError = stderr.log; StdOutput = stdout.log; InputSandbox = {/home/sipos/test/gridTest}; OutputSandbox = {stderr.log, stdout.log}; gridTest stderr.log stdout.log A worker node is allocated by the local jobmanager (e.g. Condor, LSF, …) stderr.log stdout.log stderr.log stdout.log STD out and err. streams are redirected into files (STD input stream is read from file)

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 18 Example JDL file Executable = gridTest; StdError = stderr.log; StdOutput = stdout.log; InputSandbox = {/home/sipos/test/gridTest}; OutputSandbox = {stderr.log, stdout.log}; Requirements = other.Architecture==INTEL && \ other.Memory>=256; Rank = other.Memory; Job submission with dynamic resource discovery Submit job to grid via the resource broker (RB): edg-job-submit my.jdl Execute this on the UI Returns a job-id used to monitor job, retrieve output

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 19 Information system in action ResourceBrokerStorageElementComputingElement InformationService Input sandbox Input sandbox + Broker Info Output sandbox Publish SE & CE info User interface

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 20 a user or a service can query – the BDII (usual mode) – LDAP servers on each site Information Service

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 21 Information system user query: lcg-infosites The lcg-infosites command can be used as an easy way to retrieve information on Grid resources for most use cases. USAGE: (execute on the UI machine) lcg-infosites --vo options -v --is

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 22 Example JDL file Executable = gridTest; StdError = stderr.log; StdOutput = stdout.log; InputSandbox = {/home/sipos/test/gridTest}; OutputSandbox = {stderr.log, stdout.log}; Requirements = other.Architecture==INTEL && \ other.Memory>=256; Rank = other.Memory; InputData = lfn:/grid/gilda/sipos/inputs/testbed019; Job submission with dynamic data discovery Submit job to grid via the resource broker (RB): edg-job-submit my.jdl Execute this on the UI Returns a job-id used to monitor job, retrieve output

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 23 File catalog in action ResourceBrokerStorageElementComputingElement InformationService Data location Input sandbox Input sandbox + Broker Info Output sandbox Publish SE & CE info User interface LCG File Catalogue (LFC)

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 24 Storage Element 3 sfn://trigriden01.unime.it/flatfiles/SE00/gilda/generated/ /filec79a9e3c a2a5-235f Storage Element 2 srm://aliserv6.ct.infn.it/dpm/ct.infn.it/home/gilda/generated/ /filea21ab3e2-8ff6-4a44-82a7-f2 LCG File Catalogue Catalog of grid directories Virtual file system: Each entry in the directory may be stored on different SEs lfn:/grid/gilda/sipos/inputs/ input1 input2 input3 Storage Element 1 sfn://grid005.iucc.ac.il/storage/gilda/generated/ /fileb233d43f-5bc6-4ede-a5fe-611d48be2ba5 LCG FileCatalogue (LFC) Storage Element 4 sfn://grid005.iucc.ac.it/flatfiles/SE00/gilda/generated/ /filec79a9e3c a2a5-235f User friendly file hierarchy and file names Replicas: the same content stored on two SEs System provides job with closest replica

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 25 Resolving logical file nameStorage Element 1 User interface LCG FileCatalogue (LFC) Storage Element 2 Two copied of the same file are available on 2 SEs lfn:/grid/gilda/sipos/ inputs/testbed019 Myfile.dat File_on_se1 File_on_se2

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 26 Example JDL file Executable = gridTest; StdError = stderr.log; StdOutput = stdout.log; InputSandbox = {/home/sipos/test/gridTest}; OutputSandbox = {stderr.log, stdout.log}; Requirements = other.Architecture==INTEL && \ other.Memory>=256; Rank = other.Memory; InputData = lfn:/grid/gilda/sipos/inputs/testbed019; Job submission with dynamic data discovery Submit job to grid via the resource broker (RB): edg-job-submit my.jdl Execute this on the UI Returns a job-id used to monitor job, retrieve output lfn: logical file name RB uses File Catalog to find file location The file itself is NOT transferred by the broker! Your binary must transfer input/output grid files! Broker puts your job on a CE close to the SE where the file is stored

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 27 Working with grid files Use lfc-* commands to interact with the catalogue –To create catalogue directory –List files –Used by User, users binary and by lcg-utils (see below) LCG File Catalogue (LFC) Storage Element 1 User interface Storage Element 2 lfc-* lcg-* Use lcg-* commands to couple catalogue operations with file management –Keeps SEs and catalogue in step! –Create file on storage and register it in catalogue –Replicate files between storages –Download file and delete from catalog

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 28 High(er) level user environments To hide the complexity of the grid middleware To provide application specific user interfaces To provide easier access mechanisms (Eg. Web portal) To keep the user focused onto the real problem Recommended External Software Packages for Egee CommuniTies Current RESPECT tools: –GANGA –GridWay –P-GRADE Portal Wednesday Grid software menu Basic gLite services: security, job submission, info. Higher-level gLite services (brokering,…) Application toolkits, standards Application

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 29 Summary - EGEE and gLite EU founded Grid project: EGEE; EGEE-II –Build, deploy and operate a consistent, robust a large scale production grid service that –Improve and maintain the middleware in order to deliver a reliable service to users –Attract new users from research and industry and ensure training and support for them gLite middleware: Bag of services that address key distributed system issues: –Security, Data management, Job management, Monitoring Basic components –User interface access service for end users –Workload Management System resource broker –Computing Element computing service, a job queue –Storage Element File storage –File catalog File registry, virtual file system –Information System database of SEs and CEs All built onto Grid Security Infrastructure (GSI) High level community or application specific tools

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 30 Furhter information, references EGEE – gLite middleware – gLite manuals, documentation – (gLite user guide) Recommended External Software Packages for Egee CommuniTies –

Enabling Grids for E-sciencE EGEE-II INFSO-RI CoreGRID Summer School 2007 – Budapest, 3-7 September 31 Afternoon practical 15:45-18:00 Command reminder Security (Practical 1) voms-proxy-init –voms gilda generates proxy voms-proxy-destroy delete proxy Job management (Practical 2 & 4) edg-job-submit job.jdl returns a jobID edg-job-get-status jobID returns job status edg-job-getoutput jobID downloads OutputSandBox Information system (Practical 3) lcg-infosites … queries BDII server File catalogue management (No practical today) lfc-* manipulate grid directories lcg-* manipulate SEs and grid directory entries

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Thank you! Questions?