Presentation is loading. Please wait.

Presentation is loading. Please wait.

EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org www.glite.org Features and Future Frédéric Hemmer - CERN Deputy Head of IT Department.

Similar presentations


Presentation on theme: "EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org www.glite.org Features and Future Frédéric Hemmer - CERN Deputy Head of IT Department."— Presentation transcript:

1 EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org www.glite.org Features and Future Frédéric Hemmer - CERN Deputy Head of IT Department BEGrid seminar Brussels, October 27, 2006

2 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 2 Outline Overview of EGEE EGEE gLite Middleware –Foundation services –High Level services examples –Software process –Short Term plans Software Process & ETICS

3 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 3 The EGEE project EGEE –Started in April 2004 –Now in 2 nd phase with 91 partners in 32 countries Objectives –Large-scale, production-quality grid infrastructure for e-Science –Attracting new resources and users from industry as well as science –Maintain and further improve gLite Grid middleware

4 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 4 Applications on EGEE Many applications from a growing numbers of domains –Astrophysics  MAGIC, Planck –Computational Chemistry –Earth Sciences  Earth Observation, Solid Earth Physics, Hydrology, Climate –Financial Simulation  E-GRID –Fusion –Geophysics  EGEODE –High Energy Physics  4 LHC experiments (ALICE, ATLAS, CMS, LHCb)  BaBar, CDF, DØ, ZEUS –Life Sciences  Bioinformatics (Drug Discovery, GPS@, Xmipp_MLrefine, etc.)  Medical imaging (GATE, CDSS, gPTM3D, SiMRI 3D, etc.) –Multimedia –Material Sciences > 165 Virtual Organizations (VO) User Forum Book of abstracts: http://doc.cern.ch/archive/electronic/egee/tr/egee-tr-2006-005.pdfhttp://doc.cern.ch/archive/electronic/egee/tr/egee-tr-2006-005.pdf App deployment plan https://edms.cern.ch/document/722131/2https://edms.cern.ch/document/722131/2 Presentations, posters and demos at EGEE06: http://www.eu-egee.org/egee06http://www.eu-egee.org/egee06 Applications have moved from testing to routine and daily usage ~80-90% efficiency

5 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 5 EGEE: > 180 sites, 40 countries > 24,000 processors, ~ 5 PB storage EGEE Grid Sites : Q1 2006 sites CPU EGEE: Steady growth over the lifetime of the project

6 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 6 EGEE – What do we deliver? Infrastructure operation –Currently includes ~200 sites across 40 countries –Continuous monitoring of grid services & automated site configuration/management http://gridportal.hep.ph.ic.ac.uk/rtm/launch_frame.html Middleware –Production quality middleware distributed under business friendly open source licence User Support - Managed process from first contact through to production usage –Training –Expertise in grid-enabling applications –Online helpdesk –Networking events (User Forum, Conferences etc.) Interoperability –Expanding geographical reach and interoperability with collaborating e-infrastructures

7 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 7 Middleware Layers Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are helping the users building their computing infrastructure but should not be mandatory Foundation Grid Middleware is deployed on the EGEE infrastructure –Must be complete and robust –Should allow interoperation with other major grid infrastructures –Should not assume the use of Higher-Level Grid Services Foundation Grid Middleware Security model and Infrastructure Computing (CE) and Storage Elements (SE) Accounting Information and Monitoring Higher-Level Grid Services Workload Management Replica Management Visualization Workflow Grid Economies... Applications

8 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 8 The gLite Middleware Approach Exploit experience and existing components from VDT (Condor, Globus), EDG/LCG, and others –gLite is a distribution that combines components from many different providers! Develop, Test, Certify & Distribute a generic middleware stack useful to EGEE (and other) applications –Pluggable components –Follow SOA approach, WS-I compliant where possible Focus is on re-engineering and hardening Business friendly open source license –Plan to switch to Apache-2

9 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 9 Workload ManagementData Management Security Information & Monitoring Access gLite Grid Middleware Services API Computing Element Workload Management Metadata Catalog Storage Element Data Movement File & Replica Catalog Authorization Authentication Information & Monitoring Application Monitoring Auditing Job Provenance Package Manager CLI Accounting Site Proxy Overview paper http://doc.cern.ch//archive/electronic/egee/tr/egee-tr-2006-001.pdfhttp://doc.cern.ch//archive/electronic/egee/tr/egee-tr-2006-001.pdf

10 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 10 Grid Foundation: Computing Element The CE accepts batch jobs (and job control requests) through a gatekeeper, performs AAA, passes them to a LRMS, monitors the their execution and return results to the submitter Three flavours available now:  LCG-CE (GT2 GRAM)  in production now but will be phased-out by the end of the year  gLite-CE (GSI-enabled Condor-C)  already deployed but still needs thorough testing and tuning.  CREAM (WS-I based interface)  Contribution to the OGF-BES group for a standard WS-I based CE interface BLAH is the interface to the local resource manager (via plug-ins) –CREAM and gLite-CE –Information pass-through: pass parameters to the LRMS to help job scheduling WMS, Clients LRMS WN bdII R-GMA CEMon Computing Element glexec + LCAS/ LCMAPS BLAH Grid Site Information System

11 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 11 Grid Foundation: Storage Element Site File Name (SFN): identifies a Storage Element and the logical name of the file inside it Physical File Name (PFN): argument of file open Storage Resource Manager (SRM) –hides the storage system implementation (disk or active tape) –checks the access rights to the storage system and the files –translates SFNs to PFNs –disk-based: DPM, dCache; tape-based: Castor, dCache File I/O: posix-like access from local nodes or the grid  GFAL

12 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 12 Example: The Disk Pool Manager Light-weight disk-based Storage Element –Easy to install, configure, manage and to join or remove resources –Integrated security (authentication/authorization) based on VOMS groups and roles  All control and I/O services have security built-in: GSI or Kerberos 5 –SRMv1 and SRMv2.1 interfaces. SRMv2.2 being added now Grid ClientData ServerSRM ServerName ServerDisk Pool Manager Disk SystemGridftp ClientRFIO ClientSRM ClientNS DatabaseDPM Database DPM DaemonNS DaemonRFIO Daemon Gridftp Server RFIO Client Request Daemon SRM Daemon

13 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 13 Grid Foundation: Accounting Resource usage by VO, group or single user –Resource metering: sensors running on resources to determine usage –Pricing policies: associate a cost to resource usage  if enabled allowed market- based resource brokering –privacy: access to accounting data granted only to authorized people (user, provider, VO manager) Basic functionality in APEL, full functionality in DGAS

14 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 14 High Level Services: Job Information Logging and Bookkeeping service –Tracks jobs during their lifetime (in terms of events) Job Provenance stores long term job information –Supports job rerun

15 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 15 High Level Services: Workload Management Resource brokering, workflow management, I/O data management  Web Service interface: WMProxy –Task Queue: keep non matched jobs –Information SuperMarket: optimized cache of information system –Match Maker: assigns jobs to resources according to user requirements –Job submission & monitoring  Condor-G  Condor-C  ICE (to CREAM) –External interactions:  Information System  Data Catalogs  Logging&Bookkeeping  Policy Management system (G-PBox)

16 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 16 High Level Services : FTS Reliable and manageable File Transfer System for VOs Transfers are treated as jobs –May be split onto multiple “channels” –Channels are point-to-point or “catch-all” (only one end fixed). More flexible channel definitions on the way... New features that will be available in production soon: –Cleaner error reporting and service monitoring interfaces –Proxy renewal and delegation –SRMv2.2 support Longer term development: –Optimized SRM interaction  split preparation from transfer –Better service management controls –Notification of finished jobs –Pre-staging tape support –Catalog & VO plug-ins framework  Allow catalog registration as part of transfer workflow

17 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 17 High Level Services: EDS Encrypted Data Storage –encrypt and decrypt data on-the-fly –Key-store: Hydra  N instances: at least M (<N) need to be available for decryption fault tolerance and security Demonstrated with the SRM- DICOM demo at EGEE Pisa conference (Oct’05) Will be GFAL Will be LFC Will be DPM (now d-Cache)

18 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 18 Main focus for the developers Give support on the production infrastructure (GGUS, 2 nd line support) Fix defects found on the production software Support SL(C)4 and 64bit architectures (x86-64 first) Participate to Task Forces together with applications and site experts and improve scalability Improve robustness and usability (efficiency, error reporting,...) Address requests for functionality improvements from users, site administrators, etc... (through the Technical Coordination Group) Improve adherence to international standards and interoperability with other infrastructures Deploy and expose to users new components on the preview test- bed Interoperability with Shibboleth Work plans available at: https://twiki.cern.ch/twiki/bin/view/EGEE/EGEEgLiteWorkPlans https://twiki.cern.ch/twiki/bin/view/EGEE/EGEEgLiteWorkPlans

19 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 19 Highlights: Shibboleth Shibboleth –Federation of campus infrastructures –Developed by Internet2 –Allows Single Sign On for web-based resources –Based on SAML (Security Assertion Markup Language ) –Manages an Authentication and Authorization Infrastructure (AAI) based on Shibboleth with about 160’000 users of the Swiss higher education sector  Activity started in 2002; in production since last summer  about 12’000 use SWITCHaai on a regular basis Interoperability with gLite –Specific for EGEE-2 infrastructure  NO replacement for X.509, VOMS,... –Home institution of the user is the Identity Provider –Attributes both from home institution and the VO

20 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 20 WMS Performance Results ~20000 jobs submitted –3 parallel UIs –33 Computing Elements –200 jobs/collection  Bulk submission Performances –~ 2.5 h to submit all jobs  0.5 seconds/job –~ 17 hours to transfer all jobs to a CE  3 seconds/job  26000 jobs/day Job failures –Negligible fraction of failures due to the gLite WMS  Either application errors or site problems Failure reasonJob fraction (%) Application error28 Remote batch system3.9 CRL expired3.3 Worker Node problem1.1 Gatekeeper down0.2 By A.Sciabà

21 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 21 Certification Functional Tests Testbed Deployment gLite Software Process Development Software Error Fixing Integration Deployment Packages Integration Tests Installation Guide, Release Notes, etc Pre-Production Scalability Tests Pre-Production Deployment Fail Pass Production Infrastructure Release Problem Serious problem Directives

22 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 22 gLite Software Process Technical Coordination Group (TCG) –Gathers & prioritizes user requirements from HEP, Biomed, (industry), sites –gLite development is client-driven! Software from EGEE-JRA1 and other projects –JRA1 preview test-bed (currently being set up)  early exposure to users of “uncertified” components SA3 Integration Team –Ensures components are deployable and work –Deployment Modules implemented high-level gLite node types  (WMS, CE, R-GMA Server, VOMS Server, FTS, etc) –Build system now spun off into the ETICS project SA3 Certification Team –Dedicated test-bed; test release candidates and patches –Develop test suites SA1 Pre-Production System –Scale tests by users

23 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 23 ETICS Build/Test Artefacts Web Application Report DB Project DB NMI Scheduler Clients Web Service NMI Client Via browser Via command- Line tools WNs ETICS Infrastructure

24 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 24 Summary EGEE is a global effort, and the largest multi-science Grid infrastructure worldwide gLite 3.0 is an important milestone in EGEE program –New components from gLite 1.X developed in the first phase of EGEE are being deployed for the first time on the Production Infrastructure  Addressing application ad operations requirements in terms of functionality and scalability  New build and integration environment from ETICS  Controlled software process and certification  Development is application driven (TCG) Collaboration with other projects for interoperability and definition/adoption of international standards

25 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 25 www.glite.org www.eu-egee.org

26 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 26 Grids in Europe Large European investment in developing Grid technology Sample of National Grid projects: –Austrian Grid Initiative –Belgium: BEgrid –DutchGrid –France: Grid’5000 –Germany: D-Grid; Unicore –Greece: HellasGrid –Grid Ireland –Italy: INFNGrid; GRID.IT –NorduGrid –Portuguese Grid –Swiss Grid –UK e-Science: National Grid Service; OMII; GridPP Multi-national, multi-science Grid infrastructures are a priority of the EC: –DEISA, EGEE plus several supporting projects

27 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 27 Evolution European e-Infrastructure Testbeds Utility Service Routine Usage National Global

28 Enabling Grids for E-sciencE EGEE-II INFSO-RI-031688 BEGrid Seminar, Brussels - October 27, 2006 28 Why Sustainability? Scientific applications start to depend on Grid infrastructures –e.g. EGEE supports well over 100 VOs, running over 50.000 jobs/day –Require long-term support New scientific collaborations have been formed thanks to the Grid infrastructure –E.g. WISDOM (http://wisdom.healthgrid.org)http://wisdom.healthgrid.org Business and Industry are getting very interested but need a long term perspective –E.g. over 20 companies were present at the Business Track during the EGEE’06 conference, September, 2006 >50k jobs/day Virtual Organizations Jan. ’06 Sep. ’06


Download ppt "EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org www.glite.org Features and Future Frédéric Hemmer - CERN Deputy Head of IT Department."

Similar presentations


Ads by Google