DataGrid is a project funded by the European Commission under contract IST-2000-25182 3 rd EU Review – 19-20/02/2004 The EU DataGrid Project Three years.

Slides:



Advertisements
Similar presentations
EU DataGrid progress Fabrizio Gagliardi EDG Project Leader
Advertisements

An open source approach for grids Bob Jones CERN EU DataGrid Project Deputy Project Leader EU EGEE Designated Technical Director
Partner Logo UK GridPP Testbed Rollout John Gordon GridPP 3rd Collaboration Meeting Cambridge 15th February 2002.
SWITCH Visit to NeSC Malcolm Atkinson Director 5 th October 2004.
Andrew McNab - Manchester HEP - 22 April 2002 EU DataGrid Testbed EU DataGrid Software releases Testbed 1 Job Lifecycle Authorisation at your site More.
An overview of the EGEE project Bob Jones EGEE Technical Director DTI International Technology Service-GlobalWatch Mission CERN – June 2004.
February 12, 2002 DØRACE 1 Grid activities in NIKHEF Willem van Leeuwen.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
LCSC October The EGEE project: building a grid infrastructure for Europe Bob Jones EGEE Technical Director 4 th Annual Workshop on Linux.
EGEE is proposed as a project funded by the European Union under contract IST The EGEE International Grid Infrastructure and the Digital Divide.
The CrossGrid project Juha Alatalo Timo Koivusalo.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 DataGrid Quality Assurance On behalf.
08/11/908 WP2 e-NMR Grid deployment and operations Technical Review in Brussels, 8 th of December 2008 Marco Verlato.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
GridPP9 – 5 February 2004 – Data Management DataGrid is a project funded by the European Union GridPP is funded by PPARC WP2+5: Data and Storage Management.
DataGrid is a project funded by the European Commission under contract IST Status and Prospective of EU Data Grid Project Alessandra Fanfani.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
EGEE is a project funded by the European Union under contract IST JRA1 Testing Activity: Status and Plans Leanne Guy EGEE Middleware Testing.
Andrew McNab - Manchester HEP - 5 July 2001 WP6/Testbed Status Status by partner –CNRS, Czech R., INFN, NIKHEF, NorduGrid, LIP, Russia, UK Security Integration.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
LCG and HEPiX Ian Bird LCG Project - CERN HEPiX - FNAL 25-Oct-2002.
CERN – Roberta Faggian Marque, Jan Fiete Grosse-Oetringhaus GRACE General Meeting, September 2004, Brussels 1 D6.1 Integration with the European DataGrid.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
3 June 2004GridPP10Slide 1 GridPP Dissemination Sarah Pearce Dissemination Officer
WP8 Status – Stephen Burke – 30th January 2003 WP8 Status Stephen Burke (RAL) (with thanks to Frank Harris)
DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 WP1 activity, achievements and plans.
General Status of the Project Fabrizio Gagliardi Project Leader.
DataTAG Research and Technological Development for a Transatlantic Grid Abstract Several major international Grid development projects are underway at.
The European DataGrid Project Team The EU DataGrid.
Bob Jones Technical Director CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST
INFSO-RI Enabling Grids for E-sciencE SA1 and gLite: Test, Certification and Pre-production Nick Thackray SA1, CERN.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
GridPP Presentation to AstroGrid 13 December 2001 Steve Lloyd Queen Mary University of London.
EGEE is a project funded by the European Union under contract IST Middleware Planning for LCG/EGEE Bob Jones EGEE Technical Director e-Science.
DataGRID WPMM, Geneve, 17th June 2002 Testbed Software Test Group work status for 1.2 release Andrea Formica on behalf of Test Group.
JRA Execution Plan 13 January JRA1 Execution Plan Frédéric Hemmer EGEE Middleware Manager EGEE is proposed as a project funded by the European.
DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 DataGrid Project Status Fabrizio Gagliardi.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
LCG EGEE is a project funded by the European Union under contract IST LCG PEB, 7 th June 2004 Prototype Middleware Status Update Frédéric Hemmer.
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
DataGrid is a project funded by the European Commission under contract IST F. Harris DataGrid EU Review 19 Feb 2004 WP8 HEP Applications Final.
EGEE MiddlewareLCG Internal review18 November EGEE Middleware Activities Overview Frédéric Hemmer EGEE Middleware Manager EGEE is proposed as.
Status Organization Overview of Program of Work Education, Training It’s the People who make it happen & make it Work.
1CHEP2000 February 2000F. Gagliardi EU HEP GRID Project Fabrizio Gagliardi
EC Review – 01/03/2002 – WP9 – Earth Observation Applications – n° 1 WP9 Earth Observation Applications 1st Annual Review Report to the EU ESA, KNMI, IPSL,
DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 DataGrid Project Status Fabrizio Gagliardi.
EGEE Project Review Fabrizio Gagliardi EDG-7 30 September 2003 EGEE is proposed as a project funded by the European Union under contract IST
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Robin McConnell Activity Manager UEDIN (NeSC)
INFSO-RI Enabling Grids for E-sciencE EGEE general project update Fotis Karayannis EGEE South East Europe Project Management Board.
14 June 2001LHCb workshop at Bologna1 LHCb and Datagrid - Status and Planning F Harris(Oxford)
J Jensen / WP5 /RAL UCL 4/5 March 2004 GridPP / DataGrid wrap-up Mass Storage Management J Jensen
18/12/03PPD Christmas Lectures 2003 Grid in the Department A Guide for the Uninvolved PPD Computing Group Christmas Lecture 2003 Chris Brew.
Bob Jones EGEE Technical Director
Regional Operations Centres Core infrastructure Centres
EGEE Middleware Activities Overview
JRA3 Introduction Åke Edlund EGEE Security Head
EO Applications Parallel Session
Ian Bird GDB Meeting CERN 9 September 2003
Grid related projects CERN openlab LCG EDG F.Fluckiger
Long-term Grid Sustainability
EGEE support for HEP and other applications
UK GridPP Tier-1/A Centre at CLRC
Connecting the European Grid Infrastructure to Research Communities
High Energy Physics Computing Coordination in Pakistan
Ian Bird LCG Project - CERN HEPiX - FNAL 25-Oct-2002
Grid activities in NIKHEF
gLite The EGEE Middleware Distribution
Presentation transcript:

DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 The EU DataGrid Project Three years of research and development in Grid technologies DataGrid Technical Coordinator

n° 2 Outline u DataGrid at a glance u A chronological overview u DataGrid assets u Lessons learned u Summary

n° 3 Software > 65 use cases 7 major software releases (> 60 in total) > 1,000K lines of code People 500 registered users 12 Virtual Organisations 21 Certificate Authorities >600 people trained 456 man-years of effort 170 years funded DataGrid at a glance Application Testbed ~20 regular sites > 60,000 jobs submitted ( since 09/04, release 2.0 ) Peak >1000 CPUs 6 Mass Storage Systems Scientific Applications 5 Earth Obs institutes 10 bio-informatics apps 6 HEP experiments

n° 4 Chronological overview u Project started on Jan 1 st 2001 u Early distributed testbed based on Globus 1 u CA infrastructure established u Development of higher level Grid middleware started n Workload management (“Broker”) n Data management (GDMP, edg-replica-manager, SE) n Information Services (R-GMA) n Fabric management (adopt LCFG) TB0 TB1 TB2 TB3 Jan 2001 Mar Globus 1 EDG 1 EDG 1.4EDG Assessment R&D

n° 5 Chronological overview u Decided to base development on GT 2 n Delayed rollout of TB1 (EDG v1.0) u TB1 deployed on 5 sites n CERN, NIKHEF, RAL, IN2P3, CNAF u Application evaluation started n 1 st HEP job run on TB1 on December 11 th, 2001 TB0 TB1 TB2 TB3 Jan 2001 Mar Globus 1 EDG 1 EDG 1.4EDG Assessment R&D CERN launched LCG project in September 2001

n° 6 Chronological overview u 1 st EU review successfully passed on March 1 st 2002 u Evaluation by end users revealed the need to focus on stability rather than new functionality n Project retreat in August resulted in re-focus on quality u Open Source license established in June 2002 n Served as model for globus and CrossGrid license u Start of tutorial program in July 2002 (GGF5) n Developed into a road-show with hands-on sessions; more than 600 people trained in over 25 events TB0 TB1 TB2 TB3 Jan 2001 Mar Globus 1 EDG 1 EDG 1.4EDG Assessment R&D Focus on quality

n° 7 Chronological overview u EDG technologies widely recognized: n Many sites joined testbed (up to 20) n Software used and evaluated by other projects (e.g. CrossGrid, LCG) n Collaboration with sister projects demonstrated at IST and SC u Testbed 2 (End 2002, release 1.4.x) n One of the largest Grid testbeds worldwide n Allowed first production tests by applications: s HEP monte-carlo simulation s EO grid portal developed s Many bio informatics applications TB0 TB1 TB2 TB3 Jan 2001 Mar Globus 1 EDG 1 EDG 1.4EDG Assessment R&D Focus on quality

n° 8 Evaluation of Release 1.4 (Dec 02/Jan 03) u Large increase in users u Many sites interested in joining u Pushing real jobs through system u Stability and scalability not yet satisfactory u Release 2.0 addresses the problems revealed CEs 1MB1MB 1TB1TB TOTAL: >1.5 TB 100 GB 1 TB 19 G B HEP Simulation Disk Usage (CERN) 200 GB CPU Usage CEs SEs Nb. of evts

n° 9 Chronological overview u Successfully passed 2 nd annual EU review on February 4-5 u Shortcomings identified in application tests adressed: n WMS re-factored n RLS introduced n Data management re-factored n R-GMA introduced u Testbed 3 (release 2.x) n Advanced functionality, better scalability and reliability n 2.0 released end of August n 2.1 released in November TB0 TB1 TB2 TB3 Jan 2001 Mar Globus 1 EDG 1 EDG 1.4EDG 2.0/ Assessment R&D Focus on quality Stabilization Completion of technical work n Storage Element (SE) introduced n VOMS based security n Fabric monitoring n Upgrade underlying software (move to VDT managed releases of Globus and CondorG

n° 10 Chronological overview u LCG deployed many components of EDG 2.0 in their LCG-1 service (started summer 2003) and subsequently EDG 2.1 components for LCG-2 (early 2004) u Many other Grid projects started to use EDG software in 2003: n Grace, grid.it, DutchGrid, UK e-Science programme, CERN’s openlab, etc. TB0 TB1 TB2 TB3 Jan 2001 Mar Globus 1 EDG 1 EDG 1.4EDG 2.0/ Assessment R&D Focus on quality Stabilization Completion of technical work

n° 11 DataGrid assets u Large scale testbed continuously available throughout the project duration n Have gone further than any other project in providing a continuous, large-scale grid facility u CA Infrastructure (21 CAs worldwide) u Innovative middleware n Resource Broker n Replica Location Service and layered data management tools (Replica Manager & Optimizer) n R-GMA Information and Monitoring System n Automated configuration and installation tools n Access to diverse mass storage systems (StorageElement) n VOMS security model u Distributed team of people across Europe that can work together effectively to produce concrete results u Application groups are an integral part of the project contributing to all aspects of the work

n° 12 Main lessons learned u Applications need to be involved in all phases of the project n Grid middleware is relatively new and, despite all efforts, not yet “shrink-wrap” quality – requires skilled people to be used efficiently n Middleware prototypes need to be available for application testing early s Caveat: prototypes tend to stay longer than expected – more advanced software might be delayed. u Cross-WP activities are essential and need to be coordinated n Application working group, architecture task force, integration team, security group, tutorial team, quality group. u A sequence of (distributed) testbeds is needed n Developers need their own distributed testbed to test bleeding edge software n Managed integration/certification/application testbeds – eventually production infrastructure u Site certification and validation needs to be automated and run regularly n Misconfigured sites may cause many failures u Security needs to be an integrated part from the very beginning n Adding security to existing systems is hard u Prompt hiring and retention of Personnel is critical

n° 13 Summary u DataGrid as Grid Technology Innovator n High level middleware developed in many areas (workload and data mgmt, information services, fabric mgmt) u DataGrid as Technology Provider n Software taken up by many other Grid projects (LCG, Grace, CrossGrid, grid.it, DutchGrid, UK e-science, openlab, …) n Extensive training in more than 25 tutorials held in US, Europe, AP n Substantial contributions to standardization bodies like GGF u DataGrid as Demonstrator n Successful evaluation of Grid technologies as production platform by High Energy Physics, Earth Observation, and Bioinformatics applications. This paved the way towards u Grid as next generation production infrastructure 