Presentation is loading. Please wait.

Presentation is loading. Please wait.

EGEE-III Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Andreas Gellrich, DESY The Grid – The Future of Scientific.

Similar presentations


Presentation on theme: "EGEE-III Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Andreas Gellrich, DESY The Grid – The Future of Scientific."— Presentation transcript:

1 EGEE-III Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Andreas Gellrich, DESY The Grid – The Future of Scientific Computing Andreas Gellrich DESY IT Training DESY, Hamburg 28.10.2008

2 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 2 Collaborating e-Infrastructures Potential for linking ~80 countries by 2008

3 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 3 The need for Grid in HEP CERN: the world's largest particle physics laboratory Particle physics requires special tools to create and study new particles: accelerators and detectors Mont Blanc (4810 m) Downtown Geneva Large Hadron Collider (LHC): –One of the most powerful instruments ever built to investigate matter –4 experiments: ALICE, ATLAS, CMS, LHCb –27 km circumference tunnel –Due to start up mid 2008

4 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 4 Highlights of EGEE-II >200 VOs from several scientific domains –Astronomy & Astrophysics –Civil Protection –Computational Chemistry –Comp. Fluid Dynamics –Computer Science/Tools –Condensed Matter Physics –Earth Sciences –Fusion –High Energy Physics –Life Sciences Further applications under evaluation 98k jobs/day Applications have moved from testing to routine and daily usage ~80-90% efficiency

5 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 5 Biomedical applications Biomedicine is also a pilot application area More than 20 applications deployed and being ported Three sub domains –Medical image processing –Biomedicine –Drug discovery Use Grid as platform for collaboration (don’t need same massive processing power or storage as HEP)

6 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 6 Applications Example Grid-enabled drug discovery process for neglected diseases –In silico docking  compute probability that potential drugs dock with target protein –To speed up and reduce cost to develop new drugs WISDOM (World-wide In Silico Docking On Malaria) –First biomedical data challenge –46 million ligands docked in 6 weeks –1TB of data produced –1000 computers in 15 countries  Equivalent to 80 CPU years Second data challenge on Avian flu in April 2006 –300,000 possible drug components tested –8 different targets –2000 computers used for 4 weeks

7 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 7 Astroparticle physics Major Atmospheric Gamma Imaging Cherenkov telescope (MAGIC) –Origin of VHE  -rays (30 GeV – TeV)  Active Galactic Nuclei (AGN)  Supernova Remnants  Unidentified EGRET sources  Gamma Ray Bursts –Huge hadronic background  MC simulations  to simulate the background of one night, 70 CPUs (P4 2GHz) need to run for 19200 days –Observation data are big too! MAGIC Grid –Use three national Grid centres as backbone –All are members of EGEE Work to build a second telescope is currently in progress  Towards a virtual observatory for VHE  -rays

8 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 8 Astroparticle physics PLANCK satellite mission –Measure cosmic microwave background (CMB)  At even higher resolution than previous missions –Launch in 2008; duration >1 year Application –N simulations of the whole Planck/LFI mission  different cosmological and instrumental parameters –Full sky map for frequencies 30 up to 850 GHz (two complete sky surveys) –22 channels for LFI, 48 for HFI > 12 times faster!!! but ~5% failure rate

9 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 9 Computational Chemistry GEMS (Grid Enabled Molecular Simulator) application –Calculation and fitting of electronic energies of atomic and molecular aggregates (using high level ab initio methods) –The use of statistical kinetics and dynamics to study chemical processes Virtual Monitors –Angular distributions –Vibrational distributions –Rotational distributions –Many body systems End-User applications –Nanotubes –Life sciences –Statistical Thermodynamics –Molecular Virtual Reality Angular distribution Rotational distribution Vibrational distribution Many body system Angular distribution

10 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 10 Fusion Large Nuclear Fusion installations –E.g. International Thermonuclear Experimental Reactor (ITER) –Distributed data storage and handling needed –Computing power needed for  Making decisions in real time  Solving kinetic transport  particle orbits  Stellarator optimization  magnetic field to contain the plasma

11 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 11 Earth Science Applications Community –Many small groups that aggregate for projects (and separate afterwards) The Earth –Complex system –Independent domains with interfaces  Solid Earth – Ocean – Atmosphere –Physics, chemistry and/or biology Applications –Earth observation by satellite –Seismology –Hydrology –Climate –Geosciences –Pollution –Meteorology, Space Weather –Mars Atmosphere –Database Collection

12 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 12 Earthquake analysis Seismic software application determines: Epicentre, magnitude, mechanism  May make it possible to predict future earthquakes  Assess potential impact on specific regions Analysis of Indonesian earthquake (28 March 2005) –Data from French seismic sensor network GEOSCOPE transmitted to IPGP within 12 hours after the earthquake –Solution found within 30 hours after earthquake occurred  10 times faster on the Grid than on local computers –Results  Not an aftershock of December 2004 earthquake  Different location (different part of fault line further south)  Different mechanism Rapid analysis of earthquakes is important for relief efforts

13 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 13 Industrial applications EGEODE –Industrial application from Compagnie Générale de Géophysique running on EGEE infrastructure  Seismic processing platform  Based on industrial application Geocluster© used at CGG  Being ported to EGEE for Industry and Academia OpenPlast project –French R&D programme to develop and deploy Grid platform for plastic industry (SMEs) –Based on experience from EGEE (supported by CS) –Next: Interoperability with other Grids

14 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 14 HEP High Energy Physics is a pilot application domain for EGEE –Large datasets –Large computing requirements  Major need for Grid technology to support distributed communities Support for LHC experiments through LHC Computing Grid (LCG) –ATLAS, CMS, LHCb, ALICE Also support for other major international HEP experiments –BaBar (US) –CDF (US) –DØ (US) –H1 and ZEUS (Germany)

15 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 15 LCG Tier Model Tier2 Centre ~1 TIPS Online System Offline Farm ~20 TIPS CERN Computer Centre >20 TIPS GridKa Regional Centre US Regional Centre French Regional Centre Italian Regional Centre Institute Institute ~0.25TIPS Workstations ~100 MBytes/sec 100 - 1000 Mbits/sec One bunch crossing per 25 ns 100 triggers per second Each event is ~1 Mbyte Physicists work on analysis “channels” Each institute has ~10 physicists working on one or more channels Data for these channels should be cached by the institute server Physics data cache ~PBytes/sec ~ Gbits/sec or Air Freight Tier2 Centre ~1 TIPS ~Gbits/sec Tier 0 Tier 1 Tier 3 1 TIPS = 25,000 SpecInt95 PC (1999) = ~15 SpecInt95 DESY ~1 TIPS Tier 2

16 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 16 LHC Data 40 million collisions per second After filtering, 100 collisions of interest per second A Megabyte of data for each collision = recording rate of 0.1 Gigabytes/sec 10 10 collisions recorded each year  When LHC starts operation: will generate ~ 15 Petabytes/year of data* *corresponding to more than 20 million CDs! Concorde (15 Km) Balloon (30 Km) CD stack with 1 year LHC data! (~ 20 Km) Mt. Blanc (4.8 Km)

17 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 17 LHC Computing Grid Aim: to develop, build and maintain a distributed computing environment for the storage and analysis of data from the four LHC experiments  Ensure the computing service  … and common application libraries and tools “Tier” infrastructure with Tier-0 at CERN, 11 Tier-1 centres and more than 100 Tier-2, and Tier-3 centres Phase I – 2002-05 – Development & planning Phase II – 2006-2008 – Deployment & commissioning of the initial services  LCG is not a development project – it relies on EGEE (and other Grid projects) for Grid middleware development, application support, Grid operation and deployment

18 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 18 HEP success stories Fundamental activity in preparation of LHC start up –Physics –Computing systems Examples: –LHCb: ~700 CPU/years in 2005 on the EGEE infrastructure –ATLAS: over 20,000 jobs per day  Comprehensive analysis: see S.Campana et al., “Analysis of the ATLAS Rome Production experience on the EGEE Computing Grid“, e-Science 2005, Melbourne, Australia –A lot of activity in all involved applications (including as usual a lot of activity within non-LHC experiments like BaBar, CDF and D0) ATLAS LHCb

19 Enabling Grids for E-sciencE EGEE-III Andreas Gellrich, DESY Introduction to Grids and the EGEE project 19 Monitoring


Download ppt "EGEE-III Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Andreas Gellrich, DESY The Grid – The Future of Scientific."

Similar presentations


Ads by Google