Presentation is loading. Please wait.

Presentation is loading. Please wait.

Grids: Why and How (you might use them) J. Templon, NIKHEF VLV T Workshop NIKHEF 06 October 2003.

Similar presentations


Presentation on theme: "Grids: Why and How (you might use them) J. Templon, NIKHEF VLV T Workshop NIKHEF 06 October 2003."— Presentation transcript:

1 Grids: Why and How (you might use them) J. Templon, NIKHEF VLV T Workshop NIKHEF 06 October 2003

2 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 2 Information I intend to transfer u Why are Grids interesting? Grids are solutions so I will spend some time talking about the problem and show how Grids are relevant. Solutions should solve a problem. u What are Computational Grids? u How are we (high-energy physicists) using Grids? What tools are available? u How might they be of interest to Neutrino Telescopists?

3 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 3 Our Problem  Place event info on 3D map  Trace trajectories through hits  Assign type to each track  Find particles you want  Needle in a haystack!  This is “relatively easy” case

4 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 4 More complex example

5 interactive physics analysis batch physics analysis batch physics analysis detector event summary data raw data event reprocessing event reprocessing event simulation event simulation analysis objects (extracted by physics topic) Data Handling and Computation for Physics Analysis event filter (selection & reconstruction) event filter (selection & reconstruction) processed data

6 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 6 Computational Aspects u To reconstruct and analyze 1 event takes about 90 seconds u Most collisions recorded are not what we’re looking for – maybe want as few as one out of a million. But we have to check them all! u Analysis program needs lots of calibration; determined from inspecting results of first pass. u  Each event will be analyzed several times!

7 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 7 online system multi-level trigger filter out background reduce data volume level 1 - special hardware 40 MHz (40 TB/sec) level 2 - embedded processors level 3 - PCs 75 KHz (75 GB/sec) 5 KHz (5 GB/sec) 100 Hz (100 MB/sec) data recording & offline analysis One of the four LHC detectors

8 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 8 Computational Implications (2) u 90 seconds per event to reconstruct and analyze u 100 incoming events per second u To keep up, need either: n A computer that is nine thousand times faster, or n nine thousand computers working together u Moore’s Law: wait 20 years and computers will be 9000 times faster (we need them in 2006!) u Four LHC experiments plus extra work: need >50k computers u Grids: make large numbers of computers work together

9 So What are Grids Anyway??

10 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 10 A bunch of computers is not a Grid u HEP has experience with a couple thousand computers in one place Putting them all in one spot leads to traffic jams CERN can’t pay for it all Someone else controls your resources Can you use them for other (non-CERN) work? BUT

11 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 11 Distribute computers like users u Most of computer power not at CERN n need to move users’ jobs to available CPU n Better if jobs are sent to CPUs “close” to data they consume u Need computing resource management n How to connect users with available power? u Need data storage management n How to distribute? n What about copies? (Lots of people want access to same data) u Need authorization & authentication for access to resources!

12 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 12 Grids are Information Information System Information System A-Grid B-Grid

13 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 13 What does the Grid do for you? u You submit your work, and the Grid n Finds convenient places for it to be run n Organises efficient access to your data s Caching, migration, replication n Deals with authentication to the different sites that you will be using n Interfaces to local site resource allocation mechanisms, policies n Runs your jobs n Monitors progress and recovers from problems n Tells you when your work is complete u If your task allows, Grid can also decompose your work into convenient execution units based on available resources, data distribution

14 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 14 How it works User Interface User submits job description Workload ManagerInformation System Where are resources to do this job? Data Management Service Publish Resource and Service Information Locate Copies of Requested Data

15 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 15 What’s There Now? u Job Submission n Marriage of Globus, Condor-G, EDG Workload Manager n Latest version reliable at 99% level u Information System n New System (R-GMA) – very good information model, implementation still evolving n Old System (MDS) – poor information model, poor architecture, but hacks allow 99% “uptime” u Data Management n Replica Location Service – convenient and powerful system for locating and manipulating distributed data; mostly still user-driven (no heuristics) u Data Storage n “Bare gridFTP server” – reliable but mostly suited to disk-only mass store n SRM – no mature implementations

16 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 16 VLV T Reconstruction Model Mediterranean 10 Gb/s L1 Trigger Raw Data Cache > 1 TB > 1000 CPUs  Distributed Event Database?  Auto Distributed Files?  Single Mass Store + “Thermal Grid”? StreamService 1 Mb/s This needs work!! 2 Gbit/s is not a problem but you want many x 80 Gbit/s! Dual 1TB Circular Buffers? All connections through single pipe probably bad. Dedicated line to better-connected “redistribution center”? Grid useful here – get a lot but only when you need it! Grid data model applicable, but maybe not computational model …

17 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 17 Networking Likely OK, Surely Welcome! u History of Internet Land Speed Record n 930 Mbit/sec (NIKHEF/California) 1 year ago n 2.2 Gbit/sec (CERN) six months ago n 5 Gbit/sec today (tests last week from NIKHEF) u This rapid advance results in network people looking for groups who can fill the pipes!!

18 Jeff Templon – NIKHEF VLV T Workshop, Amsterdam, 2003.10.06 - 18 Conclusions u Grids are working now for workload management in HEP u Data model of VLV T matches Grid data model quite well u Gamma-Ray burst scenario matches Grid computational paradigm well u Network demands appear feasible and will be welcomed u Sounds like a lot of fun!


Download ppt "Grids: Why and How (you might use them) J. Templon, NIKHEF VLV T Workshop NIKHEF 06 October 2003."

Similar presentations


Ads by Google