Wouter Verkerke, NIKHEF Status of Software & Computing Wouter Verkerke.

Wouter Verkerke, NIKHEF Status of Software & Computing Wouter Verkerke

Wouter Verkerke, NIKHEF Software context: CSC notes ATLAS is currently producing new samples for CSC notes –Last major physics production round was for 2005 Rome workshop (simulation in release 9, reconstruction in release 10) Plan for CSC notes –Event generation in 12.0.3 and 12.0.4 –Simulation in release 12.0.31 –Reconstruction in release 12.0.4  12.0.5 –Some samples new reconstruction in release 13 (e.g. H  gg needs improved egamma reco with brem recovery that will only be available in release 13) Major new things –New physics samples –Increased level of realism in simulation –Improved reconstruction and full trigger information for trigger-aware analysis

Wouter Verkerke, NIKHEF Physics Coordination 17/01/07 – Status of software Validation of 12.0.31 for simulation production completed –Bulk simulation underway –Initial problems reading 12.0.4 EvGen data in 12.0.31 (effectively a forward compatibility requirement). Needed because 12.0.4 contains relevant updates of MC generators (e.g. fix in AlpGen MLM matching) –Fixed in 12.0.31(8), which can read both 12.0.31 EvGen and 12.0.4 EvGen data –New: another fix 12.0.31(9) was required as 12.0.31(8) did not work on all OSG sites Processing of 12.0.4 validation sample for reconstruction did not go very smoothly –Release 12.0.4(1) built around time of last PhysCoord meeting –Very large number of tags went in last minute to make reco,trigger work –Managed to build patch 12.0.4(2) with fixes into production just before holiday shutdown –Release 12.0.4(2) is the first patch in which code was patched in addition to scripts. Used this mechanism to fix (among others) Muon software problem

Wouter Verkerke, NIKHEF Physics Coordination 17/01/07 – Status of software –Note that strategy of code patching has important implications: Official production output will be different from private running over 12.0.4 kit or 12.0.4 AFS release if patch is not explicitly applied. This presents issues for people doing private production. Will not go in details now because point has become moot for the moment –Production team managed to run part of validation sample over the holiday but success rate was low (30%). Main problems in 12.0.4(2) production –ESDs were too large because, among smaller problems too many (trigger) track collections were written out. This was not intentional and caused ESD size/evt to inflate to 2.6 Mb/evt on average. Causes reco production jobs (at 1000 events each) to crash because of ESD output files exceed 2GB limit. Problem sidestepped by lowering event count from 1000 to 250. –Also large number of (masked=ignored) ERROR messages in jobs (mostly from muon) –But despite low rate, managed to have a first look at pilot sample for validation

Wouter Verkerke, NIKHEF Physics Coordination 17/01/07 – Status of software Validation of 12.0.4(2) sample revealed a couple of problems –Most notably it turned out that fix for MuonBoy applied in code patch had not worked  Issue traced back to use of static libraries.  Thus it was decided to abandon 12.0.4(3) in favor of a cold build of a release 12.0.5 –CBNT(AA) not produced in 12.0.4(2). This was traced back to issue in job transformation scripts and was fixed for next release –Note that old-style CBNT has been abandoned in favor of new Athena-Aware CBNT starting with 12.0.4 –Also quite a few issues with trigger code were identified (collections not filled, clashes between trigger slices when run all together) –Otherwise most things looked OK in validation (e.g. Jet/ETmiss, EM clusters, Muon efficiencies, various trigger slices with post- 12.0.4 fixes)

Wouter Verkerke, NIKHEF Physics Coordination 17/01/07 – Software status Now building release 12.0.5 –Build in phases, started yesterday (17/1/07) will complete by end of week –Again quite a bit of new code (>50 collected tags, all corresponding to bug reports, with stringent acceptance policy) –As soon as build is complete, will deploy on grid and resubmit validation sample and will restart validation cycle. –Note that reco job size has been permanently lowered from 1000 events to 500 events due to increased ESD size –Best case scenario (i.e. zero problems in 12.0.5(1)): Ready for bulk submission around Feb 1 st. Latest news: –Build of 12.0.5(1) kit delayed until today due to collection of tags to fix essential problems in trigger and last nights power outage –Updated best case scenario: bulk submission around Feb 6 st.

Wouter Verkerke, NIKHEF Computing context – Exercising distributed computing We mostly analyze data now by retrieving it from either castor, or through DQ2 and running it locally –Copy AOD dataset with dq2_get to local disk –Make ntuple of AOD dataset on local machine –Analyze ntuples on local machine This model does not scale to very large number of events (e.g. 1Mevt of events to be analyzed) ATLAS Computing does not support this mode of analysis for large number of events –Need to exercise distributed computing model –Make ntuples of AOD dataset on GRID –Analyze ntuples on local machine

Wouter Verkerke, NIKHEF R.Jones PC Dec 2006 – Facilities at CERN Tier-0: –Prompt first pass processing on express/calibration & physics streams with old calibrations - calibration, monitoring –Calibrations tasks on prompt data –24-48 hours later, process full physics data streams with reasonable calibrations  Implies large data movement from T0 → T1s CERN Analysis Facility –Access to ESD and RAW/calibration data on demand –Essential for early calibration –Detector optimisation/algorithmic development

Wouter Verkerke, NIKHEF R.Jones PC Dec 2006 – Facilities Away from CERN Tier-1: –Reprocess 1-2 months after arrival with better calibrations –Reprocess all resident RAW at year end with improved calibration and software  Implies large data movement from T1 ↔ T1 and T1 → T2 ~30 Tier 2 Centers distributed worldwide Monte Carlo Simulation, producing ESD, AOD, ESD, AOD  Tier 1 centers –On demand user physics analysis of shared datasets –Limited access to ESD and RAW data sets –Simulation  Implies ESD, AOD, ESD, AOD  Tier 1 centers Tier 3 Centers distributed worldwide –Physics analysis –Data private and local - summary datasets

Wouter Verkerke, NIKHEF R.Jones PC Dec 2006 – Headline Comments (2) The Tier 1s and Tier 2s are collective - if the data is on disk, you (T2) or your group (T1) can run on it For any substantial data access, jobs go to the data –Users currently think data goes to the job! Cannot be sustained When it is found to be needed later, requests above ~10Gbytes need to be requested, not grabbed –ESD should be can be delivered in a few hours –RAW on tape may take ~week Data for Tier 3s should be pulled from Tier 2s using ATLAS tools –Tier 3s need to ensure adequate networking

Wouter Verkerke, NIKHEF R.Jones PC Dec 2006 – Analysis computing model Analysis model broken into two components –@ Tier 1: Scheduled central production of augmented AOD, tuples & TAG collections from ESD  Derived files moved to other T1s and to T2s –@ Tier 2: On-demand user analysis of augmented AOD streams, tuples, new selections etc and individual user simulation and CPU-bound tasks matching the official MC production  Modest job traffic between T2s  Tier 2 files are not private, but may be for small sub-groups in physics/detector groups  Limited individual space, copy to Tier3s

Wouter Verkerke, NIKHEF R.Jones PC Dec 2006 – Group Analysis Group analysis will produce –Deep copies of subsets –Dataset definitions –TAG selections Characterised by access to full ESD and perhaps RAW –This is resource intensive –Must be a scheduled activity –Can back-navigate from AOD to ESD at same site Can harvest small samples of ESD (and some RAW) to be sent to Tier 2s Must be agreed by physics and detector groups Big Trains etc –Efficiency and scheduling gains access if analyses are blocked into a ‘big train’; some form of co-ordination is needed –Idea around for a while, already used in e.g. heavy ions Each wagon (group) has a wagon master )production manager Must ensure will not derail the train –Train must run often enough (every ~2 weeks?) –Trains can also harvest ESD and RAW samples for Tier 2s (but we should try to anticipate and place these subsets

Wouter Verkerke, NIKHEF R.Jones PC Dec 2006 -- T1 Data Tier 1 cloud (10 sites of very different size) contains: 10% of RAW on disk, the rest on tape –2 full copies of current ESD on disk an 1 copy of previous –A full AOD/TAG at each Tier 1 –A full set of group DPD Access is scheduled, through groups

Wouter Verkerke, NIKHEF R.Jones PC Dec 2006 -- On-demand Analysis Restricted Tier 2s and CAF –Can specialise some Tier 2s for some groups –ALL Tier 2s are for ATLAS-wide usage Most ATLAS Tier 2 data should be ‘placed’ with lifetime ~ months –Job must go to the data –This means Tier 2 bandwidth is vastly lower than if you pull data to the job –Back navigation requires AOD/ESD to be co-located Role and group based quotas are essential –Quotas to be determined per group not per user Data Selection –Over small samples with Tier-2 file-based TAG and AMI dataset selector –TAG queries over larger samples by batch job to database TAG at Tier-1s/large Tier 2s What data? –Group-derived formats –Subsets of ESD and RAW Pre-selected or selected via a Big Train run by working group No back-navigation between sites, formats should be co-located

Wouter Verkerke, NIKHEF R.Jones PC Dec 2006 – T2 Disk Tier 2 cloud (~30 sites of very, very different size) contains: There is some ESD and RAW –In 2007: 10% of RAW and 30% of ESD in Tier 2 cloud –In 2008: 30% of RAW and 150% of ESD in Tier 2 cloud –In 2009 and after: 10% of RAW and 30% of ESD in Tier 2 cloud Additional access to ESD and RAW in CAF –1/18 RAW and 10% ESD 10 copies of full AOD on disk A full set of official group DPD Lots of small group DPD and user data Access is ‘on demand’

Wouter Verkerke, NIKHEF R.Jones PC Dec 2006 – Optimised Access RAW, ESD and AOD will be streamed to optimise access The selection and direct access to individual events is via a TAG database –TAG is a keyed list of variables/event –Overhead of file opens is acceptable in many scenarios –Works very well with pre-streamed data Two roles –Direct access to event in file via pointer –Data collection definition function Two formats, file and database –Now believe large queries require full database Multi-TB relational database Restricts it to Tier1s and large Tier2s/CAF –File-based TAG allows direct access to events in files (pointers) Ordinary Tier2s hold file-based primary TAG corresponding to locally-held datasets

Wouter Verkerke, NIKHEF R.Jones PC Dec 2006 – Closing Comment We desperately need to exercise the analysis model –With real distributed analysis –With streamed (‘impure’) datasets –With the data realistically placed –(several copies available, not being pulled locally)

Wouter Verkerke, NIKHEF Exercising Computing Model at NIKHEF Two major aspects we need to learn –Sending jobs to the data –Working with streamed data Sending jobs to data the official way: ganga –Ganga is job submission frontend to grid –Replace low-level tools like edg-job-submit –Ganga knows where the data is  Jobs are automatically sent to data AOD replication tests to all T1 sites are about to start –Need to learn ganga soon –Tutorials exists on wiki – Propose to have a collective session at NIKHEF soon

Wouter Verkerke, NIKHEF Ganga basics Ganga is an easy-to-use frontend for job definition and management –Allows simple switching between testing on a local batch system and large- scale data processing on distributed resources (Grid) –Developed in the context of ATLAS and LHCb For ATLAS, have built-in support for applications based on Athena framework, for JobTransforms, and for DQ2 data-management system –Component architecture readily allows extension –Implemented in Python Strong development team, meaning strong user support –F.Brochu (Cambridge), U.Egede (Imperial), J.Elmsheuser (München), K.Harrison (Cambridge), H.C.Lee (ASCC), D.Liko (CERN), A.Maier (CERN), J.T.Moscicki (CERN), A.Muraru (Bucharest), V.Romanovsky (IHEP), A.Soroko (Oxford), C.L.Tan (Birmingham) Contributions past and present from many others

Wouter Verkerke, NIKHEF Ganga job abstraction A job in Ganga is constructed from a set of building blocks, not all required for every job Merger Application Backend Input Dataset Output Dataset Splitter Data read by application Data written by application Rule for dividing into subjobs Rule for combining outputs Where to run What to run Job

Wouter Verkerke, NIKHEF Applications and Backends Running of a particular Application on a given Backend is enabled by implementing an appropriate adapter component or Runtime Handler –Can often use same Runtime Handler for several Backend: less coding PBSOSGNorduGridLocalLSFPANDA US-ATLAS WMS LHCb WMS Executable Athena (Simulation/Digitisation/ Reconstruction/Analysis) AthenaMC (Production) Gauss/Boole/Brunel/DaVinci (Simulation/Digitisation/ Reconstruction/Analysis) LHCbExperiment neutralATLAS Implemented Work in progress

Wouter Verkerke, NIKHEF LHCb applications ATLAS applications Other applications Applications Experiment-specific workload-management systems Local batch systemsDistributed (Grid) systems Processing systems (backends) Metadata catalogues Data storage and retrieval File catalogues Tools for data management Local repository Remote repository Ganga job archives Ganga monitoring loop User interface for job definition and management Ganga has built-in support for ATLAS and LHCb Component architecture allows customisation for other user groups Ganga: how the pieces fit together

Wouter Verkerke, NIKHEF Ganga & Athena Ganga has python frontend to manage job submission, organization stuff etc… j1 = Job( backend =LSF() ) # Create a new job for LSF a1 = Executable() # Create Executable application j1.application = a1 # Set value for job’s application j1.backend = LCG() # Change job’s backend to LCG export( j1, “myJob.py” ) # Write job to specified file load( “myJob.py” ) # Load job(s) from specified file j2 = j1.copy() # Create j2 as a copy of job j1 jobs # List jobs j.submit() # Submit the job j.kill() # Kill the job (if running) j.remove() # Kill the job and delete associated files !ls j.outputdir # List files in job’s output directory t = JobTemplate() # Create template templates # List templates j3 = Job( templates[ i ] ) # Create job from template

Wouter Verkerke, NIKHEF Ganga & Athena Also accessible from the command line Once it is all setup, submitting a standard athena job on the grid is supposed to be this simple Most recent web tutorial ganga athena --inDS csc11.005300.PythiaH130zz4l.recon.AOD.v11004103 --outputdata AnalysisSkeleton.aan.root --split 2 --lcg../share/AnalysisSkeleton_jobOptions.py https://twiki.cern.ch/twiki/bin/view/Atlas/GangaTutorial427

Wouter Verkerke, NIKHEF Status of Software & Computing Wouter Verkerke.

Similar presentations

Presentation on theme: "Wouter Verkerke, NIKHEF Status of Software & Computing Wouter Verkerke."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Wouter Verkerke, NIKHEF Status of Software & Computing Wouter Verkerke.

Similar presentations

Presentation on theme: "Wouter Verkerke, NIKHEF Status of Software & Computing Wouter Verkerke."— Presentation transcript:

Similar presentations

About project

Feedback