Rod Walker IC 13th March 2002 SAM-Grid Middleware  SAM.  JIM.  RunJob.  Conclusions. - Rod Walker,ICL.

Slides:



Advertisements
Similar presentations
GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
Advertisements

Physics with SAM-Grid Stefan Stonjek University of Oxford 6 th GridPP Meeting 30 th January 2003 Coseners House.
SAM-Grid Status Core SAM development SAM-Grid architecture Progress Future work.
NorduGrid Grid Manager developed at NorduGrid project.
A Computation Management Agent for Multi-Institutional Grids
WP 1 Grid Workload Management Massimo Sgaravatto INFN Padova.
CMS HLT production using Grid tools Flavia Donno (INFN Pisa) Claudio Grandi (INFN Bologna) Ivano Lippi (INFN Padova) Francesco Prelz (INFN Milano) Andrea.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Meta-Computing at DØ Igor Terekhov, for the DØ Experiment Fermilab, Computing Division, PPDG ACAT 2002 Moscow, Russia June 28, 2002.
The Sam-Grid project Gabriele Garzoglio ODS, Computing Division, Fermilab PPDG, DOE SciDAC ACAT 2002, Moscow, Russia June 26, 2002.
Workload Management Massimo Sgaravatto INFN Padova.
JIM Deployment for the CDF Experiment M. Burgon-Lyon 1, A. Baranowski 2, V. Bartsch 3,S. Belforte 4, G. Garzoglio 2, R. Herber 2, R. Illingworth 2, R.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Grid Job, Information and Data Management for the Run II Experiments at FNAL Igor Terekhov et al (see next slide) FNAL/CD/CCF, D0, CDF, Condor team, UTA,
SAMGrid – A fully functional computing grid based on standard technologies Igor Terekhov for the JIM team FNAL/CD/CCF.
OSG End User Tools Overview OSG Grid school – March 19, 2009 Marco Mambelli - University of Chicago A brief summary about the system.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
HEP Experiment Integration within GriPhyN/PPDG/iVDGL Rick Cavanaugh University of Florida DataTAG/WP4 Meeting 23 May, 2002.
The SAMGrid Data Handling System Outline:  What Is SAMGrid?  Use Cases for SAMGrid in Run II Experiments  Current Operational Load  Stress Testing.
3 Sept 2001F HARRIS CHEP, Beijing 1 Moving the LHCb Monte Carlo production system to the GRID D.Galli,U.Marconi,V.Vagnoni INFN Bologna N Brook Bristol.
Grid Job and Information Management (JIM) for D0 and CDF Gabriele Garzoglio for the JIM Team.
Building a distributed software environment for CDF within the ESLEA framework V. Bartsch, M. Lancaster University College London.
11 March 2004Getting Ready for the Grid SAM: Tevatron Experiments Using the Grid CDF and D0 Need the Grid –Requirements, the CAF and SAM –Grid from the.
CDF Grid Status Stefan Stonjek 05-Jul th GridPP meeting / Durham.
Deploying and Operating the SAM-Grid: lesson learned Gabriele Garzoglio for the SAM-Grid Team Sep 28, 2004.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
CHEP 2003Stefan Stonjek1 Physics with SAM-Grid Stefan Stonjek University of Oxford CHEP th March 2003 San Diego.
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
1 DIRAC – LHCb MC production system A.Tsaregorodtsev, CPPM, Marseille For the LHCb Data Management team CHEP, La Jolla 25 March 2003.
SAMGrid as a Stakeholder of FermiGrid Valeria Bartsch Computing Division Fermilab.
SAM and D0 Grid Computing Igor Terekhov, FNAL/CD.
Grid Workload Management Massimo Sgaravatto INFN Padova.
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
Instrumentation of the SAM-Grid Gabriele Garzoglio CSC 426 Research Proposal.
GridPP18 Glasgow Mar 07 DØ – SAMGrid Where’ve we come from, and where are we going? Evolution of a ‘long’ established plan Gavin Davies Imperial College.
Andrew McNabETF Firewall Meeting, NeSC, 5 Nov 2002Slide 1 Firewall issues for Globus 2 and EDG Andrew McNab High Energy Physics University of Manchester.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
The SAM-Grid and the use of Condor-G as a grid job management middleware Gabriele Garzoglio for the SAM-Grid Team Fermilab, Computing Division.
22 nd September 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
Giuseppe Codispoti INFN - Bologna Egee User ForumMarch 2th BOSS: the CMS interface for job summission, monitoring and bookkeeping W. Bacchi, P.
Dzero MC production on LCG How to live in two worlds (SAM and LCG)
16 September GridPP 5 th Collaboration Meeting D0&CDF SAM and The Grid Act I: Grid, Sam and Run II Rick St. Denis – Glasgow University Act II: Sam4CDF.
4 March 2004GridPP 9th Collaboration Meeting SAMGrid:JIM and CDF Development CDF Accepts the Need for the Grid –Requirements How to Meet the Need –Status.
1 DØ Grid PP Plans – SAM, Grid, Ceiling Wax and Things Iain Bertram Lancaster University Monday 5 November 2001.
The Experiments – progress and status Roger Barlow GridPP7 Oxford 2 nd July 2003.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Data reprocessing for DZero on the SAM-Grid Gabriele Garzoglio for the SAM-Grid Team Fermilab, Computing Division.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
GridPP11 Liverpool Sept04 SAMGrid GridPP11 Liverpool Sept 2004 Gavin Davies Imperial College London.
19 February 2004SAMGrid Project Review SAMGrid: Future Plans CDF Accepts the Need for the Grid –Requirements D0 Relies on the Grid –Requirements How to.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
VO Privilege Activity. The VO Privilege Project develops and implements fine-grained authorization to grid- enabled resources and services Started Spring.
AliEn AliEn at OSC The ALICE distributed computing environment by Bjørn S. Nilsen The Ohio State University.
Andrew McNab - Manchester HEP - 17 September 2002 UK Testbed Deployment Aim of this talk is to the answer the questions: –“How much of the Testbed has.
UTA MC Production Farm & Grid Computing Activities Jae Yu UT Arlington DØRACE Workshop Feb. 12, 2002 UTA DØMC Farm MCFARM Job control and packaging software.
Interactive Data Analysis on the “Grid” Tech-X/SLAC/PPDG:CS-11 Balamurali Ananthan David Alexander
Eileen Berman. Condor in the Fermilab Grid FacilitiesApril 30, 2008  Fermi National Accelerator Laboratory is a high energy physics laboratory outside.
Grid Job, Information and Data Management for the Run II Experiments at FNAL Igor Terekhov et al FNAL/CD/CCF, D0, CDF, Condor team.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
April 25, 2006Parag Mhashilkar, Fermilab1 Resource Selection in OSG & SAM-On-The-Fly Parag Mhashilkar Fermi National Accelerator Laboratory Condor Week.
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
DØ Grid Computing Gavin Davies, Frédéric Villeneuve-Séguier Imperial College London On behalf of the DØ Collaboration and the SAMGrid team The 2007 Europhysics.
ICL Grid Status D0 Phase IV+ –Code development of fibre track tool for level 3 trigger. Ray Beuselinck –SAM Consuming site EU Datagrid –Unofficial testbed.
IC Status – 29/4/02 Condor BS adaptor –New interface handles JDF`s (condor,fbs,condorG) Multi-process consumers. –In progress (crashes smaster on jobSubmitted())
BOSS: the CMS interface for job summission, monitoring and bookkeeping
BOSS: the CMS interface for job summission, monitoring and bookkeeping
Wide Area Workload Management Work Package DATAGRID project
The DZero/PPDG D0/PPDG mission is to enable fully distributed computing for the experiment, by enhancing SAM as the distributed data handling system of.
Presentation transcript:

Rod Walker IC 13th March 2002 SAM-Grid Middleware  SAM.  JIM.  RunJob.  Conclusions. - Rod Walker,ICL.

Rod Walker IC 13th March 2002

SAM stands for “Sequential Access to Data via Metadata”. Sequential access within files – order of files isn’t important, e.g. HEP data. History of SAM Project started in 1997 by FNAL Computing Division(not just physicists). Meant for FNAL experiments, and recently taken up by CDF. So far ~20 FTE years – a lot of effort. State of the art in Data Management No-one else has tried to deliver TB’s of user selected data on demand.

Rod Walker IC 13th March 2002 Global file routing Many remote stations want files –SAM allowed free-for-all to gridftp server. –MSS access only from FNAL site, cache on private network,... Needed control and routing Solution: All sites can route files, eg. –Get fnal files from fnal-router –route=fnal.gov::nijmegen and nijmegen station has route=fnal.gov::fnal-router Janet - Geant – Esnet – FNAL, 155Mbit bottleneck. Janet - Geant – Surfnet – FNAL, Gbit(?)

Rod Walker IC 13th March 2002 SAM Status Middleware Development Global routing. Diverse deployments, e.g. private network, firewall, shared vs local disk cache. CDF deployment – GridPP Bug fixes. GridFTP and Authentication – GridPP Outlook Decreasing development. FNAL CD support for RunII

Rod Walker IC 13th March 2002

JIM history Purpose: to build on SAM’s data handling, to create a real grid. Job definition & management Information & Monitoring Novel concepts Already have DH system. ups/upd packaging and deployment. rpm functionality plus multi-platform, tailoring. little dependence on native installation, e.g.python v2.1f hugely simplified deployment. Use Condor as resource broker.

Rod Walker IC 13th March 2002 JIM components User Interface Job Definition language based on classadds RB reduced to making MMS ranking function Static & dynamic constraints:os,code version,freecpu,… Plus external function to query DH system. Collaboration with Wisconsin. Choose gatekeeper, use external function, separate submission server from negotiator.

Rod Walker IC 13th March 2002

JIM components Information & Monitoring. Currently: grid sensors > ldap > MDS > PHP Developing: grid sensors > xml > native Db > PHP, other. Reliability, flexibility, persistency. Same model works for grid system book-keeping and user level monitoring.

Rod Walker IC 13th March 2002 Information Flow User Interfac e Condor-G Information And Monitoring Gatekeeper Batch Syestem Grid Sensors Compute Resource GRAM Condor Negotiator Condor Collector Condor Grid Manager External Code Execution Site Parser JDL ClassAd Cin Cout User Interfac e Parser Condor Schedd Condor Schedd Condor Schedd Condor Collector Condor Collector Grid Sensors Condor Negotiator Condor Negotiator External Code Condor Grid Manager Condor Grid Manager Gatekeeper Batch Syestem Compute Resource

Rod Walker IC 13th March 2002 RunJob Vital tool for d0 MC productions on farms. Chains, steers and parallelizes d0 executables. Creates metadata. Use SAM to store to MSS. Now interfaced to SAM for input, and can handle real data and any d0 executables. Will be used for skimming, re-processing datasets, and user analysis. Fully automate monitoring, checking and storage. Work underway by UK.

Rod Walker IC 13th March 2002 RunJob status Maintenance & development of RunJob, and interface to SAM-Grid entirely by UK. CMS using branch of RunJob for production. Dave Evans and Greg Graham collaborating on merging branches. Goal: Single package with EDG and SAM-Grid interfaces. Runjob “server” or job-manager.

Rod Walker IC 13th March 2002 SAM-Grid Logistics Site Resource Selector Info Collector Info Gatherer Match Making User Interface Submission Global Job Queue Grid Client Submission User Interface Global DH Services SAM Naming Server SAM Log Server Resource Optimizer SAM DB Server RCMetaData Catalog Bookkeeping Service SAM Stager(s) SAM Station (+other servs) Data Handling Worker Nodes Grid Gateway Local Job Handler (CAF,RunJob,Vanilla,...) JIM Advertise Local Job Handling Cluster AAA Dist.FS Info Manager XML DB server Site Conf. Glob/Loc JID map... Info Providers MDS MSS Cache Site Web Serv Grid Monitoring User Tools

Rod Walker IC 13th March 2002 Conclusions o Core SAM supported by FNAL CD o Operational support via software shifts. o UK currently contributes 2 experts on shift. o JIM post-development support, o bug fixing, deployment issues (like SAM). o will need software support shifts. o RunJob is and will be UK supported. o Expanding functionality – analysis,reprocessing. o Increasing deployment – d0 sites, CMS. o On target for end-March deliverable, and production Grid in April.

Rod Walker IC 13th March 2002 JIM V1: Package dependencies jim_broker_client xml_meta_configurator sam_common jim_info_providers jim_broker orbacus sam_config globus jim_www server_run jim_advertise galax samgrid jim_client jim_jobmanagersjim_sandbox