INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org Scenarios for Integrating Data and Job Scheduling Peter Kunszt On behalf of the JRA1-DM Cluster,

Slides:



Advertisements
Similar presentations
INFSO-RI Enabling Grids for E-sciencE EGEE and gLite Slides by: Erwin Laure EGEE Deputy Middleware Manager.
Advertisements

FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
WP 1 Grid Workload Management Massimo Sgaravatto INFN Padova.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Supporting MPI applications on the EGEE Grid.
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
INFSO-RI Enabling Grids for E-sciencE The US Federation Miron Livny Computer Sciences Department University of Wisconsin – Madison.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
INFSO-RI Enabling Grids for E-sciencE DAGs with data placement nodes: the “shish-kebab” jobs Francesco Prelz Enzo Martelli INFN.
Grid Workload Management Massimo Sgaravatto INFN Padova.
INFSO-RI Enabling Grids for E-sciencE Status and Plans of gLite Middleware Erwin Laure 4 th ARDA Workshop 7-8 March 2005.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Security and Job Management.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Provenance Challenge gLite Job Provenance.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, An Overview of the GridWay Metascheduler.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
INFSO-RI Enabling Grids for E-sciencE SA1 and gLite: Test, Certification and Pre-production Nick Thackray SA1, CERN.
Enabling Grids for E- sciencE EGEE and gLite are registered trademarks EGEE-III INFSO-RI Analysis of Overhead and waiting times.
CEOS WGISS-21 CNES GRID related R&D activities Anne JEAN-ANTOINE PICCOLO CEOS WGISS-21 – Budapest – 2006, 8-12 May.
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management and Interoperability Peter Kunszt (JRA1 DM Cluster) 2 nd EGEE Conference,
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
INFSO-RI Enabling Grids for E-sciencE EGEE Security Joni Hahkala, UH-HIP On behalf of JRA3 JRA1 AH March 22-24, 2006.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
INFSO-RI Enabling Grids for E-sciencE The gLite File Transfer Service: Middleware Lessons Learned form Service Challenges Paolo.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
EGEE is a project funded by the European Union under contract IST Data Management Gaps Krzysztof Nienartowicz Gavin McCance EGEE JRA1 Data.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
INFSO-RI Enabling Grids for E-sciencE Technical Roadmap 3 rd JRA1 All Hands Meeting Erwin Laure Deputy EGEE Middleware Manager.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
INFSO-RI Enabling Grids for E-sciencE EGEE is a project funded by the European Union under contract IST Job sandboxes.
INFSO-RI Enabling Grids for E-sciencE /10/20054th EGEE Conference - Pisa1 gLite Configuration and Deployment Models JRA1 Integration.
EGEE-II INFSO-RI Enabling Grids for E-sciencE gLite and Condor present and future Claudio Grandi (INFN – Bologna)
Summary from WP 1 Parallel Section Massimo Sgaravatto INFN Padova.
INFSO-RI Enabling Grids for E-sciencE Grid Services for Resource Reservation and Allocation Tiziana Ferrari Istituto Nazionale di.
Bulk Data Transfer Activities We regard data transfers as “first class citizens,” just like computational jobs. We have transferred ~3 TB of DPOSS data.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
INFSO-RI Enabling Grids for E-sciencE glexec on worker nodes David Groep NIKHEF.
INFSO-RI Enabling Grids for E-sciencE gLite C++ Configurator Practical experience gLite Configuration Meeting, March 1, 2005 Peter.
Distributed Data Access Control Mechanisms and the SRM Peter Kunszt Manager Swiss Grid Initiative Swiss National Supercomputing Centre CSCS GGF Grid Data.
INFSO-RI Enabling Grids for E-sciencE FTS failure handling Gavin McCance Service Challenge technical meeting 21 June.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Patch Preparation SA3 All Hands Meeting.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite – UNICORE interoperability Daniel Mallmann.
WP1 Status and plans Francesco Prelz, Massimo Sgaravatto 4 th EDG Project Conference Paris, March 6 th, 2002.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSA3.4.1 “The process document” Oliver Keeble.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Management cluster summary David Smith JRA1 All Hands meeting, Catania, 7 March.
Enabling Grids for E-sciencE EGEE-III-INFSO-RI EGEE and gLite are registered trademarks Francesco Giacomini JRA1 Activity Leader.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Study on Authorization Christoph Witzig,
1 DIRAC Data Management Components A.Tsaregorodtsev, CPPM, Marseille DIRAC review panel meeting, 15 November 2005, CERN.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
INFSO-RI Enabling Grids for E-sciencE University of Coimbra gLite 1.4 Data Management System Salvatore Scifo, Riccardo Bruno Test.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Management Claudio Grandi.
EGEE-II INFSO-RI Enabling Grids for E-sciencE WLCG File Transfer Service Sophie Lemaitre – Gavin Mccance Joint EGEE and OSG Workshop.
INFSO-RI Enabling Grids for E-sciencE Padova site report Massimo Sgaravatto On behalf of the JRA1 IT-CZ Padova group.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CREAM: current status and next steps EGEE-JRA1.
INFSO-RI Enabling Grids for E-sciencE CREAM, WMS integration and possible deployment scenarios Massimo Sgaravatto – INFN Padova.
Short update on the latest gLite status
Presentation transcript:

INFSO-RI Enabling Grids for E-sciencE Scenarios for Integrating Data and Job Scheduling Peter Kunszt On behalf of the JRA1-DM Cluster, CERN JRA1 All Hands Meeting Brno, Czech Republic June 20-22, 2005

Enabling Grids for E-sciencE INFSO-RI JRA1 All Hands Meeting - June How to spend these 45 minutes Short presentation –background info on existing services and capabilities –Motivation for DM/WMS integration –some options on how to actually do it Rest of the time may be spent in discussing the way forward, deciding our focus for the near future/mid- term

Enabling Grids for E-sciencE INFSO-RI JRA1 All Hands Meeting - June Data Movement and Scheduling gLite DM Cluster: Dedicated high-level component: Data Scheduler –Not in the plan for Release 2!!! –Some FTS bugs can only be addressed by this component. –Risk: once people understand what it’s about, it may creep back into the plan (wouldn’t be the first time) Low-level site-based components: FPS and FTS. Others in EGEE: Condor Stork Globus Reliable File Transfer Externals: P2P systems (bittorrent and friends)

Enabling Grids for E-sciencE INFSO-RI JRA1 All Hands Meeting - June Why do we have our own at all? None of the available services have fulfilled our requirements, especially for –Security –Channel management –Extensibility

Enabling Grids for E-sciencE INFSO-RI JRA1 All Hands Meeting - June FTS Capabilities Transfer a (list of) files given by their SURL or TURL (srm or gsiftp protocol) between two sites Manage the whole transfer as a single job Run through a set of states Apply site policies (extension)

Enabling Grids for E-sciencE INFSO-RI JRA1 All Hands Meeting - June FPS Capabilities The FPS is a FTS with some extra configuration Accept jobs also with LFNs and GUIDs Resolve LFNs and GUIDs through a catalog Register Replica at the end of the successful transfer Biggest usage difference between FTS/FPS today (also a matter of configuration): –FTS will transfer the files using the User Proxy –FPS will transfer the files using the Service Proxy (should be dual proxy!!) since the access is enforced through the catalogs

Enabling Grids for E-sciencE INFSO-RI JRA1 All Hands Meeting - June Stork Capabilities Performs managed transfers –No multi-file transfer yet, but promised Job described as a ClassAd Integrated with DAGs Protocol Translation possible (in-memory modules, each transfer is a running process at the Stork node) Security model needs more work (promised) No channel management, poor extensibility (cataloguing) – ongoing discussions Usage: Beneath FTS : using Stork instead of srm/gsi copy Over FTS: Write a plugin to Stork to submit into FPS to do the transfer. This would take care of a lot of issues, except for security.

Enabling Grids for E-sciencE INFSO-RI JRA1 All Hands Meeting - June Globus RFT Capabilities Provide managed transfer between two servers The job is managed, re-tries are possible and server deaths are taken care of Can support splicing, multi target transfers No channel concept – would need to be set up as such implicitly (i.e. servers per Channel) Callbacks for security hooks exist Name resolution hooks may exist, need to look at in detail Usage of RFT Beneath FTS – trivial, just submit to FTS instead calling globus- url-copy Under FTS – not possible

Enabling Grids for E-sciencE INFSO-RI JRA1 All Hands Meeting - June What do we want? Seamless integration of data and CPU jobs from the user point of view Re-use of the infrastructure where possible –Especially common security infrastructure for proxy mgmt JDL for transfer jobs Policies for data placement through WMS Mixed DAGs – a transfer may be a DAG node Transfer Job optimization (policy-based?)

Enabling Grids for E-sciencE INFSO-RI JRA1 All Hands Meeting - June How? Options We resurrect the Data Scheduler –Implements JDL interface –Extensible –WMS just hands over transfer jobs to proper DS –DAG integration in dagman a question mark – can you do callouts? WMS does it all –Manages transfer jobs it gets through the JDL/DAG and translates it to proper FTS/FPS.submit() calls –Either WMS Monitors FTS/FPS or FTS puts state into L&B –Or other notification mechanisms Stork/Condor does it all –WMS just submits the whole DAG as is to Condor –Stork works its magic and might call the FPS/FTS to actually perform the transfers

Enabling Grids for E-sciencE INFSO-RI JRA1 All Hands Meeting - June Discussion More options? Every option has pros/cons We should decide on which path to go down today..