Ashok Agarwal 1 BaBar MC Production on the Canadian Grid using a Web Services Approach Ashok Agarwal, Ron Desmarais, Ian Gable, Sergey Popov, Sydney Schaffer,

Slides:



Advertisements
Similar presentations
Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Experiences with GridWay on CRO NGI infrastructure Emir Imamagic, Srce EGEE User.
Advertisements

Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
WP 1 Grid Workload Management Massimo Sgaravatto INFN Padova.
Dr. David Wallom Use of Condor in our Campus Grid and the University September 2004.
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
GRID workload management system and CMS fall production Massimo Sgaravatto INFN Padova.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
The Globus Toolkit Gary Jackson. Introduction The Globus Toolkit is a product of the Globus Alliance ( It is middleware for developing.
GRID Workload Management System Massimo Sgaravatto INFN Padova.
Workload Management Massimo Sgaravatto INFN Padova.
First steps implementing a High Throughput workload management system Massimo Sgaravatto INFN Padova
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Ian Gable University of Victoria/HEPnet Canada 1 GridX1: A Canadian Computational Grid for HEP Applications A. Agarwal, P. Armstrong, M. Ahmed, B.L. Caron,
Globus 4 Guy Warner NeSC Training.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
OSG End User Tools Overview OSG Grid school – March 19, 2009 Marco Mambelli - University of Chicago A brief summary about the system.
TeraGrid Information Services December 1, 2006 JP Navarro GIG Software Integration.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
OPEN GRID SERVICES ARCHITECTURE AND GLOBUS TOOLKIT 4
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
Dynamic Firewalls and Service Deployment Models for Grid Environments Gian Luca Volpato, Christian Grimm RRZN – Leibniz Universität Hannover Cracow Grid.
Grid Canada CLS eScience Workshop 21 st November, 2005.
GRAM: Software Provider Forum Stuart Martin Computational Institute, University of Chicago & Argonne National Lab TeraGrid 2007 Madison, WI.
Daniel Vanderster University of Victoria National Research Council and the University of Victoria 1 GridX1 Services Project A. Agarwal, A. Berman, A. Charbonneau,
Grid Job and Information Management (JIM) for D0 and CDF Gabriele Garzoglio for the JIM Team.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
BaBar MC production BaBar MC production software VU (Amsterdam University) A lot of computers EDG testbed (NIKHEF) Jobs Results The simple question:
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
Job Submission Condor, Globus, Java CoG Kit Young Suk Moon.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
COMP3019 Coursework: Introduction to GridSAM Steve Crouch School of Electronics and Computer Science.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
National Computational Science National Center for Supercomputing Applications National Computational Science NCSA-IPG Collaboration Projects Overview.
Grid Workload Management Massimo Sgaravatto INFN Padova.
Ashok Agarwal University of Victoria 1 GridX1 : A Canadian Particle Physics Grid A. Agarwal, M. Ahmed, B.L. Caron, A. Dimopoulos, L.S. Groer, R. Haria,
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
Condor: High-throughput Computing From Clusters to Grid Computing P. Kacsuk – M. Livny MTA SYTAKI – Univ. of Wisconsin-Madison
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
Report from USA Massimo Sgaravatto INFN Padova. Introduction Workload management system for productions Monte Carlo productions, data reconstructions.
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Condor RoadMap.
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
TeraGrid Advanced Scheduling Tools Warren Smith Texas Advanced Computing Center wsmith at tacc.utexas.edu.
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
Holding slide prior to starting show. A Portlet Interface for Computational Electromagnetics on the Grid Maria Lin and David Walker Cardiff University.
NW-GRID Campus Grids Workshop Liverpool31 Oct 2007 NW-GRID Campus Grids Workshop Liverpool31 Oct 2007 Moving Beyond Campus Grids Steven Young Oxford NGS.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
SAN DIEGO SUPERCOMPUTER CENTER Inca TeraGrid Status Kate Ericson November 2, 2006.
Nguyen Tuan Anh. VN-Grid: Goals  Grid middleware (focus of this presentation)  Tuan Anh  Grid applications  Hoai.
Review of Condor,SGE,LSF,PBS
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 14 February 2006.
Ian Gable University of Victoria 1 Deploying HEP Applications Using Xen and Globus Virtual Workspaces A. Agarwal, A. Charbonneau, R. Desmarais, R. Enge,
Pilot Factory using Schedd Glidein Barnett Chiu BNL
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
LSF Universus By Robert Stober Systems Engineer Platform Computing, Inc.
Grid Compute Resources and Job Management. 2 Grid middleware - “glues” all pieces together Offers services that couple users with remote resources through.
Jaime Frey Computer Sciences Department University of Wisconsin-Madison What’s New in Condor-G.
An approach to Web services Management in OGSA environment By Shobhana Kirtane.
Matthew Farrellee Computer Sciences Department University of Wisconsin-Madison Condor and Web Services.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
Presentation transcript:

Ashok Agarwal 1 BaBar MC Production on the Canadian Grid using a Web Services Approach Ashok Agarwal, Ron Desmarais, Ian Gable, Sergey Popov, Sydney Schaffer, Cameron Sobie, Randall Sobie, Tristan Sulivan, Daniel Vanderster University of Victoria

Ashok Agarwal 2 Overview of GridX1 – A GT2 Grid Use Canadian resources calliope, mercury, mcgill Clusters: standard Globus Toolkit 2 (GT2) Resource Mgmt: CondorG-based MS Condor Brokering MyProxy credential repo Central monitoring and accounting with web GUI

Ashok Agarwal 3 Limitations of GridX1 Due to numerous service-specific protocols e.g. GRAM, MyProxy, Condor 1.Difficult to extend: Adding a new service involves modifying protocol, or developing a new one 2.Compatibility issues: Lack of protocol standardization Backwards compatibility is not perfect (protocols modified between releases) 3.Firewall problems: Each service uses its own TCP port Many ports must be opened by each institution this may conflict with local policies 4.Security vulnerabilities: The GRAM job service runs as root, which could lead to a compromised resource Access is often limited to trusted hosts, limiting usefulness of the service Solution: Web Services Resource Framework (WSRF) Globus Toolkit v.4

Ashok Agarwal 4 Globus Toolkit v.4 Advantages WSRF solves the 4 key problems with GridX1 1.Easy to extend One common protocol (SOAP) Easy to develop new WSRF services 2.Seamless upgrade support Changes to service interfaces are described in WSDL 3.Reduced firewall problems Fewer ports (the service container) Non-privileged ports 4.Good security Service container runs as non-privileged user

Ashok Agarwal 5 WSRF-Based Grid Consists of multiple metaschedulers Central resource registry to store the resource attributes, RFT and LRMS Having multiple metaschedulers and registries gives high scalability and reliability of the grid Proposed WSRF-based grid

Ashok Agarwal 6 Important Features Condor-G used as the metascheduler Automatic registering of resource ClassAds to the central registry Automatic ClassAds extraction from the registry to the metascheduler for matchmaking Incorporation of input/output file staging Job submission using WS-GRAM or Condor_Submit Web-based monitoring Metascheduler Service

Ashok Agarwal 7 Publishing Tool An information provider script runs on every Grid resource and generates the resource ClassAd in GLUE 1.2 scheme Inserts the resource information in the form of XML-formatted Condor ClassAds into the local WS-MDS Publishes the resource information into the WS-MDS of the central registry ClassAd Publishing Tool

Ashok Agarwal 8 ClassAd Extractor Java application software Runs periodically on the metascheduler Extracts compute resources ClassAds in GLUE 1.2 scheme Converts the XML data into the Condor ClassAds for each resource Publishes these ClassAds to the Condor collector for jobs matchmaking MdsClassAdExtractor Tool

Ashok Agarwal 9 BaBar MC Grid Requirements Metascheduler head node –Install Condor-G and GT4 –Install BaBar software –Set up metascheduler –Set up ClassAd extraction tool Grid Resources Head Node –Set up Portable Batch System (PBS) to act as the local resource management system –Install BaBar software –Set up classad.pm to advertise the local resource information –Set up ClassAd publishing tool

Ashok Agarwal 10 BaBar MC Production Setup Using Resource Broker

Ashok Agarwal 11 Performance Test % Eff = CPU Time / Wall Time Comparison of CPU time and percent efficiency (% Eff)

Ashok Agarwal 12 JSAM: Job Submission Tool

Ashok Agarwal 13 GridX1 Condor-G Monitoring

Ashok Agarwal 14 WSRF-Based Grid Monitoring

Ashok Agarwal 15 Conclusion With the WSRF (GT4), we have developed A metascheduling service using Condor-G Resource information provider Automatic ClassAd extraction tool Job submission client tool Execution of BaBar jobs is successful on the GT4 grid Web-based monitoring is useful for providing the status of grid resources and the jobs Monitoring is based on Condor_history. Work is in progress to improve monitoring using condor_quill. Production will start soon on this WSRF-based grid