The Prototype Laurence Field IT/SDC 11 November 2014.

Slides:



Advertisements
Similar presentations
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
Advertisements

1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
Extending ForeFront beyond the limit TMGUAG ISAIAG AG Security Suite.
Volunteer Computing Laurence Field IT/SDC 21 November 2014.
XSEDE 13 July 24, Galaxy Team: PSC Team:
P-GRADE and WS-PGRADE portals supporting desktop grids and clouds Peter Kacsuk MTA SZTAKI
SWIM WEB PORTAL by Dipti Aswath SWIM Meeting ORNL Oct 15-17, 2007.
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
Introduction to EMF Server Communication and Cases Beta Testing November 4, 2009.
Web-Enabling the Warehouse Chapter 16. Benefits of Web-Enabling a Data Warehouse Better-informed decision making Lower costs of deployment and management.
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
Catania Science Gateway Framework Motivations, architecture, features Catania, 09/06/2014Riccardo Rotondo
Pilots 2.0: DIRAC pilots for all the skies Federico Stagni, A.McNab, C.Luzzi, A.Tsaregorodtsev On behalf of the DIRAC consortium and the LHCb collaboration.
Apache Airavata GSOC Knowledge and Expertise Computational Resources Scientific Instruments Algorithms and Models Archived Data and Metadata Advanced.
MCSE Guide to Microsoft Exchange Server 2003 Administration Chapter Four Configuring Outlook and Outlook Web Access.
C Copyright © 2009, Oracle. All rights reserved. Appendix C: Service-Oriented Architectures.
1 port BOSS on Wenjing Wu (IHEP-CC)
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
The EDGeS project receives Community research funding 1 Specific security needs of Desktop Grids Desktop Grids Desktop Grids EDGeS project EDGeS project.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
SUSE Linux Enterprise Desktop Administration Chapter 12 Administer Printing.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
The Data Bridge Laurence Field IT/SDC 6 March 2015.
The EDGI project receives Community research funding 1 EDGI Brings Desktop Grids To Distributed Computing Interoperability Etienne URBAH
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
Introduction to the BOINC software David P. Anderson Space Sciences Laboratory University of California, Berkeley.
Wenjing Wu Andrej Filipčič David Cameron Eric Lancon Claire Adam Bourdarios & others.
David Cameron Claire Adam Bourdarios Andrej Filipcic Eric Lancon Wenjing Wu ATLAS Computing Jamboree, 3 December 2014 Volunteer Computing.
Neil Witheridge APAN29 Sydney February 2010 ARCS Authorisation Services Neil Witheridge Manager, ARCS Authorisation Services APAN29, Sydney, February 2010.
Cloud Status Laurence Field IT/SDC 09/09/2014. Cloud Date Title 2 SaaS PaaS IaaS VMs on demand.
Giuseppe Codispoti INFN - Bologna Egee User ForumMarch 2th BOSS: the CMS interface for job summission, monitoring and bookkeeping W. Bacchi, P.
Tarball server (for Condor installation) Site Headnode Worker Nodes Schedd glidein - special purpose Condor pool master DB Panda Server Pilot Factory -
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
Extending ISA/IAG beyond the limit. AGAT Security suite - introduction AGAT Security suite is a set of unique components that allow extending ISA / IAG.
1 The Adoption of Cloud Technology within the LHC Experiments Laurence Field IT/SDC 17/10/2014.
Getting started DIRAC Project. Outline  DIRAC information system  Documentation sources  DIRAC users and groups  Registration with DIRAC  Getting.
A PanDA Backend for the Ganga Analysis Interface J. Elmsheuser 1, D. Liko 2, T. Maeno 3, P. Nilsson 4, D.C. Vanderster 5, T. Wenaus 3, R. Walker 1 1: Ludwig-Maximilians-Universität.
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
HammerCloud Functional tests Valentina Mancinelli IT/SDC 28/2/2014.
Transformation System report Luisa Arrabito 1, Federico Stagni 2 1) LUPM CNRS/IN2P3, France 2) CERN 5 th DIRAC User Workshop 27 th – 29 th May 2015, Ferrara.
1 BOINC + CernVM Ben Segal / CERN (describing the work of many people at CERN and elsewhere ) Pre-GDB on Volunteer Computing CERN, November 11, 2014.
The Gateway Computational Web Portal Marlon Pierce Indiana University March 15, 2002.
Daniele Spiga PerugiaCMS Italia 14 Feb ’07 Napoli1 CRAB status and next evolution Daniele Spiga University & INFN Perugia On behalf of CRAB Team.
IT-SDC : Support for Distributed Computing Dynamic Federations: scalable, high performance Grid/Cloud storage federations Fabrizio Furano - Oliver Keeble.
MGRID Architecture Andy Adamson Center for Information Technology Integration University of Michigan, USA.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
1 Cloud Services Requirements and Challenges of Large International User Groups Laurence Field IT/SDC 2/12/2014.
10 March Andrey Grid Tools Working Prototype of Distributed Computing Infrastructure for Physics Analysis SUNY.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
Overview of the global architecture Giacinto DONVITO INFN-Bari.
1 Globe adapted from wikipedia/commons/f/fa/ Globe.svg IDGF-SP International Desktop Grid Federation - Support Project SZTAKI.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
DIRAC for Grid and Cloud Dr. Víctor Méndez Muñoz (for DIRAC Project) LHCb Tier 1 Liaison at PIC EGI User Community Board, October 31st, 2013.
Volunteer Clouds for the LHC experiments H. Riahi – 12/11/15 EGI User Forum Laurence Field Hassen Riahi CERN IT-SDC.
A Web Based Job Submission System for a Physics Computing Cluster David Jones IOP Particle Physics 2004 Birmingham 1.
Dynamic Federation of Grid and Cloud Storage Fabrizio Furano, Oliver Keeble, Laurence Field Speaker: Fabrizio Furano.
CernVM and Volunteer Computing Ivan D Reid Brunel University London Laurence Field CERN.
The LGI Pilot job portal EGI Technical Forum 20 September 2011 Jan Just Keijser Willem van Engen Mark Somers.
Dynamic Storage Federation based on open protocols
Data Bridge Solving diverse data access in scientific applications
Dynafed, DPM and EGI DPM workshop 2016 Speaker: Fabrizio Furano
External Focus Dr Ivan D Reid Brunel University London 02/09/2016 Ivan D Reid.
StratusLab Final Periodic Review
StratusLab Final Periodic Review
AWS DevOps Engineer - Professional dumps.html Exam Code Exam Name.
2018 Amazon AWS DevOps Engineer Professional Dumps - DumpsProfessor
Get Amazon AWS-DevOps-Engineer-Professional Exam Real Questions - Amazon AWS-DevOps-Engineer-Professional Dumps Realexamdumps.com
Ivan Reid (Brunel University London/CMS)
Exploit the massive Volunteer Computing resource for HEP computation
Presentation transcript:

The Prototype Laurence Field IT/SDC 11 November 2014

Motivation  Summer Student Project  Investigate the feasibility  Develop a prototype  Profited from a clustering of expertise in SDC  CRAB3  Async stage-out  Messaging  FTS  And prior experience  Test4Theory Date Title 2

Test4Theory Model  Avoid restarting the VM for every job  Reduces CVMFS related network traffic  Separate VM management and job management  Inline with the cloud model  Can reuse cloud related tooling  CoPilot support challenges  Aging codebase  Dependencies not available in the standard repositories  New standardized components available for some functions Date Title 3 VM BOINC Server Volunteer Agent Job Wrapper Co Pilot Job Agent Storage Agent Job Description Data I/O

Async stage-out Initial Architecture 4 CRAB3 Messaging Service Messaging Service Data Bridge VM Volunteer Agent Job Wrapper Job Description FTS Grid Pull Push TaskWorker Plugin TaskWorker Plugin Data I/O

TaskWorker Plugin  Task Worker  Obtains a task request  Fetches work from the job queue  Generates the job specifications  Submits them to the remote job manager  HTCondor  Handler  Responsible for the actual workflow execution  New task => handleNewTask handler  Implements the operations called Actions  e.g. JobSpec creation and submission  May be backend specific  Specific Actions are required to support a new backend  The required actions need to be associate to the appropriate handlers  Work will be assigned to the right handler which implements the appropriate actions 5

Apache Plugin  How to authenticate BOINC users?  In the VM, credential provided via /dev/fd0  BOINC_USERNAME  BOINC_AUTHENTICATOR  Identity Provider (IDP)  BOINC Project DB  MySQL  User Table  mod_auth_mysql  Maps username/password to DB table  AuthMysqlUserTable user  AuthMySQLNameField name  AuthMySQLPasswordField authenticator  Enables reuse of apache-based HTTP technology 6

Message Queue  Messaging service does not support BOINC authentication  Not clear if it is possible or worthwhile to provide functionality  Standard apache Web server approach  mod_auth_mysql to validate BOINC user’s credential  mod_auth_ssl to validate CRAB3 server's x509 credential  Two simple cgi scripts  put-job.cgi  get-job.cgi  Simple file-based queue  python-dirq  Job descriptions from CRAB3  Supports arbitrary file types  Garbage in, Garbage out  Extensible 7 Web Server dirq put Get

The Data Bridge  Spans authentication domains  BOINC user’s credential  Grid x509 credentials  Scalable data I/O  With sandboxing capabilities  Data Isolation  Simple apache-based prototype  Supports HTTP PUT/GET  mod_auth_mysql to validate BOINC user’s credential  mod_auth_ssl to validate CRAB3 server's x509 credential  HTTP Federation  Possibility to reuse standard DM tools 8

Dynamic HTTP Federations  The Dynafed system implements federated storage over HTTP  In testing in LHCb and Canada (Atlas)  Federates WebDAV or S3 enabled storage systems  Apache front end  Can be used as a data bridge  S3 storage backend(s)  Acts as a security gateway, authenticating clients either via X509 or BOINC auth  Clients then redirected directly to the storage  Great scalability potential  Global system, smart replica selection (availability, proximity)  9

Apache The Data Bridge 10 ssl FTS S3 mysql CRAB3 BOINC User BOINC User PUT/GET HTTP redirect & sign PUT/GET Grid DynaFed

VM and Job Agent  CernVM3  Contextualized using CernVM Online  CVMFS configuration  Mount cms and grid  Add BOINC user  Credentials read from /dev/fd0  name, authenticator  CMSJobAgent.py workflow  Gets new job description  Downloads required input files  Runs job  Uploads output files  CMSJobAgent.sh  Checks if CMSJobAgent.py is running  If not starts it  cms-agent cron  Runs CMSJobAgent.sh every minute 11

Async stage-out Current Architecture 12 CRAB3 Message Queue Data Bridge VM Volunteer Agent Job Wrapper FTS Grid GET PUT TaskWorker Plugin TaskWorker Plugin

Extending the platform  The Data Bridge as a common component for BOINC projects  For VM-based approaches requiring external job injection  Data movement external to BOINC  Can support high data I/O requirements  BOINC & DataBridge Recipe  PUTjob description  PUT data  Create Job Agent  GET job description  GET input data  Provided by the job description  Run Job  PUT output  Read data from output bucket  Similar to HLT  Fitter bad data etc.  Could be used in the storage-less IaaS providers Date Title 13

Summary  Advanced prototype for  Following the proven Test4Theory model  Developed the concept of the Data Bridge  Reused HTTP federation component for S3  Added BOINC authentication  Added a simple message delivery function  For the job description  Provide an image along with a job agent  Extended CRAB3 to support his approach  Using a Task Work plugin  Towards a platform for volunteer computing Date Title 14

Acknowledgements  Hendrik Borras  Daniele Spiga  Hassen Riahi  Adrien Devres  Fabrizio Furano  Oliver Keeble Date Title 15