Volunteer Computing Laurence Field IT/SDC 21 November 2014.

Slides:



Advertisements
Similar presentations
1 G2 and ActiveSheets Paul Roe QUT Yes Australia!
Advertisements

1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
SWIM WEB PORTAL by Dipti Aswath SWIM Meeting ORNL Oct 15-17, 2007.
How Clients and Servers Work Together. Objectives Learn about the interaction of clients and servers Explore the features and functions of Web servers.
The Prototype Laurence Field IT/SDC 11 November 2014.
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
The Client/Server Database Environment
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
Pilots 2.0: DIRAC pilots for all the skies Federico Stagni, A.McNab, C.Luzzi, A.Tsaregorodtsev On behalf of the DIRAC consortium and the LHCb collaboration.
Public-resource computing for CEPC Simulation Wenxiao Kan Computing Center/Institute of High Physics Energy Chinese Academic of Science CEPC2014 Scientific.
Ian Fisk and Maria Girone Improvements in the CMS Computing System from Run2 CHEP 2015 Ian Fisk and Maria Girone For CMS Collaboration.
1 port BOSS on Wenjing Wu (IHEP-CC)
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
The EDGeS project receives Community research funding 1 Specific security needs of Desktop Grids Desktop Grids Desktop Grids EDGeS project EDGeS project.
jpasswd A common password change client for Unix and NT Marty Wise Jefferson Lab October, 2000.
CERN IT Department CH-1211 Geneva 23 Switzerland t The Experiment Dashboard ISGC th April 2008 Pablo Saiz, Julia Andreeva, Benjamin.
SUSE Linux Enterprise Desktop Administration Chapter 12 Administer Printing.
The Data Bridge Laurence Field IT/SDC 6 March 2015.
Online Translation Service Capstone Design Eunyoung Ku Jason Roberts Jennifer Pitts Gregory Woodburn Kim Tran.
Volunteer Computing 2 Overview Volunteer Computing BOINC Volunteer Computing For HEP Virtualization Volunteer Towards a Common Platform.
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
Wenjing Wu Andrej Filipčič David Cameron Eric Lancon Claire Adam Bourdarios & others.
Wenjing Wu Computer Center, Institute of High Energy Physics Chinese Academy of Sciences, Beijing BOINC workshop 2013.
David Cameron Claire Adam Bourdarios Andrej Filipcic Eric Lancon Wenjing Wu ATLAS Computing Jamboree, 3 December 2014 Volunteer Computing.
David Cameron Riccardo Bianchi Claire Adam Bourdarios Andrej Filipcic Eric Lançon Efrat Tal Hod Wenjing Wu on behalf of the ATLAS Collaboration CHEP 15,
Neil Witheridge APAN29 Sydney February 2010 ARCS Authorisation Services Neil Witheridge Manager, ARCS Authorisation Services APAN29, Sydney, February 2010.
Cloud Status Laurence Field IT/SDC 09/09/2014. Cloud Date Title 2 SaaS PaaS IaaS VMs on demand.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
Extending ISA/IAG beyond the limit. AGAT Security suite - introduction AGAT Security suite is a set of unique components that allow extending ISA / IAG.
1 The Adoption of Cloud Technology within the LHC Experiments Laurence Field IT/SDC 17/10/2014.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
WebFTS File Transfer Web Interface for FTS3 Andrea Manzi On behalf of the FTS team Workshop on Cloud Services for File Synchronisation and Sharing.
A PanDA Backend for the Ganga Analysis Interface J. Elmsheuser 1, D. Liko 2, T. Maeno 3, P. Nilsson 4, D.C. Vanderster 5, T. Wenaus 3, R. Walker 1 1: Ludwig-Maximilians-Universität.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
2012 Objectives for CernVM. PH/SFT Technical Group Meeting CernVM/Subprojects The R&D phase of the project has finished and we continue to work as part.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
WEB SERVER SOFTWARE FEATURE SETS
1 Volunteer Computing at CERN past, present and future Ben Segal / CERN (describing the work of many people at CERN and elsewhere ) White Area lecture.
1 BOINC + CernVM Ben Segal / CERN (describing the work of many people at CERN and elsewhere ) Pre-GDB on Volunteer Computing CERN, November 11, 2014.
IT-SDC : Support for Distributed Computing Dynamic Federations: scalable, high performance Grid/Cloud storage federations Fabrizio Furano - Oliver Keeble.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
1 Cloud Services Requirements and Challenges of Large International User Groups Laurence Field IT/SDC 2/12/2014.
– Past, Present, Future Volunteer Computing at CERN Helge Meinhard, Nils Høimyr / CERN for the CERN BOINC service team H. Meinhard et al. - Volunteer.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
IT-SDC : Support for Distributed Computing Dynafed FTS3 Human Brain Project use cases Fabrizio Furano Alejandro Alvarez.
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
1 Globe adapted from wikipedia/commons/f/fa/ Globe.svg IDGF-SP International Desktop Grid Federation - Support Project SZTAKI.
Core and Framework DIRAC Workshop October Marseille.
DIRAC for Grid and Cloud Dr. Víctor Méndez Muñoz (for DIRAC Project) LHCb Tier 1 Liaison at PIC EGI User Community Board, October 31st, 2013.
ARC-CE: updates and plans Oxana Smirnova, NeIC/Lund University 1 July 2014 Grid 2014, Dubna using input from: D. Cameron, A. Filipčič, J. Kerr Nilsen,
Volunteer Clouds for the LHC experiments H. Riahi – 12/11/15 EGI User Forum Laurence Field Hassen Riahi CERN IT-SDC.
Dynamic Federation of Grid and Cloud Storage Fabrizio Furano, Oliver Keeble, Laurence Field Speaker: Fabrizio Furano.
CernVM and Volunteer Computing Ivan D Reid Brunel University London Laurence Field CERN.
The LGI Pilot job portal EGI Technical Forum 20 September 2011 Jan Just Keijser Willem van Engen Mark Somers.
GPIR GridPort Information Repository
The Client/Server Database Environment
Dynafed, DPM and EGI DPM workshop 2016 Speaker: Fabrizio Furano
External Focus Dr Ivan D Reid Brunel University London 02/09/2016 Ivan D Reid.
StratusLab Final Periodic Review
StratusLab Final Periodic Review
DCache things Paul Millar … on behalf of the dCache team.
AWS DevOps Engineer - Professional dumps.html Exam Code Exam Name.
2018 Amazon AWS DevOps Engineer Professional Dumps - DumpsProfessor
Get Amazon AWS-DevOps-Engineer-Professional Exam Real Questions - Amazon AWS-DevOps-Engineer-Professional Dumps Realexamdumps.com
Dynamic DNS support for EGI Federated cloud
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
Ivan Reid (Brunel University London/CMS)
Online Translation Service Capstone Design
Exploit the massive Volunteer Computing resource for HEP computation
Presentation transcript:

Volunteer Computing Laurence Field IT/SDC 21 November 2014

Overview Of Existing Activities   The umbrella for our volunteer computing actives  SIXTRACK  Original classic BOINC project  Accelerator beam simulations  10 years old  11K users, 18K hosts with recent credit  Test4Theory  Pioneered the VM approach  Runs HEP simulations  In production since 2011  2K users, 3K hosts with recent credit 2

Overview Of Existing Activities   Prototype running in Beijing Jan 2014  Moved activity to CERN in July 2014  Open but not advertised  1K users, 1.2K hosts with recent credit (2K running jobs)   LHCb’s prototype  Since June 2013  CVMFS and Security Issues  Internal testing/use   Started August 2014  Advanced prototype 3

Test4Theory Model  Avoid restarting the VM for every job  Reduces CVMFS related network traffic  Separate VM management and job management  Inline with the cloud model  Can reuse cloud related tooling  CoPilot support challenges  Aging codebase  Dependencies not available in the standard repositories  New standardized components available for some functions 4 VM BOINC Server Volunteer Agent Job Wrapper Co Pilot Job Agent Storage Agent Job Description Data I/O

The Challenge 5 CRAB3 VM Volunteer Agent Job Wrapper

Async stage-out The Initial Architecture 6 CRAB3 Messaging Service Messaging Service Data Bridge VM Volunteer Agent Job Wrapper Job Description FTS Grid Pull Push TaskWorker Plugin TaskWorker Plugin Data I/O

Authentication  How to authenticate BOINC users?  In the VM, credential provided via /dev/fd0  BOINC_ID  BOINC_AUTHENTICATOR  Identity Provider (IDP)  BOINC Project DB  MySQL  User Table  mod_auth_mysql  Maps username/password to DB table  AuthMysqlUserTable user  AuthMySQLNameField id  AuthMySQLPasswordField authenticator  Enables reuse of apache-based HTTP technology 7

Message Queue  Messaging service does not support BOINC authentication  Not clear if it is possible or worthwhile to provide functionality  Standard apache Web server approach  mod_auth_mysql to validate BOINC user’s credential  mod_auth_ssl to validate CRAB3 server's x509 credential  Two simple cgi scripts  put-job.cgi  get-job.cgi  Simple file-based queue  python-dirq  Job descriptions from CRAB3  Supports arbitrary file types  Garbage in, Garbage out  Extensible 8 Web Server dirq put Get

The Data Bridge  Spans authentication domains  BOINC user’s credential  Grid x509 credentials  Scalable data I/O  With sandboxing capabilities  Data Isolation  Simple apache-based prototype  Supports HTTP PUT/GET  mod_auth_mysql to validate BOINC user’s credential  mod_auth_ssl to validate CRAB3 server's x509 credential  HTTP Federation  Possibility to reuse standard DM tools 9

Dynamic HTTP Federations  The Dynafed implements federated storage over HTTP  In testing in LHCb and Canada (ATLAS)  Federates WebDAV or S3 enabled storage systems  Apache front end  Can be used as a data bridge  S3 storage backend(s)  Acts as a security gateway between X509 or BOINC Auth  Clients then redirected directly to the storage  Great scalability potential  Global system, smart replica selection (availability, proximity)  10

Apache The Data Bridge 11 ssl FTS S3 mysql CRAB3 BOINC User BOINC User PUT/GET HTTP redirect & sign PUT/GET Grid DynaFed

VM and Job Agent  CernVM3  Contextualized using CernVM Online  CVMFS configuration  Mount cms and grid  Add BOINC user  Credentials read from /dev/fd0  id, authenticator  CMSJobAgent.py workflow  Gets new job description  Downloads required input files  Runs job  Uploads output files  CMSJobAgent.sh  Checks if CMSJobAgent.py is running  If not starts it  cms-agent cron  Runs CMSJobAgent.sh every minute 12

Async stage-out Final Architecture 13 CRAB3 Message Queue Data Bridge VM Volunteer Agent Job Wrapper FTS Grid GET PUT TaskWorker Plugin TaskWorker Plugin

 Build upon and the data bridge  Require:  Stage input data to the data bridge  if needed for simulation jobs  Job Agent  Method to get job description  GET input from the data bridge  Run job  PUT output on the data bridge  Read data from the output bucket  Similar to HLT  Fitter bad data etc. 14

Summary  Many ongoing volunteer computing activities  Most following the proven Test4Theory model  Data Bridge concept recently developed  Reuses HTTP federation component for S3  Added BOINC authentication  Added a simple message delivery function  For the job description  Working towards a common platform  Only require a custom job agent  Job description  Data stage-in, stage-out 15

Links   