Www.egi.eu EGI-InSPIRE RI-261323 EGI-InSPIRE www.egi.eu EGI-InSPIRE RI-261323 Global Accounting in the Grid and Cloud John Gordon, STFC HEPiX, Beijing.

Slides:



Advertisements
Similar presentations
Accounting Update Dave Kant Grid Deployment Board Nov 2007.
Advertisements

Storage Accounting John Gordon, STFC GDB June 2012.
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GPGPU Accounting John Gordon STFC 09/04/2013 EGI CF – Accounting and Billing1.
Summary of Accounting Discussion at the GDB in Bologna Dave Kant CCLRC, e-Science Centre.
Monitoring the Grid at local, national, and Global levels Pete Gronbech GridPP Project Manager ACAT - Brunel Sept 2011.
Multicore Accounting John Gordon, STFC-RAL WLCG MB, July 2015.
GDB March User-Level, VOMS Groups and Roles Dave Kant CCLRC, e-Science Centre.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
Storage Accounting John Gordon, STFC GDB March 2013.
EMI INFSO-RI Accounting John Gordon (STFC) APEL PT Leader.
XROOTD AND FEDERATED STORAGE MONITORING CURRENT STATUS AND ISSUES A.Petrosyan, D.Oleynik, J.Andreeva Creating federated data stores for the LHC CC-IN2P3,
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
Accounting Update John Gordon and Stuart Pullinger January 2014 GDB.
Tier3 monitoring. Initial issues. Danila Oleynik. Artem Petrosyan. JINR.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
APEL Cloud Accounting Status and Plans APEL Team John Gordon.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Accounting Old and New Requirements John Gordon Revised 22/3/12.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
Accounting For Multicore Jobs John Gordon, STFC, UK Scientific Computing Department, APEL Team MB 17 th March 2015.
LCG WLCG Accounting: Update, Issues, and Plans John Gordon RAL Management Board, 19 December 2006.
LCG Accounting Update John Gordon, CCLRC-RAL WLCG Workshop, CERN 24/1/2007 LCG.
LCG User Level Accounting John Gordon CCLRC-RAL LCG Grid Deployment Board October 2006.
Accounting in LCG/EGEE Can We Gauge Grid Usage via RBs? Dave Kant CCLRC, e-Science Centre.
LCG Accounting/Reporting John Gordon, STFC MB November 9 th 2011.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Accounting in LCG Dave Kant CCLRC, e-Science Centre.
The National Grid Service User Accounting System Katie Weeks Science and Technology Facilities Council.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI-InSPIRE APEL for Accounting John Gordon, Stuart Pullinger STFC.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Accounting Requirements Stuart Pullinger STFC 09/04/2013 EGI CF – Accounting.
Accounting John Gordon WLC Workshop 2016, Lisbon.
DGAS Distributed Grid Accounting System INFN Workshop /05/1009, Palau Giuseppe Patania Andrea Guarise 6/18/20161.
APEL Architecture Alison Packer. Overview Grid jobs accounting tool APEL Client software - installed in sites (CEs, gLite- APEL node) APEL Server accepts.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Storage Accounting John Gordon, STFC OMB August 2013.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GLUE 2: Deployment and Validation Stephen Burke egi.eu EGI OMB March 26 th.
E-science grid facility for Europe and Latin America Updates on Information System Annamaria Muoio - INFN Tutorials for trainers 01/07/2008.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Accounting Requirements Stuart Pullinger STFC 09/04/2013 EGI CF – Accounting.
LCG Accounting Update John Gordon, CCLRC-RAL 10/1/2007.
John Gordon EMI TF and EGI CF March 2012 Accounting Workshop.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Accounting Update John Gordon. Outline Multicore CPU Accounting Developments Cloud Accounting Storage Accounting Miscellaneous.
Storage Accounting John Gordon STFC GDB, Lyon 6 th April2011 GDB January 2012.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI A pan-European Research Infrastructure supporting the digital European Research.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI John Gordon EGI Virtualisation and Cloud Workshop Amsterdam 13 th May 2011.
DGAS Accounting – toward national grid infrastructures HPDC workshop on Monitoring, Logging and Accounting, (MLA) in production Grids 10/06/2009, Munich.
Implementation of GLUE 2.0 support in the EMI Data Area Elisabetta Ronchieri on behalf of JRA1’s GLUE 2.0 Working Group INFN-CNAF 13 April 2011, EGI User.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI John Gordon EGI Virtualisation and Cloud Workshop Amsterdam 12 th May 2011.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI APEL Regional Accounting Alison Packer (STFC) Iván Díaz Álvarez (CESGA) APEL.
Accounting Review Summary and action list from the (pre)GDB Julia Andreeva CERN-IT WLCG MB 19th April
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Accounting task force T. Ferrari/EGI.eu Accounting task force/TCB meeting.
HTCondor Accounting Update
HTCondor Accounting Update
GridPP37, Ambleside Adrian Coveney (STFC)
John Gordon STFC OMB 26 July 2011
The New APEL Client Will Rogers, STFC.
WLCG Resources Reporting
Benchmarking Changes and Accounting
Raw Wallclock in APEL John Gordon, STFC-RAL
JRA1.4 - Account for different resource types
A Messaging Infrastructure for WLCG
APEL Storage Accounting
Cristina del Cano Novales STFC - RAL
John Gordon (STFC) APEL PT Leader
New Types of Accounting Beyond CPU
DGAS Today and tomorrow
GLOBUS ACCOUNTING USING GRID-SAFE - DEMO
Accounting Repository
APEL as a Global Accounting Repository
User Accounting Integration Spreading the Net.
Presentation transcript:

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Global Accounting in the Grid and Cloud John Gordon, STFC HEPiX, Beijing 18 th October 2012

EGI-InSPIRE RI Outline History Yesterday Today Tomorrow New types of Accounting Record Future

EGI-InSPIRE RI Overview The APEL accounting system has been gathering cpu accounting records for the LHC experiments from around the world since 2004 It now contains data from 2x10**9 jobs from 350 sites in 50 countries. Work is under way to extend accounting to storage and cloud

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI History

EGI-InSPIRE RI History LCG->EGEE->WLCG->EGI+EMI Including along the way: –Gratia (OpenScience Grid) –DGAS (Italy) –SGAS (NorduGrid)

EGI-InSPIRE RI LCG In the beginning, there was Les Robertson. –Trying to form a Grid for LHC Computing from the plethora of national and international grid projects which had sprung up around the world The LHC Tier model* predated Grids but Grids offered technology on which to build the LHC distributed computing model(s). Getting working middleware was a problem but not the only one. There was also a need for a number of other operational services required for a working Grid. Les persuaded Tier1s to take responsibility for defining and developing various missing operational components –Karlsruhe – helpdesk –Lyon – operations portal –CERN – monitoring –RAL – accounting * MONARCH Report

EGI-InSPIRE RI The Start of APEL Dave Kant designed and wrote parsers for a few batch systems to gather usage data from the batch logs and the user grid identity information from Globus GRAM and then the LCG-CE. The APEL publishers combined this into a usage record for each job and sent it via R- GMA to RAL where they were processed, summarised, and visualised So the first sites started to publish

EGI-InSPIRE RI By December 2004, 15 CEs at 13 sites 15-Dec-04 Dave Kant -8

EGI-InSPIRE RI EGEE APEL accounting became a core service in EGEE ( ) and the client was rolled out to more sites CESGA took over running and developing the portal

EGI-InSPIRE RI Others Driven by WLCG, data from other accounting systems were incorporated, –DGAS – Italy –Gratia – Open Science Grid –SGAS – Nordugrid And EGEE extended beyond Europe to Asia Pacific, India,

EGI-InSPIRE RI WLCG EGEE pushed all sites to publish but functionality was driven by WLCG. The central APEL repository stores data by site The Portal pulls in topology to drive reports Tier1 Reports Tier2 federations formally defined Comparison with Pledges. (Tier2 Report)

EGI-InSPIRE RI APEL APEL collects cpu usage data from 274 sites by the APEL client software and from a further 90 where data is collected by other software and published to the central APEL repository. It is thus a single worldwide point of reference for all accounting data for a range of VOs(LHC and other international VOs as well as regional and national VOs). These non-APEL sites include alternative middleware stacks(ARC) within EGI and e-infrastructures within EGI (Italy, NorduGrid) and outside (Open Science Grid in US). The data from systems/services not using the APEL Client were doing direct database inserts. >2x10**9 jobs. 61% ATLAS Reached 73Mjobs/month in M HS06Years almost 50% in the last 12 months.

EGI-InSPIRE RI

EGI-InSPIRE RI

EGI-InSPIRE RI

EGI-InSPIRE RI Visualisation The Accounting Portal at CESGA Pulls data from APEL, applies various topologies (T1, T2, countries, NGIs) Allows dynamic queries at any point in a tree showing one variable as a function of two dimensions –Variables: njobs, cpu, wallclock, normcpu, normwall, cpueff –Dimensions: month, site/region, VO,

EGI-InSPIRE RI

EGI-InSPIRE RI

EGI-InSPIRE RI

EGI-InSPIRE RI

EGI-InSPIRE RI

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Internals

EGI-InSPIRE RI APEL – 2011 APEL client BROKER ActiveMQ EGI Accounting Portal CONSUMER accounting.grid- support.ac.uk External clients External clients MySQL Summaries created Summaries created

EGI-InSPIRE RI APEL Deployment Stage 1 APEL client BROKER ActiveMQ CONSUMER MySQL accounting.grid- support.ac.uk External clients External clients MySQL EGI Accounting Portal External clients External clients MySQL CPU JobRecords CPU Summaries MySQL CPU JobRecords CPU Summaries EGI Message Brokers Record loader Summaries created Receiving SSM Sending SSM

EGI-InSPIRE RI APEL Deployment Stage 2 APEL client BROKER ActiveMQ CONSUMER MySQL EGI Accounting Portal External clients External clients EGI Message Brokers accounting.grid- support.ac.uk MySQL CPU JobRecords CPU Summaries MySQL CPU JobRecords CPU Summaries Record loader DBunloader Converted JobRecords Receiving SSM Sending SSM New APEL client

EGI-InSPIRE RI APEL – 2013 New APEL client EGI Accounting Portal accounting.grid- support.ac.uk External clients External clients MySQL CPU JobRecords CPU Summaries MySQL CPU JobRecords CPU Summaries EGI Message Brokers Receiving SSM Sending SSM Record loader DBunloader New APEL client nagios pub/sync

EGI-InSPIRE RI What is SSM? Secure Stomp Messenger Simple program to send and receive messages Independent of message content Uses STOMP (and EGI brokers) ~1k lines of python code SSM Status Deployed in production for ~ 10 months No operational difficulties – (except for an LDAP query bug) Robust, fast Handles load easily Allows separation of messaging and server: – filesystem buffer – but no message checking

EGI-InSPIRE RI Accounting and SSM SSM can transport any data We use it for accounting Production: –APEL message format Testing: –EMI Compute Accounting Record (CAR) –EMI Storage Accounting Record (StAR)

EGI-InSPIRE RI SSM 2.0 SSM 1 works well But we are changing it Why? It’s a bit over- complicated –Difficult to develop against –Could be simpler Interoperability issues: – Crypto not well defined – Synchronous messaging – Unnecessary (?) message sequence EMI has different accounting systems which need to interoperate One python program is not good enough.

EGI-InSPIRE RI EMI Messaging Protocol for Accounting (EMPA) More logical, simpler Use persistent queues –Not replies Use SSL –Instead of encryption –(encryption still an option) Use the infrastructure

EGI-InSPIRE RI New Publishers A number of other accounting clients and systems are in the process of publishing to APEL All they need to do is extract the relevant jobs from their database, (combine into a summary), transform to the UR format and write to a local directory. SSM does the rest, shipping the files to APEL.

EGI-InSPIRE RI Publishers Emi-apel (bulk of EGI sites) DGAS (Italy) Open Science Grid (USA, others) SGAS (NorduGrid, Switzerland, Finland) Individual Sites (CERN, NIKHEF, CC-IN2P3) Unicore (Poland, Germany) Globus/IGE (Germany, ??) ARC/JURA (Hungary and other ARC) PRACE (many but only selected VOs) EDGI – Desktop Grids MAPPER - Multiscale Simulation (PRACE+EGI)

EGI-InSPIRE RI Extend EGI Infrastructure APEL Repository SSM/ActiveMQ Messaging APEL DGAS ARC/SGAS OSG/Gratia Sites MAP PER, EDGI

EGI-InSPIRE RI New Regionalised Structure Central APEL NGI APEL Emi-apel ARCCE NGI SGAS Regionalised APEL Server, collects job records from NGI sites Send summaries on to central APEL Visualisation Portal Portal OtherRepo

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI New Accounting Records

EGI-InSPIRE RI Readiness to Receive New Records The APEL repository is in the middle of a migration but the capability to receive usage records over SSM is already in production for both Job Records and Summary Job Records. It is being used by Open Science Grid and CERN, with a number of other publishers in migration. Consumers are in place on the test service for CAR and StAR. Adding others is a lightweight process. Much simpler than defining the record schema.

EGI-InSPIRE RI APEL – 2013 New APEL client EGI Accounting Portal accounting.grid- support.ac.uk External clients External clients MySQL CPU JobRecords CPU Summaries StorageRecords StorageSummaries MySQL CPU JobRecords CPU Summaries StorageRecords StorageSummaries EGI Message Brokers Receiving SSM Sending SSM Record loader DBunloader New APEL client Storage clients Storage clients Storage loader

EGI-InSPIRE RI CAR CAR is an EMI revision of the OGF UR v1.0 rationalising some issues and including some common extensions deployed in a number of implementations. Not UR 2.0 which is a bigger revision started by OGF UR-WG XML document CAR v1.0 was agreed but now some issues have arisen and CAR v1.0.1 is being finalised APEL new schema incorporates CAR v1.0

EGI-InSPIRE RI Storage StAR (Storage Accounting Record) is a version of the OGF UR 1.0 adapted for accounting of storage utilisation Developed by EMI, submitted as a public document to OGF, revised in the light of public comments. –Added Sitename, allocated space, and revised time definitions. Status – being implemented by EMI storage providers (dCache, StoRM, DPM) in EMI-3

EGI-InSPIRE RI Storage Solutions dCache (EMI) StoRM (EMI) DPM (EMI) Castor EOS BestMan Gratia (already collecting) xrootd hadoop Gstat Cloud Storage (Unicore) (ARC) iRods ????? The middleware in scope for EGI and WLCG Anything that collects storage information could also publish usage records eg Julia Andreeva talk at CHEP

EGI-InSPIRE RI Cloud The EGI Federated Cloud Task Force has adopted a VM UR based on OGF UR –Many values like memory and network were defined in the UR but seldom if ever implemented. Sensors to collect the information and construct usage records were written. –OpenNebula and OpenStack so far. SSM is used to send to RAL.

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Future

EGI-InSPIRE RI APEL continues to collect more cpu accounting data for a widening ranges of infrastructures, projects, and VOs A more distributed architecture should reduce bottlenecks and give more control to the NGIs New types of accounting record will be added (Storage, Cloud,...)

EGI-InSPIRE RI

EGI-InSPIRE RI

EGI-InSPIRE RI