Www.bsc.es Daniele Lezzi Execution of scientific workflows on federated multi-cloud infrastructures IBERGrid Madrid, 20 September 2013.

Slides:



Advertisements
Similar presentations
Barcelona Supercomputing Center. The BSC-CNS objectives: R&D in Computer Sciences, Life Sciences and Earth Sciences. Supercomputing support to external.
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Contrail and Federated Identity Management
SLA-Oriented Resource Provisioning for Cloud Computing
1 EGI Federated Clouds Task Force HEPiX Spring 2012 Workshop Matteo Turilli
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
Architecture overview 6/03/12 F. Desprez - ISC Cloud Context : Development of a toolbox for deploying application services providers with a hierarchical.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
Raffaele Di Fazio Connecting to the Clouds Cloud Brokers and OCCI.
BESIII distributed computing and VMDIRAC
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
TRACEREP: GATEWAY FOR SHARING AND COLLECTING TRACES IN HPC SYSTEMS Iván Pérez Enrique Vallejo José Luis Bosque University of Cantabria TraceRep IWSG'15.
This project is partially funded by European Commission under the 7th Framework Programme - Grant agreement no ECO 2 Clouds team Barbara Pernici,
INFSO-RI Module 01 ETICS Overview Alberto Di Meglio.
European Grid Initiative Federated Cloud update Peter solagna Pre-GDB Workshop 10/11/
Introduction 1-1 Introduction to Virtual Machines From “Virtual Machines” Smith and Nair Chapter 1.
BLU-ICE and the Distributed Control System Constraints for Software Development Strategies Timothy M. McPhillips Stanford Synchrotron Radiation Laboratory.
INFSO-RI Module 01 ETICS Overview Etics Online Tutorial Marian ŻUREK Baltic Grid II Summer School Vilnius, 2-3 July 2009.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
ServiceSs, a new programming model for the Cloud Daniele Lezzi, Rosa M. Badia, Jorge Ejarque, Raul Sirvent, Enric Tejedor Grid Computing and Clusters Group.
ProActive components and legacy code Matthieu MOREL.
Federating PL-Grid Computational Resources with the Atmosphere Cloud Platform Piotr Nowakowski, Marek Kasztelnik, Tomasz Bartyński, Tomasz Gubała, Daniel.
Aneka Cloud ApplicationPlatform. Introduction Aneka consists of a scalable cloud middleware that can be deployed on top of heterogeneous computing resources.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
European Grid Initiative Data Services and Solutions Part 2: Data in the cloud Enol Fernández Data Services.
1 FedCloud Task Force Demo EGI CF2012 – Munich 28/29 March Matteo Turilli
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Running Big Data on the EGI Federated Cloud Javier Lopez Cacheiro, Álvaro.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI VM Management Chair: Alexander Papaspyrou 2/25/
Microsoft Cloud Computing. Topics to be covered 1.Environmental Features of windows azure 2.What is Cloud Computing 3.Roles in Cloud Computing 4.Benefits.
OpenNebula: Experience at SZTAKI Peter Kacsuk, Sandor Acs, Mark Gergely, Jozsef Kovacs MTA SZTAKI EGI CF Helsinki.
EGI Technical Forum Madrid COMPSs in the EGI Federated Cloud Daniele Lezzi – BSC EGI Technical Forum Madrid.
The EUBrazilOpenBio-BioVeL Use Case in EGI Daniele Lezzi, Barcelona Supercomputing Center EGI-TF September 2013.
EGI-InSPIRE RI EGI Webinar EGI-InSPIRE RI Porting your application to the EGI Federated Cloud 17 Feb
INFN OCCI implementation on Grid Infrastructure Michele Orrù INFN-CNAF OGF27, 13/10/ M.Orrù (INFN-CNAF) INFN OCCI implementation on Grid Infrastructure.
Cloud interoperability and elasticity with COMPSs Federated Cloud F2F Jan , Amsterdam Daniele Lezzi – Barcelona Supercomputing Center.
StratusLab is co-funded by the European Community’s Seventh Framework Programme (Capacities) Grant Agreement INFSO-RI Demonstration StratusLab First.
Instituto de Biocomputación y Física de Sistemas Complejos Cloud resources and BIFI activities in JRA2 Reunión JRU Española.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
DIRAC for Grid and Cloud Dr. Víctor Méndez Muñoz (for DIRAC Project) LHCb Tier 1 Liaison at PIC EGI User Community Board, October 31st, 2013.
Open Data and Cloud Computing e-Infrastructure for Biodiversity Daniele Lezzi Barcelona Supercomputing Center International Workshop on Science Gateways.
EGI Technical Forum Madrid The EUBrazilOpenBio-BioVeL Use Case in EGI Daniele Lezzi – BSC EGI Technical Forum Madrid.
Federated Cloud: Computing UPVLC-I3M Effort allocated: 6 pms. Proposal: Integration of an Infrastructure Broker with self-configuration and auto-scaling.
European Grid Initiative The EGI Federated Cloud as Educational and Training Infrastructure for Data Science Tiziana Ferrari/ EGI.eu.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI A pan-European Research Infrastructure supporting the digital European Research.
Multi-community e-Science service connecting grids & clouds R. Graciani 1, V. Méndez 2, T. Fifield 3, A. Tsaregordtsev 4 1 University of Barcelona 2 University.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number Federated Cloud Update.
LOFAR - Calibration, Analysis and Modelling of Radio-Astronomy Data EGI Conference May 2015, Lisbon Daniele Lezzi – Barcelona Supercomputing.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Support to user communities in EGI with COMPSs Federated Cloud F2F Jan , Amsterdam Daniele Lezzi – Barcelona Supercomputing Center.
1 EGI Federated Cloud Architecture Matteo Turilli Senior Research Associate, OeRC, University of Oxford Chair – EGI Federated Clouds Task Force
The EGI Federated Cloud
Ecological Niche Modelling in the EGI Cloud Federation
Federated Cloud Computing
FedCloud Blueprint Update
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Grid Computing.
Brief introduction to the project
Collaborative Offloading for Distributed Mobile-Cloud Apps
Sky Computing on FutureGrid and Grid’5000
Specialized Cloud Mechanisms
Module 01 ETICS Overview ETICS Online Tutorials
ELIXIR Competence Center
Sky Computing on FutureGrid and Grid’5000
Presentation transcript:

Daniele Lezzi Execution of scientific workflows on federated multi-cloud infrastructures IBERGrid Madrid, 20 September 2013

2 Outline Introduction –Objectives –The COMP Superscalar framework COMPSs Interoperability in the EGI Federated Cloud –The EGI Cloud federation model –COMPSs integration with EGI FedCloud Evaluation –The Modeller service –Testbed Results –Single request scenario –Multiple requests scenario Conclusions and Future Work

Introduction

4 Objectives 1.Enhance the COMP Superscalar programming framework to interoperate with the EGI FedCloud. 2.Optimize the Modeller biodiversity service in EUBrazilOpenBio. 3.Evaluate the service performance on a federated cloud environment. 4

5 StarSs CellSs SMPSs GPUSs GridSs ClearSpeedSs ClusterSs OmpSs ClusterSs Cluster Programmability/Portability – Incremental parallelization/restructure. – Focus in the problem, not in the hardware. – Top/down programming. – Supported languages: Java, C and Python. – “Same” source code runs on “any” machine (on Java) Optimized task implementations Performance (Intelligent Runtime) Asynchronous (data-flow) execution and locality awareness. Automatically extracts and exploits parallelism. Malleable, matches computations to specific resources on each type of target platform. The COMPSs Framework

6 The COMPSs Framework (Programming Model) Resource 2 1. Identify tasks main program { } 2. Select tasks taskA(...); taskB(...); task selection interface { } taskA taskB Task Unit of parallelism...‏ taskA Asynchrony taskB Resource 1 Resource N 6

77 The COMPSs Framework (Interoperability)

COMPSs Interoperability in the EGI FedCloud

9 Standards and validation: emerging standards for the interfaces and images (OCCI, CDMI, OVF). Resource integration: Cloud Computing to be integrated into the existing production infrastructure. Heterogeneous implementation: no mandate on the cloud technology. Provider agnosticism: the only condition to federate resources is to expose the chosen interfaces and services. The EGI Federation Model

10 COMPSs integration with EGI FedCloud COMPSs Application: implementation of the application logic, where some tasks will be instrumented by the COMPSs runtime and executed remotely on EGI FedCloud resources. Cloud Connectors: implements a common interface allowing the resource management on an specific provider. OCCI Connector: translates COMPSs resource management calls to OCCI operations. Configuration: each provider’s available templates and images are set up on COMPSs configuration.

11 Retrieve AC With VOMS Proxy Cert. OCCI Account Synchronization COMPSs integration with EGI FedCloud Configure of COMPSs runtime (available cloud providers, endpoints, etc.) Configure of the OCCI connector (VM templates, VOMS credentials). Generate of the VOMS proxy certificate. Execute the COMPSs application. During the execution, some tasks could have hardware constraints (CPU, Mem, Disk, …) OCCI Connector: maps each task requirements to a suitable template of the available cloud providers, starting new VMs if needed.

12 COMPSs integration with EGI FedCloud A COMPSs VM is available with the required software in the VM repository. The EGI Marketplace contains the list of providers offering this VM.

Evaluation

14 Ecological niche: “Set of ecological requirements for a species to survive and maintain viable populations over the time.” (Grinnel, 1917) Species occurrence points Environmental variables Modelling algorithm Projected niche model The Modeller Service (Ecological Niche Modelling)

15 The Modeller Service (operations) –Convert: Converts multi-job request in a single requests. –Model: models a specie distribution for a given request. –Test: Checks the accuracy of the distribution. –Project: Project each distribution over geographical layers in raster format. –Translate: Translates the raster projection into an image.

16 The Modeller Service (evaluated scenarios) Single multi-job request (workflow) Multiple requests Single request scenario: Issues a multi-job request (6 species and 2 modelling algorithms) producing 12 distribution models. It exploits different FedCloud templates. Multiple requests scenario: tests the global service performance for a given workload pattern (Gaussian random) by issuing many requests with low complexity (3 tasks). ENM Service (OMWS2)

17 The Testbed –Client: Issues requests to the Modeller Service. –WS Server: Hosts the service logic and the COMPSs runtime. –EGI FedCloud: Composed of two providers, both providing rOCCI server and OpenNebula cloud middleware. –CESNET (Czech Rep.): Provide the service with XLarge VMs (4 cores and 15 GB of mem). –CESGA (Spain): Provide the service with Large VMs (2 cores and 8 GB of mem).

Results

19 Results: Single request scenario System load vs. available virtual resources

20 Results: Multiple request scenario 1.Workload Distribution: JMeter generates a random Gaussian workload. Generate 30 requests randomly distributed on a 0 – 2 minute frames. Each request issues low computation time requests. 2.Test phase: Test duration = 5 hours. Test starts with pre-created VMs :  1 Large (CESGA)  2 XLarge (CESNET) 3.Analysis of service key performance indicators (KPI): Response Time Dynamic Resource Consumption Throughput Evaluation of global service performance: …

21 Results: Multiple request scenario Workload Distribution:

22 Results: Multiple request scenario Response time vs. Resource Consumption:

23 Results: Multiple request scenario System throughput:

Conclusions and Future Work

25 COMPSs, programming framework for optimization of complex workflows on different infrastructures: Grid Clusters Clouds Extensions of COMPSs for the interoperability with the EGI FedCloud: Dynamic VM multi-provider management through the rOCCI connector. Support to x509 security. The results demonstrate that the runtime is able to: Serve multiple requests properly managing the pool of resources. Keep the performance independently on the workload. Optimize the resources consumption. Conclusions

26 Improve the connector to support contextualization features. Enhance resource management to support multi-core tasks. Future Work

Thank you!

28 Results: Multiple request scenario