Atmosphere: A Platform for Development, Execution and Sharing of Applications in Federated Clouds Marian Bubak Piotr Nowakowski, Marek Kasztelnik, Tomasz.

Slides:



Advertisements
Similar presentations
Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid.
Advertisements

UrbanFlood Towards a framework for creation, deployment and reliable operation of distributed, time-critical applications Marian Bubak and Marek Kasztelnik.
High Performance Computing Course Notes Grid Computing.
Cloud Management Mechanisms
Polish Infrastructure for Supporting Computational Science in the European Research Space GridSpace Based Virtual Laboratory for PL-Grid Users Maciej Malawski,
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space User Oriented Provisioning of Secure Virtualized.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Towards auto-scaling in Atmosphere cloud platform Tomasz Bartyński 1, Marek Kasztelnik 1, Bartosz Wilk 1, Marian Bubak 1,2 AGH University of Science and.
Seamless Medical Image Processing on the Grid on the Example of Segmentation and Partition of the Airspaces Andrzej Rutkowski 1, Michał Chlebiej 1, Marcelina.
Distributed Cloud Environment for PL-Grid Applications Piotr Nowakowski, Tomasz Bartyński, Tomasz Gubała, Daniel Harężlak, Marek Kasztelnik, J. Meizner,
Summer School on Grid and Cloud Workflows and Gateways, Budapest, 1-6 July July 2013 Enabling building and execution of VPH applications on federated.
CIRRUS Workshop, Vienna, Austria119 Nov 2013 Security in the Cloud Platform for VPH Applications Marian Bubak Department of Computer Science and Cyfronet,
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
Lecture 8 – Platform as a Service. Introduction We have discussed the SPI model of Cloud Computing – IaaS – PaaS – SaaS.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
CGW 2003 Institute of Computer Science AGH Proposal of Adaptation of Legacy C/C++ Software to Grid Services Bartosz Baliś, Marian Bubak, Michał Węgiel,
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
SimCity Building Blocks at the DICE team
Software Architecture
Experience with the OpenStack Cloud for VPH Applications Jan Meizner 1, Maciej Malawski 1,2, Piotr Nowakowski 1, Paweł Suder 1, Marian Bubak 1,2 AGH University.
DataNet – Flexible Metadata Overlay over File Resources Daniel Harężlak 1, Marek Kasztelnik 1, Maciej Pawlik 1, Bartosz Wilk 1, Marian Bubak 1,2 1 ACC.
EC-project number: Universal Grid Client: Grid Operation Invoker Tomasz Bartyński 1, Marian Bubak 1,2 Tomasz Gubała 1,3, Maciej Malawski 1,2 1 Academic.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Lightweight construction of rich scientific applications Daniel Harężlak(1), Marek Kasztelnik(1), Maciej Pawlik(1), Bartosz Wilk(1) and Marian Bubak(1,
Federating PL-Grid Computational Resources with the Atmosphere Cloud Platform Piotr Nowakowski, Marek Kasztelnik, Tomasz Bartyński, Tomasz Gubała, Daniel.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Chapter 8 – Cloud Computing
Aneka Cloud ApplicationPlatform. Introduction Aneka consists of a scalable cloud middleware that can be deployed on top of heterogeneous computing resources.
Web Technologies Lecture 13 Introduction to cloud computing.
Lecture III: Challenges for software engineering with the cloud CS 4593 Cloud-Oriented Big Data and Software Engineering.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
ALL INFORMATION PRESENTED AS WELL AS ALL SESSIONS ARE MICROSOFT CONFIDENTIAL AND UNDER YOUR NON-DISCLOSURE AGREEMENT (NDA) AND\OR TECHNOLOGY PREVIEW.
EGI-InSPIRE RI EGI Webinar EGI-InSPIRE RI Porting your application to the EGI Federated Cloud 17 Feb
Introduction to the VPH-Share Atmosphere Cloud Platform Piotr Nowakowski, Tomasz Bartyński, Marian Bubak, Tomasz Gubała, Daniel Harężlak, Marek Kasztelnik,
Development, Execution and Sharing of VPH Applications in the Cloud with the Atmosphere Platform Piotr Nowakowski, Tomasz Bartyński, Marian Bubak, Tomasz.
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
Support for Taverna Workflows in VPH-Share Cloud Platform Marek Kasztelnik 1, Marian Bubak 2,1, Maciej Malawski 2,1, Piotr Nowakowski 1, Ernesto Coto 3,
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
PLG-Data and rimrock Services as Building
Piotr Bała, Marcin Radecki, Krzysztof Benedyczak
Accessing the VI-SEEM infrastructure
Department of Computer Science AGH
Demo of the Model Execution Environment WP2 Infrastructure Platform
Demo of the Model Execution Environment WP2 Infrastructure Platform
Model Execution Environment Current status of the WP2 Infrastructure Platform Marian Bubak1, Daniel Harężlak1, Marek Kasztelnik1 , Piotr Nowakowski1, Steven.
From VPH-Share to PL-Grid: Atmosphere as an Advanced Frontend
Dag Toppe Larsen UiB/CERN CERN,
Dag Toppe Larsen UiB/CERN CERN,
Model Execution Environment for Investigation of Heart Valve Diseases
StratusLab Final Periodic Review
StratusLab Final Periodic Review
DI4R Conference, September, 28-30, 2016, Krakow
DIRAC services.
Platform as a Service.
PROCESS - H2020 Project Work Package WP6 JRA3
Infrastructure for Personalised Medicine: It’s MEE that Matters!
Final Review 27th March Final Review 27th March 2019.
H2020 EU PROJECT | Topic SC1-DTH | GA:
H2020 EU PROJECT | Topic SC1-DTH | GA:
Presentation transcript:

Atmosphere: A Platform for Development, Execution and Sharing of Applications in Federated Clouds Marian Bubak Piotr Nowakowski, Marek Kasztelnik, Tomasz Bartyński, Tomasz Gubała, Daniel Harężlak, Maciej Malawski, Jan Meizner, Bartosz Wilk ACC Cyfronet AGH Krakow, POLAND EGI Community Forum 2015, November 9 – 14, 2015, Bari, Italy

Outline Context: the PLGrid infrastructure Purpose of the cloud platform VPH-Share legacy Integration with the PLGrid infrastructure Atmosphere interfaces Multitenancy support Example: HyperFlow Summary: Features of the Atmosphere

PL-Grid Project Publication of the book presenting the scientific and technical achievements of the Polish NGI in the Springer Publisher, in March 2012: „Building a National Distributed e- Infrastructure – PL-Grid” In Lecture Notes in Computer Science, Vol. 7136, subseries: Information Systems and Applications Content: 26 articles describing the experience and the scientific results obtained by the PL-Grid project partners as well as the outcome of research and development activities carried out within the Project. First working NGI in Europe in the framework of EGI.eu (since March 31, 2010) Number of users (March 2012): 900+ Number of jobs per month:750, ,500,000 Resources available: Computing power: ca. 230 TFlops Storage: ca TBytes High level of availiability and realibility of the resources Facilitating effective use of these resources by providing: innovative grid services and end-user tools like Efficient Resource Allocation, Experimental Workbench and Grid Middleware Scientific Software Packages User support: helpdesk system, broad training offer Various, well-performed dissemination activities, carried out at national and international levels, which contributed to increasing of awareness and knowledge about the Project and the grid technology in Poland.

New domain-specific services for 13 identified scientific domains Extension of the resources available in the PL-Grid Infrastructure by ca. 500 TFlops of computing power and ca. 4.4 PBytes of storage capacity Design and start-up of support for new domain grids Deployment of Quality of Service system for users by introducing SLA agreement Deployment of new infrastructure services Deployment of Cloud infrastructure for users Broad consultancy, training and dissemination offer Publication of the book presenting the scientific and technical achievements of PLGrid Plus in the Springer Publisher, in September 2014: „eScience on Distributed Computing Infrastructure” In Lecture Notes in Computer Science, Vol. 8500, subseries: Information Systems and Applications PLGrid Plus Project Content: 36 articles describing the experience and the scientific results obtained by the PLGrid Plus project partners as well as the outcome of research and development activities carried out within the Project. Effort of 147 authors, 76 reviewers and editors team in Cyfronet

PLGrid Core Project Competence Centre in the Field of Distributed Computing Grid Infrastructures Budget: total ,16 PLN, including funding from the EC : ,99 PLN Duration: – Project Coordinator: Academic Computer Centre CYFRONET AGH The main objective of the project is to support the development of ACC Cyfronet AGH as a specialized competence centre in the field of distributed computing infrastructures, with particular emphasis on grid technologies, cloud computing and infrastructures supporting computations on big data.

PLGrid Core Services Basic infrastructure services Uniform access to distributed data PaaS Cloud for scientists Applications maintenance environment of MapReduce type End-user services Technologies and environments implementing the Open Science paradigm Computing environment for interactive processing of scientific data Platform for development and execution of large-scale applications organized in a workflow Automatic selection of scientific literature Environment supporting data farming mass computations

Install/configure each application service (which we call a Cloud Service or an Atomic Service) once – then use them multiple times in different workflows; Direct access to raw virtual machines is provided for developers, with multitudes of operating systems to choose from (IaaS solution); Install whatever you want (root access to cloud Virtual Machines); The cloud platform takes over management and instantiation of Cloud Services; Many instances of Cloud Services can be spawned simultaneously Purpose of the cloud platform Developer Application Install any scientific application in the cloud End user Access available applications and data in a secure manner Administrator Cloud infrastructure for e-science Manage cloud computing and storage resources Managed application

VPH-Share Computational Cloud: developers’ view MySpine v22 (cloud service)ViroLab DRS (cloud service)GIMIAS (cloud service)Ubuntu Linux (OS template)MS Windows (OS template) VPH-Share Computational Cloud Service instance Ubuntu Linux (OS template) Hardware: 4 cores, 8 GB RAM, 30 GB HDD 1.The developer, selects and instantiates a service. This can be an existing application (e.g. MySpine) or a „raw” OS image (e.g. MS Windows Server) Service instance OpenLab v1 (development) Hardware: 4 cores, 8 GB RAM, 30 GB HDD 2. The VPH-Share cloud platform creates a virtual machine which hosts an instance of the selected template and provides the required quantity of hardware resources. 2’. The developer may securely log into the machine and install any software of his/her choosing – in this case, we’re installing the OpenLabyrinth virtual patient management application. OpenLab v1 (cloud service) 3. Once the developer is satisfied with the application, he/she may instruct the VPH- Share cloud platform to package and upload it as a new service to the cloud. OpenLab v1 (cloud service) 4. The new service is automatically registered in VPH- Share and can be shared with platform users (including other developers, who may wish to continue development work on the application). Developer

VPH-Share Computational Cloud: end users’ view MySpine v22 (cloud service)ViroLab DRS (cloud service)GIMIAS (cloud service) VPH-Share Computational Cloud 1.The scientist browses the available services and requests access to a given service. Only published services (not OS templates) are visible to non-developers. Service instance OpenLab v1 (cloud service) Hardware: as required by the application 2. The service’s owner (typically its developer) approves the sharing request, following which the scientist may spawn instances of the service in the cloud infrastructure 2’. The cloud platform creates an instance of the service with the appropriate hardware allocation (as set by the developer) and provides the user with access information. OpenLab v1 (cloud service) 3. The user is free to interact with the service. Once the instance is no longer needed, it can be shut down in order to conserve computational resources. Scientist OpenLab v1 (cloud service) Authorized users: Developer

VPH-Share: cloud site integration VPH-Share services host portal.vph-share.eu WP2 Core Services Host vph.cyfronet.pl Atmosphere-VPH Secure RESTful API (Cloud Facade) Atmosphere core (internal dependency) VPH-Share cloud site at CYF Worker nodes Template repository VPH-Share cloud site at UNIVIE Worker nodes Template repository MS Azure VPH-Share cloud account Worker nodes Template repository RackSpace VPH-Share cloud account Worker nodes Template repository Google Compute VPH-Share cloud account Worker nodes Template repository Amazon EC2 VPH-Share site Worker nodes Template repository Amazon EC2 CISTIB site (DARE only) Worker nodes Template repository

DataFluo Listener RabbitMQ DataFluo Server CS RabbitMQ Worker CS RabbitMQ Worker CS Cloud Facade Atmosphere Management Service (Launches server and automatically scales workers) Atmosphere Scientist Launcher script Secure API Problem: Cardiovascular sensitivity study: 164 input parameters (e.g. vessel diameter and length) First analysis: 1,494,000 Monte Carlo runs (expected execution time on a PC: 14,525 hours) Second Analysis: 5,000 runs per model parameter for each patient dataset; requires another 830,000 Monte Carlo runs per patient dataset for a total of four additional patient datasets – this results in 32,280 hours of calculation time on one personal computer. Total: 50,000 hours of calculation time on a single PC. Solution: Scale the application with cloud resources. VPH-Share implementation: Scalable workflow deployed entirely using VPH-Share tools and services. Consists of a RabbitMQ server and a number of clients processing computational tasks in parallel, each registered as a service. The server and client services are launched by a script which communicates directly withe the Cloud Facade API. VPH-Share example: cardivascular sensititity study

VPH-Share Scientific achievements Methods for integration of heterogeneous public cloud infrastructures into a unified computational environment Cost optimization of application deployment in a heterogeneous cloud environment (choosing optimal resources given the available funds) Federated data storage mechanisms with extensions for public storage services Dealing with resource demand spikes (optimization of platform middleware and user interfaces) An innovative approach to data-centric computing on distributed resources

VPH-Share cloud platform in numbers Five cloud IaaS technologies (OpenStack, EC2, MS Azure, RackSpace, Google Compute) Seven distinct cloud sites registered with the VPH-Share infrastructure Over 250 Atomic Services available Approximately 100 service instances operating on a daily basis Over 1 million files transferred 3 PhD theses, over 30 papers and dozens of presentations.

Integration with PL-Grid authentication mechanisms Atmosphere has been integrated with PL-Grid authentication and authorization mechanisms. In order to make use of platform features, a valid PL-Grid login must be supplied. The Atmosphere client will keep track of the user session and delegate client credentials along with every request submitted to the core application, thereby providing single sign-on capabilities for the whole platform. If required, client proxy certificates may be further delegated to virtual machines running client services so that these VM instances may also run jobs in the PL-Grid infrastructure on behalf of the user who launched them.

Cloud platform interfaces All operations on cloud hardware are abstracted by the Atmosphere platform which exposes a RESTful API. For end users, a set of GUIs provides a user-friendly work environment. The API can also be directly invoked by external services (Atmosphere relies on the well-known OpenID authentication standard with PL-Grid acting as its identity provider). Application -- or -- Workflow environment End user A full range of user-friendly GUIs is provided to enable service creation, instantiation and access. A comprehensive online user guide is also available. Atmosphere Registry (AIR) Atmosphere Ruby on Rails controller layer (core Atmosphere logic) Cloud sites The GUIs work by invoking a secure RESTful API which is exposed by the Atmosphere host. We refer to this API as the Cloud Facade. Any operation which can be performed using the GUI may also be invoked programmatically by tools acting on behalf of the platform user – this includes standalone applications and workflow management environments.

Administrative interfaces A comprehensive suite of UIs is provided for platform administrators. Administrators may register new compute sites for integration with the PL- Grid cloud infrastructure, review the composition and activity of individual teams, launch/terminate instances, obtain historical data regarding resource usage, register/deregister templates, set flavor pricing and perform all other actions required for proper cloud infrastructure maintenance.

Multitenancy support Atmosphere Cloud Site (OpenStack) Tenant 1 plgg-team1 Service templates Tenant 2 plgg-team2 Service templates Controller layer Cloud site, tenant and template management interfaces PL-Grid plgg-team1plgg-team2 Atmosphere now supports multitenancy in cloud environments. A cloud site can be partitioned into multiple tenants, each of which comprises a set of service templates and computing resources for a distinct group of users. A new tenant is automatically created for each PL-Grid team whose users request access to the cloud infrastructure. Tenants provide separation between user groups, allowing them to share resources belonging to a common cloud site. When launching service instances each user may specify which context (represented by team names) the given instance will be run in. Atmosphere respect the user’s choice by initializing the instance within a specific tenant.

Example - workflow deployed in the cloud with the HyperFlow The user uses hflowc on the UI host The workflow is launched in the cloud (Cloud 2.0) User certificates are automatically injected into user VMs Data is transferred by GridFTP to the user’s UI account

End-user tutorials and documentation Platform tutorials and user guides are available to all registered PL-Grid users. The guides ( explain how to apply for the Cloud 2.0 service, how to access cloud resources, create/instantiate services, perform development work and save new service templates. The documentation also provides optimal usage tips and explains common pitfalls associated with interaction with virtualized cloud resources.

Summary: Features of Atmosphere Layer of abstraction over cloud-based virtual machines A way to port scientific applications to the cloud Automatic selection of the best resources to deploy applications Automatic load balancing to scale up applications as needed VM image synchronization for hybrid clouds Billing model for commercial clouds Support for commercial clouds: Amazon EC2, Google Compute, RackSpace, MS Azure etc. Support for various application classes: Linux-based SOAP/REST services Web applications rich GUI clients running under MS Windows Integration with the PL-Grid ecosystem: authentication, authorization, sharing co-tenancy, billing and accounting support

For further information… A more detailed introduction to the Atmosphere cloud platform (including user manuals) can be found at The PL-Grid team responsible for development and maintenance of the cloud platform is plgg-cloud You’re also welcome to visit the DIstributed Computing Environments (DICE) team homepage at and our brand new GitHub site at