Challenges and approaches for providing a pipeline as a service

Slides:



Advertisements
Similar presentations
Fabric Management at CERN BT July 16 th 2002 CERN.ch.
Advertisements

Canada-EU Future Internet Workshop Waterloo, Canada March 24th, 2011 Ignacio M. Llorente DSA-Research.org Distributed Systems Architecture Research Group.
GENI Experiment Control Using Gush Jeannie Albrecht and Amin Vahdat Williams College and UC San Diego.
Cloud SUT proposal OSGcloud group. Objective To fill in the Research the group about the thinking within the OSG working group To solicit new ideas/proposals.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
© 2009 IBM Corporation ® IBM Software Group Introduction to Cloud Computing Vivek C Agarwal IBM India Software Labs.
Plan Introduction What is Cloud Computing?
Deploying Moodle with Red Hat Enterprise Virtualization Brian McSpadden Director of Network Operations Remote-Learner.net.
Computer communication
Web Based Applications
Introduction To Windows Azure Cloud
Distributed Systems 1 CS- 492 Distributed system & Parallel Processing Sunday: 2/4/1435 (8 – 11 ) Lecture (1) Introduction to distributed system and models.
POSTSHARP TECHNOLOGIES Better software through simpler code.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI CloudBroker Platform integration into WS-PGRADE/gUSE Zoltán Farkas MTA.
Vic Liu Liang Xia Zu Qiang Speaker: Vic Liu China Mobile Network as a Service Architecture draft-liu-nvo3-naas-arch-01.
European Life Sciences Infrastructure for Biological Information META-pipe WP6 Kick-off Lars Ailo Bongo, ELIXIR-NO.
VMs in Azure Breeze Lab Manager 19 November 2012.
GLIDEINWMS - PARAG MHASHILKAR Department Meeting, August 07, 2013.
The Road to Continuous Delivery at Perforce Jonathan Thorpe Technical Marketing Manager Perforce Laurette Cisneros Build & Release Engineering Manager.
Master thesis Analysis and implementation of monitoring systems of active network equipment. Scientific advisor: Univ. Prof., Dr. Hab., Pavel TOPALA Master.
© 2012 IBM Corporation IBM Security Systems 1 © 2012 IBM Corporation Cloud Security: Who do you trust? Martin Borrett Director of the IBM Institute for.
Web Technologies Lecture 13 Introduction to cloud computing.
 Each program needs separation.  Better computer security by not installing multiple programs on one system.  Most out-of-the-box software assumes.
Lars Ailo Bongo NBS meeting Tromsø, Jan 23, 2016 NeLS Norwegian e-Infrastructure for Life Sciences Overview and recent developments
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
Commvault and Nutanix October Changing IT landscape Today’s Challenges Datacenter Complexity Building for Scale Managing disparate solutions.
Cloud readiness assessment
CLOUD ARCHITECTURE Many organizations and researchers have defined the architecture for cloud computing. Basically the whole system can be divided into.
Introduction to comp. and prog. CS 101 G 964
Chapter 6: Securing the Cloud
Computer Basics Recap and Virtual Machines
Programmable Hardware: Hardware or Software?
MMG: from proof-of-concept to production services at scale (part II)
SuperComputing 2003 “The Great Academia / Industry Grid Debate” ?
ATLAS Cloud Operations
WP6: Marine metagenomics
Our cloud usage - and not
Network Configurations
Kay Ousterhout, Christopher Canel, Sylvia Ratnasamy, Scott Shenker
Tools and Services Workshop Overview of Atmosphere
ELIXIR activities in Norway (and Europe)
Operating System Structure
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
VMware és KVM környezetek változtatás nélkül a felhőben
Street Cleanliness Assessment System for Smart City using Mobile and Cloud Bharat Bhushan, Kavin Pradeep Sriram Kumar, Mithra Desinguraj, Sonal Gupta Project.
Software Defined Networking (SDN)
5 MAJOR BENEFITS OF CLOUD TESTING. Cloud testing is a mode of testing web applications which use cloud computing and infrastructure. It includes both.
Ticketing Systems with RT
Is your deployment in pants-down mode?
Getting Started.
Getting Started.
The Next Generation Cyber Security in the 4th Industrial Revolution
Operating System Review
Course: Module: Lesson # & Name Instructional Material 1 of 32 Lesson Delivery Mode: Lesson Duration: Document Name: 1. Professional Diploma in ERP Systems.
BusinessObjects IN Cloud ……InfoSol’s story
Lessons being learnt from moving a legacy app to the cloud
Presented by Bogdan Stanca-Kaposta (Spirent)
Draft Proposal for an Eclipse Mobile Development Suite Architecture
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Features - Benefits Major Release January 2019
Traditional Virtualized Infrastructure
MMG: from proof-of-concept to production services at scale
Distributing META-pipe on ELIXIR compute resources
Summary Inf-2202 Concurrent and Data-Intensive Programming Fall 2016
OpenStack Summit Berlin – November 14, 2018
Client/Server Computing and Web Technologies
Productive + Hybrid + Intelligent + Trusted
SQL Server on Containers
Presentation transcript:

Challenges and approaches for providing a pipeline as a service Lars Ailo Bongo (ELIXIR-NO)

META-pipe: marine metagenomics analysis pipeline 2 as-a-service interfaces: Galaxy interface for Norwegian users (since 2015) Web app interface (to be released in 2017) Compute intensive Small team Academic setting

META-pipe backend architecture

Challenges What should be exposed to users? Issue: Trade-off: flexibility vs responsibility as-a-VM = more flexible, but users must install, allocate resources, handle failures… as-a-service = less flexible, but service provider takes more responsibility Issue: What to do when the analysis fails? Who should do it?

Challenges - examples Number of pipeline parameters More => better analysis results Fewer => easier to predict resource needs Resource allocation By user => can use national resources By service provider => easier to maintain service Failure handling By user => difficult to detect, understand and resolve By service provider => more complicated backend

META-pipe: Galaxy pipeline Standardized Galaxy interface: + flexible, extensible, powerful - big and complicated software stack Norwegian research infrastructure resources are pre-allocated + Not visible for users and easy to administer - Supercomputers not ideal for workload, industry and non-Norwegian users need to pay Failure handling is mostly left to users (red boxes) + Simpler backend - Difficult to understand failures, wasted resources due to restarts, more support work

META-pipe: Web app Web app interface: Bring your own cloud resources + Easier resource allocation, simpler software stack - Just META-pipe, limited configuration possibilities Bring your own cloud resources + Can use cheapest (/free) resources - Failure handling and support on external resources Failure handling is mostly hidden for users + Easy to understand job status - Backend must group together jobs, restart parts of failed jobs, and may need manual intervention

Summary Our goal: provide a service where resource allocation and failure handling is hidden from user Our approach: Backend designed to hide failures Abstract, layer, and simplify where possible Our challenges: Resources outside of our control (failure detection, handling, and support) Job structure (mix of high-memory and compute intensive) 3rd party tools (software quality issues, security issues)

Acknowledgments META-pipe team: ELIXIR-NO ELIXIR-FI and ELIXIR-CZ Nils P. Willassen, Lars Ailo Bongo, Erik Hjerde, Espen M. Robertsen, Inge Alexander Raknes, Aleksandr Agafonov, Terje Klemetsen, Giacomo Tartari … ELIXIR-NO NeLS ELIXIR-FI and ELIXIR-CZ AAI, cloud setup EXCELERATE WP6