Pro’s and Con’s of Cloud Computing for Scientific Data – the (on-going) experience of D4Science Andrea Manieri Engineering Ingegneria Informatica s.p.a.

Slides:



Advertisements
Similar presentations
Canada-EU Future Internet Workshop Waterloo, Canada March 24th, 2011 Ignacio M. Llorente DSA-Research.org Distributed Systems Architecture Research Group.
Advertisements

D4Science: a Data Infrastructure Ecosystem for Science DL.org Autumn School – Athens, 3-8 October 2010 Leonardo Candela 6 th October 2010.
By Adam Balla & Wachiu Siu
Open Workshop on e-Infrastructures, Helsinki October 4 – 5, 2006 Roadmap Parallel Session on last chapter of e-IRG Roadmap: Crossing the Boundaries of.
D4Science Project (DILIGENT For Science) Donatella Castelli CNR-ISTI DRIVER Summit January 2008 Gottingen (Germany)
Final Review Meeting 16 th March 2010 Brussels (Belgium) D4Science Production Infrastructure Contract n°: RI Pedro Andrade (CERN)
Cloud Computing Will Crowley Monica Lopez Jaimie Morrison.
Presented by Sujit Tilak. Evolution of Client/Server Architecture Clients & Server on different computer systems Local Area Network for Server and Client.
BI in the Cloud – Sky is the limit Vishal Agrawal Product Technical Architect Infosys Tech Ltd Anand Govindarajan Principal Technology Architect Infosys.
SaaS, PaaS & TaaS By: Raza Usmani
Duncan Fraiser, Adam Gambrell, Lisa Schalk, Emily Williams
Clouds on IT horizon Faculty of Maritime Studies University of Rijeka Sanja Mohorovičić INFuture 2009, Zagreb, 5 November 2009.
Introduction to Cloud Computing
3 Cloud Computing.
A.V. Bogdanov Private cloud vs personal supercomputer.
1/8 Enhancing Grid Infrastructures with Virtualization and Cloud Technologies Ignacio M. Llorente Business Workshop EGEE’09 September 21st, 2009 Distributed.
Software to Data model Lenos Vacanas, Stelios Sotiriadis, Euripides Petrakis Technical University of Crete (TUC), Greece Workshop.
Cloud Computing Saneel Bidaye uni-slb2181. What is Cloud Computing? Cloud Computing refers to both the applications delivered as services over the Internet.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
DuraCloud Managing durable data in the cloud Michele Kimpton, Director DuraSpace.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Procurement Innovation for Cloud Services in Europe CERN – 14 May 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium.
Interoperability in the Cloud By Alex Espinoza
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Climate Sciences: Use Case and Vision Summary Philip Kershaw CEDA, RAL Space, STFC.
An emerging computing paradigm where data and services reside in massively scalable data centers and can be ubiquitously accessed from any connected devices.
Virtual Research Environments: e-Infrastructures beyond Digital Libraries Pasquale Pagano CNR-ISTI RCDL08 Conference Information.
2009 Federal IT Summit Cloud Computing Breakout October 28, 2009.
What is the cloud ? IT as a service Cloud allows access to services without user technical knowledge or control of supporting infrastructure Best described.
1 NETE4631 Course Wrap-up and Benefits, Challenges, Risks Lecture Notes #15.
A trust and Contract Management framework enabling secure collaborative business processing in on-demand created, self- managed, scalable, and highly dynamic.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse Technical Director CERN.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
EMI INFSO-RI JRA1.5-Infra-Cloud Computing Task Force Shahbaz Memon (JUELICH)
Service Level Agreements for D- Grid Introducing Commercial Use Cases to an Academic e-Science Infrastructure Philipp Wieder,
1 European e-Infrastructure experiences gained and way ahead OGF 20 / EGEE User’s Forum 9 th May 2007 Mário Campolargo European Commission - DG INFSO Head.
1 NETE4631 Network Information Systems : Introduction to Cloud Computing Lecture Notes #2.
Centre d’Excellence en Technologies de l’Information et de la Communication Evolution dans la gestion d’infrastructure de type Cloud (SDI)
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
Helix Nebula The Science Cloud CERN – 13 June 2014 Alberto Di MEGLIO on behalf of Bob Jones (CERN) This document produced by Members of the Helix Nebula.
Chapter 8 – Cloud Computing
CLOUD COMPUTING RICH SANGPROM. What is cloud computing? “Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a.
Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on Virtual Research Environments Pedro Andrade (CERN)
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
AFACT Cloud Computing WG Zon-yin Shae Institute for Information Industry Bangkok, Thailand, Nov. 26, 2014.
On the D4Science Approach Toward AquaMaps Richness Maps Generation Pasquale Pagano - CNR-ISTI Pedro Andrade.
D4Science and ETICS Building and Testing gCube and gCore Pedro Andrade CERN EGEE’08 Conference 25 September 2008 Istanbul (Turkey)
European Middleware Initiative (EMI) Alberto Di Meglio (CERN) Project Director.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Cloud-based e-science drivers for ESAs Sentinel Collaborative Ground Segment Kostas Koumandaros Greek Research & Technology Network Open Science retreat.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
RANDY MODOWSKI COSC Cloud Computing. Road Map What is Cloud Computing? History of “The Cloud” Cloud Milestones How Cloud Computing is being used.
Cloud Powered Rural Telecenters – A Model for Sustainable Telecenters Osman Ghazali, Baharudin Osman, Azizah Ahmad, Azizi Abas, Abdul Razak Rahmat, Mohamed.
Discussion Context NIST Cloud definition and extension to address network and infrastructure issues Discussion of the ISPD-RG Infrastructure definition.
Pedro Andrade > IT-GD > D4Science Pedro Andrade CERN European Organization for Nuclear Research GD Group Meeting 27 October 2007 CERN (Switzerland)
4a. Aula 2o. Período de Livro texto Copyright © 2012, Elsevier Inc. All rights reserved March 5, 2012 Prof. Kai Hwang, USC Cloud Roles in.
Virtual Research Environments as-a-Service Donatella Castelli CNR-ISTI EGI Conference 2016, 6-8 April.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
EGI-InSPIRE EGI-InSPIRE RI The European Grid Infrastructure Steven Newhouse Director, EGI.eu Project Director, EGI-InSPIRE 29/06/2016CoreGrid.
1 Tutorial Outline 30’ From Content Management Systems to VREs 50’ Creating a VRE 80 Using a VRE 20’ Conclusions.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Grant.
Jordi Farres HMA-WG Meeting ESRIN, 23 Jan 2013
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
Chapter 21: Cloud Computing and Related Security Issues
Introduction to Cloud Computing
Chapter 22: Cloud Computing Technology and Security
3 Cloud Computing.
Emerging technologies-
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Presentation transcript:

Pro’s and Con’s of Cloud Computing for Scientific Data – the (on-going) experience of D4Science Andrea Manieri Engineering Ingegneria Informatica s.p.a. 5° EGEE User Forum April 2010 Uppsala (Sweden)

2 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden)  Cloud Computing: mith or reality?  gCube evolution meet Cloud technologies  Scientific Data infrastructures and Cloud  Towards sustainable Scientific Data Infrastructures

3 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) What analysts say  N.Carr - IT doesn’t matters, 2003  IT as a commodity - Not marketing, business!  J.Gray – Distributed Computing Economy, 2003  Data near the compution – a economy thread-off analysis  N.Carr – from EDISON to GOOGLE, rewiring the world, 2007  The world wide computer and the implication on economy and society  Advices and check-lists  Garnter, IDC, Forrester-research  Security, License schema, ROI

4 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) Data in the Cloud

5 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) What experts say  Old technology with new behaviors  The illusion of infinite computing resources available on demand,  The elimination of an up-front commitment by Cloud users  The ability to pay for use of computing resources on a short-term basis as needed and release them as needed, (Berkeley Univ. Above the clouds, Jan 2009)  Clouds at the baseline of the Future Internet  Provided the following gaps are covered: Manageability and Self*; Data management, Security and privacy; federation and interoperability; virtualisation Elasticity and Adaptability; APIs, programming models and remote control (The future of clouds – report, Jan 2010)  Lights and shadows  Economic benefits are evident  To be careful with security, privacy, and SLA (ENISA Security and Cloud WG Nov 2009)

6 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) What vendors say

7 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) What Community does OCCI wg within OGF OASIS wg on Identity over the Cloud SimpleCloudApi, promoted by Zend tech, with IBM,MS et al Cloud Security Alliance Open Cloud Manifesto

8 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) The Cloud Concepts Source: The Future of Cloud report

9 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) From a testbed to a production ecosystem DiligentD4ScienceD4Science II Oct.’04Nov.’07Jan.’08Dec.’09Oct.’09Sept.’11 Testbed Empower the grid middleware to: > manage data and metadata as primary resources > virtualise the VO environment Production Stabilize gCube by supporting two large user communities: > FARM > EM Production Promote interoperability across e-Infrastructures by empowering large user communities Prototype => gCube 0.9 Software Framework => gCube 1.6 (stable and open source) => d4science e- Infrastructure Open Platform => gCube 2.0 (feature reach and interop.) => d4science ecosystem

10 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) From a testbed to a production ecosystem functionality gLite gCube DiligentD4ScienceD4Science II Oct.’04Nov.’07Jan.’08Dec.’09Oct.’09Sept.’11

11 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) Sharing Controlled sharing of  data resources  service resources  computing & storage resourses D4Science e-Infrastructure EGEE e-Infrastructure Courtesy – Donatella Castelli

12 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) Current D4Science data  metadata  environmental reports  satellite images  earth observation products  raw data  fact sheets  time series graphs  country maps  species distribution maps  etc.

13 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) Data related services Generic services for:  search  annotation  process management  visualization  …… D4Science e-Infrastructure  time series management  report generation  shared workspace  …… Courtesy – Donatella Castelli

14 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) D4Science e-Infrastructure On-demand VREs VRE Courtesy – Donatella Castelli

15 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) Infrastructure administration services Infrastructure administration services:  resource registration and management  user roles management  infrastructure monitoring  …… D4Science e-Infrastructure Courtesy – Donatella Castelli

16 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) D4Science User Communities & VREs  Environmental Monitoring Community  Global Chlorophyll Monitoring (GVM) VRE  Global Vegetation Monitoring (GCM) VRE  Fisheries and Aquaculture Resource Management Community  Fisheries Country Profiles Production System (FCPPS) VRE  Integrated Capture Information System (ICIS) VRE  AquaMaps VRE Courtesy – Donatella Castelli

17 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) D4Science –II Knowledge infrastructure ecosystem D4Science KnowledgeEcosystem  Interoperability - extend D4Science by introducing mechanisms for facilitating:  the use of data resources managed by different data infrastructures  the programmatic exploitation of the D4Science capabilities  Interoperability - extend D4Science by introducing mechanisms for facilitating:  the use of data resources managed by different data infrastructures  the programmatic exploitation of the D4Science capabilities Courtesy – Donatella Castelli

18 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) gCube: enhanced Grid platform Source - P.Pagano, F.Simeoni

19 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) The structure of the study  Phase 1: Market Analysis and positioning.  Expected results:  short introduction to cloud market, different options and commercial opportunities  gCube positioning with respect the market  Phase 2: gCube installation and performance analysis.  Expected results:  list of possible improvements in the system  list of bottlenecks and constraints on the conceptualisation and architecture  list of performance indicators and actual capabilities of the sw  report on costs for installing and operating a gCube service  Phase 3: Analysis of gCube architecture with respect cloud technology.  Expected results:  report on how gCube may exploit a IaaS  report on how gCube may exploit a PaaS.

20 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) GCube Architecture

21 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) gCube highlights  Consolidation of physical (computational) resources by adopting virtualisation  gCube infrastructure model compatible with IaaS models  Deployment model similar to Hybrid cloud  potential outsourcing to public clouds but still within and towards a federated provision  gCube as PaaS for vertical market  e.g. Salesforce or Google Apps  gCube development model analogous to PaaS models compile- time compliance enables runtime management but limitedly to general-purpose services (vs web apps)  Cloud-level transparencies over grid-style infrastructure

22 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) Developing Scientific Data applications  Exploring alternative paths: VENUS-C  A project under negotiation  A testbed for Scientific Clouds  Focus on federation of clouds and DCIs at platform level  Exploit Clouds to respond to pick of computational needs.  Resources at reasonable costs  Integration of on-demand computing resources with the data access and integration capability offered by gCube - increasing the value and usability of such data.  Explore issues related to data movement and limitations on Computing far from data repositories.

23 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) Sustaining the SDI technologies  Scientific Data Infrastructures may benefit from clouds  outsourcing the storage and preservation of data  Investing only on data and paying on the basis of access needs and performance requested  Bringing data near the computation  Some open issues more relaxed but are equally urgent  Security, privacy and compliance  Vendor lock-in  Data cost curve may bring a reduce in overall investments

24 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) Toward a Scientific Cloud for Data  Parade (the Partnership for Accessing data in Europe)  Caspar (FP6 Project) Usage of Cloud in long term preservation  Vision on technologies for Cloud Storage  promoting open data format for cloudswww.dataliberation.org  The right time for Cloud4Science - Storing, Preserving, Accessing and Discovering data for e-Science

25 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) Conclusions  Cloud – a successful marketing concept that boosted an innovative use of existing technologies  D4Science Ecosystem, is providing PaaS-like capabilities to build Virtual Research Environments and will explore  how to evolve gCube to meet technological challenges imposed by Cloud offers  costs and benefits of porting a Scientific Data Application on top of (federated Cloud Platform provided by VENUS-C)  Why not a “Cloud4Science initiative”?

26 Pro's and Con's of Cloud Computing for Scientific Data - the experience of D4Science 13 April 2010, Uppsala (Sweden) This is not the end…  What will happen when an entry-level server (now bi/quad processors dual cores) will be 64 cores (or more) on a single processor?  What will happen to your (our) solutions?  Will Virtualisation technology still the solution?  Will we need for new programming languages? And translators for existing languages? A long term roadmap for e-Infrastructures and related technologies (middlewares and services) should start to take into account multicore evolution.