Www.kit.edu KIT – The cooperation of Forschungszentrum Karlsruhe GmbH und Universität Karlsruhe (TH) The OpenCirrus TM Project: A global Testbed for Cloud.

Slides:



Advertisements
Similar presentations
© 2009 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Technology for better business outcomes.
Advertisements

Presented by: Prof Mark Baker ACET, University of Reading Tel: Web:
Die Kooperation von Forschungszentrum Karlsruhe GmbH und Universität Karlsruhe (TH) The Cirrus & Cumulus Project: Build a Scientific Cloud for a Data Center.
Hello i am so and so, title/role and a little background on myself (i.e. former microsoft employee or anything interesting) set context for what going.
Wouter Noordkamp The assessment of new platforms on operational performance and manning concepts.
Distributed Data Processing
A walk in cloud (and look for databases) Jian Xu DMM DB-talk, Feb 2010.
2  Industry trends and challenges  Windows Server 2012: Beyond virtualization  Complete virtualization platform  Improved scalability and performance.
1 Cloud Computing Prof. Ravi Sandhu Executive Director and Endowed Chair April 12, © Ravi Sandhu World-Leading.
Big Data Open Source Software and Projects ABDS in Summary XIV: Level 14B I590 Data Science Curriculum August Geoffrey Fox
CLOUD COMPUTING AN OVERVIEW & QUALITY OF SERVICE Hamzeh Khazaei University of Manitoba Department of Computer Science Jan 28, 2010.
SmartER Semantic Cloud Sevices Karuna P Joshi University of Maryland, Baltimore County Advisors: Dr. Tim Finin, Dr. Yelena Yesha.
FI-WARE – Future Internet Core Platform FI-WARE Cloud Hosting July 2011 High-level description.
Presented by Sujit Tilak. Evolution of Client/Server Architecture Clients & Server on different computer systems Local Area Network for Server and Client.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Plan Introduction What is Cloud Computing?
Cloud Computing Source:
For more notes and topics visit:
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
Software to Data model Lenos Vacanas, Stelios Sotiriadis, Euripides Petrakis Technical University of Crete (TUC), Greece Workshop.
Cloud Computing Saneel Bidaye uni-slb2181. What is Cloud Computing? Cloud Computing refers to both the applications delivered as services over the Internet.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Introduction to Cloud Technology StratusLab Tutorial (Orsay, France) 28 November 2012.
Software Architecture
DISTRIBUTED COMPUTING
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Plan  Introduction  What is Cloud Computing?  Why is it called ‘’Cloud Computing’’?  Characteristics of Cloud Computing  Advantages of Cloud Computing.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Tools for collaboration How to share your duck tales…
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
VMware vSphere Configuration and Management v6
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
7. Grid Computing Systems and Resource Management
Architecture & Cybersecurity – Module 3 ELO-100Identify the features of virtualization. (Figure 3) ELO-060Identify the different components of a cloud.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Web Technologies Lecture 13 Introduction to cloud computing.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
Next Generation of Apache Hadoop MapReduce Owen
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
BIG DATA/ Hadoop Interview Questions.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
1 S06: Open-Source Stack for Cloud Computing Milind Bhandarkar Yahoo! Michael Ryan Intel Michael Kozuch Intel Richard Gass Intel.
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
Rick Fleming HP Federal Practice Lead February 2009
Introduction to Cloud Technology
Understanding The Cloud
Organizations Are Embracing New Opportunities
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
Clouds , Grids and Clusters
Introduction to Distributed Platforms
Geoffrey Fox, Shantenu Jha, Dan Katz, Judy Qiu, Jon Weissman
Grid Computing.
The OpenCirrusTM Project: A global Testbed for Cloud Computing R&D Marcel Kunze Steinbuch Centre for Computing (SCC) Karlsruhe Institute of Technology.
University of Technology
Cloud Computing Dr. Sharad Saxena.
Presentation transcript:

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH und Universität Karlsruhe (TH) The OpenCirrus TM Project: A global Testbed for Cloud Computing R&D Marcel Kunze Steinbuch Centre for Computing (SCC) Karlsruhe Institute of Technology (KIT) Germany

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Karlsruhe Institute of Technology (KIT) Cooperation between research centre Karlsruhe und Karlsruhe university Largest scientific center in Germany scientists, students Annual budget: > 500 Million Euro R&D focus: Energy research and nano-technology >> +

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Agenda What is cloud computing ? OpenCirrus TM project Programming the cloud HPC and big data Summary

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Cloud Computing: A possible Definition A computing cloud is a set of network enabled on demand IT services, scalable and QoS guaranteed, which could be accessed in a simple and pervasive way.

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Cloud lives in Web 2.0 Everything as a Service (XaaS) AaaS: Application as a Service PaaS: Platform as a Service SaaS: Software as a Service DaaS: Data as a Service IaaS: Infrastructure as a Service HaaS: Hardware as a Service Industry is pretty much engaged Various commercial offerings exist

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Commercial Cloud Offerings (Small Excerpt) Problem: Commercial offerings are proprietary and usually not open for cloud systems research and development

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Cloud Systems Research Simple, transparent, controllable cloud computing infrastructure What types of interfaces are appropriate for clouds? How should cloud networks be constructed/managed? How are security concerns addressed in the cloud? How are various workloads most efficiently transferred? What types of applications can run in clouds? What types of service level agreements are appropriate/possible? Research requirements Perform experiments also on a low system level Flexible cloud computing framework Compare different methodologies and implementations

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Cloud Computing: A new Hype following Grid Cloud computing R&D: OpenCirrus TM project OpenCirrus TM

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Clouds vs. Grids: A Comparison Cloud ComputingGrid Computing ObjectiveProvide desired computing platform via network enabled services Resource sharing Job execution InfrastructureOne or few data centers, heterogeneous/homogeneous resource under central control, Industry and Business Geographically distributed, heterogeneous resource, no central control, VO Research and academic organization MiddlewareProprietary, several reference implementations exist (e.g. Amazon) Well developed, maintained and documented ApplicationSuited for generic applicationsSpecial application domains like High Energy Physics User interfaceEasy to use/deploy, no complex user interface required Difficult use and deployment Need new user interface, e.g., commands, APIs, SDKs, services … Business ModelCommercial: Pay-as-you-goPublicly funded: Use for free Operational ModelIndustrialization of IT Fully automated Services Mostly Manufacture Handcrafted Services QoSPossibleLittle support On-demand provisioningYesNo

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March OpenCirrus Cloud Computing Research Testbed An open, internet-scale global testbed for cloud computing research Data center management & cloud services Systems level research Application level research Structure: a loose federation Sponsors: HP Labs, Intel Research, Yahoo! Partners: UIUC, Singapore IDA, KIT, NSF Members: System and application development Great opportunity for cloud R&D

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Where are the OpenCirrus sites? Six sites initially: Sites distributed world-wide: HP Research, Yahoo!, UIUC, Intel Research Pittsburgh, KIT, Singapore IDA processor cores per site New CMU site coming in 2009 HP Yahoo (sf) UIUC Intel (pgh) KIT (de) IDA (sg) CMU (coming in 09)

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Cloud Architecture Source: S.Tai

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March OpenCirrus TM Blueprint IT infrastructure layer (Physical Resource Sets) Cloud infrastructure services Cloud application services Virtual Resource Sets Eucalyptus

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Physical Resource Sets (PRS) PRS service goals Provide mini-datacenters to researchers Isolate experiments from each other Stable base for other research PRS service approach Allocate sets of physical co-located nodes, isolated inside VLANs. Leverage existing software (e.g. Utah Emulab, HP OpsWare) Start simple, add features as we go Base to implement virtual resource sets Hardware as a Service (HaaS)

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Virtual Resource Sets (VRS) Basic idea: Abstract from physical resource by introduction of a virtualization layer Concept applies to all IT aspects: CPU, storage, networks and applications, … Main advantages Implement IT services exactly fitting customers varying need Deploy IT services on demand Automated resource management Easily guarantee service levels Live migration of services Reduce both: CapEx and OpEx Infrastructure as a Service (IaaS) Implement Compute and Storage services De-facto standard: Amazon Web Services interface

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Amazon Web Services

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Eucalyptus: A potential VRS layer Client-side API Translator Cloud Controller Cluster Controller Node Controller Amazon EC2 and S3 Interface Database Source: R.Wolski

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Programming the Cloud: Hadoop An open-source Apache software foundation project sponsored by Yahoo! intent is to reproduce the proprietary software infrastructure developed by Google Provides a parallel programming model (MapReduce), a distributed file system, and a parallel database

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March The MapReduce Programming Model Map computation across many objects Extract a set of key value pairs of e.g Web pages Reduce results in many different ways Combine it with other values that share the same key System deals with issues of resource allocation & reliability

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March How is OpenCirrus different from other testbeds? OpenCirrus TM supports both system- and app-level research n/a at Google/IBM and EC2/S3 OpenCirrus TM researchers will have complete access to the underlying hardware and software platform. OpenCirrus TM allows Intel platform features that support cloud computing (e.g. DCMI, NM) to be exposed, and exploited. Virtual machines Hadoop Map-Reduce apps Google/IBM cluster Virtual or physical machines Cluster mgmt software Open Cirrus cluster Hadoop Cloud apps and services Map-Reduce apps Cannot be modified by users Can be modified by users Can be modified by users

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March How do users get access to OpenCirrus sites? Project PIs apply to each site separately. Contact names, addresses, and web links for applications to each site will be available on the OpenCirrus TM Web site (which goes live Q1) Each OpenCirrus TM site decides which users and projects get access to its site. Planning to have a global sign on for all sites Users will be able to login to each OpenCirrus TM site for which they are authorized using the same login and password.

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Who can use the OpenCirrus Resources ? Three different types of users can use OpenCirrus TM sites: (a) Individual PIs from academic research groups (b) Industry researchers from the OpenCirrus TM partners (c) Industry researchers who have a customer relationship with the OpenCirrus TM partners What is the expected mix of these groups? The majority of users will be (a) academic researchers and (b) researchers who work for the OpenCirrus TM partners. There will be a few carefully chosen users who are (c) industry researchers with a customer relationship with an OpenCirrus TM partner

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March What kinds of research projects are OpenCirrus sites looking for? Open Cirrus TM is seeking research in the following areas (different centers will weight these differently): Datacenter federation Datacenter management Web services Data-intensive applications and systems Hadoop map-reduce applications The following kinds of projects are not of primary interest: Traditional HPC application development. Production applications that just need lots of cycles. Closed source system development.

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Potential Fields of Cloud System Development (1) Virtual organizations and social networks Science is team work, clouds are rather for individuals right now Integration of cloud services Standardization of APIs and protocols Hyperclouds may integrate services of various providers (Stratosphere ?) Management of service quality Negotiation and monitoring of SLAs How does this work for Web service mashups ? Privacy, data protection and security Importance of AAA and encryption e.g. use of Trusted Platform Module (TPM)

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Cloud Security: A possible Solution Source: IBM

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Potential Fields of Cloud System Development (2) New infrastructure services HPCaaS: High Performance Computing as a Service LSDFaaS: Large Scale Data Facility as a Service GenomeDBaaS: Genome Database as a Service How does this relate to Grid computing ?

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March HPC vs. HTC vs. MTC (Many Task Computing) HPC HTC MTC Source: I.Foster

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March The Grid and Cloud Space UNICORETraditional Cloud / Web 2.0 gLite

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Extension of the Cloud Space to all Areas LSDFaaS HPCaaS Large Scale Data Facility as a Service High Performance Computing as a Service

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March HPCaaS High Performance Computing as a Service Interesting Fields for R&D in Open Cirrus TM Flexible platform services for HPC customers Development of MPI services for clouds Development of scheduling services for clouds Management of software licenses Integration of Grid resources: Grid as a Service (GaaS)

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March LSDFaaS Large Scale Data Facility as a Service Actual projects at KIT in this field: Data storage for LHC computing Data storage for ITER (EUFORIA) Project ANKA (synchrotron radiation source) Activities in materials research Long-term data filing due to legal requirements Development of big data services

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Big Data Interesting applications are data hungry The data grows over time The data is immobile 100 1Gbps ~= 10 days Compute comes to the data Big Data clusters are the new libraries The value of a cluster is its data (J. Campbell, et al., Intel Research Pittsburgh, 2007)

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Cluster Manager Tashi High-Level Design Node Storage Service Virtualization Service Node Scheduler Cluster nodes are assumed to be commodity machines Services are instantiated through virtual machines Data location information is exposed to scheduler and services CM maintains databases and routes messages; decision logic is limited Most decisions happen in the scheduler; manages compute/storage in concert The storage service aggregates the capacity of the commodity nodes to house Big Data repositories.

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Cluster Manager (CM) VM Client Resource DB Node Manager DB VM instance DB Node Manager (NM) system software Resource Controller Plugins (VMM, DFS, power, etc.) VMM VM DFS nmd iptables /vlan Data Location Service DFS Metadata Server Client API Site Specific Plugin(s) Centralized cluster administration CM-NM API Compute node Tashi component non-Tashi component Legend Sensor Plugins Pub/Sub System Resource Telemetry Service Scheduling Agent Tashi Software Architecture

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Tashi is both… An open source software project The implementation is intended to become worthy of production use. Alpha deployment running on OpenCirrus TM cluster at Intel Research Pittsburgh since October An open research project Key question: How should compute, storage, and power be managed in a Big Data cluster to optimize for performance, energy, and fault-tolerance? Initial sponsors include: Intel Research Pittsburgh Carnegie Mellon University Yahoo!

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March The Way to Cloud Nirvana The roadmap for cloud services Leads to dynamic data centers Ranges from infrastructure services to dynamic applications Complements traditional IT services in the medium term Source: rpath

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Summary Cloud computing is the next big thing Flexible and elastic resource provisioning Economy of scale makes it attractive Move from manufacture towards industrialization of IT (Everything as a Service) OpenCirrus TM offers interesting R&D opportunities Cloud systems development Cloud application development Accepting research proposals soon OpenCirrus TM workshop at HP Palo Alto on June 8/9

KIT – The cooperation of Forschungszentrum Karlsruhe GmbH and Universität Karlsruhe (TH) | Marcel Kunze | OpenCirrus, NeSC Edinburgh | March Karlsruhe Institute of Technology Steinbuch Centre for Computing (SCC) Thank you for your attention.