SARA Reken- en Netwerkdiensten Experiences running a HPC Cloud Ron Trompert on behalf of the SARA Cloud team.

Slides:



Advertisements
Similar presentations
Cloud computing is used to describe a variety of computing concepts that involve a large number of computers connected through a real-time communication.
Advertisements

Virtual Machine Technology Dr. Gregor von Laszewski Dr. Lizhe Wang.
University of Notre Dame
Profit from the cloud TM Parallels Dynamic Infrastructure AndOpenStack.
Cloud Computing Imranul Hoque. Today’s Cloud Computing.
StratusLab is co-funded by the European Community’s Seventh Framework Programme (Capacities) Grant Agreement INSFO-RI Ioannis Konstantinou Greek.
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
1 Week #1 Objectives Review clients, servers, and Windows network models Differentiate among the editions of Server 2008 Discuss the new Windows Server.
1 Week #1 Objectives Review clients, servers, and Windows network models Differentiate among the editions of Server 2008 Discuss the new Windows Server.
BETA!BETA! Building a secure private cloud on Microsoft technologies Private cloud security concerns Security & compliance in a Microsoft private cloud.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
DESIGN CONSIDERATIONS OF A GEOGRAPHICALLY DISTRIBUTED IAAS CLOUD ARCHITECTURE CS 595 LECTURE 10 3/20/2015.
Introduction to XTMv WatchGuard Training.
Virtualization and the Cloud
Introduction to DoC Private Cloud
Virtual Machines. Virtualization Virtualization deals with “extending or replacing an existing interface so as to mimic the behavior of another system”
European Organization for Nuclear Research Virtualization Review and Discussion Omer Khalid 17 th June 2010.
Virtualization for Cloud Computing
Container-based OS Virtualization A Scalable, High-performance Alternative to Hypervisors Stephen Soltesz, Herbert Pötzl, Marc Fiuczynski, Andy Bavier.
Copyright Tim Antonowicz, This work is the intellectual property of the author. Permission is granted for this material to be shared for non- commercial,
SUNY IT Master's Project Using Open Source Virtualization Technology In Computer Education By: Ronny L. Bull Advised By: Geethapriya Thamilarasu, Ph.D.
Cisco and OpenStack Lew Tucker VP/CTO Cloud Computing Cisco Systems,
System Center 2012 Setup The components of system center App Controller Data Protection Manager Operations Manager Orchestrator Service.
Tanenbaum 8.3 See references
Copyright © 2010 Platform Computing Corporation. All Rights Reserved.1 The CERN Cloud Computing Project William Lu, Ph.D. Platform Computing.
Yury Kissin Infrastructure Consultant Storage improvements Dynamic Memory Hyper-V Replica VM Mobility New and Improved Networking Capabilities.
CS 149: Operating Systems April 21 Class Meeting
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 7 2/23/2015.
Microkernels, virtualization, exokernels Tutorial 1 – CSC469.
Enabling Technologies for Distributed and Cloud Computing Dr. Sanjay P. Ahuja, Ph.D FIS Distinguished Professor of Computer Science School of.
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 2.
Ceph Storage in OpenStack Part 2 openstack-ch,
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
| nectar.org.au NECTAR TRAINING Module 5 The Research Cloud Lifecycle.
1 Evolution of OSG to support virtualization and multi-core applications (Perspective of a Condor Guy) Dan Bradley University of Wisconsin Workshop on.
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
DCIM211. By 2015, 35% of enterprise IT expenditures for most organizations will be managed outside the IT department’s budget IT budget.
Sandor Acs 05/07/
RAL PPD Computing A tier 2, a tier 3 and a load of other stuff Rob Harper, June 2011.
Virtualization for the LHCb Online system CHEP Taipei Dedicato a Zio Renato Enrico Bonaccorsi, (CERN)
Windows Azure Virtual Machines Anton Boyko. A Continuous Offering From Private to Public Cloud.
Enabling Technologies for Distributed Computing Dr. Sanjay P. Ahuja, Ph.D. Fidelity National Financial Distinguished Professor of CIS School of Computing,
Virtual cloud R 陳昌毅 R 顏昭恩 R 黃伯淳 2010/06/03.
Virtualization One computer can do the job of multiple computers, by sharing the resources of a single computer across multiple environments. Turning hardware.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI VM Management Chair: Alexander Papaspyrou 2/25/
Grid testing using virtual machines Stephen Childs*, Brian Coghlan, David O'Callaghan, Geoff Quigley, John Walsh Department of Computer Science Trinity.
OpenNebula: Experience at SZTAKI Peter Kacsuk, Sandor Acs, Mark Gergely, Jozsef Kovacs MTA SZTAKI EGI CF Helsinki.
Moonshot-enabled Federated Access to Cloud Infrastructure Terena Networking Conference, Reykjavik. May 2012 David Orrell, Eduserv.
Øg fleksibiliteten i din infrastruktur 32 virtual processors per VM 1 TB virtual machine memory New 64TB VHDX format Native 4k disk support Hyper-V.
KAASHIV INFOTECH – A SOFTWARE CUM RESEARCH COMPANY IN ELECTRONICS, ELECTRICAL, CIVIL AND MECHANICAL AREAS
Chapter 11 – Cloud Application Development. Contents Motivation. Connecting clients to instances through firewalls. Cloud Computing: Theory and Practice.
Authentication, Authorization, and Contextualization in FermiCloud S. Timm, D. Yocum, F. Lowe, K. Chadwick, G. Garzoglio, D. Strain, D. Dykstra, T. Hesselroth.
Instituto de Biocomputación y Física de Sistemas Complejos Cloud resources and BIFI activities in JRA2 Reunión JRU Española.
Open Source Virtualization Andrey Meganov RHCA, RHCX Consultant / VDEL
NIIF Cloud Infrastructure and Services EGI Technical Forum September 20, 2011 Lyon, France Ivan Marton.
Virtualization for Cloud Computing
Dag Toppe Larsen UiB/CERN CERN,
StratusLab Roadmap C. Loomis (CNRS/LAL) EGI TCB (Amsterdam)
Dag Toppe Larsen UiB/CERN CERN,
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Enrico Bonaccorsi, (CERN) Loic Brarda, (CERN) Gary Moine, (CERN)
OS Virtualization.
Managing Clouds with VMM
Virtualization Meetup Discussion
Public vs Private Cloud Usage Costs:
Using and Building Infrastructure Clouds for Science
OpenStack for the Enterprise
06 | SQL Server and the Cloud
Presentation transcript:

SARA Reken- en Netwerkdiensten Experiences running a HPC Cloud Ron Trompert on behalf of the SARA Cloud team

SARA Reken- en Netwerkdiensten HPC Cloud - What Service Models at SARA:  Cloud Infrastructure as a Service (IaaS)  Cloud Software as a Service (SaaS)  Currently experimenting with portals (R portal)  Cloud Platform as a service  Not (yet) SARA: Allow users to  freely instantiate a personal environment  leap from laptop (small scale) to HPC (large scale)

SARA Reken- en Netwerkdiensten HPC Cloud - Why World  better utilization for infrastructure  "Green IT" (power off under-utilization)  easy management SARA  free OS & software environment  locked software can be used  rapid availability  HPC cloud for academic world Massive interest and multiple early adopters prove the need for an academic HPC Cloud environment.  Early cloud P.O.C. running “production”

SARA Reken- en Netwerkdiensten HPC Cloud – How Evolving thanks to continued support from BiG-Grid: Claudia  ~1 year POC/Beta running 2010/2011  limited infrastructure:  CPU (16*8) 128, MEM (16*24GB) 384GB  1Gbps IO, local disk storage  single VM limit Calligo  HPC infrastructure:  CPU (19*32) 608 (Intel Intel Xeon-E7 "Westmere-EX")  MEM (19*256GB) 4.864GB = 4,75TB  10GE, 1-hop, non-blocking interconnect  400TB shared storage (ISCSI,NFS,CIFS,CDMI...)  virtual clusters

SARA Reken- en Netwerkdiensten HPC Cloud – How Technology Challenges (1/2): Hypervisor  VM-Ware: data-center centric  Citrix (Xen): para-virtualisation  KVM: full virtualization, standard Linux kernel based, no commercial backing Networking  Virtual switches: slow  Para-virtual drivers: limited to 1Gbs (kvm/virtio)  SR-IOV: bright new future

SARA Reken- en Netwerkdiensten HPC Cloud – How Technology Challenges (2/2): Storage  Concurrent use of different protocols:  Local Disk -> fast but no file locking  ISCSI -> fast but no file locking  SSH -> slower, secure, local account required  NFS(v4) -> slower, shared, host-based, load?  CIFS -> slow, shared, user based  WebDav/CDMI -> slow, user based, clients? Compute  all mainstream CPUs support Virtualization  support for network IOMMU/VTd and SR-IOV is scarce

SARA Reken- en Netwerkdiensten HPC Cloud – Trust Security is of major importance  cloud user confidence  infrastructure provider confidence Protect  the outside from the cloud users  the cloud users from the outside  the cloud users from each other Not possible to protect the cloud user from himself  user has full access/control/responsibility  ex. virus research must be possible

SARA Reken- en Netwerkdiensten HPC Cloud – Trust Firewall  fine-grained access rules (“closed port” policy)  By default everything is closed, non-standard ports can be opened by the user Scanning of new virtual templates  catches initial problems, but once the VM is live... Port scanning  catches well-known problems State-full Package Inspection  random sample based

SARA Reken- en NetwerkdienstenPhilips site visit | 2 February 2012 HPC Cloud – Calligo Virtual Machine Disk Image Private network Network filter ISO Image CPU Memory Public network Template = User Defined = Predefined

SARA Reken- en Netwerkdiensten HPC Cloud – Calligo Admin PortalCLI [EC2] [OCCI] User Portal [XML-RPC] ONED mm_sched Libvirt KVM-Qemu OpenNebula Linux virsh

SARA Reken- en NetwerkdienstenPhilips site visit | 2 February 2012 HPC Cloud – Calligo Template Virtual Machine network(s) Disk Image CPU/Memory Copy of Disk Image Virtual Machine Copy of Disk Image Virtual Machine Copy of Disk Image Virtual Machine Copy of Disk Image Non Persistent Disk Image

SARA Reken- en NetwerkdienstenPhilips site visit | 2 February 2012 HPC Cloud – Calligo Template network(s) Disk Image CPU/Memory Virtual Machine Persistent Disk Image

SARA Reken- en Netwerkdiensten HPC Cloud – Calligo

SARA Reken- en Netwerkdiensten HPC Cloud – Calligo

SARA Reken- en Netwerkdiensten HPC Cloud – Calligo Node eth1 DRAC CONS-NW (static) eth3eth2 bond0 DATA-NW br0 (DHCP) br1br2 eth0 MGT-NW br100 UserA VM01 ( VLAN_ID=100 ) eth0eth1 br3 bond0.100 br56 bond0.56 br101 bond0.101 UserA VM02 ( VLAN_ID=100 ) UserB VM01 ( VLAN_ID=101 ) eth0 eth1 br102

SARA Reken- en Netwerkdiensten HPC Cloud – Accounting Take logging of VMs starting and stopping Take periodic snapshots to see which VMs are running

SARA Reken- en Netwerkdiensten HPC Cloud – Users Can ask fore core hours Each core hours is accompanied by 8 GB of memory This means When you need two cores for your VM you get 16 GB of memory When you need 32GB of memory you get 4 cores Maps nicely on the deployed hardware We did this just to start out with. Maybe we get better ideas on this later on. No overcommitting

SARA Reken- en Netwerkdiensten HPC Cloud – Users Started in January this year with the production infrastructure Now 37 users Applications: Galaxy, DNA sequencing, CLARIN, EUDAT, eSiBayes, Historic Map Collections, UrbanFlood, Transcriptomics,..… Now out of every 3 grant requests to the Dutch NGI (BiGGrid) 2 of them are Cloud requests Our users have been happy campers so far

SARA Reken- en Netwerkdiensten HPC Cloud – Experiences Had to do some some tweaking ourselves. This is the motivation for using an open source product like OpenNebula. Examples are accounting, firewalling With SR-IOV we got 8-9Gbps between VMs support for network IOMMU/VTd and SR-IOV is scarce

SARA Reken- en Netwerkdiensten HPC Cloud – Future OVF, OCCI Uploading own images Access to HSM environment, dCache grid storage Concurrent storage protocols  ISCSI  SSH  CIFS  WebDav/CDMI

SARA Reken- en Netwerkdiensten HPC Cloud - Summary SARA HPC CLOUD FREEDOM OF CHOICE Would you like to know more? Questions?