Next Generation Operating Systems Zeljko Susnjar, Cisco CTG June 2015.

Slides:



Advertisements
Similar presentations
1 Copyright © 2012 Oracle and/or its affiliates. All rights reserved. Convergence of HPC, Databases, and Analytics Tirthankar Lahiri Senior Director, Oracle.
Advertisements

Issues of HPC software From the experience of TH-1A Lu Yutong NUDT.
Profit from the cloud TM Parallels Dynamic Infrastructure AndOpenStack.
Keith Wiles DPACC vNF Overview and Proposed methods Keith Wiles – v0.5.
W4118 Operating Systems OS Overview Junfeng Yang.
Figure 1.1 Interaction between applications and the operating system.
Computer Science Storage Systems and Sensor Storage Research Overview.
1 What is an operating system? CSC330Patricia Van Hise.
Computing Platform Benchmark By Boonyarit Changaival King Mongkut’s University of Technology Thonburi (KMUTT)
Copyright Arshi Khan1 System Programming Instructor Arshi Khan.
Operating Systems.
RECONFIGURABLE DATA PROCESSING FOR CLOUDS Parth Shah Kanika Chawla Sowmith Boyanpalli 1.
COMPUTER SYSTEMS An Integrated Approach to Architecture and Operating Systems Chapter 14 Epilogue: A Look Back at the Journey ©Copyright 2008 Umakishore.
Deploying Moodle with Red Hat Enterprise Virtualization Brian McSpadden Director of Network Operations Remote-Learner.net.
1 Some Context for This Session…  Performance historically a concern for virtualized applications  By 2009, VMware (through vSphere) and hardware vendors.
Tanenbaum 8.3 See references
Virtualized FPGA accelerators in Cloud Computing Systems
Copyright © 2010 Platform Computing Corporation. All Rights Reserved.1 The CERN Cloud Computing Project William Lu, Ph.D. Platform Computing.
Shilpa Seth.  Centralized System Centralized System  Client Server System Client Server System  Parallel System Parallel System.
Data Center Infrastructure
Cloud Computing Saneel Bidaye uni-slb2181. What is Cloud Computing? Cloud Computing refers to both the applications delivered as services over the Internet.
Dual Stack Virtualization: Consolidating HPC and commodity workloads in the cloud Brian Kocoloski, Jiannan Ouyang, Jack Lange University of Pittsburgh.
Microkernels, virtualization, exokernels Tutorial 1 – CSC469.
UNIX System Administration OS Kernal Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept Kernel or MicroKernel Concept: An OS architecture-design.
Virtual Machine Course Rofideh Hadighi University of Science and Technology of Mazandaran, 31 Dec 2009.
Niko Neufeld, CERN/PH-Department
Roger Jones, Lancaster University1 Experiment Requirements from Evolving Architectures RWL Jones, Lancaster University Ambleside 26 August 2010.
Improving Network I/O Virtualization for Cloud Computing.
Virtualization: Not Just For Servers Hollis Blanchard PowerPC kernel hacker.
The Red Storm High Performance Computer March 19, 2008 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
The ALICE DAQ: Current Status and Future Challenges P. VANDE VYVRE CERN-EP/AID.
ALICE Upgrade for Run3: Computing HL-LHC Trigger, Online and Offline Computing Working Group Topical Workshop Sep 5 th 2014.
© 2010 Cisco and/or its affiliates. All rights reserved. 1 Managing Microsoft Applications with Cisco UCS Manager & PowerTool.
Headline in Arial Bold 30pt HPC User Forum, April 2008 John Hesterberg HPC OS Directions and Requirements.
 Virtual machine systems: simulators for multiple copies of a machine on itself.  Virtual machine (VM): the simulated machine.  Virtual machine monitor.
Distributed Information Systems. Motivation ● To understand the problems that Web services try to solve it is helpful to understand how distributed information.
Intel Research & Development ETA: Experience with an IA processor as a Packet Processing Engine HP Labs Computer Systems Colloquium August 2003 Greg Regnier.
A. Frank - P. Weisberg Operating Systems Structure of Operating Systems.
High Speed Detectors at Diamond Nick Rees. A few words about HDF5 PSI and Dectris held a workshop in May 2012 which identified issues with HDF5: –HDF5.
Partitioned Multistack Evironments for Exascale Systems Jack Lange Assistant Professor University of Pittsburgh.
Full and Para Virtualization
Operating-System Structures
Predrag Buncic Future IT challenges for ALICE Technical Workshop November 6, 2015.
Predrag Buncic ALICE Status Report LHCC Referee Meeting CERN
Niko Neufeld, CERN. Trigger-free read-out – every bunch-crossing! 40 MHz of events to be acquired, built and processed in software 40 Tbit/s aggregated.
Security Architecture and Design Chapter 4 Part 2 Pages 319 to 357.
Pierre VANDE VYVRE ALICE Online upgrade October 03, 2012 Offline Meeting, CERN.
CS4315A. Berrached:CMS:UHD1 Introduction to Operating Systems Chapter 1.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
Predrag Buncic (CERN/PH-SFT) Software Packaging: Can Virtualization help?
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
Background Computer System Architectures Computer System Software.
Meeting with University of Malta| CERN, May 18, 2015 | Predrag Buncic ALICE Computing in Run 2+ P. Buncic 1.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Introduction to Operating Systems Concepts
Virtualization.
Introduction to Operating Systems
LHC experiments Requirements and Concepts ALICE
High-performance tracing of many-core systems with LTTng
Grid Computing.
Dagmar Adamova (NPI AS CR Prague/Rez) and Maarten Litmaath (CERN)
ALICE Computing Model in Run3
ALICE Computing Upgrade Predrag Buncic
1. 2 VIRTUAL MACHINES By: Satya Prasanna Mallick Reg.No
Xen Summit Spring 2007 Platform Virtualization with XenEnterprise
OS Virtualization.
Chapter 17: Database System Architectures
LAB 01 Installation of VIRTUAL MACHINE and LINUX
Lecture Topics: 11/1 General Operating System Concepts Processes
Presentation transcript:

Next Generation Operating Systems Zeljko Susnjar, Cisco CTG June 2015

Cisco Confidential 2 © Cisco and/or its affiliates. All rights reserved. The end of CPU scaling Future computing challenges Power efficiency Performance == parallelism

Cisco Confidential 3 © Cisco and/or its affiliates. All rights reserved. Paradox of the computing industry System software has not evolved at the same pace as HW Multiple applications Massively multicore CPU Multiple applications Single core CPU Time-sharing OS Spatial computing platform FPGA GPU CPU

Cisco Confidential 4 © Cisco and/or its affiliates. All rights reserved.  Flexible, multi-OS, application isolation / security  Optimizations, IO handling, expensive license fees Server Virtualization is at a generational shift Hardware Hypervisor Virtual machine App Virtual machine App Operating System  No hypervisor overhead, performance, fine grained resource control  Linux ABI, security and app isolation APIs and interfaces for containers management Hypervisors are still good, but have pitfalls Application containers are the future of virtualization Hardware Operating System Container App Container App Operating System

Cisco Confidential 5 © Cisco and/or its affiliates. All rights reserved. OS constraints are hard to overcome with current design Containers are built on Operating system Memory scalability  CPU cache too small  Cache misses due to scattered data in the memory L1 cache 4 cycles L2 cache 12 cycles L3 cache 30 cycles DRAM 300 cycles CPU CPU Cores scalability  Many task on one core  one task on many cores  Monolithic architecture based on time sharing Number of cores Packet scalability  Kernel stack too complicated  User mode networking stack like PF_RING/DPDK write your own driver Number of cores

Cisco Confidential 6 © Cisco and/or its affiliates. All rights reserved. Solving the problem at the lowest level of abstraction Advantages of OS control/data plane separation o Greater scale-up bundled with smarter NIC processing o Reduce CPU kernel overhead o Higher network throughput by using multiple cores Hardware App Hardware Ctrl Plane Data plane Next-Gen Computing: Redesign of Operating System for energy efficient and linear scalable computing platform New architectural concept, greatly enhancing application performance in modern datacenters and fundamentally addressing the following challenges: o Application’s parallelization driven by rapidly growing number of CPU cores per socket o Efficient use of heterogeneous resources o Data center wide system consistency

Cisco Confidential 7 © Cisco and/or its affiliates. All rights reserved.

Cisco Confidential 8 © Cisco and/or its affiliates. All rights reserved. CERN Challenge: The Alice computing requirements From Detector Readout to Analysis: What is the “optimal” computing architecture ? Detector upgrade for Run 3 (2020) 100 increase in event rate 1TB/s raw data rate

Cisco Confidential 9 © Cisco and/or its affiliates. All rights reserved. O2 facility: Highly specialized heterogeneous computing platform  463 FPGAs Detector readout and fast cluster finder  100’000 CPU cores To compress 1.1 TB/s data stream by overall factor 14  5000 GPUs To speed up the reconstruction  50 PB of disk  Considerable computing capacity that will be used for Online and Offline tasks

Cisco Confidential 10 © Cisco and/or its affiliates. All rights reserved. DPCS: Openlab project investigating applicability of modern OS concepts in the ALICE O2 environment  Data Plane OS concept  I/O Virtualization  Multicore scaling  Heterogeneous compute Data Plane Computing System

Thank you.

Cisco Confidential 12 © Cisco and/or its affiliates. All rights reserved. Data flow in O2 facility Asynchronous event reconstruction A few hours after data taking Data of all interaction - continuous mode Computing farm Online data volume reduction by factor 14 (20 for the TPC data) Data storage for 1 year of compressed data (60 PB) Compressed data Reconstructed events Compressed data Detectors electronics 1.1 TB/s 90 GB/s ~ 8000 optical links Read-out farm: 250 servers with FPGA acceleration Processing farm: 1500 servers with GPU acceleration Storage system 68 storage units with 34 data servers