1 Efficient Mixed-Platform Clouds Phillip B. Gibbons, Intel Labs Michael Kaminsky, Michael Kozuch, Padmanabhan Pillai (Intel Labs) Gregory Ganger, David.

Slides:



Advertisements
Similar presentations
FAWN: Fast Array of Wimpy Nodes A technical paper presentation in fulfillment of the requirements of CIS 570 – Advanced Computer Systems – Fall 2013 Scott.
Advertisements

Virtual Machine Usage in Cloud Computing for Amazon EE126: Computer Engineering Connor Cunningham Tufts University 12/1/14 “Virtual Machine Usage in Cloud.
Cloud Computing to Satisfy Peak Capacity Needs Case Study.
Charles Reiss *, Alexey Tumanov †, Gregory R. Ganger †, Randy H. Katz *, Michael A. Kozuch ‡ * UC Berkeley† CMU‡ Intel Labs.
Energy-efficient Cluster Computing with FAWN: Workloads and Implications Vijay Vasudevan, David Andersen, Michael Kaminsky*, Lawrence Tan, Jason Franklin,
FAWN: A Fast Array of Wimpy Nodes Presented by: Aditi Bose & Hyma Chilukuri.
FAWN: A Fast Array of Wimpy Nodes Presented by: Clint Sbisa & Irene Haque.
Computer Science Storage Systems and Sensor Storage Research Overview.
Virtualization and the Cloud
© 2009 IBM Corporation ® IBM Software Group Introduction to Cloud Computing Vivek C Agarwal IBM India Software Labs.
AN INTRODUCTION TO CLOUD COMPUTING Web, as a Platform…
Topics Problem Statement Define the problem Significance in context of the course Key Concepts Cloud Computing Spatial Cloud Computing Major Contributions.
Wally Kowal, President and Founder Canadian Cloud Computing Inc.
Be Smart, Use PwrSmart What Is The Cloud?. Where Did The Cloud Come From? We get the term “Cloud” from the early days of the internet where we drew a.
M.A.Doman Model for enabling the delivery of computing as a SERVICE.
MATE-EC2: A Middleware for Processing Data with Amazon Web Services Tekin Bicer David Chiu* and Gagan Agrawal Department of Compute Science and Engineering.
Cloud computing Tahani aljehani.
1: IntroductionData Management & Engineering1 Course Overview: CS 395T Semantic Web, Ontologies and Cloud Databases Daniel P. Miranker Objectives: Get.
EA and IT Infrastructure - 1© Minder Chen, Stages in IT Infrastructure Evolution Mainframe/Mini Computers Personal Computer Client/Sever Computing.
Storage: Scaling Out > Scaling Up? Ankit Singla Chi-Yao Hong.
Effectively and Securely Using the Cloud Computing Paradigm.
Introduction to Cloud Computing
Ch 4. The Evolution of Analytic Scalability
Opensource for Cloud Deployments – Risk – Reward – Reality
Application-driven Energy-efficient Architecture Explorations for Big Data Authors: Xiaoyan Gu Rui Hou Ke Zhang Lixin Zhang Weiping Wang (Institute of.
Carnegie Mellon 1 Intro to some Advanced Topics / : Introduction to Computer Systems 27 th Lecture, Dec. 6, 2011 Instructors: Dave O’Hallaron,
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
RAMCloud: A Low-Latency Datacenter Storage System Ankita Kejriwal Stanford University (Joint work with Diego Ongaro, Ryan Stutsman, Steve Rumble, Mendel.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Introduction to Hadoop and HDFS
Extreme scale parallel and distributed systems – High performance computing systems Current No. 1 supercomputer Tianhe-2 at petaflops Pushing toward.
Amazon Web Services BY, RAJESH KANDEPU. Introduction  Amazon Web Services is a collection of remote computing services that together make up a cloud.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Challenges towards Elastic Power Management in Internet Data Center.
Presented by: Mostafa Magdi. Contents Introduction. Cloud Computing Definition. Cloud Computing Characteristics. Cloud Computing Key features. Cost Virtualization.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
Looking Ahead: A New PSU Research Cloud Architecture Chuck Gilbert - Systems Architect and Systems Team Lead Research CI Coordinating Committee Meeting.
Server Virtualization
Amar Phanishayee,LawrenceTan,Vijay Vasudevan
Implications of Emerging Hardware Tom Wenisch (University of Michigan) Nikos Hardavellas (Northwestern University) Sangyeun Cho (University of Pittsburgh)
Paperless Timesheet Management Project Anant Pednekar.
PDAC-10 Middleware Solutions for Data- Intensive (Scientific) Computing on Clouds Gagan Agrawal Ohio State University (Joint Work with Tekin Bicer, David.
CLOUD COMPUTING RICH SANGPROM. What is cloud computing? “Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a.
A Scalable Two-Phase Top-Down Specialization Approach for Data Anonymization Using MapReduce on Cloud.
GOOGLE APP ENGINE By Muktadiur Rahman. Contents  Cloud Computing  What is App Engine  Why App Engine  Development with App Engine  Quote & Pricing.
Web Technologies Lecture 13 Introduction to cloud computing.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-2.
Maintain Business Performance EFFICENT, ELASTIC and SIMPLE Infrastructure Sharon Azulai, Plexistor CEO | Jan 20, 2016.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
Spiros Papadimitriou Google Research Project re:Cycle Recycling CPU Cycles Stavros Harizopoulos HP Labs.
Cloud Computing from a Developer’s Perspective Shlomo Swidler CTO & Founder mydrifts.com 25 January 2009.
From infrastructure to applications Where cloud computing is at and where it’s headed.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
Introduction to Cloud Technology
DevOps Cloud Native Microservices
Organizations Are Embracing New Opportunities
C Loomis (CNRS/LAL) and V. Floros (GRNET)
Server Innovation Accelerates IT Transformation
Be Fast, Cheap and in Control
Ch 4. The Evolution of Analytic Scalability
Parallel Analytic Systems
AWS Cloud Computing Masaki.
Cloud Computing: Concepts
The Performance of Big Data Workloads in Cloud Datacenters
Virtualization Dr. S. R. Ahmed.
Presentation transcript:

1 Efficient Mixed-Platform Clouds Phillip B. Gibbons, Intel Labs Michael Kaminsky, Michael Kozuch, Padmanabhan Pillai (Intel Labs) Gregory Ganger, David Andersen, Garth Gibson (Carnegie Mellon) NSF Workshop on Sustainable Energy Efficient Data Management May 2, 2011

2 Cloud Computing & Homogeneity In near future, significant fraction of all data analysis and data storage will occur in the cloud Traditional data center goal: Homogeneity + Reduce administration costs: maintenance, diagnosis, repair + Ease of load balancing Ideal: single Server Architecture tailored to the workload CPU Mem Disk CPU Mem Disk CPU Mem Disk CPU Mem Disk CPU Mem Disk CPU …

3 Homogeneity: Challenges No single workload: Mix of customer workloads –Computation-heavy apps (powerful CPUs, little I/O BW) –Random I/O apps (I/O latency bound) –Streaming apps (I/O BW bound, little memory) –Memory-bound apps –Apps exploiting hardware assists such as GPUs Common denominator Server Architecture falls short –E.g., Two orders of magnitude loss in energy efficiency (see example on next slide)

4 FAWN: Fast Array of Wimpy Nodes For key-value stores, FAWN provides 120X more queries per Joule than traditional server FAWN great for some workloads, terrible for others Homogeneity

5 New Goal: Specialization Specialization is fundamental to efficiency –No single platform best for all application types –e.g., huge efficiency gains in FAWN –Called division of labor in sociology (see also, bees) Cloud computing must embrace specialization –and consequent heterogeneity and change-over-time Specialization is fundamental to sustainable energy-efficient data management

6 Efficient Mixed-Platform Clouds Cloud in 2020 will need… Infrastructure purposely composed of many platform types, some general-purpose and some specialized to particularly important application types Infrastructure embraces heterogeneity by design Nimble incorporation of new technologies is enabled by explicitly aiming for heterogeneity –E.g., solid state RAM and accelerators

7 Efficient Mixed-Platform Cloud Research Agenda Develop specializations motivated by important application types Algorithms/frameworks for exploiting specializations Making applications able to work on varied platforms –And automatically mapping them to best platform, accounting for where the data is Explore disruptive impact of new technologies –integration into systems, exploitation by applications Data management in mixed-platform cloud Our progress to date on specializations: See FAWN [SOSP’09], Hi-Spade [Sigmod’10,Sigmod’11], PCM-DB [CIDR’11] projects

8 Coming Soon: Intel Science and Technology Center on Cloud Computing (ISTC-CC) Pending approvals, legal agreements, etc $2.5M / year for 3-5 years Homed at Carnegie Mellon 4 Intel researchers Research Agenda

9 Back Up Slides

10 Defining Cloud Computing… Easy to get mired in defining cloud computing –we really want to avoid doing so (again ) NIST ended up with a 2-page definition –here’s their 15 th version, for reference: Is it … –Amazon Web Services (EC2, S3, etc.) ? –Google Apps + Chrome ? –Private clouds based on VMware/Eucalyptus/etc ? –Hadoop / MapReduce ? –NoSQL DBs (Cassandra, etc.) ? All are examples of broad collection of trends

11 Cloud in 2020? Huge range of uses, exploiting … –shared, managed resources –needs to be massive scale, efficient, automated, trustworthy –availability of interesting data –needs to support BIG DATA, sensor data, mining of both –convenient on-demand access from anywhere –needs to be elastic, easy-to-use, location-independent