1 CS 294-42: Cloud Computing, Systems, Networking, and Frameworks Fall 2011 (MW 1:00-2:30, 293 Cory Hall) Ion Stoica (http://www.cs.berkeley.edu/~istoica/classes/cs294/11/)

Slides:



Advertisements
Similar presentations
Ali Ghodsi UC Berkeley & KTH & SICS
Advertisements

TOPIC LEARNING BTEC Level 3 Unit 28 Websites L01- All students will understand the web architecture and components which allow the internet and websites.
Chapter 4 Infrastructure as a Service (IaaS)
THE DATACENTER NEEDS AN OPERATING SYSTEM MATEI ZAHARIA, BENJAMIN HINDMAN, ANDY KONWINSKI, ALI GHODSI, ANTHONY JOSEPH, RANDY KATZ, SCOTT SHENKER, ION STOICA.
Introduction CSCI 444/544 Operating Systems Fall 2008.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 6 2/13/2015.
Berkeley Data Analytics Stack (BDAS) Overview Ion Stoica UC Berkeley UC BERKELEY.
Virtual Synchrony Jared Cantwell. Review Multicast Causal and total ordering Consistent Cuts Synchronized clocks Impossibility of consensus Distributed.
1 From Internet Data Centers to Data Centers in the Cloud Data Centers Evolution − Internet Data Center − Enterprise Data Centers −Web 2.0 Mega Data Centers.
UNCW UNCW SIGGRAPH 2002 Topic #3: Continuous Media in Wired and Wireless Environments Ronald J. Vetter Department of Computer Science University of North.
Mesos A Platform for Fine-Grained Resource Sharing in Data Centers Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D. Joseph, Randy.
Criticisms of I3 Jack Lange. General Issues ► Design ► Performance ► Practicality.
CS 268: Future Internet Architectures Ion Stoica May 1, 2006.
CS 268: Future Internet Architectures Ion Stoica May 6, 2003.
Energy Saving Software based on Cloud Computing for Adjustable Processing Environments (ESSCCAPE) The Green Cloud.
Cloud Computing Concept&nature Cloud computing refers to the applications delivered as services over the Internet and the hardware, and systems software.
The Future of Internet Research Scott Shenker (on behalf of many networking collaborators)
CS : Creating the Grid OS—A Computer Science Approach to Energy Problems David E. Culler, Randy H. Katz University of California, Berkeley August.
Copyright © Clifford Neuman - UNIVERSITY OF SOUTHERN CALIFORNIA - INFORMATION SCIENCES INSTITUTE Distributed Systems Shahab Baqai Lahore University.
The Future of the Internet Jennifer Rexford ’91 Computer Science Department Princeton University
Damian Gordon.  When we hook up computers together using data communication facilities, we call this a computer network.
AMPCamp Introduction to Berkeley Data Analytics Systems (BDAS)
IBM Haifa Research 1 The Cloud Trade Off IBM Haifa Research Storage Systems.
Clouds on IT horizon Faculty of Maritime Studies University of Rijeka Sanja Mohorovičić INFuture 2009, Zagreb, 5 November 2009.
CLOUD COMPUTING & COST MANAGEMENT S. Gurubalasubramaniyan, MSc IT, MTech Presented by.
Introduction. Readings r Van Steen and Tanenbaum: 5.1 r Coulouris: 10.3.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Hosted on the Powerful Microsoft Azure Platform, Advent Countdown Lets Companies Run Reliable and Scalable Holiday Marketing Campaigns MICROSOFT AZURE.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Lecture 8: Design of Parallel Programs Part III Lecturer: Simon Winberg.
Lecture 3: Sun: 16/4/1435 Distributed Computing Technologies and Middleware Lecturer/ Kawther Abas CS- 492 : Distributed system.
◦ What is an Operating System? What is an Operating System? ◦ Operating System Objectives Operating System Objectives ◦ Services Provided by the Operating.
LOGO Service and network administration Storage Virtualization.
Cloud Computing Dave Elliman 11/10/2015G53ELC 1. Source: NY Times (6/14/2006) The datacenter is the computer!
1 CS 294: Big Data System Research: Trends and Challenges Fall 2015 (MW 9:30-11:00, 310 Soda Hall) Ion Stoica and Ali Ghodsi (
© Sogeti Sogeti-Microsoft Alliance Cloud Computing Barriers and Benefits.
CDN: Content Distribution Networks  References:  CS613 textbook, “Computer Networking – A Top-Down Approach”, 6 th edition. Chapter  The text.
Agenda Motion Imagery Challenges Overview of our Cloud Activities -Big Data -Large Data Implementation Lessons Learned Summary.
Yury Ivanou Nataliya Khomich. The population of Belarus is about 10 million people.
1 CS6320 – SW Engineering of Web- Based Systems L. Grewe.
Systems and Networking Challenges in Cloud Computing: Toward Software-Defined Clouds Aditya Akella TA: Aaron Gember Fall
Securely Synchronize and Share Enterprise Files across Desktops, Web, and Mobile with EasiShare on the Powerful Microsoft Azure Cloud Platform MICROSOFT.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
Distributed systems [Fall 2015] G Lec 1: Course Introduction.
MidVision Enables Clients to Rent IBM WebSphere for Development, Test, and Peak Production Workloads in the Cloud on Microsoft Azure MICROSOFT AZURE ISV.
Information-Centric Networks02c-1 Week 2 / Paper 3 Future Internet architecture: clean-slate versus evolutionary research –Jennifer Rexford, Constantine.
Information-Centric Networks Section # 2.3: Internet Evolution Instructor: George Xylomenos Department: Informatics.
Web Technologies Lecture 13 Introduction to cloud computing.
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
RANDY MODOWSKI COSC Cloud Computing. Road Map What is Cloud Computing? History of “The Cloud” Cloud Milestones How Cloud Computing is being used.
4a. Aula 2o. Período de Livro texto Copyright © 2012, Elsevier Inc. All rights reserved March 5, 2012 Prof. Kai Hwang, USC Cloud Roles in.
GameChanger’s Rate Quote Issue Solution is Deployed to Microsoft Azure for a Fast, Flexible Direct to Consumer Insurance Sales Solution MICROSOFT AZURE.
Trends In Distributed File Systems Professor: Dr. Zhang CSc 8320 Advanced Operating Systems Presented by: Tu Tran Section 5.3.
CIS 700-5: The Design and Implementation of Cloud Networks
Organizations Are Embracing New Opportunities
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Give Your Data the Edge A Scalable Data Delivery Platform
Give Your Data the Edge A Scalable Data Delivery Platform
Vidcoding Introduces Scalable Video and TV Encoding in the Cloud at an Affordable Price by Utilizing the Processing Power of Azure Batch MICROSOFT AZURE.
CS 268: Computer Networking
Consistency in Distributed Systems
ECE 671 – Lecture 16 Content Distribution Networks
Quasardb Is a Fast, Reliable, and Highly Scalable Application Database, Built on Microsoft Azure and Designed Not to Buckle Under Demand MICROSOFT AZURE.
Internet Protocols IP: Internet Protocol
EE 122: Lecture 22 (Overlay Networks)
CS 239 – Big Data Systems Fall 2018
Presentation transcript:

1 CS : Cloud Computing, Systems, Networking, and Frameworks Fall 2011 (MW 1:00-2:30, 293 Cory Hall) Ion Stoica (

2 Cloud Computing in Numbers Datacenter instance: Cost in billion range > 100,000 servers Number of servers (estimates*) Google: ~1 mil servers Microsoft, Yahoo!, IBM, HP: several 100,000s each Amazon, Ebay, GoDaddy, Facebook, Akamai: > 50,000 (*

3 Why does Cloud Computing Matter? Fundamental change The way applications are written and deployed Internet traffic: Internet becomes last-hop between hosts and datacenter Economics: pay as you go Opportunity to rethink: Large scale distributed systems Network architectures Tradeoffs in computer systems

4 How is Cloud Computing Different? How is different from distributed systems? How is different from parallel systems? Axis: Environment constraints Scale Type of failures Application requirements …

5 Academia Challenges Rapid evolving field Need to be in the avantgarde of understanding challenges and trends Unfortunately, academia trails industry Very hard to achieve the scale and generate the workload that reveal main challenges How to address above challenges?

Rising to the Challenge A thriving open source ecosystem Hadoop/HDF, Hive, Pig, … Distributions: Cloudera, Hortonworks Many users: Facebook, Twitter, Yahoo!, … Strong collaboration between Berkeley and industry Google, Microsoft, Facebook, Twitter, Yahoo!, … Academia enables experimentation Hard to explore alternatives in industry: usually pick a design and go with it for years 6

Research Trust Multiprogramming for datacenters: multiple frameworks sharing same datacenter Why? Improve utilization, performance Sharing data Enable applications using different frameworks Multiple frameworks sharing same datacenter Current project: Mesos, Orchestra, Memento, … Future: towards an OS for clusters 7

Research Trust (con’d) Real-time, interactive processing of big data Why? Need to process huge volumes of unstructure data Batch processing doesn’t allow interactive data processing Current projects: Spark, Memento Future: trade between response time and accuracy 8

9 Grading Project: 60% Class presentations: 40% Around 2 papers per student See Randy’s guidelines for leading discussion on papers dingPapers.pdf dingPapers.pdf

Administrative Information Class website: Office Hours (Soda 465d): Monday, 3-4pm Wednesday, 11-noon Create a (anonymized) blog account for paper reviews if you don’t have one yet (e.g., Sent me an by Wednesday, August 31, with your blog url Preferred for the class list 10

11 Papers Is the problem real? What is the solution’s main idea (nugget)? Why is solution different from previous work? Are system assumptions different? Is workload different? Is problem new? Does the paper (or do you) identify any fundamental/hard trade-offs?

12 Papers (cont’d) Do you think the work will be influential in 10 years? Why or why not? Predicting the future always hard, but worth a try Look at past examples for inspiration

13 Streaming Over TCP Countless papers: Why cannot be done… New protocols to do it… Today Virtually all streaming over TCP Trend to stream over HTTP!

14 Why did it Succeed?

15 Multicast Countless papers: Why world will come to a standstill without multicast… New protocols to do it… Today Multicast is used only in enterprise settings at best Overlay multicast widely used in the Internet CDN based, e.g., WorldCup, March Madness, Iinagurations,... P2P, mostly popular outside US (e.g., China)

16 Why Did it Fail?

17 Consistency Everywhere Many papers & systems: Group synchronous communication Causally ordered message delivery … Today: Almost never used in WANs, and rarely used in LANs

18 Why Did it Fail?

19 Shared Memory Countless papers: How shared memory simplifies programming parallel computers Many, many systems proposed and build Today: Message passing (MPI) took over as the de facto standard for writing parallel applications

20 Why Did it Fail?

21 Network Computer Big in 90s Promoted by an alliance of Sun, Oracle, Acorn Promise: many of advantages of cloud computing Easy to manage Application sharing … Failed miserably

22 Why Did it Fail?

Coming Back: ChromeOS Will it succeed this time? 23

24 What are Hard/Fundamental Tradeoffs? Brewer’s CAP conjecture: “Consistency, Availability, Partition-tolerance”, you can have only two in a distributed system In a in-order, reliable communication protocol cannot minimize overhead and latency simultaneously Hard to simultaneously maximize evolvability and performance