Cloud Computing Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering University of Washington August 2012.

Slides:



Advertisements
Similar presentations
Let's say we want to access domain - reliablescribe.com First we need to buy a computer We need to subscribe to an Internet Service Provider (ISP) The.
Advertisements

Ed Duguid with subject: MACE Cloud
Chapter 4 Infrastructure as a Service (IaaS)
CSE 451: Operating Systems Spring 2013 Module 24 Distributed Systems Ed Lazowska Allen Center 570 © 2013 Gribble, Lazowska,
Amazon RDS (MySQL and Oracle) and SQL Azure Emil Tabakov Telerik Software Academy academy.telerik.com.
Novell Server Linux vs. windows server 2008 By: Gabe Miller.
Cassandra Database Project Alireza Haghdoost, Jake Moroshek Computer Science and Engineering University of Minnesota-Twin Cities Nov. 17, 2011 News Presentation:
2/25/2004 The Google Cluster Architecture February 25, 2004.
Cloud Computing Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering University of Washington August 2010.
1 School of Computing Science Simon Fraser University CMPT 300: Operating Systems I Dr. Mohamed Hefeeda.
The Internet Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering University of Washington August 2010.
CSE 490h, Autumn 2008 Scalable Systems: Design, Implementation and Use of Large Scale Clusters Instructors: Ed Lazowska & Aaron Kimball TA: Slava Chernyak.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
AN INTRODUCTION TO CLOUD COMPUTING Web, as a Platform…
DISTRIBUTED COMPUTING
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
VMware vCenter Server Module 4.
An Introduction to Cloud Computing. The challenge Add new services for your users quickly and cost effectively.
Cloud Computing – The Cloud Dr. Jie Liu. Definition  Cloud computing is Web-based processing, whereby shared resources, software, and information are.
Windows Server MIS 424 Professor Sandvig. Overview Role of servers Performance Requirements Server Hardware Software Windows Server IIS.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Effectively Explaining the Cloud to Your Colleagues.
Server Hardware Chapter 22 Release 22/10/2010Jetking Infotrain Ltd.

GIS and Cloud Computing. Flickr  Upload and manage your photos online  Share your photos with your family and friends  Post your photos everywhere.
Redundant Array of Inexpensive Disks aka Redundant Array of Independent Disks (RAID) Modified from CCT slides.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
CH2 System models.
Webscale Computing Mike Culver Amazon Web Services.
The Magic of the Cloud: Supercomputers for Everyone, Everywhere Prof. Eric A. Brewer UC Berkeley.
Presented by: Mostafa Magdi. Contents Introduction. Cloud Computing Definition. Cloud Computing Characteristics. Cloud Computing Key features. Cost Virtualization.
(C) 2008 Clusterpoint(C) 2008 ClusterPoint Ltd. Empowering You to Manage and Drive Down Database Costs April 17, 2009 Gints Ernestsons, CEO © 2009 Clusterpoint.
 Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over a network (typically the Internet). 
CSE 451: Operating Systems Spring 2013 Module 26 Cloud Computing Ed Lazowska Allen Center 570 © 2013 Gribble, Lazowska, Levy,
Hosted SharePoint. Part 3/3: Office Live as a WSS solution Speaker Name Microsoft Corporation Hosted.
Cloud Computing Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering University of Washington.
CSE 451: Operating Systems Autumn 2010 Module 25 Cloud Computing Ed Lazowska Allen Center 570.
1 MSRBot Web Crawler Dennis Fetterly Microsoft Research Silicon Valley Lab © Microsoft Corporation.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
Web Technologies Lecture 13 Introduction to cloud computing.
{ Tanya Chaturvedi MBA(ISM) Hadoop is a software framework for distributed processing of large datasets across large clusters of computers.
Data Centers and Cloud Computing 1. 2 Data Centers 3.
The Google Cluster Architecture Written By: Luiz André Barroso Jeffrey Dean Urs Hölzle Presented By: Omkar Kasinadhuni Simerjeet Kaur.
KAASHIV INFOTECH – A SOFTWARE CUM RESEARCH COMPANY IN ELECTRONICS, ELECTRICAL, CIVIL AND MECHANICAL AREAS
 What is Cluster Computing  Benefits of Cluster Computing  Types of Cluster’s  Cluster Components  Cluster Applications  Performance Impacts and.
Sql Server Architecture for World Domination Tristan Wilson.
Brian Varela Group F February 16, 2010 What is the difference between a client and a server?
Lecture 1 Book: Hadoop in Action by Chuck Lam Online course – “Cloud Computing Concepts” lecture notes by Indranil Gupta.
Clouding with Microsoft Azure
Use of Cloud Computing for Implementation of e-Governance Services
3.1 Types of Servers.
Advanced Topics in Concurrency and Reactive Programming: Case Study – Google Cluster Majeed Kassis.
Web Services application that operates over a network
3.1 Types of Servers.
CLUSTER COMPUTING Presented By, Navaneeth.C.Mouly 1AY05IS037
3.1 Types of Servers.
Chapter 6 Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism Topic 11 Amazon Web Services Prof. Zhang Gang
Cloud Computing Ed Lazowska August 2011 Bill & Melinda Gates Chair in
CSE 451: Operating Systems Autumn 2009 Module 23 Distributed Systems
Design Unit 26 Design a small or home office network
CSE 451: Operating Systems Autumn 2010 Module 23 Distributed Systems
Building a Database on S3
CSE 451: Operating Systems Spring 2005 Module 20 Distributed Systems
CS246 Search Engine Scale.
CS246: Search-Engine Scale
Database System Architectures
CSE 451: Operating Systems Winter 2007 Module 21 Distributed Systems
The Internet and Electronic mail
CSE 451: Operating Systems Autumn Module 24 Virtual Machine Monitors
Presentation transcript:

Cloud Computing Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering University of Washington August 2012

6/24/2016 Office applications Databases and storage Math and science Web browser Personal computing

6/24/2016 Office applications Math and science Web browser Databases and storage Cloud accessed through the browser

With the cloud provider ’ s domain name …

Or with your own …

6/24/2016 Math and science Office applications Web browser Databases and storage Why not office applications too?

6/24/2016 © 2009 Gribble, Lazowska, Levy, Zahorjan Web browser Math and science Office applications Databases and storage Why not everything else?

Consider … z Sharing is easy z Someone else does backup z Someone else handles software updates z There ’ s 7x24x365 operations support, auxiliary power, redundant network connections, geographical diversity z Scalability – both up and down – is instantaneous z Many fewer demands on the local operating system and computer z Incredibly disruptive to existing business models!

Amazon Elastic Compute Cloud (EC2) z $0.68 per hour for y 8 cores of 4 GHz 64-bit Intel Xeon or AMD Opteron y 7 GB memory y 1.69 TB scratch storage z Need it 24x7 for a year? y $3900 z $0.085 per hour for y 1 core of 1.2 GHz 32-bit Intel or AMD (1/20 th the above) y 1.7 GB memory y 160 GB scratch storage z Need it 24x7 for a year? y $490

z This includes y Purchase + replacement y Housing y Power y Operation y Reliability y Security y Instantaneous expansion and contraction z 1000 processors for 1 day costs the same as 1 processor for 1000 days!

Velcro

z A datacenter has containers z A container has 1, ,000 servers z A server has two processors, 2 disks, tons of memory, battery backup z Processors are chosen for power efficiency, not performance

Slide courtesy of Werner Vogels

z Many hundreds of machines are involved in a single Google search request (remember, the web is 400+TB) y There are multiple clusters (of thousands of computers each) all over the world y DNS routes your search to a nearby cluster Isn’t this just timesharing?

y A cluster consists of Google Web Servers, Index Servers, Doc Servers, and various other servers (ads, spell checking, etc.) y These are cheap standalone computers, rack-mounted, connected by commodity networking gear

y Within the cluster, load-balancing routes your search to a lightly-loaded Google Web Server (GWS), which will coordinate the search and response y The index is partitioned into “ shards. ” Each shard indexes a subset of the docs (web pages). Each shard is replicated, and can be searched by multiple computers – “ index servers ” y The GWS routes your search to one index server associated with each shard, through another load-balancer y When the dust has settled, the result is an ID for every doc satisfying your search, rank-ordered by relevance

y The docs, too, are partitioned into “ shards ” – the partitioning is a hash on the doc ID. Each shard contains the full text of a subset of the docs. Each shard can be searched by multiple computers – “ doc servers ” y The GWS sends appropriate doc IDs to one doc server associated with each relevant shard y When the dust has settled, the result is a URL, a title, and a summary for every relevant doc

y Meanwhile, the ad server has done its thing, the spell checker has done its thing, etc. y The GWS builds an HTTP response to your search and ships it off z Many hundreds of computers have enabled you to search 400+TB of web in ~100 ms.

z Enormous volumes of data z Extreme parallelism z The cheapest imaginable components y Failures occur all the time y You couldn ’ t afford to prevent this in hardware z Software makes it y Fault-Tolerant y Highly Available y Recoverable y Consistent y Scalable y Predictable y Secure