Hwajung Lee ITEC452 Distributed Computing Lecture 1 Introduction to Distributed Systems.

Slides:



Advertisements
Similar presentations
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts Amherst Operating Systems CMPSCI 377 Lecture.
Advertisements

1 Distributed Computing Algorithms CSCI Distributed Computing: everything not centralized many processors.
What is cloud computing? Why is this different? Jimmy Lin The iSchool University of Maryland Monday, March 30, 2009 This work is licensed under a Creative.
Based on last years lecture notes, used by Juha Takkinen.
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Cloud Computing Lecture #1 Parallel and Distributed Computing Jimmy Lin The iSchool University of Maryland Monday, January 28, 2008 This work is licensed.
Cloud Computing Lecture #1 What is Cloud Computing? (and an intro to parallel/distributed processing) Jimmy Lin The iSchool University of Maryland Wednesday,
DISTRIBUTED COMPUTING
Dr. Kalpakis CMSC621 Advanced Operating Systems Introduction.
Massively Distributed Database Systems Spring 2014 Ki-Joune Li Pusan National University.
WEB SCIENCE: DISTRIBUTED APPROACHES. Mobile Computing Laptops Smartphones Tablets And now… Smartwatches Fitness trackers Google Glass (are you a glasshole?)
Word Wide Cache Distributed Caching for the Distributed Enterprise.
Cloud Computing Part #2 Materials adopted from the slides by Jimmy Lin, The iSchool, University of Maryland Zigmunds Buliņš, Mg. sc. ing 1.
Lecture 6: Introduction to Distributed Computing.
Distributed systems [Fall 2014] G Lec 1: Course Introduction.
What is a Distributed System? n From various textbooks: l “A distributed system is a collection of independent computers that appear to the users of the.
August 15, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 12: Multiprocessors: Non-Uniform Memory Access * Jeremy R. Johnson.
1 COMPSCI 110 Operating Systems Who - Introductions How - Policies and Administrative Details Why - Objectives and Expectations What - Our Topic: Operating.
Introduction. Readings r Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edn. 3 m Note: All figures from this book.
Distributed Systems and Algorithms Sukumar Ghosh University of Iowa Spring 2011.
Kyung Hee University 1/41 Introduction Chapter 1.
Intro to Parallel and Distributed Processing Some material adapted from slides by Christophe Bisciglia, Aaron Kimball, & Sierra Michels-Slettvet, Google.
Cloud Computing Lecture #1 What is Cloud Computing? (and an intro to parallel/distributed processing) Jimmy Lin The iSchool University of Maryland Modified.
Hwajung Lee. Maximilien Brice, © CERN  Fact: Processor population is exploding. Technology has dramatically reduced the price of processors.  Geographic.
Problem-solving on large-scale clusters: theory and applications Lecture 4: GFS & Course Wrap-up.
Distributed systems [Fall 2015] G Lec 1: Course Introduction.
Distributed Computing Systems CSCI 4780/6780. Scalability ConceptExample Centralized servicesA single server for all users Centralized dataA single on-line.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 1.
What is a distributed system? A network of processes. The nodes are processes, and the edges are communication channels.
Background Computer System Architectures Computer System Software.
Introduction Goal: connecting multiple computers to get higher performance – Multiprocessors – Scalability, availability, power efficiency Job-level (process-level)
Intro to Parallel and Distributed Processing Some material adapted from slides by Christophe Bisciglia, Aaron Kimball, & Sierra Michels-Slettvet, Google.
Lecture 13 Parallel Processing. 2 What is Parallel Computing? Traditionally software has been written for serial computation. Parallel computing is the.
Distributed and Parallel Processing George Wells.
Lecture 3 – MapReduce: Implementation CSE 490h – Introduction to Distributed Computing, Spring 2009 Except as otherwise noted, the content of this presentation.
Chapter 1 Characterization of Distributed Systems
COMPSCI 110 Operating Systems
CS/CE/TE 6378 Advanced Operating Systems
Distributed Operating Systems Spring 2004
Designing the Physical Architecture
Chilimbi, et al. (2014) Microsoft Research
Network Operating Systems (NOS)
Distributed Operating Systems
Definition of Distributed System
Lecture 1: Network Operating Systems (NOS)
Spatial Analysis With Big Data
University of Technology
Networks Intro to Digital Technology
Lecture 7: Introduction to Distributed Computing.
Operating Systems : Overview
Parallel and Multiprocessor Architectures – Shared Memory
Data Structures and Algorithms in Parallel Computing
Towards Next Generation Panel at SAINT 2002
CLUSTER COMPUTING.
Architectures of distributed systems Fundamental Models
Architectures of distributed systems Fundamental Models
COMP28112 Distributed Computing
COMP28112 Distributed Computing
COMP28112 Distributed Computing
COMP28112 Distributed Computing
Distributed Computing:
Outline Review of Classical Operating Systems - continued
Operating Systems : Overview
Operating Systems : Overview
Operating Systems : Overview
Architectures of distributed systems Fundamental Models
Distributed Systems (15-440)
COMP28112 Distributed Computing
CS 5620 Distributed Systems and Algorithms
Distributed Systems and Algorithms
Presentation transcript:

Hwajung Lee ITEC452 Distributed Computing Lecture 1 Introduction to Distributed Systems

Maximilien Brice, © CERN

Maximilien Brice, © CERN

Maximilien Brice, © CERN

Why distributed systems ? Fact: Processor population is exploding. Technology has dramatically reduced the price of processors. Geographic distribution of processes Resource sharing as used in P2P networks Computation speed up (as in a grid) Fault tolerance

What is a distributed system? (1) A network of processes/resources. The nodes are processes/resources, and the edges are communication channels.

What is a distributed system? (2) The logical distribution of functional capabilities Multiple processes Interprocess communication Disjoint address space Collective goal

What is a distributed system? (2) Collective Goal? Don’t hold your breath: Biocomputing Nanocomputing Quantum computing … It all boils down to… Divide-and-conquer Throwing more hardware at the problem Reference: Lecture Note by Jimmy Lin, Univ. of Maryland

What is a distributed system? (2) Divide and Conquer “Work” Partition w1 w2 w3 “worker” “worker” “worker” r1 r2 r3 Combine “Result” Reference: Lecture Note by Jimmy Lin, Univ. of Maryland

What is a distributed system? (2) Different Workers Different threads in the same core Different cores in the same CPU Different CPUs in a multi-processor system Different machines in a distributed system Reference: Lecture Note by Jimmy Lin, Univ. of Maryland

What is a distributed system? (2) Choices, Choices, Choices Commodity vs. “exotic” hardware Number of machines vs. processor vs. cores Bandwidth of memory vs. disk vs. network Different programming models Reference: Lecture Note by Jimmy Lin, Univ. of Maryland

A classification Client-server model Peer-to-peer model

Parallel vs. Distributed In both parallel and distributed systems, the events are partially ordered. In parallel systems, the primarily issue is speed-up In distributed systems the primary issues are fault-tolerance and availability of services

Important services Internet banking Web search Net meeting Distance education Internet auction Google earth Google sky And so on…

Examples Large networks are very commonplace these days. Think of the world wide web. Other examples are: Ubiquitous Computing Cloud computing Grid computing, Grid computing networks Ex. Computational grids (OSG, Teragrid, SETI@home) Sensor networks Network of mobile robots And so on… Google processes 20 PB a day (2008) Kazaa (formally, KaZaA) BitTorrent

Sensor Network Checking the structural integrity

Mobile robots

Cloud Computing Image Source: www.vemurivenkatrao.com/nature/cloud/

Important issues Knowledge is local Clocks are not synchronized No shared address space Topology and routing Scalability Fault tolerance

Some common sub-problems Leader election Mutual exclusion Time synchronization Distributed snapshot Replica management

Objective of the course Understand how a system works, why it fails, and how we can guarantee our design. We are not discussing implementation or programming, although they are also important issues

Implementation Practical distributed systems must have a real network as its backbone. However, such systems can be simulated on a shared-memory multiprocessor, or even on a uniprocessor. (How will you do it?)