Presentation is loading. Please wait.

Presentation is loading. Please wait.

Univ. of TehranDistributed Operating Systems1 Advanced Operating Systems University of Tehran Dept. of EE and Computer Engineering By: Dr. Nasser Yazdani.

Similar presentations


Presentation on theme: "Univ. of TehranDistributed Operating Systems1 Advanced Operating Systems University of Tehran Dept. of EE and Computer Engineering By: Dr. Nasser Yazdani."— Presentation transcript:

1 Univ. of TehranDistributed Operating Systems1 Advanced Operating Systems University of Tehran Dept. of EE and Computer Engineering By: Dr. Nasser Yazdani Distributed Systems (introduction) Lecture 8: Distributed Systems (introduction)

2 Univ. of TehranDistributed Operating Systems2 Covered topic Distributed Systems, Why? And how. References Chapter 1 of the text book

3 Univ. of TehranDistributed Operating Systems3 Outline Why Distributed systems Challenges. Communication Distributed Operating systems Architectural models

4 Distributed System? (examples) The InternetGnutella peer to peer system Food Web of Little Rock Lake, WI A Sensor Network

5 Univ. of TehranDistributed Operating Systems5 Problems? Bigger Problems like weather forecast, Economic modeling, Scientific problems, etc. Faster machines? It is getting harder to extract the performance modern applications require out of a single processor machine Some application are inherently distributed, sensor networks, etc. A lot of Data to store in one place More efficient use of resources, sharing resources Solution: Distributed computing

6 Univ. of TehranDistributed Operating Systems6 Distributed systems Definitions A collection of autonomous computers linked by a network, with software designed to produce an integrated computing facility A distributed system is a collection of independent computers that appear to users as a single computer A system in which hardware or software components located at networked computers communicate and coordinate their actions only by passing messages Examples World Wide Web Automatic Teller Machines Cell Phones

7 A working definition A distributed system is a collection of entities, each of which is autonomous, programmable, asynchronous and failure-prone, and which communicate through an unreliable communication medium. Our interest in distributed systems involves algorithmics, design and implementation, maintenance, study

8 Univ. of TehranDistributed Operating Systems8 Advantages ItemDescription EconomicsMicroprocessors offer a better price/performance than mainframes SpeedA distributed system may have more total computing power than a mainframe Inherent Distribution Some applications involve spatially separated machines. ReliabilityIf one machine crashes, the system as a whole can still survive Incremental Growth Computing power can be added in small increments

9 Univ. of TehranDistributed Operating Systems9 Disadvantages ItemDescription SoftwareLittle software, OSs, etc., exist at present for distributed systems NetworkingThe network can saturate or cause other problems. SecurityEasy access also applies to secret data, privacy!

10 A range of challenges Failures (of nodes or Network) Asynchrony Scalability Security

11 Univ. of TehranDistributed Operating Systems11 Consequences Concurrency Concurrency is the norm instead of the exception Synchronization is critical No Global Clock There is a limit as to how accurate a global clock can be. Contrary to parallel systems. Independent Failures The more stuff you add the more likely something will break Single system view says independent failures should not affect users

12 Univ. of TehranDistributed Operating Systems12 Communication Issues Building a system out of interconnected computers requires that some major issues be addressed Independent failure Unreliable communication Insecure Communication Costly Communication

13 Univ. of TehranDistributed Operating Systems13 Distributed Operating Systems A distributed operating system supports the encapsulation and protection of resources inside servers; and it supports mechanisms required to access these resources, including naming, communication and scheduling The software for multiple CPU systems can be divided into three rough classes Network operating systems (file servers) Distributed Operating Systems Shared Memory Multiprocessors

14 Univ. of TehranDistributed Operating Systems14 Parallel Computing A large collection of processing elements that can communicate and cooperate to solve large problems quickly A form of information processing which uses concurrent events during execution In other words, both the language and the hardware support concurrency

15 Univ. of TehranDistributed Operating Systems15 Parallel Architectures Unlike traditional von Neumann machines, there is no single standard architecture used on parallel machines In fact dozens of different parallel architectures have been built and are being used Several people have tried to classify the different types of parallel machines The taxonomy proposed by Flynn is the most commonly used

16 Univ. of TehranDistributed Operating Systems16 Ex. Building a mail server mail arrives from outside world store it until... user reads/deletes/saves it Solution: Solution: One server w/ disk to store mail-boxes Problems: Performance: Stable performance under high load consistent w.r.t. client-side copies concurrent mail arrival, deletion crash recovery (crash while updating mail-box) availability

17 Univ. of TehranDistributed Operating Systems17 Other problems? Not necessarily plenty of bandwidth Not necessarily low latency Significant variance in latency and bandwidth Frequent and unpredictable partial failure of channel Lost messages, &c What else has changed? We don't have hardware support for synchronization/atomicity among hosts We don't have a global timer or clock Frequent and unpredictable failure of some CPUs, I/O devices, &c Snoopy caches are not practical, because broadcasting is too expensive.

18 Univ. of TehranDistributed Operating Systems18 Challenges There are a number of challenges found in building distributed systems Heterogeneity Openness Security Scalability Failure Handling Concurrency Transparency

19 Univ. of TehranDistributed Operating Systems19 Heterogeneity Applies to Networks Computer Hardware Operating Systems Programming Languages Implementations Middleware applies to a software layer that helps to handle heterogeneity

20 Univ. of TehranDistributed Operating Systems20 Openness The characteristic that a system can be extended in various ways Hardware extensions Software extensions Historically, computer systems were largely closed UNIX broke the mold for OS IBM PC broke the mold for hardware

21 Univ. of TehranDistributed Operating Systems21 Security Security is a huge issue in computing in general, but even more so in distributed computing Communication Distributed Resources Infrastructure Attacks

22 Univ. of TehranDistributed Operating Systems22 Scalability Distributed systems operate at many different scales Two workstations and a file server Department computers… Often the more important question is not can you scale, but can you scale well Consider the Internet

23 Univ. of TehranDistributed Operating Systems23 Failure Handling What happens when a fault occurs? Detect Mask Tolerate Fault tolerant design is based on two approaches Hardware redundancy Software recovery

24 Univ. of TehranDistributed Operating Systems24 Hardware Redundancy Two computers are employed for a single application, one acting as a standby Very costly, but often very effective, solution Redundancy can be planned at a finer grain Individual servers can be replicated Redundant hardware can be used for non- critical activities when no faults are present

25 Univ. of TehranDistributed Operating Systems25 Software Redundancy Software must be designed so that the state of permanent data can be recovered or “rolled back” when a fault is detected Transaction processing

26 Univ. of TehranDistributed Operating Systems26 Concurrency Concurrency in a distributed system does not necessarily mean concurrency within a single program Many users invoke similar commands Many different server processes may be running Synchronization, of course, is a problem

27 Univ. of TehranDistributed Operating Systems27 Transparency TransparencyDescription Access Hide differences in data representation and how a resource is accessed Location Hide where a resource is located Migration Hide that a resource may move to another location Relocation Hide that a resource may be moved to another location while in use Replication Hide that a resource may have several copies. Concurrency Hide that a resource may be shared by several competitive users Failure Hide the failure and recovery of a resource Persistence Hide whether a (software) resource is in memory or on disk

28 Univ. of TehranDistributed Operating Systems28 Scalability Problems Examples of scalability limitations. ConceptExample Centralized services A single server for all users Centralized data A single on-line telephone book Centralized algorithms Doing routing based on complete information

29 Univ. of TehranDistributed Operating Systems29 Scaling Techniques (1) 1.4 The difference between letting: a server or a client check forms as they are being filled

30 Univ. of TehranDistributed Operating Systems30 Scaling Techniques (2) 1.5 An example of dividing the DNS name space into zones.

31 Univ. of TehranDistributed Operating Systems31 Hardware Models 1.6 Different basic organizations and memories in distributed computer systems

32 Univ. of TehranDistributed Operating Systems32 Multiprocessors (1) A bus-based multiprocessor. 1.7

33 Univ. of TehranDistributed Operating Systems33 Multiprocessors (2) a) A crossbar switch b) An omega switching network 1.8

34 Univ. of TehranDistributed Operating Systems34 Homogeneous Multicomputer Systems a) Grid b) Hypercube 1-9

35 Univ. of TehranDistributed Operating Systems35 Software Models An overview between DOS (Distributed Operating Systems) NOS (Network Operating Systems) Middleware SystemDescriptionMain Goal DOS Tightly-coupled operating system for multi- processors and homogeneous multicomputers Hide and manage hardware resources NOS Loosely-coupled operating system for heterogeneous multicomputers (LAN and WAN) Offer local services to remote clients Middleware Additional layer atop of NOS implementing general-purpose services Provide distribution transparency

36 Univ. of TehranDistributed Operating Systems36 Uniprocessor Operating Systems Separating applications from operating system code through a microkernel. 1.11

37 Univ. of TehranDistributed Operating Systems37 Multicomputer Operating Systems (1) 1.14

38 Univ. of TehranDistributed Operating Systems38 Multicomputer Operating Systems (2) Alternatives for blocking and buffering in message passing. 1.15

39 Univ. of TehranDistributed Operating Systems39 Distributed Shared Memory Systems (1) a) Pages of address space distributed among four machines b) Situation after CPU 1 references page 10 c) Situation if page 10 is read only and replication is used

40 Univ. of TehranDistributed Operating Systems40 Distributed Shared Memory Systems (2) False sharing of a page between two independent processes. 1.18

41 Univ. of TehranDistributed Operating Systems41 Network Operating System (1) General structure of a network operating system. 1-19

42 Univ. of TehranDistributed Operating Systems42 Positioning Middleware General structure of a distributed system as middleware. 1-22

43 Univ. of TehranDistributed Operating Systems43 Software Layers Applications, services Computer and network hardware Platform Operating system Middleware

44 Univ. of TehranDistributed Operating Systems44 Middleware What does it do? Provides an API for the application Hides the underlying heterogeneity Examples Sun RPC, ISIS CORBA RMI DCOM

45 Univ. of TehranDistributed Operating Systems45 Middleware and Openness In an open middleware-based distributed system, the protocols used by each middleware layer should be the same, as well as the interfaces they offer to applications. 1.23

46 Univ. of TehranDistributed Operating Systems46 Comparison between Systems Item Distributed OS Network OS Middleware -based OS Multiproc.Multicomp. Degree of transparency Very HighHighLowHigh Same OS on all nodes Yes No Number of copies of OS 1NNN Basis for communication Shared memory MessagesFiles Model specific Resource management Global, central Global, distributed Per node Scalability NoModeratelyYesVaries Openness Closed Open

47 Univ. of TehranDistributed Operating Systems47 Next Lecture DS Architecture References Chapter 2 of the book The Anatomy of the Grid Chord: A Scalable Peer to peer Lookup Service for Internet Applications.


Download ppt "Univ. of TehranDistributed Operating Systems1 Advanced Operating Systems University of Tehran Dept. of EE and Computer Engineering By: Dr. Nasser Yazdani."

Similar presentations


Ads by Google