Transparency in Distributed Operating Systems Vijay Akkineni.

Slides:



Advertisements
Similar presentations
Distributed Computing Systems Overview of Distributed Systems Andrew Tanenbaum and Marten van Steen, Distributed Systems – Principles and Paradigms, Prentice.
Advertisements

Distributed Processing, Client/Server and Clusters
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Database Architectures and the Web
Distributed Systems basics
Distributed Systems 1 Topics  What is a Distributed System?  Why Distributed Systems?  Examples of Distributed Systems  Distributed System Requirements.
Objektorienteret Middleware Presentation 2: Distributed Systems – A brush up, and relations to Middleware, Heterogeneity & Transparency.
Definition of a Distributed System (1) A distributed system is: A collection of independent computers that appears to its users as a single coherent system.
Chapter 9 Designing Systems for Diverse Environments.
Distributed Processing, Client/Server, and Clusters
Distributed components
© Chinese University, CSE Dept. Distributed Systems / Distributed Systems Topic 1: Characterization and Design Goals Dr. Michael R. Lyu Computer.
City University London
REK’s adaptation of Prof. Claypool’s adaptation of
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
Overview Distributed vs. decentralized Why distributed databases
Introducing … Distributed Systems.
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
DISTRIBUTED COMPUTING
Massively Distributed Database Systems Spring 2014 Ki-Joune Li Pusan National University.
Chapter 3 Database Architectures and the Web Pearson Education © 2009.
Ch 4. The Evolution of Analytic Scalability
Computer System Architectures Computer System Software
Database Architectures and the Web Session 5
Introduction to DISTRIBUTED SYSTEMS Tran, Van Hoai Department of Systems & Networking Faculty of Computer Science & Engineering HCMC University of Technology.
1 Distributed Operating Systems and Process Scheduling Brett O’Neill CSE 8343 – Group A6.
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Session-8 Data Management for Decision Support
Distributed File System By Manshu Zhang. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
Distributed Systems COEN 317 Introduction Chapter 1,2,3.
Types of Operating Systems
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
SUMAN K REDDY BURJUKINDI. Evolution of Modern Operating Systems 1 st Generation: Centralized Operating System 2 nd Generation: Network Operating System.
OS2- Sem ; R. Jalili Introduction Chapter 1.
Kyung Hee University 1/41 Introduction Chapter 1.
DISTRIBUTED COMPUTING Introduction Dr. Yingwu Zhu.
Distributed Computing Systems CSCI 4780/6780. Distributed System A distributed system is: A collection of independent computers that appears to its users.
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
A Summary of the Distributed System Concepts and Architectures Gayathri V.R. Kunapuli
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
Distributed Databases
CORBA1 Distributed Software Systems Any software system can be physically distributed By distributed coupling we get the following:  Improved performance.
Shuman Guo CSc 8320 Advanced Operating Systems
By, Naga Manojna Chintapalli. CHAPTER 2.2 TRANSPARENCY.
Definition of a Distributed System (1) A distributed system is: A collection of independent computers that appears to its users as a single coherent system.
Types of Operating Systems 1 Computer Engineering Department Distributed Systems Course Assoc. Prof. Dr. Ahmet Sayar Kocaeli University - Fall 2015.
Distributed Systems Definition.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
© Chinese University, CSE Dept. Distributed Systems / Distributed Systems Topic 1: Characterization of Distributed & Mobile Systems Dr. Michael R.
Distributed Computing Systems CSCI 6900/4900. Review Distributed system –A collection of independent computers that appears to its users as a single coherent.
Transparency Wang, Yang edu.
1 Distributed Processing Chapter 1 : Introduction.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
{ Tanya Chaturvedi MBA(ISM) Hadoop is a software framework for distributed processing of large datasets across large clusters of computers.
Distributed Computing Systems CSCI 6900/4900. Review Definition & characteristics of distributed systems Distributed system organization Design goals.
TEXT: Distributed Operating systems A. S. Tanenbaum Papers oriented on: 1.OS Structures 2.Shared Memory Systems 3.Advanced Topics in Communications 4.Distributed.
Primitive Concepts of Distributed Systems Chapter 1.
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
Introduction to Distributed Systems. Distributed System Definitions:  “A distributed system is a collection of independent computers that appear to the.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Chapter 1 Characterization of Distributed Systems
Introduction to Distributed Platforms
Introduction to Distributed Systems
Database Architectures and the Web
Distributed System Structures 16: Distributed Structures
Introduction To Distributed Systems
Distributed Systems and Concurrency: Distributed Systems
Presentation transcript:

Transparency in Distributed Operating Systems Vijay Akkineni

Operating Systems Generations Centralized Operating systems Network Operating Systems Distributed Operating Systems Cooperative Autonomous Systems Cloud Computing

Partitioning of COS ApplicationsFinanceWord processingWeb Application SubsytemsProgramming EnvrionmentDatabase Systems UtilitiesCompilerCommand Interpreter library System ServicesFile SystemMemory ManagerScehduler KernelCPU Multiplexing, Interrupt Handling, Device Drivers, Synchronization primitives, Interprocess communication

Network Operating System Peer to Peer communications. Seven layer OSI architecture. Examples Remote Login, File Transfer, Messaging, Network Browsing, Remote Execution.

Distributed Operating System Loosely coupled systems. Sharing or resources and coordination of resources. Transparency – Key difference between NOS and DOS. Distributed resources and activities are to be managed and controlled.

Distributed Operating Systems

Cooperative Autonomous Systems Characterized by Service Integration. Middleware – Cobra, JMS, RMI.

Transparency Hide irrelevant system dependent details from the users Higher Implementation Complexities Single System Image Minimal Knowledge

Location Transparency User has no awareness of object locations,objects are mapped and referred to by logical names. WebServices UDDI, Federated Services - SOA

Access Transparency Ability to access local and remote system objects in uniform way. The physical separation of system objects is concealed from the user. Examples – Accessing a file from the local file system and from a cloud drive.

Migration Transparency Logical Resources and Physical processes migrated by the system, from one location to another in an attempt to maximize efficiency, reliability, availability or security should be automatically controlled by the system Example – Application Servers using JNDI

Replication Transparency Exhibit consistency of multiple instances of files and data. System elements are copied to remote points in the system in an effort to possibly increase efficiencies through better proximity or provide increased reliability through duplication. Examples – Google's Big Table, HDFS.

Concurrency Transparency Sharing of Objects without interference. Similar to Time sharing. An important challenge when designing distributed systems is how to deal with concurrent accesses. Example – An important design goal for distributed database. Transactional Integrity and ACID properties during multiple transactions happening concurrently.

Failure Transparency Failure Transparency tries to mask failures so that they are not seen or noticed by the users. It is difficult to identify between a resource that has failed and a resource which is performing badly (slowly). Consider opening a webpage - is it dead or painfully slow, how long should the browser wait? Examples - Map Reduce Frameworks, DFS Replication on Data Nodes.

Performance Transparency Attempt to achieve a consistent and predictable performance level even with changes to system structure or load distribution. When parts of the system experience significant delay or load imbalance, the system is responsible for the automatic, rapid, and accurate detection and orchestration of a remedy. Examples - Load balancing, Speculative execution in Map Reduce.

Size/Scale Transparency A system's geographic reach, number of nodes, level of node capability, or any changes therein should exists without any required user knowledge or interaction. Research Area - Currently there is lot of ongoing research on running Map Reduce job across the data centers. Partition compute jobs based on geographical locality.

Revision transparency System occasionally have need for system-software version changes and changes to internal implementation of system infrastructure. Examples - Linux Kernel Upgrades and how it should not effect the existing software applications on the OS.

Revision transparency System occasionally have need for system-software version changes and changes to internal implementation of system infrastructure. Examples - Linux Kernel Upgrades and how it should not effect the existing software applications on the OS.

Parallelism Transparency The most difficult aspect of transparency,”Holy Grail” of distributed system designers. Systems parallel execution of processes throughout the system should occur without any required user knowledge. Examples – Parallel Algorithms on multicore processors and Map Reduce tasks on multiple systems.

Transparency Summarized Access Hide differences in data representation and how a resource is accessed LocationHide where a resource is located MigrationHide that a resource may move to another location Relocation Hide that a resource may be moved to another location while in use Replication Hide that a resource may be shared by several competitive users Concurrency Hide that a resource may be shared by several competitive users FailureHide the failure and recovery of a resource PersistenceHide whether a (software) resource is in memory or on disk

Major Research Areas AreasTransparencies Communication Networks Synchronization Distributed Algorithms Interaction and Control Transparency Process Scheduling Deadlock Handling Load balancing Performance Transparency Resource Scheduling File Sharing Concurrency Control Resource Transparency Failure Handling Configuration Redundancy Failure Transparency

Towards Cloud OS Permit autonomous management of its resources on behalf of its users and applications. Cloud OS operations must continue despite loss of nodes, entire clusters, and network partitioning. The Cloud OS must be operating system and architecture agnostic. The Cloud must support multiple types of applications, including legacy. Cloud OS management system must be decentralized, scalable, have little overhead per user and per machine and be cost effective

Logical Model of Cloud OS

Cloud Middleware Heterogeneous Nature of Cloud hindering adoption of Cloud Technologies. IBM is researching into Altocumulus middleware, which offers a uniform API for using Amazon EC2, Eucalyptus, Google AppEngine, and IBM HiPODS Cloud, aiming to provide an API which is Cloud agnostic.

References “HP performance-optimized datacenter (POD).” Data Sheet, “Amazon EC2.” [Online] Apache hadoop Map Reduce