Advanced Operating Systems Chapter 11 Distributed File systems 11

Slides:



Advertisements
Similar presentations
Distributed File Systems Chapter 11
Advertisements

Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Fall 2007cs4251 Distributed Computing Umar Kalim Dept. of Communication Systems Engineering 31/10/2007.
Computer Science Lecture 21, page 1 CS677: Distributed OS Today: Coda, xFS Case Study: Coda File System Brief overview of other recent file systems –xFS.
Hands-On Microsoft Windows Server 2003 Administration Chapter 5 Administering File Resources.
Concurrency Control & Caching Consistency Issues and Survey Dingshan He November 18, 2002.
70-270, MCSE/MCSA Guide to Installing and Managing Microsoft Windows XP Professional and Windows Server 2003 Chapter Nine Managing File System Access.
Distributed Systems CS Distributed File Systems- Part II Lecture 20, Nov 16, 2011 Majd F. Sakr, Mohammad Hammoud andVinay Kolar 1.
Jeff Chheng Jun Du.  Distributed file system  Designed for scalability, security, and high availability  Descendant of version 2 of Andrew File System.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 7: Planning a DNS Strategy.
11.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 11: Introducing WINS, DNS,
Distributed File Systems Concepts & Overview. Goals and Criteria Goal: present to a user a coherent, efficient, and manageable system for long-term data.
Coda Descendant of AFS Developed by Mahadev Satyanarayanan and coworkers at Carnegie-Mellon University since 1987 Open Source advanced caching schemes.
Distributed Systems Principles and Paradigms Chapter 10 Distributed File Systems 01 Introduction 02 Communication 03 Processes 04 Naming 05 Synchronization.
Distributed File Systems Case Studies: Sprite Coda.
© Oxford University Press 2011 DISTRIBUTED COMPUTING Sunita Mahajan Sunita Mahajan, Principal, Institute of Computer Science, MET League of Colleges, Mumbai.
Distributed File Systems Overview  A file system is an abstract data type – an abstraction of a storage device.  A distributed file system is available.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.
Globus Replica Management Bill Allcock, ANL PPDG Meeting at SLAC 20 Sep 2000.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
CS425 / CSE424 / ECE428 — Distributed Systems — Fall 2011 Some material derived from slides by Prashant Shenoy (Umass) & courses.washington.edu/css434/students/Coda.ppt.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Write Conflicts in Optimistic Replication Problem: replicas may accept conflicting writes. How to detect/resolve the conflicts? client B client A replica.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT By Jyothsna Natarajan Instructor: Prof. Yanqing Zhang Course: Advanced Operating Systems.
11.6 Distributed File Systems Consistency and Replication Xiaolong Wu Instructor: Dr Yanqing Zhang Advanced Operating System.
Highly Available Services and Transactions with Replicated Data Jason Lenthe.
Computer Science Lecture 19, page 1 CS677: Distributed OS Last class: Distributed File Systems Issues in distributed file systems Sun’s Network File System.
THE EVOLUTION OF CODA M. Satyanarayanan Carnegie-Mellon University.
CS791Aravind Elango Maintenance-Free Global Data Storage Sean Rhea, Chris Wells, Patrick Eaten, Dennis Geels, Ben Zhao, Hakim Weatherspoon and John Kubiatowicz.
Distributed File Systems
Distributed File Systems
Cloud Computing CS Distributed File Systems and Cloud Storage – Part I
Chapter 25: Advanced Data Types and New Applications
NOX: Towards an Operating System for Networks
DNS.
File System Implementation
Nache: Design and Implementation of a Caching Proxy for NFSv4
Nache: Design and Implementation of a Caching Proxy for NFSv4
Distributed Systems CS
Chapter 19: Distributed Databases
Internet Networking recitation #12
Net 323 D: Networks Protocols
Chapter 16: Distributed System Structures
SECURITY IN DISTRIBUTED FILE SYSTEMS
11.3 Distributed File Systems Communication
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT -Sumanth Kandagatla Instructor: Prof. Yanqing Zhang Advanced Operating Systems (CSC 8320)
Synchronization in Distributed File System
NFS and AFS Adapted from slides by Ed Lazowska, Hank Levy, Andrea and Remzi Arpaci-Dussea, Michael Swift.
Distributed Systems CS
Distributed P2P File System
Distributed Systems CS
Today: Coda, xFS Case Study: Coda File System
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
CSE 451: Operating Systems Winter Module 22 Distributed File Systems
Distributed File Systems
Distributed File Systems
Mobile P2P Data Retrieval and Caching
Overview Assignment 12: solution Distributed file systems 1/13/2019.
Outline Announcements Lab2 Distributed File Systems 1/17/2019 COP5611.
CSE 451: Operating Systems Spring Module 21 Distributed File Systems
Distributed File Systems
CSE 451: Operating Systems Winter Module 22 Distributed File Systems
MORE ON ARCHITECTURES The main reasons for using an architecture are maintainability and performance. We want to structure the software into reasonably.
Today: Distributed File Systems
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Outline Review of Quiz #1 Distributed File Systems 4/20/2019 COP5611.
Ch 9 – Distributed Filesystem
Distributed File Systems
Distributed File Systems
Presentation transcript:

Advanced Operating Systems Chapter 11 Distributed File systems 11 Advanced Operating Systems Chapter 11 Distributed File systems 11.6 Consistency and Replication Presented by: Harini Penta

Consistency and replication Client side caching Server side Replica Replication in peer-to-peer file systems File replication in grid systems

Client-Side Caching Figure 11-21. Client-side caching in NFS.

Client side caching Caching in NFS: Different strategies exist to handle consistency of the cached data, cached attributes, and so on. In NFSv4 a server may delegate some of its rights to a client when a file is opened An important consequence of delegating a file to a client is that the server needs to be able to recall the delegation In particular, attribute values of the same file cached by two different clients may be different unless the clients keep these attributes mutually consistent. To mitigate the effects of inconsistencies, NFS uses leases on cached attributes, file handles, and directories

Client-Side Caching Figure 11-22. Using the NFSv4 callback mechanism to recall file delegation.

Client side Caching in Coda: Unlike many other distributed file systems. cache coherence in Coda is maintained by means of callbacks The interesting aspect of this scheme is that as long as a client knows it has an outstanding callback promise at the server, it can safely access the file locally Client side Caching for Portable Devices: In most cases, an explicit upload/download model is used for maintaining files on portable storage devices. If the file is locally available, it can be returned to the client, otherwise a data transfer will need to take place. However, for many portable storage devices, we can expect that the user will use special programs to pre-install files on the device.

Client-Side Caching in Coda Figure 11-23. The use of local copies when opening a session in Coda.

Server side replication Server replication in coda: Coda allows file servers to be replicated, the unit of replication is a collection of files called a volume. The collection of Coda servers that have a copy of a volume, are known as that volume's Volume Storage Group, or simply VSG A client's Accessible Volume Storage Group (AVSG) for a volume consists of those servers in that volume's VSG that the client can contact at the moment.

Server Replication

Server side replication In particular, a server Sj in a VSG maintains a Coda version vector CV\ti(f) for each file t contained in that VSG An update of t at server Sj will lead to an increment of CV\ti(f)[i]. The three servers CV\ti(f) is initially equal to [1,1,1] for each server Sj. After updating f, client A multicasts fto each server in its AVSG, that is, S1 and S2 , Both servers will then record that their respective copy has been updated, but not that of S3, CVV1(f)= CVV2(f)=[2,2,1] Meanwhile , client B will be allowed to open a session in which it receives a copy of f from server S3, and subsequently update f as well. CW3(f)=[1,1,2]

Replication in peer-to-peer file systems Unstructured peer-to-peer systems: Independent of the way broadcasting is limited, it should be clear that if files are replicated, searching becomes easier and faster. The more popular a file is, the more replicas we create and distribute across the overlay. Replication in unstructured peer-to-peer systems happens naturally when users download files from others and subsequently make them available to the community. Controlling these networks is very difficult in practice, except when parts are controlled by a single organization

Structured peer-to-peer systems: the efficiency of lookup operations in structured peer-to-peer systems, replication is primarily deployed to balance the load between the nodes. One commonly applied method is to simply replicate a file along the path that a query has followed from source to destination. More specifically, when a query from node P to Q is routed through node R, R will check whether any of its files should be offloaded to P. If P can accept file f from R, each node visited on the route from P to R will install a pointer for fto P, indicating that a replica of f can be found at P.

Structured Peer-to-Peer Systems Figure 11-25. Balancing load in a peer-to-peer system by replication.

File replication in grid systems Naturally, performance plays a crucial role in this area as many Grid applications are highly compute-intensive A key observation is that in many Grid applications data are read only Replication in Grid systems mainly evolves around the problem of locating the best sources to copy data from. This problem can be solved by special replica location services

Thank You Thank You Thank You