CSE 486/586 Distributed Systems Time and Synchronization

Slides:



Advertisements
Similar presentations
Computer Science 425 Distributed Systems CS 425 / ECE 428  2013, I. Gupta, K. Nahrtstedt, S. Mitra, N. Vaidya, M. T. Harandi, J. Hou.
Advertisements

CSE 486/586, Spring 2013 CSE 486/586 Distributed Systems Time and Synchronization Steve Ko Computer Sciences and Engineering University at Buffalo.
Computer Science 425 Distributed Systems CS 425 / CSE 424 / ECE 428 Fall 2011 August 30, 2011 Lecture 3 Time and Synchronization Reading: Sections
Distributed Systems Fall 2010 Time and synchronization.
CSE 486/586, Spring 2012 CSE 486/586 Distributed Systems Time and Synchronization Steve Ko Computer Sciences and Engineering University at Buffalo.
Teaching material based on Distributed Systems: Concepts and Design, Edition 3, Addison-Wesley Copyright © George Coulouris, Jean Dollimore, Tim.
© Chinese University, CSE Dept. Distributed Systems / Distributed Systems Topic 9: Time, Coordination and Replication Dr. Michael R. Lyu Computer.
Distributed Systems CS Synchronization – Part II Lecture 8, Sep 28, 2011 Majd F. Sakr, Vinay Kolar, Mohammad Hammoud.
CSE 486/586, Spring 2012 CSE 486/586 Distributed Systems Logical Time Steve Ko Computer Sciences and Engineering University at Buffalo.
Lecture 9: Time & Clocks CDK4: Sections 11.1 – 11.4 CDK5: Sections 14.1 – 14.4 TVS: Sections 6.1 – 6.2 Topics: Synchronization Logical time (Lamport) Vector.
Lecture 2-1 CS 425/ECE 428 Distributed Systems Lecture 2 Time & Synchronization Reading: Klara Nahrstedt.
CSE 486/586, Spring 2013 CSE 486/586 Distributed Systems Logical Time Steve Ko Computer Sciences and Engineering University at Buffalo.
CSE 486/586, Spring 2012 CSE 486/586 Distributed Systems Mutual Exclusion Steve Ko Computer Sciences and Engineering University at Buffalo.
Time and Coordination March 13, Time and Coordination What is time? :-)  Issue: How do you coordinate distributed computers if there is no global.
CSE 486/586 CSE 486/586 Distributed Systems Logical Time Steve Ko Computer Sciences and Engineering University at Buffalo.
CSE 486/586, Spring 2013 CSE 486/586 Distributed Systems Global States Steve Ko Computer Sciences and Engineering University at Buffalo.
Lecture 9: Time and clocks (Chap 11) Haibin Zhu, PhD. Assistant Professor Department of Computer Science Nipissing University © 2002.
Time This powerpoint presentation has been adapted from: 1) sApr20.ppt.
CSE 486/586, Spring 2012 CSE 486/586 Distributed Systems Mutual Exclusion & Leader Election Steve Ko Computer Sciences and Engineering University.
Lecture 5-1 Computer Science 425 Distributed Systems CS 425 / ECE 428 Fall 2013 Indranil Gupta (Indy) September 10, 2013 Lecture 5 Time and Synchronization.
Time and global states Chapter 11. Outline Introduction Clocks, events and process states Synchronizing physical clocks Logical time and logical clocks.
Distributed Systems Topic 5: Time, Coordination and Agreement
CSE 486/586 CSE 486/586 Distributed Systems Global States Steve Ko Computer Sciences and Engineering University at Buffalo.
Distributed Systems Lecture 5 Time and synchronization 1.
CSE 486/586, Spring 2014 CSE 486/586 Distributed Systems Logical Time Steve Ko Computer Sciences and Engineering University at Buffalo.
Indranil Gupta (Indy) Topic: Time and Ordering Lecture A: Introduction and Basics Cloud Computing concepts All slides © IG.
CSE 486/586 CSE 486/586 Distributed Systems Time and Synchronization Steve Ko Computer Sciences and Engineering University at Buffalo.
Distributed Web Systems Time and Global State Lecturer Department University.
Topic 7: Time and Global State Dr. Ayman Srour Faculty of Applied Engineering and Urban Planning University of Palestine.
Proof of liveness: an example
CSE 486/586 Distributed Systems Gossiping
CSE 486/586 Distributed Systems Failure Detectors
CSE 486/586 Distributed Systems Leader Election
Distributed Computing
Distributed Systems CS
Time and Synchronization
Time and Clock Primary standard = rotation of earth
CSE 486/586 Distributed Systems Logical Time
Time and Global States Ali Fanian Isfahan University of Technology
CSE 486/586 Distributed Systems Global States
Lecture 5 Time and synchronization
CSE 486/586 Distributed Systems Time and Synchronization
CSE 486/586 Distributed Systems Failure Detectors
Time and Synchronization
CSE 486/586 Distributed Systems Failure Detectors
CSE 486/586 Distributed Systems Logical Time
Time and Clock.
Logical time (Lamport)
湖南大学-信息科学与工程学院-计算机与科学系
Distributed Systems CS
Lecture 12: Time and Ordering
Distributed Systems CS
Time and Clock.
CSE 486/586 Distributed Systems Leader Election
CSE 486/586 Distributed Systems Mutual Exclusion
Distributed Systems CS
Physical clock synchronization
CDK: Sections 11.1 – 11.4 TVS: Sections 6.1 – 6.2
Logical time (Lamport)
Logical time (Lamport)
CSE 486/586 Distributed Systems Global States
CSE 486/586 Distributed Systems Failure Detectors
CSE 486/586 Distributed Systems Reliable Multicast --- 2
CSE 486/586 Distributed Systems Logical Time
CSE 486/586 Distributed Systems Mutual Exclusion
CSE 486/586 Distributed Systems Byzantine Fault Tolerance
Logical time (Lamport)
CSE 486/586 Distributed Systems Leader Election
Last Class: Naming Name distribution: use hierarchies DNS
Outline Theoretical Foundations
Presentation transcript:

CSE 486/586 Distributed Systems Time and Synchronization Steve Ko Computer Sciences and Engineering University at Buffalo

Last Time Models of Distributed Systems Failure detectors---why? Synchronous systems Asynchronous systems Failure detectors---why? Because things do fail. Failure detectors---what? Properties: completeness & accuracy Cannot have a perfect failure detector Metrics: bandwidth, detection time, scale, accuracy Failure detectors---how? Two processes: Heartbeating and Ping Multiple processes: Centralized, ring, all-to-all

Today’s Question The topic of time Why? What? How? Today and next time Need to know when things happen One of the fundamental challenges What? Ideally, we’d like to know when exactly something happened. How? Let’s see!

Today’s Question Servers in the cloud need to timestamp events Server A and server B in the cloud can have different clock values. The cloud has server A and server B that service customers. You try to purchase an airline ticket online via the cloud. It’s the last airline ticket available on that flight. Server A timestamps your attempt at 9h:15m:32.45s. Server B timestamps someone else’s attempt at 9h:20m:22.76s. Who should get the ticket? What if Server A’s clock was > 10 minutes ahead of server B’s clock? Behind? How would you know what the difference was at those times?

Physical Clocks & Synchronization Some definitions: Clock Skew versus Drift Clock Skew = Relative Difference in clock values of two processes Clock Drift = Relative Difference in clock frequencies (rates) of two processes A non-zero clock drift will cause skew to continuously increase. Real-life examples Ever had “make: warning: Clock skew detected. Your build may be incomplete.”? It’s reported that in the worst case, there’s 1 sec/day drift in modern HW. Almost all physical clocks experience this.

Synchronizing Physical Clocks Ci(t): the reading of the software clock at process i when the real time is t. External synchronization: For a synchronization bound D>0, and for source S of UTC time, for i=1,2,...,N and for all real times t. Clocks Ci are accurate to within the bound D. Internal synchronization: For a synchronization bound D>0, for i, j=1,2,...,N and for all real times t. Clocks Ci agree within the bound D. External synchronization with D  Internal synchronization with 2D Internal synchronization with D  External synchronization with ??

Clock Synchronization Using a Time Server Client: “What time is it?” Server: “It’s t.” Any difficulty? m r t p Time server,S Any problem?

Cristian’s Algorithm Uses a time server to synchronize clocks Mainly designed for LAN Time server keeps the reference time (say UTC) A client asks the time server for time, the server responds with its current time T, and the client uses the received value T to set its clock But network round-trip time introduces an error. So what do we need to do? Estimate one-way delay

Cristian’s Algorithm Let RTT = response-received-time – request-sent-time (measurable at client) Also, suppose we know The minimum value min of the client-server one-way transmission time [Depends on what?] (to simplify our discussion, let’s say there’s a single min) That the server timestamped the message at the last possible instant before sending it back Ideally, the client should set its time to: T + (one-way latency from the server to the client)

Cristian’s Algorithm But we don’t know the one-way latency from the server to the client. When the client receives the time (T) from the server, T can be in a range of possible values. Consider two extreme cases.

Cristian’s Algorithm Case 1 Case2 T min RTT Response received Server sends response. T min RTT Response received Request sent Server sends response. T min RTT Response received Request sent

Cristian’s Algorithm Server time T could be in the following range. When the client receives the time (T) from the server, the actual time that the client should set could be between [T + min, T + RTT - min] T min min RTT Response received Request sent

Cristian’s Algorithm (From the previous slide), the accuracy is: +-(RTT/2 – min) Cristian’s algorithm A client asks its time server. The time server sends its time T. The client estimates the one-way delay and sets its time. It uses T + RTT/2 Want to improve accuracy? Take multiple readings and use the minimum RTT  tighter bound For unusually long RTTs, ignore them and repeat the request  removing outliers

CSE 486/586 Administrivia Please start PA2-A. Grades will go to UBlearns. Will post grades for PA1 probably early next week. Please use Piazza; all announcements will go there.

The Network Time Protocol (NTP) Uses a network of time servers to synchronize all processes on a network. Designed for the Internet Why not Christian’s algo.? Time servers are connected by a synchronization subnet tree. The root is in touch with UTC. Each node synchronizes its children nodes. Why? Primary server, direct sync. 1 Secondry servers, sync’ed by the primary server 2 2 2 Strata 3, sync’ed by the secondary servers 3 3 3 3 3 3

Messages Exchanged Between a Pair of NTP Peers (“Connected Servers”) Each message bears timestamps of recent message events: the local time when the previous NTP message was sent and received, and the local time when the current message was transmitted. T i i-1 -2 - 3 Server B Server A Time m m'

The Protocol Compute round-trip delay: (Ti – Ti-3) – (Ti-1 – Ti-2) Take the half of the round-trip delay as the one-way estimate: ((Ti – Ti-3) – (Ti-1 – Ti-2))/2 T i i-1 -2 - 3 Server Client Time m m'

The Protocol Compute offset: Ti-1 + (one-way estimate) - Ti = ((Ti-2 – Ti-3) + (Ti-1 – Ti))/2 Get this offset with not just one server, but multiple servers. Do some statistical analysis, remove outliers, and apply a data filtering algorithm. T i i-1 -2 - 3 Server Client Time m m'

Theoretical Base for NTP oi: estimate of the actual offset between the two clocks di: estimate of the bounds of oi ; total transmission times for m and m’; di=t+t’ T i i-1 -2 - 3 Server B Server A Time m m' (with delay t) (with delay t’)

Theoretical Base for NTP -2 - 3 Server B Server A Time m m' (with delay t) (with delay t’)

Then a Breakthrough… We cannot sync multiple clocks perfectly. Thus, if we want to order events happened at different processes (remember the ticket reservation example?), we cannot rely on physical clocks. Then came logical time. First proposed by Leslie Lamport in the 70’s Based on causality of events Defined relative time, not absolute time Critical observation: time (ordering) only matters if two or more processes interact, i.e., send/receive messages.

Events Occurring at Three Processes

Summary Time synchronization important for distributed systems Cristian’s algorithm NTP Relative order of events enough for practical purposes Lamport’s logical clocks Next: continue on logical clocks

Acknowledgements These slides contain material developed and copyrighted by Indranil Gupta at UIUC.