Distributed Systems. Day 7: Time and Snapshots [Part 1]

Size: px
Start display at page:

Download "Distributed Systems. Day 7: Time and Snapshots [Part 1]"

Transcription

1

2 Distributed Systems Day 7: Time and Snapshots [Part 1]

3 Delete Professor Theo!!!! 10am Delete Professor Theo!!!! Server 1 I hate Professor Theo!! But Why? 4pm I hate Professor Theo!! 4:01pm But Why? I hate Professor Theo!! But Why? Server 2 Delete Professor Theo!!!!

4 10am John had an accident!!! He s well now J Server 1 John had an accident!!! Status Comment: Good News!! 4pm 5pm He s well now J Status Comment: Good News!! John had an accident!!! He s well now J Server 2 Status Comment: Good News!!

5 Server 1 Server 2 Server 3

6 Agenda Time Synchronization (NTP, PTP) Challenges and Limitations Distributed Systems Notions Total ordering, partial ordering Logical Time Logical Clocks Vector Clocks

7 How do Clocks works?

8 Clocks and Clock Skew Time is a function of a crystal s oscillations Several oscillations == one second Crystal does not always oscillate at right frequency. So time becomes faster or slower Terminology: describing differences in clocks Skew: difference between two clocks! Drift: counting at different frequency than intended 10 oscillations == 1 sec System Quartz Crystal oscillate Clock 1 Clock 2 10:00am 10:05am Skew = 10am 10:05am

9 Synchronizing Clocks

10 PTP: Precision Time Protocol What time is it? Time 10 am! One client synchronizes with an authoritative clock Through exchange of network messages Problems: Delay between messages Delay in processing Server may be wrong!!!

11 How does your phone get its Time?

12 Clock Synchronization Challenges One client synchronizes with an authoritative clock Through exchange of network messages Problems: Delay between messages Delay in processing Server may be wrong!!! Network Transfer time Network Transfer time Time??? Time: 10 am! 9:30am! 9:50am! 10:00am! 10:20am!

13 Clock Synchronization Common protocols: NTP, PTP Cheap and inaccurate across long distances More accurate protocols: GPS, atomic clocks Requires expensive hardware Signaling issues when indoors Do we really need time precision? What do we care about?

14 Clock Synchronization Common protocols: NTP, PTP Cheap and inaccurate across long distances More accurate protocols: GPS Expensive and hard to retrofit at global scale Signaling issues when indoors Do we really need time precision? What do we care about?

15 Ordering Total ordering Each server can order events (in same order) Possible with a global entity (e.g. global clock) Partial ordering:!(total ordering) Some ordering that s not total ordered

16 Client sends to server 1 and server 2 Server 1 Server 2 Total ordered Server 1 Server 2 Total ordered Server 1 Server 2 Total ordered Server 1 Server 2 Not Total ordered Recall ``total ordering events processed in the same order in all servers.

17 Linearizability: The Golden Standard. Total order: all servers process message in same order Linearizability: special case of total order Order is based on real time Client sends to server 1 and server 2 Server 1 Server 2 Total ordered Server 1 Server 2 Total ordered

18 10am John had an accident!!! All servers process in same order Different servers Can process in different orders but events from clients in same order 4pm He s well now J Cut myself! Headed to the Dr. Status Comment: Bummer!!!! Cut myself! Headed to the Dr. Status Comment: Bummer!!!! Cut myself! Headed to the Dr. Status Comment: Bummer!!!! Cut myself! Headed to the Dr. I m fixed!!!! I m fixed!!!! I m fixed!!!! I m fixed!!!! Status Comment: Bummer!!!! Status Comment: Good News!! Status Comment: Good News!! Status Comment: Good News!! Status Comment: Good News!! 11am 5pm Status Comment: Hope He s Okay!!!! Status Comment: Good News!! Server 1 Server 2 Server 1 Server 2 Total Order: order based on global time FIFO Order: order based on client time

19 How can we implement total/fifo ordering? System Assumptions No server (node) failure. Network is reliable: messages are delivered eventually Messages are delivered in-order

20 Implementing Total Ordering Key insight: need a third party to assign IDs Step 2: ID_i++ Global Sequencer Process Pseudocode Before sending message, m, ask for an ID Include ID with messages Send to all Servers!!! Step1: GetID() Step 2: sendmsg (ID_i) Server 1 Global Sequencer Pseudocode Start ID =0 On receive ID request, Increment ID (so, ID++) Assign ID to Process clients Step3: sendmsg (m, ID_i) Server Pseudocode On receive message place in queue Order Queue by ID process Queue in order of ID clients Server 2

21 Implementing FIFO Ordering Local sequencer: Process locally maintain ID ID is monotonic Client Pseudocode Increment ID Assign ID to message Send message to all servers ID_A++; sendmsg (m, ID_A) Server 1 Server Pseudocode Maintain a queue for each client Order message by ID -- process in order This is FIFO Order ID_B++; clients clients sendmsg (m, ID_B) Client maintain different IDs: local and monotonic Server 2

22 How can we implement total/fifo ordering? Current: System Assumptions No server (node) failure. Network is reliable: Messages are delivered eventually New: System Assumptions Synchronous failures. Network is unreliable: Messages can be lost Messages can be duplicated

23 Do We Always Need Total Ordering? For some applications: yes!!! My back account For others: ``causal ordering Dependency between events are preserved A à B: on all servers A is processed before B. New Post: 1 Comment on Post Like post New Post: 2 Comment on Post Like post Make file1 Write to file1 Make file2 Delete file2 Add $100 Add $100 How do I capture and enforce causal ordering??? New Post: 3 Comment on Post Like post

24 Logical Clocks Each process maintains a ID i.e., Clock P1 A C E G Initially set to 0 Rules for updating Clock Each event increments the ID Event: Send msg, recvmsg, or run function Include ID in every message On Receiving a message: ID = max (my_id, ID-in-msg) ++ Only way to exchange information is through message exchange IDs are only exchanged through message exchange P2 B A C E G D F B D F If x->z, then clock(z) > clock(x)

25 Ordering with Logical Clocks Which ordering do you get? Total? FIFO? Causal? P1 A C m1 E G m2 Is B before or after A? Is C before or after D? Is F before or after E? If e->e, then clock(e ) > clock(e) Logical clocks do help with inferring ordering: can confirm P2 B A 1 C 2 E 3 G 5 P1 D F B 1 D 3 F 4 P2

26 Vector Clocks [0,0] Each process maintains an array of ID Array is of size N (N= # of processes) All entries initialized to 0 Array is called a Vector clock (VC) P1 [0,0] P2 A B C m1 D E F G m2 Size (VC i ) == N VC i [i] Number of events at P i VC i [h] = K Process P i is aware of the first k events at P h If VC x <= VC z then x->z (x happens before z) A [?,? ] C [?,? ] E [?,? ] G [?,? ] B [?,? ] D [?,? ] F [?,? ]

27 Vector Clocks [0,0] Each process maintains an array of ID Array is of size N (N= # of processes) All entries initialized to 0 Rules for updating Vector Clock for process Z Each event increments VC z [z] So each event increments processes clock in vector Event: Send msg, recv msg, or run function Include ENTIRE vector in every message On Receiving a message from process X This message will have VC x [] X s vector clock Z updates its clock for each entry: VC z [K] = max (VC z [K], VC x [k]) ++ Only way to exchange information is through message exchange IDs are only exchanged through message exchange P1 [0,0] P2 A B A [?,? ] C [?,? ] E [?,? ] G [?,? ] C m1 D E G m2 F B [?,? ] D [?,? ] F [?,? ] If VC x <= VC z then x->z (x happens before z)

28 Ordering with Vector Clocks Which ordering do you get? Total? FIFO? Causal? P1 A C m1 E G m2 Is B before or after A? Is C before or after D? Is F before or after E? P2 B A [ 1, 0 ] C [ 2, 0 ] E [ 3, 0 ] G [ 4, 3 ] D F B [ 0, 1 ] D [ 2, 2 ] F [ 2, 3 ] If VClock(e ) x <= z >= then VClock(e) x->z (x happens then e->e before z) Vector clocks are enough for FIFO and Causal ordering P1 P2

29 Terminology Review Total order: All servers process events in same order Linearizability: total order according to global time Requires: magical global time or a global mutex FIFO order: all servers process events in order generated by client More practical: you can trust client clock to be monotonic Logical clock: counters Vector clock: a vector of logical clocks

30 Logical Time Vector clocks provider better ordering than logical clocks HOWEVER!! Expensive scales with # of processes Network message: include vector clocks Processes: must maintain vector clocks Total ordering is a close second but too costly Achievable with global sequencer Ideal is Linearizability but impractical FIFO ordering: easy but less meaningful Causal ordering: goal for most distributed systems Achievable with vector clocks

Distributed Systems. Day 9: Replication [Part 1]

Distributed Systems. Day 9: Replication [Part 1] Distributed Systems Day 9: Replication [Part 1] Hash table k 0 v 0 k 1 v 1 k 2 v 2 k 3 v 3 ll Facebook Data Does your client know about all of F s servers? Security issues? Performance issues? How do clients

More information

CSE 5306 Distributed Systems

CSE 5306 Distributed Systems CSE 5306 Distributed Systems Synchronization Jia Rao http://ranger.uta.edu/~jrao/ 1 Synchronization An important issue in distributed system is how process cooperate and synchronize with one another Cooperation

More information

Arranging lunch value of preserving the causal order. a: how about lunch? meet at 12? a: <receives b then c>: which is ok?

Arranging lunch value of preserving the causal order. a: how about lunch? meet at 12? a: <receives b then c>: which is ok? Lamport Clocks: First, questions about project 1: due date for the design document is Thursday. Can be less than a page what we re after is for you to tell us what you are planning to do, so that we can

More information

Lecture 10: Clocks and Time

Lecture 10: Clocks and Time 06-06798 Distributed Systems Lecture 10: Clocks and Time Distributed Systems 1 Time service Overview requirements and problems sources of time Clock synchronisation algorithms clock skew & drift Cristian

More information

TIME AND SYNCHRONIZATION. I. Physical Clock Synchronization: Motivation and Challenges

TIME AND SYNCHRONIZATION. I. Physical Clock Synchronization: Motivation and Challenges TIME AND SYNCHRONIZATION In previous lectures, we discussed some important concepts and goals of distributed. One important concept is remote procedure calls, where we saw how failures creep up into semantics

More information

Lecture 12: Time Distributed Systems

Lecture 12: Time Distributed Systems Lecture 12: Time Distributed Systems Behzad Bordbar School of Computer Science, University of Birmingham, UK Lecture 12 1 Overview Time service requirements and problems sources of time Clock synchronisation

More information

Chapter 6 Synchronization (1)

Chapter 6 Synchronization (1) DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN Chapter 6 Synchronization (1) With material from Ken Birman Tanenbaum & Van Steen, Distributed Systems:

More information

UNIT IV 1. What is meant by hardware and software clock? Clock devices can be programmed to generate interrupts at regular intervals in orders that, for example, time slicing can be implemented.the operating

More information

殷亚凤. Synchronization. Distributed Systems [6]

殷亚凤. Synchronization. Distributed Systems [6] Synchronization Distributed Systems [6] 殷亚凤 Email: yafeng@nju.edu.cn Homepage: http://cs.nju.edu.cn/yafeng/ Room 301, Building of Computer Science and Technology Review Protocols Remote Procedure Call

More information

Clock and Time. THOAI NAM Faculty of Information Technology HCMC University of Technology

Clock and Time. THOAI NAM Faculty of Information Technology HCMC University of Technology Clock and Time THOAI NAM Faculty of Information Technology HCMC University of Technology Using some slides of Prashant Shenoy, UMass Computer Science Chapter 3: Clock and Time Time ordering and clock synchronization

More information

CSE 5306 Distributed Systems. Synchronization

CSE 5306 Distributed Systems. Synchronization CSE 5306 Distributed Systems Synchronization 1 Synchronization An important issue in distributed system is how processes cooperate and synchronize with one another Cooperation is partially supported by

More information

Clock Synchronization. Synchronization. Clock Synchronization Algorithms. Physical Clock Synchronization. Tanenbaum Chapter 6 plus additional papers

Clock Synchronization. Synchronization. Clock Synchronization Algorithms. Physical Clock Synchronization. Tanenbaum Chapter 6 plus additional papers Clock Synchronization Synchronization Tanenbaum Chapter 6 plus additional papers Fig 6-1. In a distributed system, each machine has its own clock. When this is the case, an event that occurred after another

More information

Time Synchronization and Logical Clocks

Time Synchronization and Logical Clocks Time Synchronization and Logical Clocks CS 240: Computing Systems and Concurrency Lecture 5 Marco Canini Credits: Michael Freedman and Kyle Jamieson developed much of the original material. Today 1. The

More information

Distributed Coordination 1/39

Distributed Coordination 1/39 Distributed Coordination 1/39 Overview Synchronization of distributed processes requires new concepts in addition to synchronization of processes in single multi-core systems. Topics: Notion of global

More information

Time Synchronization and Logical Clocks

Time Synchronization and Logical Clocks Time Synchronization and Logical Clocks CS 240: Computing Systems and Concurrency Lecture 5 Mootaz Elnozahy Today 1. The need for time synchronization 2. Wall clock time synchronization 3. Logical Time

More information

Last Class: Naming. Today: Classical Problems in Distributed Systems. Naming. Time ordering and clock synchronization (today)

Last Class: Naming. Today: Classical Problems in Distributed Systems. Naming. Time ordering and clock synchronization (today) Last Class: Naming Naming Distributed naming DNS LDAP Lecture 12, page 1 Today: Classical Problems in Distributed Systems Time ordering and clock synchronization (today) Next few classes: Leader election

More information

Distributed Systems COMP 212. Lecture 17 Othon Michail

Distributed Systems COMP 212. Lecture 17 Othon Michail Distributed Systems COMP 212 Lecture 17 Othon Michail Synchronisation 2/29 What Can Go Wrong Updating a replicated database: Customer (update 1) adds 100 to an account, bank employee (update 2) adds 1%

More information

Time in Distributed Systems

Time in Distributed Systems Time Slides are a variant of slides of a set by Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved. 0-13- 239227-5 Time in Distributed

More information

Beyond TrueTime: Using AugmentedTime for Improving Spanner

Beyond TrueTime: Using AugmentedTime for Improving Spanner Beyond TrueTime: Using AugmentedTime for Improving Spanner Murat Demirbas University at Buffalo, SUNY demirbas@cse.buffalo.edu Sandeep Kulkarni Michigan State University, sandeep@cse.msu.edu Spanner [1]

More information

TIME ATTRIBUTION 11/4/2018. George Porter Nov 6 and 8, 2018

TIME ATTRIBUTION 11/4/2018. George Porter Nov 6 and 8, 2018 TIME George Porter Nov 6 and 8, 2018 ATTRIBUTION These slides are released under an Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0) Creative Commons license These slides incorporate

More information

Synchronization. Distributed Systems IT332

Synchronization. Distributed Systems IT332 Synchronization Distributed Systems IT332 2 Outline Clock synchronization Logical clocks Election algorithms Mutual exclusion Transactions 3 Hardware/Software Clocks Physical clocks in computers are realized

More information

Synchronisation in. Distributed Systems. Co-operation and Co-ordination in. Distributed Systems. Kinds of Synchronisation. Clock Synchronization

Synchronisation in. Distributed Systems. Co-operation and Co-ordination in. Distributed Systems. Kinds of Synchronisation. Clock Synchronization Co-operation and Co-ordination in Distributed ystems Naming for searching communication partners Communication Mechanisms for the communication process ynchronisation in Distributed ystems But... Not enough

More information

Clock-Synchronisation

Clock-Synchronisation Chapter 2.7 Clock-Synchronisation 1 Content Introduction Physical Clocks - How to measure time? - Synchronisation - Cristian s Algorithm - Berkeley Algorithm - NTP / SNTP - PTP IEEE 1588 Logical Clocks

More information

Distributed Systems. Day 13: Distributed Transaction. To Be or Not to Be Distributed.. Transactions

Distributed Systems. Day 13: Distributed Transaction. To Be or Not to Be Distributed.. Transactions Distributed Systems Day 13: Distributed Transaction To Be or Not to Be Distributed.. Transactions Summary Background on Transactions ACID Semantics Distribute Transactions Terminology: Transaction manager,,

More information

Distributed Systems. Time, clocks, and Ordering of events. Rik Sarkar. University of Edinburgh Fall 2014

Distributed Systems. Time, clocks, and Ordering of events. Rik Sarkar. University of Edinburgh Fall 2014 Distributed Systems Time, clocks, and Ordering of events Rik Sarkar University of Edinburgh Fall 2014 Notes Last lecture: MST, MIS Ref: Lynch, distributed algorithms Today: Time, clocks, NTP Ref: CDK Causality,

More information

DISTRIBUTED COMPUTER SYSTEMS

DISTRIBUTED COMPUTER SYSTEMS DISTRIBUTED COMPUTER SYSTEMS CONSISTENCY AND REPLICATION CONSISTENCY MODELS Dr. Jack Lange Computer Science Department University of Pittsburgh Fall 2015 Consistency Models Background Replication Motivation

More information

Time. COS 418: Distributed Systems Lecture 3. Wyatt Lloyd

Time. COS 418: Distributed Systems Lecture 3. Wyatt Lloyd Time COS 418: Distributed Systems Lecture 3 Wyatt Lloyd Today 1. The need for time synchronization 2. Wall clock time synchronization 3. Logical Time: Lamport Clocks 2 A distributed edit-compile workflow

More information

Distributed Systems. replication Johan Montelius ID2201. Distributed Systems ID2201

Distributed Systems. replication Johan Montelius ID2201. Distributed Systems ID2201 Distributed Systems ID2201 replication Johan Montelius 1 The problem The problem we have: servers might be unavailable The solution: keep duplicates at different servers 2 Building a fault-tolerant service

More information

Multicast Communication. Brian Nielsen

Multicast Communication. Brian Nielsen Multicast Communication Brian Nielsen bnielsen@cs.aau.dk Communication modes in DS Uni-cast Messages are sent from exactly one process to one process Broad-cast Messages are sent from exactly one process

More information

Advanced Topics in Distributed Systems. Dr. Ayman A. Abdel-Hamid. Computer Science Department Virginia Tech

Advanced Topics in Distributed Systems. Dr. Ayman A. Abdel-Hamid. Computer Science Department Virginia Tech Advanced Topics in Distributed Systems Dr. Ayman A. Abdel-Hamid Computer Science Department Virginia Tech Synchronization (Based on Ch6 in Distributed Systems: Principles and Paradigms, 2/E) Synchronization

More information

Distributed Systems. Clock Synchronization: Physical Clocks. Paul Krzyzanowski

Distributed Systems. Clock Synchronization: Physical Clocks. Paul Krzyzanowski Distributed Systems Clock Synchronization: Physical Clocks Paul Krzyzanowski pxk@cs.rutgers.edu Except as otherwise noted, the content of this presentation is licensed under the Creative Commons Attribution

More information

Time in Distributed Systems

Time in Distributed Systems in Distributed Systems There is no common universal time (Einstein) but the speed of light is constant for all observers irrespective of their velocity event e2 at earth time t2 ---- large distances ----

More information

Lecture 6: Logical Time

Lecture 6: Logical Time Lecture 6: Logical Time 1. Question from reviews a. 2. Key problem: how do you keep track of the order of events. a. Examples: did a file get deleted before or after I ran that program? b. Did this computers

More information

Distributed Systems. 05. Clock Synchronization. Paul Krzyzanowski. Rutgers University. Fall 2017

Distributed Systems. 05. Clock Synchronization. Paul Krzyzanowski. Rutgers University. Fall 2017 Distributed Systems 05. Clock Synchronization Paul Krzyzanowski Rutgers University Fall 2017 2014-2017 Paul Krzyzanowski 1 Synchronization Synchronization covers interactions among distributed processes

More information

Causal Consistency and Two-Phase Commit

Causal Consistency and Two-Phase Commit Causal Consistency and Two-Phase Commit CS 240: Computing Systems and Concurrency Lecture 16 Marco Canini Credits: Michael Freedman and Kyle Jamieson developed much of the original material. Consistency

More information

Distributed Systems. Pre-Exam 1 Review. Paul Krzyzanowski. Rutgers University. Fall 2015

Distributed Systems. Pre-Exam 1 Review. Paul Krzyzanowski. Rutgers University. Fall 2015 Distributed Systems Pre-Exam 1 Review Paul Krzyzanowski Rutgers University Fall 2015 October 2, 2015 CS 417 - Paul Krzyzanowski 1 Selected Questions From Past Exams October 2, 2015 CS 417 - Paul Krzyzanowski

More information

Michel Raynal. Distributed Algorithms for Message-Passing Systems

Michel Raynal. Distributed Algorithms for Message-Passing Systems Michel Raynal Distributed Algorithms for Message-Passing Systems Contents Part I Distributed Graph Algorithms 1 Basic Definitions and Network Traversal Algorithms... 3 1.1 DistributedAlgorithms... 3 1.1.1

More information

Distributed Systems. coordination Johan Montelius ID2201. Distributed Systems ID2201

Distributed Systems. coordination Johan Montelius ID2201. Distributed Systems ID2201 Distributed Systems ID2201 coordination Johan Montelius 1 Coordination Coordinating several threads in one node is a problem, coordination in a network is of course worse: failure of nodes and networks

More information

Accurate Clock Models for Simulating Wireless Sensor Networks

Accurate Clock Models for Simulating Wireless Sensor Networks Accurate Clock Models for Simulating Wireless Sensor Networks F. Ferrari, A. Meier, L. Thiele TIK Institute ETH Zurich 1 Motivations: Hardware Clocks of Sensor Nodes Digital clocks A counter counts time

More information

2. Time and Global States Page 1. University of Freiburg, Germany Department of Computer Science. Distributed Systems

2. Time and Global States Page 1. University of Freiburg, Germany Department of Computer Science. Distributed Systems 2. Time and Global States Page 1 University of Freiburg, Germany Department of Computer Science Distributed Systems Chapter 3 Time and Global States Christian Schindelhauer 12. May 2014 2. Time and Global

More information

Distributed Systems. Day 2: Principles Jan 29, 2019

Distributed Systems. Day 2: Principles Jan 29, 2019 Distributed Systems Day 2: Principles Jan 29, 2019 Agenda Administrative Information System Model Failures and Fault-Detection Communication Partition/Replication Administrivia Ice-cream social: Thursdays

More information

CSE 486/586 Distributed Systems Reliable Multicast --- 1

CSE 486/586 Distributed Systems Reliable Multicast --- 1 Distributed Systems Reliable Multicast --- 1 Steve Ko Computer Sciences and Engineering University at Buffalo Last Time Global states A union of all process states Consistent global state vs. inconsistent

More information

MiFID II and beyond. In depth session on a slightly different approach to compliance validation. George Nowicki, TP ICAP ITSF 2017

MiFID II and beyond. In depth session on a slightly different approach to compliance validation. George Nowicki, TP ICAP ITSF 2017 MiFID II and beyond. In depth session on a slightly different approach to compliance validation. George Nowicki, TP ICAP ITSF 2017 MiFID II clock sync Global traceability of financial events 100 [us] macro

More information

Spanner: Google s Globally-Distributed Database. Wilson Hsieh representing a host of authors OSDI 2012

Spanner: Google s Globally-Distributed Database. Wilson Hsieh representing a host of authors OSDI 2012 Spanner: Google s Globally-Distributed Database Wilson Hsieh representing a host of authors OSDI 2012 What is Spanner? Distributed multiversion database General-purpose transactions (ACID) SQL query language

More information

Distributed Systems Exam 1 Review Paul Krzyzanowski. Rutgers University. Fall 2016

Distributed Systems Exam 1 Review Paul Krzyzanowski. Rutgers University. Fall 2016 Distributed Systems 2015 Exam 1 Review Paul Krzyzanowski Rutgers University Fall 2016 1 Question 1 Why did the use of reference counting for remote objects prove to be impractical? Explain. It s not fault

More information

Lecture 7: Logical Time

Lecture 7: Logical Time Lecture 7: Logical Time 1. Question from reviews a. In protocol, is in- order delivery assumption reasonable? i. TCP provides it b. In protocol, need all participants to be present i. Is this a reasonable

More information

Distributed Computing. CS439: Principles of Computer Systems November 19, 2018

Distributed Computing. CS439: Principles of Computer Systems November 19, 2018 Distributed Computing CS439: Principles of Computer Systems November 19, 2018 Bringing It All Together We ve been studying how an OS manages a single CPU system As part of that, it will communicate with

More information

CS505: Distributed Systems

CS505: Distributed Systems Cristina Nita-Rotaru CS505: Distributed Systems Protocols. Slides prepared based on material by Prof. Ken Birman at Cornell University, available at http://www.cs.cornell.edu/ken/book/ Required reading

More information

CS Final Review

CS Final Review CS 2510 - Final Review Topics Clock Synchroniza>on Physical Clocks Clock Synchroniza>on Algorithms Network Time Protocol Berkeley Algorithm Clock Synchroniza>on in Wireless Networks Logical Clocks Lamport

More information

Implementing caches. Example. Client. N. America. Client System + Caches. Asia. Client. Africa. Client. Client. Client. Client. Client.

Implementing caches. Example. Client. N. America. Client System + Caches. Asia. Client. Africa. Client. Client. Client. Client. Client. N. America Example Implementing caches Doug Woos Asia System + Caches Africa put (k2, g(get(k1)) put (k2, g(get(k1)) What if clients use a sharded key-value store to coordinate their output? Or CPUs use

More information

Chapter 5: Distributed Systems: Fault Tolerance. Fall 2013 Jussi Kangasharju

Chapter 5: Distributed Systems: Fault Tolerance. Fall 2013 Jussi Kangasharju Chapter 5: Distributed Systems: Fault Tolerance Fall 2013 Jussi Kangasharju Chapter Outline n Fault tolerance n Process resilience n Reliable group communication n Distributed commit n Recovery 2 Basic

More information

The Timed Asynchronous Distributed System Model By Flaviu Cristian and Christof Fetzer

The Timed Asynchronous Distributed System Model By Flaviu Cristian and Christof Fetzer The Timed Asynchronous Distributed System Model By Flaviu Cristian and Christof Fetzer - proposes a formal definition for the timed asynchronous distributed system model - presents measurements of process

More information

Distributed Computing. CS439: Principles of Computer Systems November 20, 2017

Distributed Computing. CS439: Principles of Computer Systems November 20, 2017 Distributed Computing CS439: Principles of Computer Systems November 20, 2017 Last Time Network Programming: Sockets End point of communication Identified by (IP address : port number) pair Client-Side

More information

殷亚凤. Consistency and Replication. Distributed Systems [7]

殷亚凤. Consistency and Replication. Distributed Systems [7] Consistency and Replication Distributed Systems [7] 殷亚凤 Email: yafeng@nju.edu.cn Homepage: http://cs.nju.edu.cn/yafeng/ Room 301, Building of Computer Science and Technology Review Clock synchronization

More information

DISTRIBUTED MUTEX. EE324 Lecture 11

DISTRIBUTED MUTEX. EE324 Lecture 11 DISTRIBUTED MUTEX EE324 Lecture 11 Vector Clocks Vector clocks overcome the shortcoming of Lamport logical clocks L(e) < L(e ) does not imply e happened before e Goal Want ordering that matches causality

More information

Network File System (NFS)

Network File System (NFS) Network File System (NFS) Nima Honarmand User A Typical Storage Stack (Linux) Kernel VFS (Virtual File System) ext4 btrfs fat32 nfs Page Cache Block Device Layer Network IO Scheduler Disk Driver Disk NFS

More information

Verteilte Systeme (Distributed Systems)

Verteilte Systeme (Distributed Systems) Verteilte Systeme (Distributed Systems) Karl M. Göschka Karl.Goeschka@tuwien.ac.at http://www.infosys.tuwien.ac.at/teaching/courses/ VerteilteSysteme/ Lecture 6: Clocks and Agreement Synchronization of

More information

Distributed Snapshots 10/5/18

Distributed Snapshots 10/5/18 Distributed Snapshots 10/5/18 A note on channels and goroutines... Using channels is easy, debugging them is hard Bullet-proof way: Keep track of how many things go in and go out Always ask yourself: is

More information

Globally Synchronized time via Datacenter Networks

Globally Synchronized time via Datacenter Networks 1 Globally Synchronized time via Datacenter Networks Ki Suh Lee Cornell University Joint work with Han Wang, Vishal Shrivastav and Hakim Weatherspoon 2 Synchronized Clocks Fundamental for network and distributed

More information

Versioning, Consistency, and Agreement

Versioning, Consistency, and Agreement Lee Lorenz, Brent Sheppard Jenkins, if I want another yes man, I ll build one! Versioning, Consistency, and Agreement COS 461: Computer Networks Spring 2010 (MW 3:00 4:20 in CS105) Mike Freedman hip://www.cs.princeton.edu/courses/archive/spring10/cos461/

More information

Process groups and message ordering

Process groups and message ordering Process groups and message ordering If processes belong to groups, certain algorithms can be used that depend on group properties membership create ( name ), kill ( name ) join ( name, process ), leave

More information

MINIMIZING TRANSACTION LATENCY IN GEO-REPLICATED DATA STORES

MINIMIZING TRANSACTION LATENCY IN GEO-REPLICATED DATA STORES MINIMIZING TRANSACTION LATENCY IN GEO-REPLICATED DATA STORES Divy Agrawal Department of Computer Science University of California at Santa Barbara Joint work with: Amr El Abbadi, Hatem Mahmoud, Faisal

More information

Replication in Distributed Systems

Replication in Distributed Systems Replication in Distributed Systems Replication Basics Multiple copies of data kept in different nodes A set of replicas holding copies of a data Nodes can be physically very close or distributed all over

More information

Distributed Systems Synchronization

Distributed Systems Synchronization Distributed Systems Synchronization Gianpaolo Cugola Dipartimento di Elettronica e Informazione Politecnico, Italy cugola@elet.polimi.it http://home.dei.polimi.it/cugola http://corsi.dei.polimi.it/distsys

More information

Formal Development of Fault Tolerant Transactions for a Replicated Database using Ordered Broadcasts

Formal Development of Fault Tolerant Transactions for a Replicated Database using Ordered Broadcasts Formal Development of Fault Tolerant Transactions for a Replicated Database using Ordered Broadcasts Divakar Yadav and Michael Butler Dependable Systems and Software Engineering School of Electronics and

More information

Introduction to Real-Time Communications. Real-Time and Embedded Systems (M) Lecture 15

Introduction to Real-Time Communications. Real-Time and Embedded Systems (M) Lecture 15 Introduction to Real-Time Communications Real-Time and Embedded Systems (M) Lecture 15 Lecture Outline Modelling real-time communications Traffic and network models Properties of networks Throughput, delay

More information

Distributed Systems Question Bank UNIT 1 Chapter 1 1. Define distributed systems. What are the significant issues of the distributed systems?

Distributed Systems Question Bank UNIT 1 Chapter 1 1. Define distributed systems. What are the significant issues of the distributed systems? UNIT 1 Chapter 1 1. Define distributed systems. What are the significant issues of the distributed systems? 2. What are different application domains of distributed systems? Explain. 3. Discuss the different

More information

The OSI Reference Model. EECC694 - Shaaban

The OSI Reference Model. EECC694 - Shaaban The OSI Reference Model #1 lec #11 Spring2000 4-18-2000 The Transport Layer Provides reliable end-to-end service to processes in the application layer: Connection-oriented or connection-less services.

More information

L12: I/O Systems. Name: ID:

L12: I/O Systems. Name: ID: L12: I/O Systems Name: ID: Synchronous and Asynchronous Buses Synchronous Bus (e.g., processor-memory buses) Includes a clock in the control lines and has a fixed protocol for communication that is relative

More information

OUTLINE. Introduction Clock synchronization Logical clocks Global state Mutual exclusion Election algorithms Deadlocks in distributed systems

OUTLINE. Introduction Clock synchronization Logical clocks Global state Mutual exclusion Election algorithms Deadlocks in distributed systems Chapter 5 Synchronization OUTLINE Introduction Clock synchronization Logical clocks Global state Mutual exclusion Election algorithms Deadlocks in distributed systems Concurrent Processes Cooperating processes

More information

Time and distributed systems. Just use time stamps? Correct consistency model? Replication and Consistency

Time and distributed systems. Just use time stamps? Correct consistency model? Replication and Consistency Correct consistency model? Replication and Consistency B COS 58: dvanced Computer Systems Lecture 3 Let s say and B send an op. ll readers see B? ll readers see B? Michael Freedman Some see B and others

More information

MODELS OF DISTRIBUTED SYSTEMS

MODELS OF DISTRIBUTED SYSTEMS Distributed Systems Fö 2/3-1 Distributed Systems Fö 2/3-2 MODELS OF DISTRIBUTED SYSTEMS Basic Elements 1. Architectural Models 2. Interaction Models Resources in a distributed system are shared between

More information

Consistency and Replication. Some slides are from Prof. Jalal Y. Kawash at Univ. of Calgary

Consistency and Replication. Some slides are from Prof. Jalal Y. Kawash at Univ. of Calgary Consistency and Replication Some slides are from Prof. Jalal Y. Kawash at Univ. of Calgary Reasons for Replication Reliability/Availability : Mask failures Mask corrupted data Performance: Scalability

More information

Distributed Systems. Lec 12: Consistency Models Sequential, Causal, and Eventual Consistency. Slide acks: Jinyang Li

Distributed Systems. Lec 12: Consistency Models Sequential, Causal, and Eventual Consistency. Slide acks: Jinyang Li Distributed Systems Lec 12: Consistency Models Sequential, Causal, and Eventual Consistency Slide acks: Jinyang Li (http://www.news.cs.nyu.edu/~jinyang/fa10/notes/ds-eventual.ppt) 1 Consistency (Reminder)

More information

Synchronization Part 1. REK s adaptation of Claypool s adaptation of Tanenbaum s Distributed Systems Chapter 5

Synchronization Part 1. REK s adaptation of Claypool s adaptation of Tanenbaum s Distributed Systems Chapter 5 Synchronization Part 1 REK s adaptation of Claypool s adaptation of Tanenbaum s Distributed Systems Chapter 5 1 Outline! Clock Synchronization! Clock Synchronization Algorithms! Logical Clocks! Election

More information

(Refer Slide Time: 02.06)

(Refer Slide Time: 02.06) Data Structures and Algorithms Dr. Naveen Garg Department of Computer Science and Engineering Indian Institute of Technology, Delhi Lecture 27 Depth First Search (DFS) Today we are going to be talking

More information

Recall our 2PC commit problem. Recall our 2PC commit problem. Doing failover correctly isn t easy. Consensus I. FLP Impossibility, Paxos

Recall our 2PC commit problem. Recall our 2PC commit problem. Doing failover correctly isn t easy. Consensus I. FLP Impossibility, Paxos Consensus I Recall our 2PC commit problem FLP Impossibility, Paxos Client C 1 C à TC: go! COS 418: Distributed Systems Lecture 7 Michael Freedman Bank A B 2 TC à A, B: prepare! 3 A, B à P: yes or no 4

More information

Parallel and Distributed Systems. Programming Models. Why Parallel or Distributed Computing? What is a parallel computer?

Parallel and Distributed Systems. Programming Models. Why Parallel or Distributed Computing? What is a parallel computer? Parallel and Distributed Systems Instructor: Sandhya Dwarkadas Department of Computer Science University of Rochester What is a parallel computer? A collection of processing elements that communicate and

More information

PART B UNIT II COMMUNICATION IN DISTRIBUTED SYSTEM PART A

PART B UNIT II COMMUNICATION IN DISTRIBUTED SYSTEM PART A CS6601 DISTRIBUTED SYSTEMS QUESTION BANK UNIT 1 INTRODUCTION 1. What is a distributed system? 2. Mention few examples of distributed systems. 3. Mention the trends in distributed systems. 4. What are backbones

More information

Distributed Systems - I

Distributed Systems - I CSE 421/521 - Operating Systems Fall 2011 Lecture - XXIII Distributed Systems - I Tevfik Koşar University at Buffalo November 22 nd, 2011 1 Motivation Distributed system is collection of loosely coupled

More information

CSE 380 Computer Operating Systems

CSE 380 Computer Operating Systems CSE 380 Computer Operating Systems Instructor: Insup Lee University of Pennsylvania Fall 2003 Lecture Note: Distributed Systems 1 Introduction to Distributed Systems Why do we develop distributed systems?

More information

Failure Models. Fault Tolerance. Failure Masking by Redundancy. Agreement in Faulty Systems

Failure Models. Fault Tolerance. Failure Masking by Redundancy. Agreement in Faulty Systems Fault Tolerance Fault cause of an error that might lead to failure; could be transient, intermittent, or permanent Fault tolerance a system can provide its services even in the presence of faults Requirements

More information

Data-Intensive Distributed Computing

Data-Intensive Distributed Computing Data-Intensive Distributed Computing CS 451/651 (Fall 2018) Part 7: Mutable State (2/2) November 13, 2018 Jimmy Lin David R. Cheriton School of Computer Science University of Waterloo These slides are

More information

Precision Time Protocol, and Sub-Microsecond Synchronization

Precision Time Protocol, and Sub-Microsecond Synchronization Linux Foundation End User Summit May 1, 2012 Precision Time Protocol, and Sub-Microsecond Synchronization Mike Kravetz IBM Linux Technology Center kravetz@us.ibm.com 2009 IBM Corporation Agenda Background/History

More information

When Do We Want Accurate TimeStamps?

When Do We Want Accurate TimeStamps? When Do We Want Accurate TimeStamps? Contents Overview... 2 Background... 2 But Why?... 3 Correlating PCAPs with Log Files... 4 Correlating Multiple PCAPs... 4 Captured at Win7 PC... 5 Captured at Win10

More information

Eventual Consistency 1

Eventual Consistency 1 Eventual Consistency 1 Readings Werner Vogels ACM Queue paper http://queue.acm.org/detail.cfm?id=1466448 Dynamo paper http://www.allthingsdistributed.com/files/ amazon-dynamo-sosp2007.pdf Apache Cassandra

More information

Distributed Algorithms Reliable Broadcast

Distributed Algorithms Reliable Broadcast Distributed Algorithms Reliable Broadcast Alberto Montresor University of Trento, Italy 2016/04/26 This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. Contents

More information

CS 5523 Operating Systems: Midterm II - reivew Instructor: Dr. Tongping Liu Department Computer Science The University of Texas at San Antonio

CS 5523 Operating Systems: Midterm II - reivew Instructor: Dr. Tongping Liu Department Computer Science The University of Texas at San Antonio CS 5523 Operating Systems: Midterm II - reivew Instructor: Dr. Tongping Liu Department Computer Science The University of Texas at San Antonio Fall 2017 1 Outline Inter-Process Communication (20) Threads

More information

Homework 2 COP The total number of paths required to reach the global state is 20 edges.

Homework 2 COP The total number of paths required to reach the global state is 20 edges. Homework 2 COP 5611 Problem 1: 1.a Global state lattice 1. The total number of paths required to reach the global state is 20 edges. 2. In the global lattice each and every edge (downwards) leads to a

More information

G Bayou: A Weakly Connected Replicated Storage System. Robert Grimm New York University

G Bayou: A Weakly Connected Replicated Storage System. Robert Grimm New York University G22.3250-001 Bayou: A Weakly Connected Replicated Storage System Robert Grimm New York University Altogether Now: The Three Questions! What is the problem?! What is new or different?! What are the contributions

More information

Coordination 2. Today. How can processes agree on an action or a value? l Group communication l Basic, reliable and l ordered multicast

Coordination 2. Today. How can processes agree on an action or a value? l Group communication l Basic, reliable and l ordered multicast Coordination 2 Today l Group communication l Basic, reliable and l ordered multicast How can processes agree on an action or a value? Modes of communication Unicast 1ç è 1 Point to point Anycast 1è

More information

DRAFT. Dual Time Scale in Factory & Energy Automation. White Paper about Industrial Time Synchronization. (IEEE 802.

DRAFT. Dual Time Scale in Factory & Energy Automation. White Paper about Industrial Time Synchronization. (IEEE 802. SIEMENS AG 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 DRAFT Dual Time Scale in Factory & Energy Automation White Paper about Industrial

More information

IEEE 1588 PTP clock synchronization over a WAN backbone

IEEE 1588 PTP clock synchronization over a WAN backbone Whitepaper IEEE 1588 PTP clock synchronization over a WAN backbone A field study comparing PTP clock synchronization accuracy against GPS external time reference in a live production WAN environment Contents

More information

Data synchronization Conflict resolution strategies

Data synchronization Conflict resolution strategies Data synchronization Conflict resolution strategies Waldemar Korłub Department of Computer Architecture Faculty of Electronics, Telecommunications and Informatics Gdansk University of Technology November

More information

18 Distributed Time. Distributed Embedded Systems Philip Koopman Nov. 9, Copyright , Philip Koopman

18 Distributed Time. Distributed Embedded Systems Philip Koopman Nov. 9, Copyright , Philip Koopman 18 Distributed Time Distributed Embedded Systems Philip Koopman Nov. 9, 2015 Copyright 2000-2015, Philip Koopman Preview Distributed time When things happen; chain of causal events Relating messages to

More information

MODELS OF DISTRIBUTED SYSTEMS

MODELS OF DISTRIBUTED SYSTEMS Distributed Systems Fö 2/3-1 Distributed Systems Fö 2/3-2 MODELS OF DISTRIBUTED SYSTEMS Basic Elements 1. Architectural Models 2. Interaction Models Resources in a distributed system are shared between

More information

CIS Operating Systems Synchronization based on Busy Waiting. Professor Qiang Zeng Spring 2018

CIS Operating Systems Synchronization based on Busy Waiting. Professor Qiang Zeng Spring 2018 CIS 3207 - Operating Systems Synchronization based on Busy Waiting Professor Qiang Zeng Spring 2018 Previous class IPC for passing data Pipe FIFO Message Queue Shared Memory Compare these IPCs for data

More information

Designing and Evaluating a Distributed Computing Language Runtime. Christopher Meiklejohn Université catholique de Louvain, Belgium

Designing and Evaluating a Distributed Computing Language Runtime. Christopher Meiklejohn Université catholique de Louvain, Belgium Designing and Evaluating a Distributed Computing Language Runtime Christopher Meiklejohn (@cmeik) Université catholique de Louvain, Belgium R A R B R A set() R B R A set() set(2) 2 R B set(3) 3 set() set(2)

More information

CS 31: Introduction to Computer Systems : Threads & Synchronization April 16-18, 2019

CS 31: Introduction to Computer Systems : Threads & Synchronization April 16-18, 2019 CS 31: Introduction to Computer Systems 22-23: Threads & Synchronization April 16-18, 2019 Making Programs Run Faster We all like how fast computers are In the old days (1980 s - 2005): Algorithm too slow?

More information

EE178 Spring 2018 Lecture Module 4. Eric Crabill

EE178 Spring 2018 Lecture Module 4. Eric Crabill EE178 Spring 2018 Lecture Module 4 Eric Crabill Goals Implementation tradeoffs Design variables: throughput, latency, area Pipelining for throughput Retiming for throughput and latency Interleaving for

More information