7 Fault Tolerant Distributed Transactions Commit protocols

Size: px

Start display at page:

Download "7 Fault Tolerant Distributed Transactions Commit protocols"

Violet Quinn
6 years ago
Views:

1 7 Fault Tolerant Distributed Transactions Commit protocols 7.1 Subtransactions and distribution 7.2 Fault tolerance and commit processing 7.3 Requirements 7.4 One phase commit 7.5 Two phase commit x based on Weikum / Vossen; Valduriez / Öszu; Garcia-Molina ; Reuter/ Gray

2 7.1 Transactions and Subtransactions t 2 Transaction may be nested as opposed to flat. Different semantic model of nested transactions: closed vs open Withdraw (x, 1000) Deposit (y, 1000) Append (h,...) Search (...) Fetch (x) ^ Modify (x) ^ Fetch (a) Fetch (d) Store (e) Modify (d) Modify (a) Search (...) Fetch (y) ^ Modify (y) ^ r (r) r (l) r (p) r (p) w (p) r (s) r (t) r (t) w (t) r (t) w (t) r (s)w (s) r (r) r (l) r (q) r (q) w (q) HS-2010 HS / 08-TA-2PC- 2 Example by Weikum/Vossen

3 Closed Nested Transactions Let T be a parent transaction, Ci child TA, Cij child TA of Ci, recursive. Commit rule: Ci, Cij... will be finally committed if all ancestors including T commits. Abort rule: If some Ci aborts, all childs abort. Caveat: parent does not need to abort, if child aborts. Visibility rule: if Ci commits (locally!) data are visible to parent, but not to siblings. TA outcome basically controlled by T HS-2010 HS / 08-TA-2PC- 3

4 Open nested Transactions Closed TA model too restrictive compare federations of autonomous systems Open nested transactions: Subtransactions may commit independently... and release resources. needed: different undo mechanism Compensation TA for undoing effects (if possible), Forward recovery using savepoint. More flexibility, less integrity HS-2010 HS / 08-TA-2PC- 4

5 7.2 Fault tolerance and transaction Primary problems of TA related to reliability: Atomicity, Durability Well known solution in centralized DBS Save state information in a safe place State information to be saved depends on - failure model - system aspects (e.g. buffer management) Before image / after image / WAL is safe Allows to reconstruct state of - committed TAs, effects not yet stored in DB - aborted TAs, effects partially in DB - running TAs, effects partially in DB -> abort HS-2010 HS / 08-TA-2PC- 5

6 Architectural model (centralized) System model Components of TA control C Basic principles for commit processing: write ahead log commit rule Lock mgr Local DMBS HS-2010 HS / 08-TA-2PC- 6 LOG X

7 Failures Failures in distributed system Partial failure makes it hard!! "withdraw x from account a" S0 S1 "add x to account b" S2 Has x already been added to b when S2 collapsed? Avoid both: add twice and lost add ('exactly once' semantics) HS-2010 HS / 08-TA-2PC- 7

8 Failures Did S1 commit its subtransaction? i.e. did it receive the "commit" by the TA coordinator before the net / or S1? Collapsed? S0 "commit" S1 "commit" S2 Local TA Wanted: Partial execution of one logical operation at different sites! HS-2010 HS / 08-TA-2PC- 8

9 Types of failures Transaction failures Transaction aborts (unilaterally or due to deadlock) Avg. 3% of transactions abort abnormally System (site) failures Failure of processor, main memory, power supply, Main memory contents are lost, but secondary storage contents are safe Partial vs. total failure Communication failures Lost / undeliverable messages Network partitioning HS-2010 HS / 08-TA-2PC- 9

10 Failure Model More failure types Multiple failures malevolent failures Detectable failures Failure Model Fail-stop nodes (recoverable system failures) Network: in-order msg., no spontaneous msg, timeout, net partitions may occur no persistent msg, msg delivered eventually (makes life easier ) running halted recovery running HS-2010 HS / 08-TA-2PC- 10

11 Distributed Commit. Transaction T Commit coordinator Action: a1,a2 Action: a3 Action: a4,a5 How to guarantee "all or nothing"? Decision on "commit" and "abort" must be unanimous HS-2010 HS / 08-TA-2PC- 11

12 Distributed Commit "No-failure" mode - Wait for "ack" of all actions (nodes) - send "commit" to all participating nodes Next to trivial like many algorithms without resilience. Participants (Ressource Managers) states: Working Prepared Committed Aborted HS-2010 HS / 08-TA-2PC- 12

13 Distributed Commit - Issues Problems Transaction operates on multiple servers (resource managers) Distributed system may fail partially (server crashes, network failures) and create the potential danger of inconsistent decisions Global commit needs unanimous agreement of all participants (agents) Atomic commit problem: find a protocol which ensures a unanimous decision also in case of failures. HS-2010 HS / 08-TA-2PC- 13

14 7.2 Requirements for Atomic Commit AC1 All participants finally come to the same decision (Uniform-agreement) AC2 "Commit" decision can only be reached if all local decisions were "Commit" (Uniform validity) AC3 A participant cannot reverse decision after deciding (Stability) AC4 If there is no failure and all local decisions where "commit" then the overall decision is "commit" (Non-triviality) AC5 All correct participants reach a decision (Non-blocking). HS-2010 HS / 08-TA-2PC- 14

15 AC: Discussion In all distributed systems: Safety conditions: "nothing bad happens" Liveness Conditions: "something happens" AC1 AC3: Safety unanimous, stable AC4, AC5: Liveness - Trivial solution of the AC would be: all participants always abort (AC4) - Something will happen (AC5) AC1-AC5: Non-blocking Atomic Commit (NB-AC) problem HS-2010 HS / 08-TA-2PC- 15

16 Blocking What does blocking mean? A blocking protocol does - in case of failure - prevent the others from taking the final decision on the fate of the transaction. Bad situation since resources of all participants blocked until recovery from failure. HS-2010 HS / 08-TA-2PC- 16

17 NB-AC In an asychronous * distributed system, there is no protocol which solves NB-AC. Idea of proof: Working Current state of participant p. Prepared Committed Aborted No way to decide between C and A without information about the fate of the TA. (No independent recovery) * means: msg delay and process speed unbound HS-2010 HS / 08-TA-2PC- 17

18 Relaxation of AC requirements AC 4 : too strong "No failure all decide commit" AC 4': "No participant suspected to fail every participant reaches a commit decision." (Non-Blocking weak atomic commit NB-WAC) "Suspected to fail": means there are failure detectors, e.g. timeouts, which detect crashes / communication failures, but may be wrong. NB-WAC-Protocol based on a consensus protocol Paxos (see below) HS-2010 HS / 08-TA-2PC- 18

19 7.4 One phase commit Example: Calendar application Application protocol: agreement on the date / time of some event. e.g: ".. everyone happy with suggested date? if one participant votes no, coordinator makes new suggestion else commit (1-phase)" Agreement between nodes in processing phase, not during commit. HS-2010 HS / 08-TA-2PC- 19

20 1PC: participant protocol One-Phase-Commit Participant FSA commit ack exec_update ack exec_read exec_read exec_update ack prepared commit ack committed init abort ack abort ack exec_update neg_ack aborted exec_update neg_ack Every update is acknowledged, participant gives up veto right for the whole TA one commit phase HS-2010 HS / 08-TA-2PC- 20 slide: J. Bross

21 Notation Finite state automaton different for - participants - coordinator State transition labeled by msg received / msg send transition fct δ: inputs X states -> states output fct λ: inputs X states -> output Any statechart type is ok HS-2010 HS / 08-TA-2PC- 21

22 Characteristics of 1PC Blocking? Yes! When? Two types of blocking: - participant failure - coordinator failure more serious, why? Window of uncertainty in failure free case? Number of messages for commit /abort? Suppose n participants. HS-2010 HS / 08-TA-2PC- 22

23 More involved task n participants, each having a variable x i clients send increments ("+j") to each of them no individual ack of an increment operation, (but of msg received) ---- end of operation phase Condition for successful operation: all increments successful (no overflow, or alike) If not successful: participants reset x i Commit coordinator has to decide! Commit phase? 1PC is not sufficient to come to a unanimous result! Why? work phase commit phase HS-2010 HS / 08-TA-2PC- 23

Fault tolerance. Distributed Commit Protocols. ACP requirements. Comparison of protocols. 7.4 Two phase commit. Roadmap

Fault tolerance. Distributed Commit Protocols. ACP requirements. Comparison of protocols. 7.4 Two phase commit. Roadmap Distributed ommit Protocols We know: There is no distributed Atomic ommit Protocol (AP) in an asynchronous system with properties: Uniform agreement, uniform validity, stability (A1-3) Non-triviality,