h p:// Authors: Tomáš Skopal, Irena Holubová Lecturer: Mar n Svoboda, mar

Similar documents
doc. RNDr. Tomáš Skopal, Ph.D.

Database transactions

Intro to Transactions

Overview. Introduction to Transaction Management ACID. Transactions

CSE 444: Database Internals. Lectures Transactions

Database Systems. Announcement

Intro to Transaction Management

Overview of Transaction Management

Transaction Management and Concurrency Control. Chapter 16, 17

Goal of Concurrency Control. Concurrency Control. Example. Solution 1. Solution 2. Solution 3

L i (A) = transaction T i acquires lock for element A. U i (A) = transaction T i releases lock for element A

CSE 444: Database Internals. Lectures 13 Transaction Schedules

Transaction Management: Concurrency Control

Introduction to Data Management CSE 344

What are Transactions? Transaction Management: Introduction (Chap. 16) Major Example: the web app. Concurrent Execution. Web app in execution (CS636)

CSE 344 MARCH 5 TH TRANSACTIONS

Transactions. Kathleen Durant PhD Northeastern University CS3200 Lesson 9

Introduction to Data Management CSE 414

CSE 344 MARCH 25 TH ISOLATION

Transaction Processing: Concurrency Control ACID. Transaction in SQL. CPS 216 Advanced Database Systems. (Implicit beginning of transaction)

Transaction Management: Introduction (Chap. 16)

Transaction Management & Concurrency Control. CS 377: Database Systems

Implementing Isolation

CSE 344 MARCH 9 TH TRANSACTIONS

Database Systems CSE 414

Announcements. Transaction. Motivating Example. Motivating Example. Transactions. CSE 444: Database Internals

Review. Review. Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications. Lecture #21: Concurrency Control (R&G ch.

CSC 261/461 Database Systems Lecture 21 and 22. Spring 2017 MW 3:25 pm 4:40 pm January 18 May 3 Dewey 1101

References. Transaction Management. Database Administration and Tuning 2012/2013. Chpt 14 Silberchatz Chpt 16 Raghu

Database Systems CSE 414

Transaction Processing: Concurrency Control. Announcements (April 26) Transactions. CPS 216 Advanced Database Systems

Announcements. Motivating Example. Transaction ROLLBACK. Motivating Example. CSE 444: Database Internals. Lab 2 extended until Monday

Introduction to Data Management CSE 344

Introduction to Data Management CSE 344

Database Management Systems CSEP 544. Lecture 9: Transactions and Recovery

TRANSACTION MANAGEMENT

Introduction to Data Management CSE 344

CS 4604: Introduc0on to Database Management Systems. B. Aditya Prakash Lecture #17: Transac0ons 2: 2PL and Deadlocks

Conflict Equivalent. Conflict Serializability. Example 1. Precedence Graph Test Every conflict serializable schedule is serializable

CSC 261/461 Database Systems Lecture 24

Introduction to Data Management CSE 344

Lecture 21. Lecture 21: Concurrency & Locking

CSE 190D Database System Implementation

Motivating Example. Motivating Example. Transaction ROLLBACK. Transactions. CSE 444: Database Internals

CSE 344 MARCH 21 ST TRANSACTIONS

Concurrency control 12/1/17

Lock Granularity and Consistency Levels (Lecture 7, cs262a) Ali Ghodsi and Ion Stoica, UC Berkeley February 7, 2018

Information Systems (Informationssysteme)

Introduction TRANSACTIONS & CONCURRENCY CONTROL. Transactions. Concurrency

Concurrency Control & Recovery

Lectures 8 & 9. Lectures 7 & 8: Transactions

Introduction to Data Management. Lecture #24 (Transactions)

CMPT 354: Database System I. Lecture 11. Transaction Management

Transactions and Concurrency Control

Database System Concepts

Page 1. Goals of Today s Lecture" Two Key Questions" Goals of Transaction Scheduling"

Chapter 22. Transaction Management

COURSE 1. Database Management Systems

Transactions. Transaction. Execution of a user program in a DBMS.

Page 1. Goals of Todayʼs Lecture" Two Key Questions" Goals of Transaction Scheduling"

Databases - Transactions

Concurrency Control! Snapshot isolation" q How to ensure serializability and recoverability? " q Lock-Based Protocols" q Other Protocols"

5/17/17. Announcements. Review: Transactions. Outline. Review: TXNs in SQL. Review: ACID. Database Systems CSE 414.

Transaction Processing. Introduction to Databases CompSci 316 Fall 2018

DATABASE DESIGN I - 1DL300

Database Systems CSE 414

Transaction Management Overview

TRANSACTION PROCESSING CONCEPTS

Transactions. ACID Properties of Transactions. Atomicity - all or nothing property - Fully performed or not at all

Concurrency Control. Chapter 17. Comp 521 Files and Databases Fall

Slides Courtesy of R. Ramakrishnan and J. Gehrke 2. v Concurrent execution of queries for improved performance.

Page 1. Goals of Today s Lecture. The ACID properties of Transactions. Transactions

Concurrency Control & Recovery

Database design and implementation CMPSCI 645. Lectures 18: Transactions and Concurrency

Transaction Management Overview. Transactions. Concurrency in a DBMS. Chapter 16

Concurrency Control. R &G - Chapter 19

DB2 Lecture 10 Concurrency Control

Databases - Transactions II. (GF Royle, N Spadaccini ) Databases - Transactions II 1 / 22

CHAPTER 3 RECOVERY & CONCURRENCY ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

CS322: Database Systems Transactions

6.830 Lecture Transactions October 23, 2017

Transactions and Concurrency Control. Dr. Philip Cannata

Concurrency Control. Chapter 17. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1

Database Applications (15-415)

Page 1. CS194-3/CS16x Introduction to Systems. Lecture 8. Database concurrency control, Serializability, conflict serializability, 2PL and strict 2PL

Concurrency Control 1

In This Lecture. Transactions and Recovery. Transactions. Transactions. Isolation and Durability. Atomicity and Consistency. Transactions Recovery

Transactions. Silberschatz, Korth and Sudarshan

Phantom Problem. Phantom Problem. Phantom Problem. Phantom Problem R1(X1),R1(X2),W2(X3),R1(X1),R1(X2),R1(X3) R1(X1),R1(X2),W2(X3),R1(X1),R1(X2),R1(X3)

Database Management System

Database Tuning and Physical Design: Execution of Transactions

CHAPTER: TRANSACTIONS

Transactions. 1. Transactions. Goals for this lecture. Today s Lecture

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

UNIT-IV TRANSACTION PROCESSING CONCEPTS

Lecture 7: Transactions Concurrency Control

Concurrency Control. Chapter 17. Comp 521 Files and Databases Spring

Concurrency Control. [R&G] Chapter 17 CS432 1

Introduction to Data Management CSE 344

Database Management Systems 2010/11

Transcription:

B0B36DBS, BD6B36DBS: Database Systems h p://www.ksi.m.cuni.cz/~svoboda/courses/172-b0b36dbs/ Lecture 9 Database Transac ons Authors: Tomáš Skopal, Irena Holubová Lecturer: Mar n Svoboda, mar n.svoboda@fel.cvut.cz 20. 2. 2018 Czech Technical University in Prague, Faculty of Electrical Engineering

Today s lecture outline motivation and the ACID properties schedules ( interleaved transaction execution) serializability conflicts (non)recoverable schedule locking protocols 2PL, strict 2PL, conservative 2PL deadlock and prevention phantom alternative protocols B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 2

Motivation problem: we need to execute complex database operations e.g., stored procedures, triggers, etc. in a multi-user and parallel environment database transaction sequence of actions on database objects (+ others like arithmetic, etc.) example: Let us have a bank database with table Accounts and the following transaction to transfer the money (pseudocode): transaction PaymentOrder(amount, fromacc, toacc) { 1. SELECT Balance INTO X FROM Accounts WHERE accnr = fromacc; 2. if (X < amount) AbortTransaction( Not enough money! ); 3. UPDATE Accounts SET Balance = Balance amount WHERE accnr = fromacc; 4. UPDATE Accounts SET Balance = Balance + amount WHERE accnr = toacc; 5. CommitTransaction; } B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 3

Transaction management in DBMS application launches transactions transaction manager executes transactions scheduler dynamically schedules the parallel transaction execution, producing a schedule (history) data manager executes partial operation of transactions DBMS application transaction manager transaction scheduler data manager database B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 4

Transaction management in DBMS transaction termination successful terminated by command in the transaction code the performed actions are confirmed unsuccessful transaction is cancelled 1. termination by the transaction code ABORT (or ROLLBACK) command user can be notified 2. system abort DBMS aborts the transaction some integrity constraint is violated user is notified by transaction scheduler (e.g., a deadlock occurs) user is not notified 3. system failure HW failure, power loss transaction must be restarted main objectives of transaction management enforcement of ACID properties maximal performance (throughput) parallel/concurrent execution of transactions B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 5

ACID desired properties of transaction management Atomicity partial execution is not allowed (all or nothing) prevents from incorrect transaction termination (or failure) = consistency at the DBMS level? Consistency any transaction will bring the database from one consistent (valid) state to another = consistency at application level Isolation transactions executed in parallel do not see effects of each other unless committed parallel/concurrent execution is necessary to achieve high throughput Durability once a transaction has been committed, it will remain so, even in the event of power loss, crashes, or errors logging necessary (log/journal maintained) B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 6

Transaction an executed transaction is a sequence of actions T = <A T1, A T2,..., or ABORT> basic database actions (operations) for now consider a static database (no inserts/deletes, just updates), let A be a database object (table, row, attribute in row) we omit other actions such as control construct (if, for), etc. READ(A) reads A from database WRITE(A) writes A to database confirms executed actions as valid, terminates transaction ABORT cancels executed actions, terminates transaction (with error) SQL commands SELECT, INSERT, UPDATE, could be viewed as transactions implemented using the basic actions (in SQL command ROLLBACK is used instead of abort) Example: Subtract 5 from A (some attribute), such that A>0. T = <READ(A), // action 1 if (A 5) then ABORT else WRITE(A 5), // action 2 > // action 3 or T = <READ(A), // action 1 if (A 5) then ABORT // action 2 else... > B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 7

Transaction programs vs. schedules database program design-time (not running) piece of code (that will be executed as a transaction) i.e., nonlinear branching, loops, jumps schedule (history) is a sorted list of actions coming from several transactions (i.e., transactions as interleaved) runtime history of already concurrently executed actions of several transactions i.e., linear sequence of primitive operations, w/o control constructs T1 T2 T3 READ(A) WRITE(A) READ(B) WRITE(A) READ(C) ABORT B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 8

Serial schedules specific schedule, where all actions of a transaction are coupled together no action interleaving given a set S of transactions, we can obtain S! serial schedules from the definition of ACID properties, all the schedules are equivalent it does not matter if one transaction is executed before or after another one if it matters, they are not independent and so they should be merged into single transactions example: T1 T2 T3 READ(A) WRITE(A) ABORT WRITE(A) READ(B) READ(C) B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 9

Why to interleave transactions? every schedule leads to interleaved sequential execution of transactions (there is no parallel execution of database operations) simplified model justified by single storage device Question: So why to interleave transactions when the number of steps is the same as in a serial schedule? two reasons parallel execution of non-database operations with database operations response proportional to transaction complexity (e.g., OldestEmployee vs. ComputeTaxes) example T1 T2 T3 READ(A) print( Start T2 ) A := 5 A := A +1 WRITE(A) print(a) READ(B) print( Written ) WRITE(A) X := B; READ(C) print( error ) X := X + C ABORT B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 10

Serializability a schedule is serializable if its execution leads to consistent database state, i.e., if the schedule is equivalent to any serial schedule for now we consider only committed transactions and a static database note that non-database operations are not considered so that consistency cannot be provided for non-database state (e.g., print on console) it does not matter which serial schedule is equivalent (independent transactions) strong property secures the Isolation and Consistency in ACID view serializability extends serializability by including aborted transactions and dynamic database however, testing is NP-complete, so it is not used in practice instead, conflict serializability + other techniques are used B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 11

Dangers caused by interleaving to achieve serializability (i.e., consistency and isolation), the action of interleaving cannot be arbitrary there exist 3 types of local dependencies in the schedule, so-called conflict pairs four possibilities of reading/writing the same resource in schedule read-read ok, by reading the transactions do not affect each other write-read (WR) T1 writes, then T2 reads reading uncommitted data read-write (RW) T1 reads, then T2 writes unrepeatable reading write-write (WW) T1 writes, then T2 writes overwrite of uncommitted data T1 Read(A) Read(A) Write(A) WR RW WW T2 Write(A) B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 12

Conflicts (WR) reading uncommitted data (write-read conflict) transaction T2 reads A that was earlier updated by transaction T1, but T1 did not commit so far, i.e., T2 reads potentially inconsistent data so-called dirty read Example: T1 transfers 1000 USD from account A to account B (A = 12000, B = 10000) T2 adds 1% per account T1 T2 R(A) // A = 12000 A := A 1000 // database is now inconsistent account B still contains the old balance R(A) // uncommitted data is read R(B) A := 1.01*A B := 1.01*B W(B) R(B) // B = 10100 B := B + 1000 W(B) // inconsistent database, A = 11110, B = 11100 B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 13

Conflicts (RW) unrepeatable read (read-write conflict) transaction T2 writes A that was read earlier by T1 that didn t finish yet T1 cannot repeat the reading of A (A now contains another value) so-called unrepeatable read Example: T1 transfers 1000 USD from account A to account B (A = 12000, B = 10000) T2 adds 1% per account T1 T2 R(A) // A = 12000 R(A) R(B) A := 1.01*A B := 1.01*B // update of A W(B) // database now contains A = 12120 R(B) A := A 1000 B := B + 1000 W(B) // inconsistent database, A = 11000, B = 11100 B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 14

Conflicts (WW) overwrite of uncommitted data (write-write conflict) transaction T2 overwrites A that was earlier written by T1 that still runs loss of update (original value of A is lost) so-called blind write (update of unread data) Example: Set the same price to all DVDs. (let s have two instances of this transaction, one setting price to 10 USD, second 15 USD) T1 DVD2 := 10 W(DVD2) T2 DVD1 := 15 W(DVD1) DVD2 := 15 W(DVD2) // overwrite of uncommitted data DVD1 := 10 W(DVD1) // inconsistent database, DVD1 = 10, DVD2 = 15 B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 15

Conflict serializability two schedules are conflict equivalent if they share the set of conflict pairs a schedule is conflict serializable if it is conflict-equivalent to some serial schedule, i.e., there are no real conflicts more restrictive than serializability (defined only by consistency preservation) conflict serializability alone does not consider: cancelled transactions ABORT/ROLLBACK, so the schedule could be unrecoverable dynamic database (inserting / deleting database objects) so-called phantom may occur hence, conflict serializability is not sufficient condition to provide ACID (view serializability is ultimate condition) Example: schedule, that is serializable (serial schedule <T1, T2, T3>), but is not conflict serializable (writes in T1 and T2 are in wrong order) T1 T2 T3 R(A) B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 16

Detection of conflict serializability precedence graph (also serializability graph) on a schedule nodes T i are committed transactions edges represent RW, WR, WW conflicts in the schedule schedule is conflict serializable if its precedence graph is acyclic Example: not conflict serializable T1 T2 T3 R(A) T1 WW, RW T3 RW WW WW T2 Example: conflict serializable T1 T2 T3 R(A) R(A) T1 WW, RW T3 RW RW T2 B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 17

Unrecoverable schedule at this moment we extend the transaction model by ABORT which brings another danger unrecoverable schedule one transaction aborts so that undos of every write must be done, however, this cannot be done for already committed transactions that read changes caused by the aborted transaction durability property of ACID in recoverable schedule a transaction T is committed after all other transactions that affected T commit (i.e., they changed data later read by T) if reading changed data is allowed only for committed transactions, we also avoid cascade aborts of transactions Example: T1 transfers 1000 USD from A to B, T2 adds annual interests T1 T2 R(A) A := A - 1000 ABORT commited, cannot be undone! cascade aborts R(A) A := A *1.01 R(B) B := B *1.01 W(B) B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 18

Protocols for concurrent transaction scheduling transaction scheduler works under some protocol that allows to guarantee the ACID properties and maximal throughput pessimistic control (highly concurrent workloads) locking protocols time stamps optimistic control (not very concurrent workloads) why protocol? the scheduler cannot create the entire schedule beforehand scheduling is performed in local time context dynamic transaction execution, branching parts in code DB client(s) T1 T2 T3 R(A) R(C) R(A) W(B) W(C) R(B) R(C) Schedule T1 T2 T3 R(A) R(C) R(A) W(C) R(B) W(B) R(C) B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 19

Locking protocols locking of database entities can be used to control the order of reads and writes and so to secure the conflict serializability exclusive locks X(A) locks A so that reads and writes of A are allowed only to the lock owner/creator can be granted to just one transaction shared locks S(A) only reads of A are allowed can be granted to (shared by) multiple transactions unlocking by U(A) if a lock that is not available is required for a transaction, the transaction execution is suspended and waits for releasing the lock in the schedule, the lock request is denoted, followed by empty rows of waiting the un/locking code is added by the transaction scheduler i.e., operation on locks appear just in the schedules, not in the original transaction code B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 20

Example: schedule with locking T1 X(C) R(C) W(C) S(A) R(A) U(C) U(A) T2 X(A) U(A) S(B) R(B) X(C) S(A) W(C) R(A) U(B) U(C) U(A) X(B) W(B) U(B) S(A) T1 X(C) T2 X(A) S(B) S(A) X(B) X(C) order of actions in schedule B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 21

Two-phase locking protocol (2PL) 2PL protocol applies two rules for building the schedule: 1) if a transaction wants to read (write) an entity A, it must first acquire a shared (exclusive) lock on A 2) transaction cannot requests a lock, if it already released one (regardless of the locked entity) Two obvious phases locking and unlocking Example: 2PL adjustment of the second transaction in the previous schedule X(A) X(C) X(B) /ABORT B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 22

Properties of 2PL the 2PL restriction of schedule ensures that the precedence graph is acyclic, i.e., the schedule is conflict serializable 2PL does not guarantee recoverable schedules Example: 2PL-compliant schedule, but not recoverable, if T1 aborts T1 X(A) R(A) U(A) ABORT / T2 X(A) R(A) A := A *1.01 X(B) U(A) R(B) B := B *1.01 W(B) U(B) T1 RW, WR, WW T2 B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 23

Strict 2PL Strict 2PL protocol makes the second rule of 2PL stronger, so that both rules become: 1) if a transaction wants to read (write) an entity A, it must first acquire a shared (exclusive) lock on A 2) all locks are released at the transaction termination Example: strict 2PL adjustment of second transaction in the previous example X(A) X(B) X(C) /ABORT Insertions of U(A) are not needed (implicit at the time of /ABORT). B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 24

Properties of strict 2PL the 2PL restriction of schedule ensures that the precedence graph is acyclic, i.e., the schedule is conflict serializable moreover, strict 2PL ensures schedule recoverability avoids cascade aborts Example: schedule built using strict 2PL T1 S(A) R(A) X(C) R(C) W(C) ABORT / T2 S(A) R(A) X(B) R(B) W(B) T1 T2 B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 25

Deadlock during transaction execution it may happen that transaction T 1 requests a lock that was already granted to T 2, but T 2 cannot release it because it waits for another lock kept by T 1 could be generalized to multiple transactions, T1 waits for T 2, T 2 waits for T 3,..., T n waits for T 1 strict 2PL cannot prevent from deadlock (not speaking about the weaker protocols) Example: T1 T2 T3 T4 S(A) R(A) X(B) W(B) S(B) waiting for a lock S(C) R(C) X(C) waiting for a lock X(B) X(A) waiting for a lock waiting for a lock all transactions wait for a lock no one can release a lock scheduler cannot schedule nor execute transactions deadlock B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 26

Deadlock detection deadlock can be detected by repeated checking the waits-for graph waits-for graph is a dynamic graph that captures the waiting of transactions for locks nodes are active transactions an edge denotes waiting of transaction for lock kept by another transaction a cycle in the graph = deadlock Example: waits-for graph for the previous example (a) T3 requests X(A) (b) T3 does not request X(A) T1 T2 T1 T2 T4 T3 T4 T3 B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 27

Deadlock resolution and prevention deadlocks are usually not very frequent, so the resolution could be simple abort of the waiting transaction and its restart (user will not notice) testing waits-for graph if a deadlock occurs, abort and restart a transaction in the cycle such transaction is aborted, that holds the smallest number of locks performed the least amount of work is far from completion an aborted transaction is not aborted again (if another deadlock occurs) deadlocks could be prevented prioritizing each transaction has a priority (e.g., time stamp); if T1 requests a lock kept by T2, the lock manager chooses between two strategies wait-die if T1 has higher priority, it can wait, if not, it is aborted and restarted wound-wait if T1 has higher priority, T2 is aborted, otherwise T1 waits B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 28

Coffman Conditions Deadlocks can arise if all of the following conditions hold simultaneously in a system Mutual exclusion resources can be held in a non-shareable mode Resource holding (hold and wait) additional resources may be requested even when already some resources are held No preemption resources can be released only voluntarily Circular wait transactions can request and wait for resources in cycles Unfulfillment of any of these conditions is enough to prevent deadlocks from occurring B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 29

Phantom now consider dynamic database allowing inserts and deletes if one transaction works with some set of data entities, while another transaction changes this set (inserts or deletes), it could lead to inconsistent database (inserializable schedule) Why? T1 locks all entities that at the given moment are relevant e.g., fulfill some WHERE condition of a SELECT command during execution of T1 a new transaction T2 could logically extend the set of entities i.e., at that moment the number of locks defined by WHERE would be larger so that some entities are locked and some are not applied also to strict 2PL B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 30

Example phantom T1: find the oldest male and female employees (SELECT * FROM Employees...) + INSERT INTO Statistics... T2: insert new employee Phill and delete employee Eve (employee replacement) (INSERT INTO Employees..., DELETE FROM Employees...) Initial state of the database: {[Peter, 52, m], [John, 46, m], [Eve, 55, f], [Dana, 30, f]} T1 lock men, i.e., S(Peter) S(John) M = max{r(peter), R(John)} T2 Insert(Phill, 72, m) X(Eve) Delete(Eve) phantom a new male employee can be inserted, although all men should be locked lock women, i.e., S(Dana) F = max{r(dana)} Insert(M, F) // result is inserted into table Statistics Although the schedule is strict 2PL compliant, the result [Peter, Dana] is not correct as it does not follow the serial schedule T1, T2, resulting in [Peter, Eve], nor T2, T1, resulting [Phill, Dana]. B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 31

Phantom prevention if there do not exist indexes, everything relevant must be locked e.g., entire table or even multiple tables must be locked if there exist indexes (e.g., B + -trees) on the entities defined by the lock condition, it is possible to watch for phantom at the index level index locking external attempt for the set modification is identified by the index locks updated as an index usually maintains just one attribute, its applicability is limited generalization of index locking is predicate locking, when the locks are requested for the logical sets, not particular data instances however, this is hard to implement and so not used much in practice B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 32

Optimistic (not locking) protocols if concurrently executed transactions are not often in conflict (not competing for resources), the locking overhead is unnecessarily large 3-phase optimistic protocol 1. Read: transaction reads data from database but writes into its private local data space 2. Validation: if the transaction wants to commit, it forwards the private data space to the transaction manager (i.e., request on database update) the transaction manager decides if the update is in conflict with another transaction if there is a conflict, the transaction is aborted and restarted if not, the last phase takes place: 3. Write: the private data space is copied into the database B0B36DBS, BD6B36DBS: Database Systems Lecture 9: Database Transactions 24. 4. 2018 33