H: Transactions, Concurrency, Recovery

<< ∧ >>

❖ Transaction Processing (cont)

Transaction states:

COMMIT ⇒ all changes preserved, ABORT ⇒ database unchanged

<< ∧ >>

❖ Transaction Processing (cont)

Concurrent transactions are

desirable, for improved performance (throughput)
problematic, because of potential unwanted interactions

To ensure problem-free concurrent transactions:

Atomic ... whole effect of tx, or nothing
Consistent ... individual tx's are "correct" (wrt application)
Isolated ... each tx behaves as if no concurrency
Durable ... effects of committed tx's persist

<< ∧ >>

❖ Transaction Processing (cont)

Transaction processing:

the study of techniques for realising ACID properties

Consistency is the property:

a tx is correct with respect to its own specification
a tx performs a mapping that maintains all DB constraints

Ensuring this must be left to application programmers.

Our discussion focusses on: Atomicity, Durability, Isolation

<< ∧ >>

❖ Transaction Processing (cont)

Atomicity is handled by the commit and abort mechanisms

commit ends tx and ensures all changes are saved/persisted
abort ends tx and undoes changes "already made"

Durability is handled by implementing stable storage, via

redundancy, to deal with hardware failures
logging/checkpoint mechanisms, to recover state

Isolation is handled by concurrency control mechanisms

possibilities: lock-based, timestamp-based, check-based
various levels of isolation are possible (e.g. serializable)

<< ∧ >>

❖ Transaction Processing (cont)

Where transaction processing fits in the DBMS:

<< ∧ >>

❖ Transaction Terminology

To describe transaction effects, we consider:

READ - transfer data from "disk" to memory
WRITE - transfer data from memory to "disk"
ABORT - terminate transaction, unsuccessfully
COMMIT - terminate transaction, successfully

Relationship between the above operations and SQL:

SELECT produces READ operations on the database
UPDATE and DELETE produce READ then WRITE operations
INSERT produces WRITE operations

<< ∧ >>

❖ Transaction Terminology (cont)

More on transactions and SQL

BEGIN starts a transaction
- the begin keyword in PLpgSQL is not the same thing
COMMIT commits and ends the current transaction
- some DBMSs e.g. PostgreSQL also provide END as a synonym
- the end keyword in PLpgSQL is not the same thing
ROLLBACK aborts the current transaction, undoing any changes
- some DBMSs e.g. PostgreSQL also provide ABORT as a synonym

In PostgreSQL, tx's cannot be defined inside functions (e.g. PLpgSQL)

<< ∧ >>

❖ Transaction Terminology (cont)

The READ, WRITE, ABORT, COMMIT operations:

occur in the context of some transaction T
involve manipulation of data items X, Y, ... (READ and WRITE)

The operations are typically denoted as:

R_T(X) read item X in transaction T

W_T(X) write item X in transaction T

A_T abort transaction T

C_T commit transaction T

<< ∧ >>

❖ Schedules

A schedule gives the sequence of operations from ≥ 1 tx

Serial schedule for a set of tx's T₁ .. T_n

all operations of T_i complete before T_i+1 begins

E.g. R_T₁(A) W_T₁(A) R_T₂(B) R_T₂(A) W_T₃(C) W_T₃(B)

Concurrent schedule for a set of tx's T₁ .. T_n

operations from individual T_i's are interleaved

E.g. R_T₁(A) R_T₂(B) W_T₁(A) W_T₃(C) W_T₃(B) R_T₂(A)

<< ∧ >>

❖ Schedules (cont)

Serial schedules guarantee database consistency

each T_i commits before T_i+1
prior to T_i database is consistent
after T_i database is consistent (assuming T_i is correct)
before T_i+1 database is consistent ...

Concurrent schedules interleave tx operations arbitrarily

and may produce a database that is not consistent
after all of the transactions have committed successfully

<< ∧ >>

❖ Transaction Anomalies

What problems can occur with (uncontrolled) concurrent tx's?

The set of phenomena can be characterised broadly under:

dirty read:
reading data item written by a concurrent uncommitted tx
nonrepeateable read:
re-reading data item, since changed by another concurrent tx
phantom read:
re-scanning result set, finding it changed by another tx

<< ∧ >>

❖ Schedule Properties

If a concurrent schedule on a set of tx's TT ...

produces the same effect as a serial schedule on TT
then we say that the schedule is serializable

Primary goal of isolation mechanisms (see later) is

arrange execution of individual operations in tx's in TT
to ensure that a serializable schedule is produced

Serializability is one property of a schedule, focusing on isolation

Other properties of schedules focus on recovering from failures

<< ∧ >>

❖ Transaction Failure

So far, have implicitly assumed that all transactions commit.

Additional problems can arise when transactions abort.

Consider the following schedule where transaction T1 fails:

T1: R(X) W(X) A
T2:             R(X) W(X) C

Abort will rollback the changes to X, but ...

Consider three places where the rollback might occur:

T1: R(X) W(X) A [1]     [2]        [3]
T2:                 R(X)    W(X) C

<< ∧ >>

❖ Transaction Failure (cont)

Abort / rollback scenarios:

T1: R(X) W(X) A [1]     [2]        [3]
T2:                 R(X)    W(X) C

Case [1] is ok

all effects of T1 vanish; final effect is simply from T2

Case [2] is problematic

some of T1's effects persist, even though T1 aborted

Case [3] is also problematic

T2's effects are lost, even though T2 committed

<< ∧ >>

❖ Recoverability

Consider the serializable schedule:

T1:        R(X)  W(Y)  C
T2:  W(X)                 A

(where the final value of Y is dependent on the X value)

Notes:

the final value of X is valid (change from T₂ rolled back)
T₁ reads/uses an X value that is eventually rolled-back
even though T₂ is correctly aborted, it has produced an effect

Produces an invalid database state, even though serializable.

<< ∧ >>

❖ Recoverability (cont)

Recoverable schedules avoid these kinds of problems.

For a schedule to be recoverable, we require additional constraints

all tx's T_i that wrote values used by T_j
must have committed before T_j commits

and this property must hold for all transactions T_j

Note that recoverability does not prevent "dirty reads".

In order to make schedules recoverable in the presence of dirty reads and aborts, may need to abort multiple transactions.

<< ∧ >>

❖ Cascading Aborts

Recall the earlier non-recoverable schedule:

T1:        R(X)  W(Y)  C
T2:  W(X)                 A

To make it recoverable requires:

delaying T₁'s commit until T₂ commits
if T₂ aborts, cannot allow T₁ to commit

T1:        R(X)  W(Y) ...   C? A!
T2:  W(X)                 A

Known as cascading aborts (or cascading rollback).

<< ∧ >>

❖ Cascading Aborts (cont)

Example: T₃ aborts, causing T₂ to abort, causing T₁ to abort

T1:                    R(Y)  W(Z)        A
T2:        R(X)  W(Y)                 A
T3:  W(X)                          A

Even though T₁ has no direct connection with T₃
(i.e. no shared data).

This kind of problem ...

can potentially affect very many concurrent transactions
could have a significant impact on system throughput

<< ∧ >>

❖ Cascading Aborts (cont)

Cascading aborts can be avoided if

transactions can only read values written by committed transactions
(alternative formulation: no tx can read data items written by an uncommitted tx)

Effectively: eliminate the possibility of reading dirty data.

Downside: reduces opportunity for concurrency.

These are called ACR (avoid cascading rollback) schedules.

All ACR schedules are also recoverable.

<< ∧ >>

❖ Strictness

Strict schedules also eliminate the chance of writing dirty data.

A schedule is strict if

no tx can read values written by another uncommitted tx (ACR)
no tx can write a data item written by another uncommitted tx

Strict schedules simplify the task of rolling back after aborts.

<< ∧ >>

❖ Strictness (cont)

Example: non-strict schedule

T1:  W(X)        A
T2:        W(X)     A

Problems with handling rollback after aborts:

when T₁ aborts, don't rollback (need to retain value written by T₂)
when T₂ aborts, need to rollback to pre-T₁ (not just pre-T₂)

<< ∧ >>

❖ Classes of Schedules

Relationship between various classes of schedules:

Schedules ought to be serializable and strict.

But more serializable/strict ⇒ less concurrency.

DBMSs allow users to trade off "safety" against performance.

<< ∧ >>

❖ Transaction Isolation

Simplest form of isolation: serial execution (T₁ ; T₂ ; T₃ ; ...)

Problem: serial execution yields poor throughput.

Concurrency control schemes (CCSs) aim for "safe" concurrency

Abstract view of DBMS concurrency mechanisms:

<< ∧ >>

❖ Serializability

Consider two schedules S₁ and S₂ produced by

executing the same set of transactions T₁..T_n concurrently
but with a non-serial interleaving of R/W operations

S₁ and S₂ are equivalent if StateAfter(S₁) = StateAfter(S₂)

i.e. final state yielded by S₁ is same as final state yielded by S₂

S is a serializable schedule (for a set of concurrent tx's T₁ ..T_n) if

S is equivalent to some serial schedule S_s of T₁ ..T_n

Under these circumstances, consistency is guaranteed
(assuming no aborted transactions and no system failures)

<< ∧ >>

❖ Serializability (cont)

Two formulations of serializability:

conflict serializibility
- i.e. conflicting R/W operations occur in the "right order"
- check via precedence graph; look for absence of cycles
view serializibility
- i.e. read operations see the correct version of data
- checked via VS conditions on likely equivalent schedules

View serializability is strictly weaker than conflict serializability.

<< ∧ >>

❖ Transaction Isolation Levels

SQL programmers' concurrency control mechanism ...

set transaction
    read only  -- so weaker isolation may be ok
    read write -- suggests stronger isolation needed
isolation level
    -- weakest isolation, maximum concurrency
    read uncommitted
    read committed
    repeatable read
    serializable
    -- strongest isolation, minimum concurrency

Applies to current tx only; affects how scheduler treats this tx.

<< ∧ >>

❖ Transaction Isolation Levels (cont)

Implication of transaction isolation levels:

Isolation Level	Dirty Read	Nonrepeatable Read	Phantom Read
Read Uncommitted	Possible	Possible	Possible
Read Committed	Not Possible	Possible	Possible
Repeatable Read	Not Possible	Not Possible	Possible
Serializable	Not Possible	Not Possible	Not Possible

<< ∧ >>

❖ Transaction Isolation Levels (cont)

For transaction isolation, PostgreSQL

provides syntax for all four levels
treats read uncommitted as read committed
repeatable read behaves like serializable
default level is read committed

Note: cannot implement read uncommitted because of MVCC

For more details, see PostgreSQL Documentation section 13.2

extensive discussion of semantics of UPDATE, INSERT, DELETE

<< ∧ >>

❖ Transaction Isolation Levels (cont)

A PostgreSQL tx consists of a sequence of SQL statements:

BEGIN S₁; S₂; ... S_n; COMMIT;

Isolation levels affect view of DB provided to each S_i:

in read committed ...
- each S_i sees snapshot of DB at start of S_i
in repeatable read and serializable ...
- each S_i sees snapshot of DB at start of tx
- serializable checks for extra conditions

Transactions fail if the system detects violation of isolation level.

<< ∧ >>

❖ Concurrency Control

Approaches to concurrency control:

Lock-based
- Synchronise tx execution via locks on relevant part of DB.
Version-based (multi-version concurrency control)
- Allow multiple consistent versions of the data to exist.
  Each tx has access only to version existing at start of tx.
Validation-based (optimistic concurrency control)
- Execute all tx's; check for validity problems on commit.
Timestamp-based
- Organise tx execution via timestamps assigned to actions.

<< ∧ >>

❖ Lock-based Concurrency Control

Locks introduce additional mechanisms in DBMS:

The Lock Manager

manages the locks requested by the scheduler

<< ∧ >>

❖ Lock-based Concurrency Control (cont)

Lock table entries contain:

object being locked (DB, table, tuple, field)
type of lock: read/shared, write/exclusive
FIFO queue of tx's requesting this lock
count of tx's currently holding lock (max 1 for write locks)

Lock and unlock operations must be atomic.

Lock upgrade:

if a tx holds a read lock, and it is the only tx holding that lock
then the lock can be converted into a write lock

<< ∧ >>

❖ Lock-based Concurrency Control (cont)

Synchronise access to shared data items via following rules:

before reading X, get read (shared) lock on X
before writing X, get write (exclusive) lock on X
a tx attempting to get a read lock on X is blocked if another tx already has write lock on X
a tx attempting to get an write lock on X is blocked if another tx has any kind of lock on X

These rules alone do not guarantee serializability.

<< ∧ >>

❖ Lock-based Concurrency Control (cont)

Consider the following schedule, using locks:

T1(a): L_r(Y)     R(Y)           continued
T2(a):      L_r(X)    R(X) U(X)  continued

T1(b):      U(Y)         L_w(X) W(X) U(X)
T2(b): L_w(Y)....W(Y) U(Y)

(where L_r = read-lock, L_w = write-lock, U = unlock)

Locks correctly ensure controlled access to X and Y.

Despite this, the schedule is not serializable. (Ex: prove this)

<< ∧ >>

❖ Two-Phase Locking

To guarantee serializability, we require an additional constraint:

in every tx, all lock requests precede all unlock requests

Each transaction is then structured as:

growing phase where locks are acquired
action phase where "real work" is done
shrinking phase where locks are released

Clearly, this reduces potential concurrency ...

<< ∧ >>

❖ Problems with Locking

Appropriate locking can guarantee correctness.

However, it also introduces potential undesirable effects:

Deadlock
- No transactions can proceed; each waiting on lock held by another.
Starvation
- One transaction is permanently "frozen out" of access to data.
Reduced performance
- Locking introduces delays while waiting for locks to be released.

<< ∧ >>

❖ Deadlock

Deadlock occurs when two transactions are waiting for a lock on an item held by the other.

Example:

T1: L_w(A) R(A)            L_w(B) ......
T2:            L_w(B) R(B)       L_w(A) .....

How to deal with deadlock?

prevent it happening in the first place
let it happen, detect it, recover from it

<< ∧ >>

❖ Deadlock (cont)

Handling deadlock involves forcing a transaction to "back off"

select process to roll back
- choose on basis of how far tx has progressed, # locks held, ...
roll back the selected process
- how far does this it need to be rolled back?
- worst-case scenario: abort one transaction, then retry
prevent starvation
- need methods to ensure that same tx isn't always chosen

<< ∧ >>

❖ Deadlock (cont)

Methods for managing deadlock

timeout : set max time limit for each tx
waits-for graph : records T_j waiting on lock held by T_k
- prevent deadlock by checking for new cycle ⇒ abort T_i
- detect deadlock by periodic check for cycles ⇒ abort T_i
timestamps : use tx start times as basis for priority
- scenario: T_j tries to get lock held by T_k ...
- wait-die: if T_j < T_k, then T_j waits; if T_j > T_k, then T_j rolls back
- wound-wait: if T_j < T_k, then T_k rolls back; if T_j > T_k T_j waits

<< ∧ >>

❖ Deadlock (cont)

Properties of deadlock handling methods:

both wait-die and wound-wait are fair
wait-die tends to
- roll back tx's that have done little work
- but rolls back tx's more often
wound-wait tends to
- roll back tx's that may have done significant work
- but rolls back tx's less often
timestamps easier to implement than waits-for graph
waits-for minimises roll backs because of deadlock

<< ∧ >>

❖ Optimistic Concurrency Control

Locking is a pessimistic approach to concurrency control:

limit concurrency to ensure that conflicts don't occur

Costs: lock management, deadlock handling, contention.

In scenarios where there are far more reads than writes ...

don't lock (allow arbitrary interleaving of operations)
check just before commit that no conflicts occurred
if problems, roll back conflicting transactions

Optimistic concurrency control (OCC) is a strategy to realise this.

<< ∧ >>