Distributed Transaction Processing in the Escada Protocol

Size: px
Start display at page:

Download "Distributed Transaction Processing in the Escada Protocol"

Transcription

1 Distributed Transaction Processing in the Escada Protocol Alfrânio T. Correia Júnior Grupo de Sistemas Distribuídos Departamento de Informática Universidade do Minho Distributed Transaction Processing in theescada Protocol p.1/33

2 Scenario Dependable Databases are core components of our Information Society. Software based replication is an attractive solution to assure dependability. Problems arise when attempting to preserve the following: Strong consistency (1SR). Updates are carried through any replica. Distributed Transaction Processing in theescada Protocol p.2/33

3 Scenario (Problems) Traditional database replication protocols do not scale up well due to: The high number of messages exchanged among the replicas. The deadlock rate proportional to n 3, where n is the number of replicas, which is impractical. Distributed Transaction Processing in theescada Protocol p.3/33

4 Scenario (Earlier Solutions) Lazy Replication relaxes the consistency criteria. Master/Slave chooses the replica that can receive the updates. Distributed Transaction Processing in theescada Protocol p.4/33

5 Escada Approach Pattern on the DBSM Replication based on group communication Update-everywhere and deferred updates Atomic broadcast to propagate the transaction s processing data Total order is combined with a conflict detection process to assure 1SR. Distributed Transaction Processing in theescada Protocol p.5/33

6 Protocol Overview S1 S2 T1 02 T Local Execution 02 - Atomic Multicast ( RS, WS, WV ) 03 - Certification Termination Protocol Distributed Transaction Processing in theescada Protocol p.6/33

7 Protocol Overview A, B, C, D, E A, B, C, D, E S1 S2 T1 S3 A, B, C, D, E T1 = { A, B, C } T2 = { C, D, E } T2 Distributed Transaction Processing in theescada Protocol p.7/33

8 Escada Approach Target: Large scale distributed systems. Provides partial replication (Partial DBSM) Reduces resource consumption exploiting application s locality. Distributed Transaction Processing in theescada Protocol p.8/33

9 Motivation How the Escada can be augmented to provide partial replication? Distributed Transaction Processing in theescada Protocol p.9/33

10 Main Contributions Distributed Transaction Processing Semantic Caching Extensions to the PostgreSQL Evaluation Process (TPC-C and TPC-W) Distributed Transaction Processing in theescada Protocol p.10/33

11 Contribution Distributed Transaction Processing Distributed Transaction Processing in theescada Protocol p.11/33

12 Partial DBSM Execution: A site that handles a transaction may not be able to locally complete its execution. It is possible that no single site can do it. Termination Protocol: The distributed execution fragments the knowledge about conflicts. How to decide if a transaction can commit or not? Distributed Transaction Processing in theescada Protocol p.12/33

13 Execution Rewrites the queries mapping the original relations to the actual fragments. Sites are contacted according to the replicated fragments. Distributed Transaction Processing in theescada Protocol p.13/33

14 Termination Protocol The updated information is propagated to the pertinent replicas. What can we say about the read and write sets? They could be sent to all the replicas allowing a deterministic certification (non-voting). They could be sent just to the pertinent replicas requiring a voting certification. Distributed Transaction Processing in theescada Protocol p.14/33

15 Non-Voting A, B, C, D, E A, B, C, D, E S1 S2 T1 S3 A, B, C, D, E T1 = { A, B, C } T2 = { C, D, E } T2 Distributed Transaction Processing in theescada Protocol p.15/33

16 Voting A B S1 S2 T1 C,D,E S3 T1 = { A, B, C } T2 = { C, D, E } T2 Distributed Transaction Processing in theescada Protocol p.16/33

17 Experiments Latency Latency (ms) DBSM - WAN PDBSM - WAN PDBSMRAC - WAN DDE - WAN Ideal Locking Clients Distributed Transaction Processing in theescada Protocol p.17/33

18 Contribution Semantic Caching Distributed Transaction Processing in theescada Protocol p.18/33

19 Semantic Caching Aims at reducing communication among distributed replicas. Exploits the broadcast to build the refresh mechanisms. Reduces the management overhead when compared to tuple and page-based solutions. Roughly, it caches the result sets and identifies them based on predicates. Distributed Transaction Processing in theescada Protocol p.19/33

20 Semantic Caching The satisfiability background is behind this approach. Two predicates S and F are satisfiable if S F is not a contradiction. The main idea: "upon receiving a query is possible to use a previous request?". Distributed Transaction Processing in theescada Protocol p.20/33

21 Semantic Caching We use an important class of queries, called SPJ queries. However, other queries could be cached. Distributed Transaction Processing in theescada Protocol p.21/33

22 Experiments Time Elapsed (ms) Profits 5 Clients 10 Clients 15 Clients 20 Clients Cache Entries (x100) Distributed Transaction Processing in theescada Protocol p.22/33

23 Contribution PostgreSQL Distributed Transaction Processing in theescada Protocol p.23/33

24 PostgreSQL We identified that the 1SR correctness criteria suggested by the DBSM is not achievable. We present an informal algorithm about how to exactly extract the read sets. Finally, we show how to extend the "rule mechanisms" of the PostgreSQL. Distributed Transaction Processing in theescada Protocol p.24/33

25 Contribution Evaluation Process Distributed Transaction Processing in theescada Protocol p.25/33

26 Evaluation Process We use a simulation tool that allows the combination of real and simulated code. The replication protocols, our main goal, are real implementations. This approach permits us to focus on our goal while at the same allows: Variations on the network architecture, the workload, concurrency control policies. Distributed Transaction Processing in theescada Protocol p.26/33

27 Evaluation Process TPC-W mimics an Internet commerce application. TPC-C mimics a wholesale supplier (OLTP). Distributed Transaction Processing in theescada Protocol p.27/33

28 DBSM Latency DBSM & Centralized Latency (ms) DBSM - LAN 1 CPU 3 CPUs Clients Distributed Transaction Processing in theescada Protocol p.28/33

29 Partial DBSM Latency Latency (ms) DBSM - WAN PDBSM - WAN PDBSMRAC - WAN DDE - WAN Ideal Locking Clients Distributed Transaction Processing in theescada Protocol p.29/33

30 Partial DBSM Kbytes/s Network Bandwidth DBSM - WAN PDBSM - WAN Clients Distributed Transaction Processing in theescada Protocol p.30/33

31 Conclusion Database replication based on group communication is a feasible solution. The non-voting solution scales up better. The semantic caching reduces communication. Partial replication reduces resource consumption. Distributed Transaction Processing in theescada Protocol p.31/33

32 Future Work Improve our simulation tool: user friendly. Develop a version of our semantic caching to mobile elements. Revisit the Epsilon Serializability to Improve DBSM performance. Distributed Transaction Processing in theescada Protocol p.32/33

33 The End Unfortunately, I don t know how to the things that I need to do. However, as a good start point I certainly know how I must not. Sometimes, this knowledge avoids worthless efforts. Sometimes, we simply need to go into the wrong direction to choose another path. Distributed Transaction Processing in theescada Protocol p.33/33

WICE - A Pragmatic Protocol for Database Replication in Interconnected Clusters

WICE - A Pragmatic Protocol for Database Replication in Interconnected Clusters WICE - A Pragmatic Protocol for Database Replication in Interconnected Clusters Jon Grov 1 Luís Soares 2 Alfrânio Correia Jr. 2 José Pereira 2 Rui Oliveira 2 Fernando Pedone 3 1 University of Oslo, Norway

More information

Development and evaluation of database replication in ESCADA

Development and evaluation of database replication in ESCADA Development and evaluation of database replication in ESCADA A. Sousa L. Soares A. Correia Jr. F. Moura R. Oliveira Universidade do Minho 1 Introduction Software based replication is a highly competitive

More information

Databases. Laboratorio de sistemas distribuidos. Universidad Politécnica de Madrid (UPM)

Databases. Laboratorio de sistemas distribuidos. Universidad Politécnica de Madrid (UPM) Databases Laboratorio de sistemas distribuidos Universidad Politécnica de Madrid (UPM) http://lsd.ls.fi.upm.es/lsd/lsd.htm Nuevas tendencias en sistemas distribuidos 2 Summary Transactions. Isolation.

More information

Evaluating Certification Protocols in the Partial Database State Machine

Evaluating Certification Protocols in the Partial Database State Machine Evaluating Certification Protocols in the Partial Database State Machine A. Sousa, A. Correia Jr., F. Moura, J. Pereira, R. Oliveira Abstract Partial replication is an alluring technique to ensure the

More information

Chapter 11 - Data Replication Middleware

Chapter 11 - Data Replication Middleware Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 11 - Data Replication Middleware Motivation Replication: controlled

More information

CSE 530A ACID. Washington University Fall 2013

CSE 530A ACID. Washington University Fall 2013 CSE 530A ACID Washington University Fall 2013 Concurrency Enterprise-scale DBMSs are designed to host multiple databases and handle multiple concurrent connections Transactions are designed to enable Data

More information

Optimistic Preventive Replication in a Database Cluster

Optimistic Preventive Replication in a Database Cluster Optimistic Preventive Replication in a Database Cluster Cédric Coulon, Esther Pacitti, Patrick Valduriez INRIA and LINA, University of Nantes France {Cedric.Coulon, Esther.Pacitti}@lina.univ-nantes.fr,

More information

Distributed Databases

Distributed Databases Distributed Databases Chapter 21, Part B Database Management Systems, 2 nd Edition. R. Ramakrishnan and Johannes Gehrke 1 Introduction Data is stored at several sites, each managed by a DBMS that can run

More information

Data Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich

Data Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich Data Modeling and Databases Ch 14: Data Replication Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich Database Replication What is database replication The advantages of

More information

Lazy Database Replication with Freshness Guarantees

Lazy Database Replication with Freshness Guarantees Lazy Database Replication with Freshness Guarantees Khuzaima Daudjee School of Computer Science University of Waterloo Waterloo, Ontario, Canada kdaudjee@uwaterloo.ca Kenneth Salem School of Computer Science

More information

Transaction Management using Causal Snapshot Isolation in Partially Replicated Databases. Technical Report

Transaction Management using Causal Snapshot Isolation in Partially Replicated Databases. Technical Report Transaction Management using Causal Snapshot Isolation in Partially Replicated Databases Technical Report Department of Computer Science and Engineering University of Minnesota 4-192 Keller Hall 200 Union

More information

Distributed Databases

Distributed Databases Distributed Databases Chapter 22, Part B Database Management Systems, 2 nd Edition. R. Ramakrishnan and Johannes Gehrke 1 Introduction Data is stored at several sites, each managed by a DBMS that can run

More information

Group-based Replication of On-line Transaction Processing Servers

Group-based Replication of On-line Transaction Processing Servers Group-based Replication of On-line Transaction Processing Servers A. Correia Jr. A. Sousa L. Soares J. Pereira F. Moura R. Oliveira Computer Science and Technology Center Computer Science Department University

More information

Transaction Management using Causal Snapshot Isolation in Partially Replicated Databases

Transaction Management using Causal Snapshot Isolation in Partially Replicated Databases Transaction Management using Causal Snapshot Isolation in Partially Replicated Databases ABSTRACT We present here a transaction management protocol using snapshot isolation in partially replicated multi-version

More information

Correctness Criteria Beyond Serializability

Correctness Criteria Beyond Serializability Correctness Criteria Beyond Serializability Mourad Ouzzani Cyber Center, Purdue University http://www.cs.purdue.edu/homes/mourad/ Brahim Medjahed Department of Computer & Information Science, The University

More information

Replication in Distributed Systems

Replication in Distributed Systems Replication in Distributed Systems Replication Basics Multiple copies of data kept in different nodes A set of replicas holding copies of a data Nodes can be physically very close or distributed all over

More information

Database Replication in Tashkent. CSEP 545 Transaction Processing Sameh Elnikety

Database Replication in Tashkent. CSEP 545 Transaction Processing Sameh Elnikety Database Replication in Tashkent CSEP 545 Transaction Processing Sameh Elnikety Replication for Performance Expensive Limited scalability DB Replication is Challenging Single database system Large, persistent

More information

BRANCH:IT FINAL YEAR SEVENTH SEM SUBJECT: MOBILE COMPUTING UNIT-IV: MOBILE DATA MANAGEMENT

BRANCH:IT FINAL YEAR SEVENTH SEM SUBJECT: MOBILE COMPUTING UNIT-IV: MOBILE DATA MANAGEMENT - 1 Mobile Data Management: Mobile Transactions - Reporting and Co Transactions Kangaroo Transaction Model - Clustering Model Isolation only transaction 2 Tier Transaction Model Semantic based nomadic

More information

Database Management Systems

Database Management Systems Database Management Systems Distributed Databases Doug Shook What does it mean to be distributed? Multiple nodes connected by a network Data on the nodes is logically related The nodes do not need to be

More information

On the Study of Different Approaches to Database Replication in the Cloud

On the Study of Different Approaches to Database Replication in the Cloud Melissa Santana et al.: On the Study of Different Approaches to Database Replication in the Cloud TR ITI-SIDI-2012/011 On the Study of Different Approaches to Database Replication in the Cloud Melissa

More information

Distributed Databases

Distributed Databases Distributed Databases Chapter 22.6-22.14 Comp 521 Files and Databases Spring 2010 1 Final Exam When Monday, May 3, at 4pm Where, here FB007 What Open book, open notes, no computer 48-50 multiple choice

More information

Basic vs. Reliable Multicast

Basic vs. Reliable Multicast Basic vs. Reliable Multicast Basic multicast does not consider process crashes. Reliable multicast does. So far, we considered the basic versions of ordered multicasts. What about the reliable versions?

More information

Middleware based Data Replication providing Snapshot Isolation

Middleware based Data Replication providing Snapshot Isolation Middleware based Data Replication providing Snapshot Isolation Yi Lin McGill Univ. Montreal ylin30@cs.mcgill.ca Bettina Kemme McGill Univ. Montreal kemme@cs.mcgill.ca Marta Patiño-Martínez Univ. Politecnica

More information

Database Architectures

Database Architectures Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 11/15/12 Agenda Check-in Centralized and Client-Server Models Parallelism Distributed Databases Homework 6 Check-in

More information

CSE 190D Database System Implementation

CSE 190D Database System Implementation CSE 190D Database System Implementation Arun Kumar Topic 6: Transaction Management Chapter 16 of Cow Book Slide ACKs: Jignesh Patel 1 Transaction Management Motivation and Basics The ACID Properties Transaction

More information

SCHISM: A WORKLOAD-DRIVEN APPROACH TO DATABASE REPLICATION AND PARTITIONING

SCHISM: A WORKLOAD-DRIVEN APPROACH TO DATABASE REPLICATION AND PARTITIONING SCHISM: A WORKLOAD-DRIVEN APPROACH TO DATABASE REPLICATION AND PARTITIONING ZEYNEP KORKMAZ CS742 - PARALLEL AND DISTRIBUTED DATABASE SYSTEMS UNIVERSITY OF WATERLOO OUTLINE. Background 2. What is Schism?

More information

NFSv4 as the Building Block for Fault Tolerant Applications

NFSv4 as the Building Block for Fault Tolerant Applications NFSv4 as the Building Block for Fault Tolerant Applications Alexandros Batsakis Overview Goal: To provide support for recoverability and application fault tolerance through the NFSv4 file system Motivation:

More information

Curriculum 2013 Knowledge Units Pertaining to PDC

Curriculum 2013 Knowledge Units Pertaining to PDC Curriculum 2013 Knowledge Units Pertaining to C KA KU Tier Level NumC Learning Outcome Assembly level machine Describe how an instruction is executed in a classical von Neumann machine, with organization

More information

Consistency in Distributed Systems

Consistency in Distributed Systems Consistency in Distributed Systems Recall the fundamental DS properties DS may be large in scale and widely distributed 1. concurrent execution of components 2. independent failure modes 3. transmission

More information

CONSISTENCY IN DISTRIBUTED SYSTEMS

CONSISTENCY IN DISTRIBUTED SYSTEMS CONSISTENCY IN DISTRIBUTED SYSTEMS 35 Introduction to Consistency Need replication in DS to enhance reliability/performance. In Multicasting Example, major problems to keep replicas consistent. Must ensure

More information

Concurrency Control In Distributed Main Memory Database Systems. Justin A. DeBrabant

Concurrency Control In Distributed Main Memory Database Systems. Justin A. DeBrabant In Distributed Main Memory Database Systems Justin A. DeBrabant debrabant@cs.brown.edu Concurrency control Goal: maintain consistent state of data ensure query results are correct The Gold Standard: ACID

More information

Cache Coherence in Distributed and Replicated Transactional Memory Systems. Technical Report RT/4/2009

Cache Coherence in Distributed and Replicated Transactional Memory Systems. Technical Report RT/4/2009 Technical Report RT/4/2009 Cache Coherence in Distributed and Replicated Transactional Memory Systems Maria Couceiro INESC-ID/IST maria.couceiro@ist.utl.pt Luis Rodrigues INESC-ID/IST ler@ist.utl.pt Jan

More information

Course Content. Parallel & Distributed Databases. Objectives of Lecture 12 Parallel and Distributed Databases

Course Content. Parallel & Distributed Databases. Objectives of Lecture 12 Parallel and Distributed Databases Database Management Systems Winter 2003 CMPUT 391: Dr. Osmar R. Zaïane University of Alberta Chapter 22 of Textbook Course Content Introduction Database Design Theory Query Processing and Optimisation

More information

Important Lessons. Today's Lecture. Two Views of Distributed Systems

Important Lessons. Today's Lecture. Two Views of Distributed Systems Important Lessons Replication good for performance/ reliability Key challenge keeping replicas up-to-date Wide range of consistency models Will see more next lecture Range of correctness properties L-10

More information

Chapter 20: Database System Architectures

Chapter 20: Database System Architectures Chapter 20: Database System Architectures Chapter 20: Database System Architectures Centralized and Client-Server Systems Server System Architectures Parallel Systems Distributed Systems Network Types

More information

Revisiting 1-Copy Equivalence in Clustered Databases

Revisiting 1-Copy Equivalence in Clustered Databases Revisiting 1-Copy Equivalence in Clustered Databases Rui Oliveira rco@di.uminho.pt José Pereira jop@di.uminho.pt Afrânio Correia Jr. acj@di.uminho.pt Edward Archibald Emic Networks ed@emicnetworks.com

More information

DISTRIBUTED COMPUTER SYSTEMS

DISTRIBUTED COMPUTER SYSTEMS DISTRIBUTED COMPUTER SYSTEMS CONSISTENCY AND REPLICATION CONSISTENCY MODELS Dr. Jack Lange Computer Science Department University of Pittsburgh Fall 2015 Consistency Models Background Replication Motivation

More information

Advanced Databases Lecture 17- Distributed Databases (continued)

Advanced Databases Lecture 17- Distributed Databases (continued) Advanced Databases Lecture 17- Distributed Databases (continued) Masood Niazi Torshiz Islamic Azad University- Mashhad Branch www.mniazi.ir Alternative Models of Transaction Processing Notion of a single

More information

Low Overhead Concurrency Control for Partitioned Main Memory Databases

Low Overhead Concurrency Control for Partitioned Main Memory Databases Low Overhead Concurrency Control for Partitioned Main Memory Databases Evan Jones, Daniel Abadi, Samuel Madden, June 2010, SIGMOD CS 848 May, 2016 Michael Abebe Background Motivations Database partitioning

More information

Transaction Management Exercises KEY

Transaction Management Exercises KEY Transaction Management Exercises KEY I/O and CPU activities can be and are overlapped to minimize (disk and processor) idle time and to maximize throughput (units of work per time unit). This motivates

More information

It also performs many parallelization operations like, data loading and query processing.

It also performs many parallelization operations like, data loading and query processing. Introduction to Parallel Databases Companies need to handle huge amount of data with high data transfer rate. The client server and centralized system is not much efficient. The need to improve the efficiency

More information

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 14 Distributed Transactions

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 14 Distributed Transactions CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 14 Distributed Transactions Transactions Main issues: Concurrency control Recovery from failures 2 Distributed Transactions

More information

Distributed Shared Memory

Distributed Shared Memory Distributed Shared Memory History, fundamentals and a few examples Coming up The Purpose of DSM Research Distributed Shared Memory Models Distributed Shared Memory Timeline Three example DSM Systems The

More information

Information Systems (Informationssysteme)

Information Systems (Informationssysteme) Information Systems (Informationssysteme) Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Summer 2016 c Jens Teubner Information Systems Summer 2016 1 Part VIII Transaction Management c Jens Teubner

More information

Chapter 18: Database System Architectures.! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems!

Chapter 18: Database System Architectures.! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems! Chapter 18: Database System Architectures! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems! Network Types 18.1 Centralized Systems! Run on a single computer system and

More information

Access Path Selection in Main-Memory Optimized Data Systems

Access Path Selection in Main-Memory Optimized Data Systems Access Path Selection in Main-Memory Optimized Data Systems Should I Scan or Should I Probe? Manos Athanassoulis Harvard University Talk at CS265, February 16 th, 2018 1 Access Path Selection SELECT x

More information

Low Overhead Concurrency Control for Partitioned Main Memory Databases. Evan P. C. Jones Daniel J. Abadi Samuel Madden"

Low Overhead Concurrency Control for Partitioned Main Memory Databases. Evan P. C. Jones Daniel J. Abadi Samuel Madden Low Overhead Concurrency Control for Partitioned Main Memory Databases Evan P. C. Jones Daniel J. Abadi Samuel Madden" Banks" Payment Processing" Airline Reservations" E-Commerce" Web 2.0" Problem:" Millions

More information

Document downloaded from: This paper must be cited as:

Document downloaded from:  This paper must be cited as: Document downloaded from: http://hdl.handle.net/10251/35220 This paper must be cited as: Bernabe Gisbert, JM.; Muñoz Escoí, FD. (2012). Supporting multiple isolation levels in replicated environments.

More information

Correctness Criteria Beyond Serializability

Correctness Criteria Beyond Serializability Comp. by: CsenthilkumaranGalleys0000875825 Date:3/11/08 Time:13:18:10 Stage:First Proof C Correctness Criteria Beyond Serializability MOURAD OUZZANI 1,BRAHIM MEDJAHED 2,AHMED Au1 K. ELMAGARMID 3 1 Cyber

More information

The Replication Technology in E-learning Systems

The Replication Technology in E-learning Systems Available online at www.sciencedirect.com Procedia - Social and Behavioral Sciences 28 (2011) 231 235 WCETR 2011 The Replication Technology in E-learning Systems Iacob (Ciobanu) Nicoleta Magdalena a *

More information

Postgres Plus and JBoss

Postgres Plus and JBoss Postgres Plus and JBoss A New Division of Labor for New Enterprise Applications An EnterpriseDB White Paper for DBAs, Application Developers, and Enterprise Architects October 2008 Postgres Plus and JBoss:

More information

Transaction Management: Concurrency Control

Transaction Management: Concurrency Control Transaction Management: Concurrency Control Yanlei Diao Slides Courtesy of R. Ramakrishnan and J. Gehrke DBMS Architecture Query Parser Query Rewriter Query Optimizer Query Executor Lock Manager Concurrency

More information

Consistency & Replication

Consistency & Replication Objectives Consistency & Replication Instructor: Dr. Tongping Liu To understand replication and related issues in distributed systems" To learn about how to keep multiple replicas consistent with each

More information

Database Architectures

Database Architectures Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/15/15 Agenda Check-in Parallelism and Distributed Databases Technology Research Project Introduction to NoSQL

More information

Column Stores vs. Row Stores How Different Are They Really?

Column Stores vs. Row Stores How Different Are They Really? Column Stores vs. Row Stores How Different Are They Really? Daniel J. Abadi (Yale) Samuel R. Madden (MIT) Nabil Hachem (AvantGarde) Presented By : Kanika Nagpal OUTLINE Introduction Motivation Background

More information

Relaxed Memory-Consistency Models

Relaxed Memory-Consistency Models Relaxed Memory-Consistency Models [ 9.1] In Lecture 13, we saw a number of relaxed memoryconsistency models. In this lecture, we will cover some of them in more detail. Why isn t sequential consistency

More information

Using Optimistic Atomic Broadcast in Transaction Processing Systems

Using Optimistic Atomic Broadcast in Transaction Processing Systems Using Optimistic Atomic Broadcast in Transaction Processing Systems Bettina Kemme Fernando Pedone Gustavo Alonso André Schiper Matthias Wiesmann School of Computer Science McGill University Montreal, Canada,

More information

The GlobData Fault-Tolerant Replicated Distributed Object Database

The GlobData Fault-Tolerant Replicated Distributed Object Database The GlobData Fault-Tolerant Replicated Distributed Object Database Luís Rodrigues Hugo Miranda Ricardo Almeida João Martins Pedro Vicente Universidade de Lisboa Faculdade de Ciências Departamento de Informática

More information

Testing the Dependability and Performance of Group Communication Based Database Replication Protocols

Testing the Dependability and Performance of Group Communication Based Database Replication Protocols Testing the Dependability and Performance of Group Communication Based Database Replication Protocols A. Sousa J. Pereira L. Soares A. Correia Jr. L. Rocha R. Oliveira F. Moura University of Minho Abstract

More information

Database Replication: A Tutorial

Database Replication: A Tutorial Chapter 12 Database Replication: A Tutorial Bettina Kemme, Ricardo Jiménez-Peris, Marta Patiño-Martínez, and Gustavo Alonso Abstract This chapter provides an in-depth introduction to database replication,

More information

Distributed Transaction Management

Distributed Transaction Management Distributed Transaction Management Material from: Principles of Distributed Database Systems Özsu, M. Tamer, Valduriez, Patrick, 3rd ed. 2011 + Presented by C. Roncancio Distributed DBMS M. T. Özsu & P.

More information

ATOMIC COMMITMENT Or: How to Implement Distributed Transactions in Sharded Databases

ATOMIC COMMITMENT Or: How to Implement Distributed Transactions in Sharded Databases ATOMIC COMMITMENT Or: How to Implement Distributed Transactions in Sharded Databases We talked about transactions and how to implement them in a single-node database. We ll now start looking into how to

More information

Distributed Database Systems

Distributed Database Systems Distributed Database Systems Vera Goebel Department of Informatics University of Oslo Fall 2013 1 Contents Review: Layered DBMS Architecture Distributed DBMS Architectures DDBMS Taxonomy Client/Server

More information

Chapter 4: Distributed Systems: Replication and Consistency. Fall 2013 Jussi Kangasharju

Chapter 4: Distributed Systems: Replication and Consistency. Fall 2013 Jussi Kangasharju Chapter 4: Distributed Systems: Replication and Consistency Fall 2013 Jussi Kangasharju Chapter Outline n Replication n Consistency models n Distribution protocols n Consistency protocols 2 Data Replication

More information

Distributed Database Systems

Distributed Database Systems Distributed Database Systems Vera Goebel Department of Informatics University of Oslo Fall 2016 1 Contents Review: Layered DBMS Architecture Distributed DBMS Architectures DDBMS Taxonomy Client/Server

More information

Consistency and Replication 1/65

Consistency and Replication 1/65 Consistency and Replication 1/65 Replicas and Consistency??? Tatiana Maslany in the show Orphan Black: The story of a group of clones that discover each other and the secret organization Dyad, which was

More information

Replication and Consistency. Fall 2010 Jussi Kangasharju

Replication and Consistency. Fall 2010 Jussi Kangasharju Replication and Consistency Fall 2010 Jussi Kangasharju Chapter Outline Replication Consistency models Distribution protocols Consistency protocols 2 Data Replication user B user C user A object object

More information

On Transaction Liveness in Replicated Databases

On Transaction Liveness in Replicated Databases On Transaction Liveness in Replicated Databases Fernando Pedone Rachid Guerraoui Ecole Polytechnique Fédérale de Lausanne Département d Informatique CH-1015, Switzerland Abstract This paper makes a first

More information

CSE 344 Final Review. August 16 th

CSE 344 Final Review. August 16 th CSE 344 Final Review August 16 th Final In class on Friday One sheet of notes, front and back cost formulas also provided Practice exam on web site Good luck! Primary Topics Parallel DBs parallel join

More information

A Concurrency Control for Transactional Mobile Agents

A Concurrency Control for Transactional Mobile Agents A Concurrency Control for Transactional Mobile Agents Jeong-Joon Yoo and Dong-Ik Lee Department of Information and Communications, Kwang-Ju Institute of Science and Technology (K-JIST) Puk-Gu Oryong-Dong

More information

Lecture 23 Database System Architectures

Lecture 23 Database System Architectures CMSC 461, Database Management Systems Spring 2018 Lecture 23 Database System Architectures These slides are based on Database System Concepts 6 th edition book (whereas some quotes and figures are used

More information

Goal of Concurrency Control. Concurrency Control. Example. Solution 1. Solution 2. Solution 3

Goal of Concurrency Control. Concurrency Control. Example. Solution 1. Solution 2. Solution 3 Goal of Concurrency Control Concurrency Control Transactions should be executed so that it is as though they executed in some serial order Also called Isolation or Serializability Weaker variants also

More information

Distributed Data Analytics Transactions

Distributed Data Analytics Transactions Distributed Data Analytics G-3.1.09, Campus III Hasso Plattner Institut must ensure that interactions succeed consistently An OLTP Topic Motivation Most database interactions consist of multiple, coherent

More information

Cost Reduction of Replicated Data in Distributed Database System

Cost Reduction of Replicated Data in Distributed Database System Cost Reduction of Replicated Data in Distributed Database System 1 Divya Bhaskar, 2 Meenu Department of computer science and engineering Madan Mohan Malviya University of Technology Gorakhpur 273010, India

More information

CPS 512 midterm exam #1, 10/7/2016

CPS 512 midterm exam #1, 10/7/2016 CPS 512 midterm exam #1, 10/7/2016 Your name please: NetID: Answer all questions. Please attempt to confine your answers to the boxes provided. If you don t know the answer to a question, then just say

More information

X X C 1. Recap. CSE 486/586 Distributed Systems Gossiping. Eager vs. Lazy Replication. Recall: Passive Replication. Fault-Tolerance and Scalability

X X C 1. Recap. CSE 486/586 Distributed Systems Gossiping. Eager vs. Lazy Replication. Recall: Passive Replication. Fault-Tolerance and Scalability Recap Distributed Systems Gossiping Steve Ko Computer Sciences and Engineering University at Buffalo Consistency models Linearizability Sequential consistency Causal consistency Eventual consistency Depending

More information

Migrating Oracle Databases To Cassandra

Migrating Oracle Databases To Cassandra BY UMAIR MANSOOB Why Cassandra Lower Cost of ownership makes it #1 choice for Big Data OLTP Applications. Unlike Oracle, Cassandra can store structured, semi-structured, and unstructured data. Cassandra

More information

Replica Placement. Replica Placement

Replica Placement. Replica Placement Replica Placement Model: We consider objects (and don t worry whether they contain just data or code, or both) Distinguish different processes: A process is capable of hosting a replica of an object or

More information

Consistency and Replication 1/62

Consistency and Replication 1/62 Consistency and Replication 1/62 Replicas and Consistency??? Tatiana Maslany in the show Orphan Black: The story of a group of clones that discover each other and the secret organization Dyad, which was

More information

Distributed Systems LEEC (2006/07 2º Sem.)

Distributed Systems LEEC (2006/07 2º Sem.) Distributed Systems LEEC (2006/07 2º Sem.) Introduction João Paulo Carvalho Universidade Técnica de Lisboa / Instituto Superior Técnico Outline Definition of a Distributed System Goals Connecting Users

More information

! Design constraints. " Component failures are the norm. " Files are huge by traditional standards. ! POSIX-like

! Design constraints.  Component failures are the norm.  Files are huge by traditional standards. ! POSIX-like Cloud background Google File System! Warehouse scale systems " 10K-100K nodes " 50MW (1 MW = 1,000 houses) " Power efficient! Located near cheap power! Passive cooling! Power Usage Effectiveness = Total

More information

L i (A) = transaction T i acquires lock for element A. U i (A) = transaction T i releases lock for element A

L i (A) = transaction T i acquires lock for element A. U i (A) = transaction T i releases lock for element A Lock-Based Scheduler Introduction to Data Management CSE 344 Lecture 20: Transactions Simple idea: Each element has a unique lock Each transaction must first acquire the lock before reading/writing that

More information

Databases - Transactions

Databases - Transactions Databases - Transactions Gordon Royle School of Mathematics & Statistics University of Western Australia Gordon Royle (UWA) Transactions 1 / 34 ACID ACID is the one acronym universally associated with

More information

Enhancing Edge Computing with Database Replication

Enhancing Edge Computing with Database Replication Enhancing Edge Computing with Database Replication Yi Lin Bettina Kemme Marta Patiño-Martínez + Ricardo Jiménez-Peris + McGill University, School of Computer Science, Canada + Facultad de Informatica,

More information

Distributed Systems. Day 13: Distributed Transaction. To Be or Not to Be Distributed.. Transactions

Distributed Systems. Day 13: Distributed Transaction. To Be or Not to Be Distributed.. Transactions Distributed Systems Day 13: Distributed Transaction To Be or Not to Be Distributed.. Transactions Summary Background on Transactions ACID Semantics Distribute Transactions Terminology: Transaction manager,,

More information

Distributed Data Management

Distributed Data Management Lecture Data Management Chapter 1: Erik Buchmann buchmann@ipd.uka.de IPD, Forschungsbereich Systeme der Informationsverwaltung of this Chapter are databases and distributed data management not completely

More information

Google File System. Arun Sundaram Operating Systems

Google File System. Arun Sundaram Operating Systems Arun Sundaram Operating Systems 1 Assumptions GFS built with commodity hardware GFS stores a modest number of large files A few million files, each typically 100MB or larger (Multi-GB files are common)

More information

Introduction to Database Replication

Introduction to Database Replication Introduction to Database Replication What is database replication The advantages of database replication A taxonomy of replication strategies: Synchronous Asynchronous Update everywhere Primary copy Discussion

More information

Computing Parable. The Archery Teacher. Courtesy: S. Keshav, U. Waterloo. Computer Science. Lecture 16, page 1

Computing Parable. The Archery Teacher. Courtesy: S. Keshav, U. Waterloo. Computer Science. Lecture 16, page 1 Computing Parable The Archery Teacher Courtesy: S. Keshav, U. Waterloo Lecture 16, page 1 Consistency and Replication Today: Consistency models Data-centric consistency models Client-centric consistency

More information

CS352 Lecture - The Transaction Concept

CS352 Lecture - The Transaction Concept CS352 Lecture - The Transaction Concept Last Revised 11/7/06 Objectives: 1. To introduce the notion of a transaction and the ACID properties of a transaction 2. To introduce the notion of the state of

More information

Distributed System. Gang Wu. Spring,2018

Distributed System. Gang Wu. Spring,2018 Distributed System Gang Wu Spring,2018 Lecture4:Failure& Fault-tolerant Failure is the defining difference between distributed and local programming, so you have to design distributed systems with the

More information

STEPS Towards Cache-Resident Transaction Processing

STEPS Towards Cache-Resident Transaction Processing STEPS Towards Cache-Resident Transaction Processing Stavros Harizopoulos joint work with Anastassia Ailamaki VLDB 2004 Carnegie ellon CPI OLTP workloads on modern CPUs 6 4 2 L2-I stalls L2-D stalls L1-I

More information

Mutual consistency, what for? Replication. Data replication, consistency models & protocols. Difficulties. Execution Model.

Mutual consistency, what for? Replication. Data replication, consistency models & protocols. Difficulties. Execution Model. Replication Data replication, consistency models & protocols C. L. Roncancio - S. Drapeau Grenoble INP Ensimag / LIG - Obeo 1 Data and process Focus on data: several physical of one logical object What

More information

Technical Report TR Design and Evaluation of a Transaction Model with Multiple Consistency Levels for Replicated Data

Technical Report TR Design and Evaluation of a Transaction Model with Multiple Consistency Levels for Replicated Data Technical Report Department of Computer Science and Engineering University of Minnesota 4-192 Keller Hall 200 Union Street SE Minneapolis, MN 55455-0159 USA TR 15-009 Design and Evaluation of a Transaction

More information

CMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS

CMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB s C. Faloutsos A. Pavlo Lecture#23: Distributed Database Systems (R&G ch. 22) Administrivia Final Exam Who: You What: R&G Chapters 15-22

More information

Distributed Versioning: Consistent Replication for Scaling Back-end Databases of Dynamic Content Web Sites

Distributed Versioning: Consistent Replication for Scaling Back-end Databases of Dynamic Content Web Sites Distributed Versioning: Consistent Replication for Scaling Back-end Databases of Dynamic Content Web Sites Cristiana Amza, Alan L. Cox and Willy Zwaenepoel Department of Computer Science, Rice University,

More information

management systems Elena Baralis, Silvia Chiusano Politecnico di Torino Pag. 1 Distributed architectures Distributed Database Management Systems

management systems Elena Baralis, Silvia Chiusano Politecnico di Torino Pag. 1 Distributed architectures Distributed Database Management Systems atabase Management Systems istributed database istributed architectures atabase Management Systems istributed atabase Management Systems ata and computation are distributed over different machines ifferent

More information

Consistency and Replication. Why replicate?

Consistency and Replication. Why replicate? Consistency and Replication Today: Consistency models Data-centric consistency models Client-centric consistency models Lecture 15, page 1 Why replicate? Data replication versus compute replication Data

More information

Systems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2014/15

Systems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2014/15 Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2014/15 Lecture X: Parallel Databases Topics Motivation and Goals Architectures Data placement Query processing Load balancing

More information

Tradeoff Evaluation. Comparison between CB-R and CB-A as they only differ for this aspect.

Tradeoff Evaluation. Comparison between CB-R and CB-A as they only differ for this aspect. Tradeoff Evaluation Comparison between C2PL and CB-A, as both: Allow intertransaction caching Don t use propagation Synchronously activate consistency actions Tradeoff Evaluation Comparison between CB-R

More information