Some Models of a Distributed Database Management System with Data Replication

Similar documents
SIMULATION STUDIES OF THE IMPLEMENTATION OF CENTRALIZED TWO-PHASE LOCKING IN DDBMS

SOME GPSS OPPORTUNITIES FOR MODELING OF TIMESTAMP ORDERING IN DDBMS AND SIMULATION INVESTIGATIONS

Recoverable Timestamping Approach For Concurrency Control In Distributed Database

Concurrency Control in Distributed Database System

REALIZATION OF OPEN ADDRESSING HASH TABLE IN THE CHAINED ALLOCATED MEMORY. Valentina Dyankova and Rositza Hristova

Transaction Processing: Concurrency Control ACID. Transaction in SQL. CPS 216 Advanced Database Systems. (Implicit beginning of transaction)

Analysis of Replication Control Protocols

Chapter 9: Concurrency Control

A New Binary Vote Assignment on Grid Algorithm to Manage Replication in Distributed Database Environment

Performance of Distributed Optimistic Concurrency Control in Real-Time databases

Consistency in Distributed Systems

Distributed Transaction Management

Distributed Database Systems

A Prudent-Precedence Concurrency Control Protocol for High Data Contention Main Memory Databases

Distributed Database Systems

Transaction Management: Concurrency Control

Performance Modeling of an Enhanced Optimistic Locking Architecture for Concurrency Control in a Distributed Database System

International Journal of Advanced Research in Computer Science and Software Engineering

CIRS: A State-Conscious Concurrency Control Protocol for Replicated Real-Time Databases

Low Overhead Concurrency Control for Partitioned Main Memory Databases

Correctness Criteria Beyond Serializability

Transactions with Replicated Data. Distributed Software Systems

Three Read Priority Locking for Concurrency Control in Distributed Databases

A Read-Write-Validate Approach to Optimistic Concurrency Control for Energy Efficiency of Resource-Constrained Systems

Cover sheet for Assignment 3

ISSN: Monica Gahlyan et al, International Journal of Computer Science & Communication Networks,Vol 3(3),

The Replication Technology in E-learning Systems

Optimistic Concurrency Control based on Cache Coherency in Distributed Database Systems

Queen s University Faculty of Arts and Science School of Computing CISC 432* / 836* Advanced Database Systems

Code. BIS43 Lessons per week: L 2 hours; Labs 2 hour

An Aggressive Concurrency Control Protocol for Main Memory Databases

Teaching Scheme Business Information Technology/Software Engineering Management Advanced Databases

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Introducing Network Delays in a Distributed Real- Time Transaction Processing System

PETRI NET MODELLING OF CONCURRENCY CONTROL IN DISTRIBUTED DATABASE SYSTEM

Implementation and Evaluations of Recovery Method for Batch Update

Distributed Databases Systems

CS54200: Distributed Database Systems

Goals for Today. CS 133: Databases. Final Exam: Logistics. Why Use a DBMS? Brief overview of course. Course evaluations

An Optimal Locking Scheme in Object-Oriented Database Systems

Data analysis and design Unit number: 23 Level: 5 Credit value: 15 Guided learning hours: 60 Unit reference number: H/601/1991.

Integrity in Distributed Databases

An Extended Three Phase Commit Protocol for Concurrency Control in Distributed Systems

Transaction Processing: Concurrency Control. Announcements (April 26) Transactions. CPS 216 Advanced Database Systems

STUDY OF THE DEVELOPMENT OF THE STRUCTURE OF THE NETWORK OF SOFIA SUBWAY

Analysis of Transaction and Concurrency Mechanism in Two Way Waiting Algorithm for different Databases

Discovering the Association Rules in OLAP Data Cube with Daily Downloads of Folklore Materials *

Concurrency Control. Transaction Management. Lost Update Problem. Need for Concurrency Control. Concurrency control

Concurrency Control In Distributed Main Memory Database Systems. Justin A. DeBrabant

Real-time Optimistic Concurrency Control based on Transaction Finish Degree

CS Reading Packet: "Transaction management, part 2"

230 Chapter 17. (c) Otherwise, T writes O and WTS(O) is set to TS(T).

T ransaction Management 4/23/2018 1

Transaction Processing In Replicated Data in the DDBMS

Chapter 11 - Data Replication Middleware

Topics in Reliable Distributed Systems

RESEARCH OF PROCESSES OF IDENTIFICATION, AUTHENTICATION AND AUTHORIZATION

Design and Implementation of a Two-Phase Commit Protocol Simulator

Advances in Data Management Distributed and Heterogeneous Databases - 2

Specific Objectives Contents Teaching Hours 4 the basic concepts 1.1 Concepts of Relational Databases

Cost Reduction of Replicated Data in Distributed Database System

Mobile and Heterogeneous databases

Hybrid Concurrency Control Method in Firm Real-Time Databases

THE MATHEMATICAL MODEL OF AN OPERATOR IN A HUMAN MACHINE SYSTEMS. PROBLEMS AND SOLUTIONS

Transactional Agents for Pervasive Computing

Control. CS432: Distributed Systems Spring 2017

A Review on Fragmentation Techniques in Distributed Database

ON TWO ADAPTIVE SYSTEMS FOR DOCUMENT MANAGEMENT * Vanyo G. Peychev, Ivo I. Damyanov

GUJARAT TECHNOLOGICAL UNIVERSITY

Commit Protocols and their Issues in Distributed Databases

ON NEW STRATEGY FOR PRIORITISING THE SELECTED FLOW IN QUEUING SYSTEM

PROCEEDINGS. of the Union of Scientists - Ruse. Book 5 Mathematics, Informatics and Physics. Volume 11, 2014 RUSE

Mobile and Heterogeneous databases Distributed Database System Transaction Management. A.R. Hurson Computer Science Missouri Science & Technology

Correctness Criteria Beyond Serializability

Priority assignment in real-time active databases 1

10. Replication. CSEP 545 Transaction Processing Philip A. Bernstein. Copyright 2003 Philip A. Bernstein. Outline

Distributed Database Management System UNIT-2. Concurrency Control. Transaction ACID rules. MCA 325, Distributed DBMS And Object Oriented Databases

Concurrency Control And Recovery In Database Systems By Philip Bernstein;Vassos Hadzilacos;Nathan Goodman READ ONLINE

Data Replication Model For Remote Procedure Call Transactions

Transaction Management. Pearson Education Limited 1995, 2005

IMPORTANT: Circle the last two letters of your class account:

Mobile Agent Model for Transaction Processing in Distributed Database Systems

Application of PHP and MySQL for search and retrieval Web services in Web information systems

Curriculum 2013 Knowledge Units Pertaining to PDC

Concurrency Control 9-1

Concurrency Control. Chapter 17. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1

«, 68, 55, 23. (, -, ).,,.,,. (workcamps).,. :.. 2

SERVICE COMPOSITIONS IN PROBLEMS OF URBAN PLANNING Roman K. Fedorov (1), Alexander S. Shumilov (2)

Goal of Concurrency Control. Concurrency Control. Example. Solution 1. Solution 2. Solution 3

Multi-User-Synchronization

Proposal for Business Transaction Protocol Version 1.0

Transactions. Transaction. Execution of a user program in a DBMS.

DISTRIBUTED DATABASES

NON-PROFIT ORGANIZATION CHARITY FUND

Synchronization Part 2. REK s adaptation of Claypool s adaptation oftanenbaum s Distributed Systems Chapter 5 and Silberschatz Chapter 17

Efficient Priority Assignment Policies for Distributed Real-Time Database Systems

An Evaluation of Distributed Concurrency Control. Harding, Aken, Pavlo and Stonebraker Presented by: Thamir Qadah For CS590-BDS

Phantom Problem. Phantom Problem. Phantom Problem. Phantom Problem R1(X1),R1(X2),W2(X3),R1(X1),R1(X2),R1(X3) R1(X1),R1(X2),W2(X3),R1(X1),R1(X2),R1(X3)

A Novel Method for the Comparison of Graphical Data Models

Impact of Mobility on Concurrent Transactions Mixture

Transcription:

Some Models of a Distributed Database Management System with Data Replication Svetlana Vasileva, Petar Milev, Borislav Stoyanov Abstract: The article suggests a formal model of Distributed Database Management Systems (DDBMS) and an approach for Distributed Database Systems (DDB) modeling. It is suggested to use general purpose system simulation (GPSS) for simulation modeling. Queuing systems models modelling the execution of two-phase locking (2PL) in DDB with Data replication are suggested: centralized 2PL, primary copy 2PL, distributed 2PL and voting 2PL. Results from the simulation of the distributed 2PL are presented. Key words: Distributed Database, Transactions, Simulation Model, Concurrency Control algorithms, Data Replication INTRODUCTION One of the basic problems in Distributed Database Management Systems (DDBMS) is the concurrency control of concurrent executed transactions. There are 3 basic methods for transaction concurrency control (CC): Locking (two phase locking - 2PL), Timestamp ordering and Optimistic strategies (Certification approach and others) [3], [6] and [8]. Since in the last two methods in case of conflicts transactions rollback, and in locking the possible conflicts are preliminary excluded, this work suggests a model of DDBMS functioning according to the 2PL. The current investigation of distributed CC is focused on the four types of 2PL for Distributed Database Systems (DDB) with data replication: centralized 2PL, primary copy 2PL, distributed 2PL and majority copies 2PL (voting 2PL). These four protocols are described in [3], [8] and others, and the researched algorithms developed on their basis are described in details in [2], [10] and [11]. Detailed simulation models for the 2PL algorithms for DDB with Data replication are not found in [5], [6], [7], [9] and other resources we have studied. Almost all detailed algorithms for simulation of CC performance, which we have found model after Timestamp ordering protocols [9] or the protocols using optimistic strategies. This necessitates models of 2PL in DDB to be developed and investigated. 1. MODEL OF THE STRUCTURE OF THE DDBMS As a base of the formal model we use the one suggested in [7]. We give the schemes of the two basic models of DDBMS. The modules which participate in the transaction processing are depicted with circles and the necessary data with rectangles. Fig. 1 shows the model of the structure of DDBMS in which CC is fulfilled in a centralized 2PL. In order to be clear, we do not depict the rest of components of the site S 0 (TM 0, DM 0, TC 0, LDB 0 and SC 0 ), which contain the only Lock manager LM 0 and the only Lock table LT 0. Fig. 2 gives the model of the structure of the DDBMS, in which CC is executed on distributed 2PL. Unmentioned signs: S Transaction source, TC Transaction coordinator (it also includes the component for data communication), SC - System Catalogue. The scheme of the model structures in fig. 1 and fig. 2 are specific development of the same structure models for the two basic types 2PL in DDBMS with data replication: centralized 2PL and distributed 2PL. 2. BASIC PERFORMANCE SCHEMES OF 2PL IN DISTRIBUTED DATABASES Fig. 3 and fig. 4 demonstrate an example of a performance of 2PL in DDB with 5 nodes (S 1 S 5 ) for a global transaction T 2 n (updating data element x j which has copies in local databases (LDB) of S 1 and S 5 ) initiated by site S 2. The transactions use the Strict 2PL where data elements are locked before they are accessed and locks are released after the transactions commit or abort. - II.12-1 -

Figure 1 Distributed DBMS Model Structure in case of Centralized 2PL Figure 2 Distributed DBMS Model Structure in case of Distributed 2PL In centralized 2PL the Lock manager and the Lock table, in which the transactions mark the necessary data elements with the corresponding locks are placed on one (central) site only, in fig.3 S 0. In the primary copy 2PL certain centralization kept, because for the locking of a data element a certain site is responsible and Lock table is divided among the sites, therefore fig. 3 illustrates also a Primary copy 2PL in the example of global transaction T 2 quoted above. (In general, the site, in which is the LM managing the locking of the primary copy, of the data element x j does not contain this copy). In distributed 2PL the Lock manager is distributed on the DDB sites and the Lock table of the site contains information for all the elements, which copies are stored in the LDB of the site. This distributedness is kept also in voting 2PL with the single difference that it is enough for the transaction to receive the blockings of more than half of the copies of the data element in order to continue its execution. 3. MODELING OF THE PROCESS FOR TRANSACTION SERVICE IN DISTRIBUTED DATABASE SYSTEMS The preliminary analysis of the service transaction process in DDB systems with data replication, showed the complexity and the peculiarities of the process. It is possible to present the structure of such a process by means of queuing models which are characterized with the following: - The input flow of the global transactions in the DDBMS sites is Poisson λ parameter the length of the interval between the input time has an exponential distribution; - The service transactions continuity has an exponential distribution with a parameter µ; - The service devices (CC components) are situated in sequence in parallel channels with parallel queues; - The order of service is First Input First Serviced ; - II.12-2 -

Figure 3 Schema of Centralized 2Pl and Primary Copy 2PL Figure 4 Schema of Distributed 2Pl and Voting 2PL The incoming transaction does not leave the system before it is served. The scientific analysis of the situation in DDB systems showed that the whole system can be presented as a combination of subsystems, functioning in parallel independently of one another. Such a system can be qualified as Queuing system with infinite multitude of serving devices and it is presented symbolically. It is difficult to summarize these assumptions for the cases with partially changed conditions. For example, if each transaction in the system receives the necessary locks (in 2PL), and also the elements processed by the transactions are distributed exponentially, it is clear that there would not be obscurities in the model interpretations. It this case the transactions enter the sites exponentially. The same are sent via parallel channels, they are written in particular incoming and outputting buffers, reaching the sites with the necessary data and are submitted to the receiver (leave the system). This allows us to assume that the service process can be modeled by Queuing system with parallel channels. The Mathematic model M/M/ can be used for a number of important operation characteristics as [4]: - Transaction service time in queuing system; - number of serviced transactions; - full channel load; - the length of the queues and others. We assume that M/M/ model is not a precise copy of the real and complex transaction service process in DDBMS. Therefore simulation approach is used. By the general purpose system simulation GPSS World the model and algorithms for service on transaction in the system are described. The global system catalogue (system_catalog) and data elements of the DDB (elementi), LDB of each site (Site_elementi) are modeled with 2-dimensional arrays. Initially the simplest type of transactions is generated transactions that process (read and/or update) one data element. To syntesize the schemes of the models presented below in Fig.5-7, we used the schemes for CC described in [5]. The basic scheme of Logical queuing model has been used as a basis for the models developed here. - II.12-3 -

Figure 5 presents the queuing model of service of the concurrent transactions on the centralized 2PL. Some symbols: S i generator of transactions, with intervals of coming in which are distributed according to an exponential law; QLM 0 queue of the waiting for processing transactions; LM 0 Lock manager device. If the necessary lockings are granted the transaction is split and replicated by the corresponding Transaction coordinator TC i and is executed in corresponding nodes DM j, after that they are reunited in the transaction manager device; the busy locked are released block LT 0 and leave the system. Figure 5 Simulation model of centralized 2PL Figure 6 presents the queuing model of service of the concurrent transactions on the primary copy 2PL: S i Generator of transactions, with input intervals exponentially distributed; TC i Transaction coordinator, it splits the transaction, defines the necessary primary copies and points the transaction to the corresponding lock managers LM j ; QLM j queue of the transaction waiting for LM j processing; If the necessary lockings are granted the transaction is replicated and is executed in corresponding nodes DM j, after that they are reunited in the transaction manager TM i block; the busy locked are released in corresponding blocks LT j and the transaction leaves the system. Figure 6 Simulation model of primary copy 2PL - II.12-4 -

Figure 7 presents the queuing model of service of the concurrent transactions on the distributed 2PL: S i Generator of transactions, with input intervals exponentially distributed; TC i Transaction coordinator, it splits and replicates the transaction defines the necessary executor nodes DM j and points the transaction to the corresponding lock managers LM j ; QLM j queue of the transaction waiting for LM j processing; If the necessary lockings are granted the transaction is executed in corresponding data managers DM j, after that they are reunited in the transaction manager TM i block; the busy locked are released in corresponding blocks LT j and the transaction leaves the system. Figure 7 Simulation model of distributed 2PL 4. SIMULATION MODEL OF DISTRIBUTED 2PL On the basis of the suggested distributed 2PL model on fig. 6a GPSS model [1] is developed which compares the system throughput with 2 and 3 copies of data items. For a unit model time we choose 1 ms. The diagram in fig. 8 shows the received simulation results. T h r o u g h p u t 30 20 10 0 0 1 2 3 4 5 ThinkTime 2 Copies 3 Copies Figure 8 Throughput diagram with 2 и 3 replicas The throughput diagram of DDBMS which works on Distributed 2PL shows that when the data elements have two copies each, the throughput capability for a short response time is about 25% higher than the case with three copies. If the response time is longer, the throughput capability in the case with two copies is close to that when we have three copies. This is due to the longer time necessary for global transactions processing and - II.12-5 -

subtransactions transferring and unification. CONCLUSIONS AND FUTURE WORK 1. A formal DDBMS model and execution block schemes of 2PL in DDB are suggested; 2. An approach for formal model simulation with GPSS in definite limited conditions is developed. 3. Modeling centralized 2PL, and primary copy 2PL, and more complex performance regimes in DDB is forthcoming as well as their simulation, and processing the results from the simulation performance, and the verification of the models. REFERENCES [1] Алтаев, А. Имитационное моделирование на языке GPSS. http://www.ict.edu.ru, 2002. [2] Василева, С. Алгоритми на работа на възлите по централизирания протокол за двуфазна блокировка в разпределени системи за управление на бази от данни. НАУЧНИ ТРУДОВЕ НА на РУСЕНСКИ УНИВЕРСИТЕТ А. Кънчев 2006, т. 45, с. 51-56. [3] Гарсиа-Молина, Г., Д. Ульман, Д. Уидом. Базы данных. M., Уильямс, 2003 [4] Клейнрок, Л. Теория массового обслуживания. М., Машиностроение, 1979. [5] Agrawal, R., J. M. Carey, M. Livny. Concurrency Control Performance Modeling: Alternatives and Implications. ACM Transactions on Database Systems, Vol. 12, No. 4, 1987, p.609-654. [6] Bernstein, Ph., N. Goodman. Concurrency Control in Distributed Database Systems. Computing Surveys, Vol. 13, No. 2, 1981, p.185-221. [7] Carey, M., M.Livny. Distributed Concurrency Control Performance: A Study of Algorithms, Distribution and Replication. Proceedings of the 14 th VLDB Conference, Los Angeles, 1988, p. 13-25. [8] Connolly, T., C. Begg. Database Systems. Addison Wesley, 2002. [9] Miller, J., N. Griffeth. Performance Modeling of Database and Simulation Protocols: Design Choices for Query Driven Simulation. http://chief.cs.uga.edu/~jam/papers/prot.ps. [10] Vasileva, S. Algorithms of work of nodes on the protocol for 2PL with primary copies in distributed database management systems. //Journal of the Technical university at Plovdiv Fundamental sciences and applications : Anniversary scientific conference 2006, vol. 13(1), 2006, pp. 99-105. (In Bulgaria) [11] Vasileva, S. Algorithms of work of nodes on the Distributed protocol for 2PL in Distributed Database Management Systems // 9th International conference of Tensor society on Differential geometry, Informatics and their applications, Sapporo, Japan, 2006. ABOUT THE AUTHORS Assist. Prof. Svetlana Vasileva, PhD student, College - Dobrich, University of Shumen, Phone: +359 58 603 248, Е-mail: svetlanaeli@dobrich.net Assoc. Prof. Petar Milev, PhD, Department of Mathematics and Informatics, University of Shumen, Phone: +359 54 830 340, Е-mail: milev@fmi.shu-bg.net. Assist. Prof., Borislav Stoyanov, PhD, Faculty of Computer System and Techologies, University of Shumen, Bulgaria, Phone: +359 89 9257770, Е-mail: bpstoyanov@yahoo.com. - II.12-6 -