Distributed Data Management
|
|
- Lizbeth Flynn
- 5 years ago
- Views:
Transcription
1 Lecture Data Management Chapter 1: Erik Buchmann IPD, Forschungsbereich Systeme der Informationsverwaltung
2 of this Chapter are databases and distributed data management not completely different concepts, even contradictory? what is distributed? classification. transactions.. Query processing. Outline of this course. Erik Buchmann DDM: 2
3 IPD, Forschungsbereich Systeme der Informationsverwaltung
4 Situation without Databases (1) Access to data stored in files. Respective functionality is part of applications. Book New Entrant Book File Lending Reader File Reminder Lending File Erik Buchmann DDM: 4
5 Situation without Databases (2) Redundancy. Book New Entrant Lending Reminder Book File Book File Reader File Lending File Reader File Lending File Erik Buchmann DDM: 5
6 Situation without Databases (3) Other challenges: concurrency transactional guarantees (atomarity, consistency, isolation, durability) physical / logical data representation independence data privacy, data security no standard approach for the management of huge amounts of data Erik Buchmann DDM: 6
7 From Files to Databases Access to data stored in files. Respective functionality is part of applications. (Taking physical issues into account, concurrency, access control, consistency). Databases factor out this functionality. Book New Entrant Lending Reminder Book New Entrant Lending Reminder Book File Reader File Lending File Erik Buchmann DDM: 7
8 (Centralized) Databases Data integration, all applications use the same database Physical/logical data independence Efficiency, databases handle large volumes of data Integrity constraints, consistency even if many parallel users execute different transactions on the same data Declarative query languages, SQL Automatic query optimization When considering all these benefits, why bothering with distributed data management? Erik Buchmann DDM: 8
9 Why Data Management, After All? Data might be distributed to minimize communication costs Data might be distributed to equalize the workload among multiple nodes Data might be kept at the site of the creator in order to allow cheap updates Data might be replicated at multiple sites to improve availability, throughput and response times Some scenarios are distributed by nature, e.g., the IT infrastructure of a global company Erik Buchmann DDM: 9
10 Example: A Large Retailer 1 central enterprise resource planning system 100 stores In each store 1 manufacturing information system 3 points of sale Different data, queries, updates on each node Different hardware used Erik Buchmann DDM: 10
11 Databases vs. Data Management Databases and distributed data management approaches that seem to exclude each other. Databases: Application does not do the data management itself any more, data management is centralized. Data Management: Many different nodes manage different data at different locations. Erik Buchmann DDM: 11
12 Databases (1) Several databases, together with coordination layer. App. 1 App. 2 App. 3 Coordination Layer Again, one single point of access transparent to application Erik Buchmann DDM: 12
13 Databases (2) Do we end up with the deficiencies of the situation without databases again? has control over redundant data. Technology discussed here avoids inconsistencies. Generic functionality remains to be factored out. is transparent (at least, this is objective). User/application programmer has the illusion that he deals with a centralized DB. this objective is not always realistic. we will learn in the next chapters when compromises are unavoidable Erik Buchmann DDM: 13
14 IPD, Forschungsbereich Systeme der Informationsverwaltung
15 Systems, Computing system (of computers): Set of autonomous processors which are connected by a network, and which cooperate in fulfilling the tasks assigned to them. What is distributed? application logic, function, data, control. Erik Buchmann DDM: 15
16 Classification of Systems (1) Degree of coupling ratio of volume of data exchanged and extent of local processing; typical situations: communication via computer network weak coupling, shared components (main memory, secondary storage) strong coupling. of the connection alternatives: point-to-point, shared connection (bus). Erik Buchmann DDM: 16
17 Classification of Systems (2) Degree of independence of the components frequency of information exchange continuously or only at the beginning and the end (task, result), Synchronization of components synchronous asynchronous. Erik Buchmann DDM: 17
18 Why in the First Place? (1) Bottom up. Top Down. Erik Buchmann DDM: 18
19 Why in the First Place? (2) Bottom up: Corresponds to structure of organization, many modern information systems are distributed by nature (multimedia applications, ERPs, web-based information systems and kiosks), Erik Buchmann DDM: 19
20 Why in the First Place? (3) Top down: higher reliability, no single point of failure. Better performance and lower response times, data locality, divide&conquer for complex problems more computing power becomes available, software development less complex and therefore cheaper. Erik Buchmann DDM: 20
21 Performance Impact of distribution on performance: Less CPU- and I/O-contention. Data Locality Locality reduces network delays, less communication overhead, for physical reasons, this reduction has natural bounds in WANs. Inter- and intra-query parallelism. Systems that do caching: Intra-query parallelism may even yield superlinear speedup. Erik Buchmann DDM: 21
22 Read-Only Performance vs. Update Performance Mirroring the entire database only works in read-only environments, Approaches in database products: Multiplexing of the database, i.e., production database and query database. better query performance (if query is not part of a transaction that also contains updates) Time multiplexing: Batching of updates, only queries; vice versa later on. Erik Buchmann DDM: 22
23 Fragmentation vs. Replication NYC Employees, London Employees, NYC Projects NYC Employees, London Employees, NYC Projects, London Projects New York City Tokio Tokio Employees, Tokio Projects, London Projects Hong Kong London HK Employees, HK Projects Example of distributed application. Erik Buchmann DDM: 23
24 Example points out necessity of different kinds of transparency (see following slides): data independence, network transparency, replication transparency, fragmentation transparency. Erik Buchmann DDM: 24
25 Data Independence Data independence: Immunity of application against changes of data organization, Type of transparency that also exists in non-distributed context. Data Definition: Schema description, Physical data description. Correspondingly, two kinds of data independence: Logical data independence (e.g., attribute is added), physical data independence (e.g., index is changed). Erik Buchmann DDM: 25
26 Physical Data Independence Illustration NAME FIRSTNAME STREET AGE Böhm Klemens Nordstrasse 28 Buchmann Erik Breiter Weg 26 Duckstein Ralf Goethestrasse 25 Saake Gunter Waldweg 43 FIRSTNAME NAME STREET AGE Erik Buchmann Breiter Weg 26 Gunter Saake Waldweg 43 Klemens Böhm Nordstrasse 28 Ralf Duckstein Goethestrasse 25 Query, e.g., select NAME from PERSON where FIRSTNAME == 'Ralf' works for any representation. Erik Buchmann DDM: 26
27 Logical Data Independence Illustration Underlying relation: ESD(Employee, Salary, Department) create view highincomeemp as select Employee, Salary from ESD where Salary > 20 View can be used like normal relation, e.g.: select * from highincomeemp where Salary < 50 insert into highincomeemp values ('Klemens', 35000) Modifications of base relation might not affect view. Erik Buchmann DDM: 27
28 Network Existence of network (at least certain details on a technical level) should be hidden from application programmer. Two complementary variants: Location transparency: Command independent of location of the data and the system where command is executed. Naming transparency: Each object has unique name. (Otherwise: Application must insert location name as part of the object name.) Popular examples of location/naming transparency? Erik Buchmann DDM: 28
29 Network for Flexibility Requirements regarding performance become higher add new node, instead of replacement of entire system. Scale-out. Analogously, node may want to leave the (distributed) system. We could call it number-of-nodes transparency. Erik Buchmann DDM: 29
30 Replication In principle, replication is advantageous: Higher locality of reference, reliability and availability. Replication transparency: User does not see that replicas exist. (Thus, replication transparency is stronger than network transparency.) Important concern of this course: Show that mechanisms for replication transparency are very elaborate (in the presence of updates and failures), Present mechanisms for replication transparency. Erik Buchmann DDM: 30
31 Fragmentation Global query fragment queries NYC Employees, London Employees, NYC Projects New York City NYC Employees, London Employees, NYC Projects, London Projects London Tokio Tokio Employees, Tokio Projects, London Projects Hong Kong HK Employees, HK Projects Erik Buchmann DDM: 31
32 Transactions (1) Important topic of the course: distributed transactions. Transaction: Sequence of operations s.t. the system gives certain guarantees for its execution. Transaction transition from one consistent database state to another one. Terminology: Atomicity (failure atomicity) Consistency Isolation (concurrency transparency) Durability Erik Buchmann DDM: 32
33 Atomicity, Isolation Transactional guarantees in particular, atomicity and isolation. Atomicity Example, bank scenario : Bank Person Balance Sparkasse Klemens 5000 Deutsche Bank Gunter 200 Which distribution/replication scenarios are possible? Money transfer two elementary operations. debit(klemens, 500), credit(gunter, 500). Isolation can be explained with this example, too Erik Buchmann DDM: 33
34 Transactions (2) Several aspects: Failure atomicity (cf. previous example). transactions for replicated databases one objective of distributed data management: higher reliability. failure other parts of the distributed database shall remain accessible, operation goes on. Assumption: site failures and communication failures are always possible. Erik Buchmann DDM: 34
35 IPD, Forschungsbereich Systeme der Informationsverwaltung
36 What is a (1)? database = collection of several databases with logical relationships between them, distributed over a network of computers. = software administering the distributed database and hiding distribution from the users. we are not talking about simple shared resources scenarios Erik Buchmann DDM: 36
37 What is a (2)? Not a distributed DB: Collection of files on different nodes of a network Several databases on the same machine on multi-processor One centralized database in a network Reason: no common structure among the files; no common interface for applications Network is not the only common component. No message exchange necessary. Same database problems as before. Erik Buchmann DDM: 37
38 Federated vs. Federated a number of loosely coupled, indepentent different database schemas, query languages, transaction models, programming interfaces wrapper needed, virtual integration (Integrated) tightly coupled with logical relationships between them uniform view on all resources no wrapper, physical integration Erik Buchmann DDM: 38
39 Advantages of Advantages, as compared to centralized. local autonomy local control degree of autonomy not as high as with federated s higher performance smaller number of transactions in local, but: sometimes data from different nodes. higher reliability and availability failures do not affect the entire system extensibility new nodes cost effectiveness smaller computers, sharing of resources Erik Buchmann DDM: 39
40 Disadvantages (1) Design of a distributed is more complex than the one of a centralized. Replication of data objects Choice of copy to be read, Guaranteeing that update takes place on each copy. Site- and/or communication failure Effects of updates must go to all nodes in time. Synchronization of transactions more difficult in presence of several sites, as compared to centralized case. Erik Buchmann DDM: 40
41 Disadvantages (2) Complexity all intricacies known from centralized plus additional problems, see rest of this course. Costs not only hardware; but also software, communication, staff Control is decentralized coordination? Data security more components that require protection, and network in addition. Erik Buchmann DDM: 41
42 Query Processing (1) Query Processing not the same as read operations. What differences in the distributed case? Replication, fragmentation etc. query processing must be extended Such extensions are either straightforward, are interesting, e.g., semi-joins, but impact tends to be low. r A r C,r D r A,r B r C,r D Erik Buchmann DDM: 42
43 Query Processing (2) Slightly different perspective: sources are around, e.g., in enterprises. Queries over several/many of such sources. Solutions: Federated database systems, mediator-based architectures. Scenario behind more recent developments: Sources are highly distributed, e.g., WWW, queries over several/many such sources. r A r C,r D r A,r B r C,r D Erik Buchmann DDM: 43
44 Query Processing (3) Which changes? Availability of sources not clear, may change at any time. In general, semantics of data rather unclear to users. Cost issues frequencies of data values are less obvious; construction of histograms not feasible in general. Topics in this course: Continuous QP, Online QP. r A r C,r D r A,r B r C,r D Erik Buchmann DDM: 44
45 of the Course IPD, Forschungsbereich Systeme der Informationsverwaltung
46 of this Course (1) motivation, architecture of distributed s, brief intro to concurrency control, transactions atomic commit protocols, 2PC vs. 3PC; optimizations, e.g., presumed abort; transactions in federated databases, tickets, Replication transactional guarantees in distributed systems with replication, replication in presence of site failures and communication failures, lazy replication, epidemic protocols for update propagation, Erik Buchmann DDM: 46
47 of this Course (2) Caching semantic caching, cache consistency and cache coherency, distributed caching, prefetching, Query processing in distributed environments (with a focus on open, unreliable settings such as the Internet) continuous query processing, first-few queries and user-adaptive query evaluation. Data management in sensor networks query processing in unreliable scenarios where many independent nodes process sensor data; optimization of communication processes Erik Buchmann DDM: 47
Distributed Data Management
Lecture Distributed Data Management Chapter 7: Lazy Replication Erik Buchmann buchmann@ipd.uka.de IPD, Forschungsbereich Systeme der Informationsverwaltung Synchronous vs. Asynchronous Updates Synchronous
More informationDistributed Databases Systems
Distributed Databases Systems Lecture No. 01 Distributed Database Systems Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro
More informationDistributed KIDS Labs 1
Distributed Databases @ KIDS Labs 1 Distributed Database System A distributed database system consists of loosely coupled sites that share no physical component Appears to user as a single system Database
More informationChapter 4: Distributed Transactions (First Part) IPD, Forschungsbereich Systeme der Informationsverwaltung
Chapter 4: Distributed Transactions (First Part) IPD, Forschungsbereich e der Informationsverwaltung 1 Distributed Transactions (1) US Customers Transfer USD 500,-- from Klemens account to Jim s account.
More informationIntroduction to the Course
Outline Introduction to the Course Introduction Distributed DBMS Architecture Distributed Database Design Query Processing Transaction Management Issues in Distributed Databases and an Example Objectives
More informationOutline. Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems
Distributed Systems Outline Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems What Is A Distributed System? A collection of independent computers that appears
More informationData Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 14: Data Replication Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich Database Replication What is database replication The advantages of
More informationConception of Information Systems Lecture 1: Basics
Conception of Information Systems Lecture 1: Basics 8 March 2005 http://lsirwww.epfl.ch/courses/cis/2005ss/ 2004-2005, Karl Aberer & J.P. Martin-Flatin 1 Information System: Definition Webopedia: An information
More informationmanagement systems Elena Baralis, Silvia Chiusano Politecnico di Torino Pag. 1 Distributed architectures Distributed Database Management Systems
atabase Management Systems istributed database istributed architectures atabase Management Systems istributed atabase Management Systems ata and computation are distributed over different machines ifferent
More informationDistributed Systems. Characteristics of Distributed Systems. Lecture Notes 1 Basic Concepts. Operating Systems. Anand Tripathi
1 Lecture Notes 1 Basic Concepts Anand Tripathi CSci 8980 Operating Systems Anand Tripathi CSci 8980 1 Distributed Systems A set of computers (hosts or nodes) connected through a communication network.
More informationDistributed Systems. Characteristics of Distributed Systems. Characteristics of Distributed Systems. Goals in Distributed System Designs
1 Anand Tripathi CSci 8980 Operating Systems Lecture Notes 1 Basic Concepts Distributed Systems A set of computers (hosts or nodes) connected through a communication network. Nodes may have different speeds
More informationDistributed Information Processing
Distributed Information Processing 1 st Lecture Eom, Hyeonsang ( 엄현상 ) Department of Computer Science & Engineering Seoul National University Copyrights 2017 Eom, Hyeonsang All Rights Reserved Outline
More informationDatabase Architectures
Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/15/15 Agenda Check-in Parallelism and Distributed Databases Technology Research Project Introduction to NoSQL
More informationMigrating Oracle Databases To Cassandra
BY UMAIR MANSOOB Why Cassandra Lower Cost of ownership makes it #1 choice for Big Data OLTP Applications. Unlike Oracle, Cassandra can store structured, semi-structured, and unstructured data. Cassandra
More informationCSE544 Database Architecture
CSE544 Database Architecture Tuesday, February 1 st, 2011 Slides courtesy of Magda Balazinska 1 Where We Are What we have already seen Overview of the relational model Motivation and where model came from
More informationIt also performs many parallelization operations like, data loading and query processing.
Introduction to Parallel Databases Companies need to handle huge amount of data with high data transfer rate. The client server and centralized system is not much efficient. The need to improve the efficiency
More informationDISTRIBUTED DATABASES CS561-SPRING 2012 WPI, MOHAMED ELTABAKH
DISTRIBUTED DATABASES CS561-SPRING 2012 WPI, MOHAMED ELTABAKH 1 RECAP: PARALLEL DATABASES Three possible architectures Shared-memory Shared-disk Shared-nothing (the most common one) Parallel algorithms
More informationDistributed Database Management Systems. Data and computation are distributed over different machines Different levels of complexity
atabase Management Systems istributed database atabase Management Systems istributed atabase Management Systems B M G 1 istributed architectures ata and computation are distributed over different machines
More informationDistributed Systems Principles and Paradigms
Distributed Systems Principles and Paradigms Chapter 01 (version September 5, 2007) Maarten van Steen Vrije Universiteit Amsterdam, Faculty of Science Dept. Mathematics and Computer Science Room R4.20.
More informationCA464 Distributed Programming
1 / 25 CA464 Distributed Programming Lecturer: Martin Crane Office: L2.51 Phone: 8974 Email: martin.crane@computing.dcu.ie WWW: http://www.computing.dcu.ie/ mcrane Course Page: "/CA464NewUpdate Textbook
More informationLecture 1: January 22
CMPSCI 677 Distributed and Operating Systems Spring 2018 Lecture 1: January 22 Lecturer: Prashant Shenoy Scribe: Bin Wang 1.1 Introduction to the course The lecture started by outlining the administrative
More informationDISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN. Chapter 1. Introduction
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN Chapter 1 Introduction Modified by: Dr. Ramzi Saifan Definition of a Distributed System (1) A distributed
More informationLecture 1: January 23
CMPSCI 677 Distributed and Operating Systems Spring 2019 Lecture 1: January 23 Lecturer: Prashant Shenoy Scribe: Jonathan Westin (2019), Bin Wang (2018) 1.1 Introduction to the course The lecture started
More informationDistributed Systems Principles and Paradigms. Chapter 01: Introduction
Distributed Systems Principles and Paradigms Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.20, steen@cs.vu.nl Chapter 01: Introduction Version: October 25, 2009 2 / 26 Contents Chapter
More informationDistributed Systems Principles and Paradigms. Chapter 01: Introduction. Contents. Distributed System: Definition.
Distributed Systems Principles and Paradigms Maarten van Steen VU Amsterdam, Dept. Computer Science Room R4.20, steen@cs.vu.nl Chapter 01: Version: February 21, 2011 1 / 26 Contents Chapter 01: 02: Architectures
More informationDatabase Management System. Fundamental Database Concepts
Database Management System Fundamental Database Concepts CONTENTS Basics of DBMS Purpose of DBMS Applications of DBMS Views of Data Instances and Schema Data Models Database Languages Responsibility of
More informationMulti-CPU and distributed systems. Database system architectures. Lecture topics: monolithic systems client server systems parallel database servers
Database system architectures Multi- and distributed systems Lecture topics: monolithic systems client server systems parallel database servers References: text rd edition, chapter : sections,, 6, 8 text
More informationChapter 18: Parallel Databases
Chapter 18: Parallel Databases Introduction Parallel machines are becoming quite common and affordable Prices of microprocessors, memory and disks have dropped sharply Recent desktop computers feature
More informationDatabases. Jörg Endrullis. VU University Amsterdam
Databases Jörg Endrullis VU University Amsterdam Databases A database (DB) is a collection of data with a certain logical structure a specific semantics a specific group of users Databases A database (DB)
More informationOutline. Database Management Systems (DBMS) Database Management and Organization. IT420: Database Management and Organization
Outline IT420: Database Management and Organization Dr. Crăiniceanu Capt. Balazs www.cs.usna.edu/~adina/teaching/it420/spring2007 Class Survey Why Databases (DB)? A Problem DB Benefits In This Class? Admin
More informationCSE 444: Database Internals. Lecture 22 Distributed Query Processing and Optimization
CSE 444: Database Internals Lecture 22 Distributed Query Processing and Optimization CSE 444 - Spring 2014 1 Readings Main textbook: Sections 20.3 and 20.4 Other textbook: Database management systems.
More informationCS4513 Distributed Computer Systems
Outline CS4513 Distributed Computer Systems Overview Goals Software Client Server Introduction (Ch 1: 1.1-1.2, 1.4-1.5) The Rise of Distributed Systems Computer hardware prices falling, power increasing
More informationCSE 544: Principles of Database Systems
CSE 544: Principles of Database Systems Anatomy of a DBMS, Parallel Databases 1 Announcements Lecture on Thursday, May 2nd: Moved to 9am-10:30am, CSE 403 Paper reviews: Anatomy paper was due yesterday;
More informationDatabase Architectures
Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 11/15/12 Agenda Check-in Centralized and Client-Server Models Parallelism Distributed Databases Homework 6 Check-in
More informationIntroduction to Distributed Systems
Introduction to Distributed Systems Distributed Systems L-A Sistemi Distribuiti L-A Andrea Omicini andrea.omicini@unibo.it Ingegneria Due Alma Mater Studiorum Università di Bologna a Cesena Academic Year
More informationCSE 530A ACID. Washington University Fall 2013
CSE 530A ACID Washington University Fall 2013 Concurrency Enterprise-scale DBMSs are designed to host multiple databases and handle multiple concurrent connections Transactions are designed to enable Data
More informationDistributed Databases
Distributed Databases Chapter 22.6-22.14 Comp 521 Files and Databases Spring 2010 1 Final Exam When Monday, May 3, at 4pm Where, here FB007 What Open book, open notes, no computer 48-50 multiple choice
More informationChapter 11 - Data Replication Middleware
Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 11 - Data Replication Middleware Motivation Replication: controlled
More informationOutline. Distributed Computing Systems. The Rise of Distributed Systems. Depiction of a Distributed System 4/15/2014
Outline Distributed Computing Systems Overview of Distributed Systems Overview Goals Software Client Server Andrew Tanenbaum and Marten van Steen, Distributed Systems Principles and Paradigms, Prentice
More informationLecture 23 Database System Architectures
CMSC 461, Database Management Systems Spring 2018 Lecture 23 Database System Architectures These slides are based on Database System Concepts 6 th edition book (whereas some quotes and figures are used
More informationReplication in Distributed Systems
Replication in Distributed Systems Replication Basics Multiple copies of data kept in different nodes A set of replicas holding copies of a data Nodes can be physically very close or distributed all over
More informationIntegrity in Distributed Databases
Integrity in Distributed Databases Andreas Farella Free University of Bozen-Bolzano Table of Contents 1 Introduction................................................... 3 2 Different aspects of integrity.....................................
More informationSCALABLE CONSISTENCY AND TRANSACTION MODELS
Data Management in the Cloud SCALABLE CONSISTENCY AND TRANSACTION MODELS 69 Brewer s Conjecture Three properties that are desirable and expected from realworld shared-data systems C: data consistency A:
More informationIntroduction to Distributed Systems
Introduction to Distributed Systems Distributed Systems Sistemi Distribuiti Andrea Omicini andrea.omicini@unibo.it Ingegneria Due Alma Mater Studiorum Università di Bologna a Cesena Academic Year 2011/2012
More informationCISC 7610 Lecture 5 Distributed multimedia databases. Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL
CISC 7610 Lecture 5 Distributed multimedia databases Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL Motivation YouTube receives 400 hours of video per minute That is 200M hours
More informationAdvanced Databases Lecture 17- Distributed Databases (continued)
Advanced Databases Lecture 17- Distributed Databases (continued) Masood Niazi Torshiz Islamic Azad University- Mashhad Branch www.mniazi.ir Alternative Models of Transaction Processing Notion of a single
More informationDistributed Databases
Distributed Databases Chapter 22, Part B Database Management Systems, 2 nd Edition. R. Ramakrishnan and Johannes Gehrke 1 Introduction Data is stored at several sites, each managed by a DBMS that can run
More informationIntroduction to Databases, Fall 2005 IT University of Copenhagen. Lecture 10: Transaction processing. November 14, Lecturer: Rasmus Pagh
Introduction to Databases, Fall 2005 IT University of Copenhagen Lecture 10: Transaction processing November 14, 2005 Lecturer: Rasmus Pagh Today s lecture Part I: Transaction processing Serializability
More informationDistributed Transaction Management 2003
Distributed Transaction Management 2003 Jyrki Nummenmaa http://www.cs.uta.fi/~dtm jyrki@cs.uta.fi General information We will view this from the course web page. Motivation We will pick up some motivating
More informationConsistency and Replication (part b)
Consistency and Replication (part b) EECS 591 Farnam Jahanian University of Michigan Tanenbaum Chapter 6.1-6.5 Eventual Consistency A very weak consistency model characterized by the lack of simultaneous
More informationIntro to Transactions
Reading Material CompSci 516 Database Systems Lecture 14 Intro to Transactions [RG] Chapter 16.1-16.3, 16.4.1 17.1-17.4 17.5.1, 17.5.3 Instructor: Sudeepa Roy Acknowledgement: The following slides have
More informationIntroduction to Data Management. Lecture #1 (Course Trailer ) Instructor: Chen Li
Introduction to Data Management Lecture #1 (Course Trailer ) Instructor: Chen Li 1 Today s Topics v Welcome to one of my biggest classes ever! v Read (and live by) the course wiki page: http://www.ics.uci.edu/~cs122a/
More informationMobile and Heterogeneous databases Security. A.R. Hurson Computer Science Missouri Science & Technology
Mobile and Heterogeneous databases Security A.R. Hurson Computer Science Missouri Science & Technology 1 Note, this unit will be covered in two lectures. In case you finish it earlier, then you have the
More informationChapter Outline. Chapter 2 Distributed Information Systems Architecture. Distributed transactions (quick refresh) Layers of an information system
Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 2 Distributed Information Systems Architecture Chapter Outline
More informationEssay Question: Explain 4 different means by which constrains are represented in the Conceptual Data Model (CDM).
Question 1 Essay Question: Explain 4 different means by which constrains are represented in the Conceptual Data Model (CDM). By specifying participation conditions By specifying the degree of relationship
More informationChapter 2 Distributed Information Systems Architecture
Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 2 Distributed Information Systems Architecture Chapter Outline
More informationDistributed Systems. Lecture 4 Othon Michail COMP 212 1/27
Distributed Systems COMP 212 Lecture 4 Othon Michail 1/27 What is a Distributed System? A distributed system is: A collection of independent computers that appears to its users as a single coherent system
More informationDistributed Systems LEEC (2006/07 2º Sem.)
Distributed Systems LEEC (2006/07 2º Sem.) Introduction João Paulo Carvalho Universidade Técnica de Lisboa / Instituto Superior Técnico Outline Definition of a Distributed System Goals Connecting Users
More informationIntroduction: Database Concepts Slides by: Ms. Shree Jaswal
Introduction: Database Concepts Slides by: Ms. Shree Jaswal Topics: Introduction Characteristics of databases File system V/s Database system Users of a Database system Data Models, Schemas, and Instances
More informationCurrent Topics in OS Research. So, what s hot?
Current Topics in OS Research COMP7840 OSDI Current OS Research 0 So, what s hot? Operating systems have been around for a long time in many forms for different types of devices It is normally general
More informationChapter 1: Distributed Systems: What is a distributed system? Fall 2013
Chapter 1: Distributed Systems: What is a distributed system? Fall 2013 Course Goals and Content n Distributed systems and their: n Basic concepts n Main issues, problems, and solutions n Structured and
More informationDistributed File Systems Issues. NFS (Network File System) AFS: Namespace. The Andrew File System (AFS) Operating Systems 11/19/2012 CSC 256/456 1
Distributed File Systems Issues NFS (Network File System) Naming and transparency (location transparency versus location independence) Host:local-name Attach remote directories (mount) Single global name
More informationCS352 Lecture: Database System Architectures last revised 11/22/06
CS352 Lecture: Database System Architectures last revised 11/22/06 I. Introduction - ------------ A. Most large databases require support for accesing the database by multiple users, often at multiple
More informationData-Centric Consistency Models. The general organization of a logical data store, physically distributed and replicated across multiple processes.
Data-Centric Consistency Models The general organization of a logical data store, physically distributed and replicated across multiple processes. Consistency models The scenario we will be studying: Some
More informationChapter 18: Parallel Databases Chapter 19: Distributed Databases ETC.
Chapter 18: Parallel Databases Chapter 19: Distributed Databases ETC. Introduction Parallel machines are becoming quite common and affordable Prices of microprocessors, memory and disks have dropped sharply
More informationCOMP Instructor: Dimitris Papadias WWW page:
COMP 5311 Instructor: Dimitris Papadias WWW page: http://www.cse.ust.hk/~dimitris/5311/5311.html Textbook Database System Concepts, A. Silberschatz, H. Korth, and S. Sudarshan. Reference Database Management
More informationGUJARAT TECHNOLOGICAL UNIVERSITY
Type of course: Elective SUBJECT NAME: Distributed DBMS SUBJECT CODE: 21714 B.E. 7 th SEMESTER Prerequisite: Database Management Systems & Networking Rationale: Students are familiar with Centralized DBMS.
More informationLecture 9: MIMD Architectures
Lecture 9: MIMD Architectures Introduction and classification Symmetric multiprocessors NUMA architecture Clusters Zebo Peng, IDA, LiTH 1 Introduction A set of general purpose processors is connected together.
More informationDatabase System Concepts
s Design Chapter 1: Introduction Departamento de Engenharia Informática Instituto Superior Técnico 1 st Semester 2009/2010 Slides (fortemente) baseados nos slides oficiais do livro c Silberschatz, Korth
More informationOutline. q Database integration & querying. q Peer-to-Peer data management q Stream data management q MapReduce-based distributed data management
Outline n Introduction & architectural issues n Data distribution n Distributed query processing n Distributed query optimization n Distributed transactions & concurrency control n Distributed reliability
More informationDatabase Management Systems Introduction to DBMS
Database Management Systems Introduction to DBMS D B M G 1 Introduction to DBMS Data Base Management System (DBMS) A software package designed to store and manage databases We are interested in internal
More informationIntroduction to Data Management. Lecture #1 (Course Trailer )
Introduction to Data Management Lecture #1 (Course Trailer ) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Today s Topics v Welcome to one
More informationCMPT 354 Database Systems I. Spring 2012 Instructor: Hassan Khosravi
CMPT 354 Database Systems I Spring 2012 Instructor: Hassan Khosravi Textbook First Course in Database Systems, 3 rd Edition. Jeffry Ullman and Jennifer Widom Other text books Ramakrishnan SILBERSCHATZ
More informationDistributed Databases
Distributed Databases Chapter 21, Part B Database Management Systems, 2 nd Edition. R. Ramakrishnan and Johannes Gehrke 1 Introduction Data is stored at several sites, each managed by a DBMS that can run
More informationBasic vs. Reliable Multicast
Basic vs. Reliable Multicast Basic multicast does not consider process crashes. Reliable multicast does. So far, we considered the basic versions of ordered multicasts. What about the reliable versions?
More informationDistributed Systems. Overview. Distributed Systems September A distributed system is a piece of software that ensures that:
Distributed Systems Overview Distributed Systems September 2002 1 Distributed System: Definition A distributed system is a piece of software that ensures that: A collection of independent computers that
More informationIntroduction to Database Systems. Motivation. Werner Nutt
Introduction to Database Systems Motivation Werner Nutt 1 Databases Are Everywhere Database = a large (?) collection of related data Classically, a DB models a real-world organisation (e.g., enterprise,
More informationWhat are Transactions? Transaction Management: Introduction (Chap. 16) Major Example: the web app. Concurrent Execution. Web app in execution (CS636)
What are Transactions? Transaction Management: Introduction (Chap. 16) CS634 Class 14, Mar. 23, 2016 So far, we looked at individual queries; in practice, a task consists of a sequence of actions E.g.,
More informationFIT: A Distributed Database Performance Tradeoff. Faleiro and Abadi CS590-BDS Thamir Qadah
FIT: A Distributed Database Performance Tradeoff Faleiro and Abadi CS590-BDS Thamir Qadah Desirable features in Distributed Databases Impossible to achieve Fairness Isolation Throughput It is impossible
More information02 - Distributed Systems
02 - Distributed Systems Definition Coulouris 1 (Dis)advantages Coulouris 2 Challenges Saltzer_84.pdf Models Physical Architectural Fundamental 2/58 Definition Distributed Systems Distributed System is
More informationChapter 6 Architectural Design
Chapter 6 Architectural Design Chapter 6 Architectural Design Slide 1 Topics covered The WHAT and WHY of architectural design Architectural design decisions Architectural views/perspectives Architectural
More informationCourse Logistics & Chapter 1 Introduction
CMSC 461, Database Management Systems Spring 2018 Course Logistics & Chapter 1 Introduction These slides are based on Database System Concepts book th edition, and the 2009 CMSC 461 slides by Dr. Kalpakis
More informationDistributed Systems. Chapter 1: Introduction
Distributed Systems (3rd Edition) Chapter 1: Introduction Version: February 25, 2017 2/56 Introduction: What is a distributed system? Distributed System Definition A distributed system is a collection
More informationUser Perspective. Module III: System Perspective. Module III: Topics Covered. Module III Overview of Storage Structures, QP, and TM
Module III Overview of Storage Structures, QP, and TM Sharma Chakravarthy UT Arlington sharma@cse.uta.edu http://www2.uta.edu/sharma base Management Systems: Sharma Chakravarthy Module I Requirements analysis
More informationDatacenter replication solution with quasardb
Datacenter replication solution with quasardb Technical positioning paper April 2017 Release v1.3 www.quasardb.net Contact: sales@quasardb.net Quasardb A datacenter survival guide quasardb INTRODUCTION
More informationThe C4 Collector. Or: the Application memory wall will remain until compaction is solved. Gil Tene Balaji Iyengar Michael Wolf
The C4 Collector Or: the Application memory wall will remain until compaction is solved Gil Tene Balaji Iyengar Michael Wolf High Level Agenda 1. The Application Memory Wall 2. Generational collection
More informationEventual Consistency Today: Limitations, Extensions and Beyond
Eventual Consistency Today: Limitations, Extensions and Beyond Peter Bailis and Ali Ghodsi, UC Berkeley - Nomchin Banga Outline Eventual Consistency: History and Concepts How eventual is eventual consistency?
More informationDBMS (FYCS) Unit - 1. A database management system stores data in such a way that it becomes easier to retrieve, manipulate, and produce information.
Prof- Neeta Bonde DBMS (FYCS) Unit - 1 DBMS: - Database is a collection of related data and data is a collection of facts and figures that can be processed to produce information. Mostly data represents
More informationUnit 5: Distributed, Real-Time, and Multimedia Systems
Unit 5: Distributed, Real-Time, and Multimedia Systems Unit Overview Unit 5 provides an extension to the core topics of operating systems. It introduces distributed systems and special-purpose operating
More informationChapter 19: Distributed Databases
Chapter 19: Distributed Databases Chapter 19: Distributed Databases Heterogeneous and Homogeneous Databases Distributed Data Storage Distributed Transactions Commit Protocols Concurrency Control in Distributed
More information! Parallel machines are becoming quite common and affordable. ! Databases are growing increasingly large
Chapter 20: Parallel Databases Introduction! Introduction! I/O Parallelism! Interquery Parallelism! Intraquery Parallelism! Intraoperation Parallelism! Interoperation Parallelism! Design of Parallel Systems!
More informationChapter 20: Parallel Databases
Chapter 20: Parallel Databases! Introduction! I/O Parallelism! Interquery Parallelism! Intraquery Parallelism! Intraoperation Parallelism! Interoperation Parallelism! Design of Parallel Systems 20.1 Introduction!
More informationChapter 20: Parallel Databases. Introduction
Chapter 20: Parallel Databases! Introduction! I/O Parallelism! Interquery Parallelism! Intraquery Parallelism! Intraoperation Parallelism! Interoperation Parallelism! Design of Parallel Systems 20.1 Introduction!
More informationDistributed Operating Systems Spring Prashant Shenoy UMass Computer Science.
Distributed Operating Systems Spring 2008 Prashant Shenoy UMass Computer Science http://lass.cs.umass.edu/~shenoy/courses/677 Lecture 1, page 1 Course Syllabus CMPSCI 677: Distributed Operating Systems
More informationDistributed Data Analytics Transactions
Distributed Data Analytics G-3.1.09, Campus III Hasso Plattner Institut must ensure that interactions succeed consistently An OLTP Topic Motivation Most database interactions consist of multiple, coherent
More informationTransaction Management: Introduction (Chap. 16)
Transaction Management: Introduction (Chap. 16) CS634 Class 14 Slides based on Database Management Systems 3 rd ed, Ramakrishnan and Gehrke What are Transactions? So far, we looked at individual queries;
More informationAdministration Naive DBMS CMPT 454 Topics. John Edgar 2
Administration Naive DBMS CMPT 454 Topics John Edgar 2 http://www.cs.sfu.ca/coursecentral/454/johnwill/ John Edgar 4 Assignments 25% Midterm exam in class 20% Final exam 55% John Edgar 5 A database stores
More information1/9/13. + The Transaction Concept. Transaction Processing. Multiple online users: Gives rise to the concurrency problem.
+ Transaction Processing Enterprise Scale Data Management Divy Agrawal Department of Computer Science University of California at Santa Barbara + The Transaction Concept Multiple online users: Gives rise
More informationWide Area Query Systems The Hydra of Databases
Wide Area Query Systems The Hydra of Databases Stonebraker et al. 96 Gribble et al. 02 Zachary G. Ives University of Pennsylvania January 21, 2003 CIS 650 Data Sharing and the Web The Vision A World Wide
More informationCourse Introduction & Foundational Concepts
Course Introduction & Foundational Concepts CPS 352: Database Systems Simon Miner Gordon College Last Revised: 8/30/12 Agenda Introductions Course Syllabus Databases Why What Terminology and Concepts Design
More information