Just-Right Consistency. Centralised data store. trois bases. As available as possible As consistent as necessary Correct by design
|
|
- Kerry Boone
- 5 years ago
- Views:
Transcription
1 Just-Right Consistency As available as possible As consistent as necessary Correct by design Marc Shapiro, UMC-LI6 & Inria Annette Bieniusa, U. Kaiserslautern Nuno reguiça, U. Nova Lisboa Christopher Meiklejohn, Northwestern U. Valter Balegas, U. Nova Lisboa trois bases Centralised data store Clients read and write primary copy [ 3 4
2 Geo-replicated DB Network artition + fault-tolerance + low latency read Updates? Read available replica Updates? 5 6 Eventual Consistency Strong Consistency Google Spanner Asynch. updates Available under artition Concurrency anomalies Hell for developers! 7 Synch. updates Consistent under artition Simple programming model Slow, expensive Not available under partition 8
3 Available + correct? Just-Right Consistency Insight: Maintain invariants reserve sequential patterns Merge concurrent updates Synchronise only when strictly necessary for application can't synch, can send correct! best possible availability and performance 9 Maintain (possibly unknown) application invariants Baseline: 1-copy, one op. at a time Invariant-maintaining patterns Ordered updates causal consistency Grouped updates atomic recondition-before-update Stable concurrent OK Not stable concurrency control Acompatible CAsensitive 10 FMK Fælles Medicinkort FMK Fælles Medicinkort atient create ( ) R X atient: Mr harmacy: Causatin: boxes
4 Invariant-maintaining patterns ordered updates atient grouped updates Geo-distributed FMK 1 create ( ) R X atient: Mr harmacy: Causatin: boxes Geo-distr.: invariants? EC does not maintain! precondition C is overkill! precondition 1 ordered updates create ( ) R X atient grouped updates atient: Mr harmacy: Causatin: boxes 1 Operation model generator e g effector e Replicated data: read one, update all Operation: generator = pure, read-only effector = lambda, update-only deferred-update transaction effector effector 15 16
5 Data model: registers? add-med(1) cnt 1 cnt cnt = 0 cnt = A data model: CRDTs add-med(1) cnt += 1 cnt += cnt = 0 cnt = 3 cnt = 0 cnt cnt 1 cnt = 1 cnt = 0 cnt += cnt += 1 cnt = 3 add-med() add-med() Concurrent, asynchronous updates Standard register model: assignments C A concurrent updates merged CRDT: register, counter, set, map, sequence Extends sequential type Encapsulates convergent merge Conflict-free replicated data types Sequential: register, set, map, graph CRDTs Backwards-compatible in sequential execution Concurrency semantics Local reads, mergeable writes Guaranteed convergence 17 Concurrent, asynchronous updates Standard register model: assignments C A concurrent updates merged CRDT: register, counter, set, map, sequence Extends sequential type Encapsulates convergent merge Ordered-updates pattern ordered updates 1 create ( ) R X atient 18 atient: Mr harmacy: Causatin: boxes
6 with CC animation Ordered: A-Compatible create-p before add-pp Ordered-updates invariant pattern: atient record points to valid prescription x valid x points to y y valid admin-login-enabled nondefault-password Make RHS true; then LHS true Transmit in the right order! A-compatible: Causal Consistency with animations 1=RHS! =LHS! Grouped: A-Compat. Atomic update 1! >1 Grouped-update pattern create ( ) R X atient grouped updates atient: Mr harmacy: Causatin: boxes 1 recondition-beforeupdate pattern atient create-p updates doctor, patient & cy record Transmit grouped updates together write-atomic + Read from common set of txns snapshot property = All-or-Nothing (A of ACID) A-compatible patient!! Snapshot patient!? patient?? precondition create ( ) R X atient: Mr harmacy: Causatin: boxes 1 3 4
7 -B-U is CA-sensitive pp(, 1) cnt 1 process-p (, nb) { if cnt nb // precondition at source cnt = nb // at every replica } // recond-before-update pp(, 1) cnt 1 cnt += 3 add-med (, 3) cnt += 3 = process-p (, nb) { if cnt nb // precondition at source cnt = nb // at every replica } // recondition stable w.r.t. concurrent add-med Concurrency OK! recond-before-update pp(, 1) cnt 1 process-p (, nb) { if cnt nb // precondition at source cnt = nb // at every replica } // recond-before-update pp(, 1) cnt 1 cnt = pp(, ) cnt 1 cnt = CISE Static Analysis process-p (, nb) { if cnt nb // precondition at source cnt = nb // at every replica } // recondition not stable w.r.t. concurrent process-p Forbid concurrency? Synchro, C. Or remove invariant? A, degraded semantics 6 7 8
8 CISE analysis JRC: Summary Application = operations + invariants 1. Does each individual op maintain invariant?. Do concurrent updates u v converge? 3. Is precondition of u stable w.r.t. concurrent v? roof: Invariant 3: Change specification (invariant) or Synchronise Designer decision, per pair (u, v) Ex: medication count= inc inc, inc dec, dec dec Tailor consistency to (possibly unknown) application invariants Baseline: 1-copy, one op. at a time Three invariant-maintaining patterns Ordered updates causal consistency Joint updates atomic recondition check Stable concurrent OK Not stable concurrency control 9 30 AntidoteDB.eu AntidoteDB.org CRDT data model Register, counter, set, map, sequence Extend sequential semantics A compatible Transactional Causal Consistency (TCC) Strongest A-compatible model Joint Updates / Transactional artial Order / Causal Consistency Open source, well engineered Community of users Rich interface with concurrent abstract data types (CRDTs) Geo-replication Low latency, Available under artition Erlang / Riak Core Growing community Active development Secondary Indexes, configurable back-ends and protocols, partial replication, clients in many languages, SQL interface, etc. 31 3
9 Sharded, parallel Architecture Copenhagen Total order ClockSI Get started with.org Causal order A: 10 B: 15 C: Aarhus Antidote and Just-Right Consistency JRC methodology for provably ensuring As Available as ossible, Consistent Enough TCC ordered, grouped updates recond-before-update: Bounded Ctr, CISE AntidoteDB: CRDTs.e Causal Consistency Transactions Bounded Counter Acompatible mostly A 33 C when necessary CISE verification & co-design (+ related tools) 35 More work to do Invariant-maintenance patterns: ordered updates, grouped updates, precond-before-update Complete? More evidence needed Scale to the edge: causal consistency expensive Active research area artial replication Avoidance through static analysis? Transaction semantics write-atomic easy read-atomic, causal: costly 34 36
10 Creative Commons Attribution-ShareAlike 4.0 Intl. License You are free to: Share copy and redistribute the material in any medium or format Adapt remix, transform, and build upon the material for any purpose, even commercially, under the following terms: Attribution You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. ShareAlike If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original. 37
(It s the invariants, stupid)
Consistency in three dimensions (It s the invariants, stupid) Marc Shapiro Cloud to the edge Social, web, e-commerce: shared mutable data Scalability replication consistency issues Inria & Sorbonne-Universités
More informationStrong Eventual Consistency and CRDTs
Strong Eventual Consistency and CRDTs Marc Shapiro, INRIA & LIP6 Nuno Preguiça, U. Nova de Lisboa Carlos Baquero, U. Minho Marek Zawirski, INRIA & UPMC Large-scale replicated data structures Large, dynamic
More informationThis talk is about. Data consistency in 3D. Geo-replicated database q: Queue c: Counter { q c } Shared database q: Queue c: Counter { q c }
Data consistency in 3D (It s the invariants, stupid) Marc Shapiro Masoud Saieda Ardekani Gustavo Petri This talk is about Understanding consistency Primitive consistency mechanisms How primitives compose
More informationSWIFTCLOUD: GEO-REPLICATION RIGHT TO THE EDGE
SWIFTCLOUD: GEO-REPLICATION RIGHT TO THE EDGE Annette Bieniusa T.U. Kaiserslautern Carlos Baquero HSALab, U. Minho Marc Shapiro, Marek Zawirski INRIA, LIP6 Nuno Preguiça, Sérgio Duarte, Valter Balegas
More informationStrong Eventual Consistency and CRDTs
Strong Eventual Consistency and CRDTs Marc Shapiro, INRIA & LIP6 Nuno Preguiça, U. Nova de Lisboa Carlos Baquero, U. Minho Marek Zawirski, INRIA & UPMC Large-scale replicated data structures Large, dynamic
More informationConflict-free Replicated Data Types (CRDTs)
Conflict-free Replicated Data Types (CRDTs) for collaborative environments Marc Shapiro, INRIA & LIP6 Nuno Preguiça, U. Nova de Lisboa Carlos Baquero, U. Minho Marek Zawirski, INRIA & UPMC Conflict-free
More informationEventual Consistency Today: Limitations, Extensions and Beyond
Eventual Consistency Today: Limitations, Extensions and Beyond Peter Bailis and Ali Ghodsi, UC Berkeley - Nomchin Banga Outline Eventual Consistency: History and Concepts How eventual is eventual consistency?
More informationLarge-Scale Key-Value Stores Eventual Consistency Marco Serafini
Large-Scale Key-Value Stores Eventual Consistency Marco Serafini COMPSCI 590S Lecture 13 Goals of Key-Value Stores Export simple API put(key, value) get(key) Simpler and faster than a DBMS Less complexity,
More informationCloud-backed applications. Write Fast, Read in the Past: Causal Consistency for Client- Side Applications. Requirements & challenges
Write Fast, Read in the Past: ausal onsistenc for lient- Side Applications loud-backed applications 1. Request 3. Repl 2. Process request & store update 4. Transmit update 50~200ms databas e app server
More informationGenie. Distributed Systems Synthesis and Verification. Marc Rosen. EN : Advanced Distributed Systems and Networks May 1, 2017
Genie Distributed Systems Synthesis and Verification Marc Rosen EN.600.667: Advanced Distributed Systems and Networks May 1, 2017 1 / 35 Outline Introduction Problem Statement Prior Art Demo How does it
More informationWhy distributed databases suck, and what to do about it. Do you want a database that goes down or one that serves wrong data?"
Why distributed databases suck, and what to do about it - Regaining consistency Do you want a database that goes down or one that serves wrong data?" 1 About the speaker NoSQL team lead at Trifork, Aarhus,
More informationBuilding Consistent Transactions with Inconsistent Replication
DB Reading Group Fall 2015 slides by Dana Van Aken Building Consistent Transactions with Inconsistent Replication Irene Zhang, Naveen Kr. Sharma, Adriana Szekeres, Arvind Krishnamurthy, Dan R. K. Ports
More informationExtending Eventually Consistent Cloud Databases for Enforcing Numeric Invariants
Extending Eventually Consistent Cloud Databases for Enforcing Numeric Invariants Valter Balegas, Diogo Serra, Sérgio Duarte Carla Ferreira, Rodrigo Rodrigues, Nuno Preguiça NOVA LINCS/FCT/Universidade
More informationDISTRIBUTED EVENTUALLY CONSISTENT COMPUTATIONS
LASP 2 DISTRIBUTED EVENTUALLY CONSISTENT COMPUTATIONS 3 EN TAL AV CHRISTOPHER MEIKLEJOHN 4 RESEARCH WITH: PETER VAN ROY (UCL) 5 MOTIVATION 6 SYNCHRONIZATION IS EXPENSIVE 7 SYNCHRONIZATION IS SOMETIMES
More informationSelf-healing Data Step by Step
Self-healing Data Step by Step Uwe Friedrichsen (codecentric AG) NoSQL matters Cologne, 29. April 2014 @ufried Uwe Friedrichsen uwe.friedrichsen@codecentric.de http://slideshare.net/ufried http://ufried.tumblr.com
More informationCoordination-Free Computations. Christopher Meiklejohn
Coordination-Free Computations Christopher Meiklejohn LASP DISTRIBUTED, EVENTUALLY CONSISTENT COMPUTATIONS CHRISTOPHER MEIKLEJOHN (BASHO TECHNOLOGIES, INC.) PETER VAN ROY (UNIVERSITÉ CATHOLIQUE DE LOUVAIN)
More informationConflict-Free Replicated Data Types (basic entry)
Conflict-Free Replicated Data Types (basic entry) Marc Shapiro Sorbonne-Universités-UPMC-LIP6 & Inria Paris http://lip6.fr/marc.shapiro/ 16 May 2016 1 Synonyms Conflict-Free Replicated Data Types (CRDTs).
More informationJust-Right Consistency: reconciling availability and safety
Just-Right Consistency: reconciling availability and safety Marc Shapiro, Annette Bieniusa, Nuno Preguiça, Valter Balegas, Christopher Meiklejohn To cite this version: Marc Shapiro, Annette Bieniusa, Nuno
More informationAutomating the Choice of Consistency Levels in Replicated Systems
Automating the Choice of Consistency Levels in Replicated Systems Cheng Li, João Leitão, Allen Clement Nuno Preguiça, Rodrigo Rodrigues, Viktor Vafeiadis Max Planck Institute for Software Systems (MPI-SWS),
More informationEventual Consistency Today: Limitations, Extensions and Beyond
Eventual Consistency Today: Limitations, Extensions and Beyond Peter Bailis and Ali Ghodsi, UC Berkeley Presenter: Yifei Teng Part of slides are cited from Nomchin Banga Road Map Eventual Consistency:
More informationRiak. Distributed, replicated, highly available
INTRO TO RIAK Riak Overview Riak Distributed Riak Distributed, replicated, highly available Riak Distributed, highly available, eventually consistent Riak Distributed, highly available, eventually consistent,
More informationMegastore: Providing Scalable, Highly Available Storage for Interactive Services & Spanner: Google s Globally- Distributed Database.
Megastore: Providing Scalable, Highly Available Storage for Interactive Services & Spanner: Google s Globally- Distributed Database. Presented by Kewei Li The Problem db nosql complex legacy tuning expensive
More informationUsing Erlang, Riak and the ORSWOT CRDT at bet365 for Scalability and Performance. Michael Owen Research and Development Engineer
1 Using Erlang, Riak and the ORSWOT CRDT at bet365 for Scalability and Performance Michael Owen Research and Development Engineer 2 Background 3 bet365 in stats Founded in 2000 Located in Stoke-on-Trent
More informationBuilding Consistent Transactions with Inconsistent Replication
Building Consistent Transactions with Inconsistent Replication Irene Zhang, Naveen Kr. Sharma, Adriana Szekeres, Arvind Krishnamurthy, Dan R. K. Ports University of Washington Distributed storage systems
More informationA Framework for Transactional Consistency Models with Atomic Visibility
A Framework for Transactional Consistency Models with Atomic Visibility Andrea Cerone, Giovanni Bernardi, Alexey Gotsman IMDEA Software Institute, Madrid, Spain CONCUR - Madrid, September 1st 2015 Data
More informationReplication in Distributed Systems
Replication in Distributed Systems Replication Basics Multiple copies of data kept in different nodes A set of replicas holding copies of a data Nodes can be physically very close or distributed all over
More informationIPA: Invariant-Preserving Applications for Weakly Consistent Replicated Databases
IPA: Invariant-Preserving Applications for Weakly Consistent Replicated Databases Valter Balegas NOVA LINCS, FCT, Universidade NOVA de Lisboa v.sousa@campus.fct.unl.pt Rodrigo Rodrigues INESC-ID, Instituto
More informationTARDiS: A branch and merge approach to weak consistency
TARDiS: A branch and merge approach to weak consistency By: Natacha Crooks, Youer Pu, Nancy Estrada, Trinabh Gupta, Lorenzo Alvis, Allen Clement Presented by: Samodya Abeysiriwardane TARDiS Transactional
More informationDatabases. Laboratorio de sistemas distribuidos. Universidad Politécnica de Madrid (UPM)
Databases Laboratorio de sistemas distribuidos Universidad Politécnica de Madrid (UPM) http://lsd.ls.fi.upm.es/lsd/lsd.htm Nuevas tendencias en sistemas distribuidos 2 Summary Transactions. Isolation.
More informationJanus. Consolidating Concurrency Control and Consensus for Commits under Conflicts. Shuai Mu, Lamont Nelson, Wyatt Lloyd, Jinyang Li
Janus Consolidating Concurrency Control and Consensus for Commits under Conflicts Shuai Mu, Lamont Nelson, Wyatt Lloyd, Jinyang Li New York University, University of Southern California State of the Art
More informationMutual consistency, what for? Replication. Data replication, consistency models & protocols. Difficulties. Execution Model.
Replication Data replication, consistency models & protocols C. L. Roncancio - S. Drapeau Grenoble INP Ensimag / LIG - Obeo 1 Data and process Focus on data: several physical of one logical object What
More informationTowards a Proof Framework for Information Systems with Weak Consistency
Towards a Proof Framework for Information Systems with Weak Consistency Peter Zeller and Arnd Poetzsch-Heffter University of Kaiserslautern, Germany {p zeller,poetzsch@cs.uni-kl.de Abstract. Weakly consistent
More informationLasp: A Language for Distributed, Coordination-Free Programming
Lasp: A Language for Distributed, Coordination-Free Programming Christopher Meiklejohn Basho Technologies, Inc. cmeiklejohn@basho.com Peter Van Roy Université catholique de Louvain peter.vanroy@uclouvain.be
More informationCMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS
Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB s C. Faloutsos A. Pavlo Lecture#23: Distributed Database Systems (R&G ch. 22) Administrivia Final Exam Who: You What: R&G Chapters 15-22
More informationTAPIR. By Irene Zhang, Naveen Sharma, Adriana Szekeres, Arvind Krishnamurthy, and Dan Ports Presented by Todd Charlton
TAPIR By Irene Zhang, Naveen Sharma, Adriana Szekeres, Arvind Krishnamurthy, and Dan Ports Presented by Todd Charlton Outline Problem Space Inconsistent Replication TAPIR Evaluation Conclusion Problem
More informationDatabase Availability and Integrity in NoSQL. Fahri Firdausillah [M ]
Database Availability and Integrity in NoSQL Fahri Firdausillah [M031010012] What is NoSQL Stands for Not Only SQL Mostly addressing some of the points: nonrelational, distributed, horizontal scalable,
More informationThe CISE Tool: Proving Weakly-Consistent Applications Correct
The CISE Tool: Proving Weakly-Consistent Applications Correct Mahsa Najafzadeh Sorbonne-Universités-UPMC & Inria, Paris, France Alexey Gotsman IMDEA Software Institute, Spain Hongseok Yang University of
More informationFrom strong to eventual consistency: getting it right. Nuno Preguiça, U. Nova de Lisboa Marc Shapiro, Inria & UPMC-LIP6
From strong to eventual consistency: getting it right Nuno Preguiça, U. Nova de Lisboa Marc Shapiro, Inria & UPMC-LIP6 Conflict-free Replicated Data Types ı Marc Shapiro 1,5, Nuno Preguiça 2,1, Carlos
More informationEECS 498 Introduction to Distributed Systems
EECS 498 Introduction to Distributed Systems Fall 2017 Harsha V. Madhyastha Replicated State Machines Logical clocks Primary/ Backup Paxos? 0 1 (N-1)/2 No. of tolerable failures October 11, 2017 EECS 498
More informationDistributed transactional reads: the strong, the quick, the fresh & the impossible
Distributed transactional reads: the strong, the quick, the fresh & the impossible Alejandro Z. Tomsic Sorbonne Université, Inria, LIP6, Paris Manuel Bravo IMDEA Software Institute, Madrid Marc Shapiro
More informationDistributed Systems. Day 13: Distributed Transaction. To Be or Not to Be Distributed.. Transactions
Distributed Systems Day 13: Distributed Transaction To Be or Not to Be Distributed.. Transactions Summary Background on Transactions ACID Semantics Distribute Transactions Terminology: Transaction manager,,
More informationCISC 7610 Lecture 5 Distributed multimedia databases. Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL
CISC 7610 Lecture 5 Distributed multimedia databases Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL Motivation YouTube receives 400 hours of video per minute That is 200M hours
More information10. Replication. CSEP 545 Transaction Processing Philip A. Bernstein. Copyright 2003 Philip A. Bernstein. Outline
10. Replication CSEP 545 Transaction Processing Philip A. Bernstein Copyright 2003 Philip A. Bernstein 1 Outline 1. Introduction 2. Primary-Copy Replication 3. Multi-Master Replication 4. Other Approaches
More informationDistributed Systems. Characteristics of Distributed Systems. Lecture Notes 1 Basic Concepts. Operating Systems. Anand Tripathi
1 Lecture Notes 1 Basic Concepts Anand Tripathi CSci 8980 Operating Systems Anand Tripathi CSci 8980 1 Distributed Systems A set of computers (hosts or nodes) connected through a communication network.
More informationDistributed Systems. Characteristics of Distributed Systems. Characteristics of Distributed Systems. Goals in Distributed System Designs
1 Anand Tripathi CSci 8980 Operating Systems Lecture Notes 1 Basic Concepts Distributed Systems A set of computers (hosts or nodes) connected through a communication network. Nodes may have different speeds
More informationREPLICATED STATE MACHINES
5/6/208 REPLICATED STATE MACHINES George Porter May 6, 208 ATTRIBUTION These slides are released under an Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0) Creative Commons license These
More informationMahsa NAJAFZADEH. The Analysis and Co-design of Weakly-Consistent Applications
THÈSE DE DOCTORAT DE l UNIVERSITÉ PIERRE ET MARIE CURIE Spécialité Informatique École doctorale Informatique, Télécommunications et Électronique (Paris) Présentée par Mahsa NAJAFZADEH Pour obtenir le grade
More informationMigrating Oracle Databases To Cassandra
BY UMAIR MANSOOB Why Cassandra Lower Cost of ownership makes it #1 choice for Big Data OLTP Applications. Unlike Oracle, Cassandra can store structured, semi-structured, and unstructured data. Cassandra
More informationCS6450: Distributed Systems Lecture 11. Ryan Stutsman
Strong Consistency CS6450: Distributed Systems Lecture 11 Ryan Stutsman Material taken/derived from Princeton COS-418 materials created by Michael Freedman and Kyle Jamieson at Princeton University. Licensed
More informationDistributed Systems. 19. Spanner. Paul Krzyzanowski. Rutgers University. Fall 2017
Distributed Systems 19. Spanner Paul Krzyzanowski Rutgers University Fall 2017 November 20, 2017 2014-2017 Paul Krzyzanowski 1 Spanner (Google s successor to Bigtable sort of) 2 Spanner Take Bigtable and
More informationCSE 124: REPLICATED STATE MACHINES. George Porter November 8 and 10, 2017
CSE 24: REPLICATED STATE MACHINES George Porter November 8 and 0, 207 ATTRIBUTION These slides are released under an Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0) Creative Commons
More informationSpanner : Google's Globally-Distributed Database. James Sedgwick and Kayhan Dursun
Spanner : Google's Globally-Distributed Database James Sedgwick and Kayhan Dursun Spanner - A multi-version, globally-distributed, synchronously-replicated database - First system to - Distribute data
More informationExtreme Computing. NoSQL.
Extreme Computing NoSQL PREVIOUSLY: BATCH Query most/all data Results Eventually NOW: ON DEMAND Single Data Points Latency Matters One problem, three ideas We want to keep track of mutable state in a scalable
More informationSDPaxos: Building Efficient Semi-Decentralized Geo-replicated State Machines
SDPaxos: Building Efficient Semi-Decentralized Geo-replicated State Machines Hanyu Zhao *, Quanlu Zhang, Zhi Yang *, Ming Wu, Yafei Dai * * Peking University Microsoft Research Replication for Fault Tolerance
More informationIntroduction to Distributed Data Systems
Introduction to Distributed Data Systems Serge Abiteboul Ioana Manolescu Philippe Rigaux Marie-Christine Rousset Pierre Senellart Web Data Management and Distribution http://webdam.inria.fr/textbook January
More informationEventually Consistent HTTP with Statebox and Riak
Eventually Consistent HTTP with Statebox and Riak Author: Bob Ippolito (@etrepum) Date: November 2011 Venue: QCon San Francisco 2011 1/62 Introduction This talk isn't really about web. It's about how we
More informationSwiftCloud: Fault-Tolerant Geo-Replication Integrated all the Way to the Client Machine
SwiftCloud: Fault-Tolerant Geo-Replication Integrated all the Way to the Client Machine Marek Zawirski, Annette Bieniusa, Valter Balegas, Sérgio Duarte, Carlos Baquero, Marc Shapiro, Nuno Preguiça To cite
More informationJargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems
Jargons, Concepts, Scope and Systems Key Value Stores, Document Stores, Extensible Record Stores Overview of different scalable relational systems Examples of different Data stores Predictions, Comparisons
More informationConflict-free Replicated Data Types in Practice
Conflict-free Replicated Data Types in Practice Georges Younes Vitor Enes Wednesday 11 th January, 2017 HASLab/INESC TEC & University of Minho InfoBlender Motivation Background Background: CAP Theorem
More informationMINIMIZING TRANSACTION LATENCY IN GEO-REPLICATED DATA STORES
MINIMIZING TRANSACTION LATENCY IN GEO-REPLICATED DATA STORES Divy Agrawal Department of Computer Science University of California at Santa Barbara Joint work with: Amr El Abbadi, Hatem Mahmoud, Faisal
More informationNewSQL Databases. The reference Big Data stack
Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica NewSQL Databases Corso di Sistemi e Architetture per Big Data A.A. 2017/18 Valeria Cardellini The reference
More informationCockroachDB on DC/OS. Ben Darnell, CTO, Cockroach Labs
CockroachDB on DC/OS Ben Darnell, CTO, Cockroach Labs Agenda A cloud-native database CockroachDB on DC/OS Why CockroachDB Demo! Cloud-Native Database What is Cloud-Native? Horizontally scalable Individual
More informationReplication and Consistency
Replication and Consistency Today l Replication l Consistency models l Consistency protocols The value of replication For reliability and availability Avoid problems with disconnection, data corruption,
More informationSelective Hearing: An Approach to Distributed, Eventually Consistent Edge Computation
Selective Hearing: An Approach to Distributed, Eventually Consistent Edge Computation Christopher Meiklejohn Basho Technologies, Inc. Bellevue, WA Email: cmeiklejohn@basho.com Peter Van Roy Université
More informationWhat Came First? The Ordering of Events in
What Came First? The Ordering of Events in Systems @kavya719 kavya the design of concurrent systems Slack architecture on AWS systems with multiple independent actors. threads in a multithreaded program.
More informationUsing MVCC for Clustered Databases
Using MVCC for Clustered Databases structure introduction, scope and terms life-cycle of a transaction in Postgres-R write scalability tests results and their analysis 2 focus: cluster high availability,
More informationModern Database Concepts
Modern Database Concepts Basic Principles Doc. RNDr. Irena Holubova, Ph.D. holubova@ksi.mff.cuni.cz NoSQL Overview Main objective: to implement a distributed state Different objects stored on different
More informationarxiv: v1 [cs.dc] 16 May 2018
Conflict-free Replicated Data Types arxiv:1805.06358v1 [cs.dc] 16 May 2018 Definition Nuno Preguiça DI, FCT, Universidade NOVA de Lisboa and NOVA LINCS, Portugal Carlos Baquero HASLab / INESC TEC & Universidade
More informationCSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 14 NoSQL
CSE 544 Principles of Database Management Systems Magdalena Balazinska Winter 2015 Lecture 14 NoSQL References Scalable SQL and NoSQL Data Stores, Rick Cattell, SIGMOD Record, December 2010 (Vol. 39, No.
More informationImportant Lessons. Today's Lecture. Two Views of Distributed Systems
Important Lessons Replication good for performance/ reliability Key challenge keeping replicas up-to-date Wide range of consistency models Will see more next lecture Range of correctness properties L-10
More informationSpanner: Google's Globally-Distributed Database. Presented by Maciej Swiech
Spanner: Google's Globally-Distributed Database Presented by Maciej Swiech What is Spanner? "...Google's scalable, multi-version, globallydistributed, and synchronously replicated database." What is Spanner?
More informationConsistency in Distributed Systems
Consistency in Distributed Systems Recall the fundamental DS properties DS may be large in scale and widely distributed 1. concurrent execution of components 2. independent failure modes 3. transmission
More informationSpanner: Google s Globally-Distributed Database. Wilson Hsieh representing a host of authors OSDI 2012
Spanner: Google s Globally-Distributed Database Wilson Hsieh representing a host of authors OSDI 2012 What is Spanner? Distributed multiversion database General-purpose transactions (ACID) SQL query language
More informationGlobalFS: A Strongly Consistent Multi-Site Filesystem
GlobalFS: A Strongly Consistent Multi-Site Filesystem Leandro Pacheco Raluca Halalai Valerio Schiavoni Fernando Pedone Etienne Rivière Pascal Felber RainbowFS Workshop May 3rd, 2017 Distributed applications
More informationCS6450: Distributed Systems Lecture 15. Ryan Stutsman
Strong Consistency CS6450: Distributed Systems Lecture 15 Ryan Stutsman Material taken/derived from Princeton COS-418 materials created by Michael Freedman and Kyle Jamieson at Princeton University. Licensed
More informationMDCC MULTI DATA CENTER CONSISTENCY. amplab. Tim Kraska, Gene Pang, Michael Franklin, Samuel Madden, Alan Fekete
MDCC MULTI DATA CENTER CONSISTENCY Tim Kraska, Gene Pang, Michael Franklin, Samuel Madden, Alan Fekete gpang@cs.berkeley.edu amplab MOTIVATION 2 3 June 2, 200: Rackspace power outage of approximately 0
More informationSQL, NoSQL, MongoDB. CSE-291 (Cloud Computing) Fall 2016 Gregory Kesden
SQL, NoSQL, MongoDB CSE-291 (Cloud Computing) Fall 2016 Gregory Kesden SQL Databases Really better called Relational Databases Key construct is the Relation, a.k.a. the table Rows represent records Columns
More informationDistributed Systems. Lec 12: Consistency Models Sequential, Causal, and Eventual Consistency. Slide acks: Jinyang Li
Distributed Systems Lec 12: Consistency Models Sequential, Causal, and Eventual Consistency Slide acks: Jinyang Li (http://www.news.cs.nyu.edu/~jinyang/fa10/notes/ds-eventual.ppt) 1 Consistency (Reminder)
More informationDistributed Transactions
Distributed Transactions CS6450: Distributed Systems Lecture 17 Ryan Stutsman Material taken/derived from Princeton COS-418 materials created by Michael Freedman and Kyle Jamieson at Princeton University.
More informationLarge-Scale Geo-Replicated Conflict-free Replicated Data Types
Large-Scale Geo-Replicated Conflict-free Replicated Data Types Carlos Bartolomeu carlos.bartolomeu@tecnico.ulisboa.pt Instituto Superior Técnico (Advisor: Professor Luís Rodrigues) Abstract. Conflict-free
More informationNoSQL systems: sharding, replication and consistency. Riccardo Torlone Università Roma Tre
NoSQL systems: sharding, replication and consistency Riccardo Torlone Università Roma Tre Data distribution NoSQL systems: data distributed over large clusters Aggregate is a natural unit to use for data
More informationBloom: Big Systems, Small Programs. Neil Conway UC Berkeley
Bloom: Big Systems, Small Programs Neil Conway UC Berkeley Distributed Computing Programming Languages Data prefetching Register allocation Loop unrolling Function inlining Optimization Global coordination,
More informationA Modular Design for Geo-Distributed Querying
A Modular Design for Geo-Distributed Querying Dimitrios Vasilas, Marc Shapiro, Bradley King To cite this version: Dimitrios Vasilas, Marc Shapiro, Bradley King. A Modular Design for Geo-Distributed Querying:
More informationData Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich
Data Modeling and Databases Ch 14: Data Replication Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich Database Replication What is database replication The advantages of
More information10. Replication. Motivation
10. Replication Page 1 10. Replication Motivation Reliable and high-performance computation on a single instance of a data object is prone to failure. Replicate data to overcome single points of failure
More informationCo-Design and Verification of an Available File System
Co-Design and Verification of an Available File System Mahsa Najafzadeh 1, Marc Shapiro 2, and Patrick Eugster 1,3 1 Purdue University, West Lafayette, USA 2 INRIA-LIP6, Paris, France 3 Darmstadt University,
More informationDistributed Systems. replication Johan Montelius ID2201. Distributed Systems ID2201
Distributed Systems ID2201 replication Johan Montelius 1 The problem The problem we have: servers might be unavailable The solution: keep duplicates at different servers 2 Building a fault-tolerant service
More informationImproving the Correct Eventual Consistency Tool
arxiv:1807.06431v1 [cs.dc] 17 Jul 2018 Improving the Correct Eventual Consistency Tool Sreeja Nair, Marc Shapiro RESEARCH REPORT N 9191 July 2018 Project-Teams DELYS ISSN 0249-6399 ISRN INRIA/RR--9191--FR+ENG
More informationLarge-Scale Data Engineering
Large-Scale Data Engineering nosql: BASE vs ACID THE NEED FOR SOMETHING DIFFERENT One problem, three ideas We want to keep track of mutable state in a scalable manner Assumptions: State organized in terms
More informationATOMIC COMMITMENT Or: How to Implement Distributed Transactions in Sharded Databases
ATOMIC COMMITMENT Or: How to Implement Distributed Transactions in Sharded Databases We talked about transactions and how to implement them in a single-node database. We ll now start looking into how to
More informationCAP Theorem. March 26, Thanks to Arvind K., Dong W., and Mihir N. for slides.
C A CAP Theorem P March 26, 2018 Thanks to Arvind K., Dong W., and Mihir N. for slides. CAP Theorem It is impossible for a web service to provide these three guarantees at the same time (pick 2 of 3):
More informationCSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 14 Distributed Transactions
CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 14 Distributed Transactions Transactions Main issues: Concurrency control Recovery from failures 2 Distributed Transactions
More informationDatabase Architectures
Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/15/15 Agenda Check-in Parallelism and Distributed Databases Technology Research Project Introduction to NoSQL
More informationMulti-version concurrency control
MVCC and Distributed Txns (Spanner) 2P & CC = strict serialization Provides semantics as if only one transaction was running on DB at time, in serial order + Real-time guarantees CS 518: Advanced Computer
More informationData Storage Revolution
Data Storage Revolution Relational Databases Object Storage (put/get) Dynamo PNUTS CouchDB MemcacheDB Cassandra Speed Scalability Availability Throughput No Complexity Eventual Consistency Write Request
More informationSCALABLE CONSISTENCY AND TRANSACTION MODELS
Data Management in the Cloud SCALABLE CONSISTENCY AND TRANSACTION MODELS 69 Brewer s Conjecture Three properties that are desirable and expected from realworld shared-data systems C: data consistency A:
More informationWrite Fast, Read in the Past: Causal Consistency for Client-side Applications
Write Fast, Read in the Past: Causal Consistency for Client-side Applications Marek Zawirski, Nuno Preguiça, Sérgio Duarte, Annette Bieniusa, Valter Balegas, Marc Shapiro To cite this version: Marek Zawirski,
More informationThere is a tempta7on to say it is really used, it must be good
Notes from reviews Dynamo Evalua7on doesn t cover all design goals (e.g. incremental scalability, heterogeneity) Is it research? Complexity? How general? Dynamo Mo7va7on Normal database not the right fit
More informationDon t Give Up on Serializability Just Yet. Neha Narula
Don t Give Up on Serializability Just Yet Neha Narula Don t Give Up on Serializability Just Yet A journey into serializable systems Neha Narula MIT CSAIL GOTO Chicago May 2015 2 @neha PhD candidate at
More informationAbstract Specifications for Weakly Consistent Data
Abstract Specifications for Weakly Consistent Data Sebastian Burckhardt Microsoft Research Redmond, WA, USA Jonathan Protzenko Microsoft Research Redmond, WA, USA Abstract Weak consistency can improve
More information