Reliable Crash Detection and Failover with Orchestrator

Size: px
Start display at page:

Download "Reliable Crash Detection and Failover with Orchestrator"

Transcription

1 1 Reliable Crash Detection and Failover with Orchestrator Shlomi Noach, PerconaLive 2016 " How people build software

2 Agenda Orchestrator Topologies, crash scenarios Crash detection methods Promotion complexity Limbo states, split brain Flapping & acknowledgement Visibility & control Configuration vs. State based analysis & recovery State of the orchestra 2

3 Orchestrator MySQL replication topology manager github.com/outbrain/orchestrator Free & open source 3

4 4

5 Simple replication What could possibly go wrong? 5

6 Crash detection 6

7 Observe/monitor " How do you observe your database availability? 7

8 Monitor master only " Common: ping, check :3306, issue SELECT 1 8

9 Monitor master only And if response is bad? " - is this a false positive? - try again - and again? - How many times until you re sure? How much time have you lost? 9

10 Orchestrator s observation $ $ $ Continuously probes your MySQL servers - Figuring out who replicates from who - Building the topology tree - Understands replication rules - At time of crash, knows what set setup should have been $ 10

11 Observe entire topology Holistic approach, used by Orchestrator " MySQL monitoring calls for MySQL specific solution - Monitor master and replicas (issue queries) - Check replicas status - Make an analysis based on result from all servers involved. 11

12 Multi layered/multi DC replication " How do you check an intermediate master (IM) availability? 12

13 Multi layered/multi DC replication Holistic approach, used by Orchestrator " Monitoring the IM and its replicas give the bigger picture - you may actually not care about the IM s availability as long as its replicas are happy 13

14 Dead intermediate master Orchestrator s analysis IM unreachable, its replicas are reachable, and are all in agreement their master is unreachable. 14

15 Dead master Orchestrator s analysis Master unreachable, its replicas are, and are all in agreement their master is unreachable. 15

16 Dead master & some replicas Orchestrator s analysis Master unreachable, some of its replicas are, and are all in agreement their master is unreachable. Other replicas are unreachable. 16

17 Locked master Orchestrator s analysis (pending) Master is reachable, but does not execute writes. - all replicas are in agreement that master is reachable - no replica is making progress can be handled as a failed master case 17

18 Recovery & promotion constraints You ve made the decision to promote a new master Which one? Are all options valid? Is the current state what you think the current state is? 18

19 Promotion constraints most up to date less up to date You wish to promote the most up to date replica, otherwise you give up on any replica that is more advanced delayed 24 hours 19

20 Promotion constraints log_slave_updates log_slave_updates You must not promote a replica that has no binary logs, or without log_slave_updates No binary logs 20

21 Promotion constraints DC1 DC1 DC1 You prefer to promote a replica from same DC as failed master DC2 21

22 Promotion constraints SBR SBR SBR You must not promote Row Based Replication server on top of Statement Based Replication RBR 22

23 Promotion constraints Promoting 5.7 means losing 5.6 (replication not forward compatible) So Perhaps worth losing the 5.7 server?

24 Promotion constraints But if most of your servers are 5.7, and 5.7 turns to be most up to date, better promote 5.7 and drop the 5.6 Orchestrator handles this logic and prioritizes promotion candidates by overall count and state of replicas

25 Promotion constraints, real life most up to date, DC2 DC1 less up to date, DC1 Orchestrator can promote one, non-ideal replica, have the rest of the replicas converge, and then refactor again, promoting an ideal server no binary logs, DC1 25

26 Ways to avoid promotion constraints mess Make sure first replication tier is consistent, Have variety on 2nd tier

27 Ways to avoid promotion constraints mess 5.6, semi-sync Use semi-sync on designated servers. They will be most up-to-date upon failure

28 Ways to avoid promotion constraints mess %%%% %%%% %%%% 5.6 %%%% %%%% %%%% Solve the problem by aligning relay logs on the replicas upon master failure. %%%% %%%% %%%% 5.7 That s what MHA does Work In Progress: Orchestrator to support this Will require passwordless SSH 28

29 Limbos Master failed; one replica lost along. Recovery went well. What happens when master is back alive? 29

30 Limbos OHAI VIP is mine Also, good for traffic What will promoted master say? What will lost replica say? What will lost master say? Give me traffic 30

31 Solving limbos Master_host: //old.master.com Read only Orchestrator forcibly breaks replication on lost replica RESET SLAVE ALL or forced detach master on promoted replica read_only=1 on old master, if possible iptables on old master Can t find coordinates 31

32 DC split brain You re dead DC2 I can t hear you You re dead " They re " DC1 They re dead dead 32

33 33

34 Flapping & rolling failovers The master is diagnosed as being dead A new master is promoted Turns out some app client is killing it Rolling failover What does happen to a dead master that comes back alive? 34

35 Flapping & rolling failovers Orchestrator sets a minimal interval between two automated failovers First one is automated; an immediate one following gets blocked A human acknowledging the first failover implicitly resets. Good to go for next automated failover. And a human can always command a failover. 35

36 Flapping & rolling failovers Orchestrator marks a failed master as downtimed Even if said server is back in the game (human intervention), this particular server will not be failed over in the duration of the downtime. A human can terminate the downtime 36

37 Recap: how orchestrator performs master failover Detection: everyone agrees the master is dead Is this incident muted? Has this cluster just recently recovered from another failure without ack? 37

38 Recap: how orchestrator performs master failover Pick most up to date replica which will also make for least lost servers (the two are not necessarily the same) most up to date 38

39 Recap: how orchestrator performs master failover Refactor topology Oh wait, actually, now that everything s connected, is there a better server to promote? Go for it, refactor again Mark old master as downtimed Detach promoted master from old master 39

40 Recap: how orchestrator performs master failover Invoke external hooks Orchestrator does not use nor imply a specific service discovery technique Your own app/scripts to change VIP/ CNAME/Zk entries/proxy/whatever 40

41 Visibility & control Flapping and rolling failovers are avoided by having memory of past/recent events Orchestrator audits: Detection Recoveries Refactoring operations (alas without context) Owners, reasons, internal operations To audit table; to orchestrator log; to syslog Audit log available via API 41

42 Visibility & control Control via: Web interface Web API Command line interface Hubot.orc sup > No incidents which require a failover to report..orc recover failed.server.com.orc ack failed-cluster.orc relocate this.replica below that.one.orc graceful-takeover my-cluster 42

43 Configuration vs. State based recoveries In configuration based recoveries: You designate specific roles to specific servers i.e. this server will have to be promoted or these are the relevant servers, these are not $ You must then match your operations to those dictated rules. Any change you make (provision, deprovision, relocate, ) must be reflected in configuration Implies chef/puppet deploy; reload of services 43

44 Configuration vs. State based recoveries In state based recoveries: You trust the tooling to make the best of a situation % Basically do whatever a human would do You still want to have roles for your servers chef/puppet may still be involved But those can be added/removed dynamically, and the tooling adapts to change of state 44

45 Orchestrator s detection reliability There is no n-nines number Orchestrator has proven to be very accurate, in production environments Depending on both orchestrator & MySQL configuration, detection may take ~5-10 seconds 45

46 Orchestrator HA & HTTP PROXY LAYER Orchestrator services " " " " Leader MYSQL PROXY LAYER Orchestrator is highly available Supports multiple services competing for leadership Requires highly available backend database. Supports master-master setup, and guarantees it to be collision free Backend DB 46

47 Recent developments Binary log indexing: makes for Pseudo-GTID matching within 1s-2s. Reduced recovery time Planned master takeover, forced master takeover Smarter promotion rules Fuzzy names (it s the simple stuff that makes life happier) SSL (Square contributions) Better master-master support Replication structure analysis MIT license 47

48 What s on the roadmap? Ongoing, intended Relay log alignment Semi-sync (currently via contributions) Likely Failure detection consensus / leadership handover Maybe orchestrator-agent xtrabackup Always Reliability, performance, simplification 48

49 What s on the roadmap? GitHub commitment to Orchestrator We use it, we will make it better Currently merging changes upstream GitHub will become upstream Better documentation, tutorials, sample public AMI World domination Open and grateful for Contributions Please discuss via Issues beforehand 49

50 Orchestrator/related talks Choosing a MySQL HA solution today Michael Patrick (Percona) Tuesday 19, 5:15pm Orchestrator at Square John Cesario, Grier Johnson, Brian Ip (Square) Thursday 21, 3:00pm 50

51 GitHub talks Tutorial: MySQL GTID Implementation, Maintenance, and Best Practices Gillian Gunson (GitHub), Brian Cain (Dropbox), Mark Filipi (SurveyMonkey), Monday 18, 9:30am Growing MySQL at GitHub Tom Krouper, Jonah Berquist Wednesday 20, 1:00pm Rookie DBA Mistakes: How I Screwed Up So You Don't Have To Gillian Gunson Thursday 21, 12:50pm Co-speaking: Dirty Little Secrets Jonah Berquist, Shlomi Noach Thursday 21, 3:00pm 51

52 Thank you Questions? github.com/shlomi-noach " How people build software

Practical Orchestrator

Practical Orchestrator Practical Orchestrator Shlomi Noach GitHub Percona Live 2017 How people build software 1 Agenda Setting up orchestrator Backend Discovery Refactoring Detection & recovery Scripting HA Roadmap How people

More information

Orchestrator High Availability tutorial

Orchestrator High Availability tutorial Orchestrator High Availability tutorial Shlomi Noach GitHub PerconaLive 2018 About me @github/database-infrastructure Author of orchestrator, gh-ost, freno, ccql and others. Blog at http://openark.org

More information

Autopsy of an automation disaster. Simon J Mudd (Senior Database Engineer) Percona Live, 25 th April 2017

Autopsy of an automation disaster. Simon J Mudd (Senior Database Engineer) Percona Live, 25 th April 2017 Autopsy of an automation disaster Simon J Mudd (Senior Database Engineer) Percona Live, 25 th April 2017 To err is human To really foul things up requires a computer [1] (or a script) [1]: http://quoteinvestigator.com/2010/12/07/foul-computer/

More information

Choosing a MySQL HA Solution Today. Choosing the best solution among a myriad of options

Choosing a MySQL HA Solution Today. Choosing the best solution among a myriad of options Choosing a MySQL HA Solution Today Choosing the best solution among a myriad of options Questions...Questions...Questions??? How to zero in on the right solution You can t hit a target if you don t have

More information

How to setup Orchestrator to manage thousands of MySQL servers. Simon J Mudd 3 rd October 2017

How to setup Orchestrator to manage thousands of MySQL servers. Simon J Mudd 3 rd October 2017 How to setup Orchestrator to manage thousands of MySQL servers Simon J Mudd 3 rd October 2017 Session Summary What is orchestrator and why use it? What happens as you monitor more servers? Features added

More information

MySQL High Availability

MySQL High Availability MySQL High Availability And other stuff worth talking about Peter Zaitsev CEO Moscow MySQL Users Group Meetup July 11 th, 2017 1 Few Words about Percona 2 Percona s Purpose To Champion Unbiased Open Source

More information

GitHub's online schema migrations for MySQL

GitHub's online schema migrations for MySQL GitHub's online schema migrations for MySQL Tom Krouper, Shlomi Noach GitHub Illustrated with ghosts How people build software 1 GitHub The world s largest Octocat T-shirt and stickers store And water

More information

Using MHA in and out of the Cloud. Garrick Peterson Percona University, Toronto 2013

Using MHA in and out of the Cloud. Garrick Peterson Percona University, Toronto 2013 Using MHA in and out of the Cloud Garrick Peterson Percona University, Toronto 2013 Agenda Who am I MHA Overview HA In the Cloud IP Management options Simple use case What do we use Recommendations GTID

More information

Which technology to choose in AWS?

Which technology to choose in AWS? Which technology to choose in AWS? RDS / Aurora / Roll-your-own April 17, 2018 Daniel Kowalewski Senior Technical Operations Engineer Percona 1 2017 Percona AWS MySQL options RDS for MySQL Aurora MySQL

More information

MySQL High Availability. Michael Messina Senior Managing Consultant, Rolta-AdvizeX /

MySQL High Availability. Michael Messina Senior Managing Consultant, Rolta-AdvizeX / MySQL High Availability Michael Messina Senior Managing Consultant, Rolta-AdvizeX mmessina@advizex.com / mike.messina@rolta.com Introduction Michael Messina Senior Managing Consultant Rolta-AdvizeX, Working

More information

Choosing a MySQL HA Solution Today

Choosing a MySQL HA Solution Today Choosing a MySQL HA Solution Today Choosing the best solution among a myriad of options. Michael Patrick Technical Account Manager at Percona The Evolution of HA in MySQL Blasts from the past Solutions

More information

MySQL HA Solutions. Keeping it simple, kinda! By: Chris Schneider MySQL Architect Ning.com

MySQL HA Solutions. Keeping it simple, kinda! By: Chris Schneider MySQL Architect Ning.com MySQL HA Solutions Keeping it simple, kinda! By: Chris Schneider MySQL Architect Ning.com What we ll cover today High Availability Terms and Concepts Levels of High Availability What technologies are there

More information

MySQL HA Solutions Selecting the best approach to protect access to your data

MySQL HA Solutions Selecting the best approach to protect access to your data MySQL HA Solutions Selecting the best approach to protect access to your data Sastry Vedantam sastry.vedantam@oracle.com February 2015 Copyright 2015, Oracle and/or its affiliates. All rights reserved

More information

GitHub's online schema migrations for MySQL

GitHub's online schema migrations for MySQL GitHub's online schema migrations for MySQL Jonah Berquist @github/database-infrastructure engineering manager Illustrated with ghosts (and product placement) How people build software 1 GitHub The world

More information

MySQL High Availability Solutions. Alex Poritskiy Percona

MySQL High Availability Solutions. Alex Poritskiy Percona MySQL High Availability Solutions Alex Poritskiy Percona The Five 9s of Availability Clustering & Geographical Redundancy Clustering Technologies Replication Technologies Well-Managed disasters power failures

More information

HA solution with PXC-5.7 with ProxySQL. Ramesh Sivaraman Krunal Bauskar

HA solution with PXC-5.7 with ProxySQL. Ramesh Sivaraman Krunal Bauskar HA solution with PXC-5.7 with ProxySQL Ramesh Sivaraman Krunal Bauskar Agenda What is Good HA eco-system? Understanding PXC-5.7 Understanding ProxySQL PXC + ProxySQL = Complete HA solution Monitoring using

More information

MySQL Multi-Site/Multi-Master Done Right

MySQL Multi-Site/Multi-Master Done Right MySQL Multi-Site/Multi-Master Done Right MySQL Clustering for HA and DR The Dream: Multiple, active DBMS servers with identical data over distance Too good to be true? High Performance High Availability

More information

MySQL Replication. Rick Golba and Stephane Combaudon April 15, 2015

MySQL Replication. Rick Golba and Stephane Combaudon April 15, 2015 MySQL Replication Rick Golba and Stephane Combaudon April 15, 2015 Agenda What is, and what is not, MySQL Replication Replication Use Cases Types of replication Replication lag Replication errors Replication

More information

MySQL Replication Options. Peter Zaitsev, CEO, Percona Moscow MySQL User Meetup Moscow,Russia

MySQL Replication Options. Peter Zaitsev, CEO, Percona Moscow MySQL User Meetup Moscow,Russia MySQL Replication Options Peter Zaitsev, CEO, Percona Moscow MySQL User Meetup Moscow,Russia Few Words About Percona 2 Your Partner in MySQL and MongoDB Success 100% Open Source Software We work with MySQL,

More information

Split your database. Nicolai Plum Booking.com Database Engineering

Split your database. Nicolai Plum Booking.com Database Engineering Split your database Nicolai Plum Booking.com Database Engineering Before 2 After 3 Why? Size Query time, query latency conflicting workloads Business or architecture reasons Regulatory compliance Easier

More information

Highly Available Database Architectures in AWS. Santa Clara, California April 23th 25th, 2018 Mike Benshoof, Technical Account Manager, Percona

Highly Available Database Architectures in AWS. Santa Clara, California April 23th 25th, 2018 Mike Benshoof, Technical Account Manager, Percona Highly Available Database Architectures in AWS Santa Clara, California April 23th 25th, 2018 Mike Benshoof, Technical Account Manager, Percona Hello, Percona Live Attendees! What this talk is meant to

More information

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8 ADVANCED MYSQL REPLICATION ARCHITECTURES Luís

More information

MySQL Replication : advanced features in all flavours. Giuseppe Maxia Quality Assurance Architect at

MySQL Replication : advanced features in all flavours. Giuseppe Maxia Quality Assurance Architect at MySQL Replication : advanced features in all flavours Giuseppe Maxia Quality Assurance Architect at VMware @datacharmer 1 About me Who s this guy? Giuseppe Maxia, a.k.a. "The Data Charmer" QA Architect

More information

Introduction to MySQL InnoDB Cluster

Introduction to MySQL InnoDB Cluster 1 / 148 2 / 148 3 / 148 Introduction to MySQL InnoDB Cluster MySQL High Availability made easy Percona Live Europe - Dublin 2017 Frédéric Descamps - MySQL Community Manager - Oracle 4 / 148 Safe Harbor

More information

MY CONVERSATION HAS RUN DRY

MY CONVERSATION HAS RUN DRY PARTITION TOLERANCE MY CONVERSATION HAS RUN DRY Many systems degrade, or otherwise change state, under partition BRING THE PIECES BACK TOGETHER REDISCOVER COMMUNICATION A EXAMPLE ANPLICATION 5 clients

More information

MySQL Replication Advanced Features In 20 minutes

MySQL Replication Advanced Features In 20 minutes MySQL Replication Advanced Features In 20 minutes Peter Zaitsev, CEO FOSDEM, Brussels, Belgium February 2nd, 2019 1 Question #1 Who in this room is using some kind of MySQL Replication? 2 Question #2 Which

More information

Percona XtraDB Cluster ProxySQL. For your high availability and clustering needs

Percona XtraDB Cluster ProxySQL. For your high availability and clustering needs Percona XtraDB Cluster-5.7 + ProxySQL For your high availability and clustering needs Ramesh Sivaraman Krunal Bauskar Agenda What is Good HA eco-system? Understanding PXC-5.7 Understanding ProxySQL PXC

More information

MySQL at Scale at Square

MySQL at Scale at Square MySQL at Scale at Square Bill Karwin, Square Inc. October, 2018 1 Square An honest financial network for everyone Global: USA, Canada, UK, Japan, Australia Payment transaction data stored in MySQL We are

More information

Binlog Servers at Booking.com. Jean-François Gagné jeanfrancois DOT gagne AT booking.com

Binlog Servers at Booking.com. Jean-François Gagné jeanfrancois DOT gagne AT booking.com Binlog Servers at Booking.com Jean-François Gagné jeanfrancois DOT gagne AT booking.com Presented at Percona Live Amsterdam 2015 Booking.com 1 Booking.com Based in Amsterdam since 1996 Online Hotel and

More information

Lessons from database failures

Lessons from database failures Lessons from database failures Colin Charles, Chief Evangelist, Percona Inc. colin.charles@percona.com / byte@bytebot.net http://www.bytebot.net/blog/ @bytebot on Twitter Percona Webminar 18 January 2017

More information

Migrating to Aurora MySQL and Monitoring with PMM. Percona Technical Webinars August 1, 2018

Migrating to Aurora MySQL and Monitoring with PMM. Percona Technical Webinars August 1, 2018 Migrating to Aurora MySQL and Monitoring with PMM Percona Technical Webinars August 1, 2018 Introductions Introduction Vineet Khanna (Autodesk) Senior Database Engineer vineet.khanna@autodesk.com Tate

More information

Choosing a MySQL High Availability Solution. Marcos Albe, Percona Inc. Live Webinar June 2017

Choosing a MySQL High Availability Solution. Marcos Albe, Percona Inc. Live Webinar June 2017 Choosing a MySQL High Availability Solution Marcos Albe, Percona Inc. Live Webinar June 2017 Agenda What is availability Components to build an HA solution HA options in the MySQL ecosystem Failover/Routing

More information

Vitess on Kubernetes. followed by a demo of VReplication. Jiten Vaidya

Vitess on Kubernetes. followed by a demo of VReplication. Jiten Vaidya Vitess on Kubernetes followed by a demo of VReplication Jiten Vaidya jiten@planetscale.com A word about me... Jiten Vaidya - Managed teams that operationalized Vitess at Youtube CEO at PlanetScale Founded

More information

Percona XtraDB Cluster powered by Galera. Peter Zaitsev CEO, Percona Slide Credits: Vadim Tkachenko Percona University, Washington,DC Sep 12,2013

Percona XtraDB Cluster powered by Galera. Peter Zaitsev CEO, Percona Slide Credits: Vadim Tkachenko Percona University, Washington,DC Sep 12,2013 powered by Galera Peter Zaitsev CEO, Percona Slide Credits: Vadim Tkachenko Percona University, Washington,DC Sep 12,2013 This talk High Availability Replication Cluster What is HA Availability Avail ~

More information

Primary-Backup Replication

Primary-Backup Replication Primary-Backup Replication CS 240: Computing Systems and Concurrency Lecture 7 Marco Canini Credits: Michael Freedman and Kyle Jamieson developed much of the original material. Simplified Fault Tolerance

More information

Data Integrity in Stateful Services. Percona Live, Santa Clara, 2017

Data Integrity in Stateful Services. Percona Live, Santa Clara, 2017 Data Integrity in Stateful Services Percona Live, Santa Clara, 2017 Data Integrity Bringing Sexy Back Protect the Data. -Every DBA who doesn t want to be fired Breaking Integrity Down Physical Integrity

More information

More reliability and support for PostgreSQL 10: Introducing Pgpool-II 3.7

More reliability and support for PostgreSQL 10: Introducing Pgpool-II 3.7 More reliability and support for PostgreSQL 10: Introducing Pgpool-II 3.7 PGConf.ASIA 2017 SRA OSS, Inc Japan Tatsuo Ishii Who am I? Working on OSS activities and businesses OSS activities PostgreSQL committer

More information

Percona XtraDB Cluster MySQL Scaling and High Availability with PXC 5.7 Tibor Korocz

Percona XtraDB Cluster MySQL Scaling and High Availability with PXC 5.7 Tibor Korocz Percona XtraDB Cluster MySQL Scaling and High Availability with PXC 5.7 Tibor Korocz Architect Percona University Budapest 2017.05.11 1 2016 Percona Scaling and High Availability (application) 2 Scaling

More information

Parallel DBs. April 25, 2017

Parallel DBs. April 25, 2017 Parallel DBs April 25, 2017 1 Sending Hints Rk B Si Strategy 3: Bloom Filters Node 1 Node 2 2 Sending Hints Rk B Si Strategy 3: Bloom Filters Node 1 with

More information

MySQL Backup Best Practices and Case Study:.IE Continuous Restore Process

MySQL Backup Best Practices and Case Study:.IE Continuous Restore Process MySQL Backup Best Practices and Case Study:.IE Continuous Restore Process Marcelo Altmann Senior Support Engineer - Percona Mick Begley Technical Service Manager - IE Domain Registry Agenda Agenda Why

More information

Percona XtraDB Cluster 5.7 Enhancements Performance, Security, and More

Percona XtraDB Cluster 5.7 Enhancements Performance, Security, and More Percona XtraDB Cluster 5.7 Enhancements Performance, Security, and More Michael Coburn, Product Manager, PMM Percona Live Dublin 2017 1 Your Presenter Product Manager for PMM (Percona Monitoring and Management)

More information

Lessons learned while automating MySQL in the AWS cloud. Stephane Combaudon DB Engineer - Slice

Lessons learned while automating MySQL in the AWS cloud. Stephane Combaudon DB Engineer - Slice Lessons learned while automating MySQL in the AWS cloud Stephane Combaudon DB Engineer - Slice Our environment 5 DB stacks Data volume ranging from 30GB to 2TB+. Master + N slaves for each stack. Master

More information

High Availability for Postgres using OpenSource tools. By Jobin Augustine & HariKrishna

High Availability for Postgres using OpenSource tools. By Jobin Augustine & HariKrishna High Availability for Postgres using OpenSource tools By Jobin Augustine & HariKrishna Introduction Jobin Augustine Designation : Architect Email: jobin.augustine@openscg.com Contact No: + 91 9989932600

More information

Building and Running a Solr-as-a-Service SHAI ERERA IBM

Building and Running a Solr-as-a-Service SHAI ERERA IBM Building and Running a Solr-as-a-Service SHAI ERERA IBM Who Am I? Working at IBM Social Analytics & Technologies Lucene/Solr committer and PMC member http://shaierera.blogspot.com shaie@apache.org Background

More information

ProxySQL - GTID Consistent Reads. Adaptive query routing based on GTID tracking

ProxySQL - GTID Consistent Reads. Adaptive query routing based on GTID tracking ProxySQL - GTID Consistent Reads Adaptive query routing based on GTID tracking Introduction Rene Cannao Founder of ProxySQL MySQL DBA Introduction Nick Vyzas ProxySQL Committer MySQL DBA What is ProxySQL?

More information

Percona Live Europe 2016 Use ProxySQL to Improve Your MySQL High Availability Solution Marco Tusa Manager Consulting Amsterdam, Netherlands October 3

Percona Live Europe 2016 Use ProxySQL to Improve Your MySQL High Availability Solution Marco Tusa Manager Consulting Amsterdam, Netherlands October 3 Percona Live Europe 2016 Use ProxySQL to Improve Your MySQL High Availability Solution Marco Tusa Manager Consulting Amsterdam, Netherlands October 3 5, 2016 Percona Live 2017 Use ProxySQL to Improve Your

More information

MySQL usage of web applications from 1 user to 100 million. Peter Boros RAMP conference 2013

MySQL usage of web applications from 1 user to 100 million. Peter Boros RAMP conference 2013 MySQL usage of web applications from 1 user to 100 million Peter Boros RAMP conference 2013 Why MySQL? It's easy to start small, basic installation well under 15 minutes. Very popular, supported by a lot

More information

GridDB Technical Design Document Version 1.0

GridDB Technical Design Document Version 1.0 GridDB Technical Design Document Version 1.0 Copyright 2013 TOSHIBA CORPORATION 1 Table of Contents 1. Concept of GridDB cluster... 3 2. Determination of a master node... 5 3. Determination of partition

More information

Percona Live Europe 2016 Use ProxySQL to Improve Your MySQL High Availability Solution

Percona Live Europe 2016 Use ProxySQL to Improve Your MySQL High Availability Solution Percona Live Europe 2016 Use ProxySQL to Improve Your MySQL High Availability Solution Marco Tusa Manager Consulting Amsterdam, Netherlands October 3 5, 2016 About me Marco The Grinch Open source enthusiast

More information

Design Patterns for Large- Scale Data Management. Robert Hodges OSCON 2013

Design Patterns for Large- Scale Data Management. Robert Hodges OSCON 2013 Design Patterns for Large- Scale Data Management Robert Hodges OSCON 2013 The Start-Up Dilemma 1. You are releasing Online Storefront V 1.0 2. It could be a complete bust 3. But it could be *really* big

More information

Scale out Read Only Workload by sharing data files of InnoDB. Zhai weixiang Alibaba Cloud

Scale out Read Only Workload by sharing data files of InnoDB. Zhai weixiang Alibaba Cloud Scale out Read Only Workload by sharing data files of InnoDB Zhai weixiang Alibaba Cloud Who Am I - My Name is Zhai Weixiang - I joined in Alibaba in 2011 and has been working on MySQL since then - Mainly

More information

Understanding High Availability options for PostgreSQL

Understanding High Availability options for PostgreSQL Understanding High Availability options for PostgreSQL Madan Kumar K Member of Technical Staff, ScaleGrid.io @ImMadanK High Availability 101 Redundancy is the key Standalone vs. Master-Standby Master Server

More information

Running MySQL on AWS. Michael Coburn Wednesday, April 15th, 2015

Running MySQL on AWS. Michael Coburn Wednesday, April 15th, 2015 Running MySQL on AWS Michael Coburn Wednesday, April 15th, 2015 Who am I? 2 Senior Architect with Percona 3 years on Friday! Canadian but I now live in Costa Rica I see 3-10 different customer environments

More information

The New Replication Features in MySQL 8. Luís Soares Principal Software Engineer, MySQL Replication Lead

The New Replication Features in MySQL 8. Luís Soares Principal Software Engineer, MySQL Replication Lead The New Replication Features in MySQL 8 Luís Soares (luis.soares@oracle.com) Principal Software Engineer, MySQL Replication Lead Copyright 2017, Oracle and/or its affiliates. All rights reserved. Percona

More information

How to make MySQL work with Raft. Diancheng Wang & Guangchao Bai Staff Database Alibaba Cloud

How to make MySQL work with Raft. Diancheng Wang & Guangchao Bai Staff Database Alibaba Cloud How to make MySQL work with Raft Diancheng Wang & Guangchao Bai Staff Database Engineer @ Alibaba Cloud About me Name: Guangchao Bai Location: Beijing, China Occupation: Staff Database Engineer @ Alibaba

More information

High Noon at AWS. ~ Amazon MySQL RDS versus Tungsten Clustering running MySQL on AWS EC2

High Noon at AWS. ~ Amazon MySQL RDS versus Tungsten Clustering running MySQL on AWS EC2 High Noon at AWS ~ Amazon MySQL RDS versus Tungsten Clustering running MySQL on AWS EC2 Introduction Amazon Web Services (AWS) are gaining popularity, and for good reasons. The Amazon Relational Database

More information

MySQL Replication Update

MySQL Replication Update MySQL Replication Update Lars Thalmann Development Director MySQL Replication, Backup & Connectors OSCON, July 2011 MySQL Releases MySQL 5.1 Generally Available, November 2008 MySQL

More information

CSE 444: Database Internals. Section 9: 2-Phase Commit and Replication

CSE 444: Database Internals. Section 9: 2-Phase Commit and Replication CSE 444: Database Internals Section 9: 2-Phase Commit and Replication 1 Today 2-Phase Commit Replication 2 Two-Phase Commit Protocol (2PC) One coordinator and many subordinates Phase 1: Prepare Phase 2:

More information

High availability with MariaDB TX: The definitive guide

High availability with MariaDB TX: The definitive guide High availability with MariaDB TX: The definitive guide MARCH 2018 Table of Contents Introduction - Concepts - Terminology MariaDB TX High availability - Master/slave replication - Multi-master clustering

More information

Percona Live Europe Amsterdam, Netherlands October 3 5, 2016

Percona Live Europe Amsterdam, Netherlands October 3 5, 2016 Percona Live Europe 2016 Amsterdam, Netherlands October 3 5, 2016 Microservices, MySQL & Service Discovery Abbas Ahmed : Principal Database Engineer @ rentalcars.com Twitter: @solutionguy Who am I? Principal

More information

How Facebook Got Consistency with MySQL in the Cloud Sam Dunster

How Facebook Got Consistency with MySQL in the Cloud Sam Dunster How Facebook Got Consistency with MySQL in the Cloud Sam Dunster Production Engineer Consistency Replication Replication for High Availability Facebook Replicaset Region A Slave Slave Region B Region

More information

The Exciting MySQL 5.7 Replication Enhancements

The Exciting MySQL 5.7 Replication Enhancements The Exciting MySQL 5.7 Replication Enhancements Luís Soares (luis.soares@oracle.com) Principal Software Engineer, MySQL Replication Team Lead Copyright 2016, Oracle and/or its affiliates. All rights reserved.

More information

CS /15/16. Paul Krzyzanowski 1. Question 1. Distributed Systems 2016 Exam 2 Review. Question 3. Question 2. Question 5.

CS /15/16. Paul Krzyzanowski 1. Question 1. Distributed Systems 2016 Exam 2 Review. Question 3. Question 2. Question 5. Question 1 What makes a message unstable? How does an unstable message become stable? Distributed Systems 2016 Exam 2 Review Paul Krzyzanowski Rutgers University Fall 2016 In virtual sychrony, a message

More information

Migrating to XtraDB Cluster 2014 Edition

Migrating to XtraDB Cluster 2014 Edition Migrating to XtraDB Cluster 2014 Edition Jay Janssen Managing Consultant Overview of XtraDB Cluster Percona Server + Galera Cluster of Innodb nodes Readable and Writable Virtually Synchronous All data

More information

ProxySQL Tutorial. With a GPL license! High Performance & High Availability Proxy for MySQL. Santa Clara, California April 23th 25th, 2018

ProxySQL Tutorial. With a GPL license! High Performance & High Availability Proxy for MySQL. Santa Clara, California April 23th 25th, 2018 ProxySQL Tutorial High Performance & High Availability Proxy for MySQL With a GPL license! Santa Clara, California April 23th 25th, 2018 Who we are René Cannaò ProxySQL Founder Derek Downey Director of

More information

MySQL High-Availability

MySQL High-Availability MySQL High-Availability with the Percona replication manager (PRM) Yves Trudeau April, 2014 About myself : Yves Trudeau Principal architect at Percona since 2009 With MySQL then Sun, 2007 to 2009 Focus

More information

Monitoring MySQL Performance with Percona Monitoring and Management

Monitoring MySQL Performance with Percona Monitoring and Management Monitoring MySQL Performance with Percona Monitoring and Management Your Presenters Michael Coburn - PMM Product Manager Working at Percona for almost 5 years Consultant, Manager, TAM, now Product Manager

More information

Distributed Computation Models

Distributed Computation Models Distributed Computation Models SWE 622, Spring 2017 Distributed Software Engineering Some slides ack: Jeff Dean HW4 Recap https://b.socrative.com/ Class: SWE622 2 Review Replicating state machines Case

More information

Automating Schema. Changes using

Automating Schema. Changes using Automating Schema Changes using Percona Live Open Source Database Conference 2017-04-27 12:50 PM - 1:40 PM Ballroom D " How people build software Tom Krouper @CaptainEyesight @tomkrouper Database Infrastructure

More information

High Availability Solutions for the MySQL Database

High Availability Solutions for the MySQL Database www.skysql.com High Availability Solutions for the MySQL Database Introduction This paper introduces recommendations and some of the solutions used to create an availability or high availability environment

More information

What s New in MySQL and MongoDB Ecosystem Year 2017

What s New in MySQL and MongoDB Ecosystem Year 2017 What s New in MySQL and MongoDB Ecosystem Year 2017 Peter Zaitsev CEO Percona University, Ghent June 22 nd, 2017 1 In This Presentation Few Words about Percona Few Words about Percona University Program

More information

Monitoring MySQL with Prometheus & Grafana

Monitoring MySQL with Prometheus & Grafana Monitoring MySQL with Prometheus & Grafana Julien Pivotto (@roidelapluie) Percona University Belgium June 22nd, 2017 SELECT USER(); Julien "roidelapluie" Pivotto @roidelapluie Sysadmin at inuits Automation,

More information

If Only I Could Find My Databases-Service Discovery with SmartStack and MySQL. Susanne Lehmann, Yelp

If Only I Could Find My Databases-Service Discovery with SmartStack and MySQL. Susanne Lehmann, Yelp If Only I Could Find My Databases-Service Discovery with SmartStack and MySQL Susanne Lehmann, Yelp susanne@yelp.com Me I ve been a DBA for 16 years I ve been working with all kinds of relational and NoSQL

More information

MSc, Computer & Systems TalTech. Writes on 2ndQuadrant blog From Turkey Lives in

MSc, Computer & Systems TalTech. Writes on 2ndQuadrant blog From Turkey Lives in Gülçin Yıldırım Jelínek 1 Multi-Master PostgreSQL Architectures in Cloud select * from me; Board of Directors @ PostgreSQL Europe Cloud Services Manager @ 2ndQuadrant Main Organizer @ Prague PostgreSQL

More information

Postgres Cluster and Multimaster

Postgres Cluster and Multimaster Postgres Cluster and Multimaster postgrespro.ru Ivan Panchenko Postgres Pro Cluster definition: several DBs working as one Redundancy Sharding Parallel query processing Failover Dynamic reconfiguration

More information

Binlog Servers (and MySQL) at Booking.com. Jean-François Gagné jeanfrancois DOT gagne AT booking.com Presented at Percona Live Santa Clara 2015

Binlog Servers (and MySQL) at Booking.com. Jean-François Gagné jeanfrancois DOT gagne AT booking.com Presented at Percona Live Santa Clara 2015 Binlog Servers (and MySQL) at Booking.com Jean-François Gagné jeanfrancois DOT gagne AT booking.com Presented at Percona Live Santa Clara 2015 Booking.com 2 Booking.com Based in Amsterdam since 1996 Online

More information

Run your own Open source. (MMS) to avoid vendor lock-in. David Murphy MongoDB Practice Manager, Percona

Run your own Open source. (MMS) to avoid vendor lock-in. David Murphy MongoDB Practice Manager, Percona Run your own Open source Click alternative to edit to Master Ops-Manager title style (MMS) to avoid vendor lock-in David Murphy MongoDB Practice Manager, Percona Who is this Person and What Does He Know?

More information

MySQL Group Replication & MySQL InnoDB Cluster

MySQL Group Replication & MySQL InnoDB Cluster MySQL Group Replication & MySQL InnoDB Cluster Production Ready? Kenny Gryp productions Table of Contents Group Replication MySQL Shell (AdminAPI) MySQL Group Replication MySQL Router Best Practices Limitations

More information

PRIMARY-BACKUP REPLICATION

PRIMARY-BACKUP REPLICATION PRIMARY-BACKUP REPLICATION Primary Backup George Porter Nov 14, 2018 ATTRIBUTION These slides are released under an Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0) Creative Commons

More information

AlwaysOn Availability Groups: Backups, Restores, and CHECKDB

AlwaysOn Availability Groups: Backups, Restores, and CHECKDB AlwaysOn Availability Groups: Backups, Restores, and CHECKDB www.brentozar.com sp_blitz sp_blitzfirst email newsletter videos SQL Critical Care 2016 Brent Ozar Unlimited. All rights reserved. 1 What I

More information

FromDual Annual Company Meeting

FromDual Annual Company Meeting FromDual Annual Company Meeting Athens, 2013 Galera Cluster for MySQL http:// 1 / 26 About FromDual GmbH (LLC) FromDual provides neutral and independent: Consulting for MySQL Support for MySQL and Galera

More information

Technology Overview ScaleArc. All Rights Reserved.

Technology Overview ScaleArc. All Rights Reserved. 2014 ScaleArc. All Rights Reserved. Contents Contents...1 ScaleArc Overview...1 Who ScaleArc Helps...2 Historical Database Challenges...3 Use Cases and Projects...5 Sample ScaleArc Customers...5 Summary

More information

Everything You Need to Know About MySQL Group Replication

Everything You Need to Know About MySQL Group Replication Everything You Need to Know About MySQL Group Replication Luís Soares (luis.soares@oracle.com) Principal Software Engineer, MySQL Replication Lead Copyright 2017, Oracle and/or its affiliates. All rights

More information

MyRocks deployment at Facebook and Roadmaps. Yoshinori Matsunobu Production Engineer / MySQL Tech Lead, Facebook Feb/2018, #FOSDEM #mysqldevroom

MyRocks deployment at Facebook and Roadmaps. Yoshinori Matsunobu Production Engineer / MySQL Tech Lead, Facebook Feb/2018, #FOSDEM #mysqldevroom MyRocks deployment at Facebook and Roadmaps Yoshinori Matsunobu Production Engineer / MySQL Tech Lead, Facebook Feb/2018, #FOSDEM #mysqldevroom Agenda MySQL at Facebook MyRocks overview Production Deployment

More information

big picture parallel db (one data center) mix of OLTP and batch analysis lots of data, high r/w rates, 1000s of cheap boxes thus many failures

big picture parallel db (one data center) mix of OLTP and batch analysis lots of data, high r/w rates, 1000s of cheap boxes thus many failures Lecture 20 -- 11/20/2017 BigTable big picture parallel db (one data center) mix of OLTP and batch analysis lots of data, high r/w rates, 1000s of cheap boxes thus many failures what does paper say Google

More information

Switching to Innodb from MyISAM. Matt Yonkovit Percona

Switching to Innodb from MyISAM. Matt Yonkovit Percona Switching to Innodb from MyISAM Matt Yonkovit Percona -2- DIAMOND SPONSORSHIPS THANK YOU TO OUR DIAMOND SPONSORS www.percona.com -3- Who We Are Who I am Matt Yonkovit Principal Architect Veteran of MySQL/SUN/Percona

More information

Percona XtraDB Cluster

Percona XtraDB Cluster Percona XtraDB Cluster Ensure High Availability Presenter Karthik P R CEO Mydbops www.mydbops.com info@mydbops.com Mydbops Mydbops is into MySQL/MongoDB Support and Consulting. It is founded by experts

More information

CS October 2017

CS October 2017 Atomic Transactions Transaction An operation composed of a number of discrete steps. Distributed Systems 11. Distributed Commit Protocols All the steps must be completed for the transaction to be committed.

More information

CMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS

CMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS Carnegie Mellon Univ. Dept. of Computer Science 15-415/615 - DB s C. Faloutsos A. Pavlo Lecture#23: Distributed Database Systems (R&G ch. 22) Administrivia Final Exam Who: You What: R&G Chapters 15-22

More information

Mix n Match Async and Group Replication for Advanced Replication Setups. Pedro Gomes Software Engineer

Mix n Match Async and Group Replication for Advanced Replication Setups. Pedro Gomes Software Engineer Mix n Match Async and Group Replication for Advanced Replication Setups Pedro Gomes (pedro.gomes@oracle.com) Software Engineer 4th of February Copyright 2017, Oracle and/or its affiliates. All rights reserved.

More information

Easy PostgreSQL Clustering with Patroni. Ants Aasma

Easy PostgreSQL Clustering with Patroni. Ants Aasma Easy PostgreSQL Clustering with Patroni Introduction About me Support engineer at Cybertec Helping others run PostgreSQL for 5 years. Helping myself run PostgreSQL since 7.4 days. What are we going to

More information

Using GPUaaS in Cloud Foundry

Using GPUaaS in Cloud Foundry Using GPUaaS in Cloud Foundry Agenda Introduction GPUaaS Cloud Foundry Integration 2 Technology Research Innovation Group Innovation Advanced Research Proof of Concept User Feedback Agile Roadmap 3 Technology

More information

ShardProxy Replication-Manager. DataOps - Juin 2018 Kentoku SHIBA - Stephane VAROQUI

ShardProxy Replication-Manager. DataOps - Juin 2018 Kentoku SHIBA - Stephane VAROQUI ShardProxy Replication-Manager DataOps - Juin 2018 Kentoku SHIBA - Stephane VAROQUI 1 Replication Manager MRM 2 Replication-Manager - Sharding About Sharding Frameworks They mostly don t do efficient joins

More information

Aurora, RDS, or On-Prem, Which is right for you

Aurora, RDS, or On-Prem, Which is right for you Aurora, RDS, or On-Prem, Which is right for you Kathy Gibbs Database Specialist TAM Katgibbs@amazon.com Santa Clara, California April 23th 25th, 2018 Agenda RDS Aurora EC2 On-Premise Wrap-up/Recommendation

More information

MongoDB. David Murphy MongoDB Practice Manager, Percona

MongoDB. David Murphy MongoDB Practice Manager, Percona MongoDB Click Replication to edit Master and Sharding title style David Murphy MongoDB Practice Manager, Percona Who is this Person and What Does He Know? Former MongoDB Master Former Lead DBA for ObjectRocket,

More information

Various MySQL High Availability (HA) Solutions

Various MySQL High Availability (HA) Solutions Various MySQL High Availability (HA) Solutions Percona Live MySQL Conference, London, Oct 24 th and 25 th, 2011 Oli Sennhauser Senior MySQL Consultant at FromDual GmbH oli.sennhauser@fromdual.com www.fromdual.com

More information

MariaDB 10.3 vs MySQL 8.0. Tyler Duzan, Product Manager Percona

MariaDB 10.3 vs MySQL 8.0. Tyler Duzan, Product Manager Percona MariaDB 10.3 vs MySQL 8.0 Tyler Duzan, Product Manager Percona Who Am I? My name is Tyler Duzan Formerly an operations engineer for more than 12 years focused on security and automation Now a Product Manager

More information

Primary/Backup. CS6450: Distributed Systems Lecture 3/4. Ryan Stutsman

Primary/Backup. CS6450: Distributed Systems Lecture 3/4. Ryan Stutsman Primary/Backup CS6450: Distributed Systems Lecture 3/4 Ryan Stutsman Material taken/derived from Princeton COS-418 materials created by Michael Freedman and Kyle Jamieson at Princeton University. Licensed

More information

MySQL Performance Improvements

MySQL Performance Improvements Taking Advantage of MySQL Performance Improvements Baron Schwartz, Percona Inc. Introduction About Me (Baron Schwartz) Author of High Performance MySQL 2 nd Edition Creator of Maatkit, innotop, and so

More information