UPGRADING A MESOS CLUSTER

Size: px
Start display at page:

Download "UPGRADING A MESOS CLUSTER"

Transcription

1 MesosCon Greg Mann UPGRADING A MESOS CLUSTER 2016 Mesosphere, Inc. All Rights Reserved. 1

2 Greg Mann Software Engineer Mesos contributor Computational chemist Croissant 2016 Mesosphere, Inc. All Rights Reserved. 2

3 THE DREAM Happy user Happy user Happy user Happy user Web Service Mesos Mesosphere, Inc. All Rights Reserved. 3

4 THE DREAM Happy user Happy user Happy user Happy user Web Service Mesos Mesos Mesosphere, Inc. All Rights Reserved. 4

5 THE DREAM Web Service Mesos Mesos Mesosphere, Inc. All Rights Reserved. 5

6 DEPRECATION & UPGRADES Mesos is quickly adding new features, and version 1.0 is nearly upon us - the upgrade policy is changing Mesosphere, Inc. All Rights Reserved. 6

7 DEPRECATION & UPGRADES For all Mesos versions: Functionality will be deprecated for 6 months before removal 2016 Mesosphere, Inc. All Rights Reserved. 7

8 DEPRECATION & UPGRADES For Mesos versions 0.28 and earlier: Any Mesos master or agent is compatible within +/- 1 minor version during rolling upgrades 2016 Mesosphere, Inc. All Rights Reserved. 8

9 DEPRECATION & UPGRADES From version 1.0 onward: All stable versions are loosely compatible Loose compatibility requires no dependence on deprecated functionality This means that intermediate minor releases can be skipped in a rolling upgrade as long as no deprecated features are being used! 2016 Mesosphere, Inc. All Rights Reserved. 9

10 WHAT CAN CHANGE? 2016 Mesosphere, Inc. All Rights Reserved. 10

11 WHAT CAN CHANGE? Knowledge of the potential changes from one version of Mesos to the next tells us what to look for during an upgrade Mesosphere, Inc. All Rights Reserved. 11

12 PROTOCOL BUFFERS Messages sent between master and agent are Google protobuf messages Members of these messages may change: Can be added or removed Field type may change from optional to required or vice-versa 2016 Mesosphere, Inc. All Rights Reserved. 12

13 PROTOCOL BUFFERS Messages sent between master and agent are Google protobuf messages : message ReservationInfo { required string principal = 1; } Members of these messages may change: Can be added or removed Field type may change from optional to required or vice-versa , with checks to ensure it s set: message ReservationInfo { optional string principal = 1; } : checks removed 2016 Mesosphere, Inc. All Rights Reserved. 13

14 PROTOCOL BUFFERS : message ReservationInfo { required string principal = 1; } When an incompatible message is received by a master or agent, it can be silently dropped! , with checks to ensure it s set: message ReservationInfo { optional string principal = 1; } : checks removed 2016 Mesosphere, Inc. All Rights Reserved. 14

15 MESOS APIs Mesos defines APIs for developers to code against: Native scheduler/executor interface (libmesos) HTTP scheduler/executor API interface Authenticators Authorizers Isolators Anonymous HTTP operator API 2016 Mesosphere, Inc. All Rights Reserved. 15

16 MESOS APIs Mesos defines APIs for developers to code against: Native scheduler/executor interface (libmesos) HTTP scheduler/executor API interface Authenticators Authorizers Isolators Anonymous HTTP operator API This includes the SchedulerDriver and ExecutorDriver APIs Provides C++, Java, and Python libraries for writing frameworks/executors. Schedulers and executors should make use of the same library that the Mesos master/agent are linked against Mesosphere, Inc. All Rights Reserved. 16

17 MESOS APIs Mesos defines APIs for developers to code against: Native scheduler/executor interface (libmesos) HTTP scheduler/executor API interface Authenticators Authorizers Isolators Anonymous HTTP operator API New APIs, which will eventually replace the native interfaces Temporary scheduler and executor shims in Java will enable drop-in replacement Frameworks and executors can simply adhere to the appropriate versioned API, beginning with v1 in Mesos Mesosphere, Inc. All Rights Reserved. 17

18 MESOS APIs Mesos defines APIs for developers to code against: Native scheduler/executor interface (libmesos) HTTP scheduler/executor API interface Authenticators Authorizers Isolators Anonymous HTTP operator API Mesos supports both master and agent modules Hooks are provided: their placement within the scheduler/executor life cycle may change Should be recompiled with the appropriate libmesos 2016 Mesosphere, Inc. All Rights Reserved. 18

19 MESOS APIs Mesos defines APIs for developers to code against: Native scheduler/executor interface (libmesos) HTTP scheduler/executor API interface Authenticators Authorizers Isolators Anonymous HTTP operator API For Mesos 0.28 and earlier, the operator HTTP endpoints are not versioned. The changelog of each release covers changes. Beginning with Mesos 1.0, the operator API is versioned. The API versions correspond to major release versions Mesosphere, Inc. All Rights Reserved. 19

20 ABI COMPATIBILITY Language bindings and module libraries should all be built using the same environment and dependencies Anything depending on libmesos should use the latest, upgraded version 2016 Mesosphere, Inc. All Rights Reserved. 20

21 UPGRADE PROCEDURE 2016 Mesosphere, Inc. All Rights Reserved. 21

22 UPGRADE PROCEDURE Scheduler ZooKeeper Quorum 1. Recompile Mesos modules 2. Masters: rolling upgrade 3. Agents: rolling upgrade 4. Upgrade frameworks Leading Master Mesos Master Mesos Master Agent Agent Task Task 2016 Mesosphere, Inc. All Rights Reserved. 22

23 UPGRADE PROCEDURE Scheduler ZooKeeper Quorum 1. Recompile Mesos modules 2. Masters: rolling upgrade 3. Agents: rolling upgrade 4. Upgrade frameworks Leading Master Mesos Master Mesos Master Agent Agent Task Task 2016 Mesosphere, Inc. All Rights Reserved. 23

24 UPGRADE PROCEDURE 1. Recompile Mesos modules s should be built using the same versions of dependencies as the Mesos master/agent. s should use the same libmesos as the master/agent. Note: bundled dependencies in Mesos may change from one version to the next! 2016 Mesosphere, Inc. All Rights Reserved. 24

25 UPGRADE PROCEDURE Scheduler ZooKeeper Quorum 1. Recompile Mesos modules 2. Masters: rolling upgrade 3. Agents: rolling upgrade 4. Upgrade frameworks Leading Master Mesos Master Mesos Master Agent Agent Task Task 2016 Mesosphere, Inc. All Rights Reserved. 25

26 UPGRADE PROCEDURE 2. Masters: rolling upgrade 1. Stop the master process 2. Remove old master, install new 3. Start the new, upgraded master 4. Wait for the replicated log to finish recovering: Successfully recovered registrar 5. Return to step #1 on another node 2016 Mesosphere, Inc. All Rights Reserved. 26

27 UPGRADE PROCEDURE Scheduler ZooKeeper Quorum 1. Recompile Mesos modules 2. Masters: rolling upgrade 3. Agents: rolling upgrade 4. Upgrade frameworks Leading Master Mesos Master Mesos Master Agent Agent Task Task 2016 Mesosphere, Inc. All Rights Reserved. 27

28 UPGRADE PROCEDURE 3. Agents: rolling upgrade 1. Stop the agent process 2. Remove old agent, install new 3. Start the new, upgraded agent 4. Wait for the agent to re-register and complete recovery: Finished recovery 5. Return to step #1 on another node 2016 Mesosphere, Inc. All Rights Reserved. 28

29 UPGRADE PROCEDURE Scheduler ZooKeeper Quorum 1. Recompile Mesos modules 2. Masters: rolling upgrade 3. Agents: rolling upgrade 4. Upgrade schedulers Leading Master Mesos Master Mesos Master Agent Agent Task Task 2016 Mesosphere, Inc. All Rights Reserved. 29

30 UPGRADE PROCEDURE 4. Upgrade frameworks 1. Stop scheduler 2. If needed, push new executor 3. Start upgraded framework 4. Allow framework to re-register with the master; this will trigger the reregistered() callback Frameworks should be built using the same libmesos as the new master and agent. Highly available frameworks may require a rolling upgrade. Frameworks will interact with masters and agents of two different versions during upgrade! 2016 Mesosphere, Inc. All Rights Reserved. 30

31 FURTHER ADVICE & POTENTIAL PITFALLS 2016 Mesosphere, Inc. All Rights Reserved. 31

32 FURTHER ADVICE Upgrade the leading master last. If the leading master is upgraded first, a leader election will occur. Scheduler ZooKeeper Quorum Leading Master Mesos Master Mesos Master Agent Agent Task Task 2016 Mesosphere, Inc. All Rights Reserved. 32

33 FURTHER ADVICE Upgrade the leading master last. If the leading master is upgraded first, a leader election will occur. Scheduler ZooKeeper Quorum This implies a minimum of two leader elections during upgrade. Mesos Master Leading Master Mesos Master Upgrading the leader last permits an upgrade with just one election Agent Task Agent Task 2016 Mesosphere, Inc. All Rights Reserved. 33

34 FURTHER ADVICE Backup ZooKeeper state ZooKeeper is used for master leader election, and some frameworks may store state here Scheduler ZooKeeper Quorum A successful upgrade will not impact the ZooKeeper ensemble, but a backup provides insurance Leading Master Mesos Master Mesos Master Agent Agent Task Task 2016 Mesosphere, Inc. All Rights Reserved. 34

35 FURTHER ADVICE Use different copies of libmesos for different components Mesosphere, Inc. All Rights Reserved. 35

36 FURTHER ADVICE Don t apply configuration and binary changes to the cluster simultaneously Mesosphere, Inc. All Rights Reserved. 36

37 FURTHER ADVICE Don t apply configuration and binary changes to the cluster simultaneously. 1. If a config change is needed to avoid deprecated features, do this first 2. Upgrade binaries 3. Upgrade configurations to use new features if needed 2016 Mesosphere, Inc. All Rights Reserved. 37

38 POTENTIAL PITFALLS A flapping master or agent process may occur when they re run under an init system. The program repeatedly fails, while the init system keeps restarting Mesosphere, Inc. All Rights Reserved. 38

39 POTENTIAL PITFALLS A flapping master or agent process may occur when they re run under an init system. If the master/agent configuration isn t compatible with the new binary after upgrade, it will flap. The program repeatedly fails, while the init system keeps restarting. If the agent cannot reach the master, it will flap Mesosphere, Inc. All Rights Reserved. 39

40 VISIT OUR BOOTH Located at D1, near the main entrance FirstName Office Hours Tuesday 1-3pm Wednesday 4:40-5:15pm Learn more by visiting DCOS.io and Mesosphere.com 2016 Mesosphere, Inc. All Rights Reserved. 40

41 THANK 2016 Mesosphere, Inc. All Rights Reserved. 41

SECURING A MARATHON INSTALLATION 2016

SECURING A MARATHON INSTALLATION 2016 MesosCon EU 2016 - Gastón Kleiman SECURING A MARATHON INSTALLATION 2016 2016 Mesosphere, Inc. All Rights Reserved. 1 Gastón Kleiman Distributed Systems Engineer Marathon/Mesos contributor gaston@mesosphere.io

More information

@joerg_schad Nightmares of a Container Orchestration System

@joerg_schad Nightmares of a Container Orchestration System @joerg_schad Nightmares of a Container Orchestration System 2017 Mesosphere, Inc. All Rights Reserved. 1 Jörg Schad Distributed Systems Engineer @joerg_schad Jan Repnak Support Engineer/ Solution Architect

More information

Distributed Data on Distributed Infrastructure. Claudius Weinberger & Kunal Kusoorkar, ArangoDB Jörg Schad, Mesosphere

Distributed Data on Distributed Infrastructure. Claudius Weinberger & Kunal Kusoorkar, ArangoDB Jörg Schad, Mesosphere Distributed Data on Distributed Infrastructure Claudius Weinberger & Kunal Kusoorkar, ArangoDB Jörg Schad, Mesosphere Kunal Kusoorkar Director Solutions Engineering, ArangoDB @neunhoef Jörg Schad Claudius

More information

CONTINUOUS DELIVERY WITH MESOS, DC/OS AND JENKINS

CONTINUOUS DELIVERY WITH MESOS, DC/OS AND JENKINS APACHE MESOS NYC MEETUP SEPTEMBER 22, 2016 CONTINUOUS DELIVERY WITH MESOS, DC/OS AND JENKINS WHO WE ARE ROGER IGNAZIO SUNIL SHAH Tech Lead at Mesosphere @rogerignazio Product Manager at Mesosphere @ssk2

More information

CONTINUOUS DELIVERY WITH DC/OS AND JENKINS

CONTINUOUS DELIVERY WITH DC/OS AND JENKINS SOFTWARE ARCHITECTURE NOVEMBER 15, 2016 CONTINUOUS DELIVERY WITH DC/OS AND JENKINS AGENDA Presentation Introduction to Apache Mesos and DC/OS Components that make up modern infrastructure Running Jenkins

More information

Secrets Management in Mesos. Vinod Kone Greg Mann

Secrets Management in Mesos. Vinod Kone Greg Mann Secrets Management in Mesos Vinod Kone (vinodkone@apache.org) Greg Mann (grag@apache.org) Introduction Vinod Kone Apache Mesos committer / PMC member Tech Lead Manager @ Mesosphere JIRA / ReviewBoard username:

More information

Issues Fixed in DC/OS

Issues Fixed in DC/OS Release Notes for 1.10.4 These are the release notes for DC/OS 1.10.4. DOWNLOAD DC/OS OPEN SOURCE Issues Fixed in DC/OS 1.10.4 CORE-1375 - Docker executor does not hang due to lost messages. DOCS-2169

More information

This document provides instructions for upgrading a DC/OS cluster.

This document provides instructions for upgrading a DC/OS cluster. Upgrading ENTERPRISE This document provides instructions for upgrading a DC/OS cluster. If this upgrade is performed on a supported OS with all prerequisites fulfilled, this upgrade should preserve the

More information

APACHE COTTON. MySQL on Mesos. Yan Xu xujyan

APACHE COTTON. MySQL on Mesos. Yan Xu xujyan APACHE COTTON MySQL on Mesos Yan Xu xujyan 1 SHORT HISTORY Mesos: cornerstone of Twitter s compute platform. MySQL: backbone of Twitter s data platform. Mysos: started as a hackweek project @twitter. Apache

More information

2016 Mesosphere, Inc. All Rights Reserved.

2016 Mesosphere, Inc. All Rights Reserved. MesosCon 2016 - Qian Zhang (IBM China), Avinash Sridharan, Jie Yu (Mesosphere) Container Network Interface (CNI) for Mesos: The `network/cni` isolator. 1 Qian Zhang Software Engineer zhangqxa@cn.ibm.com

More information

Secrets Management in Mesos

Secrets Management in Mesos Secrets Management in Mesos Vinod Kone (vinodkone@apache.org) MesosCon EU 2017 About me Apache Mesos PMC and Committer Engineering Manager for Mesos team @ Mesosphere Previously Tech Lead for Mesos team

More information

利用 Mesos 打造高延展性 Container 環境. Frank, Microsoft MTC

利用 Mesos 打造高延展性 Container 環境. Frank, Microsoft MTC 利用 Mesos 打造高延展性 Container 環境 Frank, Microsoft MTC About Me Developer @ Yahoo! DevOps @ HTC Technical Architect @ MSFT Agenda About Docker Manage containers Apache Mesos Mesosphere DC/OS application = application

More information

1.0. Vinod Kone Anand Mazumdar

1.0. Vinod Kone Anand Mazumdar 1.0 Vinod Kone (vinod@mesosphere.io) Anand Mazumdar (anand@mesosphere.io) MESOS 1.0 1.0 Released on July 27th 2016 Biggest release ever! 1.0.1 Released on Aug 24th 2016 Please use this one! WHAT S IN 1.0?

More information

Introduction to Mesos and the Datacenter Operating System

Introduction to Mesos and the Datacenter Operating System Introduction to Mesos and the Datacenter Operating System Artem Harutyunyan (artem@mesosphere.io) 2016 Mesosphere, Inc. All Rights Reserved. INTRO $ whoami ARTEM HARUTYUNYAN ALICE Offline (2004-2010) AliEn

More information

Installing SmartSense on HDP

Installing SmartSense on HDP 1 Installing SmartSense on HDP Date of Publish: 2018-07-12 http://docs.hortonworks.com Contents SmartSense installation... 3 SmartSense system requirements... 3 Operating system, JDK, and browser requirements...3

More information

Building/Running Distributed Systems with Apache Mesos

Building/Running Distributed Systems with Apache Mesos Building/Running Distributed Systems with Apache Mesos Philly ETE April 8, 2015 Benjamin Hindman @benh $ whoami 2007-2012 2009-2010 - 2014 my other computer is a datacenter my other computer is a datacenter

More information

CS 138: Google. CS 138 XVII 1 Copyright 2016 Thomas W. Doeppner. All rights reserved.

CS 138: Google. CS 138 XVII 1 Copyright 2016 Thomas W. Doeppner. All rights reserved. CS 138: Google CS 138 XVII 1 Copyright 2016 Thomas W. Doeppner. All rights reserved. Google Environment Lots (tens of thousands) of computers all more-or-less equal - processor, disk, memory, network interface

More information

Upgrade Guide. Platform Compatibility. Dell SonicWALL Aventail E-Class SRA 10.7 Upgrade Guide. Secure Remote Access

Upgrade Guide. Platform Compatibility. Dell SonicWALL Aventail E-Class SRA 10.7 Upgrade Guide. Secure Remote Access Secure Remote Access Dell SonicWALL Aventail E-Class SRA 10.7 This document describes the process of obtaining your Dell SonicWALL Aventail E-Class SRA firmware update file, verifying it, and installing

More information

Hortonworks SmartSense

Hortonworks SmartSense Hortonworks SmartSense Installation (January 8, 2018) docs.hortonworks.com Hortonworks SmartSense: Installation Copyright 2012-2018 Hortonworks, Inc. Some rights reserved. The Hortonworks Data Platform,

More information

Container Orchestration on Amazon Web Services. Arun

Container Orchestration on Amazon Web Services. Arun Container Orchestration on Amazon Web Services Arun Gupta, @arungupta Docker Workflow Development using Docker Docker Community Edition Docker for Mac/Windows/Linux Monthly edge and quarterly stable

More information

CS 138: Google. CS 138 XVI 1 Copyright 2017 Thomas W. Doeppner. All rights reserved.

CS 138: Google. CS 138 XVI 1 Copyright 2017 Thomas W. Doeppner. All rights reserved. CS 138: Google CS 138 XVI 1 Copyright 2017 Thomas W. Doeppner. All rights reserved. Google Environment Lots (tens of thousands) of computers all more-or-less equal - processor, disk, memory, network interface

More information

A Distributed System Case Study: Apache Kafka. High throughput messaging for diverse consumers

A Distributed System Case Study: Apache Kafka. High throughput messaging for diverse consumers A Distributed System Case Study: Apache Kafka High throughput messaging for diverse consumers As always, this is not a tutorial Some of the concepts may no longer be part of the current system or implemented

More information

Dynamic Reconfiguration of Primary/Backup Clusters

Dynamic Reconfiguration of Primary/Backup Clusters Dynamic Reconfiguration of Primary/Backup Clusters (Apache ZooKeeper) Alex Shraer Yahoo! Research In collaboration with: Benjamin Reed Dahlia Malkhi Flavio Junqueira Yahoo! Research Microsoft Research

More information

Mesosphere and Percona Server for MongoDB. Jeff Sandstrom, Product Manager (Percona) Ravi Yadav, Tech. Partnerships Lead (Mesosphere)

Mesosphere and Percona Server for MongoDB. Jeff Sandstrom, Product Manager (Percona) Ravi Yadav, Tech. Partnerships Lead (Mesosphere) Mesosphere and Percona Server for MongoDB Jeff Sandstrom, Product Manager (Percona) Ravi Yadav, Tech. Partnerships Lead (Mesosphere) Mesosphere DC/OS MICROSERVICES, CONTAINERS, & DEV TOOLS DATA SERVICES,

More information

Mesosphere and Percona Server for MongoDB. Peter Schwaller, Senior Director Server Eng. (Percona) Taco Scargo, Senior Solution Engineer (Mesosphere)

Mesosphere and Percona Server for MongoDB. Peter Schwaller, Senior Director Server Eng. (Percona) Taco Scargo, Senior Solution Engineer (Mesosphere) Mesosphere and Percona Server for MongoDB Peter Schwaller, Senior Director Server Eng. (Percona) Taco Scargo, Senior Solution Engineer (Mesosphere) Mesosphere DC/OS MICROSERVICES, CONTAINERS, & DEV TOOLS

More information

Building a Data-Friendly Platform for a Data- Driven Future

Building a Data-Friendly Platform for a Data- Driven Future Building a Data-Friendly Platform for a Data- Driven Future Benjamin Hindman - @benh 2016 Mesosphere, Inc. All Rights Reserved. INTRO $ whoami BENJAMIN HINDMAN Co-founder and Chief Architect of Mesosphere,

More information

Architecting for Failure in a Containerized World. Tom Faulhaber Infolace

Architecting for Failure in a Containerized World. Tom Faulhaber Infolace Architecting for Failure in a Containerized World Tom Faulhaber Infolace How can container tech help us build robust systems? Key takeaway: an architectural toolkit for building robust systems with

More information

Distributed CI: Scaling Jenkins on Mesos and Marathon. Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA

Distributed CI: Scaling Jenkins on Mesos and Marathon. Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA Distributed CI: Scaling Jenkins on Mesos and Marathon Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA About Me Roger Ignazio QE Automation Engineer Puppet Labs, Inc. @rogerignazio Mesos In Action

More information

Understanding Feature and Network Services in Cisco Unified Serviceability

Understanding Feature and Network Services in Cisco Unified Serviceability CHAPTER 10 Understanding Feature and Network Services in Cisco Unified Serviceability May 19, 2009 Cisco Unified Serviceability service management includes working with feature and network services and

More information

OpenWhisk on Mesos. Tyson Norris/Dragos Dascalita Haut, Adobe Systems, Inc.

OpenWhisk on Mesos. Tyson Norris/Dragos Dascalita Haut, Adobe Systems, Inc. OpenWhisk on Mesos Tyson Norris/Dragos Dascalita Haut, Adobe Systems, Inc. OPENWHISK ON MESOS SERVERLESS BACKGROUND OPERATIONAL EVOLUTION SERVERLESS BACKGROUND CUSTOMER FOCUSED DEVELOPMENT SERVERLESS BACKGROUND

More information

Understanding High Availability options for PostgreSQL

Understanding High Availability options for PostgreSQL Understanding High Availability options for PostgreSQL Madan Kumar K Member of Technical Staff, ScaleGrid.io @ImMadanK High Availability 101 Redundancy is the key Standalone vs. Master-Standby Master Server

More information

KIP-43: Kafka SASL enhancements

KIP-43: Kafka SASL enhancements KIP-43: Kafka SASL enhancements Status Motivation Public Interfaces Configuration options Protocol changes Proposed Changes SASL configuration SASL mechanism Support for multiple mechanisms in a broker

More information

High Availability for Enterprise Clouds: Oracle Solaris Cluster and OpenStack

High Availability for Enterprise Clouds: Oracle Solaris Cluster and OpenStack High Availability for Enterprise Clouds: Oracle Solaris Cluster and OpenStack Eve Kleinknecht Principal Product Manager Thorsten Früauf Principal Software Engineer November 18, 2015 Safe Harbor Statement

More information

Module 1: Understanding and Installing Internet Information Services

Module 1: Understanding and Installing Internet Information Services Course Outline Module 1: Understanding and Installing Internet Information Services In this module, you will learn about the infrastructure prerequisites for using Microsoft Internet Information Services

More information

ZooKeeper & Curator. CS 475, Spring 2018 Concurrent & Distributed Systems

ZooKeeper & Curator. CS 475, Spring 2018 Concurrent & Distributed Systems ZooKeeper & Curator CS 475, Spring 2018 Concurrent & Distributed Systems Review: Agreement In distributed systems, we have multiple nodes that need to all agree that some object has some state Examples:

More information

6 Months Training Module in MS SQL SERVER 2012

6 Months Training Module in MS SQL SERVER 2012 6 Months Training Module in MS SQL SERVER 2012 Module 1 Installing and Configuring Windows Server 2012 Installing and Managing Windows Server 2012 Windows Server 2012 Overview Installing Windows Server

More information

dbdeployer the future of MySQL sandboxes Giuseppe Maxia Software explorer

dbdeployer the future of MySQL sandboxes Giuseppe Maxia Software explorer dbdeployer the future of MySQL sandboxes Giuseppe Maxia Software explorer About me Who's this guy? Giuseppe Maxia, a.k.a. "The Data Charmer" Software Explorer at VMware Several decades development and

More information

Seagull: A distributed, fault tolerant, concurrent task runner. Sagar Patwardhan

Seagull: A distributed, fault tolerant, concurrent task runner. Sagar Patwardhan Seagull: A distributed, fault tolerant, concurrent task runner Sagar Patwardhan sagarp@yelp.com Yelp s Mission Connecting people with great local businesses. Yelp scale Outline What is Seagull? Why did

More information

Replication in Distributed Systems

Replication in Distributed Systems Replication in Distributed Systems Replication Basics Multiple copies of data kept in different nodes A set of replicas holding copies of a data Nodes can be physically very close or distributed all over

More information

Road to Auto Scaling

Road to Auto Scaling Road to Auto Scaling Varun Thacker Lucidworks Apache Lucene/Solr Committer, and PMC member Agenda APIs Metrics Recipes Auto-Scale Triggers SolrCloud Overview ZooKee per Lots Shard 1 Leader Shard 3 Replica

More information

Container 2.0. Container: check! But what about persistent data, big data or fast data?!

Container 2.0. Container: check! But what about persistent data, big data or fast data?! @unterstein @joerg_schad @dcos @jaxdevops Container 2.0 Container: check! But what about persistent data, big data or fast data?! 1 Jörg Schad Distributed Systems Engineer @joerg_schad Johannes Unterstein

More information

Read the following information carefully, before you begin an upgrade.

Read the following information carefully, before you begin an upgrade. Read the following information carefully, before you begin an upgrade. Review Supported Upgrade Paths, page 1 Review Time Taken for Upgrade, page 1 Review Available Cisco APIC-EM Ports, page 2 Securing

More information

Scale your Docker containers with Mesos

Scale your Docker containers with Mesos Scale your Docker containers with Mesos Timothy Chen tim@mesosphere.io About me: - Distributed Systems Architect @ Mesosphere - Lead Containerization engineering - Apache Mesos, Drill PMC / Committer

More information

Marathon has a timer metric that determines how long an event has taken place. Timer does not exist for Mesos observability metrics.

Marathon has a timer metric that determines how long an event has taken place. Timer does not exist for Mesos observability metrics. Performance Monitoring Here are some recommendations for monitoring a DC/OS cluster. You can use any monitoring tools. The endpoints listed below will help you troubleshoot when issues occur. Your monitoring

More information

Scalable task distribution with Scala, Akka and Mesos. Dario

Scalable task distribution with Scala, Akka and Mesos. Dario Scalable task distribution with Scala, Akka and Mesos Dario Rexin @evonox What is Mesos? 2 What is Mesos? Apache open source project Distributed systems kernel Multi resource scheduler (CPU, Memory, Ports,

More information

AGILE DEVELOPMENT AND PAAS USING THE MESOSPHERE DCOS

AGILE DEVELOPMENT AND PAAS USING THE MESOSPHERE DCOS Sunil Shah AGILE DEVELOPMENT AND PAAS USING THE MESOSPHERE DCOS 1 THE DATACENTER OPERATING SYSTEM (DCOS) 2 DCOS INTRODUCTION The Mesosphere Datacenter Operating System (DCOS) is a distributed operating

More information

MesosCon Qian Zhang (IBM China), Jie Yu (Mesosphere) OCI Support in Mesos Mesosphere, Inc. All Rights Reserved. 1

MesosCon Qian Zhang (IBM China), Jie Yu (Mesosphere) OCI Support in Mesos Mesosphere, Inc. All Rights Reserved. 1 MesosCon 2017 - Qian Zhang (IBM China), Jie Yu (Mesosphere) OCI Support in Mesos 2016 Mesosphere, Inc. All Rights Reserved. 1 Qian Zhang Software Engineer Jie Yu Software Engineer zhq527725@gmail.com jie@mesosphere.io

More information

ZooKeeper Dynamic Reconfiguration

ZooKeeper Dynamic Reconfiguration by Table of contents 1 Overview... 2 2 Changes to Configuration Format...2 2.1 Specifying the client port... 2 2.2 The standaloneenabled flag...3 2.3 Dynamic configuration file...3 2.4 Backward compatibility...

More information

Installing SQL Server Developer Last updated 8/28/2010

Installing SQL Server Developer Last updated 8/28/2010 Installing SQL Server Developer Last updated 8/28/2010 1. Run Setup.Exe to start the setup of SQL Server 2008 Developer 2. On some OS installations (i.e. Windows 7) you will be prompted a reminder to install

More information

Performance and Forgiveness. June 23, 2008 Margo Seltzer Harvard University School of Engineering and Applied Sciences

Performance and Forgiveness. June 23, 2008 Margo Seltzer Harvard University School of Engineering and Applied Sciences Performance and Forgiveness June 23, 2008 Margo Seltzer Harvard University School of Engineering and Applied Sciences Margo Seltzer Architect Outline A consistency primer Techniques and costs of consistency

More information

Memory may be insufficient. Memory may be insufficient.

Memory may be insufficient. Memory may be insufficient. Error code Less than 200 Error code Error type Description of the circumstances under which the problem occurred Linux system call error. Explanation of possible causes Countermeasures 1001 CM_NO_MEMORY

More information

ProIO Key Concepts. ProIO is for PROS! right in the name

ProIO Key Concepts. ProIO is for PROS! right in the name ProIO David Blyth The Project Inspired by works from S. Chekanov and A. Kiselev Lives at https://github.com/decibelcoo per/proio Ooh, shiny badges! Continuous Integration: no code merges without sufficient

More information

Fault Domains in Mesos. Vinod Kone

Fault Domains in Mesos. Vinod Kone Fault Domains in Mesos Vinod Kone (vinodkone@apache.org) About me Apache Mesos PMC and Committer Engineering Manager for Mesos team @ Mesosphere Previously Tech Lead for Mesos team @ Twitter PhD in Computer

More information

Reliable Crash Detection and Failover with Orchestrator

Reliable Crash Detection and Failover with Orchestrator 1 Reliable Crash Detection and Failover with Orchestrator Shlomi Noach, PerconaLive 2016 " How people build software Agenda Orchestrator Topologies, crash scenarios Crash detection methods Promotion complexity

More information

Acronis Data Cloud plugin for ConnectWise Automate

Acronis Data Cloud plugin for ConnectWise Automate Acronis Data Cloud plugin for ConnectWise Automate USER S GUIDE Revision: 17.01.2018 Table of contents 1 Introduction...3 2 What s new in Update 3...3 3 What s new in Update 2...3 4 What s new in Update

More information

What s New in Oracle Cloud Infrastructure Object Storage Classic. Topics: On Oracle Cloud. Oracle Cloud

What s New in Oracle Cloud Infrastructure Object Storage Classic. Topics: On Oracle Cloud. Oracle Cloud Oracle Cloud What's New in Classic E71883-15 February 2018 What s New in Oracle Cloud Infrastructure Object Storage Classic This document describes what's new in Classic on all the infrastructure platforms

More information

Deep Dive Amazon Kinesis. Ian Meyers, Principal Solution Architect - Amazon Web Services

Deep Dive Amazon Kinesis. Ian Meyers, Principal Solution Architect - Amazon Web Services Deep Dive Amazon Kinesis Ian Meyers, Principal Solution Architect - Amazon Web Services Analytics Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure

More information

Abstract. Introduction

Abstract. Introduction Highly Available In-Memory Metadata Filesystem using Viewstamped Replication (https://github.com/pkvijay/metadr) Pradeep Kumar Vijay, Pedro Ulises Cuevas Berrueco Stanford cs244b-distributed Systems Abstract

More information

Streaming Log Analytics with Kafka

Streaming Log Analytics with Kafka Streaming Log Analytics with Kafka Kresten Krab Thorup, Humio CTO Log Everything, Answer Anything, In Real-Time. Why this talk? Humio is a Log Analytics system Designed to run on-prem High volume, real

More information

Scaling the Yelp s logging pipeline with Apache Kafka. Enrico

Scaling the Yelp s logging pipeline with Apache Kafka. Enrico Scaling the Yelp s logging pipeline with Apache Kafka Enrico Canzonieri enrico@yelp.com @EnricoC89 Yelp s Mission Connecting people with great local businesses. Yelp Stats As of Q1 2016 90M 102M 70% 32

More information

Building and Running a Solr-as-a-Service SHAI ERERA IBM

Building and Running a Solr-as-a-Service SHAI ERERA IBM Building and Running a Solr-as-a-Service SHAI ERERA IBM Who Am I? Working at IBM Social Analytics & Technologies Lucene/Solr committer and PMC member http://shaierera.blogspot.com shaie@apache.org Background

More information

Voltha Architecture in a clustered HA configuration. Sergio Slobodrian, Ciena CORD Build Wed, November 7 th, 2017

Voltha Architecture in a clustered HA configuration. Sergio Slobodrian, Ciena CORD Build Wed, November 7 th, 2017 Voltha Architecture in a clustered HA configuration Sergio Slobodrian, Ciena CORD Build Wed, November 7 th, 2017 Abstract This talk will dive in on VOLTHA s clustered high availability architecture including

More information

Methods of Distributed Processing for Combat Simulation Data Generation

Methods of Distributed Processing for Combat Simulation Data Generation 22nd International Congress on Modelling and Simulation, Hobart, Tasmania, Australia, 3 to 8 December 2017 mssanz.org.au/modsim2017 Methods of Distributed Processing for Combat Simulation Data Generation

More information

Maintaining a Microsoft SQL Server 2005 Database Course 2780: Three days; Instructor-Led

Maintaining a Microsoft SQL Server 2005 Database Course 2780: Three days; Instructor-Led Maintaining a Microsoft SQL Server 2005 Database Course 2780: Three days; Instructor-Led Introduction This three-day instructor-led course provides students with product knowledge and skills needed to

More information

Error code. Description of the circumstances under which the problem occurred. Less than 200. Linux system call error.

Error code. Description of the circumstances under which the problem occurred. Less than 200. Linux system call error. Error code Less than 200 Error code Error type Description of the circumstances under which the problem occurred Linux system call error. Explanation of possible causes Countermeasures 1001 CM_NO_MEMORY

More information

{ REST } vs. Battle of API s

{ REST } vs. Battle of API s { REST } vs Battle of API s Software Engineer at Sensedia Who am I? MBA in java projects Java and microservice enthusiastic Microservices Agenda REST grpc Demo Questions Moving to Microservices Monolith

More information

MySQL Multi-Site/Multi-Master Done Right

MySQL Multi-Site/Multi-Master Done Right MySQL Multi-Site/Multi-Master Done Right MySQL Clustering for HA and DR The Dream: Multiple, active DBMS servers with identical data over distance Too good to be true? High Performance High Availability

More information

ZooKeeper Getting Started Guide

ZooKeeper Getting Started Guide by Table of contents 1 Getting Started: Coordinating Distributed Applications with ZooKeeper...2 1.1 Pre-requisites... 2 1.2 Download... 2 1.3 Standalone Operation... 2 1.4 Managing ZooKeeper Storage...3

More information

UNIT IV -- TRANSPORT LAYER

UNIT IV -- TRANSPORT LAYER UNIT IV -- TRANSPORT LAYER TABLE OF CONTENTS 4.1. Transport layer. 02 4.2. Reliable delivery service. 03 4.3. Congestion control. 05 4.4. Connection establishment.. 07 4.5. Flow control 09 4.6. Transmission

More information

Cisco Expressway Cluster Creation and Maintenance

Cisco Expressway Cluster Creation and Maintenance Cisco Expressway Cluster Creation and Maintenance Deployment Guide Cisco Expressway X8.6 July 2015 Contents Introduction 4 Prerequisites 5 Upgrading an X8.n cluster to X8.6 6 Prerequisites 6 Upgrade Expressway

More information

Distributed Systems. 10. Consensus: Paxos. Paul Krzyzanowski. Rutgers University. Fall 2017

Distributed Systems. 10. Consensus: Paxos. Paul Krzyzanowski. Rutgers University. Fall 2017 Distributed Systems 10. Consensus: Paxos Paul Krzyzanowski Rutgers University Fall 2017 1 Consensus Goal Allow a group of processes to agree on a result All processes must agree on the same value The value

More information

Docker DCA EXAM. m/ Product: Demo. For More Information: Docker Certified Associate

Docker DCA EXAM.   m/ Product: Demo. For More Information:   Docker Certified Associate Page No 1 https://www.dumpsplanet.com m/ Docker DCA EXAM Docker Certified Associate Product: Demo For More Information: DCA-dumps Question: 1 Which of the following is supported by control groups? A. Manage

More information

Lessons Learned: Deploying Microservices Software Product in Customer Environments Mark Galpin, Solution Architect, JFrog, Inc.

Lessons Learned: Deploying Microservices Software Product in Customer Environments Mark Galpin, Solution Architect, JFrog, Inc. Lessons Learned: Deploying Microservices Software Product in Customer Environments Mark Galpin, Solution Architect, JFrog, Inc. Who s speaking? Mark Galpin Solution Architect @jfrog magalpin Microservices

More information

Hortonworks SmartSense

Hortonworks SmartSense Hortonworks SmartSense Installation (April 3, 2017) docs.hortonworks.com Hortonworks SmartSense: Installation Copyright 2012-2017 Hortonworks, Inc. Some rights reserved. The Hortonworks Data Platform,

More information

Acronis Data Cloud plugin for ConnectWise Automate

Acronis Data Cloud plugin for ConnectWise Automate Acronis Data Cloud plugin for ConnectWise Automate USER'S GUIDE Revision: 24.08.2018 Table of contents 1 Introduction...3 2 What's new in Update 4...3 3 What's new in Update 3...3 4 What's new in Update

More information

Přehled novinek v Hyper-V 2016 Kamil Roman

Přehled novinek v Hyper-V 2016 Kamil Roman Přehled novinek v Hyper-V 2016 Kamil Roman Mail: IT@KamilRT.net Twitter: @KamilRT blog: ITblog.KamilRT.net 1 2 3 Rising number of organizations suffer from breaches 1 1 2 2 3 3 3 4 Shielded VMs Shielded

More information

Docker LibNetwork Plugins. Explorer s Tale

Docker LibNetwork Plugins. Explorer s Tale Docker LibNetwork Plugins Explorer s Tale Why am I here? I read a code I re-read the code I realized that the code is in GO! I re-re-read the code Finally, I fixed the code Now, I can tell a story about

More information

Cisco TelePresence VCS Cluster Creation and Maintenance

Cisco TelePresence VCS Cluster Creation and Maintenance Cisco TelePresence VCS Cluster Creation and Maintenance Deployment Guide Cisco VCS X8.5 Cisco TMS 13.2 or later December 2014 Contents Introduction 4 Prerequisites 5 Upgrading an X7.1 or later cluster

More information

PostgreSQL migration from AWS RDS to EC2

PostgreSQL migration from AWS RDS to EC2 PostgreSQL migration from AWS RDS to EC2 Technology lover Worked as Software Engineer, Team lead, DevOps, DBA, Data analyst Sr. Tech Architect at Coverfox Email me at mistryhitul007@gmail.com Tweet me

More information

Security by Default. Overview CHAPTER

Security by Default. Overview CHAPTER CHAPTER 3 This section contains the following topics: Overview, page 3-1 Trust Verification Service, page 3-2 Initial Trust List, page 3-2 Autoregistration, page 3-3 Supported Cisco Unified IP Phones,

More information

High Availability for Postgres using OpenSource tools. By Jobin Augustine & HariKrishna

High Availability for Postgres using OpenSource tools. By Jobin Augustine & HariKrishna High Availability for Postgres using OpenSource tools By Jobin Augustine & HariKrishna Introduction Jobin Augustine Designation : Architect Email: jobin.augustine@openscg.com Contact No: + 91 9989932600

More information

SEGR 550 Distributed Computing. Final Exam, Fall 2011

SEGR 550 Distributed Computing. Final Exam, Fall 2011 SEGR 550 Distributed Computing Final Exam, Fall 2011 (100 points total) 1) This is a take-home examination. You must send your solutions in a PDF or text file to zhuy@seattleu.edu by the deadline. Late

More information

Create an application with local persistent volumes

Create an application with local persistent volumes Local Persistent Volumes When you specify a local volume or volumes, tasks and their associated data are pinned to the node they are first launched on and will be relaunched on that node if they terminate.

More information

The Emergence of the Datacenter Developer. Tobi Knaup, Co-Founder & CTO at

The Emergence of the Datacenter Developer. Tobi Knaup, Co-Founder & CTO at The Emergence of the Datacenter Developer Tobi Knaup, Co-Founder & CTO at Mesosphere @superguenter A Brief History of Operating Systems 2 1950 s Mainframes Punchcards No operating systems Time Sharing

More information

Create OData API for Use With Salesforce Connect

Create OData API for Use With Salesforce Connect Create OData API for Use With Salesforce Connect The following steps describe how to set up and configure Jitterbit LIVE API Platform to expose data in an easy to consume, secure form so that Salesforce

More information

Stonesoft Management Center. Release Notes Revision A

Stonesoft Management Center. Release Notes Revision A Stonesoft Management Center Release Notes 6.1.3 Revision A Contents About this release on page 2 System requirements on page 2 Build version on page 3 Compatibility on page 4 New features on page 5 Enhancements

More information

Distributed Computation Models

Distributed Computation Models Distributed Computation Models SWE 622, Spring 2017 Distributed Software Engineering Some slides ack: Jeff Dean HW4 Recap https://b.socrative.com/ Class: SWE622 2 Review Replicating state machines Case

More information

Servicing ACNS Devices and Origin Servers

Servicing ACNS Devices and Origin Servers CHAPTER 13 This chapter explains how you can minimize the impact upon content delivery services when you perform maintenance on your ACNS network devices, such as replacing failed hardware or adding or

More information

Using DC/OS for Continuous Delivery

Using DC/OS for Continuous Delivery Using DC/OS for Continuous Delivery DevPulseCon 2017 Elizabeth K. Joseph, @pleia2 Mesosphere 1 Elizabeth K. Joseph, Developer Advocate, Mesosphere 15+ years working in open source communities 10+ years

More information

Port Usage Information for the IM and Presence Service

Port Usage Information for the IM and Presence Service Port Usage Information for the Service Port usage overview, page 1 Information collated in table, page 1 service port list, page 2 Port usage overview This document provides a list of the and ports that

More information

McAfee Network Security Platform 8.3

McAfee Network Security Platform 8.3 8.3.7.28-8.3.3.9 Manager-Mxx30-series Release Notes McAfee Network Security Platform 8.3 Revision C Contents About this release New features Enhancements Resolved issues Installation instructions Known

More information

@unterstein #bedcon. Operating microservices with Apache Mesos and DC/OS

@unterstein #bedcon. Operating microservices with Apache Mesos and DC/OS @unterstein @dcos @bedcon #bedcon Operating microservices with Apache Mesos and DC/OS 1 Johannes Unterstein Software Engineer @Mesosphere @unterstein @unterstein.mesosphere 2017 Mesosphere, Inc. All Rights

More information

Database Mirroring & Snapshots SQL Server 2008

Database Mirroring & Snapshots SQL Server 2008 Database Mirroring & Snapshots DB Architechs, USA 1 DB Architechs, France Agenda SQL Server User Group Database Mirroring Database Mirroring 2005 vs 2008 benchmark Database Snapshots SQL Performance &

More information

Automation of Rolling Upgrade for Hadoop Cluster without Data Loss and Job Failures. Hiroshi Yamaguchi & Hiroyuki Adachi

Automation of Rolling Upgrade for Hadoop Cluster without Data Loss and Job Failures. Hiroshi Yamaguchi & Hiroyuki Adachi Automation of Rolling Upgrade for Hadoop Cluster without Data Loss and Job Failures Hiroshi Yamaguchi & Hiroyuki Adachi About Us 2 Hiroshi Yamaguchi Hiroyuki Adachi Hadoop DevOps Engineer Hadoop Engineer

More information

Building Durable Real-time Data Pipeline

Building Durable Real-time Data Pipeline Building Durable Real-time Data Pipeline Apache BookKeeper at Twitter @sijieg Twitter Background Layered Architecture Agenda Design Details Performance Scale @Twitter Q & A Publish-Subscribe Online services

More information

Intra-cluster Replication for Apache Kafka. Jun Rao

Intra-cluster Replication for Apache Kafka. Jun Rao Intra-cluster Replication for Apache Kafka Jun Rao About myself Engineer at LinkedIn since 2010 Worked on Apache Kafka and Cassandra Database researcher at IBM Outline Overview of Kafka Kafka architecture

More information

SCALING LIKE TWITTER WITH APACHE MESOS

SCALING LIKE TWITTER WITH APACHE MESOS Philip Norman & Sunil Shah SCALING LIKE TWITTER WITH APACHE MESOS 1 MODERN INFRASTRUCTURE Dan the Datacenter Operator Alice the Application Developer Doesn t sleep very well Loves automation Wants to control

More information

Network Security Platform 8.1

Network Security Platform 8.1 8.1.7.100-8.1.3.130 Manager-M-series Release Notes Network Security Platform 8.1 Revision A Contents About this release New features Enhancements Resolved issues Installation instructions Known issues

More information

Replication. Version

Replication. Version Replication Version 2018.3 Contents Before you start... 3 Principles... 4 Prerequisites... 5 Initial Steps... 6 Post Setup... 7 Supported Operating Systems... 7 Perform the Setup... 8 Read Committed Snapshot

More information