A Whirlwind Tour of Apache Mesos

Similar documents
SCALING LIKE TWITTER WITH APACHE MESOS

SAMPLE CHAPTER IN ACTION. Roger Ignazio. FOREWORD BY Florian Leibert MANNING

POWERING THE INTERNET WITH APACHE MESOS

Building a Data-Friendly Platform for a Data- Driven Future

@unterstein #bedcon. Operating microservices with Apache Mesos and DC/OS

Key aspects of cloud computing. Towards fuller utilization. Two main sources of resource demand. Cluster Scheduling

Building/Running Distributed Systems with Apache Mesos

Scale your Docker containers with Mesos

利用 Mesos 打造高延展性 Container 環境. Frank, Microsoft MTC

A Platform for Fine-Grained Resource Sharing in the Data Center

Mesosphere and the Enterprise: Run Your Applications on Apache Mesos. Steve Wong Open Source Engineer {code} by Dell

Key aspects of cloud computing. Towards fuller utilization. Two main sources of resource demand. Cluster Scheduling

Deploying Applications on DC/OS

AGILE DEVELOPMENT AND PAAS USING THE MESOSPHERE DCOS

Advanced Continuous Delivery Strategies for Containerized Applications Using DC/OS

Mesosphere and Percona Server for MongoDB. Peter Schwaller, Senior Director Server Eng. (Percona) Taco Scargo, Senior Solution Engineer (Mesosphere)

The SMACK Stack: Spark*, Mesos*, Akka, Cassandra*, Kafka* Elizabeth K. Dublin Apache Kafka Meetup, 30 August 2017.

CONTINUOUS DELIVERY WITH MESOS, DC/OS AND JENKINS

MESOS A State-Of-The-Art Container Orchestrator Mesosphere, Inc. All Rights Reserved. 1

Mesosphere and Percona Server for MongoDB. Jeff Sandstrom, Product Manager (Percona) Ravi Yadav, Tech. Partnerships Lead (Mesosphere)

The Datacenter Needs an Operating System

CONTINUOUS DELIVERY WITH DC/OS AND JENKINS

@joerg_schad Nightmares of a Container Orchestration System

Container Orchestration on Amazon Web Services. Arun

The Emergence of the Datacenter Developer. Tobi Knaup, Co-Founder & CTO at

Using DC/OS for Continuous Delivery

Sunil Shah SECURE, FLEXIBLE CONTINUOUS DELIVERY PIPELINES WITH GITLAB AND DC/OS Mesosphere, Inc. All Rights Reserved.

Introduction to Mesos and the Datacenter Operating System

Storm. Distributed and fault-tolerant realtime computation. Nathan Marz Twitter

Advantages of using DC/OS Azure infrastructure and the implementation architecture Bill of materials used to construct DC/OS and the ACS clusters

Container 2.0. Container: check! But what about persistent data, big data or fast data?!

Networking & Security for Mesos

Servers & Developers. Julian Nadeau Production Engineer

Deploy Like A Boss Oliver Nicholas

Large-scale cluster management at Google with Borg

what is cloud computing?

Mesos: Mul)programing for Datacenters

An Introduction to Apache Spark

Processing of big data with Apache Spark

Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center

DATA SCIENCE USING SPARK: AN INTRODUCTION

REAL-TIME ANALYTICS WITH APACHE STORM

How Container Schedulers and Software-based Storage will Change the Cloud

Twitter data Analytics using Distributed Computing

Fault Domains in Mesos. Vinod Kone

Practical Considerations for Multi- Level Schedulers. Benjamin

Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion Stoica. University of California, Berkeley nsdi 11

STORM AND LOW-LATENCY PROCESSING.

Supporting GPUs in Docker Containers on Apache Mesos

APACHE COTTON. MySQL on Mesos. Yan Xu xujyan

Containerization Dockers / Mesospere. Arno Keller HPE

Index. Raul Estrada and Isaac Ruiz 2016 R. Estrada and I. Ruiz, Big Data SMACK, DOI /

MANAGING MESOS, DOCKER, AND CHRONOS WITH PUPPET

An Introduction to Kubernetes

Spark Overview. Professor Sasu Tarkoma.

An Enhanced Approach for Resource Management Optimization in Hadoop

Mesos: A Pla+orm for Fine- Grained Resource Sharing in the Data Center

SECURING A MARATHON INSTALLATION 2016

Resilient Distributed Datasets

Before proceeding with this tutorial, you must have a good understanding of Core Java and any of the Linux flavors.

Important DevOps Technologies (3+2+3days) for Deployment

DC/OS Metrics. (formerly known as Project Ambrose) Application and Resource Metrics in DC/OS Enterprise. Nick Parker at..

How to Keep UP Through Digital Transformation with Next-Generation App Development

BASIC INTER/INTRA IPC. Operating System (Linux, Windows) HARDWARE. Scheduling Framework (Mesos, YARN, etc) HERON S GENERAL-PURPOSE ARCHITECTURE

Stream Processing on IoT Devices using Calvin Framework

Reactive App using Actor model & Apache Spark. Rahul Kumar Software

Scalable Streaming Analytics

Intra-cluster Replication for Apache Kafka. Jun Rao

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::

Webinar Series TMIP VISION

Network Function Virtualization over Open DC/OS Yung-Han Chen

Big Data Security. Facing the challenge

Analytic Cloud with. Shelly Garion. IBM Research -- Haifa IBM Corporation

Page 1. Goals for Today" Background of Cloud Computing" Sources Driving Big Data" CS162 Operating Systems and Systems Programming Lecture 24

Personal Statement. Skillset I MongoDB / Cassandra / Redis / CouchDB. My name is Dale-Kurt Murray. I'm a Solutiof

Distributed CI: Scaling Jenkins on Mesos and Marathon. Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA

Onto Petaflops with Kubernetes

Scheduling Applications at Scale

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect

Kubernetes: Integration vs Native Solution

Hadoop 2.x Core: YARN, Tez, and Spark. Hortonworks Inc All Rights Reserved

Cloud Computing & Visualization

MapReduce, Hadoop and Spark. Bompotas Agorakis

Everything You Ever Wanted To Know About Resource Scheduling... Almost

Distributed ETL. A lightweight, pluggable, and scalable ingestion service for real-time data. Joe Wang

COSC 6339 Big Data Analytics. Introduction to Spark. Edgar Gabriel Fall What is SPARK?

arxiv: v1 [cs.ro] 2 May 2018

Apache Ignite TM - In- Memory Data Fabric Fast Data Meets Open Source

Real-time personal trainer on the SMACK Jan Anirvan Chakraborty

Armon HASHICORP

IBM Planning Analytics Workspace Local Distributed Soufiane Azizi. IBM Planning Analytics

Distributed Data on Distributed Infrastructure. Claudius Weinberger & Kunal Kusoorkar, ArangoDB Jörg Schad, Mesosphere

CSE 444: Database Internals. Lecture 23 Spark

Jupyter and Spark on Mesos: Best Practices. June 21 st, 2017

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info

Storm. Distributed and fault-tolerant realtime computation. Nathan Marz Twitter

Orchestration Ownage: Exploiting Container-Centric Datacenter Platforms

Integrating Apache Mesos with Science Gateways via Apache Airavata

PROFILING BASED REDUCE MEMORY PROVISIONING FOR IMPROVING THE PERFORMANCE IN HADOOP

UNIFY DATA AT MEMORY SPEED. Haoyuan (HY) Li, Alluxio Inc. VAULT Conference 2017

Transcription:

A Whirlwind Tour of Apache Mesos

About Herdy Senior Software Engineer at Citadel Technology Solutions (Singapore) The eternal student Find me on the internet: _hhandoko hhandoko hhandoko https://au.linkedin.com/in/herdyhandoko

Presentation Overview Problem Domains Mesos Fundamentals Mesos Frameworks Mesos in the Real-World Demo! Image source: https://mesosphere.com/wp-content/uploads/2015/04/dcossdashboard.jpg

Once Upon a Tweet I ve heard of: LAMP WIMP MEAN But what is SMACK? Source: https://twitter.com/theotown/status/643377504527495168

Mesos in One Paragraph Apache Mesos abstracts CPU, memory, storage, and other compute resources away from machines (physical or virtual), enabling fault-tolerant and elastic distributed systems to easily be built and run effectively. Image source: https://mesosphere.com/wp-content/themes/mesosphere/library/images/views/why-mesos/mesos-architecture.png

Mesos in One Sentence Operations / DevOps Developers / Data Scientist Next-Generation Cluster Manager Distributed Systems SDK

Mesos in One Sentence (cont d) Datacentre timesharing Image source: http://www.computersciencelab.com/computerhistory/htmlhelp/images2/ibm7094.jpg

Problem Domain: Static Partitioning Many and complex provisioning scripts Snowflake servers No automated failure handling Repartition takes hours or days

Problem Domain: Resource Management Low utilisation rate (i.e. waste) Hard to predict workload Application performance jitter Scale and capacity are coupled Image source: http://www.slideshare.net/mesosphere/apache-mesos-and-mesosphere-live-webcast-by-ceo-and-cofounder-florian-li

The Inspiration: Google Borg Top Secret orchestration system (in use since ~2004) Efficiently parcels work across Google s vast fleet of computer servers Google is building Omega (Borg vnext) Source: https://www.wired.com/2013/03/google-borg-twitter-mesos/all/

The Birth of Apache Mesos A research project at the University of California Berkeley Hindman s initial ideas from working with many-cores Intel processor (64 128 cores) Hindman teamed up with Kowinski and Zaharia who was working on software platform that work on massive data centres Twitter took a keen interest and further developed Mesos (as an opensource project) Becomes an Apache project in 2013 Source: https://www.wired.com/2013/03/google-borg-twitter-mesos/all/

Mesos Analogy to an Operating System Linux Mesos

Mesos vs Virtualization Virtualization Mesos

Mesos Architecture ZooKeeper coordinate master nodes and elect leader Mesos master manage agents and schedule Tasks Mesos agents make Offers and run Tasks

Key Concepts Frameworks Mesos understands the technical primitives of distributed computing but have no intelligence on how to do it Frameworks tell Mesos (kernel) how to run the applications A framework comprises of Scheduler and Executor Resource offers Agents advertise available resources Offers can contain user-defined attributes Resource isolation via LXC Resource allocation Roles Weights Resource Reservations

Two-tier Scheduling 1. Agents offer resources 2. Allocator decides where to offer the resources 3. Framework may accept an offer and execute a task in an agent, or 4. Framework may reject the offer and it will be passed along

App Specific Frameworks

General Purpose Framework: Marathon Container and framework orchestration platform Runs long running services (`init.d`), e.g. web applications Features High availability (active / passive) Service discovery & load balancing Health checks Event subscription REST API Image source: https://mesosphere.com/wp-content/themes/mesosphere/library/images/assets/continuous-deployment/marathon2.png

General Purpose Framework: Chronos Fault-tolerant jobs scheduler for Mesos Distributed `cron` Features Distributed and fault-tolerant Supports bash and custom executor Schedules based on ISO8601 repeating interval notation Handles jobs dependencies Image source: https://mesos.github.io/chronos/img/chronos_ui-1.png

Framework: Aurora Service orchestration framework Functionality-wise, combined Marathon + Chronos, and so much more Twitter wanted an all-in-one framework for total control Image source: http://aurora.apache.org/documentation/latest/images/components.png

BYO Framework Existing frameworks provide good coverage of most use cases (80/20) Hadoop: Batch processing Storm: Stream processing Chronos: Task scheduling Marathon / Aurora: long-running services

Custom Framework Demo!

Demo Resources Rendler Code: https://github.com/mesosphere/rendler

Mesos in Production Today

Mesos and Mesosphere Mesos is the name of the opensource Apache project Mesosphere (Mesosphere Inc.) is the company which commercializes the open source project and provides consulting services

DC/OS Demo!

Demo Resources DC/OS Installation Instructions: https://dcos.io/docs/1.7/administration/installing/cloud/packet/ Packet Hosting: https://www.packet.net Hashicorp s Terraform: https://www.hashicorp.com/terraform.html Mesosphere Tweeter App: https://github.com/mesosphere/tweeter

Predictive Scheduler: Quasar Resource efficient and QoSaware cluster manager Uses fast classification techniques in Machine Learning to profile workloads Image source: http://regmedia.co.uk/2014/02/27/quasar.jpg

Mesos on Windows Mesosphere is working with Microsoft to port Apache Mesos to work with Windows Servers Platform-specific tasks will be run on the supported nodes Image source: https://media.licdn.com/mpr/mpr/shrinknp_800_800/aaeaaqaaaaaaaarraaaajgvhztyyodflltvkzmmtnguzmi05mzrlltcyzwvlzme1ytu2ma.jpg

Fit for Purpose? Good Fit Stateless systems Web applications Spark Hadoop Poor Fit Stateful systems* Relational Database Distributed systems Cassandra *Note: Support for persistent storage volumes is under active development

Whitepapers Hindman, B., Konwinski, A., Zaharia, M., Ghodsi, A., Joseph, A.D., Katz, R.H., Shenker, S. and Stoica, I., 2011, March. Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center. In NSDI (Vol. 11, pp. 22-22). Verma, A., Pedrosa, L., Korupolu, M., Oppenheimer, D., Tune, E. and Wilkes, J., 2015, April. Large-scale cluster management at Google with Borg. In Proceedings of the Tenth European Conference on Computer Systems (p. 18). ACM.

Books

Last But Not Least

Thanks!