Scaling Data Spotify

Size: px
Start display at page:

Download "Scaling Data Spotify"

Transcription

1 Scaling Data Spotify matti@spotify.com kalvans@spotify.com

2 Mārtiņš Kalvāns Matti Pehrs

3 Agenda 1. Data at Spotify 2. Summer of Challenges & Victory Datamon Styx GABO

4 Spotify big-data context Over 100 million monthly active users Over 30 million song Over 2 billion playlists Active in 60 markets

5 Data is at the heart of Spotify In Monthly Royalty Report In Monthly Royalty Report Weekly Billboard Daily reports to partners... - AB-Testing Discover weekly Daily Mix...

6 Our growth in Data Users Developers +50 TB/day +100M Users +60 TB/day +10k M/R jobs

7 Autonomy & Dependencies Team B Team A Team C Hadoop

8 Autonomy & Dependencies

9 Autonomy & Dependencies

10 Autonomy & Dependencies

11 Summer of Incidents

12 Summer of Incidents A strain of incidents

13 Summer of Incidents A strain of incidents War-room

14 Summer of Incidents A strain of incidents War-room Hadoop on it s knees

15 Summer of Incidents A strain of incidents War-room Hadoop on it s knees Event Delivery Catch up

16 Summer of Incidents A strain of incidents War-room Hadoop on it s knees Event Delivery Catch up Reprocessing of data

17 Summer of Incidents A strain of incidents War-room Hadoop on it s knees Event Delivery Catch up Reprocessing of data Hard to debug data issues

18 Challenges and the path to victory...

19 Challenges and the path to victory Early Warning Datamon - Data monitoring

20 Challenges and the path to victory Early Warning Datamon - Data monitoring 2. Debuggability & Control Styx - Scheduling and control

21 Challenges and the path to victory Early Warning Datamon - Data monitoring 2. Debuggability & Control Styx - Scheduling and control 3. Automate Capacity GABO - Event Delivery

22 Challenges and the path to victory Early Warning Datamon - Data monitoring 2. Debuggability & Control Styx - Scheduling and control 3. Automate Capacity GABO - Event Delivery

23 Early Warning - Datamon

24 Early Warning - Datamon Unified view Ownership Alignment between teams Clear ownership of data SLA Alert on late data

25 Early Warning - Datamon Define terminology Provide metadata language Implement a Datamon service

26 Challenges and the path to victory Early Warning Datamon - Data monitoring 2. Debuggability & Control Styx - Scheduling and control 3. Automate Capacity GABO - Event Delivery

27 Debuggability & Control - Styx - Execution control - Self service for data users - Execution information - Expose debug information - Execution isolation - Docker for data jobs The river Styx

28 Debuggability & Control - Styx Execution control Centralized execution API

29 Debuggability & Control - Styx Execution control Centralized execution API Backfilling and reprocessing

30 Debuggability & Control - Styx Execution control Execution information Timeline

31 Debuggability & Control - Styx Execution control Execution information Timeline Google Cloud Logging

32 Debuggability & Control - Styx Execution control Execution information Execution isolation Docker

33 Challenges and the path to victory Early Warning Datamon - Data monitoring 2. Debuggability & Control Styx - Scheduling and control 3. Automate Capacity GABO - Event Delivery

34 Automate Capacity - GABO/Event Delivery Complex and manual config

35 Automate Capacity - GABO/Event Delivery Complex and manual config Pubsub & Dataflow streaming

36 Automate Capacity - GABO/Event Delivery Complex and manual config Pubsub & Dataflow streaming Pubsubs at scale

37 Automate Capacity - GABO/Event Delivery Complex and manual config Pubsub & Dataflow streaming Pubsubs at scale Dataflow streaming

38 Automate Capacity - GABO/Event Delivery Complex and manual config Pubsub & Dataflow streaming Pubsubs at scale Dataflow streaming :-( 2 micro services + 1 Map/Reduce job

39 Automate Capacity - GABO/Event Delivery Complex and manual config Pubsub & Dataflow streaming Pubsubs at scale Dataflow streaming :-( 2 micro services + 1 Map/Reduce job Autoscaling & The Stuffer

40 GABO - WIP Handles at least 10x our load Darkloading Autoscale everything Self service

41 Summary Make sure you have the right tools to deal with data incidents Make sure you have time to implement the tools you need Remember that your capacity model can fail at larger scale Keep track of your scale and Automate, automate, automate...

42 Thank you! Want to join the band?

Streaming analytics better than batch - when and why? _Adam Kawa - Dawid Wysakowicz_

Streaming analytics better than batch - when and why? _Adam Kawa - Dawid Wysakowicz_ Streaming analytics better than batch - when and why? _Adam Kawa - Dawid Wysakowicz_ About Us At GetInData, we build custom Big Data solutions Hadoop, Flink, Spark, Kafka and more Our team is today represented

More information

Big Data Infrastructure at Spotify

Big Data Infrastructure at Spotify Big Data Infrastructure at Spotify Wouter de Bie Team Lead Data Infrastructure September 26, 2013 2 Who am I? According to ZDNet: "The work they have done to improve the Apache Hive data warehouse system

More information

Data Processing with Apache Beam (incubating) and Google Cloud Dataflow

Data Processing with Apache Beam (incubating) and Google Cloud Dataflow Data Processing with Apache Beam (incubating) and Google Cloud Dataflow Jelena Pjesivac-Grbovic Staff software engineer Cloud Big Data In collaboration with Frances Perry, Tayler Akidau, and Dataflow team

More information

NEW STRATEGIES FOR EFFECTIVE

NEW STRATEGIES FOR EFFECTIVE NEW STRATEGIES FOR EFFECTIVE EMAIL Presented By: Brendan Cameron, Senior Project Manager, Ecommerce Department February 26, 2015 AGENDA WHAT IS BROADCAST EMAIL? GENERATING CONTENT ACQUIRING CONTACTS CREATING

More information

Google Cloud Dataflow

Google Cloud Dataflow Google Cloud Dataflow A Unified Model for Batch and Streaming Data Processing Jelena Pjesivac-Grbovic STREAM 2015 Agenda 1 Data Shapes 2 Data Processing Tradeoffs 3 Google s Data Processing Story 4 Google

More information

Revamped and Automated the infrastructure for NTN Buzztime

Revamped and Automated the infrastructure for NTN Buzztime Revamped and Automated the infrastructure for NTN Buzztime Executive Summary NTN Buzztime Inc. was looking for scalable infrastructure with a new platform that could support display of real-time restaurant

More information

The Technology of the Business Data Lake. Appendix

The Technology of the Business Data Lake. Appendix The Technology of the Business Data Lake Appendix Pivotal data products Term Greenplum Database GemFire Pivotal HD Spring XD Pivotal Data Dispatch Pivotal Analytics Description A massively parallel platform

More information

An Introduction to The Beam Model

An Introduction to The Beam Model An Introduction to The Beam Model Apache Beam (incubating) Slides by Tyler Akidau & Frances Perry, April 2016 Agenda 1 Infinite, Out-of-order Data Sets 2 The Evolution of the Beam Model 3 What, Where,

More information

StreamSets Control Hub Installation Guide

StreamSets Control Hub Installation Guide StreamSets Control Hub Installation Guide Version 3.2.1 2018, StreamSets, Inc. All rights reserved. Table of Contents 2 Table of Contents Chapter 1: What's New...1 What's New in 3.2.1... 2 What's New in

More information

Scaling with Continuous Deployment

Scaling with Continuous Deployment Scaling with Continuous Deployment Web 2.0 Expo New York, NY, September 29, 2010 Brett G. Durrett (@bdurrett) Vice President Engineering & Operations, IMVU, Inc. 0 An online community where members use

More information

Scaling Marketplaces at Thumbtack QCon SF 2017

Scaling Marketplaces at Thumbtack QCon SF 2017 Scaling Marketplaces at Thumbtack QCon SF 2017 Nate Kupp Technical Infrastructure Data Eng, Experimentation, Platform Infrastructure, Security, Dev Tools Infrastructure from early beginnings You see that?

More information

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud Microsoft Azure Databricks for data engineering Building production data pipelines with Apache Spark in the cloud Azure Databricks As companies continue to set their sights on making data-driven decisions

More information

Evolution of Big Data Facebook. Architecture Summit, Shenzhen, August 2012 Ashish Thusoo

Evolution of Big Data Facebook. Architecture Summit, Shenzhen, August 2012 Ashish Thusoo Evolution of Big Data Architectures@ Facebook Architecture Summit, Shenzhen, August 2012 Ashish Thusoo About Me Currently Co-founder/CEO of Qubole Ran the Data Infrastructure Team at Facebook till 2011

More information

Apache Beam. Modèle de programmation unifié pour Big Data

Apache Beam. Modèle de programmation unifié pour Big Data Apache Beam Modèle de programmation unifié pour Big Data Who am I? Jean-Baptiste Onofre @jbonofre http://blog.nanthrax.net Member of the Apache Software Foundation

More information

DevOps in the Cloud A pipeline to heaven?! Robert Cowham BCS CMSG Vice Chair

DevOps in the Cloud A pipeline to heaven?! Robert Cowham BCS CMSG Vice Chair DevOps in the Cloud A pipeline to heaven?! Robert Cowham BCS CMSG Vice Chair Agenda Definitions, History & Background Cloud intro DevOps Pipelines Docker containers Examples 2 Definitions DevOps Agile

More information

Google GSuite Intro Demo of GSuite and GCP integration

Google GSuite Intro Demo of GSuite and GCP integration Google GSuite Intro Demo of GSuite and GCP integration May 2017 Sara Djelassi - Sales Steve Mansfield - PSO 7 Cloud products with 1 billion users ML is core to differentiating Google services Search Search

More information

SOCIAL MEDIA. Charles Murphy

SOCIAL MEDIA. Charles Murphy SOCIAL MEDIA Charles Murphy Social Media Overview 1. Introduction 2. Social Media Areas Blogging Bookmarking Deals Location-based Music Photo sharing Video 3. The Fab Four FaceBook Google+ Linked In Twitter

More information

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Copyright 2012, Oracle and/or its affiliates. All rights reserved. 1 Big Data Connectors: High Performance Integration for Hadoop and Oracle Database Melli Annamalai Sue Mavris Rob Abbott 2 Program Agenda Big Data Connectors: Brief Overview Connecting Hadoop with Oracle

More information

TOWARDS PORTABILITY AND BEYOND. Maximilian maximilianmichels.com DATA PROCESSING WITH APACHE BEAM

TOWARDS PORTABILITY AND BEYOND. Maximilian maximilianmichels.com DATA PROCESSING WITH APACHE BEAM TOWARDS PORTABILITY AND BEYOND Maximilian Michels mxm@apache.org DATA PROCESSING WITH APACHE BEAM @stadtlegende maximilianmichels.com !2 BEAM VISION Write Pipeline Execute SDKs Runners Backends !3 THE

More information

CONTINUOUS DELIVERY WITH EFFICIENT AUTOMATED TESTING IN CLOUD

CONTINUOUS DELIVERY WITH EFFICIENT AUTOMATED TESTING IN CLOUD Budapest, 26-28 October 2016 CONTINUOUS DELIVERY WITH EFFICIENT AUTOMATED TESTING IN CLOUD Presented by [Teresa Song] All rights reserved Agenda Product Introduction Challenge & Solution 2 All rights reserved

More information

EXIN BCS SIAM Foundation. Sample Exam. Edition

EXIN BCS SIAM Foundation. Sample Exam. Edition EXIN BCS SIAM Foundation Sample Exam Edition 201704 Copyright EXIN Holding B.V. and BCS, 2017. All rights reserved. EXIN is a registered trademark. SIAM is a registered trademark. ITIL is a registered

More information

Pay TV solution from ADB

Pay TV solution from ADB Pay TV solution from ADB Complete solution for broadcast and broadband environment Integrated with personalised recommendations Consistent content discovery across multiple devices Entire functionality

More information

CIS 612 Advanced Topics in Database Big Data Project Lawrence Ni, Priya Patil, James Tench

CIS 612 Advanced Topics in Database Big Data Project Lawrence Ni, Priya Patil, James Tench CIS 612 Advanced Topics in Database Big Data Project Lawrence Ni, Priya Patil, James Tench Abstract Implementing a Hadoop-based system for processing big data and doing analytics is a topic which has been

More information

Please give me your feedback

Please give me your feedback #HPEDiscover Please give me your feedback Session ID: B4385 Speaker: Aaron Spurlock Use the mobile app to complete a session survey 1. Access My schedule 2. Click on the session detail page 3. Scroll down

More information

Music, Radio & Podcasts

Music, Radio & Podcasts Music, Radio & Podcasts *Buying Music *Streaming Music *Radio Online *Podcasts Buying Music (downloading): itunes Store, Amazon. Single tracks are mostly $1.29. Older music is less. Album prices vary.

More information

/ Cloud Computing. Recitation 3 Sep 13 & 15, 2016

/ Cloud Computing. Recitation 3 Sep 13 & 15, 2016 15-319 / 15-619 Cloud Computing Recitation 3 Sep 13 & 15, 2016 1 Overview Administrative Issues Last Week s Reflection Project 1.1, OLI Unit 1, Quiz 1 This Week s Schedule Project1.2, OLI Unit 2, Module

More information

UNIFY DATA AT MEMORY SPEED. Haoyuan (HY) Li, Alluxio Inc. VAULT Conference 2017

UNIFY DATA AT MEMORY SPEED. Haoyuan (HY) Li, Alluxio Inc. VAULT Conference 2017 UNIFY DATA AT MEMORY SPEED Haoyuan (HY) Li, CEO @ Alluxio Inc. VAULT Conference 2017 March 2017 HISTORY Started at UC Berkeley AMPLab In Summer 2012 Originally named as Tachyon Rebranded to Alluxio in

More information

CASE STUDY FINANCE Digital Outsource Services

CASE STUDY FINANCE Digital Outsource Services CASE STUDY FINANCE Digital Outsource Services How the team catch database deployment problems with Redgate SQL Monitor "...we connect to all the servers from the one interface. It lets us see everything

More information

Digital Enterprise Platform for Live Business. Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU

Digital Enterprise Platform for Live Business. Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU Digital Enterprise Platform for Live Business Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU Rethinking the Future Competing in today s marketplace means leveraging

More information

Navigating Cisco Prime Internetwork Performance Monitor Tasks in LMS 4.1

Navigating Cisco Prime Internetwork Performance Monitor Tasks in LMS 4.1 CHAPTER 7 Navigating Cisco Prime Internetwork Performance Monitor Tasks in LMS 4.1 Cisco Prime Monitor (IPM) is a network troubleshooting application that gauges network response time and availability.

More information

The communications, media, and entertainment industry and the cloud.

The communications, media, and entertainment industry and the cloud. Responding to Change The communications, media, and entertainment industry and the cloud. Cloud survey report: Industry experts explore the current and planned cloud adoption strategies of senior IT professionals.

More information

Splitted-Desktop Systems (SDS) Jean-Marie Verdun President

Splitted-Desktop Systems (SDS) Jean-Marie Verdun President Splitted-Desktop Systems (SDS) Jean-Marie Verdun President We rethink everything SDS is a french design house focusing on designing today s computer and datacenter infrastructure. We care about our earth,

More information

Sentinet for BizTalk Server VERSION 2.2

Sentinet for BizTalk Server VERSION 2.2 for BizTalk Server VERSION 2.2 for BizTalk Server 1 Contents Introduction... 2 SOA Repository... 2 Security... 3 Mediation and Virtualization... 3 Authentication and Authorization... 4 Monitoring, Recording

More information

OpenShift Roadmap Enterprise Kubernetes for Developers. Clayton Coleman, Architect, OpenShift

OpenShift Roadmap Enterprise Kubernetes for Developers. Clayton Coleman, Architect, OpenShift OpenShift Roadmap Enterprise Kubernetes for Developers Clayton Coleman, Architect, OpenShift What Is OpenShift? Application-centric Platform INFRASTRUCTURE APPLICATIONS Use containers for efficiency Hide

More information

Zero to Microservices in 5 minutes using Docker Containers. Mathew Lodge Weaveworks

Zero to Microservices in 5 minutes using Docker Containers. Mathew Lodge Weaveworks Zero to Microservices in 5 minutes using Docker Containers Mathew Lodge (@mathewlodge) Weaveworks (@weaveworks) https://www.weave.works/ 2 Going faster with software delivery is now a business issue Software

More information

Migrating massive monitoring to Bigtable without downtime. Martin Parm, Infrastructure Engineer for Monitoring

Migrating massive monitoring to Bigtable without downtime. Martin Parm, Infrastructure Engineer for Monitoring Migrating massive monitoring to Bigtable without downtime Martin Parm, Infrastructure Engineer for Monitoring This is a big deal. -- Nicholas Harteau/VP, Engineering & Infrastructure https://news.spotify.com/dk/2016/02/23/announcing-spotify-infrastructures-googley-future/

More information

this is not a love song How Spotify Uses Tableau Public Skyler Johnson

this is not a love song How Spotify Uses Tableau Public Skyler Johnson this is not a love song How Spotify Uses Tableau Public Skyler Johnson Data Visualization Designer Spotify Intro Tableau Desktop/Server @ Spotify 2018.1, 8-core Primary & Worker usually upgrade soon

More information

CS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014

CS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014 CS15-319 / 15-619 Cloud Computing Recitation 3 September 9 th & 11 th, 2014 Overview Last Week s Reflection --Project 1.1, Quiz 1, Unit 1 This Week s Schedule --Unit2 (module 3 & 4), Project 1.2 Questions

More information

Container Orchestration on Amazon Web Services. Arun

Container Orchestration on Amazon Web Services. Arun Container Orchestration on Amazon Web Services Arun Gupta, @arungupta Docker Workflow Development using Docker Docker Community Edition Docker for Mac/Windows/Linux Monthly edge and quarterly stable

More information

Data in the Cloud and Analytics in the Lake

Data in the Cloud and Analytics in the Lake Data in the Cloud and Analytics in the Lake Introduction Working in Analytics for over 5 years Part the digital team at BNZ for 3 years Based in the Auckland office Preferred Languages SQL Python (PySpark)

More information

USERS CONFERENCE Copyright 2016 OSIsoft, LLC

USERS CONFERENCE Copyright 2016 OSIsoft, LLC Bridge IT and OT with a process data warehouse Presented by Matt Ziegler, OSIsoft Complexity Problem Complexity Drives the Need for Integrators Disparate assets or interacting one-by-one Monitoring Real-time

More information

Scaling LEGO & Spotify. Henrik

Scaling LEGO & Spotify. Henrik Consultant www.crisp.se Scaling Agile @ LEGO & Spotify EA-träff Okt 2017 henrik.kniberg@crisp.se @HenrikKniberg Dad Climate guy Organizational coach & Change Instigator Author Not too hard A bit trickier

More information

First Look at Built-in Autoscaling and Alerting. Paul blog.paulbouwer.com

First Look at Built-in Autoscaling and Alerting. Paul blog.paulbouwer.com First Look at Built-in Autoscaling and Alerting Paul Bouwer @pbouwer blog.paulbouwer.com The power of cloud economics is elasticity - the ability to pay for resources only when they are needed and to

More information

Improving efficiency of Twitter Infrastructure using Chargeback

Improving efficiency of Twitter Infrastructure using Chargeback Improving efficiency of Twitter Infrastructure using Chargeback @vinucharanya @micheal AGENDA Brief History Problem Chargeback Engineering Challenges The product Impact Future Getty Images from http://www.fifa.com/worldcup/news/y=2010/m=7/news=pride-for-africa-spain-strike-gold-2247372.html

More information

End to End Analysis on System z IBM Transaction Analysis Workbench for z/os. James Martin IBM Tools Product SME August 10, 2015

End to End Analysis on System z IBM Transaction Analysis Workbench for z/os. James Martin IBM Tools Product SME August 10, 2015 End to End Analysis on System z IBM Transaction Analysis Workbench for z/os James Martin IBM Tools Product SME August 10, 2015 Please note IBM s statements regarding its plans, directions, and intent are

More information

Fundamentals of Stream Processing with Apache Beam (incubating)

Fundamentals of Stream Processing with Apache Beam (incubating) Google Docs version of slides (including animations): https://goo.gl/yzvlxe Fundamentals of Stream Processing with Apache Beam (incubating) Frances Perry & Tyler Akidau @francesjperry, @takidau Apache

More information

Introduction to Apache Beam

Introduction to Apache Beam Introduction to Apache Beam Dan Halperin JB Onofré Google Beam podling PMC Talend Beam Champion & PMC Apache Member Apache Beam is a unified programming model designed to provide efficient and portable

More information

Tour of Database Platforms as a Service. June 2016 Warner Chaves Christo Kutrovsky Solutions Architect

Tour of Database Platforms as a Service. June 2016 Warner Chaves Christo Kutrovsky Solutions Architect Tour of Database Platforms as a Service June 2016 Warner Chaves Christo Kutrovsky Solutions Architect Bio Solutions Architect at Pythian Specialize high performance data processing and analytics 15 years

More information

Acronis Data Cloud plugin for ConnectWise Automate

Acronis Data Cloud plugin for ConnectWise Automate Acronis Data Cloud plugin for ConnectWise Automate USER'S GUIDE Revision: 24.08.2018 Table of contents 1 Introduction...3 2 What's new in Update 4...3 3 What's new in Update 3...3 4 What's new in Update

More information

DevOps on AWS Deep Dive on Continuous Delivery and the AWS Developer Tools

DevOps on AWS Deep Dive on Continuous Delivery and the AWS Developer Tools DevOps on AWS Deep Dive on Continuous Delivery and the AWS Developer Tools Woody Borraccino, AWS Solutions Architect May 4, 2016, Stockholm 2016, Amazon Web Services, Inc. or its Affiliates. All rights

More information

Automation of Rolling Upgrade for Hadoop Cluster without Data Loss and Job Failures. Hiroshi Yamaguchi & Hiroyuki Adachi

Automation of Rolling Upgrade for Hadoop Cluster without Data Loss and Job Failures. Hiroshi Yamaguchi & Hiroyuki Adachi Automation of Rolling Upgrade for Hadoop Cluster without Data Loss and Job Failures Hiroshi Yamaguchi & Hiroyuki Adachi About Us 2 Hiroshi Yamaguchi Hiroyuki Adachi Hadoop DevOps Engineer Hadoop Engineer

More information

Uptime and Proactive Support Services

Uptime and Proactive Support Services Uptime and Proactive Support Services We ll accelerate your journey to sustainable IT optimisation and ensure that your technology is delivering all that it can. We ll keep your IT infrastructure up and

More information

Processing Data Like Google Using the Dataflow/Beam Model

Processing Data Like Google Using the Dataflow/Beam Model Todd Reedy Google for Work Sales Engineer Google Processing Data Like Google Using the Dataflow/Beam Model Goals: Write interesting computations Run in both batch & streaming Use custom timestamps Handle

More information

Automating, Securing, and Managing Cox Automotive's (AutoTrader) Big Data Infrastructure

Automating, Securing, and Managing Cox Automotive's (AutoTrader) Big Data Infrastructure Automating, Securing, and Managing Cox Automotive's (AutoTrader) Big Data Infrastructure with Ansible, Insights, and Satellite Abdi Hersi, Senior Linux Engineer Cox Automotive Will Nix, Principal Technical

More information

Cloudera Impala Headline Goes Here

Cloudera Impala Headline Goes Here Cloudera Impala Headline Goes Here JusAn Erickson Senior Product Manager Speaker Name or Subhead Goes Here February 2013 DO NOT USE PUBLICLY PRIOR TO 10/23/12 Agenda Intro to Impala Architectural Overview

More information

Assignment 3 ITCS-6010/8010: Cloud Computing for Data Analysis

Assignment 3 ITCS-6010/8010: Cloud Computing for Data Analysis Assignment 3 ITCS-6010/8010: Cloud Computing for Data Analysis Due by 11:59:59pm on Tuesday, March 16, 2010 This assignment is based on a similar assignment developed at the University of Washington. Running

More information

Using DC/OS for Continuous Delivery

Using DC/OS for Continuous Delivery Using DC/OS for Continuous Delivery DevPulseCon 2017 Elizabeth K. Joseph, @pleia2 Mesosphere 1 Elizabeth K. Joseph, Developer Advocate, Mesosphere 15+ years working in open source communities 10+ years

More information

@unterstein #bedcon. Operating microservices with Apache Mesos and DC/OS

@unterstein #bedcon. Operating microservices with Apache Mesos and DC/OS @unterstein @dcos @bedcon #bedcon Operating microservices with Apache Mesos and DC/OS 1 Johannes Unterstein Software Engineer @Mesosphere @unterstein @unterstein.mesosphere 2017 Mesosphere, Inc. All Rights

More information

Microservices Architekturen aufbauen, aber wie?

Microservices Architekturen aufbauen, aber wie? Microservices Architekturen aufbauen, aber wie? Constantin Gonzalez, Principal Solutions Architect glez@amazon.de, @zalez 30. Juni 2016 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

More information

Cloudline Autonomous Driving Solutions. Accelerating insights through a new generation of Data and Analytics October, 2018

Cloudline Autonomous Driving Solutions. Accelerating insights through a new generation of Data and Analytics October, 2018 Cloudline Autonomous Driving Solutions Accelerating insights through a new generation of Data and Analytics October, 2018 HPE big data analytics solutions power the data-driven enterprise Secure, workload-optimized

More information

Architecture of Flink's Streaming Runtime. Robert

Architecture of Flink's Streaming Runtime. Robert Architecture of Flink's Streaming Runtime Robert Metzger @rmetzger_ rmetzger@apache.org What is stream processing Real-world data is unbounded and is pushed to systems Right now: people are using the batch

More information

MEDIAMPLIFY : A Cloud to Cable TV Platform for Music, TV, and Video Dr. Edwin A. Hernandez Chief Technology Officer EGLA COMMUNICATIONS

MEDIAMPLIFY : A Cloud to Cable TV Platform for Music, TV, and Video Dr. Edwin A. Hernandez Chief Technology Officer EGLA COMMUNICATIONS MEDIAMPLIFY : Amplify your reach A Cloud to Cable TV Platform for Music, TV, and Video Dr. Edwin A. Hernandez Chief Technology Officer Mediamplify is the one stop shop multi-platform media distribution

More information

WGAW Start Button Please remember, Start Button does not replace your Declaration of Earnings

WGAW Start Button Please remember, Start Button does not replace your Declaration of Earnings Please remember, Start Button does not replace your Declaration of Earnings Overview Start Button is a tool to: Inform the Guild when writers begin working on a step and when the draft is scheduled to

More information

Connected vehicle cloud Commercial presentation

Connected vehicle cloud Commercial presentation Connected vehicle cloud 17.1 Commercial presentation 2017.07.01 The evolution of the automotive industry 2017 2020 2025 2030 A vehicle can generate 1 GB of data per second By 2020 automakers could make

More information

TC2-Computer Literacy Mr. Sencer February 4, 2010

TC2-Computer Literacy Mr. Sencer February 4, 2010 TC2-Computer Literacy Mr. Sencer February 4, 2010 What is a network? A network is a collection of computers and devices connected together, sometimes wirelessly, via communications devices. When a computer

More information

Bite of Science Austin Alfredo Pérez Applications Engineer, Silicon Labs. January 27, 2016

Bite of Science Austin Alfredo Pérez Applications Engineer, Silicon Labs. January 27, 2016 Bite of Science Austin Alfredo Pérez Applications Engineer, Silicon Labs January 27, 2016 1 Agenda About Me Silicon Labs Who are we? What do we do? Applications Engineering Functions Skills needed IoT

More information

Orchestrating an OpenStack* based IoT Smart Home

Orchestrating an OpenStack* based IoT Smart Home Orchestrating an OpenStack* based IoT Smart Home Michael Kadera, John Geier, Dr. Yih Leong Sun Intel Open Source Technology Center 26th October, Wednesday, 17:55-18:35 *Other names and brands may be claimed

More information

The OpenCirrus TM Project: A global Testbed for Cloud Computing R&D

The OpenCirrus TM Project: A global Testbed for Cloud Computing R&D The OpenCirrus TM Project: A global Testbed for Cloud Computing R&D Marcel Kunze Steinbuch Centre for Computing (SCC) Karlsruhe Institute of Technology (KIT) Germany KIT The cooperation of Forschungszentrum

More information

How can you implement this through a script that a scheduling daemon runs daily on the application servers?

How can you implement this through a script that a scheduling daemon runs daily on the application servers? You ve been tasked with implementing an automated data backup solution for your application servers that run on Amazon EC2 with Amazon EBS volumes. You want to use a distributed data store for your backups

More information

The Mobile World Introduction

The Mobile World Introduction TABLE OF CONTENTS The Mobile World 3 SMS for Courier & Postal Services - Introduction 7 SMS for Courier & Postal Services Outbound SMS 8 SMS for Courier & Postal Services Inbound SMS 10 Technical Overview

More information

Using Apache Beam for Batch, Streaming, and Everything in Between. Dan Halperin Apache Beam PMC Senior Software Engineer, Google

Using Apache Beam for Batch, Streaming, and Everything in Between. Dan Halperin Apache Beam PMC Senior Software Engineer, Google Abstract Apache Beam is a unified programming model capable of expressing a wide variety of both traditional batch and complex streaming use cases. By neatly separating properties of the data from run-time

More information

Streaming Auto-Scaling in Google Cloud Dataflow

Streaming Auto-Scaling in Google Cloud Dataflow Streaming Auto-Scaling in Google Cloud Dataflow Manuel Fahndrich Software Engineer Google Addictive Mobile Game https://commons.wikimedia.org/wiki/file:globe_centered_in_the_atlantic_ocean_(green_and_grey_globe_scheme).svg

More information

Better skilled workforce

Better skilled workforce Better skilled workforce for the New Style of Business HPE Education Services November 20, 2015 Education is the most powerful weapon which you can use to change the world Nelson Mandela The New Style

More information

Container 2.0. Container: check! But what about persistent data, big data or fast data?!

Container 2.0. Container: check! But what about persistent data, big data or fast data?! @unterstein @joerg_schad @dcos @jaxdevops Container 2.0 Container: check! But what about persistent data, big data or fast data?! 1 Jörg Schad Distributed Systems Engineer @joerg_schad Johannes Unterstein

More information

Andrew Durant/Ellen Sullivan

Andrew Durant/Ellen Sullivan AGENDA ITEM: 3.5 INFORMATION MANAGEMENT, TECHNOLOGY AND GOVERNANCE COMMITTEE DATE OF MEETING: 2 OCTOBER 2018 Subject : Approved and Presented by: Prepared by: Other Committees and meetings considered at:

More information

Data Management at Cloud Scale CommVault Simpana v10. VMware Partner Exchange Session SPO2308 February 2013

Data Management at Cloud Scale CommVault Simpana v10. VMware Partner Exchange Session SPO2308 February 2013 Data Management at Cloud Scale CommVault Simpana v10 VMware Partner Exchange Session SPO2308 February 2013 Agenda Breakout Session: Wednesday, Feb 27, 11:00 AM - 12:00 PM Data Management at Cloud Scale

More information

TSAR A TimeSeries AggregatoR. Anirudh Todi TSAR

TSAR A TimeSeries AggregatoR. Anirudh Todi TSAR TSAR A TimeSeries AggregatoR Anirudh Todi Twitter @anirudhtodi TSAR What is TSAR? What is TSAR? TSAR is a framework and service infrastructure for specifying, deploying and operating timeseries aggregation

More information

Building Next- GeneraAon Data IntegraAon Pla1orm. George Xiong ebay Data Pla1orm Architect April 21, 2013

Building Next- GeneraAon Data IntegraAon Pla1orm. George Xiong ebay Data Pla1orm Architect April 21, 2013 Building Next- GeneraAon Data IntegraAon Pla1orm George Xiong ebay Data Pla1orm Architect April 21, 2013 ebay Analytics >50 TB/day new data 100+ Subject Areas >100 PB/day Processed >100 Trillion pairs

More information

Value of YouTube to the music industry Paper V Direct value to the industry

Value of YouTube to the music industry Paper V Direct value to the industry Value of YouTube to the music industry Paper V Direct value to the industry June 2017 RBB Economics 1 1 Introduction The music industry has undergone significant change over the past few years, with declining

More information

Avoiding Storage Service Disruptions with Availability Intelligence

Avoiding Storage Service Disruptions with Availability Intelligence Avoiding Storage Service Disruptions with Availability Intelligence Brent Phillips, Managing Director, Americas Brett Allison, Director of Technical Services www.intellimagic.com 1 Today s Agenda 1. Availability

More information

BUSINESS CHALLENGE SOLUTION BENEFITS ABOUT AUTO & GENERAL GROUP (SOUTHEAST ASIA)

BUSINESS CHALLENGE SOLUTION BENEFITS ABOUT AUTO & GENERAL GROUP (SOUTHEAST ASIA) CASE STUDY IZO PRIVATE CLOUD ENABLES AUTO & GENERAL GROUP (SOUTHEAST ASIA) TO ACHIEVE BORDERLESS GROWTH WITH SUCCESSFUL LAUNCH OF NEW BUSINESSES IN SOUTHEAST ASIAN MARKETS. TARGETING COMPETITIVE MARKETS,

More information

Best Practices and Performance Tuning on Amazon Elastic MapReduce

Best Practices and Performance Tuning on Amazon Elastic MapReduce Best Practices and Performance Tuning on Amazon Elastic MapReduce Michael Hanisch Solutions Architect Amo Abeyaratne Big Data and Analytics Consultant ANZ 12.04.2016 2016, Amazon Web Services, Inc. or

More information

GlobalNOC Services Update Internet2 Global Summit

GlobalNOC Services Update Internet2 Global Summit GlobalNOC Services Update 2015 Internet2 Global Summit Annual Report http://globalnoc.iu.edu/annual-report/2014/ 4/28/15 Service Desk Year in Review: Welcomed ARE-ON and OSHEAN to the GlobalNOC Family

More information

Lecture 20: WSC, Datacenters. Topics: warehouse-scale computing and datacenters (Sections )

Lecture 20: WSC, Datacenters. Topics: warehouse-scale computing and datacenters (Sections ) Lecture 20: WSC, Datacenters Topics: warehouse-scale computing and datacenters (Sections 6.1-6.7) 1 Warehouse-Scale Computer (WSC) 100K+ servers in one WSC ~$150M overall cost Requests from millions of

More information

Jenkins: A complete solution. From Continuous Integration to Continuous Delivery For HSBC

Jenkins: A complete solution. From Continuous Integration to Continuous Delivery For HSBC Jenkins: A complete solution From Integration to Delivery For HSBC Rajesh Kumar DevOps Architect @RajeshKumarIN www.rajeshkumar.xyz Agenda Why Jenkins? Introduction and some facts about Jenkins Supported

More information

Design Patterns for the Cloud. MCSN - N. Tonellotto - Distributed Enabling Platforms 68

Design Patterns for the Cloud. MCSN - N. Tonellotto - Distributed Enabling Platforms 68 Design Patterns for the Cloud 68 based on Amazon Web Services Architecting for the Cloud: Best Practices Jinesh Varia http://media.amazonwebservices.com/aws_cloud_best_practices.pdf 69 Amazon Web Services

More information

Connecting your Microservices and Cloud Services with Oracle Integration CON7348

Connecting your Microservices and Cloud Services with Oracle Integration CON7348 Connecting your Microservices and Cloud Services with Oracle Integration CON7348 Robert Wunderlich Sr. Principal Product Manager September 19, 2016 Copyright 2016, Oracle and/or its affiliates. All rights

More information

Meeting researchers needs in mining web archives: the experience of the National Library of France

Meeting researchers needs in mining web archives: the experience of the National Library of France Meeting researchers needs in mining web archives: the experience of the National Library of France Sara Aubry, IT Department Peter Stirling, Legal Deposit Department Bibliothèque nationale de France LIBER

More information

ADABAS & NATURAL 2050+

ADABAS & NATURAL 2050+ ADABAS & NATURAL 2050+ Guido Falkenberg SVP Global Customer Innovation DIGITAL TRANSFORMATION #WITHOUTCOMPROMISE 2017 Software AG. All rights reserved. ADABAS & NATURAL 2050+ GLOBAL INITIATIVE INNOVATION

More information

Enjoy the hottest Streaming Music with NAS

Enjoy the hottest Streaming Music with NAS Enjoy the hottest Streaming Music with NAS Why Should You Give Spotify a Try How to Use Spotify with QNAP NAS Versatile Audio Output Approach Versatile HD Station Manipulation Approach Why Should You Give

More information

Apigee Edge Cloud. Supported browsers:

Apigee Edge Cloud. Supported browsers: Apigee Edge Cloud Description Apigee Edge Cloud is an API management platform to securely deliver and manage all APIs. Apigee Edge Cloud manages the API lifecycle with capabilities that include, but are

More information

A Public-Sector Guide to UPS Procurement

A Public-Sector Guide to UPS Procurement A Public-Sector Guide to UPS Procurement HPE HYBRID IT AND HPE NIMBLE STORAGE New HPE Demand Just Generation Right Programs IT and Revealed Aruba: New Demand Generation Programs Revealed EXECUTIVE SUMMARY

More information

Deployment Planning Guide

Deployment Planning Guide Deployment Planning Guide Community 1.5.1 release The purpose of this document is to educate the user about the different strategies that can be adopted to optimize the usage of Jumbune on Hadoop and also

More information

A Disciplined Approach to Cyber Security Transformation

A Disciplined Approach to Cyber Security Transformation A Disciplined Approach to Cyber Security Transformation Information Protection and Business Resiliency December 2014 Key takeaways from today s cyber security discussion 1. Our colleagues are not waving

More information

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

MAPR DATA GOVERNANCE WITHOUT COMPROMISE MAPR TECHNOLOGIES, INC. WHITE PAPER JANUARY 2018 MAPR DATA GOVERNANCE TABLE OF CONTENTS EXECUTIVE SUMMARY 3 BACKGROUND 4 MAPR DATA GOVERNANCE 5 CONCLUSION 7 EXECUTIVE SUMMARY The MapR DataOps Governance

More information

BUILDING A SCALABLE MOBILE GAME BACKEND IN ELIXIR. Petri Kero CTO / Ministry of Games

BUILDING A SCALABLE MOBILE GAME BACKEND IN ELIXIR. Petri Kero CTO / Ministry of Games BUILDING A SCALABLE MOBILE GAME BACKEND IN ELIXIR Petri Kero CTO / Ministry of Games MOBILE GAME BACKEND CHALLENGES Lots of concurrent users Complex interactions between players Persistent world with frequent

More information

MapReduce for Graph Algorithms

MapReduce for Graph Algorithms Seminar: Massive-Scale Graph Analysis Summer Semester 2015 MapReduce for Graph Algorithms Modeling & Approach Ankur Sharma ankur@stud.uni-saarland.de May 8, 2015 Agenda 1 Map-Reduce Framework Big Data

More information

For Performance and Scalability, Amadeus Chooses Data Center

For Performance and Scalability, Amadeus Chooses Data Center For Performance and Scalability, Amadeus Chooses Data Center Amadeus is the leader in technology solutions for the global travel and tourism industry. Through its flagship product Altéa Customer Management

More information

Seagull: A distributed, fault tolerant, concurrent task runner. Sagar Patwardhan

Seagull: A distributed, fault tolerant, concurrent task runner. Sagar Patwardhan Seagull: A distributed, fault tolerant, concurrent task runner Sagar Patwardhan sagarp@yelp.com Yelp s Mission Connecting people with great local businesses. Yelp scale Outline What is Seagull? Why did

More information

Avanan for G Suite. Technical Overview. Copyright 2017 Avanan. All rights reserved.

Avanan for G Suite. Technical Overview. Copyright 2017 Avanan. All rights reserved. Avanan for G Suite Technical Overview Contents Intro 1 How Avanan Works 2 Email Security for Gmail 3 Data Security for Google Drive 4 Policy Automation 5 Workflows and Notifications 6 Authentication 7

More information