Scaling Data Spotify
|
|
- Chloe Warren
- 5 years ago
- Views:
Transcription
1 Scaling Data Spotify matti@spotify.com kalvans@spotify.com
2 Mārtiņš Kalvāns Matti Pehrs
3 Agenda 1. Data at Spotify 2. Summer of Challenges & Victory Datamon Styx GABO
4 Spotify big-data context Over 100 million monthly active users Over 30 million song Over 2 billion playlists Active in 60 markets
5 Data is at the heart of Spotify In Monthly Royalty Report In Monthly Royalty Report Weekly Billboard Daily reports to partners... - AB-Testing Discover weekly Daily Mix...
6 Our growth in Data Users Developers +50 TB/day +100M Users +60 TB/day +10k M/R jobs
7 Autonomy & Dependencies Team B Team A Team C Hadoop
8 Autonomy & Dependencies
9 Autonomy & Dependencies
10 Autonomy & Dependencies
11 Summer of Incidents
12 Summer of Incidents A strain of incidents
13 Summer of Incidents A strain of incidents War-room
14 Summer of Incidents A strain of incidents War-room Hadoop on it s knees
15 Summer of Incidents A strain of incidents War-room Hadoop on it s knees Event Delivery Catch up
16 Summer of Incidents A strain of incidents War-room Hadoop on it s knees Event Delivery Catch up Reprocessing of data
17 Summer of Incidents A strain of incidents War-room Hadoop on it s knees Event Delivery Catch up Reprocessing of data Hard to debug data issues
18 Challenges and the path to victory...
19 Challenges and the path to victory Early Warning Datamon - Data monitoring
20 Challenges and the path to victory Early Warning Datamon - Data monitoring 2. Debuggability & Control Styx - Scheduling and control
21 Challenges and the path to victory Early Warning Datamon - Data monitoring 2. Debuggability & Control Styx - Scheduling and control 3. Automate Capacity GABO - Event Delivery
22 Challenges and the path to victory Early Warning Datamon - Data monitoring 2. Debuggability & Control Styx - Scheduling and control 3. Automate Capacity GABO - Event Delivery
23 Early Warning - Datamon
24 Early Warning - Datamon Unified view Ownership Alignment between teams Clear ownership of data SLA Alert on late data
25 Early Warning - Datamon Define terminology Provide metadata language Implement a Datamon service
26 Challenges and the path to victory Early Warning Datamon - Data monitoring 2. Debuggability & Control Styx - Scheduling and control 3. Automate Capacity GABO - Event Delivery
27 Debuggability & Control - Styx - Execution control - Self service for data users - Execution information - Expose debug information - Execution isolation - Docker for data jobs The river Styx
28 Debuggability & Control - Styx Execution control Centralized execution API
29 Debuggability & Control - Styx Execution control Centralized execution API Backfilling and reprocessing
30 Debuggability & Control - Styx Execution control Execution information Timeline
31 Debuggability & Control - Styx Execution control Execution information Timeline Google Cloud Logging
32 Debuggability & Control - Styx Execution control Execution information Execution isolation Docker
33 Challenges and the path to victory Early Warning Datamon - Data monitoring 2. Debuggability & Control Styx - Scheduling and control 3. Automate Capacity GABO - Event Delivery
34 Automate Capacity - GABO/Event Delivery Complex and manual config
35 Automate Capacity - GABO/Event Delivery Complex and manual config Pubsub & Dataflow streaming
36 Automate Capacity - GABO/Event Delivery Complex and manual config Pubsub & Dataflow streaming Pubsubs at scale
37 Automate Capacity - GABO/Event Delivery Complex and manual config Pubsub & Dataflow streaming Pubsubs at scale Dataflow streaming
38 Automate Capacity - GABO/Event Delivery Complex and manual config Pubsub & Dataflow streaming Pubsubs at scale Dataflow streaming :-( 2 micro services + 1 Map/Reduce job
39 Automate Capacity - GABO/Event Delivery Complex and manual config Pubsub & Dataflow streaming Pubsubs at scale Dataflow streaming :-( 2 micro services + 1 Map/Reduce job Autoscaling & The Stuffer
40 GABO - WIP Handles at least 10x our load Darkloading Autoscale everything Self service
41 Summary Make sure you have the right tools to deal with data incidents Make sure you have time to implement the tools you need Remember that your capacity model can fail at larger scale Keep track of your scale and Automate, automate, automate...
42 Thank you! Want to join the band?
Streaming analytics better than batch - when and why? _Adam Kawa - Dawid Wysakowicz_
Streaming analytics better than batch - when and why? _Adam Kawa - Dawid Wysakowicz_ About Us At GetInData, we build custom Big Data solutions Hadoop, Flink, Spark, Kafka and more Our team is today represented
More informationBig Data Infrastructure at Spotify
Big Data Infrastructure at Spotify Wouter de Bie Team Lead Data Infrastructure September 26, 2013 2 Who am I? According to ZDNet: "The work they have done to improve the Apache Hive data warehouse system
More informationData Processing with Apache Beam (incubating) and Google Cloud Dataflow
Data Processing with Apache Beam (incubating) and Google Cloud Dataflow Jelena Pjesivac-Grbovic Staff software engineer Cloud Big Data In collaboration with Frances Perry, Tayler Akidau, and Dataflow team
More informationNEW STRATEGIES FOR EFFECTIVE
NEW STRATEGIES FOR EFFECTIVE EMAIL Presented By: Brendan Cameron, Senior Project Manager, Ecommerce Department February 26, 2015 AGENDA WHAT IS BROADCAST EMAIL? GENERATING CONTENT ACQUIRING CONTACTS CREATING
More informationGoogle Cloud Dataflow
Google Cloud Dataflow A Unified Model for Batch and Streaming Data Processing Jelena Pjesivac-Grbovic STREAM 2015 Agenda 1 Data Shapes 2 Data Processing Tradeoffs 3 Google s Data Processing Story 4 Google
More informationRevamped and Automated the infrastructure for NTN Buzztime
Revamped and Automated the infrastructure for NTN Buzztime Executive Summary NTN Buzztime Inc. was looking for scalable infrastructure with a new platform that could support display of real-time restaurant
More informationThe Technology of the Business Data Lake. Appendix
The Technology of the Business Data Lake Appendix Pivotal data products Term Greenplum Database GemFire Pivotal HD Spring XD Pivotal Data Dispatch Pivotal Analytics Description A massively parallel platform
More informationAn Introduction to The Beam Model
An Introduction to The Beam Model Apache Beam (incubating) Slides by Tyler Akidau & Frances Perry, April 2016 Agenda 1 Infinite, Out-of-order Data Sets 2 The Evolution of the Beam Model 3 What, Where,
More informationStreamSets Control Hub Installation Guide
StreamSets Control Hub Installation Guide Version 3.2.1 2018, StreamSets, Inc. All rights reserved. Table of Contents 2 Table of Contents Chapter 1: What's New...1 What's New in 3.2.1... 2 What's New in
More informationScaling with Continuous Deployment
Scaling with Continuous Deployment Web 2.0 Expo New York, NY, September 29, 2010 Brett G. Durrett (@bdurrett) Vice President Engineering & Operations, IMVU, Inc. 0 An online community where members use
More informationScaling Marketplaces at Thumbtack QCon SF 2017
Scaling Marketplaces at Thumbtack QCon SF 2017 Nate Kupp Technical Infrastructure Data Eng, Experimentation, Platform Infrastructure, Security, Dev Tools Infrastructure from early beginnings You see that?
More informationMicrosoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud
Microsoft Azure Databricks for data engineering Building production data pipelines with Apache Spark in the cloud Azure Databricks As companies continue to set their sights on making data-driven decisions
More informationEvolution of Big Data Facebook. Architecture Summit, Shenzhen, August 2012 Ashish Thusoo
Evolution of Big Data Architectures@ Facebook Architecture Summit, Shenzhen, August 2012 Ashish Thusoo About Me Currently Co-founder/CEO of Qubole Ran the Data Infrastructure Team at Facebook till 2011
More informationApache Beam. Modèle de programmation unifié pour Big Data
Apache Beam Modèle de programmation unifié pour Big Data Who am I? Jean-Baptiste Onofre @jbonofre http://blog.nanthrax.net Member of the Apache Software Foundation
More informationDevOps in the Cloud A pipeline to heaven?! Robert Cowham BCS CMSG Vice Chair
DevOps in the Cloud A pipeline to heaven?! Robert Cowham BCS CMSG Vice Chair Agenda Definitions, History & Background Cloud intro DevOps Pipelines Docker containers Examples 2 Definitions DevOps Agile
More informationGoogle GSuite Intro Demo of GSuite and GCP integration
Google GSuite Intro Demo of GSuite and GCP integration May 2017 Sara Djelassi - Sales Steve Mansfield - PSO 7 Cloud products with 1 billion users ML is core to differentiating Google services Search Search
More informationSOCIAL MEDIA. Charles Murphy
SOCIAL MEDIA Charles Murphy Social Media Overview 1. Introduction 2. Social Media Areas Blogging Bookmarking Deals Location-based Music Photo sharing Video 3. The Fab Four FaceBook Google+ Linked In Twitter
More informationCopyright 2012, Oracle and/or its affiliates. All rights reserved.
1 Big Data Connectors: High Performance Integration for Hadoop and Oracle Database Melli Annamalai Sue Mavris Rob Abbott 2 Program Agenda Big Data Connectors: Brief Overview Connecting Hadoop with Oracle
More informationTOWARDS PORTABILITY AND BEYOND. Maximilian maximilianmichels.com DATA PROCESSING WITH APACHE BEAM
TOWARDS PORTABILITY AND BEYOND Maximilian Michels mxm@apache.org DATA PROCESSING WITH APACHE BEAM @stadtlegende maximilianmichels.com !2 BEAM VISION Write Pipeline Execute SDKs Runners Backends !3 THE
More informationCONTINUOUS DELIVERY WITH EFFICIENT AUTOMATED TESTING IN CLOUD
Budapest, 26-28 October 2016 CONTINUOUS DELIVERY WITH EFFICIENT AUTOMATED TESTING IN CLOUD Presented by [Teresa Song] All rights reserved Agenda Product Introduction Challenge & Solution 2 All rights reserved
More informationEXIN BCS SIAM Foundation. Sample Exam. Edition
EXIN BCS SIAM Foundation Sample Exam Edition 201704 Copyright EXIN Holding B.V. and BCS, 2017. All rights reserved. EXIN is a registered trademark. SIAM is a registered trademark. ITIL is a registered
More informationPay TV solution from ADB
Pay TV solution from ADB Complete solution for broadcast and broadband environment Integrated with personalised recommendations Consistent content discovery across multiple devices Entire functionality
More informationCIS 612 Advanced Topics in Database Big Data Project Lawrence Ni, Priya Patil, James Tench
CIS 612 Advanced Topics in Database Big Data Project Lawrence Ni, Priya Patil, James Tench Abstract Implementing a Hadoop-based system for processing big data and doing analytics is a topic which has been
More informationPlease give me your feedback
#HPEDiscover Please give me your feedback Session ID: B4385 Speaker: Aaron Spurlock Use the mobile app to complete a session survey 1. Access My schedule 2. Click on the session detail page 3. Scroll down
More informationMusic, Radio & Podcasts
Music, Radio & Podcasts *Buying Music *Streaming Music *Radio Online *Podcasts Buying Music (downloading): itunes Store, Amazon. Single tracks are mostly $1.29. Older music is less. Album prices vary.
More information/ Cloud Computing. Recitation 3 Sep 13 & 15, 2016
15-319 / 15-619 Cloud Computing Recitation 3 Sep 13 & 15, 2016 1 Overview Administrative Issues Last Week s Reflection Project 1.1, OLI Unit 1, Quiz 1 This Week s Schedule Project1.2, OLI Unit 2, Module
More informationUNIFY DATA AT MEMORY SPEED. Haoyuan (HY) Li, Alluxio Inc. VAULT Conference 2017
UNIFY DATA AT MEMORY SPEED Haoyuan (HY) Li, CEO @ Alluxio Inc. VAULT Conference 2017 March 2017 HISTORY Started at UC Berkeley AMPLab In Summer 2012 Originally named as Tachyon Rebranded to Alluxio in
More informationCASE STUDY FINANCE Digital Outsource Services
CASE STUDY FINANCE Digital Outsource Services How the team catch database deployment problems with Redgate SQL Monitor "...we connect to all the servers from the one interface. It lets us see everything
More informationDigital Enterprise Platform for Live Business. Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU
Digital Enterprise Platform for Live Business Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU Rethinking the Future Competing in today s marketplace means leveraging
More informationNavigating Cisco Prime Internetwork Performance Monitor Tasks in LMS 4.1
CHAPTER 7 Navigating Cisco Prime Internetwork Performance Monitor Tasks in LMS 4.1 Cisco Prime Monitor (IPM) is a network troubleshooting application that gauges network response time and availability.
More informationThe communications, media, and entertainment industry and the cloud.
Responding to Change The communications, media, and entertainment industry and the cloud. Cloud survey report: Industry experts explore the current and planned cloud adoption strategies of senior IT professionals.
More informationSplitted-Desktop Systems (SDS) Jean-Marie Verdun President
Splitted-Desktop Systems (SDS) Jean-Marie Verdun President We rethink everything SDS is a french design house focusing on designing today s computer and datacenter infrastructure. We care about our earth,
More informationSentinet for BizTalk Server VERSION 2.2
for BizTalk Server VERSION 2.2 for BizTalk Server 1 Contents Introduction... 2 SOA Repository... 2 Security... 3 Mediation and Virtualization... 3 Authentication and Authorization... 4 Monitoring, Recording
More informationOpenShift Roadmap Enterprise Kubernetes for Developers. Clayton Coleman, Architect, OpenShift
OpenShift Roadmap Enterprise Kubernetes for Developers Clayton Coleman, Architect, OpenShift What Is OpenShift? Application-centric Platform INFRASTRUCTURE APPLICATIONS Use containers for efficiency Hide
More informationZero to Microservices in 5 minutes using Docker Containers. Mathew Lodge Weaveworks
Zero to Microservices in 5 minutes using Docker Containers Mathew Lodge (@mathewlodge) Weaveworks (@weaveworks) https://www.weave.works/ 2 Going faster with software delivery is now a business issue Software
More informationMigrating massive monitoring to Bigtable without downtime. Martin Parm, Infrastructure Engineer for Monitoring
Migrating massive monitoring to Bigtable without downtime Martin Parm, Infrastructure Engineer for Monitoring This is a big deal. -- Nicholas Harteau/VP, Engineering & Infrastructure https://news.spotify.com/dk/2016/02/23/announcing-spotify-infrastructures-googley-future/
More informationthis is not a love song How Spotify Uses Tableau Public Skyler Johnson
this is not a love song How Spotify Uses Tableau Public Skyler Johnson Data Visualization Designer Spotify Intro Tableau Desktop/Server @ Spotify 2018.1, 8-core Primary & Worker usually upgrade soon
More informationCS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014
CS15-319 / 15-619 Cloud Computing Recitation 3 September 9 th & 11 th, 2014 Overview Last Week s Reflection --Project 1.1, Quiz 1, Unit 1 This Week s Schedule --Unit2 (module 3 & 4), Project 1.2 Questions
More informationContainer Orchestration on Amazon Web Services. Arun
Container Orchestration on Amazon Web Services Arun Gupta, @arungupta Docker Workflow Development using Docker Docker Community Edition Docker for Mac/Windows/Linux Monthly edge and quarterly stable
More informationData in the Cloud and Analytics in the Lake
Data in the Cloud and Analytics in the Lake Introduction Working in Analytics for over 5 years Part the digital team at BNZ for 3 years Based in the Auckland office Preferred Languages SQL Python (PySpark)
More informationUSERS CONFERENCE Copyright 2016 OSIsoft, LLC
Bridge IT and OT with a process data warehouse Presented by Matt Ziegler, OSIsoft Complexity Problem Complexity Drives the Need for Integrators Disparate assets or interacting one-by-one Monitoring Real-time
More informationScaling LEGO & Spotify. Henrik
Consultant www.crisp.se Scaling Agile @ LEGO & Spotify EA-träff Okt 2017 henrik.kniberg@crisp.se @HenrikKniberg Dad Climate guy Organizational coach & Change Instigator Author Not too hard A bit trickier
More informationFirst Look at Built-in Autoscaling and Alerting. Paul blog.paulbouwer.com
First Look at Built-in Autoscaling and Alerting Paul Bouwer @pbouwer blog.paulbouwer.com The power of cloud economics is elasticity - the ability to pay for resources only when they are needed and to
More informationImproving efficiency of Twitter Infrastructure using Chargeback
Improving efficiency of Twitter Infrastructure using Chargeback @vinucharanya @micheal AGENDA Brief History Problem Chargeback Engineering Challenges The product Impact Future Getty Images from http://www.fifa.com/worldcup/news/y=2010/m=7/news=pride-for-africa-spain-strike-gold-2247372.html
More informationEnd to End Analysis on System z IBM Transaction Analysis Workbench for z/os. James Martin IBM Tools Product SME August 10, 2015
End to End Analysis on System z IBM Transaction Analysis Workbench for z/os James Martin IBM Tools Product SME August 10, 2015 Please note IBM s statements regarding its plans, directions, and intent are
More informationFundamentals of Stream Processing with Apache Beam (incubating)
Google Docs version of slides (including animations): https://goo.gl/yzvlxe Fundamentals of Stream Processing with Apache Beam (incubating) Frances Perry & Tyler Akidau @francesjperry, @takidau Apache
More informationIntroduction to Apache Beam
Introduction to Apache Beam Dan Halperin JB Onofré Google Beam podling PMC Talend Beam Champion & PMC Apache Member Apache Beam is a unified programming model designed to provide efficient and portable
More informationTour of Database Platforms as a Service. June 2016 Warner Chaves Christo Kutrovsky Solutions Architect
Tour of Database Platforms as a Service June 2016 Warner Chaves Christo Kutrovsky Solutions Architect Bio Solutions Architect at Pythian Specialize high performance data processing and analytics 15 years
More informationAcronis Data Cloud plugin for ConnectWise Automate
Acronis Data Cloud plugin for ConnectWise Automate USER'S GUIDE Revision: 24.08.2018 Table of contents 1 Introduction...3 2 What's new in Update 4...3 3 What's new in Update 3...3 4 What's new in Update
More informationDevOps on AWS Deep Dive on Continuous Delivery and the AWS Developer Tools
DevOps on AWS Deep Dive on Continuous Delivery and the AWS Developer Tools Woody Borraccino, AWS Solutions Architect May 4, 2016, Stockholm 2016, Amazon Web Services, Inc. or its Affiliates. All rights
More informationAutomation of Rolling Upgrade for Hadoop Cluster without Data Loss and Job Failures. Hiroshi Yamaguchi & Hiroyuki Adachi
Automation of Rolling Upgrade for Hadoop Cluster without Data Loss and Job Failures Hiroshi Yamaguchi & Hiroyuki Adachi About Us 2 Hiroshi Yamaguchi Hiroyuki Adachi Hadoop DevOps Engineer Hadoop Engineer
More informationUptime and Proactive Support Services
Uptime and Proactive Support Services We ll accelerate your journey to sustainable IT optimisation and ensure that your technology is delivering all that it can. We ll keep your IT infrastructure up and
More informationProcessing Data Like Google Using the Dataflow/Beam Model
Todd Reedy Google for Work Sales Engineer Google Processing Data Like Google Using the Dataflow/Beam Model Goals: Write interesting computations Run in both batch & streaming Use custom timestamps Handle
More informationAutomating, Securing, and Managing Cox Automotive's (AutoTrader) Big Data Infrastructure
Automating, Securing, and Managing Cox Automotive's (AutoTrader) Big Data Infrastructure with Ansible, Insights, and Satellite Abdi Hersi, Senior Linux Engineer Cox Automotive Will Nix, Principal Technical
More informationCloudera Impala Headline Goes Here
Cloudera Impala Headline Goes Here JusAn Erickson Senior Product Manager Speaker Name or Subhead Goes Here February 2013 DO NOT USE PUBLICLY PRIOR TO 10/23/12 Agenda Intro to Impala Architectural Overview
More informationAssignment 3 ITCS-6010/8010: Cloud Computing for Data Analysis
Assignment 3 ITCS-6010/8010: Cloud Computing for Data Analysis Due by 11:59:59pm on Tuesday, March 16, 2010 This assignment is based on a similar assignment developed at the University of Washington. Running
More informationUsing DC/OS for Continuous Delivery
Using DC/OS for Continuous Delivery DevPulseCon 2017 Elizabeth K. Joseph, @pleia2 Mesosphere 1 Elizabeth K. Joseph, Developer Advocate, Mesosphere 15+ years working in open source communities 10+ years
More information@unterstein #bedcon. Operating microservices with Apache Mesos and DC/OS
@unterstein @dcos @bedcon #bedcon Operating microservices with Apache Mesos and DC/OS 1 Johannes Unterstein Software Engineer @Mesosphere @unterstein @unterstein.mesosphere 2017 Mesosphere, Inc. All Rights
More informationMicroservices Architekturen aufbauen, aber wie?
Microservices Architekturen aufbauen, aber wie? Constantin Gonzalez, Principal Solutions Architect glez@amazon.de, @zalez 30. Juni 2016 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
More informationCloudline Autonomous Driving Solutions. Accelerating insights through a new generation of Data and Analytics October, 2018
Cloudline Autonomous Driving Solutions Accelerating insights through a new generation of Data and Analytics October, 2018 HPE big data analytics solutions power the data-driven enterprise Secure, workload-optimized
More informationArchitecture of Flink's Streaming Runtime. Robert
Architecture of Flink's Streaming Runtime Robert Metzger @rmetzger_ rmetzger@apache.org What is stream processing Real-world data is unbounded and is pushed to systems Right now: people are using the batch
More informationMEDIAMPLIFY : A Cloud to Cable TV Platform for Music, TV, and Video Dr. Edwin A. Hernandez Chief Technology Officer EGLA COMMUNICATIONS
MEDIAMPLIFY : Amplify your reach A Cloud to Cable TV Platform for Music, TV, and Video Dr. Edwin A. Hernandez Chief Technology Officer Mediamplify is the one stop shop multi-platform media distribution
More informationWGAW Start Button Please remember, Start Button does not replace your Declaration of Earnings
Please remember, Start Button does not replace your Declaration of Earnings Overview Start Button is a tool to: Inform the Guild when writers begin working on a step and when the draft is scheduled to
More informationConnected vehicle cloud Commercial presentation
Connected vehicle cloud 17.1 Commercial presentation 2017.07.01 The evolution of the automotive industry 2017 2020 2025 2030 A vehicle can generate 1 GB of data per second By 2020 automakers could make
More informationTC2-Computer Literacy Mr. Sencer February 4, 2010
TC2-Computer Literacy Mr. Sencer February 4, 2010 What is a network? A network is a collection of computers and devices connected together, sometimes wirelessly, via communications devices. When a computer
More informationBite of Science Austin Alfredo Pérez Applications Engineer, Silicon Labs. January 27, 2016
Bite of Science Austin Alfredo Pérez Applications Engineer, Silicon Labs January 27, 2016 1 Agenda About Me Silicon Labs Who are we? What do we do? Applications Engineering Functions Skills needed IoT
More informationOrchestrating an OpenStack* based IoT Smart Home
Orchestrating an OpenStack* based IoT Smart Home Michael Kadera, John Geier, Dr. Yih Leong Sun Intel Open Source Technology Center 26th October, Wednesday, 17:55-18:35 *Other names and brands may be claimed
More informationThe OpenCirrus TM Project: A global Testbed for Cloud Computing R&D
The OpenCirrus TM Project: A global Testbed for Cloud Computing R&D Marcel Kunze Steinbuch Centre for Computing (SCC) Karlsruhe Institute of Technology (KIT) Germany KIT The cooperation of Forschungszentrum
More informationHow can you implement this through a script that a scheduling daemon runs daily on the application servers?
You ve been tasked with implementing an automated data backup solution for your application servers that run on Amazon EC2 with Amazon EBS volumes. You want to use a distributed data store for your backups
More informationThe Mobile World Introduction
TABLE OF CONTENTS The Mobile World 3 SMS for Courier & Postal Services - Introduction 7 SMS for Courier & Postal Services Outbound SMS 8 SMS for Courier & Postal Services Inbound SMS 10 Technical Overview
More informationUsing Apache Beam for Batch, Streaming, and Everything in Between. Dan Halperin Apache Beam PMC Senior Software Engineer, Google
Abstract Apache Beam is a unified programming model capable of expressing a wide variety of both traditional batch and complex streaming use cases. By neatly separating properties of the data from run-time
More informationStreaming Auto-Scaling in Google Cloud Dataflow
Streaming Auto-Scaling in Google Cloud Dataflow Manuel Fahndrich Software Engineer Google Addictive Mobile Game https://commons.wikimedia.org/wiki/file:globe_centered_in_the_atlantic_ocean_(green_and_grey_globe_scheme).svg
More informationBetter skilled workforce
Better skilled workforce for the New Style of Business HPE Education Services November 20, 2015 Education is the most powerful weapon which you can use to change the world Nelson Mandela The New Style
More informationContainer 2.0. Container: check! But what about persistent data, big data or fast data?!
@unterstein @joerg_schad @dcos @jaxdevops Container 2.0 Container: check! But what about persistent data, big data or fast data?! 1 Jörg Schad Distributed Systems Engineer @joerg_schad Johannes Unterstein
More informationAndrew Durant/Ellen Sullivan
AGENDA ITEM: 3.5 INFORMATION MANAGEMENT, TECHNOLOGY AND GOVERNANCE COMMITTEE DATE OF MEETING: 2 OCTOBER 2018 Subject : Approved and Presented by: Prepared by: Other Committees and meetings considered at:
More informationData Management at Cloud Scale CommVault Simpana v10. VMware Partner Exchange Session SPO2308 February 2013
Data Management at Cloud Scale CommVault Simpana v10 VMware Partner Exchange Session SPO2308 February 2013 Agenda Breakout Session: Wednesday, Feb 27, 11:00 AM - 12:00 PM Data Management at Cloud Scale
More informationTSAR A TimeSeries AggregatoR. Anirudh Todi TSAR
TSAR A TimeSeries AggregatoR Anirudh Todi Twitter @anirudhtodi TSAR What is TSAR? What is TSAR? TSAR is a framework and service infrastructure for specifying, deploying and operating timeseries aggregation
More informationBuilding Next- GeneraAon Data IntegraAon Pla1orm. George Xiong ebay Data Pla1orm Architect April 21, 2013
Building Next- GeneraAon Data IntegraAon Pla1orm George Xiong ebay Data Pla1orm Architect April 21, 2013 ebay Analytics >50 TB/day new data 100+ Subject Areas >100 PB/day Processed >100 Trillion pairs
More informationValue of YouTube to the music industry Paper V Direct value to the industry
Value of YouTube to the music industry Paper V Direct value to the industry June 2017 RBB Economics 1 1 Introduction The music industry has undergone significant change over the past few years, with declining
More informationAvoiding Storage Service Disruptions with Availability Intelligence
Avoiding Storage Service Disruptions with Availability Intelligence Brent Phillips, Managing Director, Americas Brett Allison, Director of Technical Services www.intellimagic.com 1 Today s Agenda 1. Availability
More informationBUSINESS CHALLENGE SOLUTION BENEFITS ABOUT AUTO & GENERAL GROUP (SOUTHEAST ASIA)
CASE STUDY IZO PRIVATE CLOUD ENABLES AUTO & GENERAL GROUP (SOUTHEAST ASIA) TO ACHIEVE BORDERLESS GROWTH WITH SUCCESSFUL LAUNCH OF NEW BUSINESSES IN SOUTHEAST ASIAN MARKETS. TARGETING COMPETITIVE MARKETS,
More informationBest Practices and Performance Tuning on Amazon Elastic MapReduce
Best Practices and Performance Tuning on Amazon Elastic MapReduce Michael Hanisch Solutions Architect Amo Abeyaratne Big Data and Analytics Consultant ANZ 12.04.2016 2016, Amazon Web Services, Inc. or
More informationGlobalNOC Services Update Internet2 Global Summit
GlobalNOC Services Update 2015 Internet2 Global Summit Annual Report http://globalnoc.iu.edu/annual-report/2014/ 4/28/15 Service Desk Year in Review: Welcomed ARE-ON and OSHEAN to the GlobalNOC Family
More informationLecture 20: WSC, Datacenters. Topics: warehouse-scale computing and datacenters (Sections )
Lecture 20: WSC, Datacenters Topics: warehouse-scale computing and datacenters (Sections 6.1-6.7) 1 Warehouse-Scale Computer (WSC) 100K+ servers in one WSC ~$150M overall cost Requests from millions of
More informationJenkins: A complete solution. From Continuous Integration to Continuous Delivery For HSBC
Jenkins: A complete solution From Integration to Delivery For HSBC Rajesh Kumar DevOps Architect @RajeshKumarIN www.rajeshkumar.xyz Agenda Why Jenkins? Introduction and some facts about Jenkins Supported
More informationDesign Patterns for the Cloud. MCSN - N. Tonellotto - Distributed Enabling Platforms 68
Design Patterns for the Cloud 68 based on Amazon Web Services Architecting for the Cloud: Best Practices Jinesh Varia http://media.amazonwebservices.com/aws_cloud_best_practices.pdf 69 Amazon Web Services
More informationConnecting your Microservices and Cloud Services with Oracle Integration CON7348
Connecting your Microservices and Cloud Services with Oracle Integration CON7348 Robert Wunderlich Sr. Principal Product Manager September 19, 2016 Copyright 2016, Oracle and/or its affiliates. All rights
More informationMeeting researchers needs in mining web archives: the experience of the National Library of France
Meeting researchers needs in mining web archives: the experience of the National Library of France Sara Aubry, IT Department Peter Stirling, Legal Deposit Department Bibliothèque nationale de France LIBER
More informationADABAS & NATURAL 2050+
ADABAS & NATURAL 2050+ Guido Falkenberg SVP Global Customer Innovation DIGITAL TRANSFORMATION #WITHOUTCOMPROMISE 2017 Software AG. All rights reserved. ADABAS & NATURAL 2050+ GLOBAL INITIATIVE INNOVATION
More informationEnjoy the hottest Streaming Music with NAS
Enjoy the hottest Streaming Music with NAS Why Should You Give Spotify a Try How to Use Spotify with QNAP NAS Versatile Audio Output Approach Versatile HD Station Manipulation Approach Why Should You Give
More informationApigee Edge Cloud. Supported browsers:
Apigee Edge Cloud Description Apigee Edge Cloud is an API management platform to securely deliver and manage all APIs. Apigee Edge Cloud manages the API lifecycle with capabilities that include, but are
More informationA Public-Sector Guide to UPS Procurement
A Public-Sector Guide to UPS Procurement HPE HYBRID IT AND HPE NIMBLE STORAGE New HPE Demand Just Generation Right Programs IT and Revealed Aruba: New Demand Generation Programs Revealed EXECUTIVE SUMMARY
More informationDeployment Planning Guide
Deployment Planning Guide Community 1.5.1 release The purpose of this document is to educate the user about the different strategies that can be adopted to optimize the usage of Jumbune on Hadoop and also
More informationA Disciplined Approach to Cyber Security Transformation
A Disciplined Approach to Cyber Security Transformation Information Protection and Business Resiliency December 2014 Key takeaways from today s cyber security discussion 1. Our colleagues are not waving
More informationMAPR DATA GOVERNANCE WITHOUT COMPROMISE
MAPR TECHNOLOGIES, INC. WHITE PAPER JANUARY 2018 MAPR DATA GOVERNANCE TABLE OF CONTENTS EXECUTIVE SUMMARY 3 BACKGROUND 4 MAPR DATA GOVERNANCE 5 CONCLUSION 7 EXECUTIVE SUMMARY The MapR DataOps Governance
More informationBUILDING A SCALABLE MOBILE GAME BACKEND IN ELIXIR. Petri Kero CTO / Ministry of Games
BUILDING A SCALABLE MOBILE GAME BACKEND IN ELIXIR Petri Kero CTO / Ministry of Games MOBILE GAME BACKEND CHALLENGES Lots of concurrent users Complex interactions between players Persistent world with frequent
More informationMapReduce for Graph Algorithms
Seminar: Massive-Scale Graph Analysis Summer Semester 2015 MapReduce for Graph Algorithms Modeling & Approach Ankur Sharma ankur@stud.uni-saarland.de May 8, 2015 Agenda 1 Map-Reduce Framework Big Data
More informationFor Performance and Scalability, Amadeus Chooses Data Center
For Performance and Scalability, Amadeus Chooses Data Center Amadeus is the leader in technology solutions for the global travel and tourism industry. Through its flagship product Altéa Customer Management
More informationSeagull: A distributed, fault tolerant, concurrent task runner. Sagar Patwardhan
Seagull: A distributed, fault tolerant, concurrent task runner Sagar Patwardhan sagarp@yelp.com Yelp s Mission Connecting people with great local businesses. Yelp scale Outline What is Seagull? Why did
More informationAvanan for G Suite. Technical Overview. Copyright 2017 Avanan. All rights reserved.
Avanan for G Suite Technical Overview Contents Intro 1 How Avanan Works 2 Email Security for Gmail 3 Data Security for Google Drive 4 Policy Automation 5 Workflows and Notifications 6 Authentication 7
More information