Monasca. Monitoring/Logging-as-a-Service (at-scale)

Size: px
Start display at page:

Download "Monasca. Monitoring/Logging-as-a-Service (at-scale)"

Transcription

1 Monasca Monitoring/Logging-as-a-Service (at-scale)

2 Speaker Roland Hochmuth Hewlett Packard Enterprise Fort Collins, Colorado, USA

3 Agenda Describe how to build a highly scalable monitoring and logging as a service platform Architectural and design principles Scale, HA Provide an overview of Monasca Features API Demo

4 What is Monitoring-as-a-Service? A Monitoring or Logging solution deployed as Software-as-a-Service E.g. CloudWatch, Datadog, New Relic, Librato, Loggly and many others First-class, preferably RESTful HTTP API Authentication Multi-tenancy Provides self-provisioning to users/tenants of the service Designed to be highly reliable and operate at scale Historically run by an operations team doing web services

5 What is OpenStack? OpenStack is a cloud operating system that controls large pools of compute, storage, and networking resources Open-source alternative to AWS, Microsoft Azure, Google Cloud and other cloud services Deployed in both public and private clouds

6 What is Monasca? Open-source Monitoring/Logging-as-a-Service platform for OpenStack Authentication currently via OpenStack Identity Service (Keystone) Microservices message-bus based architecture First-class RESTful API Push-based metrics Consolidates Operational Monitoring, Monitoring-as-a-Service, Metering & Billing and more Designed for elastic cloud environments/deployments High-availability / clustering built-in Horizontally scalable and vertically 4 tiered/layered architecture Capable of long-term data retention to address metering, SLA, capacity planning, trend analysis, post-hoc RCA, and other use cases Extensible and Composable

7 The Log The Log: What every software engineer should know about real-time data's unifying abstraction Log: An append-only, totally-ordered sequence of records ordered by time From To

8 Monitoring Architecture

9 Kafka A performant, distributed, durable, publish/subscribe messaging and stream processing system Metrics, logs and events are published to topics in Kafka Microservices register in a "consumer group" as a consumer Microservices "subscribe" to topics and consume metrics/logs and events Messages are replicated per consumer group Messages are load-balanced across all consumers in a consumer group Can add/remove micro-services to handle load or mitigate problems As micro-services expand/contract the partitions are automatically re-balanced At-least-once semantic guarantees on message delivery Also used for domain events, notification retry events, periodic notifications, grouping notifcations and other areas Always accept data, never drop data, true elasticity Loggly:

10 CQRS Command Query Responsibility Segregation (CQRS) CQRS involves splitting an application into two parts internally: 1. Command side ordering the system to update state 2. Query side that gets information without changing state Advantages Decouples the read/write load. Allows each to be scaled independently Read store can be optimized for the query pattern of the application Reference Event sourcing, CQRS, stream processing and Apache Kafka

11 Microservices Microservices are small, autonomous, decoupled services that are deployed independenty and work together as a single application Communication between services occurs via a network Services need to be able to change independently of each other, and be deployed by themselves without requiring consumers to change Benefits: Resilience Scale Ease of deployment Organizational Alignment Optimized for Change/Replaceability

12 POST Metrics Sequence

13 Domain Events Sequence

14 Deployment Models (HA/Scale) Many ways to deploy Monasca Typically deployed in a clustered/ha configuration using three nodes or greater If any node or microservice fails, the cluster remains operational Partitions in Kafka are redistributed among the remaining components Preferably, the database is run on a separate layer from the other components/microservices Note, Monasca can also be deployed on a single-node, non-clustered Has also been containerized and run in Kubernetes

15 Metrics Model POST /v2.0/metrics { name: http_status, dimensions: { url: cluster: c1, control_plane: ccp, service: compute } timestamp: 0, /* milliseconds */ value: 1.0, } value_meta: { status_code: 500, msg: Internal server error } Simple, concise, multi-dimensional flexible description Name (string) Dimensions: Dictionary of user-defined (key, value) pairs that are used to uniquely identify a metric Optional dictionary of user-defined (key, value) pairs that can be used to describe a measurement Normally used for errors and messages

16 Push vs Pull Monitoring-as-a-Service Can't always pull due to firewalls and network issues Low-latency: sub-second latency difficult for pull model Doesn't require service discovery and registration As entities are deployed, they can start sending metrics without have to be discovered or registered Events Temporary caching/buffering of metrics/events while service unreachable.

17 Monasca API Primary point for pushing metrics and handling queries Authenticates all requests against the Keystone identity service Note, auth tokens are cached to reduce the load on Keystone Resources: Metrics, Alarm Definitions, Alarms and Notification Methods API Specification: Horizontally scalable Publishes metrics to Kafka Queries timeseries DB for measurements and statistics Queries Config DB for alarms, alarm definitions and notification methods

18 Persister Consumes both metrics and alarm state transition events from Kafka Stores temporarily in-memory and does batch writes to the TSDB, based on batch size or time, to optimize write performance At-least once message delivery semantics: No metrics or alarm state transition events are lost The Kafka consumer offset for each batch is only updated after successfully storing the metric or alarm state transition event Note, duplicates are possible HA/fault-tolerance: Multiple persisters run simultaneously and balance load If a persister fails, the load is automatically re-balanced across the remaining persisters.

19 Time Series Databases Used for storing: Metrics Alarm state history Two databases supported: 1. Vertica Enterprise class, proprietary, closed-source, clustered, HA, analytics database Excels at time-series 2. InfluxDB Open-source single-node time-series DB Clustering is closed-source Note, can replicate to multiple instances of InfluxDB using Kafka Investigating support for additional databases

20 Config Database Stores all "transactional" data for Monasca such as Alarm Definitions Alarms Notification Methods MySQL and Postgres supported Typically deployed in a clustered or HA configuration

21 Threshold Engine Near real-time stream processing, clustered and highly available threshold engine Based on Apache Storm Consumes metrics from Kafka Creates alarms based on metrics that match patterns specified in the alarm definition Evaluates whether metrics exceed threshold Publishes alarm state transition events to Kafka Supports both simple and compound alarm expressions

22 Notification Engine Consumes "alarm state transition events" from Kafka produced by the Threshold Engine Evaluates whether notifications should be sent based on actions specified in the alarm definition. OK, ALARM and UNDETERMINED actions Supports , PagerDuty, webhooks, HipChat, Slack and JIRA Dynamic plugins supported Supports both "one-shot" and "periodic" notifications If sending to the notification address fails, then notification is published to retry topic in Kafka, and retried later Grouping notifications: In progress

23 Kafka Message Schema JSON messages published/consumed to/from Kafka by Monasca micro-services Well-defined schema is published at:

24 Metrics Create, query and get statistics for metrics GET, POST /v2.0/metrics GET /v2.0/metrics/names: Returns the unique metric names GET /v2.0/metrics/dimension/names Returns the unique dimension names GET /v2.0/metrics/dimension/names/values Returns the unique dimension values

25 Measurements GET /v2.0/metrics/measurements Returns a list of measurements Query parameters Name and dimensions to filter by Start_time and end_time Offset and limit merge_metrics: allow multiple metrics to be combined into a single list of measurements. group_by: list of columns to group the metrics to be returned. Allows multiple unique metrics to be returned in a single query.

26 Statistics GET /v2.0/metrics/statistics Query parameters Name and dimensions to filter by Start_time and end_time Statistics: avg, min, max, sum and count Period: The time period to aggregate measurements by Offset, limit merge_metrics: allow multiple metrics to be combined into a single list of statistics group_by: list of columns to group the metrics to be returned. Allows multiple unique metrics to be returned in a single query.

27 Metrics Names GET /v2.0/metrics/names Returns a list of the unique metric names Query parameters Dimensions Offset, limit

28 Metric Dimension Names GET /v2.0/metrics/dimensions/names List the dimension names Query parameters Metric name Offset, limit

29 Metric Dimension Values GET /v2.0/metrics/dimensions/names/values List the dimension values Query parameters Metric name Dimension name Offset, limit

30 Alarm Definitions POST, GET /v2.0/alarm-definitions Alarm definitions are templates that are used to automatically and dynamically create alarms based on matching metric names and dimensions One alarm definition can result in zero or more alarms. Simple grammar for creating compound alarm expressions: avg(cpu.user_perc{}) > 85 or avg(disk.read_ops{device=vda}, 120) > 1000 Alarm states (OK, ALARM and UNDETERMINED) Actions associated with alarms for state transitions User assigned severity (LOW, MEDIUM, HIGH, CRITICAL) Thresholds can be dynamically adjusted via PATCH Minimal lifecycle management, alarm_lifecycle_state and link.

31 List Alarms GET /v2.0/alarms Query parameters: metric_name - Name of metric to filter by metric_dimensions State: OK, ALARM or UNDETERMINED. Severity: One or more severities to filter by, separated with, ex. severity=low MEDIUM state_updated_start_time : The start time in ISO 8601 combined date and time format in UTC. Offset, limit sort_by

32 Alarms GET, PUT, PATCH, DELETE /v2.0/alarms/{alarm-id} Alarms created by the Threshold Engine based on matching alarm definitions. When new nodes or components are deployed, alarms are automatically created Alarms are resources within Monasca. They have a resource ID and lifecycle. By default, three states: OK, ALARM and UNDETERMINED UNDETERMINED state occurs when metrics are no longer being received Deterministic alarms, two states: OK and ALARM Used for systems where metrics are sporadic. E.g. Creating metrics when errors in log files occur, and no metrics, when there aren't any errors.

33 Alarm Counts GET /v2.0/alarms/count Query the total number of alarms in the OK, ALARM or UNDETERMINED state, and their severities, grouped by metrics dimension, such as OpenStack service, state and severity. Used for summary dashboards

34 Example: Helion Ops Console

35 Alarm History GET /v2.0/alarms/state-history Lists the alarm state history for alarms Query Parameters: Dimensions to filter on Start/end timestamp Offset, limit GET /v2.0/alarms/{alarm-id}/state-history Lists the alarm state history for a specific alarm

36 Notification Methods POST, GET, DELETE /v2.0/notification-methods Notification methods are associated with Actions in alarm definitions. Example: POST /v2.0/notification-methods { "name":"name of notification method", "type":" ", "address":"john.doe@hp.com" }

37 Monasca Agent System metrics (cpu, memory, network, filesystem, ) Service metrics MySQL, Kafka, and many others Application metrics Built-in Statsd daemon Python monasca-statsd library: Adds support for dimensions VM system metrics Open vswitch metrics Active checks HTTP status checks and response times System up/down checks (ping and ssh) Runs any Nagios plugin or check_mk Extensible/Pluggable: Additional services can be easily added

38 Agent details The Agent Forwarder buffers metrics for a short time to increase the size of the http request body (number of metrics) sent to the Monasca API. The Agent request an auth token from the Keystone Identity service which is supplied on all requests. The Monasca Agent and API caches Monasca Agent and API caches Monasca Agent and API caches auth tokens in-memory to reduce the round-trip authorization requests to Keystone If network connectivity between the Agent and API occurs the Agent will buffer metrics and send when connectivity is restored Metrics are submitted using a agent role, which only allows metrics to be POST d to the metrics endpoint

39 Grafana/Monasca Integration Datasource: A datasource that can be added to the Grafana dashboard to enable Monasca Keystone authentication Support for Alerting will be added in Grafana 4.

40 Grafana Monasca Data Source

41 Logging Architecture

42 Logging API POST /v3.0/logs Batch log messages in a single http request Global / local / mixed dimensions Similar to dimensions in metrics. JSON only Specification Queries not done via API, but via Tenantized version of Kibana

43 Log Model { "dimensions": { "hostname":"devstack", "service":"monitoring", "component":"monasca-api" } "logs":[ { "message":"msg1", "dimensions": { "service":"compute", "component":"nova-api", "path":"/var/log/mysql.log" } }, { "message":"msg2", "dimensions": { "path":"/var/log/monasca/monasca-api.log" } } ] }

44 Log Agents Logstash Beaver Logspout: Under Investigation

45 Kibana Integration Keystone authentication support for Kibana Authentication plugin: Note: In progress of moving to official OpenStack repo

46 Composabilty: Logging/Metrics

47 Transform and Analytics Engine

48 Monasca Transform A new micro-service in Monasca that aggregates and transforms metrics. Currently based on Apache Spark Streaming. Use Cases: Object Storage Disk Capacity Object Storage Capacity Compute Host Capacity VM Capacity More to come Metrics are aggregated and published every hour. Currently in deployment in HPE Helion OpenStack 4.0. OpenStack project/repo

49 Monasca Analytics A framework that adds data science tools (parsers, algorithms, etc). Features include: Algorithmic flow definition, enabling sharing of complex algorithmic recipes Thin orchestration layer that instantiates an execution environment. Focused on: Anomaly detection Reducing alert fatigue via alarm clustering (unsupervised machine learning). Example algorithms: One Class SVM and LiNGAM. Status: Under Development OpenStack project/repo

50 Distributions & Deployments Charter Communications: Monasca and Grafana is currently deployed in production private cloud Monitoring-as-a-Service Use cases supported with Grafana as the Visualization Dashboard 2 datacenters, compute nodes, 1000 VMs, 11,000 metrics/sec FIWARE Lab: Hewlett Packard Enterprise: Cloud System, Helion OpenStack Supported and tested up to 65K metrics/sec injest rates. Fujitsu: FUJITSU Software ServerView Cloud Monitoring Manager. NEC: Planning to include Monasca in "Cloud Solution Menus" solution. Others

51 Statistics: Mitaka/Newton Release Organizations: Contributors: Commits: Reviews: Lines of code: ,370

52 Ecosystem Hewlett Packard Enterprise Fujitsu Charter Communications NEC Cisco Cloudbase Solutions SUSE SolidFire SAP Cray Inc. FIWARE Lab Mirantis Broadcom

53 Containers and Kubernetes New Monasca Agent Plugins Docker plugin cadviser plugin Kubernetes plugin: Monitors both Kubernetes control plane and containers Prometheus client plugin: Scrapes apps Mesos pugin Containerization of Monasca Heapster Monasca data sink

54 Next Steps Containerizing Monasca Monitoring containers and container managers, such as Kubernetes Grouping notifications

Overview. SUSE OpenStack Cloud Monitoring

Overview. SUSE OpenStack Cloud Monitoring Overview SUSE OpenStack Cloud Monitoring Overview SUSE OpenStack Cloud Monitoring Publication Date: 08/04/2017 SUSE LLC 10 Canal Park Drive Suite 200 Cambridge MA 02141 USA https://www.suse.com/documentation

More information

FUJITSU Software ServerView Cloud Monitoring Manager V1.0. Overview

FUJITSU Software ServerView Cloud Monitoring Manager V1.0. Overview FUJITSU Software ServerView Cloud Monitoring Manager V1.0 Overview J2UL-2073-01ENZ0(00) November 2015 Trademarks Copyright FUJITSU LIMITED 2015 LINUX is a registered trademark of Linus Torvalds. The OpenStack

More information

The Art of Container Monitoring. Derek Chen

The Art of Container Monitoring. Derek Chen The Art of Container Monitoring Derek Chen 2016.9.22 About me DevOps Engineer at Trend Micro Agile transformation Micro service and cloud service Docker integration Monitoring system development Automate

More information

Designing MQ deployments for the cloud generation

Designing MQ deployments for the cloud generation Designing MQ deployments for the cloud generation WebSphere User Group, London Arthur Barr, Senior Software Engineer, IBM MQ 30 th March 2017 Top business drivers for cloud 2 Source: OpenStack user survey,

More information

FUJITSU Software ServerView Cloud Monitoring Manager V1.1. Release Notes

FUJITSU Software ServerView Cloud Monitoring Manager V1.1. Release Notes FUJITSU Software ServerView Cloud Monitoring Manager V1.1 Release Notes J2UL-2170-01ENZ0(00) July 2016 Contents Contents About this Manual... 4 1 What's New?...6 1.1 Performance Improvements... 6 1.2

More information

Datasheet FUJITSU Software Cloud Monitoring Manager V2.0

Datasheet FUJITSU Software Cloud Monitoring Manager V2.0 Datasheet FUJITSU Software Cloud Monitoring Manager V2.0 Cloud Monitoring Manager supports DevOps teams to keep maximum control of their OpenStack Cloud OpenStack is complex and highly distributed. Gaining

More information

WHITEPAPER. MemSQL Enterprise Feature List

WHITEPAPER. MemSQL Enterprise Feature List WHITEPAPER MemSQL Enterprise Feature List 2017 MemSQL Enterprise Feature List DEPLOYMENT Provision and deploy MemSQL anywhere according to your desired cluster configuration. On-Premises: Maximize infrastructure

More information

The SMACK Stack: Spark*, Mesos*, Akka, Cassandra*, Kafka* Elizabeth K. Dublin Apache Kafka Meetup, 30 August 2017.

The SMACK Stack: Spark*, Mesos*, Akka, Cassandra*, Kafka* Elizabeth K. Dublin Apache Kafka Meetup, 30 August 2017. Dublin Apache Kafka Meetup, 30 August 2017 The SMACK Stack: Spark*, Mesos*, Akka, Cassandra*, Kafka* Elizabeth K. Joseph @pleia2 * ASF projects 1 Elizabeth K. Joseph, Developer Advocate Developer Advocate

More information

StreamSets Control Hub Installation Guide

StreamSets Control Hub Installation Guide StreamSets Control Hub Installation Guide Version 3.2.1 2018, StreamSets, Inc. All rights reserved. Table of Contents 2 Table of Contents Chapter 1: What's New...1 What's New in 3.2.1... 2 What's New in

More information

Datasheet FUJITSU Software ServerView Cloud Monitoring Manager V1.1

Datasheet FUJITSU Software ServerView Cloud Monitoring Manager V1.1 Datasheet FUJITSU Software ServerView Cloud Monitoring Manager V1.1 Datasheet FUJITSU Software ServerView Cloud Monitoring Manager V1.1 A Monitoring Cloud Service for Enterprise OpenStack Systems Cloud

More information

Cisco Tetration Analytics

Cisco Tetration Analytics Cisco Tetration Analytics Enhanced security and operations with real time analytics John Joo Tetration Business Unit Cisco Systems Security Challenges in Modern Data Centers Securing applications has become

More information

Part2: Let s pick one cloud IaaS middleware: OpenStack. Sergio Maffioletti

Part2: Let s pick one cloud IaaS middleware: OpenStack. Sergio Maffioletti S3IT: Service and Support for Science IT Cloud middleware Part2: Let s pick one cloud IaaS middleware: OpenStack Sergio Maffioletti S3IT: Service and Support for Science IT, University of Zurich http://www.s3it.uzh.ch/

More information

How can you implement this through a script that a scheduling daemon runs daily on the application servers?

How can you implement this through a script that a scheduling daemon runs daily on the application servers? You ve been tasked with implementing an automated data backup solution for your application servers that run on Amazon EC2 with Amazon EBS volumes. You want to use a distributed data store for your backups

More information

Advanced Continuous Delivery Strategies for Containerized Applications Using DC/OS

Advanced Continuous Delivery Strategies for Containerized Applications Using DC/OS Advanced Continuous Delivery Strategies for Containerized Applications Using DC/OS ContainerCon @ Open Source Summit North America 2017 Elizabeth K. Joseph @pleia2 1 Elizabeth K. Joseph, Developer Advocate

More information

Containerization Dockers / Mesospere. Arno Keller HPE

Containerization Dockers / Mesospere. Arno Keller HPE Containerization Dockers / Mesospere Arno Keller HPE What is the Container technology Hypervisor vs. Containers (Huis vs artement) A container doesn't "boot" an OS instead it loads the application and

More information

Monitoring system for geographically distributed datacenters based on Openstack. Gioacchino Vino

Monitoring system for geographically distributed datacenters based on Openstack. Gioacchino Vino Monitoring system for geographically distributed datacenters based on Openstack Gioacchino Vino Tutor: Dott. Domenico Elia Tutor: Dott. Giacinto Donvito Borsa di studio GARR Orio Carlini 2016-2017 INFN

More information

Developing Microsoft Azure Solutions (70-532) Syllabus

Developing Microsoft Azure Solutions (70-532) Syllabus Developing Microsoft Azure Solutions (70-532) Syllabus Cloud Computing Introduction What is Cloud Computing Cloud Characteristics Cloud Computing Service Models Deployment Models in Cloud Computing Advantages

More information

70-532: Developing Microsoft Azure Solutions

70-532: Developing Microsoft Azure Solutions 70-532: Developing Microsoft Azure Solutions Objective Domain Note: This document shows tracked changes that are effective as of January 18, 2018. Create and Manage Azure Resource Manager Virtual Machines

More information

70-532: Developing Microsoft Azure Solutions

70-532: Developing Microsoft Azure Solutions 70-532: Developing Microsoft Azure Solutions Exam Design Target Audience Candidates of this exam are experienced in designing, programming, implementing, automating, and monitoring Microsoft Azure solutions.

More information

Using the SDACK Architecture to Build a Big Data Product. Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver

Using the SDACK Architecture to Build a Big Data Product. Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver Using the SDACK Architecture to Build a Big Data Product Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver Outline A Threat Analytic Big Data product The SDACK Architecture Akka Streams and data

More information

"Charting the Course... H8Q14S HPE Helion OpenStack. Course Summary

Charting the Course... H8Q14S HPE Helion OpenStack. Course Summary Course Summary Description This course will take students through an in-depth look at HPE Helion OpenStack V5.0. The course flow is optimized to address the high-level architecture and HPE Helion OpenStack

More information

How to Keep UP Through Digital Transformation with Next-Generation App Development

How to Keep UP Through Digital Transformation with Next-Generation App Development How to Keep UP Through Digital Transformation with Next-Generation App Development Peter Sjoberg Jon Olby A Look Back, A Look Forward Dedicated, data structure dependent, inefficient, virtualized Infrastructure

More information

Developing Microsoft Azure Solutions (70-532) Syllabus

Developing Microsoft Azure Solutions (70-532) Syllabus Developing Microsoft Azure Solutions (70-532) Syllabus Cloud Computing Introduction What is Cloud Computing Cloud Characteristics Cloud Computing Service Models Deployment Models in Cloud Computing Advantages

More information

Deep Dive Amazon Kinesis. Ian Meyers, Principal Solution Architect - Amazon Web Services

Deep Dive Amazon Kinesis. Ian Meyers, Principal Solution Architect - Amazon Web Services Deep Dive Amazon Kinesis Ian Meyers, Principal Solution Architect - Amazon Web Services Analytics Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure

More information

Beyond 1001 Dedicated Data Service Instances

Beyond 1001 Dedicated Data Service Instances Beyond 1001 Dedicated Data Service Instances Introduction The Challenge Given: Application platform based on Cloud Foundry to serve thousands of apps Application Runtime Many platform users - who don

More information

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme CNA2080BU Deep Dive: How to Deploy and Operationalize Kubernetes Cornelia Davis, Pivotal Nathan Ness Technical Product Manager, CNABU @nvpnathan #VMworld #CNA2080BU Disclaimer This presentation may contain

More information

OpenStack Mitaka Release Overview

OpenStack Mitaka Release Overview OpenStack Mitaka Release Overview Brad Topol, Ph.D., IBM Distinguished Engineer, OpenStack Core Contributor Davanum Srinivas, Community Architect and OpenStack TC Member / Principal Software Engineer,

More information

How to re-invent your IT Architecture. André Christ, Co-CEO LeanIX

How to re-invent your IT Architecture. André Christ, Co-CEO LeanIX How to re-invent your IT Architecture André Christ, Co-CEO LeanIX 2012 founded 30 employees > 80 customers 150 % motivated 2 OUR MISSION Become global #1 SaaS helping companies to modernize their IT architectures

More information

Qualys Cloud Platform

Qualys Cloud Platform 18 QUALYS SECURITY CONFERENCE 2018 Qualys Cloud Platform Looking Under the Hood: What Makes Our Cloud Platform so Scalable and Powerful Dilip Bachwani Vice President, Engineering, Qualys, Inc. Cloud Platform

More information

/ Cloud Computing. Recitation 5 February 14th, 2017

/ Cloud Computing. Recitation 5 February 14th, 2017 15-319 / 15-619 Cloud Computing Recitation 5 February 14th, 2017 1 Overview Administrative issues Office Hours, Piazza guidelines Last week s reflection Project 2.1, OLI Unit 2 modules 5 and 6 This week

More information

Fluentd + MongoDB + Spark = Awesome Sauce

Fluentd + MongoDB + Spark = Awesome Sauce Fluentd + MongoDB + Spark = Awesome Sauce Nishant Sahay, Sr. Architect, Wipro Limited Bhavani Ananth, Tech Manager, Wipro Limited Your company logo here Wipro Open Source Practice: Vision & Mission Vision

More information

Developing Microsoft Azure Solutions (70-532) Syllabus

Developing Microsoft Azure Solutions (70-532) Syllabus Developing Microsoft Azure Solutions (70-532) Syllabus Cloud Computing Introduction What is Cloud Computing Cloud Characteristics Cloud Computing Service Models Deployment Models in Cloud Computing Advantages

More information

Using Prometheus with InfluxDB for metrics storage

Using Prometheus with InfluxDB for metrics storage Using Prometheus with InfluxDB for metrics storage Roman Vynar Senior Site Reliability Engineer, Quiq September 26, 2017 About Quiq Quiq is a messaging platform for customer service. https://goquiq.com

More information

A10 HARMONY CONTROLLER

A10 HARMONY CONTROLLER DATA SHEET A10 HARMONY CONTROLLER AGILE MANAGEMENT, AUTOMATION, ANALYTICS FOR MULTI-CLOUD ENVIRONMENTS PLATFORMS A10 Harmony Controller provides centralized agile management, automation and analytics for

More information

Important DevOps Technologies (3+2+3days) for Deployment

Important DevOps Technologies (3+2+3days) for Deployment Important DevOps Technologies (3+2+3days) for Deployment DevOps is the blending of tasks performed by a company's application development and systems operations teams. The term DevOps is being used in

More information

WHITE PAPER. RedHat OpenShift Container Platform. Benefits: Abstract. 1.1 Introduction

WHITE PAPER. RedHat OpenShift Container Platform. Benefits: Abstract. 1.1 Introduction WHITE PAPER RedHat OpenShift Container Platform Abstract Benefits: Applications are designed around smaller independent components called microservices. Elastic resources: Scale up or down quickly and

More information

@joerg_schad Nightmares of a Container Orchestration System

@joerg_schad Nightmares of a Container Orchestration System @joerg_schad Nightmares of a Container Orchestration System 2017 Mesosphere, Inc. All Rights Reserved. 1 Jörg Schad Distributed Systems Engineer @joerg_schad Jan Repnak Support Engineer/ Solution Architect

More information

Cloud & container monitoring , Lars Michelsen Check_MK Conference #4

Cloud & container monitoring , Lars Michelsen Check_MK Conference #4 Cloud & container monitoring 04.05.2018, Lars Michelsen Some cloud definitions Applications Data Runtime Middleware O/S Virtualization Servers Storage Networking Software-as-a-Service (SaaS) Applications

More information

Continuous delivery while migrating to Kubernetes

Continuous delivery while migrating to Kubernetes Continuous delivery while migrating to Kubernetes Audun Fauchald Strand Øyvind Ingebrigtsen Øvergaard @audunstrand @oyvindio FINN Infrastructure History Kubernetes at FINN Agenda Finn Infrastructure As

More information

MQ Monitoring on Cloud

MQ Monitoring on Cloud MQ Monitoring on Cloud Suganya Rane Digital Automation, Integration & Cloud Solutions Agenda Metrics & Monitoring Monitoring Options AWS ElasticSearch Kibana MQ CloudWatch on AWS Prometheus Grafana MQ

More information

Prometheus For Big & Little People Simon Lyall

Prometheus For Big & Little People Simon Lyall Prometheus For Big & Little People Simon Lyall Sysadmin (it says DevOps Engineer in my job title) Large Company, Auckland, New Zealand Use Prometheus at home on workstations, home servers and hosted Vms

More information

Network Automation using modern tech. Egor Krivosheev 2degrees

Network Automation using modern tech. Egor Krivosheev 2degrees Network Automation using modern tech Egor Krivosheev 2degrees Key parts of network automation today Streaming Telemetry APIs SNMP and screen scraping are still around NETCONF RFC6241 XML encoding Most

More information

Application Centric Microservices Ken Owens, CTO Cisco Intercloud Services. Redhat Summit 2015

Application Centric Microservices Ken Owens, CTO Cisco Intercloud Services. Redhat Summit 2015 Application Centric Microservices Ken Owens, CTO Cisco Intercloud Services Redhat Summit 2015 Agenda Introduction Why Application Centric Application Deployment Options What is Microservices Infrastructure

More information

Kubernetes: Twelve KeyFeatures

Kubernetes: Twelve KeyFeatures Kubernetes: Twelve KeyFeatures Kubernetes is a Greek word which means helmsman, or the pilot of a ship. It is an open source project that was started by Google and derived from Borg, which is used inside

More information

OpenShift 3 Technical Architecture. Clayton Coleman, Dan McPherson Lead Engineers

OpenShift 3 Technical Architecture. Clayton Coleman, Dan McPherson Lead Engineers OpenShift 3 Technical Architecture Clayton Coleman, Dan McPherson Lead Engineers Principles The future of *aas Redefine the Application Networked components wired together Not just a web frontend anymore

More information

DevOps CICD PopUp. Software Defined Application Delivery Fabric. Frey Khademi. Systems Engineering DACH. Avi Networks

DevOps CICD PopUp. Software Defined Application Delivery Fabric. Frey Khademi. Systems Engineering DACH. Avi Networks DevOps CICD PopUp Software Defined Application Delivery Fabric Systems Engineering DACH Frey Khademi Avi Networks Agenda Avi Introduction - Overview - Architecture - Use Cases Demo Integration Building

More information

Upcoming Services in OpenStack Rohit Agarwalla, Technical DEVNET-1102

Upcoming Services in OpenStack Rohit Agarwalla, Technical DEVNET-1102 Upcoming Services in OpenStack Rohit Agarwalla, Technical Leader roagarwa@cisco.com, @rohitagarwalla DEVNET-1102 Agenda OpenStack Overview Upcoming Services Trove Sahara Ironic Magnum Kolla OpenStack Overview

More information

Wrapp. Powered by AWS EC2 Container Service. Jude D Souza Solutions Wrapp Phone:

Wrapp. Powered by AWS EC2 Container Service. Jude D Souza Solutions Wrapp Phone: Containers @ Wrapp Powered by AWS EC2 Container Service Jude D Souza Solutions Architect @ Wrapp Phone: +46 767085740 Email: jude@wrapp.com About Me Jude D Souza Stockholm, Sweden ß Karachi, Pakistan jude@wrapp.com

More information

Best Practice Deployment of F5 App Services in Private Clouds. Henry Tam, Senior Product Marketing Manager John Gruber, Sr. PM Solutions Architect

Best Practice Deployment of F5 App Services in Private Clouds. Henry Tam, Senior Product Marketing Manager John Gruber, Sr. PM Solutions Architect Best Practice Deployment of F5 App Services in Private Clouds Henry Tam, Senior Product Marketing Manager John Gruber, Sr. PM Solutions Architect Agenda 1 2 3 4 5 The trend of data center, private cloud

More information

Cloud Monitoring as a Service. Built On Machine Learning

Cloud Monitoring as a Service. Built On Machine Learning Cloud Monitoring as a Service Built On Machine Learning Table of Contents 1 2 3 4 5 6 7 8 9 10 Why Machine Learning Who Cares Four Dimensions to Cloud Monitoring Data Aggregation Anomaly Detection Algorithms

More information

Cisco Tetration Analytics

Cisco Tetration Analytics Cisco Tetration Analytics Enhanced security and operations with real time analytics Christopher Say (CCIE RS SP) Consulting System Engineer csaychoh@cisco.com Challenges in operating a hybrid data center

More information

Getting Started With Amazon EC2 Container Service

Getting Started With Amazon EC2 Container Service Getting Started With Amazon EC2 Container Service Emeka Igbokwe Solution Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Agenda Containers EC2 Container Service EC2 Container

More information

Monitoring MySQL Performance with Percona Monitoring and Management

Monitoring MySQL Performance with Percona Monitoring and Management Monitoring MySQL Performance with Percona Monitoring and Management Your Presenters Michael Coburn - PMM Product Manager Working at Percona for almost 5 years Consultant, Manager, TAM, now Product Manager

More information

Thales PunchPlatform Agenda

Thales PunchPlatform Agenda Thales PunchPlatform Agenda What It Does Building Blocks PunchPlatform team Deployment & Operations Typical Setups Customers and Use Cases RoadMap 1 What It Does Compose Arbitrary Industrial Data Processing

More information

Professional PostgreSQL monitoring made easy. Kaarel Moppel - p2d2.cz 2019 Prague

Professional PostgreSQL monitoring made easy. Kaarel Moppel - p2d2.cz 2019 Prague Professional PostgreSQL monitoring made easy Kaarel Moppel - p2d2.cz 2019 Prague Who? Kaarel Moppel Senior Database Consultant km@cybertec.at PostgreSQL Database Services Training Consulting High Availability

More information

@unterstein #bedcon. Operating microservices with Apache Mesos and DC/OS

@unterstein #bedcon. Operating microservices with Apache Mesos and DC/OS @unterstein @dcos @bedcon #bedcon Operating microservices with Apache Mesos and DC/OS 1 Johannes Unterstein Software Engineer @Mesosphere @unterstein @unterstein.mesosphere 2017 Mesosphere, Inc. All Rights

More information

Container 2.0. Container: check! But what about persistent data, big data or fast data?!

Container 2.0. Container: check! But what about persistent data, big data or fast data?! @unterstein @joerg_schad @dcos @jaxdevops Container 2.0 Container: check! But what about persistent data, big data or fast data?! 1 Jörg Schad Distributed Systems Engineer @joerg_schad Johannes Unterstein

More information

Lenses 2.1 Enterprise Features PRODUCT DATA SHEET

Lenses 2.1 Enterprise Features PRODUCT DATA SHEET Lenses 2.1 Enterprise Features PRODUCT DATA SHEET 1 OVERVIEW DataOps is the art of progressing from data to value in seconds. For us, its all about making data operations as easy and fast as using the

More information

Distributed CI: Scaling Jenkins on Mesos and Marathon. Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA

Distributed CI: Scaling Jenkins on Mesos and Marathon. Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA Distributed CI: Scaling Jenkins on Mesos and Marathon Roger Ignazio Puppet Labs, Inc. MesosCon 2015 Seattle, WA About Me Roger Ignazio QE Automation Engineer Puppet Labs, Inc. @rogerignazio Mesos In Action

More information

Towards a Real- time Processing Pipeline: Running Apache Flink on AWS

Towards a Real- time Processing Pipeline: Running Apache Flink on AWS Towards a Real- time Processing Pipeline: Running Apache Flink on AWS Dr. Steffen Hausmann, Solutions Architect Michael Hanisch, Manager Solutions Architecture November 18 th, 2016 Stream Processing Challenges

More information

Monitoring MySQL Performance with Percona Monitoring and Management

Monitoring MySQL Performance with Percona Monitoring and Management Monitoring MySQL Performance with Percona Monitoring and Management Santa Clara, California April 23th 25th, 2018 MIchael Coburn, Product Manager Your Presenter Product Manager for PMM (also Percona Toolkit

More information

UMP Alert Engine. Status. Requirements

UMP Alert Engine. Status. Requirements UMP Alert Engine Status Requirements Goal Terms Proposed Design High Level Diagram Alert Engine Topology Stream Receiver Stream Router Policy Evaluator Alert Publisher Alert Topology Detail Diagram Alert

More information

Data Acquisition. The reference Big Data stack

Data Acquisition. The reference Big Data stack Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Data Acquisition Corso di Sistemi e Architetture per Big Data A.A. 2017/18 Valeria Cardellini The reference

More information

When (and how) to move applications from VMware to Cisco Metacloud

When (and how) to move applications from VMware to Cisco Metacloud White Paper When (and how) to move applications from VMware to Cisco Metacloud What You Will Learn This white paper will explain when to migrate various applications running in VMware virtual machines

More information

Containers Infrastructure for Advanced Management. Federico Simoncelli Associate Manager, Red Hat October 2016

Containers Infrastructure for Advanced Management. Federico Simoncelli Associate Manager, Red Hat October 2016 Containers Infrastructure for Advanced Management Federico Simoncelli Associate Manager, Red Hat October 2016 About Me Kubernetes Decoupling problems to hand out to different teams Layer of abstraction

More information

Architectural challenges for building a low latency, scalable multi-tenant data warehouse

Architectural challenges for building a low latency, scalable multi-tenant data warehouse Architectural challenges for building a low latency, scalable multi-tenant data warehouse Mataprasad Agrawal Solutions Architect, Services CTO 2017 Persistent Systems Ltd. All rights reserved. Our analytics

More information

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development:: Title Duration : Apache Spark Development : 4 days Overview Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized

More information

Cisco Container Platform

Cisco Container Platform Cisco Container Platform Pradnesh Patil Suhail Syed Cisco Spark How Questions? Use Cisco Spark to communicate with the speaker after the session 1. Find this session in the Cisco Live Mobile App 2. Click

More information

Using DC/OS for Continuous Delivery

Using DC/OS for Continuous Delivery Using DC/OS for Continuous Delivery DevPulseCon 2017 Elizabeth K. Joseph, @pleia2 Mesosphere 1 Elizabeth K. Joseph, Developer Advocate, Mesosphere 15+ years working in open source communities 10+ years

More information

Building a Data-Friendly Platform for a Data- Driven Future

Building a Data-Friendly Platform for a Data- Driven Future Building a Data-Friendly Platform for a Data- Driven Future Benjamin Hindman - @benh 2016 Mesosphere, Inc. All Rights Reserved. INTRO $ whoami BENJAMIN HINDMAN Co-founder and Chief Architect of Mesosphere,

More information

IBM Planning Analytics Workspace Local Distributed Soufiane Azizi. IBM Planning Analytics

IBM Planning Analytics Workspace Local Distributed Soufiane Azizi. IBM Planning Analytics IBM Planning Analytics Workspace Local Distributed Soufiane Azizi IBM Planning Analytics IBM Canada - Cognos Ottawa Lab. IBM Planning Analytics Agenda 1. Demo PAW High Availability on a Prebuilt Swarm

More information

Regain control thanks to Prometheus. Guillaume Lefevre, DevOps Engineer, OCTO Technology Etienne Coutaud, DevOps Engineer, OCTO Technology

Regain control thanks to Prometheus. Guillaume Lefevre, DevOps Engineer, OCTO Technology Etienne Coutaud, DevOps Engineer, OCTO Technology Regain control thanks to Prometheus Guillaume Lefevre, DevOps Engineer, OCTO Technology Etienne Coutaud, DevOps Engineer, OCTO Technology About us Guillaume Lefevre DevOps Engineer, OCTO Technology @guillaumelfv

More information

The intelligence of hyper-converged infrastructure. Your Right Mix Solution

The intelligence of hyper-converged infrastructure. Your Right Mix Solution The intelligence of hyper-converged infrastructure Your Right Mix Solution Applications fuel the idea economy SLA s, SLA s, SLA s Regulations Latency Performance Integration Disaster tolerance Reliability

More information

Flip the Switch to Container-based Clouds

Flip the Switch to Container-based Clouds Flip the Switch to Container-based Clouds B I L L B O R S A R I D I R E C T O R, S Y S T E M S E N G I N E E R I N G 1 November 2017 1 2017 Datera Datera at a Glance Founded 2013 Smart storage for clouds

More information

Developing Microsoft Azure Solutions: Course Agenda

Developing Microsoft Azure Solutions: Course Agenda Developing Microsoft Azure Solutions: 70-532 Course Agenda Module 1: Overview of the Microsoft Azure Platform Microsoft Azure provides a collection of services that you can use as building blocks for your

More information

Docker and Oracle Everything You Wanted To Know

Docker and Oracle Everything You Wanted To Know Docker and Oracle Everything You Wanted To Know June, 2017 Umesh Tanna Principal Technology Sales Consultant Oracle Sales Consulting Centers(SCC) Bangalore Safe Harbor Statement The following is intended

More information

Deploying Applications on DC/OS

Deploying Applications on DC/OS Mesosphere Datacenter Operating System Deploying Applications on DC/OS Keith McClellan - Technical Lead, Federal Programs keith.mcclellan@mesosphere.com V6 THE FUTURE IS ALREADY HERE IT S JUST NOT EVENLY

More information

Data pipelines with PostgreSQL & Kafka

Data pipelines with PostgreSQL & Kafka Data pipelines with PostgreSQL & Kafka Oskari Saarenmaa PostgresConf US 2018 - Jersey City Agenda 1. Introduction 2. Data pipelines, old and new 3. Apache Kafka 4. Sample data pipeline with Kafka & PostgreSQL

More information

Course Outline. Lesson 2, Azure Portals, describes the two current portals that are available for managing Azure subscriptions and services.

Course Outline. Lesson 2, Azure Portals, describes the two current portals that are available for managing Azure subscriptions and services. Course Outline Module 1: Overview of the Microsoft Azure Platform Microsoft Azure provides a collection of services that you can use as building blocks for your cloud applications. Lesson 1, Azure Services,

More information

Building a Kubernetes on Bare-Metal Cluster to Serve Wikipedia. Alexandros Kosiaris Giuseppe Lavagetto

Building a Kubernetes on Bare-Metal Cluster to Serve Wikipedia. Alexandros Kosiaris Giuseppe Lavagetto Building a Kubernetes on Bare-Metal Cluster to Serve Wikipedia Alexandros Kosiaris Giuseppe Lavagetto Introduction The Wikimedia Foundation is the organization running the infrastructure supporting Wikipedia

More information

New Data Architectures For Netflow Analytics NANOG 74. Fangjin Yang - Imply

New Data Architectures For Netflow Analytics NANOG 74. Fangjin Yang - Imply New Data Architectures For Netflow Analytics NANOG 74 Fangjin Yang - Cofounder @ Imply The Problem Comparing technologies Overview Operational analytic databases Try this at home The Problem Netflow data

More information

Performance Monitoring and Management of Microservices on Docker Ecosystem

Performance Monitoring and Management of Microservices on Docker Ecosystem Performance Monitoring and Management of Microservices on Docker Ecosystem Sushanta Mahapatra Sr.Software Specialist Performance Engineering SAS R&D India Pvt. Ltd. Pune Sushanta.Mahapatra@sas.com Richa

More information

Techno Expert Solutions

Techno Expert Solutions Course Content of Microsoft Windows Azzure Developer: Course Outline Module 1: Overview of the Microsoft Azure Platform Microsoft Azure provides a collection of services that you can use as building blocks

More information

Merging Enterprise Applications with Docker* Container Technology

Merging Enterprise Applications with Docker* Container Technology Solution Brief NetApp Docker Volume Plugin* Intel Xeon Processors Intel Ethernet Converged Network Adapters Merging Enterprise Applications with Docker* Container Technology Enabling Scale-out Solutions

More information

GoDocker. A batch scheduling system with Docker containers

GoDocker. A batch scheduling system with Docker containers GoDocker A batch scheduling system with Docker containers Web - http://www.genouest.org/godocker/ Code - https://bitbucket.org/osallou/go-docker Twitter - #godocker Olivier Sallou IRISA - 2016 CC-BY-SA

More information

Microservice Powered Orchestration

Microservice Powered Orchestration Microservice Powered Orchestration Huabing Zhao ZTE, System Engineer, Network Management &, OPEN-O Common PTL zhao.huabing@zte.com.cn Zhaoxing Meng ZTE, NFV&SDN Architect, Network Management &, OPEN-O

More information

FUJITSU Software ServerView Cloud Monitoring Manager V1.1. CMM Operator's Guide

FUJITSU Software ServerView Cloud Monitoring Manager V1.1. CMM Operator's Guide FUJITSU Software ServerView Cloud Monitoring Manager V1.1 CMM Operator's Guide J2UL-2076-02ENZ0(00) July 2016 Contents Contents About this Manual... 5 1 Introduction... 8 1.1 Basic Usage Scenario... 9

More information

Information Security Policy

Information Security Policy Information Security Policy Information Security is a top priority for Ardoq, and we also rely on the security policies and follow the best practices set forth by AWS. Procedures will continuously be updated

More information

Federated Prometheus Monitoring at Scale

Federated Prometheus Monitoring at Scale Federated Prometheus Monitoring at Scale LungChih Tung Oath Nandhakumar Venkatachalam Oath Team Core Platform Team powering all Yahoo Media Products Yahoo Media Products Homepage, News Finance Sports,

More information

Module Day Topic. 1 Definition of Cloud Computing and its Basics

Module Day Topic. 1 Definition of Cloud Computing and its Basics Module Day Topic 1 Definition of Cloud Computing and its Basics 1 2 3 1. How does cloud computing provides on-demand functionality? 2. What is the difference between scalability and elasticity? 3. What

More information

How we built a highly scalable Machine Learning platform using Apache Mesos

How we built a highly scalable Machine Learning platform using Apache Mesos How we built a highly scalable Machine Learning platform using Apache Mesos Daniel Sârbe Development Manager, BigData and Cloud Machine Translation @ SDL Co-founder of BigData/DataScience Meetup Cluj,

More information

CONTAINERS AND MICROSERVICES WITH CONTRAIL

CONTAINERS AND MICROSERVICES WITH CONTRAIL CONTAINERS AND MICROSERVICES WITH CONTRAIL Scott Sneddon Sree Sarva DP Ayyadevara Sr. Director Sr. Director Director Cloud and SDN Contrail Solutions Product Line Management This statement of direction

More information

Monitor your containers with the Elastic Stack. Monica Sarbu

Monitor your containers with the Elastic Stack. Monica Sarbu Monitor your containers with the Elastic Stack Monica Sarbu Monica Sarbu Team lead, Beats team monica@elastic.co 3 Monitor your containers with the Elastic Stack Elastic Stack 5 Beats are lightweight shippers

More information

Data Sheet. Monitoring Automation for Web-Scale Networks MONITORING AUTOMATION FOR WEB-SCALE NETWORKS -

Data Sheet. Monitoring Automation for Web-Scale Networks MONITORING AUTOMATION FOR WEB-SCALE NETWORKS - Data Sheet Monitoring Automation for Web-Scale Networks CLOUD-BASED MONITORING AUTOMATION FOR WEB-SCALE NETWORKS NetSpyGlass (NSG) is cloud-based, network monitoring automation for network operators seeking

More information

Microservices Lessons Learned From a Startup Perspective

Microservices Lessons Learned From a Startup Perspective Microservices Lessons Learned From a Startup Perspective Susanne Kaiser @suksr CTO at Just Software @JustSocialApps Each journey is different People try to copy Netflix, but they can only copy what they

More information

/ Cloud Computing. Recitation 5 September 26 th, 2017

/ Cloud Computing. Recitation 5 September 26 th, 2017 15-319 / 15-619 Cloud Computing Recitation 5 September 26 th, 2017 1 Overview Administrative issues Office Hours, Piazza guidelines Last week s reflection Project 2.1, OLI Unit 2 modules 5 and 6 This week

More information

Professional PostgreSQL monitoring made easy. Kaarel Moppel Kaarel Moppel

Professional PostgreSQL monitoring made easy. Kaarel Moppel  Kaarel Moppel Professional PostgreSQL monitoring made easy Kaarel Moppel Kaarel Moppel Why to monitor Failure / Downtime detection Slowness / Performance analysis Proactive predictions Maybe wasting money? Kaarel Moppel

More information

Think Small to Scale Big

Think Small to Scale Big Think Small to Scale Big Intro to Containers for the Datacenter Admin Pete Zerger Principal Program Manager, MVP pete.zerger@cireson.com Cireson Lee Berg Blog, e-mail address, title Company Pete Zerger

More information

Overview SENTINET 3.1

Overview SENTINET 3.1 Overview SENTINET 3.1 Overview 1 Contents Introduction... 2 Customer Benefits... 3 Development and Test... 3 Production and Operations... 4 Architecture... 5 Technology Stack... 7 Features Summary... 7

More information

OpenStack Magnum Pike and the CERN cloud. Spyros

OpenStack Magnum Pike and the CERN cloud. Spyros OpenStack Magnum Pike and the CERN cloud Spyros Trigazis @strigazi OpenStack Magnum OpenStack Magnum #openstack-containers Kubernetes, Docker Swarm, Apache Mesos, DC/OS (experimental) aas Deep integration

More information