Monitor Cassandra audit log

Size: px
Start display at page:

Download "Monitor Cassandra audit log"

Transcription

1 Monitor Cassandra audit log This is a tutorial about how to create a new eagle application step by step, though it is using cassandra query monitoring as example, but it could be extended to any log-based monitoring cases. 1. Flow Stream into Kafka 2. Startup Eagle Server 3. Create New Monitoring Application 4. Start Monitoring Topology 5. Validate with Sample Policy a. Define sample policy with eagle b. Trigger alert with sample event c. Review generated alert 1. Flow Stream into Kafka Raw Query Log Sample: host:/ source:/ user:cassandra timestamp: category:query type:cql_select ks:dg_keyspace cf: customer_details operation:select id,name, ,contact from customer_details masked_columns:[ ,contact] other_columns:[id,name] Create Kafka topic: cassandra_querylog_sandbox $KAFKA_HOME/bin/kafka-topics.sh --create --topic cassandra_querylog_sandbox --zookeeper localhost: partitions 1 -- replication-factor 1 Parsed Query Log and Flow into Kafka "host": "/ ", "source": "/ ", "user": "jaspa", "timestamp": , "category": "QUERY", "type": "CQL_SELECT", "ks": "dg_keyspace", "cf": "customer_details", "operation": "CQL_SELECT", "masked_columns": "bank ccno ip name sal ssn tel url", "other_columns": "id npi" 2. Startup Eagle Server

2 $EAGLE_HOME/bin/eagle-service.sh start 3. Create New Monitoring Application Site: sandbox Application: Group: Cassandra Name: cassandraquerylog Source Stream: Name: cassandraquerylogstream Executor: cassandraquerylogexecutor Schema: cf string query column family ks string query keyspace host string the host that current metric comes form type string query type user string query user category string query category timestamp long query timestamp masked_columns string query masked_columns operation string query operation other_columns string query other_columns source string source host Here is the script for defining the metadata CassandraQueryLogMonitorMetadata.sh # EAGLE_SERVICE_HOST, default is `hostname -f`

3 export EAGLE_SERVICE_HOST=localhost # EAGLE_SERVICE_PORT, default is 9099 export EAGLE_SERVICE_PORT=9099 # EAGLE_SERVICE_USER export EAGLE_SERVICE_USER=admin # EAGLE_SERVICE_PASSWORD export EAGLE_SERVICE_PASSWD=secret curl -u $EAGLE_SERVICE_USER:$EAGLE_SERVICE_PASSWD -X POST -H 'Content- Type:application/json' \ " /entities?servicename=siteapplicationservice" \ -d ' [ "tags": "site":"sandbox", "application":"cassandraquerylog" "enabled": true, "config": "" ] ' curl -u $EAGLE_SERVICE_USER:$EAGLE_SERVICE_PASSWD -X POST -H 'Content- Type:application/json' \ " /entities?servicename=applicationdescservice" \ -d ' [ "tags": "application":"cassandraquerylog" "description":"cassandra Query Log Monitoring", "alias":"querylogmonitor", "groupname":"cassandra", "config":"", "features":["common","metadata"] ] ' ## AlertStreamService echo "" echo "Importing AlertStreamService for HDFS... " curl -u $EAGLE_SERVICE_USER:$EAGLE_SERVICE_PASSWD -X POST -H 'Content- Type:application/json' \ " /entities?servicename=alertstreamservice" \ -d ' [ "tags": "application":"cassandraquerylog",

4 "streamname":"cassandraquerylogstream" "description":"cassandra query log data source stream" ] ' ## AlertExecutorService: what alert streams are consumed by alert executor echo "" echo "Importing AlertExecutorService for HDFS... " curl -u $EAGLE_SERVICE_USER:$EAGLE_SERVICE_PASSWD -X POST -H 'Content- Type:application/json' \ " /entities?servicename=alertexecutorservice" \ -d ' [ "tags": "application":"cassandraquerylog", "alertexecutorid":"cassandraquerylogexecutor", "streamname":"cassandraquerylogstream" "description":"executor for cassandra query log stream" ] ' ## AlertStreamSchemaService: schema for event from alert stream echo "" echo "Importing AlertStreamSchemaService for HDFS... " curl -u $EAGLE_SERVICE_USER:$EAGLE_SERVICE_PASSWD -X POST -H 'Content- Type:application/json' \ " /entities?servicename=alertstreamschemaservice" \ -d ' [ "tags": "attrname": "host" "attrdescription": "the host that current metric comes form", "tags": "attrname": "source" "attrdescription": "source host",

5 "tags": "attrname": "user" "attrdescription": "query user", "tags": "attrname": "timestamp" "attrdescription": "query timestamp", "attrtype": "long", "tags": "attrname": "category" "attrdescription": "query category", "tags": "attrname": "type" "attrdescription": "query type", "tags": "attrname": "ks"

6 "attrdescription": "query keyspace", "tags": "attrname": "cf" "attrdescription": "query column family", "tags": "attrname": "operation" "attrdescription": "query operation", "tags": "attrname": "masked_columns" "attrdescription": "query masked_columns", "tags": "attrname": "other_columns" "attrdescription": "query other_columns", ] ' ## Finished echo ""

7 echo "Finished initialization for eagle topology" After successfully loading above metadata, the new application of Cassandra Query Monitoring Application could be found in Eagle Web after refreshing the page as following: 4. Start Monitoring Topology 1. Create a new topology configuration file named "cassandra-querylog-sandbox.conf" to connect source kafka topic: cassandra_querylog_sandbox as following: cassandra-querylog-sandbox.conf # Licensed to the Apache Software Foundation (ASF) under one or more # contributor license agreements. See the NOTICE file distributed with # this work for additional information regarding copyright ownership. # The ASF licenses this file to You under the Apache License, Version 2.0 # (the "License"); you may not use this file except in compliance with # the License. You may obtain a copy of the License at # # # # Unless required by applicable law or agreed to in writing, software # distributed under the License is distributed on an "AS IS" BASIS, # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. # See the License for the specific language governing permissions and # limitations under the License. "envcontextconfig" : "env" : "storm", "mode" : "cluster", "topologyname" : "cassandrawquerylogmonitortopology",

8 "stormconfigfile" : "cassandraw-querlog-storm.yaml", "parallelismconfig" : "cassandraquerylogstream" : 1, "cassandraquerylogexecutor*" : 1 "datasourceconfig": "topic" : "cassandra_querylog_sandbox", "zkconnection" : "sandbox.hortonworks.com:2181", "zkconnectiontimeoutms" : 15000, "consumergroupid" : "eagle.consumer", "fetchsize" : , "deserializerclass" : "org.apache.eagle.datastream.storm. JsonMessageDeserializer", "transactionzkservers" : "sandbox.hortonworks.com", "transactionzkport" : 2181, "transactionzkroot" : "/consumers", "transactionstateupdatems" : 2000 "alertexecutorconfigs" : "cassandraquerylogexecutor" : "parallelism" : 1, "partitioner" : "org.apache.eagle.policy. DefaultPolicyPartitioner" "needvalidation" : "true" "eagleprops" : "site" : "sandbox", "datajoinpollintervalsec" : 30, "mailhost" : "mailhost.com", "mailsmtpport":"25", "maildebug" : "true", "balancepartitionenabled" : true, #"partitionrefreshintervalinmin" : 60, #"kafkastatisticrangeinmin" : 60, "eagleservice": "host": "localhost", "port": 9099, "username": "admin", "password": "secret" "readhdfsusercommandpatternfrom" : "file" "dynamicconfigsource" : "enabled" : true, "initdelaymillis" : 0, "delaymillis" : 30000

9 2. Start monitoring topology./bin/kafka-stream-monitor.sh cassandraquerylogstream cassandraquerylogexecutor cassandra-querylog-sandbox.conf 5. Validate with Sample Policy a. Define sample policy with eagle from cassandraquerylogstream[(ks == 'dg_keyspace') and (cf == 'customer_details')] select * insert into outputstream; b. Trigger alert with sample event $ $KAFKA_HOME/bin/kafka-console-producer.sh --topic cassandra_querylog_sandbox --broker-list sandbox.hortonworks.com:6667 "host": "/ ","source": "/ ","user": "jaspa", "timestamp": , "category": "QUERY", "type": "CQL_SELECT", "ks": "dg_keyspace","cf": "customer_details","operation": "CQL_SELECT","masked_columns": "bank ccno ip name sal ssn tel url"," other_columns": "id npi" c. Review generated alert

10

Hortonworks Data Platform

Hortonworks Data Platform Data Governance () docs.hortonworks.com : Data Governance Copyright 2012-2017 Hortonworks, Inc. Some rights reserved. The, powered by Apache Hadoop, is a massively scalable and 100% open source platform

More information

Configuring NiFi Authentication and Proxying with Apache Knox

Configuring NiFi Authentication and Proxying with Apache Knox 3 Configuring NiFi Authentication and Proxying with Apache Knox Date of Publish: 2018-08-13 http://docs.hortonworks.com Contents...3 Configuring NiFi for Knox Authentication... 3 Configuring Knox for NiFi...

More information

Hortonworks Data Platform

Hortonworks Data Platform Data Governance () docs.hortonworks.com : Data Governance Copyright 2012-2017 Hortonworks, Inc. Some rights reserved. The, powered by Apache Hadoop, is a massively scalable and 100% open source platform

More information

Source, Sink, and Processor Configuration Values

Source, Sink, and Processor Configuration Values 3 Source, Sink, and Processor Configuration Values Date of Publish: 2018-12-18 https://docs.hortonworks.com/ Contents... 3 Source Configuration Values...3 Processor Configuration Values... 5 Sink Configuration

More information

Let the data flow! Data Streaming & Messaging with Apache Kafka Frank Pientka. Materna GmbH

Let the data flow! Data Streaming & Messaging with Apache Kafka Frank Pientka. Materna GmbH Let the data flow! Data Streaming & Messaging with Apache Kafka Frank Pientka Wer ist Frank Pientka? Dipl.-Informatiker (TH Karlsruhe) Verheiratet, 2 Töchter Principal Software Architect in Dortmund Fast

More information

1 Big Data Hadoop. 1. Introduction About this Course About Big Data Course Logistics Introductions

1 Big Data Hadoop. 1. Introduction About this Course About Big Data Course Logistics Introductions Big Data Hadoop Architect Online Training (Big Data Hadoop + Apache Spark & Scala+ MongoDB Developer And Administrator + Apache Cassandra + Impala Training + Apache Kafka + Apache Storm) 1 Big Data Hadoop

More information

Hortonworks Data Platform

Hortonworks Data Platform Data Governance () docs.hortonworks.com : Data Governance Copyright 2012-2017 Hortonworks, Inc. Some rights reserved. The, powered by Apache Hadoop, is a massively scalable and 100% open source platform

More information

Apache Directory Studio Apache DS. User's Guide

Apache Directory Studio Apache DS. User's Guide Apache Directory Studio Apache DS User's Guide Apache Directory Studio Apache DS: User's Guide Version 2.0.0.v20180908-M14 Copyright 2006-2018 Apache Software Foundation Licensed to the Apache Software

More information

Installing Apache Atlas

Installing Apache Atlas 3 Installing Apache Atlas Date of Publish: 2018-04-01 http://docs.hortonworks.com Contents Apache Atlas prerequisites... 3 Migrating Atlas metadata when upgrading to HDP-3.0+... 3 Overview... 3 Migrate

More information

Apache Directory Studio. User's Guide

Apache Directory Studio. User's Guide Apache Directory Studio User's Guide Apache Directory Studio: User's Guide Version 2.0.0.v20180908-M14 Copyright 2006-2018 Apache Software Foundation Licensed to the Apache Software Foundation (ASF) under

More information

bin/kafka-preferred-replica-election.sh --zookeeper localhost:12913/kafka --path-to-json-file topicpartitionlist.json

bin/kafka-preferred-replica-election.sh --zookeeper localhost:12913/kafka --path-to-json-file topicpartitionlist.json Replication tools 1. Preferred Replica Leader Election Tool FAQ What happens if the preferred replica is not in the ISR? How to find if all the partitions have been moved to the "preferred replica" after

More information

Bitnami Kafka for Huawei Enterprise Cloud

Bitnami Kafka for Huawei Enterprise Cloud Bitnami Kafka for Huawei Enterprise Cloud Description Apache Kafka is publish-subscribe messaging rethought as a distributed commit log. How to start or stop the services? Each Bitnami stack includes a

More information

Innovatus Technologies

Innovatus Technologies HADOOP 2.X BIGDATA ANALYTICS 1. Java Overview of Java Classes and Objects Garbage Collection and Modifiers Inheritance, Aggregation, Polymorphism Command line argument Abstract class and Interfaces String

More information

rkafka rkafka is a package created to expose functionalities provided by Apache Kafka in the R layer. Version 1.1

rkafka rkafka is a package created to expose functionalities provided by Apache Kafka in the R layer. Version 1.1 rkafka rkafka is a package created to expose functionalities provided by Apache Kafka in the R layer. Version 1.1 Wednesday 28 th June, 2017 rkafka Shruti Gupta Wednesday 28 th June, 2017 Contents 1 Introduction

More information

Data Governance Overview

Data Governance Overview 3 Data Governance Overview Date of Publish: 2018-04-01 http://docs.hortonworks.com Contents Apache Atlas Overview...3 Apache Atlas features...3...4 Apache Atlas Overview Apache Atlas Overview Apache Atlas

More information

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info START DATE : TIMINGS : DURATION : TYPE OF BATCH : FEE : FACULTY NAME : LAB TIMINGS : PH NO: 9963799240, 040-40025423

More information

Apache Ranger User Guide

Apache Ranger User Guide Apache Ranger 0.5 - User Guide USER GUIDE Version : 0.5.0 September 2015 About this document Getting started General Features Login to the system: Log out to the system: Service Manager (Access Manager)

More information

UMP Alert Engine. Status. Requirements

UMP Alert Engine. Status. Requirements UMP Alert Engine Status Requirements Goal Terms Proposed Design High Level Diagram Alert Engine Topology Stream Receiver Stream Router Policy Evaluator Alert Publisher Alert Topology Detail Diagram Alert

More information

Big Data Syllabus. Understanding big data and Hadoop. Limitations and Solutions of existing Data Analytics Architecture

Big Data Syllabus. Understanding big data and Hadoop. Limitations and Solutions of existing Data Analytics Architecture Big Data Syllabus Hadoop YARN Setup Programming in YARN framework j Understanding big data and Hadoop Big Data Limitations and Solutions of existing Data Analytics Architecture Hadoop Features Hadoop Ecosystem

More information

Lenses 2.1 Enterprise Features PRODUCT DATA SHEET

Lenses 2.1 Enterprise Features PRODUCT DATA SHEET Lenses 2.1 Enterprise Features PRODUCT DATA SHEET 1 OVERVIEW DataOps is the art of progressing from data to value in seconds. For us, its all about making data operations as easy and fast as using the

More information

Hortonworks DataFlow

Hortonworks DataFlow Getting Started with Streaming Analytics () docs.hortonworks.com : Getting Started with Streaming Analytics Copyright 2012-2018 Hortonworks, Inc. Some rights reserved. Except where otherwise noted, this

More information

Big Data Architect.

Big Data Architect. Big Data Architect www.austech.edu.au WHAT IS BIG DATA ARCHITECT? A big data architecture is designed to handle the ingestion, processing, and analysis of data that is too large or complex for traditional

More information

Policy Manager for IBM WebSphere DataPower 7.2: Configuration Guide

Policy Manager for IBM WebSphere DataPower 7.2: Configuration Guide Policy Manager for IBM WebSphere DataPower 7.2: Configuration Guide Policy Manager for IBM WebSphere DataPower Configuration Guide SOAPMDP_Config_7.2.0 Copyright Copyright 2015 SOA Software, Inc. All rights

More information

Creating a Multi-Container Pod

Creating a Multi-Container Pod CHAPTER 13 Creating a Multi-Container Pod A Pod is the atomic unit of an application managed by Kubernetes. A Pod has a single filesystem and IP Address; the containers in the Pod share the filesystem

More information

Apache Karaf Cave 4.x - Documentation

Apache Karaf Cave 4.x - Documentation Apache Karaf Cave 4.x - Documentation Apache Software Foundation Apache Karaf Cave 4.x - Documentation Overview User Guide 1. Installation 1.1. Pre-installation requirements 1.2. Registration of the Apache

More information

Building Event Driven Architectures using OpenEdge CDC Richard Banville, Fellow, OpenEdge Development Dan Mitchell, Principal Sales Engineer

Building Event Driven Architectures using OpenEdge CDC Richard Banville, Fellow, OpenEdge Development Dan Mitchell, Principal Sales Engineer Building Event Driven Architectures using OpenEdge CDC Richard Banville, Fellow, OpenEdge Development Dan Mitchell, Principal Sales Engineer October 26, 2018 Agenda Change Data Capture (CDC) Overview Configuring

More information

DC/OS Metrics. (formerly known as Project Ambrose) Application and Resource Metrics in DC/OS Enterprise. Nick Parker at..

DC/OS Metrics. (formerly known as Project Ambrose) Application and Resource Metrics in DC/OS Enterprise. Nick Parker at.. DC/OS Metrics (formerly known as Project Ambrose) Application and Resource Metrics in DC/OS Enterprise Nick Parker at.. 1 Introduction Nick Parker DC/OS Slack: chat.dcos.io DC/OS Mailing List: users@dcos.io

More information

Symantec Enterprise Security Manager Baseline Policy Manual for CIS Benchmark. AIX 5.3 and 6.1

Symantec Enterprise Security Manager Baseline Policy Manual for CIS Benchmark. AIX 5.3 and 6.1 Symantec Enterprise Security Manager Baseline Policy Manual for CIS Benchmark AIX 5.3 and 6.1 Symantec Enterprise Security Manager Baseline Policy Manual for CIS Benchmark for AIX 5.3 and 6.1 The software

More information

Hortonworks Cybersecurity Package

Hortonworks Cybersecurity Package Tuning Guide () docs.hortonworks.com Hortonworks Cybersecurity : Tuning Guide Copyright 2012-2018 Hortonworks, Inc. Some rights reserved. Hortonworks Cybersecurity (HCP) is a modern data application based

More information

IMPLEMENTING A LAMBDA ARCHITECTURE TO PERFORM REAL-TIME UPDATES

IMPLEMENTING A LAMBDA ARCHITECTURE TO PERFORM REAL-TIME UPDATES IMPLEMENTING A LAMBDA ARCHITECTURE TO PERFORM REAL-TIME UPDATES by PRAMOD KUMAR GUDIPATI B.E., OSMANIA UNIVERSITY (OU), INDIA, 2012 A REPORT submitted in partial fulfillment of the requirements of the

More information

Setup Watson Explorer Fundational Components on Linux

Setup Watson Explorer Fundational Components on Linux Setup Watson Explorer 11.0.0.1 Fundational Components on Linux Abstract This guide are my step by step guide to setup WEX 11.0.0.1 on one machine. This virtual machine are Linux CentOS 6.7 and will be

More information

APACHE COTTON. MySQL on Mesos. Yan Xu xujyan

APACHE COTTON. MySQL on Mesos. Yan Xu xujyan APACHE COTTON MySQL on Mesos Yan Xu xujyan 1 SHORT HISTORY Mesos: cornerstone of Twitter s compute platform. MySQL: backbone of Twitter s data platform. Mysos: started as a hackweek project @twitter. Apache

More information

Confluent Developer Training for Apache Kafka Exercise Manual B7/801/A

Confluent Developer Training for Apache Kafka Exercise Manual B7/801/A Confluent Developer Training for Apache Kafka Exercise Manual B7/801/A Table of Contents Introduction................................................................ 1 Hands-On Exercise: Using Kafka s

More information

Kafka pours and Spark resolves! Alexey Zinovyev, Java/BigData Trainer in EPAM

Kafka pours and Spark resolves! Alexey Zinovyev, Java/BigData Trainer in EPAM Kafka pours and Spark resolves! Alexey Zinovyev, Java/BigData Trainer in EPAM With IT since 2007 With Java since 2009 With Hadoop since 2012 With Spark since 2014 With EPAM since 2015 About Contacts E-mail

More information

Hortonworks DataFlow Sam Lachterman Solutions Engineer

Hortonworks DataFlow Sam Lachterman Solutions Engineer Hortonworks DataFlow Sam Lachterman Solutions Engineer 1 Hortonworks Inc. 2011 2017. All Rights Reserved Disclaimer This document may contain product features and technology directions that are under development,

More information

Oracle Fusion Middleware Oracle Stream Analytics Install Guide for Hadoop 2.7 and Higher

Oracle Fusion Middleware Oracle Stream Analytics Install Guide for Hadoop 2.7 and Higher Oracle Fusion Middleware Oracle Stream Analytics Install Guide for Hadoop 2.7 and Higher 18.1.0.0.0 E93122-01 May 2018 Oracle Fusion Middleware Oracle Stream Analytics Install Guide for Hadoop 2.7 and

More information

Oracle Fusion Middleware

Oracle Fusion Middleware Oracle Fusion Middleware User's Guide for Oracle Business Intelligence Data Warehouse Administration Console 11g Release 1 (11.1.1) E14849-06 November 2012 Explains how to use the Data Warehouse Administration

More information

Symantec Enterprise Security Manager Baseline Policy Manual for CIS Benchmark. For Red Hat Enterprise Linux 5

Symantec Enterprise Security Manager Baseline Policy Manual for CIS Benchmark. For Red Hat Enterprise Linux 5 Symantec Enterprise Security Manager Baseline Policy Manual for CIS Benchmark For Red Hat Enterprise Linux 5 Symantec ESM Baseline Policy Manual for CIS Benchmark for Red Hat Enterprise Linux 5 The software

More information

Using PCF Ops Manager to Deploy Hyperledger Fabric

Using PCF Ops Manager to Deploy Hyperledger Fabric Using PCF Ops Manager to Deploy Hyperledger Fabric By VMware Introduction Pivotal Cloud Foundry Operations Manager (PCF Ops Manager for short) is a graphical interface used to configure and deploy various

More information

Introducing Kafka Connect. Large-scale streaming data import/export for

Introducing Kafka Connect. Large-scale streaming data import/export for Introducing Kafka Connect Large-scale streaming data import/export for Kafka @tlberglund My Secret Agenda 1. Review of Kafka 2. Why do we need Connect? 3. How does Connect work? 4. Tell me about these

More information

Delving Deep into Hadoop Course Contents Introduction to Hadoop and Architecture

Delving Deep into Hadoop Course Contents Introduction to Hadoop and Architecture Delving Deep into Hadoop Course Contents Introduction to Hadoop and Architecture Hadoop 1.0 Architecture Introduction to Hadoop & Big Data Hadoop Evolution Hadoop Architecture Networking Concepts Use cases

More information

Cloud Stream Service. User Guide. Issue 18 Date HUAWEI TECHNOLOGIES CO., LTD.

Cloud Stream Service. User Guide. Issue 18 Date HUAWEI TECHNOLOGIES CO., LTD. Issue 18 Date 2018-11-30 HUAWEI TECHNOLOGIES CO., LTD. Copyright Huawei Technologies Co., Ltd. 2018. All rights reserved. No part of this document may be reproduced or transmitted in any form or by any

More information

Migrating massive monitoring to Bigtable without downtime. Martin Parm, Infrastructure Engineer for Monitoring

Migrating massive monitoring to Bigtable without downtime. Martin Parm, Infrastructure Engineer for Monitoring Migrating massive monitoring to Bigtable without downtime Martin Parm, Infrastructure Engineer for Monitoring This is a big deal. -- Nicholas Harteau/VP, Engineering & Infrastructure https://news.spotify.com/dk/2016/02/23/announcing-spotify-infrastructures-googley-future/

More information

Issues Fixed in DC/OS

Issues Fixed in DC/OS Release Notes for 1.10.4 These are the release notes for DC/OS 1.10.4. DOWNLOAD DC/OS OPEN SOURCE Issues Fixed in DC/OS 1.10.4 CORE-1375 - Docker executor does not hang due to lost messages. DOCS-2169

More information

User Agent Preparing the Windows Environment and Installing the User Agent. How-To

User Agent Preparing the Windows Environment and Installing the User Agent. How-To User Agent 1.1.5 Preparing the Windows Environment and Installing the User Agent How-To CONTENTS Introduction to the User Agent... 2 Selecting the User Account to Run the User Agent... 3 Setting Access

More information

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect Big Data Big Data Analyst INTRODUCTION TO BIG DATA ANALYTICS ANALYTICS PROCESSING TECHNIQUES DATA TRANSFORMATION & BATCH PROCESSING REAL TIME (STREAM) DATA PROCESSING Big Data Engineer BIG DATA FOUNDATION

More information

WeChat Adobe Campaign Integration - User Guide

WeChat Adobe Campaign Integration - User Guide WeChat Adobe Campaign Integration - User Guide Table of Contents 1. Verticurl App Account Creation... 1 2. Configuration Setup in Verticurl App... 2 3. Configure QR Code Service... 3 3.1 QR code service

More information

Sizing Guidelines and Performance Tuning for Intelligent Streaming

Sizing Guidelines and Performance Tuning for Intelligent Streaming Sizing Guidelines and Performance Tuning for Intelligent Streaming Copyright Informatica LLC 2017. Informatica and the Informatica logo are trademarks or registered trademarks of Informatica LLC in the

More information

ESET REMOTE ADMINISTRATOR PLUG-IN FOR KASEYA. Technical Setup and User Guide

ESET REMOTE ADMINISTRATOR PLUG-IN FOR KASEYA. Technical Setup and User Guide ESET REMOTE ADMINISTRATOR PLUG-IN FOR KASEYA Technical Setup and User Guide ESET REMOTE ADMINISTRATOR PLUG-INFOR KASEYA Copyright 2014 by ESET, spol. s r.o. ESET REMOTE ADMINISTRATOR Plug-in FOR KASEYA

More information

Understanding Parsers

Understanding Parsers HCP 1 Date of Publish: 2018-10-15 http://docs.hortonworks.com Contents 3 Java Parsers..3 General Purpose Parsers 3 Parser Configuration 4 Example: fieldtransformation Configuration 5 HCP Parsers are pluggable

More information

ExtraHop 6.0 ExtraHop REST API Guide

ExtraHop 6.0 ExtraHop REST API Guide ExtraHop 6.0 ExtraHop REST API Guide 2018 ExtraHop Networks, Inc. All rights reserved. This manual in whole or in part, may not be reproduced, translated, or reduced to any machinereadable form without

More information

Auto Management for Apache Kafka and Distributed Stateful System in General

Auto Management for Apache Kafka and Distributed Stateful System in General Auto Management for Apache Kafka and Distributed Stateful System in General Jiangjie (Becket) Qin Data Infrastructure @LinkedIn GIAC 2017, 12/23/17@Shanghai Agenda Kafka introduction and terminologies

More information

StorageTapper. Real-time MySQL Change Data Uber. Ovais Tariq, Shriniket Kale & Yevgeniy Firsov. October 03, 2017

StorageTapper. Real-time MySQL Change Data Uber. Ovais Tariq, Shriniket Kale & Yevgeniy Firsov. October 03, 2017 StorageTapper Real-time MySQL Change Data Streaming @ Uber Ovais Tariq, Shriniket Kale & Yevgeniy Firsov October 03, 2017 Overview What we will cover today Background & Motivation High Level Features System

More information

Data Acquisition. The reference Big Data stack

Data Acquisition. The reference Big Data stack Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Data Acquisition Corso di Sistemi e Architetture per Big Data A.A. 2016/17 Valeria Cardellini The reference

More information

Altiris IT Analytics Solution 7.1 from Symantec User Guide

Altiris IT Analytics Solution 7.1 from Symantec User Guide Altiris IT Analytics Solution 7.1 from Symantec User Guide Altiris IT Analytics Solution 7.1 from Symantec User Guide The software described in this book is furnished under a license agreement and may

More information

Microsoft Exchange Server 2007 Implementation and Maintenance

Microsoft Exchange Server 2007 Implementation and Maintenance Microsoft Exchange Server 2007 Implementation and Maintenance Chapter 1 Exchange Server 2007 Deployment 1.1 Overview, Hardware & Editions 1.2 Exchange Server, Windows & Active Directory 1.3 Administration

More information

In dropwizard metric framework, it use separate thread to scan the metrics map and do flush at a fixed interval.

In dropwizard metric framework, it use separate thread to scan the metrics map and do flush at a fixed interval. Runtime statistics As a monitoring system, user may ask question like: Is the system healthy? what's the heat point of the system? How many events is processed per second? Is the data well partitioned

More information

DATA SCIENCE USING SPARK: AN INTRODUCTION

DATA SCIENCE USING SPARK: AN INTRODUCTION DATA SCIENCE USING SPARK: AN INTRODUCTION TOPICS COVERED Introduction to Spark Getting Started with Spark Programming in Spark Data Science with Spark What next? 2 DATA SCIENCE PROCESS Exploratory Data

More information

Chat Channels Via Kafka (with Grading Notes)

Chat Channels Via Kafka (with Grading Notes) SE424: Distributed Systems Assignment 2 Semester 1 5778 Due: 10 Jan 2017 Chat Channels Via Kafka (with Grading Notes) In this assignment, we ll take the work from the previous assignment on channelized

More information

Apache Storm. Hortonworks Inc Page 1

Apache Storm. Hortonworks Inc Page 1 Apache Storm Page 1 What is Storm? Real time stream processing framework Scalable Up to 1 million tuples per second per node Fault Tolerant Tasks reassigned on failure Guaranteed Processing At least once

More information

Kafka Integration User Guide. Functional Area: Kafka Integration. Geneos Release: v4.7. Document Version: v1.0.0

Kafka Integration User Guide. Functional Area: Kafka Integration. Geneos Release: v4.7. Document Version: v1.0.0 Kafka Integration User Guide Functional Area: Kafka Integration Geneos Release: v4.7 Document Version: v1.0.0 Date Published: 23 July 2018 Copyright 2018. ITRS Group Ltd. All rights reserved. Information

More information

Installation of Actiheart Data Analysis Suite:

Installation of Actiheart Data Analysis Suite: Installation of Actiheart Data Analysis Suite: Currently software is only compatible with XP platform and version 6 of Java. Installation requires: - Windows XP platform - MySQL installation - Folders:

More information

Apache Kafka a system optimized for writing. Bernhard Hopfenmüller. 23. Oktober 2018

Apache Kafka a system optimized for writing. Bernhard Hopfenmüller. 23. Oktober 2018 Apache Kafka...... a system optimized for writing Bernhard Hopfenmüller 23. Oktober 2018 whoami Bernhard Hopfenmüller IT Consultant @ ATIX AG IRC: Fobhep github.com/fobhep whoarewe The Linux & Open Source

More information

RSA NetWitness Logs. Salesforce. Event Source Log Configuration Guide. Last Modified: Wednesday, February 14, 2018

RSA NetWitness Logs. Salesforce. Event Source Log Configuration Guide. Last Modified: Wednesday, February 14, 2018 RSA NetWitness Logs Event Source Log Configuration Guide Salesforce Last Modified: Wednesday, February 14, 2018 Event Source Product Information: Vendor: Salesforce Event Source: CRM Versions: API v1.0

More information

Configuration Guide. Requires Vorex version 3.9 or later and VSA version or later. English

Configuration Guide. Requires Vorex version 3.9 or later and VSA version or later. English Kaseya v2 Integration of VSA with Vorex Configuration Guide Requires Vorex version 3.9 or later and VSA version 9.3.0.11 or later English September 15, 2017 Copyright Agreement The purchase and use of

More information

FUJITSU Cloud Service K5. API User Guide. Version 1.1 FUJITSU LIMITED

FUJITSU Cloud Service K5. API User Guide. Version 1.1 FUJITSU LIMITED FUJITSU Cloud Service K5 API User Guide Version 1.1 FUJITSU LIMITED Preface Purpose of This Document This guide describes preparation of the environment for services, which are used with API (Application

More information

Working with Database Connections. Version: 18.1

Working with Database Connections. Version: 18.1 Working with Database Connections Version: 18.1 Copyright 2018 Intellicus Technologies This document and its content is copyrighted material of Intellicus Technologies. The content may not be copied or

More information

LAB EXERCISE: RedHat OpenShift with Contrail 5.0

LAB EXERCISE: RedHat OpenShift with Contrail 5.0 LAB EXERCISE: RedHat OpenShift with Contrail 5.0 Author Savithru Lokanath , v1.4 Prerequisites An AWS account with permissions to CRUD Compute, Storage & Network resources CentOS

More information

DefendX Software Control-Audit for Hitachi Installation Guide

DefendX Software Control-Audit for Hitachi Installation Guide DefendX Software Control-Audit for Hitachi Installation Guide Version 4.1 This guide details the method for the installation and initial configuration of DefendX Software Control-Audit for NAS, Hitachi

More information

Kafka Connect the Dots

Kafka Connect the Dots Kafka Connect the Dots Building Oracle Change Data Capture Pipelines With Kafka Mike Donovan CTO Dbvisit Software Mike Donovan Chief Technology Officer, Dbvisit Software Multi-platform DBA, (Oracle, MSSQL..)

More information

Deploying SQL Stream Processing in Kubernetes with Ease

Deploying SQL Stream Processing in Kubernetes with Ease Deploying SQL Stream Processing in Kubernetes with Ease Andrew Stevenson CTO Landoop Big Data Fast Data Financial Markets andrew@landoop.com www.landoop.com Antonios Chalkiopoulos CEO Landoop Big Data

More information

Red Hat CloudForms 4.6

Red Hat CloudForms 4.6 Red Hat CloudForms 4.6 Scanning Container Images in CloudForms with OpenSCAP Configuring OpenSCAP in CloudForms for Scanning Container Images Last Updated: 2018-05-24 Red Hat CloudForms 4.6 Scanning Container

More information

ExtraHop 7.5 ExtraHop REST API Guide

ExtraHop 7.5 ExtraHop REST API Guide ExtraHop 7.5 ExtraHop REST API Guide 2018 ExtraHop Networks, Inc. All rights reserved. This manual in whole or in part, may not be reproduced, translated, or reduced to any machinereadable form without

More information

IBM Financial Transactions Repository Version IBM Financial Transactions Repository Guide IBM

IBM Financial Transactions Repository Version IBM Financial Transactions Repository Guide IBM IBM Financial Transactions Repository Version 2.0.2 IBM Financial Transactions Repository Guide IBM Note Before using this information and the product it supports, read the information in Notices. Product

More information

Building Scalable and Extendable Data Pipeline for Call of Duty Games: Lessons Learned. Yaroslav Tkachenko Senior Data Engineer at Activision

Building Scalable and Extendable Data Pipeline for Call of Duty Games: Lessons Learned. Yaroslav Tkachenko Senior Data Engineer at Activision Building Scalable and Extendable Data Pipeline for Call of Duty Games: Lessons Learned Yaroslav Tkachenko Senior Data Engineer at Activision 1+ PB Data lake size (AWS S3) Number of topics in the biggest

More information

Sandbox Setup Guide for HDP 2.2 and VMware

Sandbox Setup Guide for HDP 2.2 and VMware Waterline Data Inventory Sandbox Setup Guide for HDP 2.2 and VMware Product Version 2.0 Document Version 10.15.2015 2014-2015 Waterline Data, Inc. All rights reserved. All other trademarks are the property

More information

Hadoop. Course Duration: 25 days (60 hours duration). Bigdata Fundamentals. Day1: (2hours)

Hadoop. Course Duration: 25 days (60 hours duration). Bigdata Fundamentals. Day1: (2hours) Bigdata Fundamentals Day1: (2hours) 1. Understanding BigData. a. What is Big Data? b. Big-Data characteristics. c. Challenges with the traditional Data Base Systems and Distributed Systems. 2. Distributions:

More information

Technical Note. System Roles in ZL Unified Archive

Technical Note. System Roles in ZL Unified Archive Technical Note System Roles in ZL Unified Archive Version 8.0.1 March 2015 Copyright 2015 ZL Technologies, Inc. All rights reserved ZL Technologies, Inc. ( ZLTI, formerly known as ZipLip ) and its licensors

More information

f5-icontrol-rest Documentation

f5-icontrol-rest Documentation f5-icontrol-rest Documentation Release 1.3.10 F5 Networks Aug 04, 2018 Contents 1 Overview 1 2 Installation 3 2.1 Using Pip................................................. 3 2.2 GitHub..................................................

More information

Symantec Enterprise Security Manager IBM DB2 Modules User Guide for Windows and UNIX. Version 4.6

Symantec Enterprise Security Manager IBM DB2 Modules User Guide for Windows and UNIX. Version 4.6 Symantec Enterprise Security Manager IBM DB2 Modules User Guide for Windows and UNIX Version 4.6 Symantec Enterprise Security Manager IBM DB2 Modules User Guide The software described in this book is furnished

More information

Symantec Enterprise Security Manager IBM DB2 Modules User Guide for Windows and UNIX. Version 4.2

Symantec Enterprise Security Manager IBM DB2 Modules User Guide for Windows and UNIX. Version 4.2 Symantec Enterprise Security Manager IBM DB2 Modules User Guide for Windows and UNIX Version 4.2 Symantec Enterprise Security Manager IBM DB2 Modules User Guide The software described in this book is furnished

More information

5 reasons why choosing Apache Cassandra is planning for a multi-cloud future

5 reasons why choosing Apache Cassandra is planning for a multi-cloud future White Paper 5 reasons why choosing Apache Cassandra is planning for a multi-cloud future Abstract We have been hearing for several years now that multi-cloud deployment is something that is highly desirable,

More information

Cloudera Manager Quick Start Guide

Cloudera Manager Quick Start Guide Cloudera Manager Guide Important Notice (c) 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, and any other product or service names or slogans contained in this

More information

Google Search Appliance

Google Search Appliance Google Search Appliance Configuring GSA Mirroring Google Search Appliance software version 7.2 Google, Inc. 1600 Amphitheatre Parkway Mountain View, CA 94043 www.google.com GSA-MIR_100.08 December 2013

More information

Panopticon Designer, Server & Streams Release Notes. Version 17.0

Panopticon Designer, Server & Streams Release Notes. Version 17.0 Panopticon Designer, Server & Streams Release Notes Version 17.0 Datawatch Corporation makes no representation or warranties with respect to the contents of this manual or the associated software and especially

More information

Big Data Development CASSANDRA NoSQL Training - Workshop. November 20 to (5 days) 9 am to 5 pm HOTEL DUBAI GRAND DUBAI

Big Data Development CASSANDRA NoSQL Training - Workshop. November 20 to (5 days) 9 am to 5 pm HOTEL DUBAI GRAND DUBAI Big Data Development CASSANDRA NoSQL Training - Workshop November 20 to 24 2016 (5 days) 9 am to 5 pm HOTEL DUBAI GRAND DUBAI ISIDUS TECH TEAM FZE PO Box 9798 Dubai UAE, email training-coordinator@isidusnet

More information

IBM Tivoli Netcool Performance Manager Big Data Extension1.4.3 Document Revision R2E1. Administering Big Data Extension IBM

IBM Tivoli Netcool Performance Manager Big Data Extension1.4.3 Document Revision R2E1. Administering Big Data Extension IBM IBM Tivoli Netcool Performance Manager Big Data Extension1.4.3 Document Revision R2E1 Administering Big Data Extension IBM Note Before using this information and the product it supports, read the information

More information

Altus Data Engineering

Altus Data Engineering Altus Data Engineering Important Notice 2010-2018 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, and any other product or service names or slogans contained in this document are trademarks

More information

Exam C IBM Cloud Platform Application Development v2 Sample Test

Exam C IBM Cloud Platform Application Development v2 Sample Test Exam C5050 384 IBM Cloud Platform Application Development v2 Sample Test 1. What is an advantage of using managed services in IBM Bluemix Platform as a Service (PaaS)? A. The Bluemix cloud determines the

More information

IBM Data Replication for Big Data

IBM Data Replication for Big Data IBM Data Replication for Big Data Highlights Stream changes in realtime in Hadoop or Kafka data lakes or hubs Provide agility to data in data warehouses and data lakes Achieve minimum impact on source

More information

Hortonworks Cybersecurity Platform

Hortonworks Cybersecurity Platform Installation () docs.hortonworks.com : Installation Copyright 2012-2018 Hortonworks, Inc. Some rights reserved. (HCP) is a modern data application based on Apache Metron, powered by Apache Hadoop, Apache

More information

iway iway Big Data Integrator New Features Bulletin and Release Notes Version DN

iway iway Big Data Integrator New Features Bulletin and Release Notes Version DN iway iway Big Data Integrator New Features Bulletin and Release Notes Version 1.5.1 DN3502232.0517 Active Technologies, EDA, EDA/SQL, FIDEL, FOCUS, Information Builders, the Information Builders logo,

More information

IBM Network Performance Insight Document Revision R2E1. Upgrading Network Performance Insight IBM

IBM Network Performance Insight Document Revision R2E1. Upgrading Network Performance Insight IBM IBM Network Performance Insight 1.2.1 Document Revision R2E1 Upgrading Network Performance Insight IBM Note Before using this information and the product it supports, read the information in Notices on

More information

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development:: Title Duration : Apache Spark Development : 4 days Overview Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized

More information

Exporting a Confluence space as a Website

Exporting a Confluence space as a Website Exporting a Confluence space as a Website This how-to explains how to export a Confluence space using the AutoExport Plugin. How it works. What it does. The AutoExport Plugin for Confluence is a plugin

More information

Changing the IP Address and Hostname for Cisco Unified Communications Manager Release 8.6(1)

Changing the IP Address and Hostname for Cisco Unified Communications Manager Release 8.6(1) Changing the IP Address and Hostname for Cisco Unified Communications Manager Release 8.6(1) Published: May 24, 2011 Revised: June 28, 2012 This document provides the steps to change the IP address or

More information

iway Big Data Integrator New Features Bulletin and Release Notes

iway Big Data Integrator New Features Bulletin and Release Notes iway Big Data Integrator New Features Bulletin and Release Notes Version 1.5.2 DN3502232.0717 Active Technologies, EDA, EDA/SQL, FIDEL, FOCUS, Information Builders, the Information Builders logo, iway,

More information

Oracle Cloud Using Oracle Event Hub Cloud Service

Oracle Cloud Using Oracle Event Hub Cloud Service Oracle Cloud Using Oracle Event Hub Cloud Service E78653-16 June 2018 Oracle Cloud Using Oracle Event Hub Cloud Service, E78653-16 Copyright 2017, 2018, Oracle and/or its affiliates. All rights reserved.

More information

Contents. Getting Set Up Contents 2

Contents. Getting Set Up Contents 2 Getting Set Up Contents 2 Contents Getting Set Up... 3 Best Practices...3 Installing the JAR File... 3 Configuring Community Manager Reports...4 Configure the Analytics Database...4 Enable the Analytics

More information

VST Hospital Administrator Guide. Version 2.0.4

VST Hospital Administrator Guide. Version 2.0.4 VST Hospital Administrator Guide Version 2.0.4 Notice Copyright 2002- Vocera Communications, Inc. All rights reserved. Vocera is a registered trademark of Vocera Communications, Inc. This software is licensed,

More information