Streaming Integration and Intelligence For Automating Time Sensitive Events

Similar documents
Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics

Oracle GoldenGate for Big Data

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

WHITEPAPER. MemSQL Enterprise Feature List

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::

Azure Data Factory VS. SSIS. Reza Rad, Consultant, RADACAD

Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0. WEBINAR MAY 15 th, PM EST 10AM PST

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk

Fluentd + MongoDB + Spark = Awesome Sauce

The Hadoop Ecosystem. EECS 4415 Big Data Systems. Tilemachos Pechlivanoglou

Qualys Cloud Platform

Architecting Microsoft Azure Solutions (proposed exam 535)

Data Acquisition. The reference Big Data stack

IBM Data Replication for Big Data

Key Features. High-performance data replication. Optimized for Oracle Cloud. High Performance Parallel Delivery for all targets

Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers

Kafka Connect the Dots

In-Memory Computing Essentials

Accelerate Your Data Pipeline for Data Lake, Streaming and Cloud Architectures

Developing Microsoft Azure Solutions (70-532) Syllabus

microsoft

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

Enable IoT Solutions using Azure

Hortonworks and The Internet of Things

Developing Microsoft Azure Solutions (70-532) Syllabus

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info

Gain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved.

Informatica Enterprise Information Catalog

Achieve Data Democratization with effective Data Integration Saurabh K. Gupta

Innovatus Technologies

Microsoft Big Data and Hadoop

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect

2013 AWS Worldwide Public Sector Summit Washington, D.C.

FAQs. Business (CIP 2.2) AWS Market Place Troubleshooting and FAQ Guide

Developing Microsoft Azure Solutions (70-532) Syllabus

Transforming IT: From Silos To Services

HDInsight > Hadoop. October 12, 2017

Understanding the latent value in all content

Stages of Data Processing

CERTIFICATE IN SOFTWARE DEVELOPMENT LIFE CYCLE IN BIG DATA AND BUSINESS INTELLIGENCE (SDLC-BD & BI)

Stanislav Harvan Internet of Things

The Future of Real-Time in Spark

August Oracle - GoldenGate Statement of Direction

Oracle Big Data SQL. Release 3.2. Rich SQL Processing on All Data

Introduction to Oracle NoSQL Database

MODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS

Big Data Syllabus. Understanding big data and Hadoop. Limitations and Solutions of existing Data Analytics Architecture

Extend NonStop Applications with Cloud-based Services. Phil Ly, TIC Software John Russell, Canam Software

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud

Building a Data Strategy for a Digital World

Flash Storage Complementing a Data Lake for Real-Time Insight

Solving the Really Big Tech Problems with IoT Data Security and Privacy

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.

The age of Big Data Big Data for Oracle Database Professionals

Leverage the Oracle Data Integration Platform Inside Azure and Amazon Cloud

USERS CONFERENCE Copyright 2016 OSIsoft, LLC

VOLTDB + HP VERTICA. page

BIG DATA COURSE CONTENT

An Information Asset Hub. How to Effectively Share Your Data

THINK DIGITAL RETHINK LEGACY

<Insert Picture Here> Introduction to Big Data Technology

Table 1 The Elastic Stack use cases Use case Industry or vertical market Operational log analytics: Gain real-time operational insight, reduce Mean Ti

Microsoft Exam

Big Data and Enterprise Data, Bridging Two Worlds with Oracle Data Integration

How to Troubleshoot Databases and Exadata Using Oracle Log Analytics

Architectural challenges for building a low latency, scalable multi-tenant data warehouse

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a)

Microsoft Architecting Microsoft Azure Solutions.

Modern ETL Tools for Cloud and Big Data. Ken Beutler, Principal Product Manager, Progress Michael Rainey, Technical Advisor, Gluent Inc.

Simplifying Data Management. With DataStax Enterprise (DSE) OpsCenter

Syncsort DMX-h. Simplifying Big Data Integration. Goals of the Modern Data Architecture SOLUTION SHEET

Modernizing Business Intelligence and Analytics

exam. Microsoft Perform Data Engineering on Microsoft Azure HDInsight. Version 1.0

A Single Source of Truth

Data Analytics at Logitech Snowflake + Tableau = #Winning

Cloud Analytics and Business Intelligence on AWS

Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp.

Oracle GoldenGate 12c

Certified Big Data Hadoop and Spark Scala Course Curriculum

DURATION : 03 DAYS. same along with BI tools.

Activator Library. Focus on maximizing the value of your data, gain business insights, increase your team s productivity, and achieve success.

Data Acquisition. The reference Big Data stack

Cloudline Autonomous Driving Solutions. Accelerating insights through a new generation of Data and Analytics October, 2018

Oracle NoSQL Database Enterprise Edition, Version 18.1

HOW TO ACHIEVE REAL-TIME ANALYTICS ON A DATA LAKE USING GPUS. Mark Brooks - Principal System Kinetica May 09, 2017

Certified Big Data and Hadoop Course Curriculum

Managing IoT and Time Series Data with Amazon ElastiCache for Redis

Spatial Analytics Built for Big Data Platforms

Exam Questions

Webinar Series TMIP VISION

Security and Performance advances with Oracle Big Data SQL

TECHED USER CONFERENCE MAY 3-4, 2016

Introduction to Apache Apex

Upgrade Your MuleESB with Solace s Messaging Infrastructure

STATE OF MODERN APPLICATIONS IN THE CLOUD

Big Data on AWS. Big Data Agility and Performance Delivered in the Cloud. 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Big Data Analytics using Apache Hadoop and Spark with Scala

Delving Deep into Hadoop Course Contents Introduction to Hadoop and Architecture

Prices in Japan (Yen) Oracle Technology Global Price List December 8, 2017

Cisco Tetration Analytics

Transcription:

Streaming Integration and Intelligence For Automating Time Sensitive Events Ted Fish Director Sales, Midwest ted@striim.com 312-330-4929

Striim Executive Summary Delivering Data for Time Sensitive Processes & Decisions Founded Founded in 2012 by leaders of GoldenGate Software and BEA/WebLogic Lead investors Backed by leading investors: Summit Partners, Intel Capital, Atlantic Bridge, & Dell Customers Deployments in financial services, telco, healthcare, retail, IoT

Sample Customers Financial Services Transportation & Logistics Telco, Manufacturing Retail, High Tech/IoT

Striim Awards @ 2017 Strata Data Conference Best Big Data Technology for Real-Time Analytics / Top 5 Vendors to Watch / Best IoT Platform https://www.datanami.com/this-just-in/striim-wins-two-datanami-readers-editors-choice-awards/

Time Sensitive Processes Across All Industries What are Yours? Financial Services - Anti-money laundering - Fraud prevention - Risk management - VIP customer service Healthcare - Proactive illness detection - Staff allocation optimization - Point of care compliance - Eligibility verification Manufacturing - Quality management - Predictive maintenance - Equipment monitoring - Capacity optimization Retail - Security Threats Detection - Real-time offers - Geo-targeted marketing - Dynamic pricing Communications - Network health monitoring, - Predict network failures - Proactive service outreach - Location-based advertising Transportation/Logistics - Connected car - Predictive maintenance - Asset tracking - Route optimization Insurance - Claim fraud detection - Agent fraud detection - Risk-based policy pricing - Agency performance - Usage-based insurance Public Sector - Crime detection and prevention - Cyber security - Traffic management - Connect City IT - Cyber security - Replication validation - API usage monitoring - SLA monitoring

Challenge: Supporting Time-Sensitive Decisions, Processes, Insights Data is New Integration Options Data is Old Lack of Data for Real-Time Events Cloud Customers Websites Messaging Files Orders / Sales Applications & Databases Messaging Inventory Network ETL & Batch Data Warehouse Security Legacy Replication & CDC Big Data Process Automation Devices, Mobile, IoT

How Striim Transforms Your Organization Data is New In-Memory Streaming Integration & Intelligence Time Sensitive Events Data is Old Websites Customers Cloud Orders Files Applications & Databases Inventory Messaging Network Security Data Warehouse Devices, Mobile, IoT End-To-End Solution 7/24 Enterprise Grade Platform Safety Before the Data Lands Automate Processes, Deliver Insight, Make Timely Decisions Big Data

Streaming Events Streaming Data Delivery Striim: Fast Data, In-Flight Intelligence, Fast Decisions Real-Time Insights & Action Alerts Triggers Machine Learning/ AI Models Real-Time Visualization Time Sensitive Decisions & Processes Customers Orders / Sales Inventory CDC Security Databases Transformation Anomaly Detection Cloud Process Automation Log files Any Messaging, Built-In Kafka Sensors IoT Data is New Edge Filtering Aggregation Streaming Integration Enrichment External Context Pattern Matching Complex Event Rules Multi-Stream Correlation Streaming Intelligence Files Big Data & NOSQL Fast Deliver to Anywhere Messaging Data Warehouse

Visualization & Drilldowns Through Streaming Dashboards

Real-Time Treat/Fraud Security Monitoring Largest Credit Card Company Correlate Logs From Multiple Security Products to Identify Cross-Domain Issues or Exploits that are not Obvious from a Single Security Product Source Logs In Real-Time from Multiple Security Products Ingests and Analyzes security log & session data, capturing all events from 50+ siloed security solutions Looks for Patterns of Activity Across Logs that Indicate Exploits or Anomalies Provide Real-Time Monitoring Dashboard and Immediate Alerts blacklist locations

End-to-End: In-Memory Integration Intelligence Kafka - Hybrid Sources Targets Parsers Delimited JSON XML Free Text Binary Name/Value Zipped AVRO OGG Trail Apache Log Sys Log MS Event Log Mail Log SNMP CollectD CEF DHCP Log WCF +Others Databases JDBC/SQL Oracle CDC MS/SQL CDC MySQL CDC HPE NSK Salesforce Files Log Files System Files Batch Files Network TCP UDP HTTP MQTT Netflow PCAP Messaging Kafka Flume JMS AMQP Big Data HDFS Hbase Hive RESTful API Continuous Data Collection Virtual Machines Stream Processing Operating Systems Streaming Analytics IoT Gateways Cloud Continuous Results Delivery Big Data Databases JDBC/SQL Oracle MS/SQL MySQL Teradata Files Network MQTT Messaging Kafka JMS AMQP Big Data HDFS Hbase Hive Hazelcast Cloud Azure Blob Azure SQL DB Amazon S3 Amazon Redshift Google Big Query Alerting Email SMS Formats Delimited JSON XML Template AVRO

Sources Streams Windows Queries UDFs Caches Targets Striim Next Generation Architecture Sources & Parsers RDBMS Generic JDBC/SQL Oracle CDC MS/SQL CDC HPE NonStop CDC Files CSV/TSV JSON XML Apache Avro, Free-form Network TCP UDP HTTP Message Queues Kafka/ Flume JMS BigData HDFS Hive Applications Business-Level Logic With (extended) SQL Continuous Query Processor Distributed In-Memory Cache Distributed In-Memory Store Kafka Streams (optional) Distributed Indexed Store (ES) Scalable IMC Cluster Node 1 Node 2 Node 3 External Context Real-time Dashboards Node n Targets & Formatters Alerting Email SMS Message Queues JMS Kafka Cloud Google BigQuery MS Azure SQL, AWS Redshift DB Persistence JDBC/SQL Oracle MS/SQL, Teradata HPE NonStop File Persistence CSV/TSV JSON XML BigData HDFS Hbase, Hive

Kafka Enhancements Integration, Performance, Support Performance, Scalability, Security, Easier Manageability Kafka built-in - for persisting data and performance Continuous collection - data and deliver to Kafka Intelligence - on streaming Kafka data Visualization and Drill Downs - on streaming data SQL Queries against Kafka, without coding Performance - each writer dynamically processes 24 parallel threads Enterprise Grade - Scalability, Reliability, Security Exactly-Once-Processing (E1P) from sources to targets

Built-In CDC Change Data Capture / Replication Database to Database Striim Leveraged their Golden Gate Development Expertise Oracle MS SQL Server HPE NonStop MySQL CDC On-Premise Database to On or Off Premise Databases Read current Base Table to perform Initial Load to Target Data Tables Start change data capture on Source database to read transaction logs Striim Transforms Change Records to DML Operation and Applies changes to DBMS destinations Add Kafka Persistent Streams to add Mission Critical E1P reliability for fault tolerant replication

IoT Projects Exploits Striim s Platform Protocol Translation Cloud Exported ML Model HUB Protocol Translation Exported ML Model Machine Learning Continuous Real-Time Processing Intelligence at the EDGE Visualization & Automation

Machine Learning and AI Operationalize with Seamless Integration Collect Data to Train Model and Use Model in Striim to Perform Real-Time Anomaly Detection or Predictions Source Data From Databases, Files, Kafka, etc. Process and Prepare the Data and Write to Disk to Train Machine Learning Export Trained Model and Use in Streaming Analytics Real-Time Scoring Present Results on Dashboard and Alert on Anomalies Streaming Analytics Process & Prepare Data CQ Real-Time Scoring Server Real-Time Dashboard Training Files Machine Learning Exported Model Wrapper Function

Application Development Environment - Robust & Flexible End-to-end integrated application development Visual dashboard designer Visual application designer (Flow Designer) Application templates SQL-like programming interface Source data preview Live / ad-hoc query & parameterized query interfaces Predictive analytics Applications Visual designer Business-Level Logic With Tungsten QL (extended SQL)

Built-In Enterprise Operations / Clustering, HA, Fault Tolerant Clustering - Multiple node use cases Events Partitioned Over Cluster Architecture Hybrid / Public / Private Cloud Striim Agents Mesh network Democratic Scale out Collection Agents Processing Cluster Memory on one node not enough for holding windows/caches Processing power on one node not enough to process the number of incoming events Consistent hashing and distribution of data stream, caches, results caches, Co-location of data and processing Dynamic resizing Availability Guarding against node outages Guarding against errors on nodes Dynamic resizing

Built-In Enterprise Operations / Scalability Cluster scales horizontally on commodity hardware Events Partitioned Over Cluster Lightweight agents for edge collection of data Consistent data partitioning Collection Agents Processing Cluster Scalable indexed results store 20 Event Rate Per Servers 10 Rate M Events/s 0 2 12 24 32 48

THANK YOU