Hortonworks DataFlow Sam Lachterman Solutions Engineer

Size: px
Start display at page:

Download "Hortonworks DataFlow Sam Lachterman Solutions Engineer"

Transcription

1 Hortonworks DataFlow Sam Lachterman Solutions Engineer 1 Hortonworks Inc All Rights Reserved

2 Disclaimer This document may contain product features and technology directions that are under development, may be under development in the future or may ultimately not be developed. Project capabilities are based on information that is publicly available within the Apache Software Foundation project websites ("Apache"). Progress of the project capabilities can be tracked from inception to release through Apache, however, technical feasibility, market demand, user feedback and the overarching Apache Software Foundation community development process can all effect timing and final delivery. This document s description of these features and technology directions does not represent a contractual commitment, promise or obligation from Hortonworks to deliver these features in any generally available product. Product features and technology directions are subject to change, and must not be included in contracts, purchase orders, or sales agreements of any kind. Since this document contains an outline of general product development plans, customers should not rely upon it when making purchasing decisions. 2 Hortonworks Inc All Rights Reserved

3 ONLY Hortonworks Company Profile Apache 100 open source TM % Hadoop data platform Founded in ST provider to go public HADOOP IPO 4Q14 (NASDAQ: HDP) 1,000+ subscription customers technology partners ~1100 employees across 2, countries 3 Hortonworks Inc All Rights Reserved

4 Accelerating Momentum Multi-Platform Expansion: HDP & HDF International Adoption Company Founded 2011 IPO Dec 2014 Fastest company to $100mm revenue 800+ customers and growing 4 Hortonworks Inc All Rights Reserved

5 Hortonworks Data Flow 5 Hortonworks Inc All Rights Reserved

6 Connected Data Platforms 6 Hortonworks Inc All Rights Reserved

7 Hortonworks DataFlow for Data in Motion Powered by Apache NiFi, Kafka, and Storm Real-time Integrated Secure Streaming 7 Hortonworks Inc All Rights Reserved

8 Connecting Data Between Ecosystems Without Coding: 180+ Processors FTP SFTP HL7 UDP XML Hash Merge Extract Duplicate Encrypt Tail Evaluate Execute GeoEnrich Scan Replace Translate Split Fetch Convert HTTP WebSocket HTML Image Syslog AMQP All Apache project logos are trademarks of the ASF and the respective projects. 8 Hortonworks Inc All Rights Reserved Route Text Route Content Route Context Control Rate Distribute Load Generate Table Fetch Jolt Transform JSON Prioritized Delivery

9 NiFi Integration of Many Things Like when the iphone was new, people compare NiFI to something they know, to create a frame of reference based on their experience iphone: a little of everything, integrated together People compare NiFi to a messaging bus (Kafka), ETL, scheduler (Oozie), CEP - Complex Event Processing (Spark/Storm), Visual UI flow designer, Forwarder (Splunk), while NiFI is an integration of many things and complements many different systems. Event Processing Messaging Bus ETL It is a telephone? Is it an client? Is it video camera? Is it digital camera? Is it a music player? Is it a gameboy? Scheduler GUI Forwarder 9 Hortonworks Inc All Rights Reserved

10 Use Cases 10 Hortonworks Inc All Rights Reserved

11 HDF: Visual Data Flow Management between Datacenter and Cloud H O R T O N W O R K S D ATA F L O W Dynamically adjust the pipeline Add and adjust data sources Visually trace the data path Add and Adjust Data Sources to maximize the opportunity that you capture from perishable insights Visually Trace the Data Path to manage the what, who, where and how around data in motion Dynamically Adjust the Pipeline to match the dataflow with your bandwidth Deliver Continuous Insights from real-time stream processing 11 Hortonworks Inc All Rights Reserved

12 Architectural Transformation Edge Analytics Machine Learning Edge Data Data in Motion (Cloud) Data at Rest (Cloud) Cloud Data in Motion Edge Data Closed Loop Analytics On-premises Data at Rest (on-premises) Data in Motion (on-premises) Data at Rest (on-premises) Deep Historical Analysis 12 Hortonworks Inc All Rights Reserved

13 Attunity Replicate-HDF Architecture In-memory optimized metadata management and data transport Data Streaming Transaction logs CDC MSG MSG MSG n 2 1 Message broker Bulk Load Data Streaming MSG MSG MSG n 2 1 Message broker 2016 Attunity

14 Optimize Log Analytics Platforms with Content Based Routing Edge analytics for costeffective and efficient movement of machine data Intelligent, content based routing, transformation and enrichment Send data to alternative systems based on value, content, priority Transformation Routing Enrichment 14 Hortonworks Inc All Rights Reserved

15 HDF Technical Overview 15 Hortonworks Inc All Rights Reserved

16 Apache NiFi What is data logistics? Guaranteed delivery Data buffering Backpressure Pressure release Prioritized queuing Flow-specific QoS Latency vs. throughput Loss tolerance Data provenance Recovery / recording a rolling log of fine-grained history Designed for extension Visual command and control Flow templates Policy-based security Clustering 16 Hortonworks Inc All Rights Reserved

17 Key Components of Streaming Analytics in Enterprise Environments Easy, Secure, Reliable Way to Get the Data You Need Because there s no data science without the data Flow Management Stream Processing Immediate and Continuous Insights Because acting on perishable insights in real time maximizes value Enterprise Services Provisioning, Management, Monitoring, Security, Audit, Compliance, Governance Because it all has to work together in an enterprise environment 17 Hortonworks Inc All Rights Reserved

18 Data in Motion Needs DataFlow Management and Stream Processing 1. Acquire truck telemetry events from different Geo Cloud Instances and from the edge (telemetry devices on certain trucks) 2. Move and Filter Data from Customer X Cloud Instances to on-premise instance 3. Intelligent Routing of data. The routing and filtering rules will be often changed at run-time. 4. Deliver the data data to various downstream systems. Flow Management 5. Parse the device data to standardized format that downstream sysem can understand 6. Enrich the data with contextual information HR, Payroll, Truck Maintenance records, weather 7. Join multiple sensor streams via attributes in real-time 1. E.g: Join separate streams like geo-location stream of a truck with the driver speed stream. 8. Detect complex patterns in real-time: 1. E.g: Over a 3 minute period, if a driver has multiple infractions across 2 different infraction types, then alert (Breaking, Speeding) 2. E.g: Over a 5 minute period, if the average speed of a driver is > 80 in known dangerous routes, then alert. Stream Processing 18 Hortonworks Inc All Rights Reserved

19 HDF Components HDF provides the flow management, stream processing, and enterprise services needed to collect, curate, analyze and act on data-in-motion across the data center and cloud. Flow Management Stream Processing Java Agent Data acquisition and delivery Simple transformation and data routing Simple event processing End to end provenance Edge intelligence & bi-directional comms Scalable data broker for streaming apps Scale out Complex transformation Complex Streaming analytics Continuous Insights SAM UI Enterprise Services Provisioning, Management, Monitoring, Security, Audit, Compliance, Governance, Multi-tenancy) SmartSense 19 Hortonworks Inc All Rights Reserved

20 Logical Architecture (Zero-Master Clustering) 20 Hortonworks Inc All Rights Reserved

21 Use cases for Schema Registry Flow Management Schema Registry usage in Flow Management Generic/flexible format conversion Generic routing, SQL-based routing for any structured data Reusable schema, avoid schema overhead Data quality validation based on schema Stream Processing Schema Registry usage in Stream Processing Schema evolution and version compatibility Consumers and producers can evolve at different rates 21 Hortonworks Inc All Rights Reserved

22 What is Schema Registry? What Value Does it Provide? What is Schema Registry? A shared repository of schemas that allows applications to flexibly interact with each other - in order to save or retrieve schemas for the data they need to access What Value does Schema Registry Provide? 1. Data Governance Provide reusable schema (centralized registry) Define relationship between schemas (version management) Enable generic format conversion, and generic routing (schema validation) 2. Operational Efficiency To avoid attaching schema to every piece of data (centralized registry) Consumers and producers can evolve at different rates (version management) Data quality (schema validation) Example Use Register Schemas for Kafka Topics to be used by consumers of Kafka Topic (e.g: Streaming Analytics Manager) 22 Hortonworks Inc All Rights Reserved

23 Truck CSV data NiFi TailFile Attribute: Content: CSV bytes schema fingerprint SplitText Attribute: Content: CSV bytes schema fingerprint ExtractText ConvertFormat Attribute: Schema Version Schema name Schema Content: CSV bytes FetchSchema FromRegistry Attribute: Schema Version Schema name Content: CSV bytes Input CS Output CS CSV xml Avro JSON CSV xml Avro JSON 23 Hortonworks Inc All Rights Reserved

24 Truck CSV data NiFi TailFile Attribute: Without schema Attribute: No generic format conversion: only Content: SplitText JSON to Avro, CSV to Avro, etc Content: 1 to 1 mapping CSV bytes in conversion: 100-column CSV to 100-field CSV bytes JSON schema fingerprint schema fingerprint ExtractText ConvertFormat CovertFormat Input CS Output CS Attribute: Schema Version Schema name Schema Content: CSV bytes FetchSchema FromRegistry Attribute: Schema Version Schema name Content: CSV bytes CSV xml Avro JSON CSV xml Avro JSON With schema Generic format conversion: single processor + plugin controller services Flexible conversion: 100-column CSV to 5-field JSON 24 Hortonworks Inc All Rights Reserved

25 Streaming Analytics Manager What is it? A tool used to design, develop, deploy and manage streaming analytics app using a drag drop visualize paradigm Streaming Analytics Manager will allow you to build streaming analytics applications that do event correlation, context enrichment, complex pattern matching, analytical aggregations and create alerts/notifications when insights are discovered. The tool is agnostic to the underlying streaming engine and will support multiple streaming substrates (e.g: Storm, Spark Streaming, Flink) Extensibility is a first class citizen (add custom sinks, processors, spouts, etc..) Guiding Principle Write Complex Streaming Applications With No Code 25 Hortonworks Inc All Rights Reserved

26 Streaming Analytics Manager Key Capabilities Building Stream Apps using the following primitives Connecting to Streams Joining Streams Forking Streams Aggregations over Windows Stream Analytics Descriptive, Predictive, Prescriptive Rules Engine Transformations Filtering and Routing Notifications / Alerts Deploying Stream Apps Deploying the Stream App on a a supported streaming engine Monitoring the Stream App with app specific metrics. 26 Hortonworks Inc All Rights Reserved

27 Truck Driver Violation Pattern - Streaming Analytics Design 27 Hortonworks Inc All Rights Reserved

28 Demo 28 Hortonworks Inc All Rights Reserved

29 DATAWORKS SUMMIT/HADOOP SUMMIT JUNE 13 15, 2017 San Jose McHenry Convention Center REGISTER NOW AND SAVE $1,000 REGISTER NOW! > dataworkssummit.com

Hortonworks and The Internet of Things

Hortonworks and The Internet of Things Hortonworks and The Internet of Things Dr. Bernhard Walter Solutions Engineer About Hortonworks Customer Momentum ~700 customers (as of November 4, 2015) 152 customers added in Q3 2015 Publicly traded

More information

Hortonworks DataFlow. Accelerating Big Data Collection and DataFlow Management. A Hortonworks White Paper DECEMBER Hortonworks DataFlow

Hortonworks DataFlow. Accelerating Big Data Collection and DataFlow Management. A Hortonworks White Paper DECEMBER Hortonworks DataFlow Hortonworks DataFlow Accelerating Big Data Collection and DataFlow Management A Hortonworks White Paper DECEMBER 2015 Hortonworks DataFlow 2015 Hortonworks www.hortonworks.com 2 Contents What is Hortonworks

More information

Schema Registry Overview

Schema Registry Overview 3 Date of Publish: 2018-11-15 https://docs.hortonworks.com/ Contents...3 Examples of Interacting with Schema Registry...4 Schema Registry Use Cases...6 Use Case 1: Registering and Querying a Schema for

More information

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Cy Erbay Senior Director Striim Executive Summary Striim is Uniquely Qualified to Solve the Challenges of Real-Time

More information

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk Raanan Dagan and Rohit Pujari September 25, 2017 Washington, DC Forward-Looking Statements During the course of this presentation, we may

More information

IBM Data Replication for Big Data

IBM Data Replication for Big Data IBM Data Replication for Big Data Highlights Stream changes in realtime in Hadoop or Kafka data lakes or hubs Provide agility to data in data warehouses and data lakes Achieve minimum impact on source

More information

Flash Storage Complementing a Data Lake for Real-Time Insight

Flash Storage Complementing a Data Lake for Real-Time Insight Flash Storage Complementing a Data Lake for Real-Time Insight Dr. Sanhita Sarkar Global Director, Analytics Software Development August 7, 2018 Agenda 1 2 3 4 5 Delivering insight along the entire spectrum

More information

DATA INTEGRATION PLATFORM CLOUD. Experience Powerful Data Integration in the Cloud

DATA INTEGRATION PLATFORM CLOUD. Experience Powerful Data Integration in the Cloud DATA INTEGRATION PLATFORM CLOUD Experience Powerful Integration in the Want a unified, powerful, data-driven solution for all your data integration needs? Oracle Integration simplifies your data integration

More information

Hortonworks DataFlow

Hortonworks DataFlow Getting Started with Streaming Analytics () docs.hortonworks.com : Getting Started with Streaming Analytics Copyright 2012-2018 Hortonworks, Inc. Some rights reserved. Except where otherwise noted, this

More information

Information empowerment for your evolving data ecosystem

Information empowerment for your evolving data ecosystem Information empowerment for your evolving data ecosystem Highlights Enables better results for critical projects and key analytics initiatives Ensures the information is trusted, consistent and governed

More information

Lenses 2.1 Enterprise Features PRODUCT DATA SHEET

Lenses 2.1 Enterprise Features PRODUCT DATA SHEET Lenses 2.1 Enterprise Features PRODUCT DATA SHEET 1 OVERVIEW DataOps is the art of progressing from data to value in seconds. For us, its all about making data operations as easy and fast as using the

More information

Oracle Big Data Connectors

Oracle Big Data Connectors Oracle Big Data Connectors Oracle Big Data Connectors is a software suite that integrates processing in Apache Hadoop distributions with operations in Oracle Database. It enables the use of Hadoop to process

More information

Fluentd + MongoDB + Spark = Awesome Sauce

Fluentd + MongoDB + Spark = Awesome Sauce Fluentd + MongoDB + Spark = Awesome Sauce Nishant Sahay, Sr. Architect, Wipro Limited Bhavani Ananth, Tech Manager, Wipro Limited Your company logo here Wipro Open Source Practice: Vision & Mission Vision

More information

Oracle Big Data SQL. Release 3.2. Rich SQL Processing on All Data

Oracle Big Data SQL. Release 3.2. Rich SQL Processing on All Data Oracle Big Data SQL Release 3.2 The unprecedented explosion in data that can be made useful to enterprises from the Internet of Things, to the social streams of global customer bases has created a tremendous

More information

Oracle GoldenGate for Big Data

Oracle GoldenGate for Big Data Oracle GoldenGate for Big Data The Oracle GoldenGate for Big Data 12c product streams transactional data into big data systems in real time, without impacting the performance of source systems. It streamlines

More information

FAQs. Business (CIP 2.2) AWS Market Place Troubleshooting and FAQ Guide

FAQs. Business (CIP 2.2) AWS Market Place Troubleshooting and FAQ Guide FAQs 1. What is the browser compatibility for logging into the TCS Connected Intelligence Data Lake for Business Portal? Please check whether you are using Mozilla Firefox 18 or above and Google Chrome

More information

Overview SENTINET 3.1

Overview SENTINET 3.1 Overview SENTINET 3.1 Overview 1 Contents Introduction... 2 Customer Benefits... 3 Development and Test... 3 Production and Operations... 4 Architecture... 5 Technology Stack... 7 Features Summary... 7

More information

Building a Data Strategy for a Digital World

Building a Data Strategy for a Digital World Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service

More information

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development:: Title Duration : Apache Spark Development : 4 days Overview Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized

More information

Gain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved.

Gain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved. Gain Insights From Unstructured Data Using Pivotal HD 1 Traditional Enterprise Analytics Process 2 The Fundamental Paradigm Shift Internet age and exploding data growth Enterprises leverage new data sources

More information

Cisco Tetration Analytics

Cisco Tetration Analytics Cisco Tetration Analytics Enhanced security and operations with real time analytics John Joo Tetration Business Unit Cisco Systems Security Challenges in Modern Data Centers Securing applications has become

More information

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways

More information

Hortonworks DataFlow

Hortonworks DataFlow Hortonworks DataFlow Schema Registry User Guide (January 31, 2018) docs.hortonworks.com Hortonworks DataFlow: Schema Registry User Guide Copyright 2012-2018 Hortonworks, Inc. Some rights reserved. Except

More information

SoftNAS Cloud Platinum Edition

SoftNAS Cloud Platinum Edition Control Any Data. Any Cloud. Anywhere. SoftNAS Cloud Platinum Edition Cuts public storage costs up to 67% through auto tiering and native support for both block and object storage on AWS and Azure Accelerates

More information

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case

More information

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info START DATE : TIMINGS : DURATION : TYPE OF BATCH : FEE : FACULTY NAME : LAB TIMINGS : PH NO: 9963799240, 040-40025423

More information

Down the event-driven road: Experiences of integrating streaming into analytic data platforms

Down the event-driven road: Experiences of integrating streaming into analytic data platforms Down the event-driven road: Experiences of integrating streaming into analytic data platforms Dr. Dominik Benz, Head of Machine Learning Engineering, inovex GmbH Confluent Meetup Munich, 8.10.2018 Integrate

More information

Modern Data Warehouse The New Approach to Azure BI

Modern Data Warehouse The New Approach to Azure BI Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics

More information

Alexander Klein. #SQLSatDenmark. ETL meets Azure

Alexander Klein. #SQLSatDenmark. ETL meets Azure Alexander Klein ETL meets Azure BIG Thanks to SQLSat Denmark sponsors Save the date for exiting upcoming events PASS Camp 2017 Main Camp 05.12. 07.12.2017 (04.12. Kick-Off abends) Lufthansa Training &

More information

MapR Enterprise Hadoop

MapR Enterprise Hadoop 2014 MapR Technologies 2014 MapR Technologies 1 MapR Enterprise Hadoop Top Ranked Cloud Leaders 500+ Customers 2014 MapR Technologies 2 Key MapR Advantage Partners Business Services APPLICATIONS & OS ANALYTICS

More information

Data Acquisition. The reference Big Data stack

Data Acquisition. The reference Big Data stack Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Data Acquisition Corso di Sistemi e Architetture per Big Data A.A. 2016/17 Valeria Cardellini The reference

More information

Getting personal with your customers and GDPR

Getting personal with your customers and GDPR Getting personal with your customers and GDPR A practical approach to a secure, governed 360 degree customer view Darren Brunt Presales Director UK&I, Talend Colm Moynihan Partner Presales Manager EMEA,

More information

Data Acquisition. The reference Big Data stack

Data Acquisition. The reference Big Data stack Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Data Acquisition Corso di Sistemi e Architetture per Big Data A.A. 2017/18 Valeria Cardellini The reference

More information

PSOACI Tetration Overview. Mike Herbert

PSOACI Tetration Overview. Mike Herbert Tetration Overview Mike Herbert Cisco Spark How Questions? Use Cisco Spark to communicate with the speaker after the session 1. Find this session in the Cisco Live Mobile App 2. Click Join the Discussion

More information

microsoft

microsoft 70-775.microsoft Number: 70-775 Passing Score: 800 Time Limit: 120 min Exam A QUESTION 1 Note: This question is part of a series of questions that present the same scenario. Each question in the series

More information

Stages of Data Processing

Stages of Data Processing Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,

More information

Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools

Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools SAP Technical Brief Data Warehousing SAP HANA Data Warehousing Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools A data warehouse for the modern age Data warehouses have been

More information

Cloudline Autonomous Driving Solutions. Accelerating insights through a new generation of Data and Analytics October, 2018

Cloudline Autonomous Driving Solutions. Accelerating insights through a new generation of Data and Analytics October, 2018 Cloudline Autonomous Driving Solutions Accelerating insights through a new generation of Data and Analytics October, 2018 HPE big data analytics solutions power the data-driven enterprise Secure, workload-optimized

More information

Configuring and Deploying Hadoop Cluster Deployment Templates

Configuring and Deploying Hadoop Cluster Deployment Templates Configuring and Deploying Hadoop Cluster Deployment Templates This chapter contains the following sections: Hadoop Cluster Profile Templates, on page 1 Creating a Hadoop Cluster Profile Template, on page

More information

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications

More information

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

MAPR DATA GOVERNANCE WITHOUT COMPROMISE MAPR TECHNOLOGIES, INC. WHITE PAPER JANUARY 2018 MAPR DATA GOVERNANCE TABLE OF CONTENTS EXECUTIVE SUMMARY 3 BACKGROUND 4 MAPR DATA GOVERNANCE 5 CONCLUSION 7 EXECUTIVE SUMMARY The MapR DataOps Governance

More information

Building LinkedIn s Real-time Data Pipeline. Jay Kreps

Building LinkedIn s Real-time Data Pipeline. Jay Kreps Building LinkedIn s Real-time Data Pipeline Jay Kreps What is a data pipeline? What data is there? Database data Activity data Page Views, Ad Impressions, etc Messaging JMS, AMQP, etc Application and

More information

Leverage the Oracle Data Integration Platform Inside Azure and Amazon Cloud

Leverage the Oracle Data Integration Platform Inside Azure and Amazon Cloud Leverage the Oracle Data Integration Platform Inside Azure and Amazon Cloud WHITE PAPER / AUGUST 8, 2018 DISCLAIMER The following is intended to outline our general product direction. It is intended for

More information

High Volume Messaging with IBM MessageSight for use in Mobile, Web and M2M solutions

High Volume Messaging with IBM MessageSight for use in Mobile, Web and M2M solutions High Volume Messaging with IBM MessageSight for use in Mobile, Web and M2M solutions Dave Locke IBM Software Group Trademark Statement IBM and the IBM logo are trademarks of International Business Machines

More information

Overview. SUSE OpenStack Cloud Monitoring

Overview. SUSE OpenStack Cloud Monitoring Overview SUSE OpenStack Cloud Monitoring Overview SUSE OpenStack Cloud Monitoring Publication Date: 08/04/2017 SUSE LLC 10 Canal Park Drive Suite 200 Cambridge MA 02141 USA https://www.suse.com/documentation

More information

How Apache Hadoop Complements Existing BI Systems. Dr. Amr Awadallah Founder, CTO Cloudera,

How Apache Hadoop Complements Existing BI Systems. Dr. Amr Awadallah Founder, CTO Cloudera, How Apache Hadoop Complements Existing BI Systems Dr. Amr Awadallah Founder, CTO Cloudera, Inc. Twitter: @awadallah, @cloudera 2 The Problems with Current Data Systems BI Reports + Interactive Apps RDBMS

More information

Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap. Pedro Alves

Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap. Pedro Alves Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap Pedro Alves Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our current intended product direction.

More information

Architectural challenges for building a low latency, scalable multi-tenant data warehouse

Architectural challenges for building a low latency, scalable multi-tenant data warehouse Architectural challenges for building a low latency, scalable multi-tenant data warehouse Mataprasad Agrawal Solutions Architect, Services CTO 2017 Persistent Systems Ltd. All rights reserved. Our analytics

More information

Technical Sheet NITRODB Time-Series Database

Technical Sheet NITRODB Time-Series Database Technical Sheet NITRODB Time-Series Database 10X Performance, 1/10th the Cost INTRODUCTION "#$#!%&''$!! NITRODB is an Apache Spark Based Time Series Database built to store and analyze 100s of terabytes

More information

Data Governance Overview

Data Governance Overview 3 Data Governance Overview Date of Publish: 2018-04-01 http://docs.hortonworks.com Contents Apache Atlas Overview...3 Apache Atlas features...3...4 Apache Atlas Overview Apache Atlas Overview Apache Atlas

More information

Ian Choy. Technology Solutions Professional

Ian Choy. Technology Solutions Professional Ian Choy Technology Solutions Professional XML KPIs SQL Server 2000 Management Studio Mirroring SQL Server 2005 Compression Policy-Based Mgmt Programmability SQL Server 2008 PowerPivot SharePoint Integration

More information

BIG DATA REVOLUTION IN JOBRAPIDO

BIG DATA REVOLUTION IN JOBRAPIDO BIG DATA REVOLUTION IN JOBRAPIDO Michele Pinto Big Data Technical Team Leader @ Jobrapido Big Data Tech 2016 Firenze - October 20, 2016 ABOUT ME NAME Michele Pinto LINKEDIN https://www.linkedin.com/in/pintomichele

More information

Cisco Tetration Analytics

Cisco Tetration Analytics Cisco Tetration Analytics Enhanced security and operations with real time analytics Christopher Say (CCIE RS SP) Consulting System Engineer csaychoh@cisco.com Challenges in operating a hybrid data center

More information

An Architecture for Intelligent Data Processing on IoT Edge Devices

An Architecture for Intelligent Data Processing on IoT Edge Devices 2017 UKSim-AMSS 19th International Conference on Modelling & Simulation An Architecture for Intelligent on IoT s Roger Young, Sheila Fallon, Paul Jacob Software Research Institute, Athlone Institute of

More information

Solace JMS Broker Delivers Highest Throughput for Persistent and Non-Persistent Delivery

Solace JMS Broker Delivers Highest Throughput for Persistent and Non-Persistent Delivery Solace JMS Broker Delivers Highest Throughput for Persistent and Non-Persistent Delivery Java Message Service (JMS) is a standardized messaging interface that has become a pervasive part of the IT landscape

More information

WHITEPAPER. MemSQL Enterprise Feature List

WHITEPAPER. MemSQL Enterprise Feature List WHITEPAPER MemSQL Enterprise Feature List 2017 MemSQL Enterprise Feature List DEPLOYMENT Provision and deploy MemSQL anywhere according to your desired cluster configuration. On-Premises: Maximize infrastructure

More information

Industrial IoT: Architecture Framework Use Cases. Artur Borycki Teradata Labs

Industrial IoT: Architecture Framework Use Cases. Artur Borycki Teradata Labs Industrial IoT: Architecture Framework Use Cases Artur Borycki Teradata Labs IoT represents more than just things : It must represent systems (and systems of systems) The Internet of Things: It s About

More information

Datameer for Data Preparation:

Datameer for Data Preparation: Datameer for Data Preparation: Explore, Profile, Blend, Cleanse, Enrich, Share, Operationalize DATAMEER FOR DATA PREPARATION: EXPLORE, PROFILE, BLEND, CLEANSE, ENRICH, SHARE, OPERATIONALIZE Datameer Datameer

More information

Streaming Integration and Intelligence For Automating Time Sensitive Events

Streaming Integration and Intelligence For Automating Time Sensitive Events Streaming Integration and Intelligence For Automating Time Sensitive Events Ted Fish Director Sales, Midwest ted@striim.com 312-330-4929 Striim Executive Summary Delivering Data for Time Sensitive Processes

More information

The Now Platform Reference Guide

The Now Platform Reference Guide The Now Platform Reference Guide A tour of key features and functionality START Introducing the Now Platform Digitize your business with intelligent apps The Now Platform is an application Platform-as-a-Service

More information

Building a Scalable Recommender System with Apache Spark, Apache Kafka and Elasticsearch

Building a Scalable Recommender System with Apache Spark, Apache Kafka and Elasticsearch Nick Pentreath Nov / 14 / 16 Building a Scalable Recommender System with Apache Spark, Apache Kafka and Elasticsearch About @MLnick Principal Engineer, IBM Apache Spark PMC Focused on machine learning

More information

Title DC Automation: It s a MARVEL!

Title DC Automation: It s a MARVEL! Title DC Automation: It s a MARVEL! Name Nikos D. Anagnostatos Position Network Consultant, Network Solutions Division Classification ISO 27001: Public Data Center Evolution 2 Space Hellas - All Rights

More information

Stanislav Harvan Internet of Things

Stanislav Harvan Internet of Things Stanislav Harvan v-sharva@microsoft.com Internet of Things IoT v číslach Gartner: V roku 2020 bude na Internet pripojených viac ako 25mld zariadení: 1,5mld smart TV 2,5mld pc 5mld smart phone 16mld dedicated

More information

HDInsight > Hadoop. October 12, 2017

HDInsight > Hadoop. October 12, 2017 HDInsight > Hadoop October 12, 2017 2 Introduction Mark Hudson >20 years mixing technology with data >10 years with CapTech Microsoft Certified IT Professional Business Intelligence Member of the Richmond

More information

Security and Performance advances with Oracle Big Data SQL

Security and Performance advances with Oracle Big Data SQL Security and Performance advances with Oracle Big Data SQL Jean-Pierre Dijcks Oracle Redwood Shores, CA, USA Key Words SQL, Oracle, Database, Analytics, Object Store, Files, Big Data, Big Data SQL, Hadoop,

More information

COMPUTE CLOUD SERVICE. Moving to SPARC in the Oracle Cloud

COMPUTE CLOUD SERVICE. Moving to SPARC in the Oracle Cloud COMPUTE CLOUD SERVICE Moving to A new way forward: SPARC in the. Does your IT organization support mission-critical applications that would benefit from the unsurpassed security performance of SPARC servers

More information

HDP Security Overview

HDP Security Overview 3 HDP Security Overview Date of Publish: 2018-07-15 http://docs.hortonworks.com Contents HDP Security Overview...3 Understanding Data Lake Security... 3 What's New in This Release: Knox... 5 What's New

More information

HDP Security Overview

HDP Security Overview 3 HDP Security Overview Date of Publish: 2018-07-15 http://docs.hortonworks.com Contents HDP Security Overview...3 Understanding Data Lake Security... 3 What's New in This Release: Knox... 5 What's New

More information

USERS CONFERENCE Copyright 2016 OSIsoft, LLC

USERS CONFERENCE Copyright 2016 OSIsoft, LLC Bridge IT and OT with a process data warehouse Presented by Matt Ziegler, OSIsoft Complexity Problem Complexity Drives the Need for Integrators Disparate assets or interacting one-by-one Monitoring Real-time

More information

Powerful Insights with Every Click. FixStream. Agentless Infrastructure Auto-Discovery for Modern IT Operations

Powerful Insights with Every Click. FixStream. Agentless Infrastructure Auto-Discovery for Modern IT Operations Powerful Insights with Every Click FixStream Agentless Infrastructure Auto-Discovery for Modern IT Operations The Challenge AIOps is a big shift from traditional ITOA platforms. ITOA was focused on data

More information

docs.hortonworks.com

docs.hortonworks.com docs.hortonworks.com : Getting Started Guide Copyright 2012, 2014 Hortonworks, Inc. Some rights reserved. The, powered by Apache Hadoop, is a massively scalable and 100% open source platform for storing,

More information

Event Streams using Apache Kafka

Event Streams using Apache Kafka Event Streams using Apache Kafka And how it relates to IBM MQ Andrew Schofield Chief Architect, Event Streams STSM, IBM Messaging, Hursley Park Event-driven systems deliver more engaging customer experiences

More information

Azure Data Factory VS. SSIS. Reza Rad, Consultant, RADACAD

Azure Data Factory VS. SSIS. Reza Rad, Consultant, RADACAD Azure Data Factory VS. SSIS Reza Rad, Consultant, RADACAD 2 Please silence cell phones Explore Everything PASS Has to Offer FREE ONLINE WEBINAR EVENTS FREE 1-DAY LOCAL TRAINING EVENTS VOLUNTEERING OPPORTUNITIES

More information

MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap

MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap Steve Tuck Senior Director, Product Strategy Todd Blackmon Senior Director, Sales Consulting David Gengenbach Sales Consultant

More information

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme CNA2080BU Deep Dive: How to Deploy and Operationalize Kubernetes Cornelia Davis, Pivotal Nathan Ness Technical Product Manager, CNABU @nvpnathan #VMworld #CNA2080BU Disclaimer This presentation may contain

More information

TIBCO Spotfire Statement of Direction. Spotfire Product Management

TIBCO Spotfire Statement of Direction. Spotfire Product Management TIBCO Spotfire Statement of Direction Spotfire Product Management CONFIDENTIALITY The following information is confidential information of TIBCO Software Inc. Use, duplication, transmission, or republication

More information

Optimized Data Integration for the MSO Market

Optimized Data Integration for the MSO Market Optimized Data Integration for the MSO Market Actions at the speed of data For Real-time Decisioning and Big Data Problems VelociData for FinTech and the Enterprise VelociData s technology has been providing

More information

Enable IoT Solutions using Azure

Enable IoT Solutions using Azure Internet Of Things A WHITE PAPER SERIES Enable IoT Solutions using Azure 1 2 TABLE OF CONTENTS EXECUTIVE SUMMARY INTERNET OF THINGS GATEWAY EVENT INGESTION EVENT PERSISTENCE EVENT ACTIONS 3 SYNTEL S IoT

More information

The IBM MobileFirst Platform

The IBM MobileFirst Platform The IBM MobileFirst Platform Curtis Miles IBM MobileFirst Solution Architect April 14, 2015 What is the IBM MobileFirst Platform? A modular set " of libraries, tools, and runtimes " that help you " easily

More information

Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager

Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager Please Note IBM s statements regarding its plans, directions, and intent are subject to change or

More information

Syncsort DMX-h. Simplifying Big Data Integration. Goals of the Modern Data Architecture SOLUTION SHEET

Syncsort DMX-h. Simplifying Big Data Integration. Goals of the Modern Data Architecture SOLUTION SHEET SOLUTION SHEET Syncsort DMX-h Simplifying Big Data Integration Goals of the Modern Data Architecture Data warehouses and mainframes are mainstays of traditional data architectures and still play a vital

More information

2013 AWS Worldwide Public Sector Summit Washington, D.C.

2013 AWS Worldwide Public Sector Summit Washington, D.C. 2013 AWS Worldwide Public Sector Summit Washington, D.C. EMR for Fun and for Profit Ben Butler Sr. Manager, Big Data butlerb@amazon.com @bensbutler Overview 1. What is big data? 2. What is AWS Elastic

More information

for Multi-Services Gateways

for Multi-Services Gateways KURA an OSGi-basedApplication Framework for Multi-Services Gateways Introduction & Technical Overview Pierre Pitiot Grenoble 19 février 2014 Multi-Service Gateway Approach ESF / Increasing Value / Minimizing

More information

MODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS

MODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS MODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS SUJEE MANIYAM FOUNDER / PRINCIPAL @ ELEPHANT SCALE www.elephantscale.com sujee@elephantscale.com HI, I M SUJEE MANIYAM Founder / Principal @ ElephantScale

More information

by Cisco Intercloud Fabric and the Cisco

by Cisco Intercloud Fabric and the Cisco Expand Your Data Search and Analysis Capability Across a Hybrid Cloud Solution Brief June 2015 Highlights Extend Your Data Center and Cloud Build a hybrid cloud from your IT resources and public and providerhosted

More information

A Single Source of Truth

A Single Source of Truth A Single Source of Truth is it the mythical creature of data management? In the world of data management, a single source of truth is a fully trusted data source the ultimate authority for the particular

More information

The SMACK Stack: Spark*, Mesos*, Akka, Cassandra*, Kafka* Elizabeth K. Dublin Apache Kafka Meetup, 30 August 2017.

The SMACK Stack: Spark*, Mesos*, Akka, Cassandra*, Kafka* Elizabeth K. Dublin Apache Kafka Meetup, 30 August 2017. Dublin Apache Kafka Meetup, 30 August 2017 The SMACK Stack: Spark*, Mesos*, Akka, Cassandra*, Kafka* Elizabeth K. Joseph @pleia2 * ASF projects 1 Elizabeth K. Joseph, Developer Advocate Developer Advocate

More information

Shine a Light on Dark Data with Vertica Flex Tables

Shine a Light on Dark Data with Vertica Flex Tables White Paper Analytics and Big Data Shine a Light on Dark Data with Vertica Flex Tables Hidden within the dark recesses of your enterprise lurks dark data, information that exists but is forgotten, unused,

More information

Modernizing Business Intelligence and Analytics

Modernizing Business Intelligence and Analytics Modernizing Business Intelligence and Analytics Justin Erickson Senior Director, Product Management 1 Agenda What benefits can I achieve from modernizing my analytic DB? When and how do I migrate from

More information

Certified Big Data and Hadoop Course Curriculum

Certified Big Data and Hadoop Course Curriculum Certified Big Data and Hadoop Course Curriculum The Certified Big Data and Hadoop course by DataFlair is a perfect blend of in-depth theoretical knowledge and strong practical skills via implementation

More information

Evolution of an Apache Spark Architecture for Processing Game Data

Evolution of an Apache Spark Architecture for Processing Game Data Evolution of an Apache Spark Architecture for Processing Game Data Nick Afshartous WB Analytics Platform May 17 th 2017 May 17 th, 2017 About Me nafshartous@wbgames.com WB Analytics Core Platform Lead

More information

Building Event Driven Architectures using OpenEdge CDC Richard Banville, Fellow, OpenEdge Development Dan Mitchell, Principal Sales Engineer

Building Event Driven Architectures using OpenEdge CDC Richard Banville, Fellow, OpenEdge Development Dan Mitchell, Principal Sales Engineer Building Event Driven Architectures using OpenEdge CDC Richard Banville, Fellow, OpenEdge Development Dan Mitchell, Principal Sales Engineer October 26, 2018 Agenda Change Data Capture (CDC) Overview Configuring

More information

Cloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018

Cloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018 Cloud Computing 2 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning

More information

iway iway Big Data Integrator New Features Bulletin and Release Notes Version DN

iway iway Big Data Integrator New Features Bulletin and Release Notes Version DN iway iway Big Data Integrator New Features Bulletin and Release Notes Version 1.5.0 DN3502232.1216 Active Technologies, EDA, EDA/SQL, FIDEL, FOCUS, Information Builders, the Information Builders logo,

More information

Installing an HDF cluster

Installing an HDF cluster 3 Installing an HDF cluster Date of Publish: 2018-08-13 http://docs.hortonworks.com Contents Installing Ambari...3 Installing Databases...3 Installing MySQL... 3 Configuring SAM and Schema Registry Metadata

More information

FUJITSU Software ServerView Cloud Monitoring Manager V1.0. Overview

FUJITSU Software ServerView Cloud Monitoring Manager V1.0. Overview FUJITSU Software ServerView Cloud Monitoring Manager V1.0 Overview J2UL-2073-01ENZ0(00) November 2015 Trademarks Copyright FUJITSU LIMITED 2015 LINUX is a registered trademark of Linus Torvalds. The OpenStack

More information

Solving the Really Big Tech Problems with IoT Data Security and Privacy

Solving the Really Big Tech Problems with IoT Data Security and Privacy Solving the Really Big Tech Problems with IoT Data Security and Privacy HPE Security Data Security March 16, 2017 IoT Everywhere - Promising New Value Manufacturing Energy / Utilities Banks / Financial

More information

IBM Db2 Warehouse on Cloud

IBM Db2 Warehouse on Cloud IBM Db2 Warehouse on Cloud February 01, 2018 Ben Hudson, Offering Manager Noah Kuttler, Product Marketing CALL LOGISTICS Data Warehouse Community Share. Solve. Do More. There are 2 options to listen to

More information

Creating a Recommender System. An Elasticsearch & Apache Spark approach

Creating a Recommender System. An Elasticsearch & Apache Spark approach Creating a Recommender System An Elasticsearch & Apache Spark approach My Profile SKILLS Álvaro Santos Andrés Big Data & Analytics Solution Architect in Ericsson with more than 12 years of experience focused

More information

MarkLogic 8 Overview of Key Features COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

MarkLogic 8 Overview of Key Features COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. MarkLogic 8 Overview of Key Features Enterprise NoSQL Database Platform Flexible Data Model Store and manage JSON, XML, RDF, and Geospatial data with a documentcentric, schemaagnostic database Search and

More information