IBM Db2 Event Store Simplifying and Accelerating Storage and Analysis of Fast Data. IBM Db2 Event Store

Size: px
Start display at page:

Download "IBM Db2 Event Store Simplifying and Accelerating Storage and Analysis of Fast Data. IBM Db2 Event Store"

Transcription

1 IBM Db2 Event Store Simplifying and Accelerating Storage and Analysis of Fast Data IBM Db2 Event Store

2 Disclaimer The information contained in this presentation is provided for informational purposes only. While efforts were made to verify the completeness and accuracy of the information contained in this presentation, it is provided as is, without warranty of any kind, express or implied. In addition, this information is based on IBM s current product plans and strategy, which are subject to change by IBM without notice. IBM shall not be responsible for any damages arising out of the use of, or otherwise related to, this presentation or any other documentation. Nothing contained in this presentation is intended to, or shall have the effect of: Creating any warranty or representation from IBM (or its affiliates or its or their suppliers and/or licensors); or Altering the terms and conditions of the applicable license agreement governing the use of IBM software. Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon many factors, including considerations such as the amount of multiprogramming in the user's job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve results similar to those stated here.

3 The Challenges of Fast Data Data is arriving faster than ever before Billions of events processed every day Evident cross industry and driven by IoT Must land data quickly, or throw it away Total data is large, and growing rapidly Storing all events implies large data sets Storage costs are significant, and must be managed Data is useless without fast insights Data value decays rapidly over time Insights must derived quickly, and used advanced analytics (ML) Data availability without duplication Data must be available to the entire organization without requiring replication or duplication Maintain data in open format for future-proofing

4 Existing Solution for Fast Data Lambda Architecture Batch Layer Query-friendly Storage Data Stream Queries Speed Layer Store data quickly

5 Existing Solution for Fast Data Lambda Architecture Batch Layer Query-friendly Storage Complex Two separate architectures to deploy/manage Data Stream Queries Speed Layer Store data quickly

6 Existing Solution for Fast Data Lambda Architecture Batch Layer Query-friendly Storage Complex Two separate architectures to deploy/manage Data Stream Queries Speed Layer Store data quickly Costly Two copies of the data

7 Existing Solution for Fast Data Lambda Architecture Batch Layer Query-friendly Storage Complex Two separate architectures to deploy/manage Data Stream Queries Complex Queries need to consult two data stores Speed Layer Store data quickly Costly Two copies of the data

8 Alternative Approach Kappa Architecture Data Stream Queries Speed Layer Store data quickly

9 Alternative Approach Kappa Architecture Better Only writing data once Data Stream Queries Speed Layer Store data quickly

10 Alternative Approach Kappa Architecture Better Only writing data once Data Stream Better Less infrastructure to manage Speed Layer Store data quickly Queries

11 Alternative Approach Kappa Architecture Data Stream Better Only writing data once Better Less infrastructure to manage Worse Queries are much less efficient Queries Speed Layer Store data quickly

12 Alternative Approach Modified Lambda Data Stream Ingest Layer Store data quickly Query Layer Query-friendly Storage Queries

13 Alternative Approach Modified Lambda Extract, Transform, Load ETL Data Stream Ingest Layer Store data quickly Query Layer Query-friendly Storage Queries There is no such thing as a new idea. It is impossible. We simply take a lot of old ideas and put them into a sort of mental kaleidoscope. We give them a turn and they make new and curious combinations. Mark Twain

14 Alternative Approach Modified Lambda Complex Still maintaining two separate stores Data Stream Ingest Layer Store data quickly Query Layer Query-friendly Storage Queries

15 Alternative Approach Modified Lambda Complex Still maintaining two separate stores Data Stream Ingest Layer Store data quickly Query Layer Query-friendly Storage Queries Costly Still two copies of the data

16 Alternative Approach Modified Lambda Complex Still maintaining two separate stores Incomplete Queries do not consider all data Data Stream Ingest Layer Store data quickly Query Layer Query-friendly Storage Queries Costly Still two copies of the data

17 Surely there must be a better way

18 What is IBM Db2 Event Store? A unified offering for Fast Data which delivers IBM Db2 Event Store 1 2 Lightning Fast Ingest 1 Million inserts per second per node Ingest scales linearly with added nodes Data ingested quickly, then refined and enriched Real-time Analytics Real-time analytics over ALL ingested data Super-fast lookups and intelligent scans Integrated machine learning capabilities 3 Integrated and Highly Available Packaged and integrated with IBM Data Science experience; available Streams sink Remains available on node failure Architected to scale to very large clusters 4 Built for Data Sharing and Efficiency Writes to shared storage in Parquet format Able to leverage low-cost object storage Single copy of the data

19 Understanding the Engine and Components IBM Db2 Event Store Open Access IBM Streams High Speed Ingest Real-Time Insights Machine Learning IBM Parquet Compatible BIGSQL tools IBM Event Store Cluster Highly Available Distributed Storage Open Data Format In-Memory Data grid

20 IBM Db2 Event Store A combination of IBM assets + Open Source

21 How Event Store Manages Data IBM Db2 Event Store Maintains two data tiers Most recent data stored locally and replicated for HA Data and index cached locally for fast access Rest of data stored in shared storage layer Data is sharded (by hash) and is logically owned ( mastered ) by a given node Shared storage layer Allows for fast recovery from failures with logical remastering Reduces storage costs when using object storage Provides high availability Separates compute and storage Node A Node B Node C IBM Event Store Engine IBM Event Store Engine Log (on SSD) Log (on SSD) Log (on SSD) Compressed Parquet data Shared Storage IBM Event Store Engine Cache Cache Cache

22 IBM Event Store Ingest 1. Ingest occurs using Python, Scala or Java API, or Streams sink 2. Batches of rows are formed and sent asynchronously from client to the appropriate Event Store nodes 3. Rows are placed in the queryable log, replicated to replica nodes, and reply is sent to client 4. After some time, data in logs is formed into Parquet blocks and written to the storage layer 1 Log (on SSD) Event Store Client Streaming Application Node A Node B Node C IBM Event Store Engine IBM Event Store Engine Log (on SSD) Compressed Parquet data Shared Storage 2 IBM Event Store Engine IBM Db2 Event Store Log (on SSD) Configurable Replication 3 Share 4

23 IBM Event Store Analytics 1. Analytics queries through Spark like Scala API, or through Python/REST 2. Queries are sent to either Event Store nodes, or vanilla Spark nodes depending on performance requirements and whether most recent (ungroomed) data is required a) Query is sent to Event Store nodes to retrieve most recent data and combine with groomed data in cache or in storage layer IBM BLUSpark Event Store Engine Engine Cache 2a Event Store Client Analytical Application Node A Node B Node C Spark Executor 1 Log (on SSD) Spark Executor IBM BLUSpark Event Store Engine Engine Cache 2a Log (on SSD) 2b Spark Executor IBM Db2 Event Store b) Query is sent to Spark node(s) to read data all but most recent data from storage Compressed Parquet data Storage

24 IBM Event Store Availability - Ingest Provides HA on all data Shared data leverages replicated storage Log data is replicated by IBM Event Store On insert In the presence of node failures, inserts continue to be processed So long as configured replication factor is achievable Event Store Client Streaming Application Node A Node B Node C IBM Event Store Engine IBM Event Store Engine IBM Event Store Engine Log (on SSD) Log (on SSD) Log (on SSD) IBM Db2 Event Store Configurable Replication Share Compressed Parquet data Storage

25 IBM Event Store Availability - Query Provides HA on all data Shared data leverages replicated storage Log data is replicated by IBM Event Store On query In the presence of node failures, queries continue to be processed So long as configured number of query replicas are reachable Data in storage layer is directly queryable regardless of the state of IBM Event Store nodes IBM Event Store Engine Event Store Client Transactional Application Node A Node B Node C Spark Executor Spark Executor IBM BLUSpark Event Store Engine Compressed Parquet data Spark Executor BLUSpark IBM Event Store Engine Log (on SSD) Log (on SSD) Log (on SSD) IBM Db2 Event Store Storage

26 Demo

27

28

29 Offerings

30 IBM Event Store Offerings IBM Db2 Event Store Developer Edition Free Download and Go edition Great for getting started, writing your first application Packaged with Desktop version of DSX Runs on MacOS, Linux, Windows Download from Enterprise Edition Production level offering Includes high availability, monitoring UI, REST API Packaged with DSX Local Comes in 1 and 3 node installers

31 Recent Enhancements

32 IBM Db2 Event Store Enterprise Version IBM Db2 Event Store Time To Live (TTL) What Continuous data ingest without ever increasing storage requirements Why Data is plentiful, but storage is costly and data value degrades with time How Retention time specified in hours at table creation time, data automatically deleted when retention time is exceeded JDBC Connectivity What Query a database using standard JDBC interface Why Leverage existing development skill and established tooling How Included with Event Store Enterprise v1.1.2 ( As well as Performance and stability improvements

33 Thank you For more information, visit: ibm.biz/eventstore Contact eventstore@ca.ibm.com IBM Hybrid Cloud / 2018 / 2018 IBM Corporation

34 Db2 Community Share. Solve. Do More. IBM Db2 Event Store Community JOIN the Db2 Community and keep an eye out for upcoming webinars, forum posts, blogs and more. Have questions about this content? Have suggestions for upcoming topics? Post them to the Forum or send an to acommunity Manager:

IBM Db2 Warehouse on Cloud

IBM Db2 Warehouse on Cloud IBM Db2 Warehouse on Cloud February 01, 2018 Ben Hudson, Offering Manager Noah Kuttler, Product Marketing CALL LOGISTICS Data Warehouse Community Share. Solve. Do More. There are 2 options to listen to

More information

Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager

Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager Please Note IBM s statements regarding its plans, directions, and intent are subject to change or

More information

Build and Deploy Stored Procedures with IBM Data Studio

Build and Deploy Stored Procedures with IBM Data Studio Build and Deploy Stored Procedures with IBM Data Studio December 19, 2013 Presented by: Anson Kokkat, Product Manager, Optim Database Tools 1 DB2 Tech Talk series host and today s presenter: Rick Swagerman,

More information

Lambda Architecture for Batch and Stream Processing. October 2018

Lambda Architecture for Batch and Stream Processing. October 2018 Lambda Architecture for Batch and Stream Processing October 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only.

More information

that will impact New IoT Technology Trends Production Automation

that will impact New IoT Technology Trends Production Automation New IoT Technology Trends that will impact Production Automation Alexander Körner, Software Solution Architect Watson IoT Electronics Industry Lab, Munich IBM Deutschland GmbH @AlexKoeMuc 19. Juni 2018

More information

Accelerate Big Data Insights

Accelerate Big Data Insights Accelerate Big Data Insights Executive Summary An abundance of information isn t always helpful when time is of the essence. In the world of big data, the ability to accelerate time-to-insight can not

More information

Reducing MIPS Using InfoSphere Optim Query Workload Tuner TDZ-2755A. Lloyd Matthews, U.S. Senate

Reducing MIPS Using InfoSphere Optim Query Workload Tuner TDZ-2755A. Lloyd Matthews, U.S. Senate Reducing MIPS Using InfoSphere Optim Query Workload Tuner TDZ-2755A Lloyd Matthews, U.S. Senate 0 Disclaimer Copyright IBM Corporation 2010. All rights reserved. U.S. Government Users Restricted Rights

More information

20 years of Lotus Notes and a look into the next 20 years Lotusphere Comes To You

20 years of Lotus Notes and a look into the next 20 years Lotusphere Comes To You 20 years of Lotus Notes and a look into the next 20 years Lotusphere Comes To You Kevin Cavanaugh, Vice President, Messaging and Collaboration Lotus Software and WebSphere Portal email@us.ibm.com Organizations

More information

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways

More information

Best practices for building a Hadoop Data Lake Solution CHARLOTTE HADOOP USER GROUP

Best practices for building a Hadoop Data Lake Solution CHARLOTTE HADOOP USER GROUP Best practices for building a Hadoop Data Lake Solution CHARLOTTE HADOOP USER GROUP 07.29.2015 LANDING STAGING DW Let s start with something basic Is Data Lake a new concept? What is the closest we can

More information

Db2 Analytics Accelerator V5.1 What s new in PTF 5

Db2 Analytics Accelerator V5.1 What s new in PTF 5 Ute Baumbach, Christopher Watson IBM Boeblingen Laboratory Db2 Analytics Accelerator V5.1 What s new in PTF 5 Legal Disclaimer IBM Corporation 2017. All Rights Reserved. The information contained in this

More information

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications

More information

IBM Social Rendering Templates for Digital Data Connector

IBM Social Rendering Templates for Digital Data Connector IBM Social Rendering Templates for Digital Data Dr. Dieter Buehler Software Architect WebSphere Portal / IBM Web Content Manager Social Rendering Templates for DDC- Overview This package demonstrates how

More information

IBM Infrastructure Suite for z/vm and Linux: Introduction IBM Tivoli OMEGAMON XE on z/vm and Linux

IBM Infrastructure Suite for z/vm and Linux: Introduction IBM Tivoli OMEGAMON XE on z/vm and Linux IBM Infrastructure Suite for z/vm and Linux: Introduction IBM Tivoli OMEGAMON XE on z/vm and Linux August/September 2015 Please Note IBM s statements regarding its plans, directions, and intent are subject

More information

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015 Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL May 2015 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document

More information

zspotlight: Spark on z/os

zspotlight: Spark on z/os zspotlight: Spark on z/os Avijit Chatterjee, Ph.D. achatter@us.ibm.com, @ChatterAvijit STSM, IBM Competitive Project Office 1 CEOs are increasingly focused on customers as individuals leveraging contextual

More information

Bringing Data to Life

Bringing Data to Life Bringing Data to Life Data management and Visualization Techniques Benika Hall Rob Harrison Corporate Model Risk March 16, 2018 Introduction Benika Hall Analytic Consultant Wells Fargo - Corporate Model

More information

Innovate 2013 Automated Mobile Testing

Innovate 2013 Automated Mobile Testing Innovate 2013 Automated Mobile Testing Marc van Lint IBM Netherlands 2013 IBM Corporation Please note the following IBM s statements regarding its plans, directions, and intent are subject to change or

More information

Asanka Padmakumara. ETL 2.0: Data Engineering with Azure Databricks

Asanka Padmakumara. ETL 2.0: Data Engineering with Azure Databricks Asanka Padmakumara ETL 2.0: Data Engineering with Azure Databricks Who am I? Asanka Padmakumara Business Intelligence Consultant, More than 8 years in BI and Data Warehousing A regular speaker in data

More information

What s New in the IBM Lotus Notes Client. Kevin O Connell, Consulting Manager, IBM Asia Pacific

What s New in the IBM Lotus Notes Client. Kevin O Connell, Consulting Manager, IBM Asia Pacific Technical Track What s New in the IBM Lotus Notes Client Kevin O Connell, Consulting Manager, IBM Asia Pacific ID101 What's New in the IBM Lotus Notes Client Kevin O'Connell Asia Pacific Consulting Manager

More information

IBM DB2 Analytics Accelerator Trends and Directions

IBM DB2 Analytics Accelerator Trends and Directions March, 2017 IBM DB2 Analytics Accelerator Trends and Directions DB2 Analytics Accelerator for z/os on Cloud Namik Hrle IBM Fellow Peter Bendel IBM STSM Disclaimer IBM s statements regarding its plans,

More information

REST APIs on z/os. How to use z/os Connect RESTful APIs with Modern Cloud Native Applications. Bill Keller

REST APIs on z/os. How to use z/os Connect RESTful APIs with Modern Cloud Native Applications. Bill Keller REST APIs on z/os How to use z/os Connect RESTful APIs with Modern Cloud Native Applications Bill Keller bill.keller@us.ibm.com Important Disclaimer IBM s statements regarding its plans, directions and

More information

Introduction to Hadoop. Owen O Malley Yahoo!, Grid Team

Introduction to Hadoop. Owen O Malley Yahoo!, Grid Team Introduction to Hadoop Owen O Malley Yahoo!, Grid Team owen@yahoo-inc.com Who Am I? Yahoo! Architect on Hadoop Map/Reduce Design, review, and implement features in Hadoop Working on Hadoop full time since

More information

Databricks, an Introduction

Databricks, an Introduction Databricks, an Introduction Chuck Connell, Insight Digital Innovation Insight Presentation Speaker Bio Senior Data Architect at Insight Digital Innovation Focus on Azure big data services HDInsight/Hadoop,

More information

IBM InfoSphere Data Replication s Change Data Capture (CDC) Fast Apply IBM Corporation

IBM InfoSphere Data Replication s Change Data Capture (CDC) Fast Apply IBM Corporation IBM InfoSphere Data Replication s Change Data Capture (CDC) Fast Apply Agenda - Overview of Fast Apply - When to use Fast Apply - The available strategies & when to use - Common concepts - How to configure

More information

Oracle TimesTen Scaleout: Revolutionizing In-Memory Transaction Processing

Oracle TimesTen Scaleout: Revolutionizing In-Memory Transaction Processing Oracle Scaleout: Revolutionizing In-Memory Transaction Processing Scaleout is a brand new, shared nothing scale-out in-memory database designed for next generation extreme OLTP workloads. Featuring elastic

More information

Managing IoT and Time Series Data with Amazon ElastiCache for Redis

Managing IoT and Time Series Data with Amazon ElastiCache for Redis Managing IoT and Time Series Data with ElastiCache for Redis Darin Briskman, ElastiCache Developer Outreach Michael Labib, Specialist Solutions Architect 2016, Web Services, Inc. or its Affiliates. All

More information

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development:: Title Duration : Apache Spark Development : 4 days Overview Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized

More information

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud Microsoft Azure Databricks for data engineering Building production data pipelines with Apache Spark in the cloud Azure Databricks As companies continue to set their sights on making data-driven decisions

More information

The Evolution of Big Data Platforms and Data Science

The Evolution of Big Data Platforms and Data Science IBM Analytics The Evolution of Big Data Platforms and Data Science ECC Conference 2016 Brandon MacKenzie June 13, 2016 2016 IBM Corporation Hello, I m Brandon MacKenzie. I work at IBM. Data Science - Offering

More information

Big data streaming: Choices for high availability and disaster recovery on Microsoft Azure. By Arnab Ganguly DataCAT

Big data streaming: Choices for high availability and disaster recovery on Microsoft Azure. By Arnab Ganguly DataCAT : Choices for high availability and disaster recovery on Microsoft Azure By Arnab Ganguly DataCAT March 2019 Contents Overview... 3 The challenge of a single-region architecture... 3 Configuration considerations...

More information

Product Overview Analyst s Notebook Analyst s Notebook is a standalone desktop product for a single user Allows quick collation and visualization of unstructured or structured data Incorporates powerful

More information

5/24/ MVP SQL Server: Architecture since 2010 MCT since 2001 Consultant and trainer since 1992

5/24/ MVP SQL Server: Architecture since 2010 MCT since 2001 Consultant and trainer since 1992 2014-05-20 MVP SQL Server: Architecture since 2010 MCT since 2001 Consultant and trainer since 1992 @SoQooL http://blog.mssqlserver.se Mattias.Lind@Sogeti.se 1 The evolution of the Microsoft data platform

More information

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a)

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Cloudera s Developer Training for Apache Spark and Hadoop delivers the key concepts and expertise need to develop high-performance

More information

Massive Scalability With InterSystems IRIS Data Platform

Massive Scalability With InterSystems IRIS Data Platform Massive Scalability With InterSystems IRIS Data Platform Introduction Faced with the enormous and ever-growing amounts of data being generated in the world today, software architects need to pay special

More information

Migrate from Netezza Workload Migration

Migrate from Netezza Workload Migration Migrate from Netezza Automated Big Data Open Netezza Source Workload Migration CASE SOLUTION STUDY BRIEF Automated Netezza Workload Migration To achieve greater scalability and tighter integration with

More information

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List)

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List) CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List) Microsoft Solution Latest Sl Area Refresh No. Course ID Run ID Course Name Mapping Date 1 AZURE202x 2 Microsoft

More information

In-memory data pipeline and warehouse at scale using Spark, Spark SQL, Tachyon and Parquet

In-memory data pipeline and warehouse at scale using Spark, Spark SQL, Tachyon and Parquet In-memory data pipeline and warehouse at scale using Spark, Spark SQL, Tachyon and Parquet Ema Iancuta iorhian@gmail.com Radu Chilom radu.chilom@gmail.com Big data analytics / machine learning 6+ years

More information

Approaching the Petabyte Analytic Database: What I learned

Approaching the Petabyte Analytic Database: What I learned Disclaimer This document is for informational purposes only and is subject to change at any time without notice. The information in this document is proprietary to Actian and no part of this document may

More information

Datacenter replication solution with quasardb

Datacenter replication solution with quasardb Datacenter replication solution with quasardb Technical positioning paper April 2017 Release v1.3 www.quasardb.net Contact: sales@quasardb.net Quasardb A datacenter survival guide quasardb INTRODUCTION

More information

MODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS

MODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS MODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS SUJEE MANIYAM FOUNDER / PRINCIPAL @ ELEPHANT SCALE www.elephantscale.com sujee@elephantscale.com HI, I M SUJEE MANIYAM Founder / Principal @ ElephantScale

More information

Caching patterns and extending mobile applications with elastic caching (With Demonstration)

Caching patterns and extending mobile applications with elastic caching (With Demonstration) Ready For Mobile Caching patterns and extending mobile applications with elastic caching (With Demonstration) The world is changing and each of these technology shifts has potential to make a significant

More information

Energy Management with AWS

Energy Management with AWS Energy Management with AWS Kyle Hart and Nandakumar Sreenivasan Amazon Web Services August [XX], 2017 Tampa Convention Center Tampa, Florida What is Cloud? The NIST Definition Broad Network Access On-Demand

More information

Apache Ignite - Using a Memory Grid for Heterogeneous Computation Frameworks A Use Case Guided Explanation. Chris Herrera Hashmap

Apache Ignite - Using a Memory Grid for Heterogeneous Computation Frameworks A Use Case Guided Explanation. Chris Herrera Hashmap Apache Ignite - Using a Memory Grid for Heterogeneous Computation Frameworks A Use Case Guided Explanation Chris Herrera Hashmap Topics Who - Key Hashmap Team Members The Use Case - Our Need for a Memory

More information

Capture Business Opportunities from Systems of Record and Systems of Innovation

Capture Business Opportunities from Systems of Record and Systems of Innovation Capture Business Opportunities from Systems of Record and Systems of Innovation Amit Satoor, SAP March Hartz, SAP PUBLIC Big Data transformation powers digital innovation system Relevant nuggets of information

More information

Intro to Big Data on AWS Igor Roiter Big Data Cloud Solution Architect

Intro to Big Data on AWS Igor Roiter Big Data Cloud Solution Architect Intro to Big Data on AWS Igor Roiter Big Data Cloud Solution Architect Igor Roiter Big Data Cloud Solution Architect Working as a Data Specialist for the last 11 years 9 of them as a Consultant specializing

More information

Unifying Big Data Workloads in Apache Spark

Unifying Big Data Workloads in Apache Spark Unifying Big Data Workloads in Apache Spark Hossein Falaki @mhfalaki Outline What s Apache Spark Why Unification Evolution of Unification Apache Spark + Databricks Q & A What s Apache Spark What is Apache

More information

Qunar Performs Real-Time Data Analytics up to 300x Faster with Alluxio

Qunar Performs Real-Time Data Analytics up to 300x Faster with Alluxio CASE STUDY Qunar Performs Real-Time Data Analytics up to 300x Faster with Alluxio Xueyan Li, Lei Xu, and Xiaoxu Lv Software Engineers at Qunar At Qunar, we have been running Alluxio in production for over

More information

ELTMaestro for Spark: Data integration on clusters

ELTMaestro for Spark: Data integration on clusters Introduction Spark represents an important milestone in the effort to make computing on clusters practical and generally available. Hadoop / MapReduce, introduced the early 2000s, allows clusters to be

More information

CICS V5.4 open beta and beyond

CICS V5.4 open beta and beyond CICS V5.4 open beta and beyond Alexander David Brown IBM UK Ltd. Date of presentation (01/10/2016) Session GB Preface IBM s statements regarding its plans, directions and intent are subject to change or

More information

Introduction to K2View Fabric

Introduction to K2View Fabric Introduction to K2View Fabric 1 Introduction to K2View Fabric Overview In every industry, the amount of data being created and consumed on a daily basis is growing exponentially. Enterprises are struggling

More information

5 Fundamental Strategies for Building a Data-centered Data Center

5 Fundamental Strategies for Building a Data-centered Data Center 5 Fundamental Strategies for Building a Data-centered Data Center June 3, 2014 Ken Krupa, Chief Field Architect Gary Vidal, Solutions Specialist Last generation Reference Data Unstructured OLTP Warehouse

More information

Technical Sheet NITRODB Time-Series Database

Technical Sheet NITRODB Time-Series Database Technical Sheet NITRODB Time-Series Database 10X Performance, 1/10th the Cost INTRODUCTION "#$#!%&''$!! NITRODB is an Apache Spark Based Time Series Database built to store and analyze 100s of terabytes

More information

Saving ETL Costs Through Data Virtualization Across The Enterprise

Saving ETL Costs Through Data Virtualization Across The Enterprise Saving ETL Costs Through Virtualization Across The Enterprise IBM Virtualization Manager for z/os Marcos Caurim z Analytics Technical Sales Specialist 2017 IBM Corporation What is Wrong with Status Quo?

More information

IBM B2B INTEGRATOR BENCHMARKING IN THE SOFTLAYER ENVIRONMENT

IBM B2B INTEGRATOR BENCHMARKING IN THE SOFTLAYER ENVIRONMENT IBM B2B INTEGRATOR BENCHMARKING IN THE SOFTLAYER ENVIRONMENT 215-4-14 Authors: Deep Chatterji (dchatter@us.ibm.com) Steve McDuff (mcduffs@ca.ibm.com) CONTENTS Disclaimer...3 Pushing the limits of B2B Integrator...4

More information

New Oracle NoSQL Database APIs that Speed Insertion and Retrieval

New Oracle NoSQL Database APIs that Speed Insertion and Retrieval New Oracle NoSQL Database APIs that Speed Insertion and Retrieval O R A C L E W H I T E P A P E R F E B R U A R Y 2 0 1 6 1 NEW ORACLE NoSQL DATABASE APIs that SPEED INSERTION AND RETRIEVAL Introduction

More information

Databricks Delta: Bringing Unprecedented Reliability and Performance to Cloud Data Lakes

Databricks Delta: Bringing Unprecedented Reliability and Performance to Cloud Data Lakes Databricks Delta: Bringing Unprecedented Reliability and Performance to Cloud Data Lakes AN UNDER THE HOOD LOOK Databricks Delta, a component of the Databricks Unified Analytics Platform*, is a unified

More information

What's New in IBM Notes 9.0 Social Edition IBM Corporation

What's New in IBM Notes 9.0 Social Edition IBM Corporation What's New in IBM Notes 9.0 Social Edition IBM Client Strategy The flexible and comprehensive collaboration solution the client the server Universal access Remain productive regardless of location Browser

More information

Blended Learning Outline: Cloudera Data Analyst Training (171219a)

Blended Learning Outline: Cloudera Data Analyst Training (171219a) Blended Learning Outline: Cloudera Data Analyst Training (171219a) Cloudera Univeristy s data analyst training course will teach you to apply traditional data analytics and business intelligence skills

More information

Built for Speed: Comparing Panoply and Amazon Redshift Rendering Performance Utilizing Tableau Visualizations

Built for Speed: Comparing Panoply and Amazon Redshift Rendering Performance Utilizing Tableau Visualizations Built for Speed: Comparing Panoply and Amazon Redshift Rendering Performance Utilizing Tableau Visualizations Table of contents Faster Visualizations from Data Warehouses 3 The Plan 4 The Criteria 4 Learning

More information

WHITEPAPER. MemSQL Enterprise Feature List

WHITEPAPER. MemSQL Enterprise Feature List WHITEPAPER MemSQL Enterprise Feature List 2017 MemSQL Enterprise Feature List DEPLOYMENT Provision and deploy MemSQL anywhere according to your desired cluster configuration. On-Premises: Maximize infrastructure

More information

Home of Redis. April 24, 2017

Home of Redis. April 24, 2017 Home of Redis April 24, 2017 Introduction to Redis and Redis Labs Redis with MySQL Data Structures in Redis Benefits of Redis e 2 Redis and Redis Labs Open source. The leading in-memory database platform,

More information

Creating a Recommender System. An Elasticsearch & Apache Spark approach

Creating a Recommender System. An Elasticsearch & Apache Spark approach Creating a Recommender System An Elasticsearch & Apache Spark approach My Profile SKILLS Álvaro Santos Andrés Big Data & Analytics Solution Architect in Ericsson with more than 12 years of experience focused

More information

Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context

Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context 1 Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes

More information

Netezza The Analytics Appliance

Netezza The Analytics Appliance Software 2011 Netezza The Analytics Appliance Michael Eden Information Management Brand Executive Central & Eastern Europe Vilnius 18 October 2011 Information Management 2011IBM Corporation Thought for

More information

TECHED USER CONFERENCE MAY 3-4, 2016

TECHED USER CONFERENCE MAY 3-4, 2016 TECHED USER CONFERENCE MAY 3-4, 2016 Bruce Beaman, Senior Director Adabas and Natural Product Marketing Software AG Software AG s Future Directions for Adabas and Natural WHAT CUSTOMERS ARE TELLING US

More information

When, Where & Why to Use NoSQL?

When, Where & Why to Use NoSQL? When, Where & Why to Use NoSQL? 1 Big data is becoming a big challenge for enterprises. Many organizations have built environments for transactional data with Relational Database Management Systems (RDBMS),

More information

IBM DB2 Analytics Accelerator

IBM DB2 Analytics Accelerator June, 2017 IBM DB2 Analytics Accelerator DB2 Analytics Accelerator for z/os on Cloud for z/os Update Peter Bendel IBM STSM Disclaimer IBM s statements regarding its plans, directions, and intent are subject

More information

IBM Terms of Use SaaS Specific Offering Terms. IBM DB2 on Cloud. 1. IBM SaaS. 2. Charge Metrics

IBM Terms of Use SaaS Specific Offering Terms. IBM DB2 on Cloud. 1. IBM SaaS. 2. Charge Metrics IBM Terms of Use SaaS Specific Offering Terms IBM DB2 on Cloud The Terms of Use ( ToU ) is composed of this IBM Terms of Use - SaaS Specific Offering Terms ( SaaS Specific Offering Terms ) and a document

More information

Security and Performance advances with Oracle Big Data SQL

Security and Performance advances with Oracle Big Data SQL Security and Performance advances with Oracle Big Data SQL Jean-Pierre Dijcks Oracle Redwood Shores, CA, USA Key Words SQL, Oracle, Database, Analytics, Object Store, Files, Big Data, Big Data SQL, Hadoop,

More information

Cloud Analytics and Business Intelligence on AWS

Cloud Analytics and Business Intelligence on AWS Cloud Analytics and Business Intelligence on AWS Enterprise Applications Virtual Desktops Sharing & Collaboration Platform Services Analytics Hadoop Real-time Streaming Data Machine Learning Data Warehouse

More information

Fluentd + MongoDB + Spark = Awesome Sauce

Fluentd + MongoDB + Spark = Awesome Sauce Fluentd + MongoDB + Spark = Awesome Sauce Nishant Sahay, Sr. Architect, Wipro Limited Bhavani Ananth, Tech Manager, Wipro Limited Your company logo here Wipro Open Source Practice: Vision & Mission Vision

More information

TPF Users Group Fall 2008 Title: z/tpf Support for OpenLDAP

TPF Users Group Fall 2008 Title: z/tpf Support for OpenLDAP z/tpf V1.1 Title: z/tpf Support for OpenLDAP Name: Mark Cooper Venue: Main Tent AIM Enterprise Platform Software IBM z/transaction Processing Facility Enterprise Edition 1.1.0 Any reference to future plans

More information

Optimizing Data Transformation with Db2 for z/os and Db2 Analytics Accelerator

Optimizing Data Transformation with Db2 for z/os and Db2 Analytics Accelerator Optimizing Data Transformation with Db2 for z/os and Db2 Analytics Accelerator Maryela Weihrauch, IBM Distinguished Engineer, WW Analytics on System z March, 2017 Please note IBM s statements regarding

More information

Scale-Out Architectures for Secondary Storage

Scale-Out Architectures for Secondary Storage Technology Insight Paper Scale-Out Architectures for Secondary Storage NEC is a Pioneer with HYDRAstor By Steve Scully, Sr. Analyst February 2018 Scale-Out Architectures for Secondary Storage 1 Scale-Out

More information

Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape

Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape Tom Clark IBM Distinguished Engineer, Chief Architect Software 1 Data growth is continuing to explode Sensors & Devices

More information

Scaling Marketplaces at Thumbtack QCon SF 2017

Scaling Marketplaces at Thumbtack QCon SF 2017 Scaling Marketplaces at Thumbtack QCon SF 2017 Nate Kupp Technical Infrastructure Data Eng, Experimentation, Platform Infrastructure, Security, Dev Tools Infrastructure from early beginnings You see that?

More information

OFA Developer Workshop 2014

OFA Developer Workshop 2014 OFA Developer Workshop 2014 Shared Memory Communications over RDMA (SMC-R): Update Jerry Stevens IBM sjerry@us.ibm.com Trademarks, copyrights and disclaimers IBM, the IBM logo, and ibm.com are trademarks

More information

What is Gluent? The Gluent Data Platform

What is Gluent? The Gluent Data Platform What is Gluent? The Gluent Data Platform The Gluent Data Platform provides a transparent data virtualization layer between traditional databases and modern data storage platforms, such as Hadoop, in the

More information

Optimizing and Modeling SAP Business Analytics for SAP HANA. Iver van de Zand, Business Analytics

Optimizing and Modeling SAP Business Analytics for SAP HANA. Iver van de Zand, Business Analytics Optimizing and Modeling SAP Business Analytics for SAP HANA Iver van de Zand, Business Analytics Early data warehouse projects LIMITATIONS ISSUES RAISED Data driven by acquisition, not architecture Too

More information

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training::

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training:: Module Title Duration : Cloudera Data Analyst Training : 4 days Overview Take your knowledge to the next level Cloudera University s four-day data analyst training course will teach you to apply traditional

More information

Modern Data Warehouse The New Approach to Azure BI

Modern Data Warehouse The New Approach to Azure BI Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics

More information

Service Description. IBM DB2 on Cloud. 1. Cloud Service. 1.1 IBM DB2 on Cloud Standard Small. 1.2 IBM DB2 on Cloud Standard Medium

Service Description. IBM DB2 on Cloud. 1. Cloud Service. 1.1 IBM DB2 on Cloud Standard Small. 1.2 IBM DB2 on Cloud Standard Medium Service Description IBM DB2 on Cloud This Service Description describes the Cloud Service IBM provides to Client. Client means the company and its authorized users and recipients of the Cloud Service.

More information

The webinar will start soon... Elasticsearch Performance Optimisation

The webinar will start soon... Elasticsearch Performance Optimisation The webinar will start soon... Performance Optimisation 1 whoami Alan Hardy Sr. Solutions Architect NEMEA 2 Webinar Housekeeping & Logistics Slides and recording will be available following the webinar

More information

Behind the Glitz - Is Life Better on Another Database Platform?

Behind the Glitz - Is Life Better on Another Database Platform? Behind the Glitz - Is Life Better on Another Database Platform? Rob Bestgen bestgen@us.ibm.com DB2 for i CoE We know the stories My Boss thinks we should move to SQL Server Oracle is being considered for

More information

Apache Ignite and Apache Spark Where Fast Data Meets the IoT

Apache Ignite and Apache Spark Where Fast Data Meets the IoT Apache Ignite and Apache Spark Where Fast Data Meets the IoT Denis Magda GridGain Product Manager Apache Ignite PMC http://ignite.apache.org #apacheignite #denismagda Agenda IoT Demands to Software IoT

More information

Fast Innovation requires Fast IT

Fast Innovation requires Fast IT Fast Innovation requires Fast IT Cisco Data Virtualization Puneet Kumar Bhugra Business Solutions Manager 1 Challenge In Data, Big Data & Analytics Siloed, Multiple Sources Business Outcomes Business Opportunity:

More information

Software Defined Storage for the Evolving Data Center

Software Defined Storage for the Evolving Data Center Software Defined Storage for the Evolving Data Center Petter Sveum Information Availability Solution Lead EMEA Technology Practice ATTENTION Forward-looking Statements: Any forward-looking indication of

More information

QLIK INTEGRATION WITH AMAZON REDSHIFT

QLIK INTEGRATION WITH AMAZON REDSHIFT QLIK INTEGRATION WITH AMAZON REDSHIFT Qlik Partner Engineering Created August 2016, last updated March 2017 Contents Introduction... 2 About Amazon Web Services (AWS)... 2 About Amazon Redshift... 2 Qlik

More information

Apache Spark 2.0. Matei

Apache Spark 2.0. Matei Apache Spark 2.0 Matei Zaharia @matei_zaharia What is Apache Spark? Open source data processing engine for clusters Generalizes MapReduce model Rich set of APIs and libraries In Scala, Java, Python and

More information

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure Mario Beck (mario.beck@oracle.com) Principal Sales Consultant MySQL Session Agenda Requirements for

More information

Optimizing Data Integration Solutions by Customizing the IBM InfoSphere Information Server Deployment Architecture IBM Redbooks Solution Guide

Optimizing Data Integration Solutions by Customizing the IBM InfoSphere Information Server Deployment Architecture IBM Redbooks Solution Guide Optimizing Data Integration Solutions by Customizing the IBM InfoSphere Information Server Deployment Architecture IBM Redbooks Solution Guide IBM InfoSphere Information Server provides a unified data

More information

Data Analytics at Logitech Snowflake + Tableau = #Winning

Data Analytics at Logitech Snowflake + Tableau = #Winning Welcome # T C 1 8 Data Analytics at Logitech Snowflake + Tableau = #Winning Avinash Deshpande I am a futurist, scientist, engineer, designer, data evangelist at heart Find me at Avinash Deshpande Chief

More information

Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Love to Scale

Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Love to Scale Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Love to Scale Luca Canali, CERN About Luca Data Engineer and team lead at CERN Hadoop and Spark service, database services 18+ years

More information

microsoft

microsoft 70-775.microsoft Number: 70-775 Passing Score: 800 Time Limit: 120 min Exam A QUESTION 1 Note: This question is part of a series of questions that present the same scenario. Each question in the series

More information

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security Bringing OpenStack to the Enterprise An enterprise-class solution ensures you get the required performance, reliability, and security INTRODUCTION Organizations today frequently need to quickly get systems

More information

IBM Europe Announcement ZP , dated November 6, 2007

IBM Europe Announcement ZP , dated November 6, 2007 IBM Europe Announcement ZP07-0484, dated November 6, 2007 IBM WebSphere Front Office for Financial Markets V2.0 and IBM WebSphere MQ Low Latency Messaging V2.0 deliver high speed and high throughput market

More information

MapR Enterprise Hadoop

MapR Enterprise Hadoop 2014 MapR Technologies 2014 MapR Technologies 1 MapR Enterprise Hadoop Top Ranked Cloud Leaders 500+ Customers 2014 MapR Technologies 2 Key MapR Advantage Partners Business Services APPLICATIONS & OS ANALYTICS

More information

Building a Data Strategy for a Digital World

Building a Data Strategy for a Digital World Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service

More information

Using WebSphere Application Server Optimized Local Adapters (WOLA) to Integrate COBOL and zaap-able Java

Using WebSphere Application Server Optimized Local Adapters (WOLA) to Integrate COBOL and zaap-able Java Using WebSphere Application Server Optimized Local Adapters (WOLA) to Integrate COBOL and zaap-able Java David Follis IBM March 12, 2014 Session Number 14693 Insert Custom Session QR if Desired. Trademarks

More information