Big Data analytics in insurance

Size: px
Start display at page:

Download "Big Data analytics in insurance"

Transcription

1 Big Data analytics in insurance

2 Who we are Experts At Your Service > Over 50 specialists in IT infrastructure > Certified, experienced, passionate Based In Switzerland > 100% self-financed Swiss company > Over CHF 8 mio. Turnover Leading In Infrastructure Services > More than 150 customers in CH, D & F > Over 50 SLAs dbi FlexService contracted Big Data analytics in insurance Page 2

3 About us Yann Neuhaus / David Hueber Chief Sales Officer / CEO yann.neuhaus[at]dbi-services.com / david.hueber[at]dbi-services.com Big Data analytics in insurance Page 3

4 Agenda 1.Introduction 2.Relational data model 3.Testing infrastructures 4.Big Data process 5.Conclusion Big Data analytics in insurance Page 4

5 Introduction > Big Data processes > Objectives > High level architecture Big Data analytics in insurance Page 5

6 Introduction Big Data Analytics Business Intelligence process Business requirements Old model Rigid model, Alteration, Long time Implementation Data source Data assessment Data cleansing Analyze Scope Collect Clean Visualization Page 6

7 Introduction Big Data Analytics Big Data process Business requirements New model Fast and Flexible process Data Data source source Data integration Data assessment Data exploration Data cleansing Business validation Analyze Process repetition Identification Semi/unstructured data Analyze, Patterns Implement solutions New sources, patterns Page 7

8 Introduction Objectives Analyzing the risks of an insurance portfolio Big Data Analytics for insurance Business Comparison between relational and Big Data data model > Implementation of relational data model > Implementation of a Big Data process > Choosing the right Big Data tools Page 8

9 Introduction Objectives DATA Data consolidation from multiple sources DATA DATA STATISTICS Analyze the risks of insurance customer BUSINESS Generation of new business use cases BUSINESS STATISTICS Big Data = Get the true about all data Page 9

10 Introduction Disclaimer BE CARFUL WE ARE NOT >Insurance Business Analyst >Data scientists It s a demonstration use case Not intended to production environments Big Data infrastructure project is first of all about choosing the right tool for your use case > Large ecosystem Page 10

11 Introduction High level architecture Customer Data - Exploration - Analyze - Processing - Manipulation Assess the risk of an insurance portfolio Page 11

12 Relational data model > Architecture > Accident fact table > Challenges Big Data analytics in insurance Page 12

13 Relational data model Architecture DB1 an OLTP database DB2 a DataWareHouse VEH Ref Tables VEH (tmp) Ref Tables CSV CSV CSV CUST CONTR Replication CUST (tmp) CONTR (tmp) Processing Data Load STG_ACC MV_CONTR FACT_ACC REPORTING Page 13

14 Relational data model Accident fact table Age range Gender Accident fact table Type vehicle Brand vehicle New dimension? Page 14

15 Relational data model Challenges Relational data model offer 3 main challenges for Big Data Performance > Data loading is slow with relational model (ACID vs CAP) Modeling > Adding a new dimension in the fact table consumes performances and leads to fragmentation Infrastructure > Increasing the data volume needs to scale up the infrastructure Page 15

16 Testing infrastructures 1 > Infrastructures comparison Big Data analytics in insurance Page 16

17 Testing infrastructures Infrastructures comparison Relation data model 2 nodes (1 OLTP 1 DWH) 4 GB Memory per nodes 2 vcpu per nodes 100 GB per nodes Oracle 12.2 Enterprise Edition Cloudera On-Premise cluster 5 nodes 8 GB Memory Cloudera Manager 4 GB Memory data nodes 1 vcpu per nodes 100 GB per nodes Cloudera EDH edition Oracle Big Data Appliance 6 nodes 64 GB Memory 2 * 22 CPU per nodes 96 TB per nodes Cloudera EDH edition Cloudera Cloud cluster 5 nodes EC2 15 GB Memory per nodes 4 vcpu per nodes 500 GB per nodes Cloudera EDH edition Page 17

18 Big Data process > Reference architecture > Data ingestion > Data analysis > Data processing > Data visualization Start Demo (on Cloudera master):./data_analysis.bash 5 Big Data analytics in insurance Page 18

19 Big Data process Reference architecture Data Sources Data Ingestion Data Analysis Data Processing Data Visualization CSV Customer Data CSV Page 19

20 Big Data process Data Ingestion Importing data from multiple sources into HDFS Data Lake Data sources > Customer csv (extracted from relational database) > Open Data csv (1 file per year in our business case) HDFS import methods > WebHDFS > HTTPFS > HDFS copy from local Page 20

21 Big Data process Data Ingestion > OpenData import ~]$ curl -i -X PUT -L " centos" -T /home/centos/opendata2009.csv ~]$ curl -i -X PUT -L " centos" -T /home/centos/opendata2010.csv > Customer data import ~]$ curl -i -X PUT -L " -T /home/centos/customer.csv Page 21

22 Big Data process Data ingestion Time results for data import 12 GB Data import Time (minutes) Page 22

23 Big Data process Data analysis Creation of Hive meta-data > Hive DDL > Creation of insurance database > Creation of tables > Accident data > customer Pre-analysis the data > Data manipulation with SQL queries External tables for data processing > Creation of external tables with transformations > customer1 & customer2 (2 transformation steps) > opendata ( transformed accident data) Page 23

24 Big Data process Data analysis Time results for data analysis Data analysis Time (minutes) Page 24

25 Big Data process Follow up Demo :./data_processing.bash Big Data analytics in insurance Page 25

26 Big Data process Data processing Retrieving external tables from HDFS > customer2 > opendata Usage of Spark for data processing > Spark SQL queries > Creation of Spark dataframes Implementation of python scripts for data counting > Persist results into HDFS in csv format Page 26

27 Big Data process Data processing Time results for data processing Data processing Time (minutes) Page 27

28 Big Data process Data visualization Creation of Hive external tables for OpenData and customer > per age range > per gender > per vehicle type > per vehicle brand ODBC connection for retrieving data from Hive > Extract tables from Hive insurance database ODBC connection with QlikView or Excel for reporting Follow up Demo :./data_reporting.bash Page 28

29 Conclusion > Big Data advantages > Advantages vs Drawbacks Big Data analytics in insurance Page 29

30 Conclusion Big Data advantages Flexible Architecture > Standard Big Data architecture > Lambda architecture Schemaless data model > No integrity constraints > Several way of doing Adding a new dimension in the fact table > Same data model > Same architecture > No performances decrease Page 30

31 Conclusion Advantages vs Drawbacks Flexibility for data modeling Performances New technologies Infrastructures are not easy to deploy New business generation Compatibility with common B.I tools Lack of human resources Page 31

32 Basel Delémont Zürich Any questions? Please do ask! Nyon We would love to boost }5 your IT-Infrastructure How about you? Big Data analytics in insurance Page 32

Tuning slow queries after an upgrade

Tuning slow queries after an upgrade Tuning slow queries after an upgrade Who we are Experts At Your Service > Over 50 specialists in IT infrastructure > Certified, experienced, passionate Based In Switzerland > 100% self-financed Swiss company

More information

Implement SAML 2.0 SSO in WLS using IDM Federation Services

Implement SAML 2.0 SSO in WLS using IDM Federation Services Implement SAML 2.0 SSO in WLS using IDM Federation Services Who we are Experts At Your Service > Over 60 specialists in IT infrastructure > Certified, experienced, passionate Based In Switzerland > 100%

More information

Infrastructure Consolidation with OCI

Infrastructure Consolidation with OCI Infrastructure Consolidation with OCI Who we are Experts At Your Service Over 60 specialists in IT infrastructure Certified, experienced, passionate Based In Switzerland 100% self-financed Swiss company

More information

CC13c LifeCycle Management. Infrastructure at your Service.

CC13c LifeCycle Management. Infrastructure at your Service. CC13c LifeCycle Management Infrastructure at your Service. About me Infrastructure at your Service. Wisson Delivery Manager Principal Consultant +41 79 819 25 98 gerard.wisson [ at ] dbi-services.com Page

More information

Incrementally Updating Backups Tips and Tricks

Incrementally Updating Backups Tips and Tricks Infrastructure at your Service. Incrementally Updating Backups Tips and Tricks Oracle 12.1.0.2 - Linux x86 64bit About me Infrastructure at your Service. William Sescu Consultant +41 78 674 12 90 william.sescu@dbi-services.com

More information

Infrastructure at your Service. DOAG Webinar. ODA 12c new features. ODA 12c new features

Infrastructure at your Service. DOAG Webinar. ODA 12c new features. ODA 12c new features Infrastructure at your Service. DOAG Webinar Infrastructure at your Service. About us David Hueber COO Principal Consultant Mobile +41 79 963 43 68 david-.hueber@dbi-services.com www.dbi-services.com Page

More information

Quo Vadis SQLTXPLAIN

Quo Vadis SQLTXPLAIN Quo Vadis SQLTXPLAIN Who we are Experts At Your Service > Over 50 specialists in IT infrastructure > Certified, experienced, passionate Based In Switzerland > 100% self-financed Swiss company > Over CHF8

More information

Infrastructure at your Service. Oracle over Docker. Oracle over Docker

Infrastructure at your Service. Oracle over Docker. Oracle over Docker Infrastructure at your Service. Infrastructure at your Service. About me David Hueber COO Principal Consultant Mobile +41 79 963 43 68 david-.hueber@dbi-services.com www.dbi-services.com Page 2 18.11.2015

More information

DOAG 2013 Oracle 12c Backup & Recovery

DOAG 2013 Oracle 12c Backup & Recovery Infrastructure at your Service. DOAG 2013 Oracle 12c Backup & Recovery Infrastructure at your Service. About me David Hueber Head of Service Management Senior Consultant Mobile +41 79 963 43 68 david.hueber@dbi-services.com

More information

Databases Clone using ACFS. Infrastructure at your Service.

Databases Clone using ACFS. Infrastructure at your Service. Infrastructure at your Service. About me Infrastructure at your Service. David Hueber COO Principal Consultant +41 79 963 43 68 david.hueber[at]dbi-services.com Page 2 Who we are dbi services Experts At

More information

Monitoring WebLogic with WLDF

Monitoring WebLogic with WLDF Monitoring WebLogic with WLDF Infrastructure at your Service. Infrastructure at your Service. About me Gérard Wisson Delivery Manager Senior Consultant Mobile +41 79 819 25 98 Gerard.wisson@dbi-services.com

More information

Infrastructure at your Service. In-Memory-Pläne für den 12.2-Optimizer: Teuer oder billig?

Infrastructure at your Service. In-Memory-Pläne für den 12.2-Optimizer: Teuer oder billig? Infrastructure at your Service. In-Memory-Pläne für den 12.2-Optimizer: Teuer oder billig? About me Infrastructure at your Service. Clemens Bleile Senior Consultant Oracle Certified Professional DB 11g,

More information

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case

More information

Modern Data Warehouse The New Approach to Azure BI

Modern Data Warehouse The New Approach to Azure BI Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics

More information

RDBMS & WLS Consolidation with ODA X5-2

RDBMS & WLS Consolidation with ODA X5-2 RDBMS & WLS Consolidation with ODA X5-2 Who we are Experts At Your Service > Over 50 specialists in IT infrastructure > Certified, experienced, passionate Based In Switzerland > 100% self-financed Swiss

More information

Innovatus Technologies

Innovatus Technologies HADOOP 2.X BIGDATA ANALYTICS 1. Java Overview of Java Classes and Objects Garbage Collection and Modifiers Inheritance, Aggregation, Polymorphism Command line argument Abstract class and Interfaces String

More information

Modernizing Business Intelligence and Analytics

Modernizing Business Intelligence and Analytics Modernizing Business Intelligence and Analytics Justin Erickson Senior Director, Product Management 1 Agenda What benefits can I achieve from modernizing my analytic DB? When and how do I migrate from

More information

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a)

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Cloudera s Developer Training for Apache Spark and Hadoop delivers the key concepts and expertise need to develop high-performance

More information

Oracle Big Data Discovery

Oracle Big Data Discovery Oracle Big Data Discovery Turning Data into Business Value Harald Erb Oracle Business Analytics & Big Data 1 Safe Harbor Statement The following is intended to outline our general product direction. It

More information

English Edition Infrastructure at your Service. Ingenuity. Inventio. Insurance. Insight. Intensity. Influence. Ingenuity.

English Edition Infrastructure at your Service. Ingenuity. Inventio. Insurance. Insight. Intensity. Influence. Ingenuity. English Edition 2017 Infrastructure at your Service. ight Ingenuity Insurance Insight Inventio Intensity Ingenuity Influence Invention Editorial Dear Readers, It is my pleasure to present our company,

More information

Benchmarks Prove the Value of an Analytical Database for Big Data

Benchmarks Prove the Value of an Analytical Database for Big Data White Paper Vertica Benchmarks Prove the Value of an Analytical Database for Big Data Table of Contents page The Test... 1 Stage One: Performing Complex Analytics... 3 Stage Two: Achieving Top Speed...

More information

Data in the Cloud and Analytics in the Lake

Data in the Cloud and Analytics in the Lake Data in the Cloud and Analytics in the Lake Introduction Working in Analytics for over 5 years Part the digital team at BNZ for 3 years Based in the Auckland office Preferred Languages SQL Python (PySpark)

More information

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training::

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training:: Module Title Duration : Cloudera Data Analyst Training : 4 days Overview Take your knowledge to the next level Cloudera University s four-day data analyst training course will teach you to apply traditional

More information

Making the Most of Hadoop with Optimized Data Compression (and Boost Performance) Mark Cusack. Chief Architect RainStor

Making the Most of Hadoop with Optimized Data Compression (and Boost Performance) Mark Cusack. Chief Architect RainStor Making the Most of Hadoop with Optimized Data Compression (and Boost Performance) Mark Cusack Chief Architect RainStor Agenda Importance of Hadoop + data compression Data compression techniques Compression,

More information

Big Data solution benchmark

Big Data solution benchmark Big Data solution benchmark Introduction In the last few years, Big Data Analytics have gained a very fair amount of success. The trend is expected to grow rapidly with further advancement in the coming

More information

Modeling. Preparation. Operationalization. Profile Explore. Model Testing & Validation. Feature & Algorithm Selection. Transform Cleanse Denormalize

Modeling. Preparation. Operationalization. Profile Explore. Model Testing & Validation. Feature & Algorithm Selection. Transform Cleanse Denormalize Preparation Modeling Ingest Transform Cleanse Denormalize Profile Explore Visualize Feature & Algorithm Selection Model Testing & Validation Operationalization Models Visualizations Deploy Apps, Services

More information

SQL Server 2019 Big Data Clusters

SQL Server 2019 Big Data Clusters SQL Server 2019 Big Data Clusters Ben Weissman @bweissman > SOLISYON GMBH > FÜRTHER STRAßE 212 > 90429 NÜRNBERG > +49 911 990077 20 Who am I? Ben Weissman @bweissman b.weissman@solisyon.de http://biml-blog.de/

More information

Asanka Padmakumara. ETL 2.0: Data Engineering with Azure Databricks

Asanka Padmakumara. ETL 2.0: Data Engineering with Azure Databricks Asanka Padmakumara ETL 2.0: Data Engineering with Azure Databricks Who am I? Asanka Padmakumara Business Intelligence Consultant, More than 8 years in BI and Data Warehousing A regular speaker in data

More information

Safe Harbor Statement

Safe Harbor Statement Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment

More information

Welcome to the Era of Open Analytics

Welcome to the Era of Open Analytics Welcome to the Era of Open Analytics Sumit Sarkar, Chief Data Evangelist, Progress sumit.sarkar@progress.com @SAsInSumit www.linkedin.com/in/meetsumit The embedded analytics market is estimated to nearly

More information

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development:: Title Duration : Apache Spark Development : 4 days Overview Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized

More information

Blended Learning Outline: Cloudera Data Analyst Training (171219a)

Blended Learning Outline: Cloudera Data Analyst Training (171219a) Blended Learning Outline: Cloudera Data Analyst Training (171219a) Cloudera Univeristy s data analyst training course will teach you to apply traditional data analytics and business intelligence skills

More information

CERTIFICATE IN SOFTWARE DEVELOPMENT LIFE CYCLE IN BIG DATA AND BUSINESS INTELLIGENCE (SDLC-BD & BI)

CERTIFICATE IN SOFTWARE DEVELOPMENT LIFE CYCLE IN BIG DATA AND BUSINESS INTELLIGENCE (SDLC-BD & BI) CERTIFICATE IN SOFTWARE DEVELOPMENT LIFE CYCLE IN BIG DATA AND BUSINESS INTELLIGENCE (SDLC-BD & BI) The Certificate in Software Development Life Cycle in BIGDATA, Business Intelligence and Tableau program

More information

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue Datalynx Enterprise Data Management Solution Catalogue About Datalynx Vendor of the world s most versatile Enterprise Data Management software Licence our software to clients & partners Partner-based sales

More information

BIG DATA COURSE CONTENT

BIG DATA COURSE CONTENT BIG DATA COURSE CONTENT [I] Get Started with Big Data Microsoft Professional Orientation: Big Data Duration: 12 hrs Course Content: Introduction Course Introduction Data Fundamentals Introduction to Data

More information

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect Big Data Big Data Analyst INTRODUCTION TO BIG DATA ANALYTICS ANALYTICS PROCESSING TECHNIQUES DATA TRANSFORMATION & BATCH PROCESSING REAL TIME (STREAM) DATA PROCESSING Big Data Engineer BIG DATA FOUNDATION

More information

Exam Questions

Exam Questions Exam Questions 70-775 Perform Data Engineering on Microsoft Azure HDInsight (beta) https://www.2passeasy.com/dumps/70-775/ NEW QUESTION 1 You are implementing a batch processing solution by using Azure

More information

Approaching the Petabyte Analytic Database: What I learned

Approaching the Petabyte Analytic Database: What I learned Disclaimer This document is for informational purposes only and is subject to change at any time without notice. The information in this document is proprietary to Actian and no part of this document may

More information

Azure Data Factory VS. SSIS. Reza Rad, Consultant, RADACAD

Azure Data Factory VS. SSIS. Reza Rad, Consultant, RADACAD Azure Data Factory VS. SSIS Reza Rad, Consultant, RADACAD 2 Please silence cell phones Explore Everything PASS Has to Offer FREE ONLINE WEBINAR EVENTS FREE 1-DAY LOCAL TRAINING EVENTS VOLUNTEERING OPPORTUNITIES

More information

Fast, In-Memory Analytics on PPDM. Calgary 2016

Fast, In-Memory Analytics on PPDM. Calgary 2016 Fast, In-Memory Analytics on PPDM Calgary 2016 In-Memory Analytics A BI methodology to solve complex and timesensitive business scenarios by using system memory as opposed to physical disk, by increasing

More information

Big Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours

Big Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours Big Data Hadoop Developer Course Content Who is the target audience? Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours Complete beginners who want to learn Big Data Hadoop Professionals

More information

AWS Serverless Architecture Think Big

AWS Serverless Architecture Think Big MAKING BIG DATA COME ALIVE AWS Serverless Architecture Think Big Garrett Holbrook, Data Engineer Feb 1 st, 2017 Agenda What is Think Big? Example Project Walkthrough AWS Serverless 2 Think Big, a Teradata

More information

Accelerate Big Data Insights

Accelerate Big Data Insights Accelerate Big Data Insights Executive Summary An abundance of information isn t always helpful when time is of the essence. In the world of big data, the ability to accelerate time-to-insight can not

More information

Oracle Big Data. A NA LYT ICS A ND MA NAG E MENT.

Oracle Big Data. A NA LYT ICS A ND MA NAG E MENT. Oracle Big Data. A NALYTICS A ND MANAG E MENT. Oracle Big Data: Redundância. Compatível com ecossistema Hadoop, HIVE, HBASE, SPARK. Integração com Cloudera Manager. Possibilidade de Utilização da Linguagem

More information

Big Data Analytics using Apache Hadoop and Spark with Scala

Big Data Analytics using Apache Hadoop and Spark with Scala Big Data Analytics using Apache Hadoop and Spark with Scala Training Highlights : 80% of the training is with Practical Demo (On Custom Cloudera and Ubuntu Machines) 20% Theory Portion will be important

More information

Understanding the latent value in all content

Understanding the latent value in all content Understanding the latent value in all content John F. Kennedy (JFK) November 22, 1963 INGEST ENRICH EXPLORE Cognitive skills Data in any format, any Azure store Search Annotations Data Cloud Intelligence

More information

Security and Performance advances with Oracle Big Data SQL

Security and Performance advances with Oracle Big Data SQL Security and Performance advances with Oracle Big Data SQL Jean-Pierre Dijcks Oracle Redwood Shores, CA, USA Key Words SQL, Oracle, Database, Analytics, Object Store, Files, Big Data, Big Data SQL, Hadoop,

More information

VOLTDB + HP VERTICA. page

VOLTDB + HP VERTICA. page VOLTDB + HP VERTICA ARCHITECTURE FOR FAST AND BIG DATA ARCHITECTURE FOR FAST + BIG DATA FAST DATA Fast Serve Analytics BIG DATA BI Reporting Fast Operational Database Streaming Analytics Columnar Analytics

More information

Azure SQL Data Warehouse. Andrija Marcic Microsoft

Azure SQL Data Warehouse. Andrija Marcic Microsoft Azure SQL Data Warehouse Andrija Marcic Microsoft End to end platform built for the cloud Hadoop SQL Azure SQL Data Warehouse Azure SQL Database App Service Intelligent App Azure Machine Learning Power

More information

EsgynDB Enterprise 2.0 Platform Reference Architecture

EsgynDB Enterprise 2.0 Platform Reference Architecture EsgynDB Enterprise 2.0 Platform Reference Architecture This document outlines a Platform Reference Architecture for EsgynDB Enterprise, built on Apache Trafodion (Incubating) implementation with licensed

More information

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk Raanan Dagan and Rohit Pujari September 25, 2017 Washington, DC Forward-Looking Statements During the course of this presentation, we may

More information

Introducing Microsoft SQL Server 2016 R Services. Julian Lee Advanced Analytics Lead Global Black Belt Asia Timezone

Introducing Microsoft SQL Server 2016 R Services. Julian Lee Advanced Analytics Lead Global Black Belt Asia Timezone Introducing Microsoft SQL Server 2016 R Services Julian Lee Advanced Analytics Lead Global Black Belt Asia Timezone SQL Server 2016: Everything built-in built-in built-in built-in built-in built-in $2,230

More information

Databricks, an Introduction

Databricks, an Introduction Databricks, an Introduction Chuck Connell, Insight Digital Innovation Insight Presentation Speaker Bio Senior Data Architect at Insight Digital Innovation Focus on Azure big data services HDInsight/Hadoop,

More information

Bull Fast Track/PDW and Big Data

Bull Fast Track/PDW and Big Data Bull Fast Track/PDW and Big Data Add High Performance BI to your Big Data Roger Van Unen Expert Microsoft / BI roger.van-unen@bull.net http://www.bull.fr/bi/fastrack.html Michael Schmitter BI Sales Germany

More information

The age of Big Data Big Data for Oracle Database Professionals

The age of Big Data Big Data for Oracle Database Professionals The age of Big Data Big Data for Oracle Database Professionals Oracle OpenWorld 2017 #OOW17 SessionID: SUN5698 Tom S. Reddy tom.reddy@datareddy.com About the Speaker COLLABORATE & OpenWorld Speaker IOUG

More information

Data Analytics using MapReduce framework for DB2's Large Scale XML Data Processing

Data Analytics using MapReduce framework for DB2's Large Scale XML Data Processing IBM Software Group Data Analytics using MapReduce framework for DB2's Large Scale XML Data Processing George Wang Lead Software Egnineer, DB2 for z/os IBM 2014 IBM Corporation Disclaimer and Trademarks

More information

Oracle Big Data Fundamentals Ed 2

Oracle Big Data Fundamentals Ed 2 Oracle University Contact Us: 1.800.529.0165 Oracle Big Data Fundamentals Ed 2 Duration: 5 Days What you will learn In the Oracle Big Data Fundamentals course, you learn about big data, the technologies

More information

microsoft

microsoft 70-775.microsoft Number: 70-775 Passing Score: 800 Time Limit: 120 min Exam A QUESTION 1 Note: This question is part of a series of questions that present the same scenario. Each question in the series

More information

Oracle Big Data Connectors

Oracle Big Data Connectors Oracle Big Data Connectors Oracle Big Data Connectors is a software suite that integrates processing in Apache Hadoop distributions with operations in Oracle Database. It enables the use of Hadoop to process

More information

Microsoft Big Data and Hadoop

Microsoft Big Data and Hadoop Microsoft Big Data and Hadoop Lara Rubbelke @sqlgal Cindy Gross @sqlcindy 2 The world of data is changing The 4Vs of Big Data http://nosql.mypopescu.com/post/9621746531/a-definition-of-big-data 3 Common

More information

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Copyright 2012, Oracle and/or its affiliates. All rights reserved. 1 Storage Innovation at the Core of the Enterprise Robert Klusman Sr. Director Storage North America 2 The following is intended to outline our general product direction. It is intended for information

More information

REGULATORY REPORTING FOR FINANCIAL SERVICES

REGULATORY REPORTING FOR FINANCIAL SERVICES REGULATORY REPORTING FOR FINANCIAL SERVICES Gordon Hughes, Global Sales Director, Intel Corporation Sinan Baskan, Solutions Director, Financial Services, MarkLogic Corporation Many regulators and regulations

More information

Take P, R or U. and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22

Take P, R or U. and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22 Take P, R or U and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22 Oliver Engels CEO, oh22data AG @oengels Datamonster from Germany MS Data Platform MVP President of PASS Germany

More information

Kubernetes for Stateful Workloads Benchmarks

Kubernetes for Stateful Workloads Benchmarks Kubernetes for Stateful Workloads Benchmarks Baremetal Like Performance for For Big Data, Databases And AI/ML Executive Summary Customers are actively evaluating stateful workloads for containerization

More information

Ian Choy. Technology Solutions Professional

Ian Choy. Technology Solutions Professional Ian Choy Technology Solutions Professional XML KPIs SQL Server 2000 Management Studio Mirroring SQL Server 2005 Compression Policy-Based Mgmt Programmability SQL Server 2008 PowerPivot SharePoint Integration

More information

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Cy Erbay Senior Director Striim Executive Summary Striim is Uniquely Qualified to Solve the Challenges of Real-Time

More information

BigInsights and Cognos Stefan Hubertus, Principal Solution Specialist Cognos Wilfried Hoge, IT Architect Big Data IBM Corporation

BigInsights and Cognos Stefan Hubertus, Principal Solution Specialist Cognos Wilfried Hoge, IT Architect Big Data IBM Corporation BigInsights and Cognos Stefan Hubertus, Principal Solution Specialist Cognos Wilfried Hoge, IT Architect Big Data 2013 IBM Corporation A Big Data architecture evolves from a traditional BI architecture

More information

BI4Dynamics NAV Installation Requirements

BI4Dynamics NAV Installation Requirements BI4Dynamics NAV Installation Requirements Last update: January 2018 BI4Dynamics NAV version: 6.1 Document version: 2.0 BI4Dynamics NAV System Requirements Page 1 1. SYSTEM REQUIREMENTS BI4Dynamics can

More information

Stages of Data Processing

Stages of Data Processing Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,

More information

Hadoop Overview. Lars George Director EMEA Services

Hadoop Overview. Lars George Director EMEA Services Hadoop Overview Lars George Director EMEA Services 1 About Me Director EMEA Services @ Cloudera Consulting on Hadoop projects (everywhere) Apache Committer HBase and Whirr O Reilly Author HBase The Definitive

More information

Automated Netezza to Cloud Migration

Automated Netezza to Cloud Migration Automated Netezza to Cloud Migration CASE STUDY Client Overview Our client is a government-sponsored enterprise* that provides financial products and services to increase the availability and affordability

More information

Orchestration of Data Lakes BigData Analytics and Integration. Sarma Sishta Brice Lambelet

Orchestration of Data Lakes BigData Analytics and Integration. Sarma Sishta Brice Lambelet Orchestration of Data Lakes BigData Analytics and Integration Sarma Sishta Brice Lambelet Introduction The Five Megatrends Driving Our Digitized World And Their Implications for Distributed Big Data Management

More information

BEST BIG DATA CERTIFICATIONS

BEST BIG DATA CERTIFICATIONS VALIANCE INSIGHTS BIG DATA BEST BIG DATA CERTIFICATIONS email : info@valiancesolutions.com website : www.valiancesolutions.com VALIANCE SOLUTIONS Analytics: Optimizing Certificate Engineer Engineering

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

Oracle CoreTech Update OASC Opening 17. November 2014

Oracle CoreTech Update OASC Opening 17. November 2014 Oracle CoreTech Update OASC Opening 17. November 2014 Roger Wullschleger Senior Manager Sales Consulting CoreTech Oracle Software (Schweiz) GmbH Copyright 2014, Oracle and/or its affiliates. All rights

More information

Data Lake Based Systems that Work

Data Lake Based Systems that Work Data Lake Based Systems that Work There are many article and blogs about what works and what does not work when trying to build out a data lake and reporting system. At DesignMind, we have developed a

More information

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways

More information

SOFTWARE DEVELOPMENT: DATA SCIENCE

SOFTWARE DEVELOPMENT: DATA SCIENCE PROFESSIONAL CAREER TRAINING INSTITUTE SOFTWARE DEVELOPMENT: DATA SCIENCE www.pcti.edu/data-science applicant@pcti.edu 832-484-9100 PROGRAM OVERVIEW Prepare for a life changing career as a data scientist

More information

Big Data Architect.

Big Data Architect. Big Data Architect www.austech.edu.au WHAT IS BIG DATA ARCHITECT? A big data architecture is designed to handle the ingestion, processing, and analysis of data that is too large or complex for traditional

More information

HDInsight > Hadoop. October 12, 2017

HDInsight > Hadoop. October 12, 2017 HDInsight > Hadoop October 12, 2017 2 Introduction Mark Hudson >20 years mixing technology with data >10 years with CapTech Microsoft Certified IT Professional Business Intelligence Member of the Richmond

More information

Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0. WEBINAR MAY 15 th, PM EST 10AM PST

Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0. WEBINAR MAY 15 th, PM EST 10AM PST Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0 WEBINAR MAY 15 th, 2018 1PM EST 10AM PST Welcome and Logistics If you have problems with the sound on your computer, switch

More information

Microsoft Perform Data Engineering on Microsoft Azure HDInsight.

Microsoft Perform Data Engineering on Microsoft Azure HDInsight. Microsoft 70-775 Perform Data Engineering on Microsoft Azure HDInsight http://killexams.com/pass4sure/exam-detail/70-775 QUESTION: 30 You are building a security tracking solution in Apache Kafka to parse

More information

IBM DB2 Analytics Accelerator Trends and Directions

IBM DB2 Analytics Accelerator Trends and Directions March, 2017 IBM DB2 Analytics Accelerator Trends and Directions DB2 Analytics Accelerator for z/os on Cloud Namik Hrle IBM Fellow Peter Bendel IBM STSM Disclaimer IBM s statements regarding its plans,

More information

MapR Enterprise Hadoop

MapR Enterprise Hadoop 2014 MapR Technologies 2014 MapR Technologies 1 MapR Enterprise Hadoop Top Ranked Cloud Leaders 500+ Customers 2014 MapR Technologies 2 Key MapR Advantage Partners Business Services APPLICATIONS & OS ANALYTICS

More information

White Paper / Azure Data Platform: Ingest

White Paper / Azure Data Platform: Ingest White Paper / Azure Data Platform: Ingest Contents White Paper / Azure Data Platform: Ingest... 1 Versioning... 2 Meta Data... 2 Foreword... 3 Prerequisites... 3 Azure Data Platform... 4 Flowchart Guidance...

More information

Configuration Guide for Veeam Backup & Replication with the HPE Hyper Converged 250 System

Configuration Guide for Veeam Backup & Replication with the HPE Hyper Converged 250 System Configuration Guide for Veeam Backup & Replication with the HPE Hyper Converged 250 System 1 + 1 = 3 HPE + Veeam Better Together Contents Intended audience...3 Veeam Backup & Replication overview...3 Adding

More information

WHITEPAPER. MemSQL Enterprise Feature List

WHITEPAPER. MemSQL Enterprise Feature List WHITEPAPER MemSQL Enterprise Feature List 2017 MemSQL Enterprise Feature List DEPLOYMENT Provision and deploy MemSQL anywhere according to your desired cluster configuration. On-Premises: Maximize infrastructure

More information

<Insert Picture Here> Introduction to Big Data Technology

<Insert Picture Here> Introduction to Big Data Technology Introduction to Big Data Technology The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into

More information

@Pentaho #BigDataWebSeries

@Pentaho #BigDataWebSeries Enterprise Data Warehouse Optimization with Hadoop Big Data @Pentaho #BigDataWebSeries Your Hosts Today Dave Henry SVP Enterprise Solutions Davy Nys VP EMEA & APAC 2 Source/copyright: The Human Face of

More information

Dealing with Data Especially Big Data

Dealing with Data Especially Big Data Dealing with Data Especially Big Data INFO-GB-2346.01 Fall 2017 Professor Norman White nwhite@stern.nyu.edu normwhite@twitter Teaching Assistant: Frenil Sanghavi fps241@stern.nyu.edu Administrative Assistant:

More information

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage Performance Study of Microsoft SQL Server 2016 Dell Engineering February 2017 Table of contents

More information

Přehled novinek v SQL Server 2016

Přehled novinek v SQL Server 2016 Přehled novinek v SQL Server 2016 Martin Rys, BI Competency Leader martin.rys@adastragrp.com https://www.linkedin.com/in/martinrys 20.4.2016 1 BI Competency development 2 Trends, modern data warehousing

More information

Oracle made it easy: Cloud DB Vergleich

Oracle made it easy: Cloud DB Vergleich Oracle made it easy: Cloud DB Vergleich MATTHIAS FUCHS, ESENTRI BORYS NESELOVSKYI, OPITZ CONSULTING DOAG 2018 KONFERENZ, NÜRNBERG Cloud Angebote für Oracle Datenbank ORACLE CLOUD Oracle Datenbank Microsoft

More information

Cloud Analytics and Business Intelligence on AWS

Cloud Analytics and Business Intelligence on AWS Cloud Analytics and Business Intelligence on AWS Enterprise Applications Virtual Desktops Sharing & Collaboration Platform Services Analytics Hadoop Real-time Streaming Data Machine Learning Data Warehouse

More information

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure Mario Beck (mario.beck@oracle.com) Principal Sales Consultant MySQL Session Agenda Requirements for

More information

Rickard Linck Client Technical Professional Core Database and Lifecycle Management Common Analytic Engine Cloud Data Servers On-Premise Data Servers

Rickard Linck Client Technical Professional Core Database and Lifecycle Management Common Analytic Engine Cloud Data Servers On-Premise Data Servers Rickard Linck Client Technical Professional Core Database and Lifecycle Management Common Analytic Engine Cloud Data Servers On-Premise Data Servers Watson Data Platform Reference Architecture Business

More information

R Language for the SQL Server DBA

R Language for the SQL Server DBA R Language for the SQL Server DBA Beginning with R Ing. Eduardo Castro, PhD, Principal Data Analyst Architect, LP Consulting Moderated By: Jose Rolando Guay Paz Thank You microsoft.com idera.com attunity.com

More information

The Evolution of Big Data Platforms and Data Science

The Evolution of Big Data Platforms and Data Science IBM Analytics The Evolution of Big Data Platforms and Data Science ECC Conference 2016 Brandon MacKenzie June 13, 2016 2016 IBM Corporation Hello, I m Brandon MacKenzie. I work at IBM. Data Science - Offering

More information

4th National Conference on Electrical, Electronics and Computer Engineering (NCEECE 2015)

4th National Conference on Electrical, Electronics and Computer Engineering (NCEECE 2015) 4th National Conference on Electrical, Electronics and Computer Engineering (NCEECE 2015) Benchmark Testing for Transwarp Inceptor A big data analysis system based on in-memory computing Mingang Chen1,2,a,

More information

Apache Ignite - Using a Memory Grid for Heterogeneous Computation Frameworks A Use Case Guided Explanation. Chris Herrera Hashmap

Apache Ignite - Using a Memory Grid for Heterogeneous Computation Frameworks A Use Case Guided Explanation. Chris Herrera Hashmap Apache Ignite - Using a Memory Grid for Heterogeneous Computation Frameworks A Use Case Guided Explanation Chris Herrera Hashmap Topics Who - Key Hashmap Team Members The Use Case - Our Need for a Memory

More information