DecisionCAMP 2016: Solving the last mile in model based development

Similar documents
Introduction to K2View Fabric

Data Management Glossary

The Truth About Test Data Management & Its Impact on Agile Development

Test Data Management Data Sheet

CIB Session 12th NoSQL Databases Structures

Architectural challenges for building a low latency, scalable multi-tenant data warehouse

Provide Real-Time Data To Financial Applications

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::

MODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS

What is Gluent? The Gluent Data Platform

Data Informatics. Seon Ho Kim, Ph.D.

Incremental Updates VS Full Reload

Modern Data Warehouse The New Approach to Azure BI

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples

Lambda Architecture for Batch and Stream Processing. October 2018

SQL Server SQL Server 2008 and 2008 R2. SQL Server SQL Server 2014 Currently supporting all versions July 9, 2019 July 9, 2024

Delivering a 360 o View in Healthcare and Life Sciences With Agile Data

Developing Microsoft Azure Solutions (70-532) Syllabus

Course Outline: Designing, Optimizing, and Maintaining a Database Administrative Solution for Microsoft SQL Server 2008

Qlik Sense Enterprise architecture and scalability

IBM Campaign Version-independent Integration with IBM Engage Version 1 Release 3.1 April 07, Integration Guide IBM

CISC 7610 Lecture 4 Approaches to multimedia databases. Topics: Document databases Graph databases Metadata Column databases

CHAPTER2 UNDERSTANDING WINDOWSAZURE PLATFORMARCHITECTURE

IBM Campaign Version-independent Integration with IBM Watson Campaign Automation Version 1 Release 1.5 February, Integration Guide IBM

Developing Microsoft Azure Solutions (70-532) Syllabus

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

1Z0-526

Automated Netezza Migration to Big Data Open Source

Esri and MarkLogic: Location Analytics, Multi-Model Data

What's New in SAS Data Management

SAP NLS Update Roland Kramer, SAP EDW (BW/HANA), SAP SE PBS Customer Information Day, July 1st, 2016

Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers

Stages of Data Processing

In-Memory Computing EXASOL Evaluation

Optimizing and Modeling SAP Business Analytics for SAP HANA. Iver van de Zand, Business Analytics

Developing Microsoft Azure Solutions

SmartData Fabric distributed virtual data, graph data and master data management, analytics and security. Solutions and Key Features Revision 2.

Realizing the Full Potential of MDM 1

WHITEPAPER. MemSQL Enterprise Feature List

Performance Evaluation of Redis and MongoDB Databases for Handling Semi-structured Data

Enterprise Open Source Databases

Informatica Power Center 10.1 Developer Training

OLAP Introduction and Overview

Big data easily, efficiently, affordably. UniConnect 2.1

Informatica Cloud Data Integration Winter 2017 December. What's New

Microsoft vision for a new era

MOC 20463C: Implementing a Data Warehouse with Microsoft SQL Server

Hadoop 2.x Core: YARN, Tez, and Spark. Hortonworks Inc All Rights Reserved

COURSE 10977A: UPDATING YOUR SQL SERVER SKILLS TO MICROSOFT SQL SERVER 2014

Big Data Hadoop Stack

IOTA ARCHITECTURE: DATA VIRTUALIZATION AND PROCESSING MEDIUM DR. KONSTANTIN BOUDNIK DR. ALEXANDRE BOUDNIK

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Comparison of SmartData Fabric with Cloudera and Hortonworks Revision 2.1

Manual Trigger Sql Server 2008 Examples Insert Update

Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization

MICROSOFT BUSINESS INTELLIGENCE

Big Data Architect.

ASG WHITE PAPER DATA INTELLIGENCE. ASG s Enterprise Data Intelligence Solutions: Data Lineage Diving Deeper

SHAREPOINT 2016 ADMINISTRATOR BOOTCAMP 5 DAYS

Oracle NoSQL Database Enterprise Edition, Version 18.1

Db2 Analytics Accelerator V5.1 What s new in PTF 5

Spread the Database Love with Heterogeneous Replication. MC Brown, VP, Products

SQL Server Everything built-in

ETL is No Longer King, Long Live SDD

Big Data Analytics. Rasoul Karimi

Turning Relational Database Tables into Spark Data Sources

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk

Shine a Light on Dark Data with Vertica Flex Tables

white paper Aster Data ncluster In - database Analytics with R

How Insurers are Realising the Promise of Big Data

Synchronization mechanisms between SAP BW and SAP HANA authorizations

Exam /Course 20767B: Implementing a SQL Data Warehouse

OPERATIONALIZING MACHINE LEARNING USING GPU ACCELERATED, IN-DATABASE ANALYTICS

Top 7 Data API Headaches (and How to Handle Them) Jeff Reser Data Connectivity & Integration Progress Software

Oracle NoSQL Database Enterprise Edition, Version 18.1

Migrate from Netezza Workload Migration

Evaluation Checklist Data Warehouse Automation

Introduction to SAP HANA and what you can build on it. Jan 2013 Balaji Krishna Product Management, SAP HANA Platform

Extend NonStop Applications with Cloud-based Services. Phil Ly, TIC Software John Russell, Canam Software

microsoft

Processing big data with modern applications: Hadoop as DWH backend at Pro7. Dr. Kathrin Spreyer Big data engineer

Lenses 2.1 Enterprise Features PRODUCT DATA SHEET

70-532: Developing Microsoft Azure Solutions

Implementing a Data Warehouse with Microsoft SQL Server 2012/2014 (463)

SAP IQ - Business Intelligence and vertical data processing with 8 GB RAM or less

Azure-persistence MARTIN MUDRA

FAQs. Business (CIP 2.2) AWS Market Place Troubleshooting and FAQ Guide

Oracle #1 RDBMS Vendor

Data Analytics using MapReduce framework for DB2's Large Scale XML Data Processing

A c t i v e w o r k s p a c e f o r e x t e r n a l d a t a a g g r e g a t i o n a n d S e a r c h. 1

Fast Track Model Based Design and Development with Oracle9i Designer. An Oracle White Paper August 2002

WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG

Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Migrating Mappings and Mapplets from a PowerCenter Repository to a Model Repository

Security and Performance advances with Oracle Big Data SQL

Inventory (input to ECOMP and ONAP Roadmaps)

Integrate MATLAB Analytics into Enterprise Applications

Key Features. High-performance data replication. Optimized for Oracle Cloud. High Performance Parallel Delivery for all targets

Integrating your CX, ERP and HCM Clouds with your On-premises Applications CON7012

IBM Data Replication for Big Data

Transcription:

DecisionCAMP 2016: Solving the last mile in model based development Larry Goldberg July 2016 www.sapiensdecision.com

The Problem We are seeing very significant improvement in development Cost/Time/Quality. But the last mile in implementing decisions remains data integration: the Cost of this step remains high and the time unimproved by model based development Development Cost/time before Decision Modeling Cost/time after Decision Modeling Data Integration Cost/time before Decision Modeling Cost/time after Decision Modeling 2 www.sapiensdecision.com

The Solution: The Business Logical Unit The Business Logical Unit is the name given to a model of the information inputs and outputs of a given decision model Thus we are able to extend the model based paradigm into the data integration implementation for decision models. The Business Logical model is implemented in a tool we call Sapiens Decision InfoHub (DI) Our experience of this tool in the field shows: Reduction in time and cost of data integration in the development and change cycles Improved data governance Significantly enhanced decision execution performance Strengthened security of the data in the execution environment 3 www.sapiensdecision.com

The Business Logical Unit vs Classic Data Approaches TRADITIONAL RDBMS Data is scattered across multiple systems (e.g. CRM, Billing, etc.) Each system is independent Very hard to access all data corresponding to one entity (e.g. customer) UNSTRUCTURED BIG DATA Data is stored and distributed in big data system (e.g. Hadoop) No business logic in storage Data access for one entity requires lookup through massive amount of data BUSINESS LOGICAL UNIT Data is represented by BLU Distributed storage on demand Data access for one instance is a core feature Aligns with DM Glossary giving a business view of the data required for a given decision 4 www.sapiensdecision.com

The Business Logical Unit* *Patent Pending 5 www.sapiensdecision.com

Business Logical Unit Definitions BUSINESS LOGICAL UNIT (BLU) Generic word used to encapsulate the concept behind the business-oriented representation of data in the Decision BUSINESS LOGICAL UNIT TYPE (BLUT) Specific type of Business Logical Unit representing business-oriented data (e.g. Policy Renewal, Product Pricing, Patient Claim Eligibility, etc.) LUTs are configured in DI Studio BUSINESS LOGICAL UNIT INSTANCE (BLUI) Instance of a Logical Unit Type (e.g. Policy 123, Product N23RX, Patient 2344433456) Instances are the micro-databases stored in the DI servers 6 www.sapiensdecision.com

Relationship of Business Logical Unit, Type and Instance to The Decision Model Business Logical Unit Type (BLUT) Policy Renewal Business Logical Unit (BLU) Source Table representation Decision Model Input representation Decision Model Output representation Each input and output is represented in the BLU Business Logical Unit Instance (BLUI) The Decision Model 7 www.sapiensdecision.com

Relationship of Business Logical Unit, Type and Instance to The Decision Model Business Logical Unit Type (BLUT) Policy Renewal Business Logical Unit (BLU) Source Table representation Decision Model Input representation Decision Model Output representation Each input and output is represented in the BLU Business Logical Unit Instance (BLUI) The Decision Model 8 www.sapiensdecision.com

Flexible Synchronization ON-DEMAND SYNC On-demand calls triggered by web services, batch scripts or directly querying InfoHub (administrative mode). EVENT-BASED SYNC InfoHub synchronization can be triggered using the principles of Change Data Capture (CDC). ALWAYSYNC Timer based synchronizations Using and combining these synchronization strategies ensures that the data is available as needed by the execution environment 9 www.sapiensdecision.com

Row Level Security Hierarchical Encryption-Key Schema (HEKS) implemented for two BLU types 1 Master Key allowing full access 2 Type Keys restricting access to 2 different BLU Types 6 Instance Keys, 3 for each BLU Type restricting access at the BLU Instance level 10 www.sapiensdecision.com

Architecture KEY TO THE LAYERS IN THE DIAGRAM CONFIGURATION: The versioned configuration of every Logical Unit Type, accessed through administration tools (Admin Manager, Studio and Web Admin interfaces). WEB/DATABASE SERVICES: Communicates with user applications: either via direct queries (database services) or via web services. AUTHENTICATION ENGINE: Manages user access control and restrictions. MASKING LAYER: Optional, allows real time masking of sensitive data. PROCESSING ENGINE: Where every data computation is managed; it uses the principles of massive parallel processing and map-reduce in order execute operations. SMART DATA CONTROLLER: Drives the real-time synchronization of data to InfoHub. ETL LAYER: Embedded migration layer, allowing for automated ETL on retrieval. ENCRYPTION ENGINE: Manages the granular encryption of each data set. LU STORAGE MANAGER: Compresses and send data to the distributed database for storage. InfoHub leverages Cassandra as the distributed database. The communication between the distributed databases is very straight forward, making InfoHub a flexible solution that can be adapted to any other distributed database. Sapiens DECISION InfoHub Sapiens DM Sapiens DE/External RDBMS Third Party Reporting/BI Tools INFOHUB STUDIO 11 www.sapiensdecision.com

Web/Database Services Exposes DI s functions as APIs Support REST APIs Can output XML or JSON formats Access to the APIs is subject to Authentication Built-in function to extract the LU, or part of the LU as JSON/XML Provides JDBC protocol to access the distributed database directly, and interrogate it using SQL USER APPLICATIONS WEB/DATABASE SERVICES AUTHENTICATION ENGINE MASKING LAYER (OPTIONAL) PROCESSING ENGINE SMART DATA CONTROLLER ENCRYPTION ENGINE LU STORAGE MANAGER DISTRIBUTED DATABASE (Cassandra) 12 www.sapiensdecision.com

Authentication & Encryption Encryption Each LU Type / Instance is encrypted using a specific key Master Key LU Type Key LU Instance Key Authentication Users are assigned roles Access to LU instances can be specified at User level Access to methods (that access LU instances) is specified at role level USER APPLICATIONS WEB/DATABASE SERVICES AUTHENTICATION ENGINE MASKING LAYER (OPTIONAL) PROCESSING ENGINE SMART DATA CONTROLLER ENCRYPTION ENGINE LU STORAGE MANAGER DISTRIBUTED DATABASE (Cassandra) 13 www.sapiensdecision.com

Processing Engine Checks Synchronization status Manages SQL queries Handles necessary operations and computations Initiates and Synchronizes Workers to distribute the work Runs Aggregations using Map-Reduce algorithm USER APPLICATIONS WEB/DATABASE SERVICES 7 AUTHENTICATION ENGINE DATA MASKING (OPTIONAL) PROCESSING ENGINE Data Access Complete 1 Data Access Request 6 2 MPP MANAGER 5 4 WORKER 2 1 3 4 WORKER SMART DATA CONTROLLER ENCRYPTION ENGINE LU STORAGE MANAGER DISTRIBUTED DATABASE (Cassandra) 14 www.sapiensdecision.com

Smart Data Controller Retrieves data from distributed database storage to memory (if not already there) Checks Schema version against last published schema version Checks logical unit instance data timestamp against AlwaySync timer If required, Triggers ETL for data synchronization ETL services synchronizes data with source systems and sends it to the Processing Engine USER APPLICATIONS WEB/DATABASE SERVICES AUTHENTICATION ENGINE MASKING LAYER (OPTIONAL) PROCESSING ENGINE SMART DATA CONTROLLER ENCRYPTION ENGINE LU STORAGE MANAGER DISTRIBUTED DATABASE (Cassandra) 15 www.sapiensdecision.com

i 1 Decision Model Information Requirements Based on Process, regulation and Policy knowledge sources, requirements for decision logic emerge Persistence of the output in the BLU (BLUI) for analytics and decision optimization Logic and Glossary designed in the decision modeler Results of execution are persisted into DI for analytics and decision optimization 6 5 2 A Business Logical Unit is generated from Glossary of the Decision 7 Analytics and Machine Learning on inputs/outputs of decisions Decision Input 4 3 Output External source 1 The data sources required by the BLU are discovered partially by autodiscovery Business Logical Unit Type Internal DI Query on external sources External source 2 External source 3 BLU ETL Input table of LU with data fully transformed from input A Model Based Logic and Information Development Cycle Models Deployed to Execution Environment Business Logical Unit is populated through ETL and Synchronization rules One example of input transformation: could be regular expression, or one of many built in templates, or Java code 16 www.sapiensdecision.com

Key Features RISK-FREE INTEGRATION Fully Integrated with DECISION Suite, decision based business level model Embedded ETL Embedded Web-Services SQL support & REST-Like APIs Flexible Synchronization HIGH-END PERFORMANCE Logical Unit Storage In-Memory Computation Map-Reduce Massive Parallel Processing MODERN ARCHITECTURE ENHANCED SECURITY Fully distributed Linearly scalable Highly Available No Single Point of Failure Embedded Replication Row-Level security using HEKS Complete User Access Control Data Masking Library 18 www.sapiensdecision.com

Thank You Please visit us at www.sapiensdecision.com Larry Goldberg Evangelist Office: (919) 405-1515 Mobile: (919) 633-8686 larry.goldberg@sapiens.com www.sapiensdecision.com