SmartData Fabric (SDF) aka SmartData Lake (SDL) aka Distributed Data Virtualization (DDV) Basic Overview

Size: px
Start display at page:

Download "SmartData Fabric (SDF) aka SmartData Lake (SDL) aka Distributed Data Virtualization (DDV) Basic Overview"

Transcription

1 Smart Fabric (SDF) aka Smart Lake (SDL) aka Distributed Virtualization (DDV) Basic Overview October 2018 Revision 8.6 Copyright 2018 WhamTech, Inc. 1

2 Smart Fabric Unique indexed adapter-based data virtualization, federation and integration for: Advanced data access and data security seen as a security solution discovery, quality, standardization, governance and relationships mapping Virtual data warehouse and/or virtual data mart lake + data management + master data management = data reservoir provisioning for highly curated, self-serve reporting, BI and analytics Interoperability with write-back to data sources not just app to app Seamless, automatic and near real-time updateable distributed master data management Virtual graph database and link analysis, and interactive graph/link visualization Standards such as ODBC, JDBC, REST APIs and SQL, and standard applications Revision 8.6 Copyright 2018 WhamTech, Inc. 2

3 Smart Fabric (SDF) Transparent virtual distributed data management layer that plugs-and-plays in existing IT infrastructures Complements and leverages existing IT systems, tools and applications Key differentiator: Federated adapters that Read, Transform and Index (RTI) data, wherever it resides, process queries against these indexes, and read and transform results data from sources Addresses upfront data discovery, security, quality, standards, MDM and other data-related processes Leaves and guards data in sources, copies (e.g., data lake) or stored in indexes a hybrid approach ORGANIZATION B EXTERNAL COMMERCIAL/PUBLIC DATA SOURCE ORGANIZATION A = SDF Federation Server = SDF Adapter = SDF CLOUD DATA SOURCE ORGANIZATION C Governance ORGANIZATION S OWN SYSTEMS OF REFERENCE DS1 DS2 DS3 DS4 DS5 DS6 DS7 DS8 DS9 DS10 DS11 DS12 Revision 8.6 Copyright 2018 WhamTech, Inc. 3 ETL MDM

4 Smartphone CRM app invoking new BPM-based workflows with write-back to legacy systems through standard APIs/data services Patient-centric smartphone app interacts with legacy data sources through new workflows developed and orchestrated by BPM software BPM workflows interact with data source through standard FHIR REST APIs provided as data services BPM workflows both read and write back to legacy data sources Public Cloud Applications Applications EIQ Adapter EIQ Adapter EIQ Federation Server EIQ Adapter Applications Applications Web Server BPM Workflow Web Server FHIR REST APIs Web Server EIQ Federation Server Applications Applications Hybrid Cloud 2.0 Applications EIQ Federation Server Applications EIQ Adapter EIQ Adapter Applications Patient Management EHR Type 1 Labs Hybrid Cloud 1.0 sources remain on premise Local Smart Fabric deployed on premise sources remain on premise Local Smart Fabric deployed on Cloud Applications EIQ Federation Server Applications EIQ Adapter EIQ Adapter Applications Patient Management EHR Type 2 Applications Patient Management EHR Type 3 ORGANIZATION A ORGANIZATION B ORGANIZATION C Revision 8.6 Copyright 2018 WhamTech, Inc. 4

5 New paradigm read, transform and index (RTI) ETL is No Longer King, Long Live SDD (aka RTI) - How to Close the Loop from Discovery (data) to Information (value-added data) to Insights (analytics) to Outcomes (business improvements) Paper presented at DATAVERSITY Architecture Summit in Chicago, IL, in November 2017 DATA analytics DATA integration Master DATA management DATA relationships DATA management The increasing value of DATA Raw DATA Revision 8.6 Copyright 2018 WhamTech, Inc. 5

6 are powerful as they enable processes and drive value Build and maintain DATA PROFILES using raw indexes for data discovery (metadata), data matching within and across data sources, and developing and testing data transforms Support FORRESTER ZERO TRUST DATA SECURITY FRAMEWORK discover, INDEX, classify and secure PCI, PHI, PII, GDPR, etc. PRE-PROCESS DATA to build and maintain production indexes to address data management fundamentals data usually discarded Index POINTERS REFER TO DATA in sources, copies, e.g., Lake, or stored in indexes pointers retained in results data = full traceability Enable HIGH PERFORMANCE, DISTRIBUTED PARALLEL QUERY PROCESSSING through standard drivers, APIs, Web/data services, SQL and other query languages MONITOR DATA SOURCES for content and relationships in near real-time, and process events Enable VIRTUAL GRAPH DATABASE, link analysis and graph/link visualization EOS Revision 8.6 Copyright 2018 WhamTech, Inc. 6

7 SDF EIQ Adapter index and query process DISCOVERY INITIAL INDEX BUILD CONTINUOUS INDEX UPDATE QUERY PROCESSING RESULTS RETRIEVAL BI / Analytics / Application(s) STANDARD DRIVER SQL Query Transform EIQ Product front-end Application to Standard View Mapping EIQ Federation Server EIQ Adapter Standard View Mapping to EIQ DEVELOP and TEST Profiler Read Transform Index (RTI) Tool USED BY BUILD Discovery Automatic Query Processing EIQ Transforms/clean-ups Update Server Result-set data source pointers Retrieval source-specific Other data source EIQ Adapters and EIQ Federation Servers EIQ Adapter EIQ Federation Server CONVENTIONAL DRIVER OR BULK LOAD USER API / DRIVER MESSAGE QUEUE Source Transaction Log Source Revision 8.6 Copyright 2018 WhamTech, Inc. 7

8 Basic Smart Fabric architecture Applications WhamTech Smart Fabric (SDF) ADMINISTRATION/ CONFIGURATION Virtual network, data source and advanced data discovery Standard drivers, APIs, Web/data services, SQL and other query languages Virtual data security Virtual event processing Virtual cybersecurity Virtual MDM WhamTech EIQ Federation Servers Virtual reporting, BI and analytics WhamTech EIQ Adapters (indexed and conventional federated) Independent structured and unstructured, and indexed views preprocessing (read, but not usually stored) Virtual interactive link analysis and visualization ACCESS SECURITY Optional Lake Centralized or Distributed Sources Mainframes bases Files Logs Office docs Applications Web docs Social media Big Streaming Cloud IoT Revision 8.6 Copyright 2018 WhamTech, Inc. 8

9 Smart Fabric Keyword Descriptions (1 of 2) DISTRIBUTED aka FEDERATED (not centralized) VIRTUAL (leaves data where it resides uses federated adapters) SECURITY (data and access to it) DATA MANAGEMENT (data discovery, classification, security, processing/analytics, cleansing, transformation, standardization, mapping to a standard data view, linking/matching within and across data sources, indexing and query processing) MASTER DATA MANAGEMENT (hybrid [limited repository + full registry] near realtime distributed, seamless and automatic integration with data access) ANALYTICS (built-in, externally run against and highly curated data provisioning for) INDEPENDENT (of where data resides and associated systems, and configurations) INTEROPERABLE (can write back to data sources) Revision 8.6 Copyright 2018 WhamTech, Inc. 9

10 Smart Fabric Keyword Descriptions (2 of 2) INDEXES SYNCED TO DATA SOURCES IN REAL-TIME TO BATCH (twelve change data capture options) VIRTUAL GRAPH DATABASE (semantic model) VIRTUAL LINK ANALYSIS (find connections between entities, n degrees of separation) GRAPH/LINK VISUALIZATION (highly interactive thin client, OEM tool) RUNS IN CLOUD, ON PREMISE, IN DATA CENTERS OR AS HYBRID (including Hybrid Cloud 2.0) EVENT PROCESSING (as indexes and index views are being updated) ULTIMATE METADATA MANAGEMENT (complete on all data) BUILDS AND SUPPORTS DATA GOVERNANCE (bottom-up/edge-in) EOS Revision 8.6 Copyright 2018 WhamTech, Inc. 10

11 SDF better than or as good as alternatives No. Feature Smart Fabric Warehouse Conventional Federated Search 1 Minimal time to implement and add new sources 2 Relatively low cost and high ROI 3 Flexibility of use 4 Actively monitor data sources ( ) or 5 Unstructured data 6 Unlimited query options and performance ( )* 7 Denormalized views ( )* ( ) 8 Relationship/link mapping or or 9 Write back to data sources or 10 No major schema transforms or 11 source changes readily accommodated 12 Full text search *with data marts Revision 8.6 Copyright 2018 WhamTech, Inc. 11 Lake

12 SDF combines best of and overcomes worst of alternatives No. Feature Smart Fabric Warehouse Conventional Federated Search 13 Clean and usable data 14 Consistent and multiple indexes and types ( ) 15 Install nothing on data source systems 16 Pre-aggregated, calculated and join views 17 Results when data sources unavailable or 18 Row, column and data element security ( ) 19 Structured data or 20 stays in original format 21 remains in source or 22 User-level access to source data 23 Latest data available 24 Drill-down capability ( ) Lake Revision 8.6 Copyright 2018 WhamTech, Inc. 12

13 SDF slightly disadvantaged compared to alternatives No. Feature Smart Fabric Warehouse Conventional Federated Search 25 No index or query load on data sources ( ) 26 source owners not aware of queries ( ) 27 Archive options ( )* ( ) 28 Good for application data sources ( ) 29 Minimal additional system cost ( ) 30 No need for data or index update process *Can store and index either in own or third-party database: 1. Changed data for archive 2. Derived data (aggregations, calculations) 3. Any designated data Can index original format archived data, e.g., mainframe files stored for SOX compliance Lake Revision 8.6 Copyright 2018 WhamTech, Inc. 13

14 F I R E W A L L F I R E W A L L Smart Fabric security-centric distributed virtual data, master data and graph data management, and analytics Example multiple data source SDF configuration RDBMS Social Media Feed EIQ SuperAdapter EIQ SuperAdapter EIQ Federation Server Adapters and federation servers independently configurable and accessible at multiple levels Potential LIFO/FIFO query processing Hadoop EIQ SuperAdapter Mainframe ERP System EIQ Conventional Adapter EIQ SuperAdapter EIQ Federation Server EIQ Federation Server TCP / IP WhamTech ODBC/JDBC Driver, APIs, Web/data services Application(s) Salesforce 3 rd Party Adapter Revision 8.6 Copyright 2018 WhamTech, Inc. 14

15 Example shared-nothing architecture Out-of-the-box configurable backup, failover and load balancing = high availability Application(s) EIQ Federation Server can be multiple sharded segments or replicated copies EIQ Federation Server EIQ Federation Server EIQ Federation Server EIQ SuperAdapter EIQ SuperAdapter EIQ SuperAdapter EIQ SuperAdapter EIQ SuperAdapter EIQ SuperAdapter Source Revision 8.6 Copyright 2018 WhamTech, Inc. 15

16 Initial EIQ Adapter configuration, index build and data view mapping Discovery and raw indexbased Profiling Metadata Discovery and Semantic Mapping Classification and Security Develop and test Transforms using profiles Distributed Metadata Repository, incl. Governance Source Discovery Network Asset and Device Discovery Source Read, Transform/ clean-up (and Index) EIQ EIQ Adapter* w/sdv** Automated Discovery and Classification (ADDC) thus far Twelve ways to build and maintain indexes Index schema and names usually same as data source usually do not store data only queryable representations Alternate use of raw indexes to initially build EIQ mapped to SDV *EIQ SuperAdapter and EIQ TurboAdapter **Standard View Revision 8.6 Copyright 2018 WhamTech, Inc. 16

17 EIQ Adapter index update, query and results retrieval Distributed Metadata Repository, incl. Governance Queries resolved in the EIQ Adapter and EIQ Result-set pointers to data in source Source Read, Transform/ clean-up (and Index) User-level access Continual EIQ updates EIQ Federation Server EIQ Multiple other data sources EIQ EIQ Federation EIQ Server Server Adapter* (sub- (submiddleware) w/sdv** Middleware) w/sdv Raw results data usually transformed/cleaned-up from source Applications / middleware connect with standard drivers or Web Services and SQL*** Middleware Application(s) Results provided in almost any format *EIQ SuperAdapter and EIQ TurboAdapter **Standard View ***Future OQL, SPARQL and NoSQL options Revision 8.6 Copyright 2018 WhamTech, Inc. 17

18 Smart Fabric for data federation/abstraction Discovery Profiling Classification Categorization Access and Security Cleansing Transformation Standardization Masking Tokenization Encryption Indexing (data discarded) Standard data view mapping Indexed views BI, analytics, CRM and BPM support Link mapping/ indexing Master Management (MDM) Event processing Event correlation [Anomaly detection] High performance parallel query processing Integration (results read only) Interoperability (results read and write) Link Analysis/Graph base WhamTech key differentiators addressed upfront Revision 8.6 Copyright 2018 WhamTech, Inc. 18

19 Enterprise conventional data life-cycle Mart (DM)/ Analytics base/ Link Analysis- Graph base Acted on Analyzed Reported Related Indexed Stored Quality/improved Discarded/retained Copied Indexed Stored Created Stored Copied Quality/improved Stored Indexed Copied Quality/improved copied multiple times Log/Transaction System Operational Store (ODS) Warehouse (ETL and DW) Revision 8.6 Copyright 2018 WhamTech, Inc. 19

20 Big life-cycle similar to enterprise data life-cycle Big / Analytics base/ Link Analysis/ Graph base Acted on Analyzed Reported Related Indexed Stored Quality/improved Discarded/retained Copied Indexed Stored Created Stored Copied x 3 Quality/improved Stored Indexed Copied Quality/improved copied multiple times Log/Transaction System Big Lake/Reservoir (similar to ODS) Big Refinery (similar to ETL) Revision 8.6 Copyright 2018 WhamTech, Inc. 20

21 SDF eliminates most conventional data life-cycle stages Reported Related Indexed Stored Quality/improved Discarded/retained Acted on Analyzed Copied Indexed Stored Created Stored Copied Quality/improved Stored Indexed Copied Quality/improved Reported Discarded/retained Acted on Analyzed WhamTech Smart Fabric Created Stored Quality/improved Indexed Related Master Capabilities in the Smart Fabric support applications Revision 8.6 Copyright 2018 WhamTech, Inc. 21

22 provisioning for Big and other analytics Big / Analytics base/ Link Analysis/ Graph base Reported Indexed Stored Quality/improved Discarded/retained Acted on Analyzed Copied Created Stored Quality/improved Indexed Related Master data mapping, quality, security, masking tokenization, encryption and link mapping, and master data, addressed Eliminate up to 80% of time spent by expensive data scientists/analysts preparing data Tend towards real-time analytics and feedback to operational/transactional systems Log/Transaction System WhamTech Smart Fabric Revision 8.6 Copyright 2018 WhamTech, Inc. 22

23 SDF bridges the gap between the enterprise, and reporting, BI, analytics and other apps Integration (read only) Mapping Quality Masking, Tokenization and Security Encryption Individual Group Single customer view (read only) Interoperability (read and write) Enterprise (operational, transactional, other and external data sources) Aggregation Links/ Relationships Smart Fabric Provisioning Governance Master Reporting, BI, analytics and other apps Population Multiple entity centricities Local Queries/Results Regional Other Predictive rules/interactive CRM and BPM Global Revision 8.6 Copyright 2018 WhamTech, Inc. 23

24 Smart Fabric basic solutions Revision 8.6 Copyright 2018 WhamTech, Inc. 24

25 Smart Fabric incremental solutions Revision 8.6 Copyright 2018 WhamTech, Inc. 25

26 Sounds too good to be true where is this applied? (1 of 2) Major project(s) with a very large healthcare provider, starting with selective patient indexing/segmentation for contributing network providers 3 completed -> 100 -> 600 -> 20,000 data sources = incremental delivery Multiple other incremental use cases, e.g., matching prescriptions to diagnoses, and potential for becoming the basis for a Smart Lake Major project(s) with a very large system integrator, starting with enabling a highly security-centric single person view across multiple disparate HR systems using standard semantic view data access, MDM, event processing, BPM workflows and operational dashboards Past work with major DoD and intel government contractors high performance and complex query processing, including up to 60 billion records/day on HBase Includes federal healthcare projects with CMS, VA and other agencies Federal Fraud Fusion Center Unemployment insurance claims, Medicare, Medicaid, SSA, SARS, NIBRs, etc., MDM, data stays in sources (fed, state and local), graph and link visualization, and support SPARK and ML analytics Revision 8.6 Copyright 2018 WhamTech, Inc. 26

27 Sounds too good to be true where is this applied? (2 of 2) Work with Northrop Grumman on a number of projects POC for single patient view for NHS Trust in the UK 3 organizations, 7 data sources = 4 on premise and 2 in Cloud - Hybrid Cloud, use NHS MPI and own MPI, FHIR APIs, services and AWS POC for patient smartphone app support for interactive CRM, new BPM workflows on REST API data services, with read and write back to legacy data sources Message Bank for very large medical academic delivery system for HL7 and other messages Cassandra target data source, real-time, parse and index, VMPI, FHIR APIs, and future support for reporting, BI and analytics, including SPARK and ML Bitcoin/Blockchain transaction reporting, BI and analytics particularly, graph visualization, link analysis, SNA and MDM Potential integration and matching of unstructured healthcare insurance benefits documents with structured claims data Virtual graph database, link analysis and graph visualization using simple SQL OEM KeyLines visualization Revision 8.6 Copyright 2018 WhamTech, Inc. 27

28 Conventional vs. SDF adapters costs and ROI comparison Attribute Conventional Federated WhamTech SDF Adapters Access Adapters Costs - TCO Up to 1000 % of WhamTech 100% ROI assuming TCO as basis, and revenue gains and cost savings Capabilities Basic Advanced more capabilities for less cost Perpetual License IBM and others > 200% of 100%, starting at $10K per data source Costs CAPEX WhamTech; some freeware and Red Lease/SaaS Costs Hat < WhamTech Assume 40% of perpetual license costs per year, including maintenance and support Implementation Costs Up to 500% of WhamTech, long duration to implement Maintenance and 18% of perpetual license costs Support Costs included in lease/saas costs 40% of perpetual license costs per year, including maintenance and support 100%, relatively simple to implement = low costs and short duration 18% of perpetual license costs included in lease/saas costs Revision 8.6 Copyright 2018 WhamTech, Inc. 28

29 The End Appendix: Backup material Revision 8.6 Copyright 2018 WhamTech, Inc. 29

30 Simplified* layer capabilities *Detailed layer diagram in Appendix, slide 39 Applications Standard drivers, APIs, Web/data services, SQL and other query languages Built-in advanced capabilities RBAC and data loss prevention Link analysis/graph database Visualization Big /analytics data provisioning Built-in support for common applications Post-index, standard data view, multi-record processing Pre-index, single record processing Support for BI/analytics Master data management Indexing content, security, extracted entities, indexed views, unstructured and Link Security classification, data masking, tokenization and encryption Support for CRM Support for BPM/ decision support Event processing Event correlation [Anomaly detection] Standard data view mapping more than one possible Quality cleansing, transformation and standardization Discovery, profiling and correlation device, source and data Support for interoperability Analytics parsing, categorization, entity extraction and other analytics AUTOMATION Sources Mainframes bases Files Logs Optional Lake Centralized or Distributed Office docs Applications Web docs Social media Big Streaming Cloud IoT Revision 8.6 Copyright 2018 WhamTech, Inc. 30

31 Advanced data management Combine the best and overcome the worst of conventional approaches of data warehousing, federated data access and enterprise search through index-based federated adapters to create a Hybrid Smart Fabric/Lake without needing to copy or move data from source systems, although that is an option Access adapters and federation servers at any level and from any location with access control Include all data sources from mainframes to IoT devices, on premise, Cloud, Hybrid Cloud, external, etc. Use multiple types of indexes and indexed views distributed, 100% contiguous across data sources, columnar, file-based and contain pointers to source data Federate/distribute data governance built and maintained from the bottom up as systems are discovered, read, indexed and metadata captured combine with IAM, RBAC, etc. - can obtain (and store) a complete centralized data governance view at any time, and intervene and impose as needed Federate/distribute metadata repository, data discovery, security, quality, transforms, relationships and mapping to standard data views Seamless, automatic and near real-time updateable integration of master data management to enable single customer/patient and other entity views across the extended organization can also federate/distribute master data Revision 8.6 Copyright 2018 WhamTech, Inc. 31

32 Advanced application support Combine queries on structured, semi-structured and unstructured data Accelerate query processing on existing systems, but almost no load on data sources High performance parallel/edge query processing Enable direct access to data sources through indexes and/or use indexes to represent and use data logically as (1) objects, (2) relational, (3) hierarchical and (4) NoSQL/Big Table Built-in virtual graph database, link analysis and graph visualization - use simple SQL Event processing - monitor changes to data sources through indexes and indexed views, trigger workflows, and update applications and visualizations, e.g., operational dashboards and graphs True interoperability based on single customer/patient views with both read and write-back to data sources goal to have almost any application working with almost any data source(s) Provision highly curated data to Big /analytics in near real-time Bridge the gap between enterprise operational/transactional systems and reporting, BI, analytics and other applications tend towards closing the loop in near real-time Partnering with unique analytics management vendor, Aginity (aginity.com) Revision 8.6 Copyright 2018 WhamTech, Inc. 32

33 Advanced data security Leverage centrally managed IAM and attribute-based RBAC, e.g., Active Directory (AD) with Kerberos source stewards can have ultimate veto Supports Single Sign-On (SSO) for all data sources regardless of data source system support Be a data security gatekeeper for data sources Follow Forrester Zero Trust Security Model = Discover, index, classify and secure All results data traceable to source records Row, column and data element security Dynamic data masking, tokenization and encryption (third-party Format-Preserving Encryption (FPE)) governance from the bottom-up and/or can support a top-down tool Full auditability Support for third-party User Behavior Analytics (UBA) Alleviates/prevents insider data thefts (25%) and external origin (hacks) data thefts (75%) Leverage user logs, including queries made Revision 8.6 Copyright 2018 WhamTech, Inc. 33

34 Other advanced capabilities NEAR REAL-TIME ARCHITECTURE Edge process data Enable a near real-time data/event-driven architecture Build new workflows using BPM software on top of legacy systems to support operations, CRM, smartphone apps, IOT devices, reporting, BI and analytics STANDARDS Standard drivers, APIs, Web/data services, REST APIs, ANSI SQL and other query languages with conversion, Cloud, VMs, PMs, Windows, Linux and soon-to-be IBM Power Systems Standard data models/views, e.g., HL7 and FHIR for healthcare, NIEM for government and other areas, XBR and others for financial services, ACORD for insurance or organization s own Revision 8.6 Copyright 2018 WhamTech, Inc. 34

35 Smart Fabric example Bluemix deployment Multiple access methods Multiple query language options Multiple ways to represent data Standard data view, e.g., FHIR APIs and NIEM Cloud platform-based data services New BPM workflows running on legacy data sources Write-back to data sources VMPI-governed data access Multiple legacy data sources sources could be in multiple organizations sources could be on premise and in the Cloud Hybrid Cloud access Revision 8.6 Copyright 2018 WhamTech, Inc. 35

36 Multi-level value contribution Reduce costs Increase revenue Increase profit Improve customer experience Upsell and cross-sell customers Gain customers Improve compliance Improve reporting Tend to realtime Reduce waste Reduce liability Convert knowledge to SUCCESS OUTCOMES Visualization Link analysis Social analytics Ontology representation BI/ analytics BPM Decision support CRM/ MDM EHR/ HIE Convert information to KNOWLEDGE Indexed views Advanced text search Entity extraction Entity resolution Link / mapping Realtime alerts CEP Categorization discovery profiling Access almost any data source Integrate multiple data sources Convert data to value-added INFORMATION Leave data in sources Work with structured and unstructured data Improve data quality Build structured and unstructured indexes Update in near realtime Map to a virtual standard data view Provide basic indexed virtual DATA access, integration, sharing and interoperability Scale with distributed parallel processing Almost no load on data sources Revision 8.6 Copyright 2018 WhamTech, Inc. 36

37 Smart Fabric is location agnostic Revision 8.6 Copyright 2018 WhamTech, Inc. 37

38 Smart Fabric is configuration agnostic Revision 8.6 Copyright 2018 WhamTech, Inc. 38

39 -driven bottom-up vs top-down approach For each data source: Application/Middleware External Query in SQL WhamTech s Standard Drivers/APIs/Web Services WhamTech s Mapping Layer WhamTech s Automatic Query Processor EIQ SuperAdapter WhamTech Content WhamTech Link Quality/Parser/Entity Extraction/Other discovery and profiling Transaction Log Reader, MQ or similar Source Driver/API/Web Service Source Revision 8.6 Copyright 2018 WhamTech, Inc. 39

40 Detailed Smart Fabric layer capabilities Applications Real-time Business Intelligence Distributed Analytics Complex Event Processing Enterprise and Web Search Virtual Graph base Link Analysis Social Network Analysis Living Networks CDI-MDM/ Single Entity View Other Applications Standard drivers, APIs, Web/data services, SQL and other query languages EIQ SuperAdapter Security Layer Query Side Automatic Query Processing Real-time Monitoring and Event Processing Master Management Smart Fabric Administration and Configuration Tools Structured Master Text Extracted Entity Semantic Mapping to Standard View(s) Fuzzy Match Preaggregated Precalculated Embedded Value Join Link Denormalized Security and Privacy Access Controls Security Layer Side Cleansing, Transformation, Standardization, Masking, Tokenization and Encryption Entity Extraction/NLP/Categorization/Other Text Processing Metadata Discovery/ Profiling Device, Source and Discovery Metadata Management and Repository, incl. Governance Changed Capture Intelligent Spider Sources Network Assets and Devices Relational bases Standard, Proprietary and Web Service Drivers Mainframe data Files Changed Capture Web Services Application Drivers Applications Network Assets and Devices Enterprise documents and Spidered files from Web and other sources Revision 8.6 Copyright 2018 WhamTech, Inc. 40

41 Smart Fabric impact diagram Intra and Inter- Source Correlation Profiling Source Discovery Semantic Identification Structured Unstructured Index Preparation and Results Transformation Transform Development and Testing Masking, Tokenization and Encryption Parser Transform Entity Extraction Security Classification Categorization Security Classification Content Structured (most data discarded) Distributed Metadata Repository Distributed Metadata Standard View Standard View (indexes semantically mapped) Link Results Pointers Link Master and Results Master Master Results Link Analytics and Visualization Device/Host Unstructured (most data discarded) Results Revision 8.6 Copyright 2018 WhamTech, Inc. 41

42 Smart Fabric impact diagram for query submission APPLICATION Discovery Index Preparation and Results Transformation Content Standard View Link Master and Results 3 Structured (most data discarded) 1 1 Master Structured Unstructured Distributed Metadata Repository 3 Distributed Metadata 2 Standard View (indexes semantically mapped) Link Master 3 Link Analytics and Visualization 3 Unstructured (most data discarded) Revision 8.6 Copyright 2018 WhamTech, Inc. 42

43 Smart Fabric impact diagram for results retrieval APPLICATION Discovery Index Preparation and Results Transformation Content Standard View Link Master and Results 4 Structured Unstructured Masking, Tokenization and Encryption 5 5 Transform Structured (most data discarded) Distributed Metadata Repository 4 Distributed Metadata 6 Standard View (indexes semantically mapped) 7 Results Pointers Link Master Master 4 9 Link Analytics and Visualization Unstructured (most data discarded) 8 Results Revision 8.6 Copyright 2018 WhamTech, Inc. 43

44 Smart Fabric processes (1 of 5) Automate (using BPM) as much as possible: Deploy on AWS, Azure, IBM Bluemix, OpenStack, VMs, physical servers other cloud options available Instantiate an EIQ System Administration and Configuration Tool Instantiate a distributed network asset/device, data source and metadata repository Network asset/device discovery source discovery Using network asset/device discovery tool Using spiders for ediscovery-type documents, files, , etc. Instantiate EIQ Adapters on demand Revision 8.6 Copyright 2018 WhamTech, Inc. 44

45 Smart Fabric processes (2 of 5) discovery Optionally, with raw Link (internal and external pre-joins) identification DSL: risk classification CS: Event correlation DSL = Security Layer CS = Cybersecurity profiling for data transforms for typos, transpositions and non-standard data, e.g., name, address, phone and correction Lookup dictionaries and thesauri, USPS or other address correction, regular expressions, APIs, DLLs, transformation server, etc. DSL: Masking, tokenization or encryption for indexed data or dynamically depending on access controls Revision 8.6 Copyright 2018 WhamTech, Inc. 45

46 Smart Fabric processes (3 of 5) Multiple indexes and types, e.g., basic content, DSL: security (classification), aggregations, calculations, fuzzy, text, extracted entities and Link DSL: Can encrypt entire disc volumes, individual indexes or entire sets of indexes MDM: source-specific tables containing unique indexed primary entity IDs, and master data, links and date-time MDM = Master Management Create using Link Index process, with multi-attribute fuzzy match for composite scoring and master data rules DSL: WhamTech Security and Privacy Access Profiles (SPAPs) or other Role- Based Access Control Current: Source organization, user, role, application, target organization and data source profiles available Future: Extend for application processes Revision 8.6 Copyright 2018 WhamTech, Inc. 46

47 Smart Fabric processes (4 of 5) Hierarchies honored through joins and/or Link Inferred ontologies Reasons for hierarchies change depending on application, e.g., one vendor has multiple products and one product from multiple vendors MDM: Versioning with access to historic master data Combine with other data sources, tending towards EDW/enterprise solutions MDM: Pure registry option to replace either data source indexes or source data itself (automatically updates indexes) with master data Pure registry-based master data table, but limits options, lower performance and more complex Execute analytics, combined with other data and search/query filters, e.g., reporting, BI and link analysis/graph database Include aggregations, calculations, master data (if available) and other data, e.g., external Revision 8.6 Copyright 2018 WhamTech, Inc. 47

48 Smart Fabric processes (5 of 5) Write back selective updates/corrections to data sources with possible inverse data transforms (MDM: See previous slide) Continuously monitor metadata (index tree profiles) using stored procedures with triggers Helps identify anomalies/outliers Event processing enabled (federated solution for Oracle Event Processing) Open source and commercial BPM software for non-oracle solutions Interoperability query transformation to avoid rewriting applications Goal to enable almost any application(s) to work with almost any data source(s) Mainframe data source option files and live systems Hadoop (HBase/Hive and HDFS levels) and Cassandra options Revision 8.6 Copyright 2018 WhamTech, Inc. 48

49 Smart Fabric key differentiators expanded (1 of 2) 1. Index data regardless of where it resides in sources not copied or moved, work with copies or store in indexes a hybrid approach 2. Any data source mainframes, legacy, databases, office documents, Web, social media, streaming, Hadoop, Cassandra, structured, unstructured, semi-structured, etc. 3. Automation as much as possible detect new devices, and discover data sources and data, e.g., for IoT and networks 4. quality and other data management fundamentals addressed before any data access cleansed, transformed and standardized indexes, and results 5. Own indexes, indexed views and query processing, as part of adapters to multiple data sources parallel distributed processing linearly scalable maintained in real-time using changed data capture, incrementally and batch 6. Event processing (past federated data access partner with Oracle ) Revision 8.6 Copyright 2018 WhamTech, Inc. 49

50 Smart Fabric key differentiators expanded (2 of 2) 7. Link mapping finds relationships between entities within and across data sources and captures them in Link using (i) exact, (ii) fuzzy and (iii) composite weighted probability matches 8. Link used to (i) locate and read entities for master data, (ii) execute degrees of separation queries, (iii) accelerate joins, (iv) represent ontological/semantic models and (v) perform link analytics 9. Seamless combination of master data with operational, other enterprise and external data systems, and of use by any application 10. Link analytics for single customer/patient views, network visualization, fraud detection, debottlenecking, social media analysis, etc. 11. Highly interactive browser-based query, results and link visualization tool 12. Support for active CRM, BPM, individual analytics and machine learning Revision 8.6 Copyright 2018 WhamTech, Inc. 50

51 Smart Fabric data processes Analyzed Discarded/retained Acted on Note: not copied or moved - only results retrieved Reported (Results data quality/ transformation) Results retrieval Query processing Master data management Device/host discovery source discovery discovery and profiling security classification and link indexing quality/transformation/ masking/tokenization/encryption Revision 8.6 Copyright 2018 WhamTech, Inc. 51

52 Solutions to multiple challenging problems in one platform Applications Virtual network, data source and advanced data discovery Virtual data security Virtual event processing Virtual cybersecurity Virtual MDM Virtual reporting, BI and analytics Virtual interactive link analysis and visualization Optional Modules Distributed Virtualization (and Federation, Integration and Interoperability) Platform, aka Smart Fabric Basic Product Revision 8.6 Copyright 2018 WhamTech, Inc. 52

53 Solutions to multiple challenging problems in one platform 1. SMARTDATA FABRIC (SDF) for basic data discovery, profiling, quality, mapping, indexing, virtualization, federation, integration and interoperability as basis for capability support modules and applications 2. EXTEND SDF WITH AUTOMATED NETWORK, DATA SOURCE AND ADVANCED DATA DISCOVERY including relationships and eventual automated mapping to a standard data view 3. EXTEND SDF WITH EVENT PROCESSING to keep track of significant changes occurring in data 4. EXTEND SDF WITH DISTRIBUTED (preferably HYBRID) MDM to seamlessly combine with operational/transactional data and maintain in near real-time 5. EXTEND SDF WITH IMPROVED CYBERSECURITY through indexed federated log and other data source access, including automated anomaly detection and automated event correlation 6. EXTEND SDF WITH VIRTUAL DATA SECURITY LAYER to defend and protect data of value (i) as it is created, (ii) at rest in the source, (iii) in transit, (iv) at the recipient and (v) after no longer needed 7. EXTEND SDF WITH BI/ANALYTICS oriented virtual and materialized real-time updateable hierarchical indexed views, text analytics including entity extraction, and locally executed algorithms 8. EXTEND SDF WITH LINK ANALYSIS (and OEM LINK VISUALIZATION) for link analysis/graph database for almost any type of analytics, including virtual MDM (master patient index), cybersecurity and data security Revision 8.6 Copyright 2018 WhamTech, Inc. 53

Comparison of SmartData Fabric with Cloudera and Hortonworks Revision 2.1

Comparison of SmartData Fabric with Cloudera and Hortonworks Revision 2.1 Comparison of SmartData Fabric with Cloudera and Hortonworks Revision 2.1 Page 1 of 11 www.whamtech.com (972) 991-5700 info@whamtech.com August 2018 Page 2 of 11 www.whamtech.com (972) 991-5700 info@whamtech.com

More information

SmartData Fabric distributed virtual data, graph data and master data management, analytics and security. Solutions and Key Features Revision 2.

SmartData Fabric distributed virtual data, graph data and master data management, analytics and security. Solutions and Key Features Revision 2. s and Key Features Revision 2.5 Page 1 of 7 www.whamtech.com (972) 991-5700 info@whamtech.com March 2018 ID SOL1 Automated Data Discovery and Classification (ADDC) Key Feature ID KF01 KF02 KF03 Key Feature

More information

ETL is No Longer King, Long Live SDD

ETL is No Longer King, Long Live SDD ETL is No Longer King, Long Live SDD How to Close the Loop from Discovery to Information () to Insights (Analytics) to Outcomes (Business Processes) A presentation by Brian McCalley of DXC Technology,

More information

Configuration for Registry*, Repository or Hybrid** Master Data Management. *aka Federated or Virtual **aka Coexistence

Configuration for Registry*, Repository or Hybrid** Master Data Management. *aka Federated or Virtual **aka Coexistence Configuration for Registry*, Repository or Hybrid** Master Data Management *aka Federated or Virtual **aka Coexistence October 2017 (Best viewed in slideshow mode) Revision 3.6 Copyright 2017 WhamTech,

More information

Distributed Hybrid MDM, aka Virtual MDM Optional Add-on, for WhamTech SmartData Fabric

Distributed Hybrid MDM, aka Virtual MDM Optional Add-on, for WhamTech SmartData Fabric Distributed Hybrid MDM, aka Virtual MDM Optional Add-on, for WhamTech SmartData Fabric Revision 2.1 Page 1 of 17 www.whamtech.com (972) 991-5700 info@whamtech.com August 2018 Contents Introduction... 3

More information

Virtual Hybrid Master Patient Index (VMPI) Proof-of-Concept (POC) for Huntington Medical Foundation

Virtual Hybrid Master Patient Index (VMPI) Proof-of-Concept (POC) for Huntington Medical Foundation Virtual Hybrid Master Patient Index (VMPI) Proof-of-Concept (POC) for Huntington Medical Foundation August 2018 (Best viewed in slideshow mode) Revision 3.4 Copyright 2018 WhamTech, Inc. 1 VMPI POC Goals

More information

OEP Application. CEP JDBC Data Cartridge. WhamTech JDBC Driver. WhamTech EIQ Federation Servers/ EIQ Adapters. Indexes and Indexed Views

OEP Application. CEP JDBC Data Cartridge. WhamTech JDBC Driver. WhamTech EIQ Federation Servers/ EIQ Adapters. Indexes and Indexed Views OEP Application CEP JDBC Data Cartridge CEP Table WhamTech JDBC Driver WhamTech EIQ Federation Servers/ EIQ Adapters Indexes and Indexed Views Oracle Event Processing Using Indexed Virtualized and Federated

More information

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications

More information

Modern Data Warehouse The New Approach to Azure BI

Modern Data Warehouse The New Approach to Azure BI Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics

More information

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Cy Erbay Senior Director Striim Executive Summary Striim is Uniquely Qualified to Solve the Challenges of Real-Time

More information

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue Datalynx Enterprise Data Management Solution Catalogue About Datalynx Vendor of the world s most versatile Enterprise Data Management software Licence our software to clients & partners Partner-based sales

More information

WhamTech SmartData Fabric Healthcare Configurable FHIR REST APIs

WhamTech SmartData Fabric Healthcare Configurable FHIR REST APIs WhamTech SmartData Fabric Healthcare Configurable FHIR REST APIs March 2017 (Best viewed in slideshow mode) Revision 1.2 Copyright 2017 WhamTech, Inc. 1 Challenges facing interoperability in healthcare

More information

Transforming IT: From Silos To Services

Transforming IT: From Silos To Services Transforming IT: From Silos To Services Chuck Hollis Global Marketing CTO EMC Corporation http://chucksblog.emc.com @chuckhollis IT is being transformed. Our world is changing fast New Technologies New

More information

The Emerging Data Lake IT Strategy

The Emerging Data Lake IT Strategy The Emerging Data Lake IT Strategy An Evolving Approach for Dealing with Big Data & Changing Environments bit.ly/datalake SPEAKERS: Thomas Kelly, Practice Director Cognizant Technology Solutions Sean Martin,

More information

Building a Data Strategy for a Digital World

Building a Data Strategy for a Digital World Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service

More information

ADABAS & NATURAL 2050+

ADABAS & NATURAL 2050+ ADABAS & NATURAL 2050+ Guido Falkenberg SVP Global Customer Innovation DIGITAL TRANSFORMATION #WITHOUTCOMPROMISE 2017 Software AG. All rights reserved. ADABAS & NATURAL 2050+ GLOBAL INITIATIVE INNOVATION

More information

Optimized Data Integration for the MSO Market

Optimized Data Integration for the MSO Market Optimized Data Integration for the MSO Market Actions at the speed of data For Real-time Decisioning and Big Data Problems VelociData for FinTech and the Enterprise VelociData s technology has been providing

More information

Fast Innovation requires Fast IT

Fast Innovation requires Fast IT Fast Innovation requires Fast IT Cisco Data Virtualization Puneet Kumar Bhugra Business Solutions Manager 1 Challenge In Data, Big Data & Analytics Siloed, Multiple Sources Business Outcomes Business Opportunity:

More information

PERSPECTIVE. Data Virtualization A Potential Antidote for Big Data Growing Pains. Abstract

PERSPECTIVE. Data Virtualization A Potential Antidote for Big Data Growing Pains. Abstract PERSPECTIVE Data Virtualization A Potential Antidote for Big Data Growing Pains Abstract Enterprises are already facing challenges around data consolidation, heterogeneity, quality, and value. Now they

More information

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

MAPR DATA GOVERNANCE WITHOUT COMPROMISE MAPR TECHNOLOGIES, INC. WHITE PAPER JANUARY 2018 MAPR DATA GOVERNANCE TABLE OF CONTENTS EXECUTIVE SUMMARY 3 BACKGROUND 4 MAPR DATA GOVERNANCE 5 CONCLUSION 7 EXECUTIVE SUMMARY The MapR DataOps Governance

More information

@Pentaho #BigDataWebSeries

@Pentaho #BigDataWebSeries Enterprise Data Warehouse Optimization with Hadoop Big Data @Pentaho #BigDataWebSeries Your Hosts Today Dave Henry SVP Enterprise Solutions Davy Nys VP EMEA & APAC 2 Source/copyright: The Human Face of

More information

USERS CONFERENCE Copyright 2016 OSIsoft, LLC

USERS CONFERENCE Copyright 2016 OSIsoft, LLC Bridge IT and OT with a process data warehouse Presented by Matt Ziegler, OSIsoft Complexity Problem Complexity Drives the Need for Integrators Disparate assets or interacting one-by-one Monitoring Real-time

More information

2013 Cisco and/or its affiliates. All rights reserved. 1

2013 Cisco and/or its affiliates. All rights reserved. 1 2013 Cisco and/or its affiliates. All rights reserved. 1 Building the Internet of Things Jim Green - CTO, Data & Analytics Business Group, Cisco Systems Brian McCarson Sr. Principal Engineer & Sr. System

More information

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud Microsoft Azure Databricks for data engineering Building production data pipelines with Apache Spark in the cloud Azure Databricks As companies continue to set their sights on making data-driven decisions

More information

The Value of Data Modeling for the Data-Driven Enterprise

The Value of Data Modeling for the Data-Driven Enterprise Solution Brief: erwin Data Modeler (DM) The Value of Data Modeling for the Data-Driven Enterprise Designing, documenting, standardizing and aligning any data from anywhere produces an enterprise data model

More information

Link Indexes Overcome Conventional Link Analysis Problems and Provide Other Unique Solutions

Link Indexes Overcome Conventional Link Analysis Problems and Provide Other Unique Solutions Link Indexes Overcome Conventional Link Analysis Problems and Provide Other Unique Solutions August 2018 (Best viewed in slideshow mode) Revision 3.3 Copyright 2018 WhamTech, Inc. 1 WhamTech Takes a Bottom-up

More information

Lambda Architecture for Batch and Stream Processing. October 2018

Lambda Architecture for Batch and Stream Processing. October 2018 Lambda Architecture for Batch and Stream Processing October 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only.

More information

Capture Business Opportunities from Systems of Record and Systems of Innovation

Capture Business Opportunities from Systems of Record and Systems of Innovation Capture Business Opportunities from Systems of Record and Systems of Innovation Amit Satoor, SAP March Hartz, SAP PUBLIC Big Data transformation powers digital innovation system Relevant nuggets of information

More information

Enabling Data Governance Leveraging Critical Data Elements

Enabling Data Governance Leveraging Critical Data Elements Adaptive Presentation at DAMA-NYC October 19 th, 2017 Enabling Data Governance Leveraging Critical Data Elements Jeff Goins, President, Jeff.goins@adaptive.com James Cerrato, Chief, Product Evangelist,

More information

Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight

Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight ESG Lab Review InterSystems Data Platform: A Unified, Efficient Data Platform for Fast Business Insight Date: April 218 Author: Kerry Dolan, Senior IT Validation Analyst Abstract Enterprise Strategy Group

More information

Introduction to K2View Fabric

Introduction to K2View Fabric Introduction to K2View Fabric 1 Introduction to K2View Fabric Overview In every industry, the amount of data being created and consumed on a daily basis is growing exponentially. Enterprises are struggling

More information

Overview of Data Services and Streaming Data Solution with Azure

Overview of Data Services and Streaming Data Solution with Azure Overview of Data Services and Streaming Data Solution with Azure Tara Mason Senior Consultant tmason@impactmakers.com Platform as a Service Offerings SQL Server On Premises vs. Azure SQL Server SQL Server

More information

Optimizing and Modeling SAP Business Analytics for SAP HANA. Iver van de Zand, Business Analytics

Optimizing and Modeling SAP Business Analytics for SAP HANA. Iver van de Zand, Business Analytics Optimizing and Modeling SAP Business Analytics for SAP HANA Iver van de Zand, Business Analytics Early data warehouse projects LIMITATIONS ISSUES RAISED Data driven by acquisition, not architecture Too

More information

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC SAP Agile Data Preparation Simplify the Way You Shape Data Introduction SAP Agile Data Preparation Overview Video SAP Agile Data Preparation is a self-service data preparation application providing data

More information

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda Agenda Oracle9i Warehouse Review Dulcian, Inc. Oracle9i Server OLAP Server Analytical SQL Mining ETL Infrastructure 9i Warehouse Builder Oracle 9i Server Overview E-Business Intelligence Platform 9i Server:

More information

Realizing the Full Potential of MDM 1

Realizing the Full Potential of MDM 1 Realizing the Full Potential of MDM SOLUTION MDM Augmented with Data Virtualization INDUSTRY Applicable to all Industries EBSITE www.denodo.com PRODUCT OVERVIE The Denodo Platform offers the broadest access

More information

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality?

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality? Oliver Engels & Tillmann Eitelberg Big Data! Big Quality? Like to visit Germany? PASS Camp 2017 Main Camp 5.12 7.12.2017 (4.12 Kick Off Evening) Lufthansa Training & Conference Center, Seeheim SQL Konferenz

More information

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Integrating Complex Financial Workflows in Oracle Database Xavier Lopez Seamus Hayes Oracle PolarLake, LTD 2 Copyright 2011, Oracle

More information

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case

More information

SQL Server SQL Server 2008 and 2008 R2. SQL Server SQL Server 2014 Currently supporting all versions July 9, 2019 July 9, 2024

SQL Server SQL Server 2008 and 2008 R2. SQL Server SQL Server 2014 Currently supporting all versions July 9, 2019 July 9, 2024 Current support level End Mainstream End Extended SQL Server 2005 SQL Server 2008 and 2008 R2 SQL Server 2012 SQL Server 2005 SP4 is in extended support, which ends on April 12, 2016 SQL Server 2008 and

More information

Stages of Data Processing

Stages of Data Processing Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,

More information

Energy Management with AWS

Energy Management with AWS Energy Management with AWS Kyle Hart and Nandakumar Sreenivasan Amazon Web Services August [XX], 2017 Tampa Convention Center Tampa, Florida What is Cloud? The NIST Definition Broad Network Access On-Demand

More information

Streaming Integration and Intelligence For Automating Time Sensitive Events

Streaming Integration and Intelligence For Automating Time Sensitive Events Streaming Integration and Intelligence For Automating Time Sensitive Events Ted Fish Director Sales, Midwest ted@striim.com 312-330-4929 Striim Executive Summary Delivering Data for Time Sensitive Processes

More information

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You Özgür Yiğit Oracle Data Integration, Senior Manager, ECEMEA Safe Harbor Statement The following

More information

REGULATORY REPORTING FOR FINANCIAL SERVICES

REGULATORY REPORTING FOR FINANCIAL SERVICES REGULATORY REPORTING FOR FINANCIAL SERVICES Gordon Hughes, Global Sales Director, Intel Corporation Sinan Baskan, Solutions Director, Financial Services, MarkLogic Corporation Many regulators and regulations

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

Informatica Enterprise Information Catalog

Informatica Enterprise Information Catalog Data Sheet Informatica Enterprise Information Catalog Benefits Automatically catalog and classify all types of data across the enterprise using an AI-powered catalog Identify domains and entities with

More information

Dell Boomi Cloud MDM Overview

Dell Boomi Cloud MDM Overview Dell Boomi Cloud MDM Overview Dell Boomi s Multi-Purpose PaaS Boomi as the Multi-Purpose PaaS for enterprise data management Move: AtomSphere Integration Manage: Master Data Management (MDM) Govern: API

More information

Oracle Big Data Connectors

Oracle Big Data Connectors Oracle Big Data Connectors Oracle Big Data Connectors is a software suite that integrates processing in Apache Hadoop distributions with operations in Oracle Database. It enables the use of Hadoop to process

More information

Data Analytics at Logitech Snowflake + Tableau = #Winning

Data Analytics at Logitech Snowflake + Tableau = #Winning Welcome # T C 1 8 Data Analytics at Logitech Snowflake + Tableau = #Winning Avinash Deshpande I am a futurist, scientist, engineer, designer, data evangelist at heart Find me at Avinash Deshpande Chief

More information

Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0. WEBINAR MAY 15 th, PM EST 10AM PST

Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0. WEBINAR MAY 15 th, PM EST 10AM PST Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0 WEBINAR MAY 15 th, 2018 1PM EST 10AM PST Welcome and Logistics If you have problems with the sound on your computer, switch

More information

DecisionCAMP 2016: Solving the last mile in model based development

DecisionCAMP 2016: Solving the last mile in model based development DecisionCAMP 2016: Solving the last mile in model based development Larry Goldberg July 2016 www.sapiensdecision.com The Problem We are seeing very significant improvement in development Cost/Time/Quality.

More information

CIAM: Need for Identity Governance & Assurance. Yash Prakash VP of Products

CIAM: Need for Identity Governance & Assurance. Yash Prakash VP of Products CIAM: Need for Identity Governance & Assurance Yash Prakash VP of Products Key Tenets of CIAM Solution Empower consumers, CSRs & administrators Scale to millions of entities, cloud based service Security

More information

Convergence and Collaboration: Transforming Business Process and Workflows

Convergence and Collaboration: Transforming Business Process and Workflows Convergence and Collaboration: Transforming Business Process and Workflows Steven Hagan, Vice President, Server Technologies 1 Copyright 2011, Oracle and/or its affiliates. All rights Convergence & Collaboration:

More information

Přehled novinek v SQL Server 2016

Přehled novinek v SQL Server 2016 Přehled novinek v SQL Server 2016 Martin Rys, BI Competency Leader martin.rys@adastragrp.com https://www.linkedin.com/in/martinrys 20.4.2016 1 BI Competency development 2 Trends, modern data warehousing

More information

August Oracle - GoldenGate Statement of Direction

August Oracle - GoldenGate Statement of Direction August 2015 Oracle - GoldenGate Statement of Direction Disclaimer This document in any form, software or printed matter, contains proprietary information that is the exclusive property of Oracle. Your

More information

Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems

Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems 1 Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems The Defacto Choice For Convergence 2 ABSTRACT & SPEAKER BIO Dealing with enormous data growth is a key challenge for

More information

TECHED USER CONFERENCE MAY 3-4, 2016

TECHED USER CONFERENCE MAY 3-4, 2016 TECHED USER CONFERENCE MAY 3-4, 2016 Bruce Beaman, Senior Director Adabas and Natural Product Marketing Software AG Software AG s Future Directions for Adabas and Natural WHAT CUSTOMERS ARE TELLING US

More information

Advanced Solutions of Microsoft SharePoint Server 2013

Advanced Solutions of Microsoft SharePoint Server 2013 Course Duration: 4 Days + 1 day Self Study Course Pre-requisites: Before attending this course, students must have: Completed Course 20331: Core Solutions of Microsoft SharePoint Server 2013, successful

More information

Progress DataDirect For Business Intelligence And Analytics Vendors

Progress DataDirect For Business Intelligence And Analytics Vendors Progress DataDirect For Business Intelligence And Analytics Vendors DATA SHEET FEATURES: Direction connection to a variety of SaaS and on-premises data sources via Progress DataDirect Hybrid Data Pipeline

More information

Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools

Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools SAP Technical Brief Data Warehousing SAP HANA Data Warehousing Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools A data warehouse for the modern age Data warehouses have been

More information

BIG DATA COURSE CONTENT

BIG DATA COURSE CONTENT BIG DATA COURSE CONTENT [I] Get Started with Big Data Microsoft Professional Orientation: Big Data Duration: 12 hrs Course Content: Introduction Course Introduction Data Fundamentals Introduction to Data

More information

THINK DIGITAL RETHINK LEGACY

THINK DIGITAL RETHINK LEGACY THINK DIGITAL RETHINK LEGACY Adabas & 2050+ Platform Strategy & Roadmap Bruce Beddoe VP Adabas Systems 1 % BUSINESS & MISSION-CRITICAL 2 For internal use only Billions invested in DIFFERENTIATING business

More information

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk Raanan Dagan and Rohit Pujari September 25, 2017 Washington, DC Forward-Looking Statements During the course of this presentation, we may

More information

Government IT Modernization and the Adoption of Hybrid Cloud

Government IT Modernization and the Adoption of Hybrid Cloud Government IT Modernization and the Adoption of Hybrid Cloud An IDC InfoBrief, Sponsored by VMware June 2018 Federal and National Governments Are at an Inflection Point Federal and national governments

More information

Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization

Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization Composite Software, Inc. June 2011 TABLE OF CONTENTS INTRODUCTION... 3 DATA FEDERATION... 4 PROBLEM DATA CONSOLIDATION

More information

Getting personal with your customers and GDPR

Getting personal with your customers and GDPR Getting personal with your customers and GDPR A practical approach to a secure, governed 360 degree customer view Darren Brunt Presales Director UK&I, Talend Colm Moynihan Partner Presales Manager EMEA,

More information

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways

More information

Architecting Microsoft Azure Solutions (proposed exam 535)

Architecting Microsoft Azure Solutions (proposed exam 535) Architecting Microsoft Azure Solutions (proposed exam 535) IMPORTANT: Significant changes are in progress for exam 534 and its content. As a result, we are retiring this exam on December 31, 2017, and

More information

Introduction to Federation Server

Introduction to Federation Server Introduction to Federation Server Alex Lee IBM Information Integration Solutions Manager of Technical Presales Asia Pacific 2006 IBM Corporation WebSphere Federation Server Federation overview Tooling

More information

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp. Data 101 Which DB, When Joe Yong (joeyong@microsoft.com) Azure SQL Data Warehouse, Program Management Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020

More information

Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers

Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers Oracle zsig Conference IBM LinuxONE and z System Servers Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers Sam Amsavelu Oracle on z Architect IBM Washington

More information

CipherCloud CASB+ Connector for ServiceNow

CipherCloud CASB+ Connector for ServiceNow ServiceNow CASB+ Connector CipherCloud CASB+ Connector for ServiceNow The CipherCloud CASB+ Connector for ServiceNow enables the full suite of CipherCloud CASB+ capabilities, in addition to field-level

More information

IBM Cloud Security for the Cloud. Amr Ismail Security Solutions Sales Leader Middle East & Pakistan

IBM Cloud Security for the Cloud. Amr Ismail Security Solutions Sales Leader Middle East & Pakistan IBM Cloud Security for the Cloud Amr Ismail Security Solutions Sales Leader Middle East & Pakistan Today s Drivers for Cloud Adoption ELASTIC LOWER COST SOLVES SKILLS SHORTAGE RAPID INNOVATION GREATER

More information

Hybrid Data Platform

Hybrid Data Platform UniConnect-Powered Data Aggregation Across Enterprise Data Warehouses and Big Data Storage Platforms A Percipient Technology White Paper Author: Ai Meun Lim Chief Product Officer Updated Aug 2017 2017,

More information

IBM services and technology solutions for supporting GDPR program

IBM services and technology solutions for supporting GDPR program IBM services and technology solutions for supporting GDPR program 1 IBM technology solutions as key enablers - Privacy GDPR Program Work-stream IBM software 2.1 Privacy Risk Assessment and Risk Treatment

More information

Title DC Automation: It s a MARVEL!

Title DC Automation: It s a MARVEL! Title DC Automation: It s a MARVEL! Name Nikos D. Anagnostatos Position Network Consultant, Network Solutions Division Classification ISO 27001: Public Data Center Evolution 2 Space Hellas - All Rights

More information

WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG

WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG The #1 Challenge in Successfully Deploying a Data Catalog The data cataloging space is relatively new. As a result, many organizations don

More information

Big Data with Hadoop Ecosystem

Big Data with Hadoop Ecosystem Diógenes Pires Big Data with Hadoop Ecosystem Hands-on (HBase, MySql and Hive + Power BI) Internet Live http://www.internetlivestats.com/ Introduction Business Intelligence Business Intelligence Process

More information

Modernizing Business Intelligence and Analytics

Modernizing Business Intelligence and Analytics Modernizing Business Intelligence and Analytics Justin Erickson Senior Director, Product Management 1 Agenda What benefits can I achieve from modernizing my analytic DB? When and how do I migrate from

More information

How Insurers are Realising the Promise of Big Data

How Insurers are Realising the Promise of Big Data How Insurers are Realising the Promise of Big Data Jason Hunter, CTO Asia-Pacific, MarkLogic A Big Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies

More information

Information Management Fundamentals by Dave Wells

Information Management Fundamentals by Dave Wells Information Management Fundamentals by Dave Wells All rights reserved. Reproduction in whole or part prohibited except by written permission. Product and company names mentioned herein may be trademarks

More information

MetaMatrix Enterprise Data Services Platform

MetaMatrix Enterprise Data Services Platform MetaMatrix Enterprise Data Services Platform MetaMatrix Overview Agenda Background What it does Where it fits How it works Demo Q/A 2 Product Review: Problem Data Challenges Difficult to implement new

More information

What is Gluent? The Gluent Data Platform

What is Gluent? The Gluent Data Platform What is Gluent? The Gluent Data Platform The Gluent Data Platform provides a transparent data virtualization layer between traditional databases and modern data storage platforms, such as Hadoop, in the

More information

Qualys Cloud Platform

Qualys Cloud Platform 18 QUALYS SECURITY CONFERENCE 2018 Qualys Cloud Platform Looking Under the Hood: What Makes Our Cloud Platform so Scalable and Powerful Dilip Bachwani Vice President, Engineering, Qualys, Inc. Cloud Platform

More information

Shine a Light on Dark Data with Vertica Flex Tables

Shine a Light on Dark Data with Vertica Flex Tables White Paper Analytics and Big Data Shine a Light on Dark Data with Vertica Flex Tables Hidden within the dark recesses of your enterprise lurks dark data, information that exists but is forgotten, unused,

More information

<Insert Picture Here> Enterprise Data Management using Grid Technology

<Insert Picture Here> Enterprise Data Management using Grid Technology Enterprise Data using Grid Technology Kriangsak Tiawsirisup Sales Consulting Manager Oracle Corporation (Thailand) 3 Related Data Centre Trends. Service Oriented Architecture Flexibility

More information

Understanding the latent value in all content

Understanding the latent value in all content Understanding the latent value in all content John F. Kennedy (JFK) November 22, 1963 INGEST ENRICH EXPLORE Cognitive skills Data in any format, any Azure store Search Annotations Data Cloud Intelligence

More information

Sentinet for BizTalk Server SENTINET

Sentinet for BizTalk Server SENTINET Sentinet for BizTalk Server SENTINET Sentinet for BizTalk Server 1 Contents Introduction... 2 Sentinet Benefits... 3 SOA and API Repository... 4 Security... 4 Mediation and Virtualization... 5 Authentication

More information

TRUSTED IT: REDEFINE SOCIAL, MOBILE & CLOUD INFRASTRUCTURE. John McDonald

TRUSTED IT: REDEFINE SOCIAL, MOBILE & CLOUD INFRASTRUCTURE. John McDonald TRUSTED IT: REDEFINE SOCIAL, MOBILE & CLOUD INFRASTRUCTURE John McDonald 1 What is Trust? Can I trust that my assets will be available when I need them? Availability Critical Assets Security Can I trust

More information

EMC Documentum xdb. High-performance native XML database optimized for storing and querying large volumes of XML content

EMC Documentum xdb. High-performance native XML database optimized for storing and querying large volumes of XML content DATA SHEET EMC Documentum xdb High-performance native XML database optimized for storing and querying large volumes of XML content The Big Picture Ideal for content-oriented applications like dynamic publishing

More information

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours Advanced Solutions of Microsoft SharePoint Server 2013 Course 20332 36 Contact Hours Course Overview This course examines how to plan, configure, and manage a Microsoft SharePoint Server 2013 environment.

More information

Smart Data Center From Hitachi Vantara: Transform to an Agile, Learning Data Center

Smart Data Center From Hitachi Vantara: Transform to an Agile, Learning Data Center Smart Data Center From Hitachi Vantara: Transform to an Agile, Learning Data Center Leverage Analytics To Protect and Optimize Your Business Infrastructure SOLUTION PROFILE Managing a data center and the

More information

Accelerating Digital Transformation with InterSystems IRIS and vsan

Accelerating Digital Transformation with InterSystems IRIS and vsan HCI2501BU Accelerating Digital Transformation with InterSystems IRIS and vsan Murray Oldfield, InterSystems Andreas Dieckow, InterSystems Christian Rauber, VMware #vmworld #HCI2501BU Disclaimer This presentation

More information

Syncsort DMX-h. Simplifying Big Data Integration. Goals of the Modern Data Architecture SOLUTION SHEET

Syncsort DMX-h. Simplifying Big Data Integration. Goals of the Modern Data Architecture SOLUTION SHEET SOLUTION SHEET Syncsort DMX-h Simplifying Big Data Integration Goals of the Modern Data Architecture Data warehouses and mainframes are mainstays of traditional data architectures and still play a vital

More information

Rickard Linck Client Technical Professional Core Database and Lifecycle Management Common Analytic Engine Cloud Data Servers On-Premise Data Servers

Rickard Linck Client Technical Professional Core Database and Lifecycle Management Common Analytic Engine Cloud Data Servers On-Premise Data Servers Rickard Linck Client Technical Professional Core Database and Lifecycle Management Common Analytic Engine Cloud Data Servers On-Premise Data Servers Watson Data Platform Reference Architecture Business

More information

Red Hat JBoss Middleware Integration Products Roadmap. Ken Johnson Director, Product Management, Red Hat

Red Hat JBoss Middleware Integration Products Roadmap. Ken Johnson Director, Product Management, Red Hat Red Hat JBoss Middleware Integration Products Roadmap Ken Johnson Director, Product Management, Red Hat The Plan... Integration Products Overview Product-by-product Intro Roadmap Cross-product initiatives

More information

IBM Data Replication for Big Data

IBM Data Replication for Big Data IBM Data Replication for Big Data Highlights Stream changes in realtime in Hadoop or Kafka data lakes or hubs Provide agility to data in data warehouses and data lakes Achieve minimum impact on source

More information

Advanced Solutions of Microsoft SharePoint 2013

Advanced Solutions of Microsoft SharePoint 2013 Course 20332A :Advanced Solutions of Microsoft SharePoint 2013 Page 1 of 9 Advanced Solutions of Microsoft SharePoint 2013 Course 20332A: 4 days; Instructor-Led About the Course This four-day course examines

More information

ANY Data for ANY Application Exploring IBM Data Virtualization Manager for z/os in the era of API Economy

ANY Data for ANY Application Exploring IBM Data Virtualization Manager for z/os in the era of API Economy ANY Data for ANY Application Exploring IBM for z/os in the era of API Economy Francesco Borrello francesco.borrello@it.ibm.com IBM z Analytics Traditional Data Integration Inadequate No longer Viable to

More information