APPLYING KNOWLEDGE BASED AI TO MODERN DATA MANAGEMENT. Mani Keeran, CFA Gi Kim, CFA Preeti Sharma

Similar documents
FIBO Operational Ontologies Briefing for the Object Management Group

The Emerging Data Lake IT Strategy

The Financial Industry Business Ontology

FIBO Shared Semantics. Ontology-based Financial Standards Thursday Nov 7 th 2013

Linked Data: Fast, low cost semantic interoperability for health care?

FIBO Metadata in Ontology Mapping

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Oracle Spatial and Graph: Benchmarking a Trillion Edges RDF Graph ORACLE WHITE PAPER NOVEMBER 2016

Semantics in the Financial Industry: the Financial Industry Business Ontology

Practical Semantic Applications Master Title for Oil and Gas Asset Reporting. Information Integration David Price, TopQuadrant

A Formal Definition of RESTful Semantic Web Services. Antonio Garrote Hernández María N. Moreno García

Financial Dataspaces: Challenges, Approaches and Trends

Advances In Data Integration: The No ETL Approach. Marcos A. Campos, Principle Consultant, The Cognatic Group. capsenta.com. Sponsored by Capsenta

Update on data interoperability activities ECB 1 June Michal Piechocki Chairman Frankfurt Group Technical Workshop

Enabling Data Governance Leveraging Critical Data Elements

geospatial querying ApacheCon Big Data Europe 2015 Budapest, 28/9/2015

Chevron Position Paper for W3C Workshop on Semantic Web in Oil & Gas Industry

The DPM metamodel detail

FINANCIAL REGULATORY REPORTING ACROSS AN EVOLVING SCHEMA

An overview of RDB2RDF techniques and tools

Creating a Virtual Knowledge Base for Financial Risk and Reporting

Metatomix Semantic Platform

Semantic Web Fundamentals

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent

Development of guidelines for publishing statistical data as linked open data

Taxonomy Tools: Collaboration, Creation & Integration. Dow Jones & Company

Semantic Integration with Apache Jena and Apache Stanbol

Using Linked Data Concepts to Blend and Analyze Geospatial and Statistical Data Creating a Semantic Data Platform

Thematix consultancy in semantic technology and business

Publishing Statistical Data and Geospatial Data as Linked Data Creating a Semantic Data Platform

Data on the Web. Human Information Interaction for Knowledge Extraction, Interaction, Utilization, Decision making HI-I-KEIUD

REGULATORY REPORTING FOR FINANCIAL SERVICES

Semantic Enrichment ARMA Chicago Spring Seminar April 18, 2018

Linked data and its role in the semantic web. Dave Reynolds, Epimorphics

ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

Semantic Web Fundamentals

EPIM ReportingHub (ERH) A new powerful knowledge sharing platform. Houston, March 2012 Ph.D. Kari Anne Haaland Thorsen

Integration of INSPIRE & SDMX data infrastructures for the 2021 population and housing census

New Approach to Graph Databases

Fusion Registry 9 SDMX Data and Metadata Management System

COMPUTER AND INFORMATION SCIENCE JENA DB. Group Abhishek Kumar Harshvardhan Singh Abhisek Mohanty Suhas Tumkur Chandrashekhara

Industry Adoption of Semantic Web Technology

Reducing Consumer Uncertainty

Orchestrating Music Queries via the Semantic Web

Comparative Study of RDB to RDF Mapping using D2RQ and R2RML Mapping Languages

YADSL. Models DSLFIN Miami Beach. (Yet Another DSL --> (Domain Specific Language)) John Davies Co-founder, CTO - C24

Linked Data and RDF. COMP60421 Sean Bechhofer

Semantic Queries and Mediation in a RESTful Architecture

Knowledge Integration Environment

The Model-Driven Semantic Web Emerging Standards & Technologies

CHAPTER 1 INTRODUCTION

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

The Information Platform of the Future. MarkLogic and Smartlogic

Semantic Web Test

OpenBudgets.eu: Fighting Corruption with Fiscal Transparency. Project Number: Start Date of Project: Duration: 30 months

Demo: Linked Open Statistical Data for the Scottish Government

Open And Linked Data Oracle proposition Subtitle

Data Governance for the Connected Enterprise

Data Points Structure Explanatory Documentation

The Role of Metadata in a Data Governance Strategy

The Semantic Web Revisited. Nigel Shadbolt Tim Berners-Lee Wendy Hall

Oracle Buys Automated Applications Controls Leader LogicalApps

Linked Open Data: a short introduction

RKB, sameas and dotac

Using RDF to Model the Structure and Process of Systems

Unlocking the full potential of location-based services: Linked Data driven Web APIs

On the use of Abstract Workflows to Capture Scientific Process Provenance

overcome the fragmentation of multiple data formats and communication protocols;

The MUSING Approach for Combining XBRL and Semantic Web Data. ~ Position Paper ~

Readme file for Oracle Spatial and Graph and OBIEE Sample Application (V305) VirtualBox

Towards Open Innovation with Open Data Service Platform

Enhancing Security Exchange Commission Data Sets Querying by Using Ontology Web Language

ISA Action 1.17: A Reusable INSPIRE Reference Platform (ARE3NA)

Design & Manage Persistent URIs

NIEM. National. Information. Exchange Model. NIEM and Information Exchanges. <Insert Picture Here> Deploy. Requirements. Model Data.

Building a Data Strategy for a Digital World

Linked Data: What Now? Maine Library Association 2017

DarkoKravos, PMP. Dodd Frank Title VII Recordkeeping. Record keeping changes impacting business and technology

Towards the Semantic Desktop. Dr. Øyvind Hanssen University Library of Tromsø

BLU AGE 2009 Edition Agile Model Transformation

Enterprise Information Integration using Semantic Web Technologies:

Contribution of OCLC, LC and IFLA

Terminologies, Knowledge Organization Systems, Ontologies

Semantic Annotation, Search and Analysis

Introduction to Web Services & SOA

Information Modeling Service-Oriented Architecture. Galen Mulrooney (contractor to VHA) June 3, 2009 V 1.1

Welcome to INFO216: Advanced Modelling

"Leveraging FIBO with Semantic Analysis to Perform On-Boarding, KYC and CDD" Bryan Bell & Elisa Kendall

Lecture Telecooperation. D. Fensel Leopold-Franzens- Universität Innsbruck

Electronic Health Records with Cleveland Clinic and Oracle Semantic Technologies

Introduction to Web Services & SOA

Structured Data To RDF II Deliverable D4.3.2

Management of Complex Product Ontologies Using a Web-Based Natural Language Processing Interface

Development of an Ontology-Based Portal for Digital Archive Services

Ontology Servers and Metadata Vocabulary Repositories

Resilient Linked Data. Dave Reynolds, Epimorphics

The JORD Project. The JORD Project (POSC Caesar & FIATECH, ISO15926) Joint Operational Reference Data

Kresge Business Library Instruction Overview: Selected Financial Databases

Transcription:

APPLYING KNOWLEDGE BASED AI TO MODERN DATA MANAGEMENT Mani Keeran, CFA Gi Kim, CFA Preeti Sharma

2 What we are going to discuss During last two decades, majority of information assets have been digitized in the financial sector. However, relational databases cannot answer critical business inquiries easily because its model lacks of semantics of business processes while focusing on efficient physical mechanism of data. Now new ways of storing data is being matured. RDF/OWL, one of most prominent Semantic Web technologies can be one of options for enabling datastore incorporating human intelligence. We will present how experimental Enterprise Knowledge Graph was created out of legacy RDB tables. It will include inferences features in querying. We consider this machine readable data format as potential backend of AI mechanism in Financial Information Management.

3 CHALLENGES IN FINANCIAL SECTORS Is my database intelligent enough to answer my questionnaires?

4 Example Questionnaire #1: Am I allowed to make this contract with them? Finding a counterparty for a certain trade is restricted by various regulatory requirements and internal rules. This process is defined as a multi-step decision making tree, which is conducted by manual workflow or predefined codes. US Person Trying to make a Swap contract Has not approved by the Committee yet Non-US Person Fund A? Counterparty X Dodd Frank Compliant Under EU Regulation Is this a valid trade?

5 Example Questionnaire #2: What is our exposure to Lehman Brothers? When Lehman Brother failed, many firms were not able to aggregate exposure because their systems were not ready for this. An human understands exposure should include various relationships between two parties, but a machine needs explicit code that implements the logic to integrate various sources. Swap Counterparty Security Issuer Prime Broker Subsidiary 1 Subsidiary 2 Lehman Brothers Company A General Partner Money Market Subsidiary 3 Subsidiary 4

6 Relational Database is not Intelligent because it lacks of Semantics in its physical data models Conceptual Describes Semantics of a business domain Captures Business Requirements in terms of Entities and Relationships Logical System Model captures details of the entities in terms of attributes Communicates Design Details Physical Technology Model/Physical Design Implementation of Specific Use Case Business metadata is separated from physical implementation Semantics Implied in Data Model Semantics Embedded in Application Logic

7 KNOWLEDGE BASED AI Enlightening Financial Information System Using Ontology, FIBO and Triple Store

8 Moving to AI-enabled Data What is AI? AI is the study of how to make computers do things at which, at the moment, people are better Elaine Rich, 1986 What people are better in using data than databases? People understand context of data. People can associate relevant information together even when they are not obtained at the same time, same place. People can leverage both of explicit and implicit relationships of information

9 Knowledge Based AI Reasoning Knowledge Base Machine Learning Deep Learning Machine Readable Business Knowledge Knowledge from Learning

10 From Database to Knowledge Base Database Knowledge Base Structured collection of records or data. (Wikipedia) Organized in such a way that a program can quickly select desired pieces. (webopedia) Machine-readable resource for the dissemination of information. A Dynamic Resource that may itself have the capacity to learn, as part of an AI expert system. (Techtarget) Stores knowledge in a computer-readable form, usually for the purpose of having automated reasoning applied to them. (Wikipedia)

11 Ontology Data model that captures knowledge of a domain as a set of concepts & relationships between these concepts. Allows relationships to become visible and not dependent on code that embeds business rules. Semantic Industry Standards - Resource Descriptor Framework (RDF) and Web Ontology Language (OWL) govern the construction of Ontologies

12 Ontology Model as Knowledge Base Financial Institution type Legal Entity Country type type is domiciled in Brokerage Firm issues Issuer buy/sell securities Common Stock has an issuer Class Relationship

13 Triple Store as Knowledge Base Storage Triples is the simplest instance format of RDF/OWL expression. A Triplestore is a database for triples. Data is indexed for the graph data structure OWL expression Issuer issues Common Stock Apple Inc. Triple Instance Subject Predicate Object Issues Apple Common Stock (APPL) Fund A has position issues US is domiciled in Apple Inc. APPL

14 OWL Language of Ontology Web Ontology Language from W3C Computational logic-based, used to express knowledge which can be understood and processed by machines Ontologies described using OWL can be processed by a reasoner which infers knowledge based on the asserted facts in the Ontology Has various constructs to create Ontologies and support reasoning e.g. Class EquivalentClass TransitiveProperty FunctionalProperty Rdfs:label etc.

15 Financial Industry Business Ontology (FIBO) What is FIBO? an industry initiative to define financial industry terms, definitions and synonyms using semantic web principles such as RDF/OWL and OMG modeling standards such as UML. a joint effort by the Object Management Group (OMG) and the Enterprise Data Management (EDM) Council. includes Ontologies for Financial Instruments, Business Entities, Market Data etc. driven by regulatory and industry requirement for data quality and transparency. Benefits provides common reference model to integrate disparate technical systems and message formats within an organization and across industry. aims to bring transparency in the financial system. aims to improve regulatory reporting by providing clear meaning of data. Each industry is coming up with its own standard ontologies: CDISC : Clinical Research HL7 : Health Care ACORD : Insurance

16 FIBO as Knowledge Base Language for Finance Any type of Legacy data can be a part - Both of Structured/unstructured data can be interpreted into Ontology Language using FIBO vocabularies. FIBO brings the parts together - FIBO pre-defines implicit/explicit business rules between objects in standardized manner. - Undiscovered relationship can be added as combination of standard vocabularies using query grammar. Inference: Make the sum of the parts whole - Business rules are included as a part of the data model. - A query will leverage business rules defined in FIBO

17 EKG; ENTERPRISE KNOWLEDGE GRAPH Proof of Concept

18 Introducing EKG Enterprise Knowledge Graph Enterprise Data Platform Operational Zone Analytical Zone EKG Zone Normalized Model Support Operational Reporting Data Distribution Dimensional Model Support Analytical Reporting Connected to BI Layer Graph Data Model Support Semantic Query Extended data service with Inference feature Integration Zone Data Producers, Data Lake

19 Logical View of PoC Legacy Tables EKG (Triplestore) Query Issuers of Securities Bloomberg File Counterparty Legal Entities Agents Brokers Investment Advisors Use legacy tables as sources. Translate the legacy data into triples and store them. Do queries with Business questions.

20 PoC Technology Legacy Database Input: 10 tables w/ +250k rows Define mappings using Excel Import FIBO RDF/OWL files Triple Store* Results: +1.4 million triples File feed * AllegroGraph 6.3 or Oracle 12c Spatial and Graph Linux Server Data Files Mapping Files API over HTTP Resource Mapper Key Components Resource Mapper - Developed as an in-house Java code leveraging external libraries for triple store communication. Mapping Files - CSV format files that contain mapping rules between tables and triples. Triple Store - Implemented on AllegroGraph Free Edition (Oracle 12c was also tested) FIBO Extension - Relevant vocabularies picked from published FIBO version - Additional vocabularies for extension

21 Rule-based Triple translation SUBJECT FLAG PREDICATE OBJECT CONTEXT BASE_URI_S BASE_URI_O risk_cntry URI http://www.w3.org/1999/02/22 rdf syntax nhttp://spec.edmcouncil.org/fibo/red/fnd/plachttp://franklintemhttp://franklintempleton.com/fibopoc/country# risk_cntry URI http://www.w3.org/1999/02/22 rdf syntax nhttp://www.w3.org/2002/07/owl#individual http://franklintemhttp://franklintempleton.com/fibopoc/country# bbcompid URI http://www.w3.org/1999/02/22 rdf syntax nhttp://www.semanticweb.org/gkim/ontologiehttp://franklintemhttp://franklintempleton.com/fibopoc/bloombergidentifier# bbcompid URI http://www.w3.org/1999/02/22 rdf syntax nhttp://www.w3.org/2002/07/owl#individual http://franklintemhttp://franklintempleton.com/fibopoc/bloombergidentifier# bbcompid URI http://www.w3.org/1999/02/22 rdf syntax nhttp://spec.edmcouncil.org/fibo/red/be/corp http://franklintemhttp://franklintempleton.com/fibopoc/bloombergidentifier# bbcompid Literal http://www.w3.org/2000/01/rdf schema#labcomp_name http://franklintemhttp://franklintempleton.com/fibopoc/bloombergidentifier# bbcompid Resource http://spec.edmcouncil.org/fibo/red/be/ow bbultimate_parentid http://franklintemhttp://franklintempletohttp://franklintempleton.com/fibopoc/bloo bbultimate_parentid URI http://www.w3.org/1999/02/22 rdf syntax nhttp://www.semanticweb.org/gkim/ontologiehttp://franklintemhttp://franklintempleton.com/fibopoc/bloombergidentifier# bbultimate_parentid URI http://www.w3.org/1999/02/22 rdf syntax nhttp://www.w3.org/2002/07/owl#individual http://franklintemhttp://franklintempleton.com/fibopoc/bloombergidentifier# bbultimate_parentid URI http://www.w3.org/1999/02/22 rdf syntax nhttp://spec.edmcouncil.org/fibo/red/be/own http://franklintemhttp://franklintempleton.com/fibopoc/bloombergidentifier# bbparentid URI http://www.w3.org/1999/02/22 rdf syntax nhttp://www.semanticweb.org/gkim/ontologiehttp://franklintemhttp://franklintempleton.com/fibopoc/bloombergidentifier# bbparentid URI http://www.w3.org/1999/02/22 rdf syntax nhttp://www.w3.org/2002/07/owl#individual http://franklintemhttp://franklintempleton.com/fibopoc/bloombergidentifier# Column Usage Sample value SUBJECT A column which has the name of Subject <name of a column> resource FLAG Defines a type of a Object in a statement. URI (Bare URI), Literal (Value), Resource PREDICATE A URI of a Predicate http://www.omg.org/spec/edmc FIBO/BE/LegalEntities/LegalPersons/isOrganizedIn OBJECT A URI, a column which have a value of Literal or a name of Object resource http://www.omg.org/spec/edmc FIBO/BE/LegalEntities/LegalPersons/LegalEntity BASE_URI_Subject Base URI for Subject a resource http://franklintempleton.com/fibopoc/securityissuer# BASE_URI_Object Base URI for Object a resource http://franklintempleton.com/fibopoc/securityissuer# CONTEXT URI of graph http://franklintempleton.com/fibopoc/legalentity

22 Technical Environment 1. Triple Store : Allegrograph an RDF graph database 2. Visual Discovery Tool : Gruff a triple store browser 3. Query Language: SPARQL RDF query language SELECT?contract WHERE {?contract <hascounterparty>?counterparty.?counterparty.<islegalentity>?lege } Gruff SPARQL Triple Store AllegroGraph

23 USE CASE DEMO

24 Use Case #1: Am I allowed to make this contract with them? US Person Trying to make a Swap contract Has not approved by the Committee yet Non-US Person Fund A? Counterparty X Dodd Frank Compliant Under EU Regulation Is this a valid trade?

25 Use Case #1: Sample Process