Knowledge Base for Business Intelligence

Similar documents
Web of Data: P2P Business domain translation of problem spaces. Semantic Business Integration (WIP draft)

COSC 3351 Software Design. An Introduction to UML (I)

Web of Data: Business domain translation of problem spaces. Semantic Business Integration (WIP draft)

Graph Database. Relation

How Co-Occurrence can Complement Semantics?

User Configurable Semantic Natural Language Processing

6234A - Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services

Hyperdata: Update APIs for RDF Data Sources (Vision Paper)

Web of Data: Business domain translation of problem spaces. Semantic Business Integration (WIP draft)

Browsing the World in the Sensors Continuum. Franco Zambonelli. Motivations. all our everyday objects all our everyday environments

BUILDING THE SEMANTIC WEB

Querying Data with Transact SQL

Semantic Web Test

Implementation Techniques

An overview of Graph Categories and Graph Primitives

GeoTemporal Reasoning for the Social Semantic Web

Upgrading your End User Skills to SharePoint 2013 Course 55026A; 3 Days, Instructor-led

UML Primer. -Elango Sundaram

DIONYSUS: Towards Query-aware Distributed Processing of RDF Graph Streams

Course Modules for MCSA: SQL Server 2016 Database Development Training & Certification Course:

Jans Aasman, Ph.D. CEO Franz Inc GeoSpatial with AllegroGraph

Metamodeling. Janos Sztipanovits ISIS, Vanderbilt University

Event Stores (I) [Source: DB-Engines.com, accessed on August 28, 2016]

Digital Public Space: Publishing Datasets

Dexter respects users privacy by storing users local data and evaluating queries client sided inside their browsers.

Graph Databases. Graph Databases. May 2015 Alberto Abelló & Oscar Romero

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

Instance linking. Leif Granholm Trimble Senior Vice President BIM Ambassador

Contents Contents 1 Introduction Entity Types... 37

ER modeling. Lecture 4

MICROSOFT BUSINESS INTELLIGENCE

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Semantics and Ontologies for Geospatial Information. Dr Kristin Stock

Reasoning on Business Processes and Ontologies in a Logic Programming Environment

OVERVIEW OF RELATIONAL DATABASES: KEYS

"Charting the Course to Your Success!" MOC D Querying Microsoft SQL Server Course Summary

Outline Introduction Triple Storages Experimental Evaluation Conclusion. RDF Engines. Stefan Schuh. December 5, 2008

Title. Prolog, Rules, Reasoning and SPARQLing Magic in the real world. Franz Inc

An Archiving System for Managing Evolution in the Data Web

Microsoft End to End Business Intelligence Boot Camp

Temporal Qualification in Topic Maps

0.1 Knowledge Organization Systems for Semantic Web

Harvesting RDF triples

AVANTUS TRAINING PTE LTD

Extracting knowledge from Ontology using Jena for Semantic Web

Structure of This Presentation

MS-55045: Microsoft End to End Business Intelligence Boot Camp

Graph Data Management Systems in New Applications Domains. Mikko Halin

10778A: Implementing Data Models and Reports with Microsoft SQL Server 2012

CSE 21 Mathematics for Algorithm and System Analysis

XML Systems & Benchmarks

MANAGING DATA(BASES) USING SQL (NON-PROCEDURAL SQL, X401.9)

Mobile Wireless Sensor Network enables convergence of ubiquitous sensor services

Database Management

An Entity Name Systems (ENS) for the [Semantic] Web

Advanced Multidimensional Reporting

"Charting the Course... MOC A: SharePoint 2016 Site Collections and Site Owner Administration. Course Summary

Querying Microsoft SQL Server

Linked Data and RDF. COMP60421 Sean Bechhofer

Querying Data with Transact-SQL

Understanding Billions of Triples with Usage Summaries

Creational. Structural

COURSE OUTLINE: Querying Microsoft SQL Server

ADVANCED DATABASES ; Spring 2015 Prof. Sang-goo Lee (11:00pm: Mon & Wed: Room ) Advanced DB Copyright by S.-g.

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

Cloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018

Quadstream. Algorithms for Multi-scale Polygon Generalization. Curran Kelleher December 5, 2012

A Deductive System for Annotated RDFS

Benchmarking Database Representations of RDF/S Stores

Introduction to UML What is UML? Motivations for UML Types of UML diagrams UML syntax Descriptions of the various diagram types Rational Rose (IBM.. M

CSE 544 Data Models. Lecture #3. CSE544 - Spring,

Concept Map Based Semantic Representation and Knowledge Visualization

Unified Modeling Language (UML)

Site Design. SWE 432, Fall 2017 Design and Implementation of Software for the Web

FOUNDATIONS OF DATABASES AND QUERY LANGUAGES

Querying Data with Transact SQL Microsoft Official Curriculum (MOC 20761)

Novel System Architectures for Semantic Based Sensor Networks Integraion

Toward Semantic Data Stream Technologies and Applications

SEMANTIC ANALYSIS TYPES AND DECLARATIONS

The first thing you see when you open NVivo is a list of recent projects you have worked with in program. Today we will create a new project.

HyperGraphDB. Motivation, Architecture and Applications

Hierarchies. QlikView Technical Brief. 26 Nov 2013, HIC

RDFPath. Path Query Processing on Large RDF Graphs with MapReduce. 29 May 2011

Towards Efficient Semantically Enriched Complex Event Processing and Pattern Matching

DISCUSSION 5min 2/24/2009. DTD to relational schema. Inlining. Basic inlining

Curriculum Guide. ThingWorx

The RMap Project: Linking the Products of Research and Scholarly Communication Tim DiLauro

Modelling with Classes. CITS1220 Software Engineering

Trellis Magento 2 Salsify Connector

CSIT5300: Advanced Database Systems

Enterprise Data Catalog for Microsoft Azure Tutorial

Semantic Web in a Constrained Environment

Semantic Annotation, Search and Analysis

Chapter B: Hierarchical Model

Developing SQL Data Models(768)

FedX: A Federation Layer for Distributed Query Processing on Linked Open Data

Overview. Data-mining. Commercial & Scientific Applications. Ongoing Research Activities. From Research to Technology Transfer

Annotation Component in KiWi

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai.

7. Query Processing and Optimization

Transcription:

Knowledge Base for Business Intelligence System for population and linking of knowledge bases dealing with data, information and knowledge comming from heterogeneous data sources to provide pluggable Business Intelligence insight into applications. Semantic Web (RDF Triples) based information manipulation and P2P node deployment architecture. Nodes clients/services provides endpointds for their sources (data sources or also another Nodes). Augmented indexing, search and browsing facilities, analysis and mining, pluggable into existing deployments leverages the existing potential of applications to be Business Intelligence aware.

Triples (time based) Albeit its lack of popularity in some environments, RDF triples are an exceptional serialization format. And when it comes to time related events or information, data can be serialized using the quad-extension mechanism which allows to attach a context resource to a triple, let say, time related knowledge. One of our system s attempts is to have things sorted by time and then arrange them in sets partitioned in base of that ordering. Time based triples may come from everywhere. It may seem difficult at first to think ordering all the events in a relational database. But think of Web related events: social media from which extract statements via NLP, page navigation data, mobile apps users behavior, IoT(s). All these are strongly time bound sources of data.

Sets arrangement For the internal representation of triple s knowledge we have choosen a set model oriented approach. This will hopefully allows us later to model easily Predicates upon those sets and a a graph representation mechanism based in the so called Nodes which are a definition of a Predicate for the Node itself and two other Predicates for its possible matching input/output nodes. The set space of abstract Resource(s) is partitioned into Subjects (S), Predicates (P) and Objects (O). Resources can be anyone of these so they are represented in the form of an identifier ID = [CTX] [ID] [ID] [ID] being each part its corresponding SPO value and CTX is time data. Nesting (reification) of triples is allowable.

Sets arrangement (cont.) Subjects intersects whit Objects to form Predicate Kinds. Objects intersects with Predicates to form Subject Kinds. Predicates intersects with Subjects to form Object Kinds. Then, all SPO intersects refer to the Triples set itselff. When we say Kind for each of these SPOs we mean exactly that. A set of Predicates and its Object values characterizes a set of Subjects, given, for example, for its sex, age, etc. Idem for Predicates and Objects. Due to Kinds being the definition of sort of types we need to partition this sets temporally and use this knowledge to partition SPO sets. With temporally we mean that, for example, some Predicates usually occurs before/after anothers.

Sets arrangement (cont) Sets arrangement consists of the partition of sets in the relative/abstract concept of classes, instances and occurrences. The more external a set is, the more before temporally is of all its contents. So, for example, one is first a son of, then a father of and then a grandfather of (Kinds). A Kind doesn t have occurrences. So classes enclose other classes and instances. Instances encloses other instances and occurrences. Occurrences may enclose what may be interlinked with others SPO partitions. For each SPO set its occurrences are of its set type. S class/inst: Predicate / Object P class/inst: Object / Subject O class/inst: Subject / Predicate

Subjects Object Kinds Predicates Triples Predicate Kinds Subject Kinds Resources Objects

Class1 Instance 1 Occurrence 1 Class 2 Class 3

Inference Temporal logic may be implemented using quad context in triples by retrieving / adding a source timestamp to triples, an ordering / offset context to Kinds and by logical operators over the set of SPOs by using three bits to tag time to occurrences in its context: each one regarding something happening in the past, present, future or their combinations. A possible implementation of the set partitions could be a two-keyed hash map in which one can traverse classes, instances and occurrences, query for possible keys / paths and retrieve values based on the closure of the sets hierarchy. Multiple occurrences of the same SPO(s) are different set elements because of their instances / classes.

Inference (cont) Predicates: predicates are triples of the form pred: set(pred, pred, pred), where pred may be any Resource type (SPO, Kind, Triple) or even another reified pred. Sets API based resolution. Reified response: ((person, leads, organization), bornin, (place, country, AR )) Type subsumption: organization: company, club, etc. S/O(s) as predicates: use Kind predicates. Nodes: detect Predicate patterns (Node types). Aggregate Node predicates and their parent / child (graph in/out) Nodes (predicates). Example: Movie database: categories, films, roles and actors, ratings.

Application Object Model Application Object Model for ease of translation of the underlying core components interfaces into a graph / facade and to provide a foundation in which to build upon the proposed P2P client / service abstractions (protocols) Model - types : Type[] - entities : Entity[] Type - propertytypes : Type[] (Map<String, Type[]>) - entities : Entity[] Entity - type : Type - properties : Property[] - payload : Object Property (extends Entity) - value : Entity