Using Linked Data Concepts to Blend and Analyze Geospatial and Statistical Data Creating a Semantic Data Platform

Similar documents
Publishing Statistical Data and Geospatial Data as Linked Data Creating a Semantic Data Platform

GeoSPARQL Support and Other Cool Features in Oracle 12c Spatial and Graph Linked Data Seminar Culture, Base Registries & Visualisations

Open And Linked Data Oracle proposition Subtitle

Developing GeoSPARQL Applications with Oracle Spatial and Graph

IT Infrastructure for BIM and GIS 3D Data, Semantics, and Workflows

Linked Data and Geosparql

Semantic Web Fundamentals

geospatial querying ApacheCon Big Data Europe 2015 Budapest, 28/9/2015

Semantic Web Fundamentals

Publishing the 15 th Italian Population and Housing Census in Linked Open Data

Verarbeitung von Vektor- und Rasterdaten auf der Hadoop Plattform DOAG Spatial and Geodata Day 2016

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

SEXTANT 1. Purpose of the Application

Land Information Management and its (3D) Database Foundation

ISA Action 1.17: A Reusable INSPIRE Reference Platform (ARE3NA)

Building Virtual Earth Observatories Using Scientific Database, Semantic Web and Linked Geospatial Data Technologies

Readme file for Oracle Spatial and Graph and OBIEE Sample Application (V305) VirtualBox

INSPIRE & Linked Data: Bridging the Gap Part II: Tools for linked INSPIRE data

COMPUTER AND INFORMATION SCIENCE JENA DB. Group Abhishek Kumar Harshvardhan Singh Abhisek Mohanty Suhas Tumkur Chandrashekhara

Semantic Integration with Apache Jena and Apache Stanbol

Serving Ireland s Geospatial as Linked Data on the Web

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent

Reducing Consumer Uncertainty

Representing and Querying Linked Geospatial Data

Transformative characteristics and research agenda for the SDI-SKI step change: A Cadastral Case Study

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

CREATING VIRTUAL SEMANTIC GRAPHS ON TOP OF BIG DATA FROM SPACE. Konstantina Bereta and Manolis Koubarakis

Orchestrating Music Queries via the Semantic Web

Transformative characteristics and research agenda for the SDI-SKI step change:

THE GETTY VOCABULARIES TECHNICAL UPDATE

Semantic Web Information Management

Map Visualization in Analytic Applications LJ Qian, Director of Software Development David Lapp, Product Manager Oracle

R2RML by Assertion: A Semi-Automatic Tool for Generating Customised R2RML Mappings

Bring Location Intelligence To Big Data Applications on Hadoop, Spark, and NoSQL

Land Administration and Management: Big Data, Fast Data, Semantics, Graph Databases, Security, Collaboration, Open Source, Shareable Information

Study and guidelines on Geospatial Linked Data as part of ISA Action 1.17 Resource Description Framework

An overview of RDB2RDF techniques and tools

Ivan Herman. F2F Meeting of the W3C Business Group on Oil, Gas, and Chemicals Houston, February 13, 2012

APPLYING KNOWLEDGE BASED AI TO MODERN DATA MANAGEMENT. Mani Keeran, CFA Gi Kim, CFA Preeti Sharma

Semantics. Matthew J. Graham CACR. Methods of Computational Science Caltech, 2011 May 10. matthew graham

Methodological Guidelines for Publishing Linked Data

Strabon. Semantic support for EO Data Access in TELEIOS. Presenter: George Garbis

The Data Web and Linked Data.

Oracle Spatial Technologies: An Update. Xavier Lopez Director, Spatial Technologies Oracle Corporation

Linked Data Practices for the Geospatial Community

Semantic Interoperability of Basic Data in the Italian Public Sector Giorgia Lodi

Optimising a Semantic IoT Data Hub

Big, Linked and Open Earth Observation Data: the Projects TELEIOS and LEO

Introduction to the Dimensionally Extended 9 Intersection Model (DE-9IM) in PostgreSQL/PostGIS Tutorial

Linked Data Evolving the Web into a Global Data Space

Enrichment of Sensor Descriptions and Measurements Using Semantic Technologies. Student: Alexandra Moraru Mentor: Prof. Dr.

Deploying Spatial Applications in Oracle Public Cloud

Web Ontology Editor: architecture and applications

Querying the Semantic Web

Publishing Official Classifications in Linked Open Data

FAGI-gis: A tool for fusing geospatial RDF data

GEOSPATIAL ENGINEERING COMPETENCIES. Geographic Information Science

DBpedia-An Advancement Towards Content Extraction From Wikipedia

Linked Data and RDF. COMP60421 Sean Bechhofer

a paradigm for the Introduction to Semantic Web Semantic Web Angelica Lo Duca IIT-CNR Linked Open Data:

Linked data and its role in the semantic web. Dave Reynolds, Epimorphics

Oracle Spatial and Graph: Benchmarking a Trillion Edges RDF Graph ORACLE WHITE PAPER NOVEMBER 2016

Semantic challenges in sharing dataset metadata and creating federated dataset catalogs

BSC Smart Cities Initiative

The MEG Metadata Schemas Registry Schemas and Ontologies: building a Semantic Infrastructure for GRIDs and digital libraries Edinburgh, 16 May 2003

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009

Interacting with Linked Data Part I: General Introduction

Semantic Web: Core Concepts and Mechanisms. MMI ORR Ontology Registry and Repository

Disclaimer MULTIMODEL DATABASE WITH ORACLE DATABASE 18C

Enterprise Information Integration using Semantic Web Technologies:

Linked open data at Insee. Franck Cotton Guillaume Mordant

PUBLICATION OF INSPIRE-BASED AGRICULTURAL LINKED DATA

Using RDF to Model the Structure and Process of Systems

Advances in Data Integration & Representation in Systems Biology

Introduction to Geodatabase and Spatial Management in ArcGIS. Craig Gillgrass Esri

What to expect from MySQL 8.0?

Oracle APEX 18.1 New Features

Semantic Queries and Mediation in a RESTful Architecture

Day 2. RISIS Linked Data Course

Spatial Data on the Web

May 21, 2014 Walter E. Washington Convention Center Washington, DC USA. Copyright 2014, Oracle and/or its affiliates. All rights reserved.

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

Oracle Spatial Summit

Semantic Annotation, Search and Analysis

Information Workbench

An Evaluation of Geo-Ontology Representation Languages for Supporting Web Retrieval of Geographical Information

May 22, 2013 Ronald Reagan Building and International Trade Center Washington, DC USA

COMP9321 Web Application Engineering

Introducing Linked Data

COMP9321 Web Application Engineering

Property graphs vs Semantic Graph Databases. July 2014

Publishing Linked Statistical Data: Aragón, a case study.

The Butterfly Effect. A proposal for distribution and management for butterfly data programs. Dave Waetjen SESYNC Butterfly Workshop May 10, 2012

case study The Asset Description Metadata Schema (ADMS) A common vocabulary to publish semantic interoperability assets on the Web July 2011

The Emerging Web of Linked Data

A Conceptual Design Towards Semantic Geospatial Data Access

Welcome to INFO216: Advanced Modelling

Open Geospatial Consortium Inc.

Transcription:

Using Linked Data Concepts to Blend and Analyze Geospatial and Statistical Data Creating a Semantic Data Platform Hans Viehmann Product Manager EMEA ORACLE Corporation October 17, 2018 @SpatialHannes

Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle s products remains at the sole discretion of Oracle. 3

What is Linked Data Concept of publishing and interlinking structured data on the web Moving from documents to useable data Based on W3C standards Resource Description Framework (RDF), OWL, SPARQL... Originally developed by Tim Berners-Lee Design principles Use Uniform Resource Identifiers (URIs) to uniquely identify things (data entities) Use HTTP URLs, corresponding to these URIs, so that information can be retrieved Provide metadata using open standards such as RDF Include links to related URIs, so that people can discover more things 4

Why Linked Data for Statistics Agencies? Data dissemination is key to the work of NSIs Production of accurate data is not everything Need to broaden reach Simplifying access Various models to represent data in the past eg. SDMX (Statistical Data and Metadata Exchange) Linked Data allow to Associate data with metadata (or meaning) major step towards AI Use a common vocabulary Refer to data owner/authoritative datasource or other 3rd party datasets 5

Two example projects Based on Oracle Spatial and Graph as Linked Data Platform CensLOD project, ISTAT, Italy Publishing 2015 census data as linked (open) data Infrastructure development to publish Linked Open Data Project details kindly provided by Monica Scannapieco E-stat Portal, National Statistics Center of Japan (NSTAC) Expanding e-stat portal (www.e-stat.go.jp) to publish statistics as linked open data Using RDF Data Cube vocabulary Geometries based on Statistical standard area, small area, and grid square code Material kindly provided by Shoki Nishimura (NSTAC)

General project flow Domain analysis and ontology definition Using Protégé or Topbraid Composer as ontology editor Creation of subject-predicateobject triples Mapping of source data Inferencing Publishing Setup of SPARQL endpoint Development of UI Image courtesy of: Istat, Italy

Ontology definition ISTAT CensLOD Territorial data ontology, census data ontology Modeled in OWL using Protégé Based on existing meta-ontologies SKOS and XKOS: skos:concept, ADMS: adms:assetrepository, Data Cube Vocabulary: qb:dataset, qb:observation, PROV: prov:wasgeneratedby, GeoNames: gn:name, gn:countrycode, gn:parentcountry, Territorial data resulting in 95 entities and 200 rules, eg. using EquivalentTo to link entities to respective Geonames entity

Generating triples Describing mapping rules using R2RML standard eg. associating column names with entities Choosing rulebase and possible optimizations Using inferencing engine to materialize additional triples for performance Creating RDF Views on relational data (optional) No duplication of data and storage Image courtesy of: NSTAC, Japan

Publishing Creating a SPARQL endpoint For machine-to-machine access For advanced users Linked Data interface Faceted search/graph browser For basic users GUI to download datasets For basic users For advanced users Image courtesy of: Istat, Italy

Linked Geodata and Semantic GIS GIS applications with semantically complex thematic aspects Logical reasoning to classify features, eg. land cover type, suitable farm land, etc., combined with spatial queries Linking to available datasources (geonames.org, dbpedia,...) Requirements Consistent modeling of geospatial data, both simple and complex Gemetric functions and topological queries based on Spatial indexing Conceptual solution provided by OGC standards Simple Features as WKT literals, eg. "Point(-83.4 34.3)"^^ogc:wktLiteral Queries in GeoSPARQL

Integrating Geometry Image courtesy of: NSTAC, Japan

Integrating Geometry Storing boundary data with corresponding geometry Either locally or remote Retrieving statistical data associated with target area GeoSPAQRL FILTER statement including geometry Can be federated query across SPARQL endpoints Image courtesy of: NSTAC, Japan

Example Federated Queries blogs.oracle.com/oraclespatial Fundamental concept of Semantic Technologies Implemented through SERVICE clause in SPARQL Federated SPARQL query inside Oracle Database is possible In this case SPARQL in SQL Also works with other SPARQL clients Take predicate pushdown into account

Single platform for geospatial and linked data Transformation and Modeling Tools Relational2RDF Plug-in for Protégé Topbraid Composer integration Support for Apache Jena Natural Language Processing Extraction (partners) Load, Query and Inferencing RDF/OWL Data Management SQL & SPARQL Query OWL Inferencing Semantic Rules Semantic Indexing Scalability & Security SQL Developer integration Solution Development and Analytic Tools Java, HTTP access JSON, XML output Graph visualization (Cytoscape) Oracle Advanced Analytics (R, Mining) Oracle Business Intelligence (OBIEE) Map (GIS) Visualization

Geospatial Linked Data Platform for NSIs Graph Analysis and Reporting GeoSPARQL, SQL, Java, Linked Data Graph Data Management Query Engine Ontologies Inferencing Engine RDF Graph Layer Enterprise Data Stores Statistics Data Geometry Data Public LOD endpoints (DBPedia, GeoNames, )

Linked Data Platform at NSTAC Moved from database consolidation platform to Oracle Cloud Increasing data volume from 500M RDF triples to > 2bn triples by end- 2018 Planned work: Expand number of datasets Generate RDF triple from legacy data semi-automatically

Resources Oracle Spatial and Graph OTN product page here White papers, software downloads, documentation and videos Performance White Paper on 1 Trillion Triple Benchmark Oracle Big Data Lite Virtual Machine - a free sandbox to get started: www.oracle.com/technetwork/database/bigdata-appliance/oracle-bigdatalite-2104726.html Hands On Lab with RDF Graph data included Content also available on GITHub under http://github.com/oracle/bigdatalite/ Blog examples, tips & tricks: blogs.oracle.com/oraclespatial @OracleBigData, @SpatialHannes, @JeanIhm Oracle Spatial and Graph Group

Appendix OGC GeoSPARQL Support in Oracle Spatial and Graph 12c

OGC GeoSPARQL GeoSPARQL A Geographic Query Language for RDF Data OGC Standard (document 11-052r4) Published in June 2012 Submitting Organizations Traverse Technologies, Inc.

Why GeoSPARQL? Linked Geo Data Many Linked Open Data (LOD) datasets have geospatial components Barriers to integration Vendor-specific geometry support Different vocabularies W3C Basic Geo, GML XMLLiteral, Vendor-specific Different spatial reference systems WGS84 Lat-Long, British National Grid

Why GeoSPARQL? Semantic GIS GIS applications with semantically complex thematic aspects Logical reasoning to classify features Land cover type, suitable farm land, etc. Complex Geometries Polygons and Multi-Polygons with 1000 s of points Complex Spatial Operations Union, Intersection, Buffers, etc. Find parcels with an area of at least 3 sq. miles that touch a local feeder road and are inside an area of suitable farm land.

From SPARQL to GeoSPARQL RDF Data :res1 rdf:type :House. :res1 :baths "2.5"^^xsd:decimal. :res1 :bedrooms "3"^^xsd:decimal. :res2 rdf:type :Condo. :res2 :baths "2"^^xsd:decimal. :res2 :bedrooms "2"^^xsd:decimal. :res3 rdf:type :House :res3 :baths "1.5"^^xsd:decimal. :res3 :bedrooms "3"^^xsd:decimal. SPARQL Query SELECT?r?ba?br WHERE {?r rdf:type :House.?r :baths?ba.?r :bedrooms?br } Result Bindings?r?ba?br =================== :res1 "2.5" "3" :res3 "1.5" "3"

From SPARQL to GeoSPARQL RDF Data :res1 rdf:type :House. :res1 :baths "2.5"^^xsd:decimal. :res1 :bedrooms "3"^^xsd:decimal. :res2 rdf:type :Condo. :res2 :baths "2"^^xsd:decimal. :res2 :bedrooms "2"^^xsd:decimal. :res3 rdf:type :House :res3 :baths "1.5"^^xsd:decimal. :res3 :bedrooms "3"^^xsd:decimal. SPARQL Query SELECT?r?ba?br WHERE {?r rdf:type :House.?r :baths?ba.?r :bedrooms?br FILTER (?ba > 2) } Result Bindings?r?ba?br =================== :res1 "2.5" "3"

From SPARQL to GeoSPARQL Spatial RDF Data :res1 rdf:type :House. This is what GeoSPARQL :res1 :baths "2.5"^^xsd:decimal. standardizes :res1 :bedrooms "3"^^xsd:decimal. :res1 ogc:hasgeometry :geom1. :geom1 ogc:aswkt "POINT(-122.25 37.46)"^^ogc:wktLiteral. :res3 rdf:type :House Vocabulary & :res3 :baths "1.5"^^xsd:decimal. Datatypes :res3 :bedrooms "3"^^xsd:decimal. :res3 ogc:hasgeometry :geom3. :geom3 ogc:aswkt "POINT(-122.24 37.47)"^^ogc:wktLiteral. Find houses within a search polygon GeoSPARQL Query Extension SELECT?r?ba?br Functions WHERE {?r rdf:type :House.?r :baths?ba.?r :bedrooms?br.?r ogc:hasgeometry?g.?g ogc:aswkt?wkt FILTER(ogcf:sfWithin(?wkt, "POLYGON( )"^^ogc:wktliteral)) }

RDB2RDF for viewing Spatial Data as RDF Relational Data id int baths number bedrooms number HOUSE table geom SDO_GEOMETRY 1 2.5 3 POINT(-122.25 37.46) 3 1.5 3 POINT(-122.24 37.47) RDB2RDF: Direct Mapping sem_apis.create_rdfview_model ( House_Model, sys.odcivarchar2list('house'), 'http://dm/ ); <http://dm/rdfuser.house/id=1> rdf:type <http://dm/rdfuser.house>; :baths "2.5"^^xsd:decimal; :bedrooms "3"^^xsd:decimal; :geom "POINT( )"^^ogc:wktliteral. <http://dm/rdfuser.house/id=3> rdf:type <http://dm/rdfuser.house>; :baths "1.5"^^xsd:decimal; :bedrooms "3"^^xsd:decimal; :geom "POINT( )"^^ogc:wktliteral. RDF View (of Relational Data)

House table RDB2RDF for viewing Spatial Data as RDF Relational Data id int baths number bedrooms number geom SDO_GEOMETRY 1 2.5 3 POINT(-122.25 37.46) 3 1.5 3 POINT(-122.24 37.47) RDB2RDF: Direct Mapping sem_apis.create_rdfview_model ( House_Model, sys.odcivarchar2list('house'), 'http://dm/ ); Querying RDF View PREFIX : <http://dm/rdfuser.house#>. SELECT?r?ba?br WHERE {?r rdf:type <http://dm/rdfuser.house>; :baths?ba; :bedrooms?br; :geom?wkt. FILTER (ogcf:sfwithin(?wkt, "POLYGON( )"^^ogc:wktliteral) ) }

GeoSPARQL Support in Oracle Oracle Spatial and Graph supports the following conformance classes for GeoSPARQL Core Topology Vocabulary Extension (Simple Features) Geometry Extension (WKT, 1.2.0) Geometry Topology Extension (Simple Features, WKT, 1.2.0) RDFS Entailment Extension (Simple Features, WKT, 1.2.0)

Builds on the power of Oracle Spatial Efficient Spatial Indexing Spatial Reference Systems Built-in support for 1000 s of SRS Coordinate system transformations applied transparently during indexing and query Geometry Types Support OGC Simple Features geometry types Point, Line, Polygon Multi-Point, Multi-Line, Multi-Polygon Geometry Collection Up to 500,000 vertices per Geometry

GeoSPARQL New 12.2 Features New utility functions Support for EPSG SRID URIs Revised Geometry Storage Scheme big performance gain SDO_JOIN Spatial Aggregates 3D Support