When using this architecture for accessing distributed services, however, query broker and/or caches are recommendable for performance reasons.

Similar documents
LifeWatch/EnvEurope User Forum Use Case Ecology

EnvThes - interlinked thesaurus for long term ecological research, monitoring, and experiments

SEXTANT 1. Purpose of the Application

Reducing Consumer Uncertainty

Automatic Creation of INSPIRE Meta-information from SWE Services

INSPIRE & Linked Data: Bridging the Gap Part II: Tools for linked INSPIRE data

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

/// INTEROPERABILITY BETWEEN METADATA STANDARDS: A REFERENCE IMPLEMENTATION FOR METADATA CATALOGUES

FP7-INFRASTRUCTURES Grant Agreement no Scoping Study for a pan-european Geological Data Infrastructure D 4.4

(Geo)DCAT-AP Status, Usage, Implementation Guidelines, Extensions

The GeoPortal Cookbook Tutorial

The European Commission s science and knowledge service. Joint Research Centre

PortalU, a Tool to Support the Implementation of the Shared Environmental Information System (SEIS) in Germany

Jeffery S. Horsburgh. Utah Water Research Laboratory Utah State University

> Semantic Web Use Cases and Case Studies

GeoDCAT-AP Representing geographic metadata by using the "DCAT application profile for data portals in Europe"

THE ENVIRONMENTAL OBSERVATION WEB AND ITS SERVICE APPLICATIONS WITHIN THE FUTURE INTERNET Project introduction and technical foundations (I)

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

CREATING SMART TRANSPORT SERVICES BY FACILITATING THE RE-USE OF OPEN GIS DATA

Multi-disciplinary Interoperability: the EuroGEOSS Operating Capacities

Relation between Geospatial information projects related to GBIF

Compass INSPIRE Services. Compass INSPIRE Services. White Paper Compass Informatics Limited Block 8, Blackrock Business

Metadata of geographic information

Enabling Efficient Discovery of and Access to Spatial Data Services. CHARVAT, Karel, et al. Abstract

Enrichment of Sensor Descriptions and Measurements Using Semantic Technologies. Student: Alexandra Moraru Mentor: Prof. Dr.

A5.2-D3 [3.5] Workflow Design and Construction Service Component Specification. Eva Klien (FHG), Christine Giger (ETHZ), Dániel Kristóf (FOMI)

Capturing ICAN end-user requirements for NETMAR

INSPIRE: The ESRI Vision. Tina Hahn, GIS Consultant, ESRI(UK) Miguel Paredes, GIS Consultant, ESRI(UK)

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Leveraging metadata standards in ArcGIS to support Interoperability. Aleta Vienneau and Marten Hogeweg

GeoPortal 4 Everybody

Validation experience

Spatial Data on the Web

Spatial Data on the Web

Harvesting Open Government Data with DCAT-AP

ISA Action 1.17: A Reusable INSPIRE Reference Platform (ARE3NA)

DanubeGIS User Manual Document number: Version: 1 Date: 11-Nov-2016

The GEO Discovery and Access Broker

Levels Of Data Interoperability In The Emerging North American Groundwater Data Network

Extension of INSPIRE Download Services TG for Observation Data

INSPIRE Geoportal Rich user experience across member states services

DATA SHARING AND DISCOVERY WITH ARCGIS SERVER GEOPORTAL EXTENSION. Clive Reece, Ph.D. ESRI Geoportal/SDI Solutions Team

Knowledge Inventory for hydrogeology research

IT Infrastructure for BIM and GIS 3D Data, Semantics, and Workflows

SII Law Organization Coordination activities Examples of good practices Education Technical matters Success stories Challenges

Introduction to INSPIRE. Network Services

ECP-2007-GEO OneGeology-Europe. Annex 1: Cookbook

Study and guidelines on Geospatial Linked Data as part of ISA Action 1.17 Resource Description Framework

EarthLookCZ as Czech way to GMES

Linked Open Data and Semantic Technologies for Research in Agriculture and Forestry

The Butterfly Effect. A proposal for distribution and management for butterfly data programs. Dave Waetjen SESYNC Butterfly Workshop May 10, 2012

Publishing Statistical Data and Geospatial Data as Linked Data Creating a Semantic Data Platform

Harvard Hypermap: An Open Source Framework for Making the World's Geospatial Information more Accessible

Cross-Nature, first steps

How to become an INSPIRE node and fully exploit the investments made?

Uniform Resource Management

Linking and Finding Earth Observation (EO) Data on the Web

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

A USER S GUIDE TO REGISTERING AND MAINTAINING DATA SERVICES IN HIS CENTRAL 2.0

Sensor Data Management

HUMBOLDT Application Scenario: Protected Areas

geospatial querying ApacheCon Big Data Europe 2015 Budapest, 28/9/2015

SAFER the GIGAS Effect

Esri Geoportal Server

The What, Why, Who and How of Where: Building a Portal for Geospatial Data. Alan Darnell Director, Scholars Portal

Outline. The Collaborative Research Platform for Data Curation and Repositories: CKAN For ANGIS Data Portal. Open Access & Open Data.

EARTH OBSERVATION DATA INTEROPERABILITY ARRANGEMENT WITH ONTOLOGY REGISTRY

Description. Speaker Patrizia Monteduro (International Consultant, FAO) TRAINING GEONETWORK OPENSOURCE Islamabad, Pakistan, Jan 29-31, 2014

Using Linked Data Concepts to Blend and Analyze Geospatial and Statistical Data Creating a Semantic Data Platform

Interoperability in Science Data: Stories from the Trenches

The EOC Geoservice: Standardized Access to Earth Observation Data Sets and Value Added Products ABSTRACT

Semantic Technologies for Nuclear Knowledge Modelling and Applications

Initial Operating Capability & The INSPIRE Community Geoportal

The geospatial metadata catalogue. FOSS4G Barcelona. Jeroen Ticheler. Founder and chair. Director

Development of a Ground Water Data Portal for Interoperable Data Exchange and Mediation within the NGWMN

Agris on-line Papers in Economics and Informatics. Geospatial Data Infrastructure Components Deployed for LTER -Europe Researchers Community

METAINFORMATION INFRASTRUCTURE FOR GEOSPATIAL INFORMATION

Quality Assured (QA) data

Cataloguing GI Functions provided by Non Web Services Software Resources Within IGN

UN-LINKS Why, What and How. United Nations Library and Information Network for Knowledge Sharing. #FAOdata.

Heterogeneous Missions Accessibility: Interoperability for Earth Observation

For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS

INSPIRE overview and possible applications for IED and E-PRTR e- Reporting Alexander Kotsev

Discovery and Access of Geospatial Resources Using GIS Portal Toolkit Marten Hogeweg Product Manager GIS Portal Toolkit

SEMANTIC SOLUTIONS FOR OIL & GAS: ROLES AND RESPONSIBILITIES

Terminologies, Knowledge Organization Systems, Ontologies

Linked Data Practices for the Geospatial Community

INSPIRE Download Service

Welcome. to Pre-bid meeting. Karnataka State Spatial Data Infrastructure (KSSDI) Project, KSCST, Bangalore.

Utilizing, creating and publishing Linked Open Data with the Thesaurus Management Tool PoolParty

D6.1.2 Piloting Plan

Title: Author(s)/Organisation(s): Working Group: References: Quality Assurance: A5.2-D3 [3.7] Information Grounding Service Component Specification

PRODUCT BROCHURE ERDAS APOLLO MANAGING AND SERVING GEOSPATIAL INFORMATION

Serving Ireland s Geospatial as Linked Data on the Web

ERDAS APOLLO Managing and Serving Geospatial Information

The Semantic Sensor Network Ontology A Generic Language to Describe Sensor Assets

1. PUBLISHABLE SUMMARY

Introduction to SDIs (Spatial Data Infrastructure)

INSPIRE & Environment Data in the EU

Transcription:

Integration of semantics, data and geospatial information for LTER Abstract The long term ecological monitoring and research network (LTER) in Europe[1] provides a vast amount of data with regard to drivers and impacts of environmental change (Mirtl & Krauze 2007, Mirtl et al. 2013). In a test within the EnvEurope (LIFE08 ENV/IT/000399[4]) and ExpeER projects those data have been exposed via a linked service and SPARQL endpoint (D2RQ) and an OGC SOS to find out best practices for data services. A SNORQL editor, a SPARQL to EXCEL tool and, most important of all, an R SPARQL plugin have been tested as clients. This worked quite well and SPARQLing the D2RQ service and the thesaurus SPARQL endpoint proofed feasible. When using this architecture for ing distributed services, however, query broker and/or caches are recommendable for performance reasons. Top requirement, however would be the integration of spatial data and semantics. Although geosparql is already a W3C and OGC recommendation and a geosparql extension to the D2RQ service exists, we could not test that so far because of the lack of geosparql clients. SPARQLing data sources is nice for the IT-trained domain specialists, our normal domain specialist, however would need applications with simple and easy to use GUIS, based on the SPARQL interface. LTER data challenges The long term ecological monitoring and research network (LTER) in Europe[1] provides a vast amount of data with regard to drivers and impacts of environmental change (Mirtl & Krauze 2007, Mirtl et al. 2013). A variety of ecosystems are monitored touching research topics of several different disciplines, like biodiversity research, soil science, air pollution measurement, groundwater and surface water analysis, meteorology, hydrology and others. In addition to the variety of disciplines involved, data management and analysis is fragmented and carried out by several dislocated institutes which are responsible for the monitored sites. Because of the distributed responsibility, the implemented data management systems vary from collections of simple spreadsheet tables, via distributed domain specific databases to centralized data management systems. Within those systems data are organized using different data models, and related vocabularies, taxonomies and code lists. Existing semantic de facto standards like the EUNIS habitat list[2], the Catalogue of Life, or the World Reference Base for Soil Resources[3] are often not considered depending on the autonomous decision of each data manager and provider. Overcoming the described semantic heterogeneity, resulting from the variety of disciplines and institutes, is crucial for the intended exchange of metadata and data. The establishment and use of a common set of exactly defined concepts is a first step into the direction of semantic harmonization; using those concepts to annotate data and metadata and finally using them

directly in the local data management systems, should be the subsequent activities (Schentz et al. 2005, Adamescu et al. 2010). LTER Europe and related projects like EnvEurope (LIFE08 ENV/IT/000399[4]) or ExpeER[5] are working towards a harmonisation of methods and data flows for long term ecological research, monitoring and experiments. LTER data architecture INSPIRE ILTER Metadata repository Thessaurus & Ref. lists Catalogue service Any data provider EnvEurope Portal Formatiert: Schriftart: 10 Pt. Fig 1. LTER data architecture ( Johannes Peterseil) A DRUPAL based metadata container is provided to manage data related and site related metadata. Those metadata are tagged with keywords from the common thesaurus. The metadata repository provides a download service in EML format and INSPIRE (close to ISO19115) format and a CSW (OGC- catalogue service) interface. So far there also is a central data cache, used for experiments with data services:in order to provide recommendations for the LTER Europe network a two-fold strategy was applied. First a linked service using D2RQ was established providing online to data. With a SPARQL plugin for R statistical software the data from distributed resources can directly be used in the analysis or displayed. Second a SOS server and SWE client provided by 52 North are used to evaluate XML based data services. These services can also be directly consumed by R statistical software or other clients.

A SKOS/RDF based thesaurus (EnvThes) serves as a source of keywords for the metadata system and as knowledge base, to which pointers (URIs of the concepts), entered into the data records, are pointing. LTER Linked service (proof of concept) record base Ontology URL of concept Thesaurus Fig 2: D2RQ service (with regard to the mapping to a common ontology) It should be proven, that exposing data of a relational database via a linked interface and SPARQL endpoint, is possible and that existing knowledge bases can be integrated. Furthermore the question also was, whether or not this service could be used to link distributed databases. Because of the lack of time, the default ontology generated by the D2RQ software, was used as common ontology for the mapping process. With some little modifications it served as bases for the class-mapping and EnvThes, the thesaurus started within the EnvEurope project, was used to map some database records to a common vocabulary (instance mapping). As SPARQL clients we tested: (1) The built- in SNORQL editor (2) A download service for EXCEL spreadsheets http://www.snee.com/sparql/spreadsheetsparql.html (3) A SPARQL plugin for the statistic package R http://linkedscience.org/tools/sparql-package-for-r/

All the 3 clients proofed to work sufficiently, offering an IT-trained domain expert ( at an IT level that could be described as can work with MS- ), the possibility to start further analysis of data starting with the SPARQL query. When using the R plugin it is additionally possible to establish some predefined queries for reuse by people who are not familiar with SPARQL. Within our tests we also could do a SPARQL query over the D2RQ service and EnvThes, thus eg. getting a list of parameters from the D2RQ service and their definitions from EnvThes. The performance of the D2RQ service is not exhilarating and the service does not do any query optimization. But the general test- outcome was, that this service offers the possibility to map data to a common vocabulary and make several of such services ible for distributed SPARQL queries.

Further work needed (1) Top priority of future work is the full integration of spatial information and semantic information. At the Austrian environment agency, the integration of geographic data geo geo record base Ontology Geographic Information (shape files) and other information within ORACLE databases is state of the art. Mostly geographic information results from import of shape files. This geographic information usually is displayed with ESRI tools working in a local client server environment or exposed via WFS (OGC web feature service) or WMS (OGC web map services). The possibility to offer the geographic data via a geosparql endpoint technically is ready, but never was tested by us, because of the lack of geosparql clients. (2) Whilst SPARQLing data is a nice possibility for domain experts with advanced IT skills, we are so far missing tools and/or toolsets for the not so IT-trained users. (3) Performance is crucial for domain specialists who want to do evaluations based on distributed data and/or knowledge. To our experience querybrokers and caches at the site of the analyst are strongly needed. For us real-time analysis is not dominant, so that a harvesting mechanism bringing regularly data to the data mining site could be a good solution.