Publishing Official Classifications in Linked Open Data

Similar documents
Semantic Interoperability of Basic Data in the Italian Public Sector Giorgia Lodi

Publishing the 15 th Italian Population and Housing Census in Linked Open Data

EXTENDED KNOWLEDGE ORGANIZATION SYSTEM (XKOS)

The European Commission s science and knowledge service. Joint Research Centre

Reducing Consumer Uncertainty

XKOS: Extending SKOS for Describing Statistical Classifications

Study and guidelines on Geospatial Linked Data as part of ISA Action 1.17 Resource Description Framework

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

Linked open data at Insee. Franck Cotton Guillaume Mordant

case study The Asset Description Metadata Schema (ADMS) A common vocabulary to publish semantic interoperability assets on the Web July 2011

Towards the Discovery of Person-Level Data

Publishing Statistical Data and Geospatial Data as Linked Data Creating a Semantic Data Platform

1. CONCEPTUAL MODEL 1.1 DOMAIN MODEL 1.2 UML DIAGRAM

Promoting semantic interoperability between public administrations in Europe

Converting a thesaurus into an ontology: the use case of URBISOC

Mapping between Digital Identity Ontologies through SISM

ISA Action 1.17: A Reusable INSPIRE Reference Platform (ARE3NA)

Libraries, classifications and the network: bridging past and future. Maria Inês Cordeiro

Using Linked Data Concepts to Blend and Analyze Geospatial and Statistical Data Creating a Semantic Data Platform

A Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision

Metadata Common Vocabulary: a journey from a glossary to an ontology of statistical metadata, and back

(Geo)DCAT-AP Status, Usage, Implementation Guidelines, Extensions

Semantics. Matthew J. Graham CACR. Methods of Computational Science Caltech, 2011 May 10. matthew graham

For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS

Ontology Summit2007 Survey Response Analysis. Ken Baclawski Northeastern University

A representation framework for terminological ontologies

Building a missing item in INSPIRE: The Re3gistry

Early analysis and debugging of linked open data cubes

A Formal Definition of RESTful Semantic Web Services. Antonio Garrote Hernández María N. Moreno García

Semantic MediaWiki A Tool for Collaborative Vocabulary Development Harold Solbrig Division of Biomedical Informatics Mayo Clinic

Data is the new Oil (Ann Winblad)

Semantiska webben DFS/Gbg

Data Foundations & Terminology (DFT) WG/IG & VSIG

Linking library data: contributions and role of subject data. Nuno Freire The European Library

0.1 Knowledge Organization Systems for Semantic Web

From Open Data to Data- Intensive Science through CERIF

Opus: University of Bath Online Publication Store

Corso di Biblioteche Digitali

Introduction to metadata management

Vocabulary and Semantics in the Virtual Observatory

Corso di Biblioteche Digitali

GeoDCAT-AP Representing geographic metadata by using the "DCAT application profile for data portals in Europe"

STS Infrastructural considerations. Christian Chiarcos

Semantic Similarity Assessment to Browse Resources exposed as Linked Data: an Application to Habitat and Species Datasets R. Albertoni, M.

Data formats for exchanging classifications UNSD

INTEROPERABILITY + SEMANTICS = CHECK! Smart and Cost Effective Data Modelling and Tools of the Future

Taking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria

INSPIRE & Linked Data: Bridging the Gap Part II: Tools for linked INSPIRE data

GenTax: A Generic Methodology for Deriving OWL and RDF-S Ontologies from Hierarchical Classifications, Thesauri, and Inconsistent Taxonomies

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

On practical aspects of enhancing semantic interoperability using SKOS and KOS alignment

Taxonomy Tools: Collaboration, Creation & Integration. Dow Jones & Company

/// INTEROPERABILITY BETWEEN METADATA STANDARDS: A REFERENCE IMPLEMENTATION FOR METADATA CATALOGUES

Crea%ng and U%lizing Linked Open Sta%s%cal Data for the Development of Advanced Analy%cs Services E. Kalampokis, A. Karamanou, A. Nikolov, P.

Linked Open Data in Legal Scholarship

SEMANTIC SOLUTIONS FOR OIL & GAS: ROLES AND RESPONSIBILITIES

Permanence and Temporal Interoperability of Metadata in the Linked Open Data Environment

arxiv: v1 [cs.ir] 25 Jan 2008

Resilient Linked Data. Dave Reynolds, Epimorphics

Linked Open Data: a short introduction

Semantic challenges in sharing dataset metadata and creating federated dataset catalogs

Guidelines for Multilingual Linked Data generation and publication

Project. Deliverable. Revision History. Project Acronym: AthenaPlus Grant Agreement number: Project Title:

Library Technology Conference, March 20, 2014 St. Paul, MN

Introduction to metadata cleansing using SPARQL update queries. April 2014 PwC EU Services

SKOS and the Ontogenesis of Vocabularies

TOOP: A Common Semantic Model for the Once-Only Principle Jack Verhoosel, TNO June 14, 2018

DCAT-AP FOR DATA PORTALS IN EUROPE

Semantic Interoperability Courses

Copyright 2012 Taxonomy Strategies. All rights reserved. Semantic Metadata. A Tale of Two Types of Vocabularies

PoolParty - Thesaurus Server

Towards semantic asset management and Core Vocabularies for e-government. Makx Dekkers Stijn Goedertier

Semantic Web: Core Concepts and Mechanisms. MMI ORR Ontology Registry and Repository

PROPOSAL TO REPRESENT THE UNESCO THESAURUS FOR THE SEMANTIC WEB APPLYING ISO-25964

Semantic Web. Ontology Pattern. Gerd Gröner, Matthias Thimm. Institute for Web Science and Technologies (WeST) University of Koblenz-Landau

Vocabulary Alignment for archaeological Knowledge Organization Systems

lnteroperability of Standards to Support Application Integration

How FAIR am I? FAIR Principles and Interoperability of Data and Tools

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent

Semantic Integration with Apache Jena and Apache Stanbol

New Dimensions in KOS

EARTh: an Environmental Application Reference Thesaurus in the Linked Open Data Cloud

Application Services for Knowledge Organisation and System Integration

Terminologies, Knowledge Organization Systems, Ontologies

Content analysis and classification in mathematics

TOGAF 9 Foundation v9.1 Level 1 Level 1: An Introduction to TOGAF

Linked.Art & Vocabularies: Linked Open Usable Data

OPEN. Promoting the reuse of Open Government Data through the Open Data Interoperability Platform (ODIP) Presentation metadata SUPPORT

Linguaggi Logiche e Tecnologie per la Gestione Semantica dei testi

Unified Thesaurus feasibility study

Using Internet as a Data Source for Official Statistics: a Comparative Analysis of Web Scraping Technologies

Towards an Integrated Information Framework for Service Technicians

SKOS. COMP62342 Sean Bechhofer

Enterprise Architecture Layers

A STRATEGY ON STRUCTURAL METADATA MANAGEMENT BASED ON SDMX AND THE GSIM MODELS

COMP9321 Web Application Engineering

SKOS - Simple Knowledge Organization System

COMP9321 Web Application Engineering

Ontologies SKOS. COMP62342 Sean Bechhofer

ISA Programme. interoperability effect on the EU public administrations. 20 th XBRL Eurofiling Workshop 26/11/2014

Transcription:

Publishing Official Classifications in Linked Open Data Giorgia Lodi**, Antonio Maccioni**, Monica Scannapieco*, Mauro Scanu*, Laura Tosco* * Istituto Nazionale di Statistica Istat ** Agenzia per l Italia Digitale AgID

Motivations End of 2012 the Agency for Digital Italy (AgID) published the first version of National Guidelines on the use of LOD as the paradigm for enabling semantic interoperability between Public Administrations Among the key datasets, AgID identified official classifications as cross-domain data to be published in LOD Monica Scannapieco, SemStat2014, Riva del Garda, 19/10/2014 2

Aims of the Project The Italian National Institute of Statistics - Istat & AgID launched a joint project: ü To model Official Classifications (Ateco 2007 -Classification of Economic Activity and COFOG - (Classification of the Functions of Government) in LOD using standard ontologies (SKOS, XKOS, ADMS, ) ü To certify data by provenance using the PROV framework, i.e. document the (i) overall process of creation of a classification and (ii) the creation and publication of the LOD version ü publication on a SPARQL endpoint hosted by AgID: http:// spcdata.digitpa.gov.it Monica Scannapieco, SemStat2014, Riva del Garda, 19/10/2014 3

Statistical Official Classifications Statistical Official Classifications allow to: ü Reuse them in different production processes ü Promote harmonization Important models: ü Neuchâtel Terminology Model Classification (2004) ü Generic Statistical Information Model (GSIM) (2013) GSIM: UML- schema Monica Scannapieco, SemStat2014, Riva del Garda, 19/10/2014 4

ATECO 2007 Classificazione delle Attività Economiche (ATECO) Version 2007 Italian counterpart of NACE classification ( http://epp.eurostat.ec.europa.eu/portal/page/portal/nacerev2/introduction) Consisting of the following levels: ATECO Sezioni Divisioni Gruppi Classi NACE Sections Divisions Groups Classes Categorie - Sottocategorie - Monica Scannapieco, SemStat2014, Riva del Garda 19/10/2014 5

PROV Ontology W3C recommendation, OWL2 ontology Provides a set of classes, properties, and restrictions to represent provenance information Publishing, together with data, who is the responsible of the data - person/ institution/ administration that manages/ creates/ manipulates the data Certifying data - data are certified if they are published by their responsible Provenance of data in terms of information regarding entities and processes involved in the production and publication of data Monica Scannapieco, SemStat2014, Riva del Garda 19/10/2014 6

XKOS Ontology XKOS - extended Knowledge Organization System Data Documentation Initiative (DDI) Extension of SKOS - Simple Knowledge Organization System Manages statistical classifications Implements requirements laid out in ISO standards (ISO 704:2000 revised 2009 and ISO 1087-1:2000) Represent statistical classifications with their structure, textual properties, and relations between classications Monica Scannapieco, SemStat2014, Riva del Garda 19/10/2014 7

XKOS uses SKOS classes and related properties (codes, labels, etc.) to represent: Classification items (skos:concept) Classification Scheme (skos:conceptscheme) Classification (skos:concept) skos:inscheme skos:member skos:broader skos:narrower skos:collection skos:concept XKOS and SKOS SKOS vocabulary: loosely defined notions of broader than, narrower than, and related to relationships. Statistical classifications - XKOS: Formally defined hierarchical relations, which are referred to as generic (generic-specific) and partitive (whole-part). Levels that correspond to increasingly detailed views of the field covered Use of associations that are more specific than the generic related to. Causal, sequential, and temporal relations defined. skos:classification scheme may be attached to a classification using xkos:belongsto XKOS adds properties to describe the links between these objects i.e.: xkos:belongsto xkos:follows xkos:supercedes xkos:classifiedunder xkos:classificationlevel xkos:depth xkos:organizedby xkos:levels xkos:classificationlevel xkos:correspondence xkos:madeof Monica Scannapieco, SemStat2014, Riva del Garda 19/10/2014 8

LinkedATECO 2007 Available at AgID SPARQL endpoint: http://spcdata.digitpa.gov.it/ Uses different ontologies and vocabularies Realization steps: Provenance Modeling Classification Modeling Classification Distribution Implementation Monica Scannapieco, SemStat2014, Riva del Garda 19/10/2014 9

Provenance Modeling Monica Scannapieco, SemStat2014, Riva del Garda 19/10/2014 10

Classification Modeling Monica Scannapieco, SemStat2014, Riva del Garda 19/10/2014 11

Classification Distribution ADMS(Asset Description Metadata Schema): describe semantic assets, defined as highly reusable metadata DCAT (Data Catalogue Vocabulary) to insert the classification as dataset of our data catalogue and to decouple the abstract entity notion of dataset from its actual implementation Monica Scannapieco, SemStat2014, Riva del Garda 19/10/2014 12

Implementation Data Import: CSV file from http:// sistemaclassicazioni.istat.it/ class/sistemaclassicazioni/ index.php to a relational data base Pre-processing: e.g. adding new attributes with composed existing fields Data Mapping D2RQ framework Publishing resulting RDF data D2RQ framework Monica Scannapieco, SemStat2014, Riva del Garda 19/10/2014 13

Conclusions Contribution of the work: Technical: modeling aspects of OS classification by making use of ontologies that are specific of the statistical domain, like XKOS, as well as of more general purpose ontologies, like PROV. Methodological: data interoperability is made possible by twofold efforts Shared formats and models, OS i.e., classifications technological in LOD standards like RDF framework as and a first ontologies step towards like content harmonization XKOS Content harmonization, i.e., common domain-specific information concepts. Monica Scannapieco, SemStat2014, Riva del Garda 19/10/2014 14