Linked Open Data Cloud. John P. McCrae, Thierry Declerck

Similar documents
Semantic Web Fundamentals

Semantic Web Fundamentals

Position Paper: Interoperability Challenges for Linguistic Linked Data

TODO. LLOD and corpora

Building the Multilingual Web of Data. Integrating NLP with Linked Data and RDF using the NLP Interchange Format

Guidelines for Multilingual Linked Data generation and publication

STS Infrastructural considerations. Christian Chiarcos

Datos abiertos de Interés Lingüístico

Maximising (Re)Usability of Language Resources using Linguistic Linked Data

Putting ontologies to work in NLP

WordNets and TEI-LEX. John P. McCrae, Thierry Declerck

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Serving Ireland s Geospatial as Linked Data on the Web

A Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision

Semantic Web: Core Concepts and Mechanisms. MMI ORR Ontology Registry and Repository

Resilient Linked Data. Dave Reynolds, Epimorphics

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent

University of Rome Tor Vergata DBpedia Manuel Fiorelli

Language Resources and Linked Data

Knowledge-Driven Video Information Retrieval with LOD

The role of vocabularies for estimating carbon footprint for food recipies using Linked Open Data

ISA Action 1.17: A Reusable INSPIRE Reference Platform (ARE3NA)

Interactive Knowledge Stack A Software Architecture for Semantic Content Management Systems

Linked data implementations who, what, why?

Programming Technologies for Web Resource Mining

Methodology and tools for Multilingual Linguistic Linked Data generation

Proposal for Implementing Linked Open Data on Libraries Catalogue

The P2 Registry

digm to show or to reveal as in paradigm where para is a pattern and digm the act of revealing

Harvesting Open Government Data with DCAT-AP

Linked Data Evolving the Web into a Global Data Space

DBpedia Data Processing and Integration Tasks in UnifiedViews

lemon: An Ontology-Lexicon model for the Multilingual Semantic Web

Introducing FREME: Deploying Linguistic Linked Data

Accessing information about Linked Data vocabularies with vocab.cc

Database of historical places, persons, and lemmas

From Dictionaries to Cross-lingual Lexical Resources

Multi-agent and Semantic Web Systems: Linked Open Data

Implementing a Variety of Linguistic Annotations

DCMI Abstract Model - DRAFT Update

Semantically enhancing SensorML with controlled vocabularies in the marine domain

The RMap Project: Linking the Products of Research and Scholarly Communication Tim DiLauro

BIBLIOGRAPHIC REFERENCE DATA STANDARD

1. CONCEPTUAL MODEL 1.1 DOMAIN MODEL 1.2 UML DIAGRAM

KNOWLEDGE GRAPHS. Lecture 2: Encoding Graphs with RDF. TU Dresden, 23th Oct Markus Krötzsch Knowledge-Based Systems

Data is the new Oil (Ann Winblad)

Web Ontology for Software Package Management

Linked Data. Department of Software Enginnering Faculty of Information Technology Czech Technical University in Prague Ivo Lašek, 2011

Data Governance for the Connected Enterprise

Building Blocks of Linked Data

A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N BUSINESS OPPORTUNITIES) FREME WEBINAR HELD FOR GALA, 28 APRIL 2016

Semantic Web and Natural Language Processing

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia framework (MPEG-21) Part 21: Media Contract Ontology

XLIFF 2.0 AND ENRICHMENT WORKFLOWS IN THE BROWSER

FIBO Metadata in Ontology Mapping

Qualifications Dataset Register User Manual: Publishing Workflow

Semantic Web. Tahani Aljehani

Background and Context for CLASP. Nancy Ide, Vassar College

Developing markup metaschemas to support interoperation among resources with different markup schemas

Linked Data Semantic Web Technologies 1 (2010/2011)

The German DBpedia: A Sense Repository for Linking Entities

Semantic Web Information Management

The MEG Metadata Schemas Registry Schemas and Ontologies: building a Semantic Infrastructure for GRIDs and digital libraries Edinburgh, 16 May 2003

Towards an Integrated Information Framework for Service Technicians

Expressing language resource metadata as Linked Data: A potential agenda for the Open Language Archives Community

Dative: from collaborative database to Archival Information Package

Europeana update: aspects of the data

Semantic Web Systems Linked Open Data Jacques Fleuriot School of Informatics

BIBLID (2004) 93:1 pp (2004.6) 209. NBINet NBINet 92

SEXTANT 1. Purpose of the Application

Day 2. RISIS Linked Data Course

Linked Open Data: a short introduction

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Unlocking the full potential of location-based services: Linked Data driven Web APIs

An e-infrastructure for Language Documentation on the Web

Web Service: annotateservice. Operations. annotate. The service performs semantic annotation of textual documents provided as plain text or as XML.

Semantic Web. Ontology Pattern. Gerd Gröner, Matthias Thimm. Institute for Web Science and Technologies (WeST) University of Koblenz-Landau

Technische Universität Dresden Fakultät Informatik. Wikidata. A Free Collaborative Knowledge Base. Markus Krötzsch TU Dresden.

DBpedia-An Advancement Towards Content Extraction From Wikipedia

DBpedia Extracting structured data from Wikipedia

Ontology Servers and Metadata Vocabulary Repositories

Rupert Westenthaler. Open Annotation Support for Apache Stanbol

Linked Data in Translation-Kits

The Semantic Web Revisited. Nigel Shadbolt Tim Berners-Lee Wendy Hall

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

Building a missing item in INSPIRE: The Re3gistry

(Geo)DCAT-AP Status, Usage, Implementation Guidelines, Extensions

Semantic Web for Earth and Environmental Terminology (SWEET) Status, Future Development and Community Building

Linking Distributed Data across the Web

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Why You Should Care About Linked Data and Open Data Linked Open Data (LOD) in Libraries

Towards a joint service catalogue for e-infrastructure services

Semantic Web. MPRI : Web Data Management. Antoine Amarilli Friday, January 11th 1/29

Introduction. October 5, Petr Křemen Introduction October 5, / 31

On the use of Abstract Workflows to Capture Scientific Process Provenance

ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

B2FIND and Metadata Quality

Assisting IoT Projects and Developers in Designing Interoperable Semantic Web of Things Applications

Language Resources and Linked Data (EKAW 2014, Linköping, Sweden)

Transcription:

Linked Open Data Cloud John P. McCrae, Thierry Declerck

Hitchhiker s guide to the Linked Open Data Cloud

DBpedia Largest node in the linked open data cloud Nucleus for a web of open data Most data is derived by parsing Wikipedia E.g., https://en.wikipedia.org/wiki/c++ => http://dbpedia.org/resource/c++ Uses transparent content negotiation

Transparent content negotiation I want to know about C++ and I understand RDF and HTML curl -H "Accept: application/rdf+xml;text/html" -I \ http://dbpedia.org/resource/c++ Go to this location for the RDF/XML version HTTP/1.1 303 See Other... Location: http://dbpedia.org/data/c++.xml...

Transparent content negotiation I want to know about C++ and I only know HTML curl -H "Accept: text/html" -I \ http://dbpedia.org/resource/c++ Go to this location for the HTML version Use /resource/ HTTP/1.1 303 See Other URL to refer... Location: http://dbpedia.org/page/c++ to concept...

DBpedia Pages Ontology properties Links to other resources

DBpedia ontology Axioms External Links Labels

WikiData RDF Version: https://www.wikidata.org/entity/q240 7

BabelNet Dictionary compiled from Wikipedia (Open Mulitlingual) WordNet Wiktionary OmegaWiki WikiData

LexVo Assigns URIs to words (strings in a language) Contains links to WordNet, FrameNet etc Definitions of ISO Language Codes

Domain datasets 100 s of domain specific datasets

LexInfo BabelNet DBpedia LexVo

Reusing URIs

Why reuse URIs Data interoperability Queries work over multiple datasets Semantic definitions allows alignments to be reasoned (Often) the creators of the URIs have good idea on how data should be structured

Challenges of interoperability Linguistic Differences The Fulton County Grand Jury said Friday Susanne AT NP1s NNL1cb JJ NN1c VVDv NPD1 Penn DT NNP NNP NNP NNP VBD NNP Differences in Granularity

Language codes en fr de Problem: 7,000+ languages and more dialects, but only 262=676 codes th br? br = Breton

ISO Language Codes pms Piedmontese ang Anglo-Saxon fr-ca Québécois 3-Letter codes with region cover minority, historical languages, right?

Variability How to tag this talk? en? en-latn? (As it is not written in Cyrillic) en-bg or en-100 (As it is presented in Bulgaria) en-gb or en-826 (As is is composed in British English) en-latn-gb? Region subtags are used to indicate linguistic variations associated with or appropriate to a specific country, territory, or region. Typically, a region subtag is used to indicate variations such as regional dialects or usage, or region-specific spelling conventions. It can also be used to indicate that content is expressed in a way that is appropriate for use throughout a region, for instance, Spanish content tailored to be useful throughout Latin America. -- RFC 5646

Glottolog Identifies languoids (language varieties) Uses URLs http://glottolog.org/resource/languoid/id/queb1 247 More information can be found by following the link

Linked Open Vocabularies http://lov.linkeddata.es/

ISOcat Effort to standardize linguistic vocabulary from ISO Technical Committee Standardized Data Categories in a Registry Discontinued in December 2014

Problems with ISOcat According to Schuurman et al. Too easy to get a login Out-of-control Entries were copies of other entries People sometimes copied an entry, just in order to make sure the original owner would not change the entry without them knowing it Complexity - Too many obligatory and overly technical fields As an alternative the CLARIN concept registry is (still) being introduced. I. Schuurman, M. Windhouwer, O. Ohren, D. Zeman, CLARIN Concept Registry: The new semantic registry, in CLARIN 2015 Selected Papers (2015), pp. 62 70

LexInfo Ontology for associat[ing] linguistic information with respect to any level of linguistic description and expressivity to elements in an ontology Expands OntoLex-Lemon with a set of general categories

LexInfo - Properties and Values Properties and open-world (nonexhaustive) list of values

LexInfo - Verb Frames Verb frames with formal definitions

LexInfo - Arguments Hierarchies of arguments to be used in the frames

OLiA Ontologies of Linguistic Annotation Modular architecture for describing annotation schemes: Reference Model: Common terminology (similar to LexInfo) Annotation Model: Describes a particular annotation scheme Linking Model: Describes the linking between the reference and annotation

GOLD - General Ontology Linguistic Description Quite popular Defines many terms Loose semantics Sometimes has range and domains on properties Not clear how this fits together

Submitting to the LOD Cloud

Go to lod-cloud.net

Fill in the form

Fields Identifier unique alphanumeric string Title Full name in English Description 2-10 sentence description in English Full Download A link to the complete dataset, ideally as compressed N-Triples SPARQL Endpoint If available Other Download Other formats for download or partial downloads

Fields (2) Example A single resource that resolves Keywords Domain Defines the colour in the diagram Website Contact Point Links Number of triples linking to another dataset in the cloud Size Number of triples in this dataset Namespace, DOI, Image (if desired)

Stars for metadata quality Availability of resource

Services using linked data

Service-oriented architectures It is implemented a self-contained operation unit. It is a black box for its consumers, which only need to know the interface, not the implementation. It may consist of other underlying services. Interoperability is a significant challenge here

Service chains Translation DE => EN Often tricky to do in practice! Sentiment Analysis (EN) Parser (EN)

Issues with service chains Services are often components of pipelines without clear usage to the end user The technology readiness level of services is often quite low, with little documentation or graphical user interface, Services are hard to install often requiring compiling from source or specialized libraries not found in major software repositories.

Teanga RDF and Linked Data to provide service interoperability Docker to enable easy install and usage Attractive Web Front-End (Bootstrap, AngularJS, NodeJS) Graceful control of errors

LAPPS Grid Defines key vocabularies for service interoperability LAPPS Interchange Format (JSON-LD) Web Service Exchange Vocabulary Human-in-the-loop workflow construction using Galaxy

Summary

Summary Linked Open Data Cloud Big Many relevant tools Fragmented Interoperability is less terrible than other systems

Thanks. This publication has emanated from research supported in part by a research grant from Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289, co-funded by the European Regional Development Fund