Europeana update: aspects of the data

Similar documents
The Europeana Data Model and Europeana Libraries Robina Clayphan

Europeana and the Mediterranean Region

& Interoperability Issues

Links, languages and semantics: linked data approaches in The European Library and Europeana. Valentine Charles, Nuno Freire & Antoine Isaac

Linked Open Europeana: Semantics for the Digital Humanities

MINT METADATA INTEROPERABILITY SERVICES

Europeana Data Model. Stefanie Rühle (SUB Göttingen) Slides by Valentine Charles

EUROPEANA METADATA INGESTION , Helsinki, Finland

Achieving interoperability between the CARARE schema for monuments and sites and the Europeana Data Model

Europeana Linked Open Data data.europeana.eu

Sharing Archival Metadata MODULE 20. Aaron Rubinstein

Europeana Creative. EDM Endpoint. Custom Views.

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

When Semantics support Multilingual Access to Cultural Heritage The Europeana Case. Valentine Charles and Juliane Stiller

The Local Amsterdam Cultural Heritage Linked Open Data Network

Europeana Creative. EDM Endpoint. Custom Views

Information und Wissen: global, sozial und frei?

The Europeana Data Model, current status

The CARARE project: modeling for Linked Open Data

Introduction

ECLAP Kick-off An Aggregator Project for EUROPEANA

Metadata Issues in Long-term Management of Data and Metadata

Linked European Television Heritage

Fondly Collisions: Archival hierarchy and the Europeana Data Model

Europeana Data Model Primer

Contribution of OCLC, LC and IFLA

Elevating Natural History Museums Cultural Collections to the Linked Data Cloud

Ontology Servers and Metadata Vocabulary Repositories

An aggregation system for cultural heritage content

(Geo)DCAT-AP Status, Usage, Implementation Guidelines, Extensions

> Semantic Web Use Cases and Case Studies

Building Consensus: An Overview of Metadata Standards Development

Joining the BRICKS Network - A Piece of Cake

Linked Open Europeana: Semantic Leveraging of European Cultural Heritage

Vocabulary Alignment for archaeological Knowledge Organization Systems

Introduction to Metadata for digital resources (2D/3D)

Reducing Consumer Uncertainty

Digital Public Space: Publishing Datasets

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Resilient Linked Data. Dave Reynolds, Epimorphics

CARARE 2.0: a metadata schema for 3D Cultural Objects

Mapping from Flat or Hierarchical Metadata Schemas to a Semantic Web Ontology. Justyna Walkowska, Marcin Werla

Linked Data. The World is Your Database

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

Linking library data: contributions and role of subject data. Nuno Freire The European Library

From Open Data to Data- Intensive Science through CERIF

Semantic challenges in sharing dataset metadata and creating federated dataset catalogs

Linked Data and RDF. COMP60421 Sean Bechhofer

DEMYSTIFYING PUBLISHING TO EUROPEANA: A PRACTICAL WORKFLOW FOR CONTENT PROVIDERS

Deliverable Number: D 5.3. Title of the Deliverable: Metadata gateway. Dissemination Level: Contractual Date of Delivery to EC: Month 12

Nuno Freire National Library of Portugal Lisbon, Portugal

Reflecting on digital library technologies

Semantic Web Fundamentals

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

Serving Ireland s Geospatial as Linked Data on the Web

4) DAVE CLARKE. OASIS: Constructing knowledgebases around high resolution images using ontologies and Linked Data

Ontologies and thesauri How to answer complex questions using interoperability?

Not Just for Geeks: A practical approach to linked data for digital collections managers

Linked Data and cultural heritage data: an overview of the approaches from Europeana and The European Library

Interoperability & Archives in the European Commission

Linked.Art & Vocabularies: Linked Open Usable Data

Linking Knowledge Organization Systems via Wikidata

The Local Amsterdam Cultural Heritage Linked Open Data Network. Lukas Koster Library of the University of Amsterdam.

The Semantic Institution: An Agenda for Publishing Authoritative Scholarly Facts. Leslie Carr

Linked Data: What Now? Maine Library Association 2017

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Why You Should Care About Linked Data and Open Data Linked Open Data (LOD) in Libraries

Building a missing item in INSPIRE: The Re3gistry

Digital Library Interoperability. Europeana

Guidelines for Multilingual Linked Data generation and publication

Semantic Web Fundamentals

a paradigm for the Semantic Web Linked Data Angelica Lo Duca IIT-CNR Linked Open Data:

The CoBiS Linked Open Data Project and Portal

The role of vocabularies for estimating carbon footprint for food recipies using Linked Open Data

Metadata for general purposes

Integration of Heterogeneous Metadata in Europeana. Cesare Concordia Institute of Information Science and Technology-CNR

SPARQL เอกสารหล ก ใน มคอ.3

Linked Open Data: a short introduction

case study The Asset Description Metadata Schema (ADMS) A common vocabulary to publish semantic interoperability assets on the Web July 2011

Comparing Open Source Digital Library Software

Research Data Repository Interoperability Primer

Utilizing, creating and publishing Linked Open Data with the Thesaurus Management Tool PoolParty

National Documentation Centre Open access in Cultural Heritage digital content

DELIVERABLE. D2.2 Intermediate Version of the Interoperability Infrastructure

Europeana and semantic alignment of vocabularies

Data is the new Oil (Ann Winblad)

How We Build RIC-O: Principles

Persistent Identifiers for Audiovisual Archives and Cultural Heritage

The Semantic Web DEFINITIONS & APPLICATIONS

GeoDCAT-AP Representing geographic metadata by using the "DCAT application profile for data portals in Europe"

Ontologies SKOS. COMP62342 Sean Bechhofer

Evolving Europeana s Metadata: from ESE to EDM

Europeana Organization Profile

Semantiska webben DFS/Gbg

The P2 Registry

Linked open data at Insee. Franck Cotton Guillaume Mordant

Enrichment of Sensor Descriptions and Measurements Using Semantic Technologies. Student: Alexandra Moraru Mentor: Prof. Dr.

ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

Transcription:

Europeana update: aspects of the data Robina Clayphan, Europeana Foundation European Film Gateway Workshop, 30 May 2011, Frankfurt/Main

Overview The Europeana Data Model (EDM) Data enrichment activity at Europeana The linked open data pilot

Rationale of EDM Starting point: ESE (Europeana Semantic Elements) represents lowest common denominator for object metadata forces interoperability flat model, mostly with text string values one-to-one principle violated major drawback: richness of the original metadata is lost Goal to move to a model that allows more sophisticated representation of data preserve original data while still allowing for interoperability Semantic Web representation Semantic linking between objects

EDM requirements 1. Distinguish between the real world object (painting, book, program) and its digital representation 2. And the object and the metadata record describing the object. 3. Allow multiple records for same object, containing potentially contradictory statements about an object 4. Support for objects that are composed of other objects 5. Standard metadata format that can be specialized 6. Standard vocabulary format that can be specialized 7. EDM should be based on existing standards

EDM basics OAI ORE for organization of metadata about an object Requirements 1-4 (Distinguishing between objects) Dublin Core for metadata representation Requirement 5 SKOS for vocabulary representation Requirement 6 OAI ORE, Dublin Core and SKOS together fulfil Requirement-7

The general picture Cultural artefact Buildling Sculpture Networked objects Painting

EDM Classes

EDM Properties (excluding ESE)

The Example 1 from Direction des Musees de France 9

The Example 2 from the Louvre 10

Aggregation organizes data of a single provider: example 1 provenance metadata digital representation aggregation Cultural heritage object 11

provenance metadata Two providers and two aggregations (the same object) aggregation of DMF v aggregation of Louvre Cultural heritage object provenance metadata 12

Creating EDM data Apply the different properties to the different objects in the model The properties are all the ESE elements plus the new ones defined for EDM The objects are the classes defined in EDM Not all defined objects will be implemented in the first instance

Creating EDM data Key objects for providers are the information resources ore:aggregation, ens:providedcho, ens:webresource And the non-information resources (contextual) ens:agent, ens:place, ens:timespan, ens:concept Object templates http://europeanalabs.eu/wiki/edmobjecttemplatesproviders Guidelines are being prepared to help in the creation of EDM data

Creating EDM data

Data Enrichment Activity

Data Enrichment Activity Clean and enrich existing Europeana data with standard, multi-lingual terms and references Who, what, where, when, Agent, concept, place, time period Two approaches Apply an existing thesaurus or controlled vocabulary Create an ontology and mapping using Annocultur (or a mix of the two)

When time periods Mixed values in the data Work to identify dates from text value Replace a string with a four digit year value Apply chronological and historical periods No existing usable ontology created an ad hoc ontology from the values found in the data http://annocultor.eu/time/

Linked Open Data Pilot

Linked Open Data Pilot Applying linked data techniques to Europeana data More than just adding links in the metadata values Publishing Europeana data using standard web technologies RDF/XML syntax, identifying resources with URIs Enabling linking to (more) semantically related resources on the web Contribute a large cultural heritage data set to the community Relieves participating data providers from having to implement their own infrastructure

Preconditions Needed a suitable data model EDM in RDF Transform from source ESE a flat model that uses simple string values and breaks the one-to-one rule Needed data agreements for exposing data in the public domain Call for volunteer providers who could allow their data to be exposed this way The pilot is a separate system from the operational Europeana system

Extraction of the volunteers data from Europeana Conversion of the existing ESE data to EDM format Resulting in RDF/XML representation of the data Assigning URIs to the EDM resources thereby created aggregations and proxies Process Incorporating the Europeana-generated links to semantically related resources - from the enrichment activity Adding links to LOD services of providers where known National Library of Hungary, Swedish culture aggregator owl:sameas statements using values in dc:identifier

End result Generation of data dumps Made available for download at http://data.europeana.eu/downloads Ingested into an RDF store searchable as a SPARQL endpoint http://data.europeana.eu From 9 direct providers 300 indirect providers 16 countires Data set of 184m triples before links to any other data sources.

Lessons learned Difficulties in converting from ESE to EDM Knowing which resource any particular ESE element belonged to Use of proxies attaching metadata about e.g. a painting to a resource that is not standing for that painting Meta-level information about the data (licensing, provenance) OAI-ORE resource maps not yet established as the accepted solution Dereferencable HTTP URI design difficulties in persistence between the pilot and the Europeana system RDF Storage and Scalability Only tested for some SPARQL queries so far

Thank you! robinaclayphan@kb.nl

Moving to EDM Migration of existing data to EDM Default mapping from existing ESE data to EDM Needs fine tuning for different sectors (and maybe collections) Finalise definition of which properties belong to which class Object templates Finalise XML schema Finalise which parts of EDM will be implemented first Providers can continue to supply ESE it is a subset of EDM Use controlled vocabularies where possible Ideally use identifiers for e.g. names and places where available Use standard representations of dates etc consistent values in text strings

Enrichment work http://www.europeanalabs.eu/wiki/edmprototypingtask21 EConnect Gazeteer http://europeana-geo.isti.cnr.it/gazetteer/homepage.action And Geoparser http://www.europeana.eu/portal/thoughtlab_enrichingmetadata.html Alignment of controlled vocabularies for subject http://europeanalabs.eu/wiki/wp12vocabularies Authority files for Names VIAF EFG and HOPE

URLs to enrichment work http://europeanalabs.eu/wiki/edmprototypingtask21#relevantworkdone :EuropeanaOffice http://europeanalabs.eu/wiki/edmprototypingtask21annocultor An example of enriched object for concept and period is at http://europeana.eu/portal/record/09405o/452a91de6e574b3130449a4 1FB00D34E5A06EA43.html And one for place and agent is at http://europeana.eu/portal/record/02428/c5f60468b9b2c138a8d9e746 D01B4558F91F286B.html On these pages, you must click on "More" and then "Show Enrichment Fields"

Read about it https://version1.europeana.eu/web/guest/technical-requirements/ All ESE documents and Guidelines http://version1.europeana.eu/web/europeana-project/technicaldocuments/ EDM v5.2 EDM Primer