Persistent Identifiers for Audiovisual Archives and Cultural Heritage

Similar documents
PID System for eresearch

Report on the European Resolution Discovery Service (ERDS) Meeting (Feb 17/18, 2010)

WORKSHOP 28 TH /29 TH APRIL Christine Staiger

PIDs for CLARIN. Daan Broeder CLARIN / Max-Planck Institute for Psycholinguistics

Paul Doorenbosch KB, National Library of the Netherlands Open Repositories 2010, Madrid,

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

CLARIN s central infrastructure. Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna

EUDAT. Towards a pan-european Collaborative Data Infrastructure. Damien Lecarpentier CSC-IT Center for Science, Finland EUDAT User Forum, Barcelona

Dutch View on URN:NBN and Related PID Services

CODA CATCHPlus Open Document Annotation. Hennie Brugman OAC II Project Review meeting Chicago July 26-27, 2012

EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd

Big Data, exploiter de grands volumes de données

From The European Library to The European Digital Library. Jill Cousins Inforum, Prague, May 2007

Persistent Identifiers

Introduction

Key Elements of Global Data Infrastructures

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

Multimedia Project Presentation

Title Version Author(s) Date Status Distribution 1 Introduction 2 OpenSKOS community 2.1 OpenSKOS user stories

For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS

OAI-PMH for dummies: how to build an institutional repository with limited resources?

Persistent identifiers, long-term access and the DiVA preservation strategy

The Documentalist Support System: a Web-Services based Tool for Semantic Annotation and Browsing

Persistent identifiers: jnbn, a JEE application for the management of a national NBN infrastructure

Utilizing PBCore as a Foundation for Archiving and Workflow Management

EUDAT Training 2 nd EUDAT Conference, Rome October 28 th Introduction, Vision and Architecture. Giuseppe Fiameni CINECA Rob Baxter EPCC EUDAT members

The National Digital Library Finna Among Digital Research Infrastructures in Finland

EUDAT. Towards a Collaborative Data Infrastructure. Ari Lukkarinen CSC-IT Center for Science, Finland NORDUnet 2012 Oslo, 18 August 2012

The e-depot in practice. Barbara Sierman Digital Preservation Officer Madrid,

Europeana update: aspects of the data

DRIVER Step One towards a Pan-European Digital Repository Infrastructure

The Biblioteca de Catalunya and Europeana

FLAT: A CLARIN-compatible repository solution based on Fedora Commons

Joining the BRICKS Network - A Piece of Cake

EUDAT Towards a Collaborative Data Infrastructure

Networking European Digital Repositories

Persistent and unique Identifiers

Implementation of the Data Seal of Approval

Developing a social science data platform. Ron Dekker Director CESSDA

Europeana, the prototype EDLfoundation Europeana Network Europeana, vs. 1.0 ThoughtLab Technical requirements

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

Post Digitization: Challenges in Managing a Dynamic Dataset. Jasper Faase, 12 April 2012

Linked European Television Heritage

The DART-Europe E-theses Portal

Using EUDAT services to replicate, store, share, and find cultural heritage data

NARCIS: The Gateway to Dutch Scientific Information

D6.1 Project External Website, project flyer and social media presence

Nuno Freire National Library of Portugal Lisbon, Portugal

DOIs for Research Data

1. General requirements

Networking European Digital Repositories

Digital Curation and Preservation: Defining the Research Agenda for the Next Decade

Science Europe Consultation on Research Data Management

Share.TEC System Architecture

Digital Library Interoperability. Europeana

The Semantic Web DEFINITIONS & APPLICATIONS

The Design of a DLS for the Management of Very Large Collections of Archival Objects

Remote Workflow Enactment using Docker and the Generic Execution Framework in EUDAT

EUROPEANA METADATA INGESTION , Helsinki, Finland

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

CONTENTdm & The Digital Collection Gateway New Looks for Discovery and Delivery

Data Discovery - Introduction

Part 2: Current State of OAR Interoperability. Towards Repository Interoperability Berlin 10 Workshop 6 November 2012

A distributed network of digital heritage information

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

Metadata and Encoding Standards for Digital Initiatives: An Introduction

AGGREGATIVE DATA INFRASTRUCTURES FOR THE CULTURAL HERITAGE

Registry Interchange Format: Collections and Services (RIF-CS) explained

Handles at LC as of July 1999

From Individual Solutions to Generic Tools Digitization at the Max Planck Society. Digitization Day 2012, Geneva Andrea Kulas

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

Using Persistent Identifiers at

Records Management and Interoperability Initiatives and Experiences in the EU and Estonia

EUDAT. Towards a pan-european Collaborative Data Infrastructure

CLARIN for Linguists Portal & Searching for Resources. Jan Odijk LOT Summerschool Nijmegen,

Building for the Future

Promoting semantic interoperability between public administrations in Europe

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Services pour identifier et valoriser : l enregistrement de données via DataCite

DCH-RP and PREFORMA Two case studies on the digital preservation of cultural heritage

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Resilient Linked Data. Dave Reynolds, Epimorphics

Ontology Servers and Metadata Vocabulary Repositories

The Sunshine State Digital Network

EUDAT - Open Data Services for Research

The Open Archives Initiative and the Sheet Music Consortium

How can CLARIN archive and curate my resources?

From Open Data to Data- Intensive Science through CERIF

SciX Open, self organising repository for scientific information exchange. D15: Value Added Publications IST

Interoperability and transparency The European context

Links, languages and semantics: linked data approaches in The European Library and Europeana. Valentine Charles, Nuno Freire & Antoine Isaac

Towards repository interoperability

National Documentation Centre Open access in Cultural Heritage digital content

Survey of Existing Services in the Mathematical Digital Libraries and Repositories in the EuDML Project

Robin Wilson Director. Digital Identifiers Metadata Services

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Mass Digitisation Enabling Access, Use and Reuse

Design & Manage Persistent URIs

XML-based production of Eurostat publications

Transcription:

Persistent Identifiers for Audiovisual Archives and Cultural Heritage Hennie Brugman Technical coordinator CATCHPlus Max-Planck-Institute for Psycholinguistics Netherlands Institute for Sound and Vision

Summary CATCH & CATCHPlus Requirements from CATCHPlus, CH and AV Our solution Base technology Identifier management Organisational embedding Application to collections Concluding remarks

CATCH & CATCHPlus CATCH research program by NWO (14 projects) CATCHPlus valorisation project 8 subprojects at large CH institutions Connected by common services Vocabulary services Annotation services Infrastructural: OAI-PMH, persistent identifiers Project bureau hosted by Netherlands Institute for Sound and Vision www.catchplus.nl

Requirements from CATCHPlus, Cultural Heritage and Audiovisual Archives

Requirements (1) Software support Good resolving service available Proven technology, stable and 100% reliable Scalable, with respect to Number of identifiers Performance Globally working solution Redundant hosting and service providing Identification of parts of objects (AV, CH) Possibility to associate metadata with an identifier (AV, CH) Actionable : identifiers can be resolved using http URIs Support for identifier management tasks

Requirements (2) Identifier management Identifier management should be independent of System management Web server management Hosting of resolution services Can be done from the context of a collection management system typically by a responsible collection manager Is efficient, powerful and simple

Requirements (3) Organisation, policy What choices are made by partner institutions? (the fewer flavours, the better) Reliability and sustainability of the service providers Limited and controlable costs Freedom to switch between service providers

CATCHPlus solution

CATCHPlus solution: base technology Based on Handle technology Best meets our requirements http://handle.net/ One Local Handle System and Handle prefix per participating Naming Authority Hosted by SARA, (eventually) mirrored by other EPIC partners (redundant hosting) Redundant resolving is inherent to Handle System

CATCHPlus solution: identifier management REST web service For resolving, searching, creation and management of Handles (in one s own Naming Authority) over internet Also support for batch operations ( move collection ) SARA has built the first version for CATCHPlus Available as Open source Ambition: uniform redundant service by EPIC User interface will be developed (Q1-2, 2010) Prototype for evaluation by collection managers

CATCHPlus solution: organisational embedding EPIC (European Persistent Identifier Consortium) SARA (Netherlands), CSC (Finland), GWDG (Germany) Redundant and reliable PID services for escience and eculture in Europe Based on Handles European mirror of Global Handle Repository Governance structure with technical board and board of stakeholders

Application to data sets

Collections and data sets Currently assigning identifiers to: Concepts for the CATCHPlus Vocabulary Repository A subcollection of the Sound and Vision archive Several Dutch cultural heritage institutions and projects expressed interest

Application to data sets Some questions to answer first... What are the objects to assign persistent identifiers to? (versions, metadata records, formats, composite objects...) Is there a relation with already existing identifiers? What syntax to use? Include semantics in your PIDs? Where do your PIDs resolve to, especially for objects that do not have a web representation of their own? Who is responsible for identifier creation and management? What garantees can be made with regard to persistence? Who does hosting? Who provides services?

Steps For existing objects Determine your policies Determine what URLs to resolve to Create and publish PIDs for these URLs Locally store association of URLs and proprietary identifiers For all externally visible metadata: replace proprietary identifiers with PIDs For new objects Ultimately, integrate PID creation and management in your collection management tools and workflows

Objects: Sound and Vision pilot metadata descriptions at level of broadcasts Open data set: polygoon journaal Existing identifiers: task identifiers Resolve to metadata record implies: resolve to dynamically created html page Persistent identifiers are published using OAI-PMH Published metadata refers back to same dynamic web page OAI data provider uses PID service to find handles for internal identifiers/urls

Concluding remarks External accessibility of data and service depends on one resolver service: should be no single point of failure Identifier management is an extra task that explicitly has to be dealt with Explicit commitments with respect to persistency have to be made, and kept Identifier management requires tool support (otherwise too labour intensive and error prone) (Re)organizing your data internally becomes easier Publishing parts of your collections on the internet becomes easier, more consistent and more sustainable

Questions?