Nuno Freire National Library of Portugal Lisbon, Portugal

Similar documents
From The European Library to The European Digital Library. Jill Cousins Inforum, Prague, May 2007

Introduction

Use of Authorities Open Data in the ARROW Rights Infrastructure

Links, languages and semantics: linked data approaches in The European Library and Europeana. Valentine Charles, Nuno Freire & Antoine Isaac

ehaction Joint Action to Support the ehealth Network

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure

The DART-Europe E-theses Portal

CLARIN s central infrastructure. Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna

Linked Open Data in Aggregation Scenarios: The Case of The European Library Nuno Freire The European Library

ECP-2008-DILI EuropeanaConnect. D5.7.1 EOD Connector Documentation and Final Prototype. PP (Restricted to other programme participants)

The DIGMAP Virtual Digital Library

Linked Data and cultural heritage data: an overview of the approaches from Europeana and The European Library

Open Archives Forum - Technical Validation -

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

Europeana, the prototype EDLfoundation Europeana Network Europeana, vs. 1.0 ThoughtLab Technical requirements

esignature Infrastructure Marketing Model

BHL-EUROPE: Biodiversity Heritage Library for Europe. Jana Hoffmann, Henning Scholz

WEB-BASED COLLECTION MANAGEMENT FOR LIBRARIES

Linking library data: contributions and role of subject data. Nuno Freire The European Library

DRIVER Step One towards a Pan-European Digital Repository Infrastructure

This document is a preview generated by EVS

Part 2: Current State of OAR Interoperability. Towards Repository Interoperability Berlin 10 Workshop 6 November 2012

This document is a preview generated by EVS

This document is a preview generated by EVS

Networking European Digital Repositories

Networking European Digital Repositories

WEB-BASED COLLECTION MANAGEMENT FOR ARCHIVES

A service-oriented national e-theses information system and repository

This document is a preview generated by EVS

Cataloguing manuscripts in an international context

Box 2: Voting Oct 2003 Belgium Bulgaria Croatia Cyprus Czech Republic Denmark Estonia Finland France Germany [2 folders]

HPC IN EUROPE. Organisation of public HPC resources

Workshop on Addressing the Barriers to IPv6 Deployment Spanish use case

Guidelines for preparing a Z39.50/SRU target to enable metadata harvesting

UNIMARC and XML at the BN. Nuno Freire José Borbinha Hugo Manguinhas INESC-ID

2nd Technical Validation Questionnaire - interim results -

National Library 2

Networking European Digital Repositories

Developing a social science data platform. Ron Dekker Director CESSDA

ECLAP Kick-off An Aggregator Project for EUROPEANA

This document is a preview generated by EVS

GEANT Cloud Framework Agreement

IBM offers Software Maintenance for additional Licensed Program Products

The Biblioteca de Catalunya and Europeana

Preparing for High-Luminosity LHC. Bob Jones CERN Bob.Jones <at> cern.ch

Friedrich Smaxwil CEN President. CEN European Committee for Standardization

EUREKA European Network in international R&D Cooperation

EUDAT. Towards a pan-european Collaborative Data Infrastructure. Damien Lecarpentier CSC-IT Center for Science, Finland EUDAT User Forum, Barcelona

data elements (Delsey, 2003) and by providing empirical data on the actual use of the elements in the entire OCLC WorldCat database.

This document is a preview generated by EVS

Combating Pharmacrime AGENDA

146 Information Technology

August 14th - 18th 2005, Oslo, Norway. Web crawling : The Bibliothèque nationale de France experience

Flash Eurobarometer 468. Report. The end of roaming charges one year later

The Canadian Experience

EUROPEANA METADATA INGESTION , Helsinki, Finland

Contribution of OCLC, LC and IFLA

This document is a preview generated by EVS

EC (DG SANTE) The ehealth DSI , Solution Provider

How to contribute information to AGRIS

This document is a preview generated by EVS

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Guidelines for Developing Digital Cultural Collections

Metadata and Encoding Standards for Digital Initiatives: An Introduction

A service-oriented national e-thesis information system and repository

Research Data Management

The OAIS Reference Model: current implementations

This document is a preview generated by EVS

Modelling bibliographic information: purposes, prospects, potential

An aggregation system for cultural heritage content

Towards repository interoperability

Interoperability Patterns in Digital Library Systems Federations. Paolo Manghi, Leonardo Candela, Pasquale Pagano

Use of RMI in the Online Delivery of Text-based content: Standards for the expression of publisher/library licenses

ITU. The New Global Challenges in Cybersecurity 30 October 2017 Bucharest, Romania

Syrtis: New Perspectives for Semantic Web Adoption

Open Archive Solutions to Traditional Archive/Library Cooperation By DONATELLA CASTELLI

This document is a preview generated by EVS

Interoperability & Archives in the European Commission

ITU Plenipotentiary 2018

Studying conceptual models for publishing library data to the Semantic Web

This document is a preview generated by EVS

This document is a preview generated by EVS

National Documentation Centre Open access in Cultural Heritage digital content

Digital Library Interoperability. Europeana

The MetaArchive of Southern Digital Culture. Building a Multi-Institutional Digital Preservation Network

Europeana update: aspects of the data

The EuroHPC strategic initiative

A Repository of Metadata Crosswalks. Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research

Online repositories of Eu informations sources. European Documentation Centres: Online repositories of EU information sources

This document is a preview generated by EVS

The Virtual Observatory and the IVOA

This document is a preview generated by EVS

CERN Document Server Software

Digital Dividend harmonisation in Europe

EU e-marketing requirements

The European Commission s science and knowledge service. Joint Research Centre

Extracting Output Schemas from XSLT Stylesheets and Their Possible Applications

Euro-IX update. EIX WG Ripe 53 Amsterdam. Serge Radovcic. Euro-IX update. EIX WG RIPE53 Amsterdam. Oct 5th 2006

Training Workshop on Multi Hazard Early Warning Systems

EUMETSAT EXPERIENCE WITH MULTICAST ACROSS GÉANT

Transcription:

Date submitted: 05/07/2010 UNIMARC in The European Library and related projects Nuno Freire National Library of Portugal Lisbon, Portugal E-mail: nuno.freire@bnportugal.pt Meeting: 148. UNIMARC WORLD LIBRARY AND INFORMATION CONGRESS: 76TH IFLA GENERAL CONFERENCE AND ASSEMBLY 10-15 August 2010, Gothenburg, Sweden http://www.ifla.org/en/ifla76 Abstract: This paper provides an overview of the use of UNIMARC in The European Library service as well as in related projects, highlighting the current and future prospects of UNIMARC data handling and exploitation in such contexts. It includes information on issues such as UNIMARC conversion and harvesting in specific profiles, distributed search and retrieval, FRBRization, rights management and data provision to Europeana. INTRODUCTION The European Library 1 was created in 2005 as a service of the Conference of the European National Libraries 2 (CENL), aiming to offer a common access point to the distributed collections of the 48 national libraries of Europe in 32 languages. The European Library currently gives access to 150 million bibliographic records across Europe, concerning resources that can be both digital and analogical, and is a free service. 1

What is The European Library? The European Library An online service for all European National Libraries which provides access to their catalogues & digital collections via one central, multi-lingual web-interface (portal) Libraries Alternate point of access Exposure to a wider audience Platform for collaboration Users Single point of access Search multiple libraries and collections Multi-lingual interface Online exhibitions www.theeuropeanlibrary.org The European Library of today is the result of a number of earlier successful projects: Project Gabriel - Gateway and Bridge to Europe s National Libraries: It ran during 1996-2004 and established a joint web portal of European national libraries; TEL Project (The European Library: Gateway to Europe's Knowledge): It ran during 2001-2004, and laid down the foundations of the current service. The project had the participation of 9 national libraries: Finland, Germany, Italy (Florence), Italy (Rome), Netherlands, Portugal, Slovenia, Switzerland and United Kingdom. The success of The European Library has lead to further developments and improvements. The number of national libraries members grew from the initial 9 to the current 48. Its growth was supported by European co-funded projects. TEL-ME-MOR (2005-2007), had as its main goal to support, stimulate and facilitate the participation of the 10 national libraries from the New Member States, which were partners in the project; the EDL project (2006-2007) worked towards the integration of additional national libraries and in enhancing the multilingual capabilities of The European Library portal. Both projects have contributed to the inclusion in the service of the collections of 19 national libraries. 2

TELplus was another project of The European Library context, undertaken by a consortium of 26 partners, including national libraries and research centres. TELplus provided value-adding services and products for The European Library. Its primary objectives concerned improvements to the service architectural performance, the enhancement of multilingual access functionalities, and providing access to full-text documents originating from OCR processes. Metadata usage at The European Library Collections from the national libraries can be made available in The European Library portal via three communication protocols: the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH), Z39.50 or the Search/Retrieve via URL (). Harvesting via OAI-PMH TEL server(s) OAI-PMH central index OAI-PMH OAI-PMH OAI-PMH The choice of the communication protocol greatly influences the functionalities that the portal can provide to the end-user. Although all three protocols provide a standard for communication between the portal and the libraries systems, the underlying communication paradigm is substantially different. While OAI-PMH s design allows the portal to harvest all metadata records from the libraries into a central repository, Z39.50 and where designed for remote search and retrieval, therefore metadata records remain only at the data provider. 3

Search and retrieval via TEL server(s) central Z39.50 gateway Z39.50 Z39.50 central index query Metadata formats MARC formats TEL server(s) central Z39.50 gateway Z39.50 OAI-PMH MARC formats and TEL-AP for objects central index TEL-AP for objects TEL-AP for objects query The European Library portal relies on a metadata application profile based on Dublin Core for achieving interoperability between the diverse bibliographic metadata formats used by the European national libraries The European Library Application Profile for Objects (TEL-AP). The portal and the central index are designed to work on this metadata format. Libraries metadata is in generally in one of the formats of the MARC family, although other formats are also in use, particularly within digital and specialized collections. Conversion from these formats to the TEL-AP may be performed at the national 4

libraries or at The European Library. In the second case, conversion may be performed in the central Z39.50 gateway or during ingest in the central index. UNIMARC at The European Library A study performed in 2007 addressed the bibliographic data formats usage on national libraries 3. The study included twenty nine national libraries and found that UNIMARC or a derivative was in use at ten of those libraries: Albania, Bulgaria, Cyprus, France, Greece, Lithuania, Portugal, Republic of Macedonia, Serbia, and Slovenia. UNIMARC usage in The European Library At the National Libraries From a study conducted in 2007 by Christoph Schmidt- Supprian: The study included 29 National Libraries Concerning data formats and descriptive rules, international standards were clearly dominant. Among 29 libraries, ten used UNIMARC or a derivative Albania, Bulgaria, Cyprus, France, Greece, Lithuania, Portugal, Republic of Macedonia, Serbia, and Slovenia Within The European Library, UNIMARC was supported since the original project, where three of the eight countries used UNIMARC. A conversion of UNIMARC to TEL- AP is supported by the Z39.50 gateway and by the OAI-PMH metadata harvesting system. This conversion is documented in The European Library Handbook, which is available in the members area of the portal. A particular XSLT implementation of the conversion between UNIMARC and TEL-AP is available, and supported by the OAI-PMH software made available for libraries (http://repox.ist.utl.pt). 5

UNIMARC usage in The European Library Transformation of UNIMARC to the TEL-AP for objects Supported in the Central Z39.50 Gateway Documentation for transformation of UNIMARC is available in The European Library Handbook (members area) XSLT Stylesheets for tranformation of UNIMARC are available in the OAI-PMH software made available for libraries http://repox.ist.utl.pt Other recent or ongoing developments in The European Library include support for UNIMARC. These are described in the following sections. UNIMARC in Project TELplus Project TELplus, which ended in December 2009, included UNIMARC support in two of its work packages. Work package 2 - Improving the usability of The European Library through OAI-PMH compliancy aimed at increasing the adoption of OAI-PMH within the participating libraries of The European Library. It provided tools and guidelines for the implementation of OAI-PMH on libraries catalogues and collections. It also addressed the central usage of OAI-PMH at The European Library with the development of tools and their integration with the central index. It was in this work package that the XSLT implementation of the UNIMARC conversion to TEL-AP was developed and made available for use in the national libraries and in The European Library. 6

UNIMARC usage in Project TELplus WP2 WP2 Improving the usability of The European Library through OAI-PMH compliancy Provide tools and guidelines for the implementation of OAI-PMH on libraries catalogues and collections Provide tools to expand The European Library central index Implemented transformation of UNIMARC to TEL-AP for objects: Based on XSLT style sheets (for MARCXML and MarcXchange) In the OAI-PMH software made available for libraries Allowing libraries to transform their UNIMARC records to TEL-AP for objects In the central OAI-PMH software of The European Library Allowing UNIMARC metadata to be harvested and transformed centrally to TEL-AP for objects Work package 3 of TELplus - Improving accessibility was aimed at improving accessibility in three ways: by enhancing full-text searching; by aggregating related resources; and investigating automatic vocabulary mappings. It included a task where UNIMARC was used: Setting up a FRBR repository for the European Library. This task had the support of several national libraries that contributed to this experiment with bibliographic and authority data, and as advisors. UNIMARC usage in Project TELplus WP 3 WP 3, task 3.3 FRBR aggregation, search and browsing The purpose of this task was to develop solutions to support alternative services of searching in The European Library according to the FRBR paradigm The work focused on a specific collection, especially to make it easier to assess the results The works of the Nobel Prize winners in Literature Proof of concept available at: http://digmap2.ist.utl.pt:8080/lgtefrbr/evaluation.html 7

The main objective of this task was to investigate possible improvements for search and accessibility of the contents of The European Library through the implementation of the FRBR model in the search functionality of the portal. This work was focused on a specific collection rather than on the complete catalogues, since a controlled collection would allow a better evaluation of the results. The chosen collection was the works of the Nobel Prize winners in Literature. UNIMARC usage in Project TELplus WP3 Both bibliographic and authority records from library members of The European Library were gathered 5 out of 12 libraries that contributed to the collection sent UNIMARC metadata records FRBRization of UNIMARC data was conducted for the following National Libraries: French National Library Martynas Mazvydas National Library of Lithuania National Library of Portugal National Library of Serbia Royal Library of Belgium Both bibliographic and authority records from library members of The European Library were gathered and a FRBRization process was applied to them. Five out of twelve libraries that contributed to the collection sent UNIMARC metadata records. The particular FRBRization of UNIMARC data was addressed and applied to the collections of: French National Library, Martynas Mazvydas National Library of Lithuania, National Library of Portugal, National Library of Serbia, and Royal Library of Belgium. This task resulted in a proof of concept which is still available at: http://digmap2.ist.utl.pt:8080/lgtefrbr/evaluation.html UNIMARC in Project ARROW Project ARROW 4 - Accessible Registries of Rights Information and Orphan Works - addresses the interoperability of rights information along the process of rights clearance for digitization purposes. It covers the identification of the work, the 8

clarification of its intellectual property rights status and the identification of the rights holders. UNIMARC usage in Project ARROW ARROW - Accessible Registries of Rights Information and Orphan Works towards Europeana Consortium of European national libraries, publishers and collective management organisations, aiming to: Deliver a rights information infrastructure operable within and across borders Facilitate identification of right holders (authors/visual artists/publishers) Facilitate the identification of the IPR status Provide test beds for Business models for digital libraries Registries for Orphan works (OW) and Out of Print Works (OPW) Rights clearance mechanisms Model licenses developed by the EC High Level Expert Group (HLEG) Interoperability and criteria for interoperability ARROW depends on the availability of existing bibliographic data and rights information. There is already an established and generally well-regarded information infrastructure for print material, through national bibliographies, books in print and the databases of rights organisations. Currently, these sources are not inter-operable because of differences in data collection policies and data schemas. Bibliographic databases rarely include metadata about rights ownership and usage policies. Such information is usually held in a wide array of formats by publishers, collecting societies and authors. Bibliographic data from the catalogues of Europe s national libraries are one of the key data sources in ARROW. For this reason, ARROW is building on the existing interoperability achieved through The European Library, using it as a core source of bibliographic data. 9

UNIMARC usage in Project ARROW The European Library, as an aggregator of bibliographic metadata from the National Libraries Will be the main source of information to identify all manifestations related with a work Early results from the project identified that TEL-AP for objects would not fulfil the requirements of the project As a result a new central index based on MARC formats is being prepared Support for MARC21 already implemented Support for UNIMARC is under development Early results from the project identified that TEL-AP would not fulfil the requirements of the project. As a result a new central index based on MARC formats is being prepared at The European Library. The first library to have its catalogue in ARROW, by means of the UNIMARC format, is the Bibliothèque nationale de France. UNIMARC in data provision to Europeana The European Library is also an aggregator for Europeana. Although so far most of the provision of metadata from The European Library to Europeana has been based on the TEL-AP, in the future it will be based on the MARC formats. The release of the new Europeana Data Model, planned for Spring 2011, will allow the semantic quality of the metadata from the MARC formats to be better preserved when it reaches Europeana. Conversion of UNIMARC to the Europeana Data Model will be a priority for The European Library. 10

Future work In the context of The European Library as an aggregator for Europeana New Europeana Data Model (EDM) planned for the Danube release (Spring 2011) The European Library will support the transformation of UNIMARC to EDM References 1 The European Library portal is available at: http://www.theeuropeanlibrary.org 2 Members of CENL are the national librarians of the Council of Europe member states. Further information about CENL is available at: http://www.cenl.org/ 3 Schmidt-Supprian, Christop. 2007. Controlled Vocabularies in a Multilingual Federated Search Environment:The Example of The European Library 4 http://www.arrow-net.eu 11