A Repository of Metadata Crosswalks. Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research

Similar documents
The OCLC Metadata Switch Project

Digital Library Curriculum Development Module 4-b: Metadata Draft: 6 May 2008

The Design of a DLS for the Management of Very Large Collections of Archival Objects

Adding OAI ORE Support to Repository Platforms

Opus: University of Bath Online Publication Store

OAI-PMH. DRTC Indian Statistical Institute Bangalore

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

Problem: Solution: No Library contains all the documents in the world. Networking the Libraries

Creating a National Federation of Archives using OAI-PMH

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Guidelines for preparing a Z39.50/SRU target to enable metadata harvesting

The Open Archives Initiative and the Sheet Music Consortium

Metadata Workshop 3 March 2006 Part 1

Metadata: The Theory Behind the Practice

Search Interoperability, OAI, and Metadata

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Opus: University of Bath Online Publication Store

National Documentation Centre Open access in Cultural Heritage digital content

An aggregation system for cultural heritage content

WikiD (Wiki/Data) Jeffrey A. Young OCLC Office of Research code4lib 2006 Oregon State University, Corvallis, Oregon 15 February 2006

Metadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics

Metadata aggregation for digital libraries

Metadata Harvesting Framework

Nuno Freire National Library of Portugal Lisbon, Portugal

Contribution of OCLC, LC and IFLA

The Open Archives Initiative in Practice:

Metadata. Week 4 LBSC 671 Creating Information Infrastructures

INTRO INTO WORKING WITH MINT

Chinese-European Workshop on Digital Preservation. Beijing (China), July 14 16, 2004

Guidelines for Developing Digital Cultural Collections

The Semantics of Semantic Interoperability: A Two-Dimensional Approach for Investigating Issues of Semantic Interoperability in Digital Libraries

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure

Cross-domain Metadata Interoperability for Integrated Information Services

OpenAIRE Guidelines Promoting Repositories Interoperability and Supporting Open Access Funder Mandates

Persistent identifiers, long-term access and the DiVA preservation strategy

Handles at LC as of July 1999

Using DSpace for Digitized Collections. Lisa Spiro, Marie Wise, Sidney Byrd & Geneva Henry Rice University. Open Repositories 2007 January 23, 2007

Open Archives Forum - Technical Validation -

Digital Library Interoperability. Europeana

A methodology for Sharing Archival Descriptive Metadata in a Distributed Environment

DLF Aquifer: The Final Story. Katherine Kott, Susan Harum, Kat Hagedorn, Tom Habing DLF Spring Forum, Raleigh, NC May 5, 2009

Integrating Access to Digital Content

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University

Chuck Cartledge, PhD. 25 February 2018

Metadata registries for recordkeeping metadata interoperability

2nd Technical Validation Questionnaire - interim results -

ACDH AUSTRIAN CENTRE FOR DIGITAL HUMANITIES

Metadata Standards and Applications

Data Exchange and Conversion Utilities and Tools (DExT)

Hello, I m Melanie Feltner-Reichert, director of Digital Library Initiatives at the University of Tennessee. My colleague. Linda Phillips, is going

Introduction to Archivists Toolkit Version (update 5)

The Semantic Institution: An Agenda for Publishing Authoritative Scholarly Facts. Leslie Carr

Building Interoperable Digital Libraries: A Practical Guide to creating Open Archives

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Persistent identifiers: jnbn, a JEE application for the management of a national NBN infrastructure

Repository Interoperability and Preservation: The Hub and Spoke Framework

Joining the BRICKS Network - A Piece of Cake

Interoperability for Digital Libraries

The MIND Approach. Fabio Crestani University of Strathclyde, Glasgow, UK. Open Archive Forum Workshop Berlin, Germany, March 2003

15/06/2018 In Out, In Out, And Shake It All About. A Moving Story of Data

3. Technical and administrative metadata standards. Metadata Standards and Applications

Session Two: OAIS Model & Digital Curation Lifecycle Model

Ontology Servers and Metadata Vocabulary Repositories

1. General requirements

Comparing Open Source Digital Library Software

IVOA Registry Interfaces Version 0.1

SDMX self-learning package No. 3 Student book. SDMX-ML Messages

GETTING STARTED WITH DIGITAL COMMONWEALTH

Developing Seamless Discovery of Scholarly and Trade Journal Resources Via OAI and RSS Chumbe, Santiago Segundo; MacLeod, Roddy

An Approach To Automatically Generate Digital Library Image Metadata For Semantic And Content- Based Retrieval

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey.

An overview of the OAIS and Representation Information

Increasing access to OA material through metadata aggregation

Metadata for Electronic Information Resources

SRW and CQL; Open Source at LC

Workshop B: Application Profiles Canadian Metadata Forum September 28, 2005

Presented by Dr Joanne Evans, Centre for Organisational and Social informatics Faculty of IT, Monash University Designing for interoperability

EXTENDING OAI-PMH PROTOCOL WITH DYNAMIC SETS DEFINITIONS USING CQL LANGUAGE

Metadata Cataloging. regarding items. For the assignment, I chose to outline some fields from three different

The MEG Metadata Schemas Registry Schemas and Ontologies: building a Semantic Infrastructure for GRIDs and digital libraries Edinburgh, 16 May 2003


*Note: This is the author s original submitted article and has not been edited for style and content by ASTD.

RDF and Digital Libraries

The OAIS Reference Model: current implementations

AGGREGATIVE DATA INFRASTRUCTURES FOR THE CULTURAL HERITAGE

Assessment of product against OAIS compliance requirements

OAI-ORE. A non-technical introduction to: (

Archival Information Package (AIP) E-ARK AIP version 1.0

Research Data Repository Interoperability Primer

Metadata Standards and Applications. 4. Metadata Syntaxes and Containers

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

METADATA POLICIES FOR THE DESCRIPTION OF DIGITAL FOLKLORE COLLECTIONS

Building Interoperable and Accessible ETD Collections: A Practical Guide to Creating Open Archives

Building for the Future

Update on the TDL Metadata Working Group s activities for

OAI Static Repositories (work area F)

OpenData Hackathon Δημόσια, Ανοικτά Δεδομένα H εμπειρία του Εθνικού Κέντρου Τεκμηρίωσης

Transcription:

A Repository of Metadata Crosswalks Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research DLF-2004 Spring Forum April 21, 2004

Outline of this talk Crosswalks and metadata translation Solution 1: A collection of XSLT scripts Solution 2: Pathfinders for crosswalks Solution 3: A crosswalk repository Open issues Project status

Our research project goals A robust design for metadata translation A clean separation of: document data model schema translations machinery Support for current practice and foreseeable innovation A metadata translation system/toolkit An unplugged service for metadata translation A place for human input (intellectual mappings) in an automated system

Our system design Crosswalk repository client A metadata crosswalk Record translation client A record Metadata Translator A transformed record

The problem Reusing metadata often requires translation Available translation options typically: Support single use cases Produce low-fidelity translations Are cumbersome to maintain 100 a Shakespeare, William, d 1564-1616.P 245 a Hamlet.P 260 a New York : b Penguin Books, c c2003.p <dc:creator>shakespeare, William,1564-1616</> <dc:title>hamlet</> <dc:publisher>penguin Books</> <dc:date>2003</>

What is a crosswalk? Crosswalks are used to translate between different metadata element sets. The elements (or fields) in one metadata set are correlated with the elements of another metadata set that have the same or similar meanings. This is also sometimes called semantic mapping. Source: Canadian Heritage Information Network (http://www.chin.gc.ca/english/)

An example: Dublin Core to Encoded Archival Description (EAD)

Why use XSLT for crosswalks? It capitalizes on current trends that model structured text in XML. It is a reasonable solution for lightweight processes and simple semantic mappings. So, an XSLT repository would: Reduce duplication of effort. Promote the use of standards.

Which crosswalks are equivalent? A test client If they re not equivalent, how do they differ? Which crosswalks have XML schemas that match my data?

A crosswalk pathfinder

In the XSLT collection Some problems Information needed for executing the scripts is missing. Undocumented XSLT scripts aren t crosswalks, as we have defined them. Syntax and semantics have been dissociated. The collection can t be mined. In the pathfinder page The documents vary in scope and granularity. Executable code is often difficult to locate. Pathfinders aren t designed for browsing and searching.

The crosswalk repository (3.0) Components of our solution: Model the crosswalk as a complex object using the Library of Congress Metadata Encoding and Transmission (METS) standard. Assemble the records into a searchable repository built on Open Archives Initiative (OAI) standards.

A crosswalk as a METS record Describe the crosswalk object in the METS header. Assemble and identify six objects in the METS structural map: The source metadata schema The target metadata schema The crosswalk Human-readable and executable versions of each Associate metadata for each file in the METS Descriptive Metadata Section.

OCLC s OAI repository The Open Archives Initiative develops and promotes interoperability standards to facilitate the efficient dissemination of content. Primary among these is the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). Source:oclc.org/research Collects publicly accessible XML-encoded metadata on a range of research projects into a searchable collection, including thesauri and Electronic Theses and Dissertations. Includes XML tools required to manage the data, such as XSLT scripts. Serves as a testbed for research on information registries, URL maintenance, and searching.

A crosswalk METS record in the OAI repository

What the METS encoding solves The semantic and syntactic information required for interpreting and executing a crosswalk is collected into a single object. The repository is searchable by humans and automated processes. Services can be built on top of it. It encourages the development and standardization of crosswalks. These outcomes are possible because every component in the system is a standard.

Some possible services Translations Queries Which encodings have been done for a given metadata schema or namespace? Interactions with data Given the XML schema and namespace referenced in my data, does this repository have any XSLT scripts that process it? Documentation The METS crosswalk object can be associated with a given set of records to document which standards/versions/scripts were used to convert it.

Open issue 1 Are crosswalks a potential standard? Or just a local solution for the management of heterogeneous data? The crosswalk is a preliminary one aimed at transforming relatively simple METS documents Source: Yee and Beaubien, 2003 Crosswalks that extend interoperability are essential so that the digital library collections can be accessible through a variety of portals and search interfaces. As more organizations share what they have learned the development of crosswalks will be better understood and more easily accomplished. Source: Lightle and Ridgway 2003

Open issue 2: Is XSLT the best tool for metadata translation? XSLT is cumbersome when there is a need for high-fidelity translations. More precise associations between syntax and semantics may be necessary. The supporting documentation required for verifiable translation is daunting. Metadata Schema X (versions * encodings) * Metadata Schema Y (versions * encodings)

Project status The OCLC OAI repository is accessible at: http://errol.oclc.org/xmlregistry.oclc.org.html Advanced searching using the SRW (Search/Retrieve Web service) protocol is currently being implemented. The registry is being populated with crosswalk records. We welcome your comments and participation!

For further information Metadata Schema Transformation Services http://oclc.org/research/projects/mswitch/1_schematrans.htm The Open Archives Initiative Project http://oclc.org/research/projects/oai/default.htm Two Paths to Interoperable Metadata http://oclc.org/research/publications/archive/2003/godbydc2003.pdf

References Raymond Yee and Rick Beaubien. 2003. A Preliminary Crosswalk from METS to IMS Content Packaging. Library Hi Tech. Kimberly Lightle and Judith S. Ridgway. 2003. Generation of XML Records across Multiple Metadata Standards. DLIB http://dlib.org/dlib/september03/lightle/09lightle.html