A methodology for Sharing Archival Descriptive Metadata in a Distributed Environment

Similar documents
The Design of a DLS for the Management of Very Large Collections of Archival Objects

Building a Distributed Digital Library System Enhancing the Role of Metadata

Modeling Archives by means of OAI-ORE

Handling Hierarchically Structured Resources Addressing Interoperability Issues in Digital Libraries

Digital Archives: Extending the 5S model through NESTOR

A Distributed Digital Library System Architecture for Archive Metadata

The NESTOR Model: Properties and Applications in the Context of Digital Archives

An Architecture to Share Metadata among Geographically Distributed Archives

Empowering Archives through Annotations

Creating a National Federation of Archives using OAI-PMH

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

Search Interoperability, OAI, and Metadata

OAI-PMH. DRTC Indian Statistical Institute Bangalore

Problem: Solution: No Library contains all the documents in the world. Networking the Libraries

You may print, preview, or create a file of the report. File options are: PDF, XML, HTML, RTF, Excel, or CSV.

A Repository of Metadata Crosswalks. Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research

Archivists Toolkit: Description Functional Area

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University

Descendants, Ancestors, Children and Parent: A Set-Based Approach to Efficiently Address XPath Primitives

Metadata Cataloging. regarding items. For the assignment, I chose to outline some fields from three different

Fondly Collisions: Archival hierarchy and the Europeana Data Model

Data Exchange and Conversion Utilities and Tools (DExT)

Metadata aggregation for digital libraries

INTRO INTO WORKING WITH MINT

OpenAIRE Guidelines Promoting Repositories Interoperability and Supporting Open Access Funder Mandates

MINT METADATA INTEROPERABILITY SERVICES

The Open Archives Initiative in Practice:

Harvesting Metadata Using OAI-PMH

Metadata Harvesting Framework

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Assessment of product against OAIS compliance requirements

SobekCM METS Editor Application Guide for Version 1.0.1

Integrating Access to Digital Content

Information und Wissen: global, sozial und frei?

Content Creation & Dissemination Team EAD Database WG, EAD3 Group Implementing EAD3 in the CCD Program: Final Report and Recommendations 2016 March 3

The Sunshine State Digital Network

NDL Search -future development- Yasunao Kobayashi Assistant Director Library Support Division, Kansai-kan of the National Diet Library

Getting Started with the Digital Commonwealth. Robin L. Dale Director of Digital & Preservation Services LYRASIS

CONTENTdm & The Digital Collection Gateway New Looks for Discovery and Delivery

Adding EAD-Encoded Finding Aids in CONTENTdm

The Ohio State University's Knowledge Bank: An Institutional Repository in Practice

Building for the Future

Rules for Archival Description and Encoded Archival Description: Competing or Compatible Standards?

Joining the BRICKS Network - A Piece of Cake

Metadata: The Theory Behind the Practice

Nuno Freire National Library of Portugal Lisbon, Portugal

B2SAFE metadata management

Digital Library Curriculum Development Module 4-b: Metadata Draft: 6 May 2008

Hello, I m Melanie Feltner-Reichert, director of Digital Library Initiatives at the University of Tennessee. My colleague. Linda Phillips, is going

Using DSpace for Digitized Collections. Lisa Spiro, Marie Wise, Sidney Byrd & Geneva Henry Rice University. Open Repositories 2007 January 23, 2007

Introduction to the OAI Protocol for Metadata Harvesting Version 2.0. Hussein Suleman Virginia Tech DLRL 17 June 2002

Research on the Interoperability Architecture of the Digital Library Grid

Interoperability Patterns in Digital Library Systems Federations. Paolo Manghi, Leonardo Candela, Pasquale Pagano

Europeana, the prototype EDLfoundation Europeana Network Europeana, vs. 1.0 ThoughtLab Technical requirements

The Matterhorn RDF Data Model: Implemeting OAIS and RiC in the context of semantic technologies. Alain Dubois, Andreas Nef

Digital Library Interoperability. Europeana

Research repository models: Can one size fit all?

Links, languages and semantics: linked data approaches in The European Library and Europeana. Valentine Charles, Nuno Freire & Antoine Isaac

Registry Interchange Format: Collections and Services (RIF-CS) explained

Open Archive Solutions to Traditional Archive/Library Cooperation By DONATELLA CASTELLI

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Europeana and the Mediterranean Region

Designing a Multi-level Metadata Standard based on Dublin Core for Museum data

Geospatial Multistate Archive and Preservation Partnership Metadata Comparison

SobekCM. Compiled for presentation to the Digital Library Working Group School of Oriental and African Studies

An aggregation system for cultural heritage content

The Biblioteca de Catalunya and Europeana

The Open Archives Initiative and the Sheet Music Consortium

Flexible Design for Simple Digital Library Tools and Services

Open Archives Forum - Technical Validation -

Using the WorldCat Digital Collection Gateway with CONTENTdm

European Holocaust Research Infrastructure Theme [INFRA ] GA no Deliverable D19.5

Guidelines for Developing Digital Cultural Collections

Metadata for Digital Collections: A How-to-Do-It Manual

2nd Technical Validation Questionnaire - interim results -

Cross-domain Metadata Interoperability for Integrated Information Services

Exploring the Concept of Temporal Interoperability as a Framework for Digital Preservation*

Procedures for Creating/Publishing EAD/MARC records using the Archivists Toolkit

Transfers and Preservation of E-archives at the National Archives of Sweden

Open Archives Initiative protocol development and implementation at arxiv

Developing Seamless Discovery of Scholarly and Trade Journal Resources Via OAI and RSS Chumbe, Santiago Segundo; MacLeod, Roddy

The Scottish Collections Network: landscaping the Scottish common information environment. Gordon Dunsire

The OAIS Reference Model: current implementations

CUSTOMIZED OAI-ORE AND OAI-PMH EXPORTS OF COMPOUND OBJECTS FOR FEDORA REPOSITORIES

CMDI and granularity

The CARARE project: modeling for Linked Open Data

Digital Library Interoperability at High Level of Abstraction

Metadata Overview: digital repositories

Digital Curation and Preservation: Defining the Research Agenda for the Next Decade

Chapter 6. Importing Data EAD Constraints on EAD

Appendix REPOX User Manual

Comparing Open Source Digital Library Software

FLAT: A CLARIN-compatible repository solution based on Fedora Commons

Digitisation Standards

Architecture domain. Leonardo Candela. DL.org Autumn School Athens, 3-8 October th October 2010

UC Bibliographic Standards for Cooperative, Vendor, and Campus Backlog Cataloging rev. 07/24/2012

e-government: A Legislative Ontology for the SIAP Parliamentary Management System

Signed metadata : method and application

AN EXPLORATORY STUDY OF THE DESCRIPTION FIELD IN THE DIGITAL PUBLIC LIBRARY OF AMERICA

Transcription:

A methodology for Sharing Archival Descriptive Metadata in a Distributed Environment Nicola Ferro and Gianmaria Silvello Information Management Research Group (IMS) Department of Information Engineering University of Padua, Italy

Outline The Nature of Archives Network of Digital Archives Digital Libraries Technologies and Digital Archives Encoded Archival Description Metadata Format Nested Sets Methodology Conclusions

Outline The Nature of Archives Network of Digital Archives Digital Libraries Technologies and Digital Archives Encoded Archival Description Metadata Format Nested Sets Methodology Conclusions

Archives Archives keep the context and the network of relationships. Archives have a hierarchical structure: archival bond. Archival descriptions need to be able to express and maintain hierarchical structure and relationships.

Archival Descriptions The Interna(onal Council on Archives has developed a general standard for archival descrip3on called Interna3onal Standard for Archival Descrip3on (General) ISAD(G) Archival descrip3ons produced according to the ISAD(G) standard take the form of a tree which represents the rela3onships among more general and more specific archive units going from the root to the leaves of the tree. Reference. Interna3onal Council on Archives. ISAD(G): General Interna3onal Standard Archival Descrip3on, 2nd edi3on. OGawa: Interna3onal Council on Archives, 1999.

Outline The Nature of Archives Network of Digital Archives Digital Libraries Technologies and Digital Archives Encoded Archival Description Metadata Format Nested Sets Methodology Conclusions

Archival Descriptive Metadata Archival descriptive metadata should meet the following three main requisites: 1. Context: archival descriptive metadata have to retain information about the context of a given record. 2. Hierarchy: archival descriptive metadata have to reflect the archive organization which is described in a multi-leveled fashion. 3. Variable Granularity: archival descriptive metadata have to facilitate access to the requested items.

Network of Digital Archives Archive B Heterogeneity issues.! Archive A Archive C Archive E Archive D Archives have a fixed tree structure.! Archives must preserve their autonomy and independence.! Difficulties in exchanging archival information embedded in a tree hierarchy.!

Trees mapped into Sets Archive descriptions assume a tree structure. It is difficult to share trees between archives and to access a precise element of the tree without accessing the whole hierarchy.

Nested Sets Model Sub fonds Fonds Sub fonds Serie Serie Sub fonds Serie Serie Serie Serie Sets permit to access elements with a variable granularity. Throughout nested sets it is possible to express hierarchy and retain context information. An organization of nested sets is flexible and well-suited for a distributed environment.

Outline The Nature of Archives Network of Digital Archives Digital Libraries Technologies and Digital Archives Encoded Archival Description Metadata Format Nested Sets Methodology Conclusions

Digital Libraries DLSs are the technology of choice for managing the information resources of different kind of organizations. o The need for interoperability among different systems is a compelling issue o DELOS Reference Model. Europeana the European digital library, museum and archive is a 2-year project that will give users direct access to some 2 million digital objects. This figure is taken from Europeana leaflet available at: http://www.europeana.eu

OAI-PMH Open Archive Initiative promotes interoperability through OAI-PMH. Dublin Core metadata format is the lowest common denominator in OAI-PMH. OAI-PMH is the de-facto standard in metadata exchange. It is based on the distinction between two main components: Data and Service Provider.

OAI Sets OAIsets enable logical data partitioning by defining group of records. OAIsets are defined by three main components: 1. setspec 2. setname 3. setdesc OAIset organization may be flat or hierarchical. Harvesting procedures: incremental and selective harvesting. Harvesting from a set which has subsets will cause the repository to return metadata in the specified set and recursively from all its subsets.

Digital Libraries and Digital Archives The use of OAI-PMH is not widespread in the archival context. Dublin Core metadata format seems to flatten out the archive structure. EAD: Encoded Archival Description. EAD is a standard defined by The Library of Congress in partnership with the Society of American Archivists. EAD reflects and emphasizes ISAD(G).

Outline The Nature of Archives Network of Digital Archives Digital Libraries Technologies and Digital Archives Encoded Archival Description Metadata Format Nested Sets Methodology Conclusions

EAD Structure and Puzzles <ead> <eadheader> [...] </eadheader> <archdesc level= fonds > [...] <did> [...] </did> <dsc> [...] <c01> [...] </c01> <c01> [...] <c02> [...] </c02> </c01> </dsc> </archdesc> </ead> Automatic processing: Several degree of freedom in tagging practice. Levels: The level of description needs to be inferred by navigating the upper components. Size: Sharing and searching archival description might be made difficult by the high size of EAD and its deep hierarchical structure. User needs: Users are often interested in item-level information which is typically buried very deeply in the hierarchy and difficult to reach. Archival metadata requirements: EAD complies with both the context and hierarchy requirements but it disregards the variable granularity one.

Outline The Nature of Archives Network of Digital Archives Digital Libraries Technologies and Digital Archives Encoded Archival Description Metadata Format Nested Sets Methodology Conclusions

Benefits of the Nested Sets Methodology The methodology addresses the shortcoming of EAD when it was used in a distributed environment and with variable granularity access to the resources. EAD items are mapped into different DC metadata which are shareable and natively supported by OAI-PMH. Context and hierarchy are expressed in a straightforward manner exploiting native functionalities of OAI-PMH levering the role of OAISets. This approach keeps archival metadata independent of the original EAD file, without loosing any context information. This approach can be applied also independently of the EAD standard; indeed we can also create archival description metadata from scratch by exploiting OAI sets and DC records.

Nested Set Methodology

Nested Sets Methodology Internal nodes are mapped into sets.

Nested Sets Methodology

Outline The Nature of Archives Network of Digital Archives Digital Libraries Technologies and Digital Archives Encoded Archival Description Metadata Format Nested Sets Methodology Conclusions

Conclusions We defined the requisites which must be satisfied in order to obtain shareable metadata and to retain all the fundamental characteristics of archival resources. We presented a methodology for creating shareable archival descriptive metadata which exploits the synergy between OAI-PMH and DC. This methodology opens archival description to be shared in a distributed environment. EAD metadata can be mapped into our methodology without losing information. The methodology can be applied backwards generating a new EAD file with a slightly different structure compared to the original one, but it brings the same informational content.

Conclusions Thank you! Questions? Gianmaria Silvello Department of Information Engineering University of Padova silvello@dei.unipd.it