Adding OAI ORE Support to Repository Platforms

Similar documents
Introducing Manakin: Overview & Architecture. Scott Phillips, Cody Green, Alexey Maslov, Adam Mikeal, and John Leggett

A Repository of Metadata Crosswalks. Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Comparing Open Source Digital Library Software

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

OPENAIRE FP7 POST-GRANT OPEN ACCESS PILOT

The RAMLET project Use cases

Building for the Future

Building a Digital Repository on a Shoestring Budget

Using DSpace for Digitized Collections. Lisa Spiro, Marie Wise, Sidney Byrd & Geneva Henry Rice University. Open Repositories 2007 January 23, 2007

The OAIS Reference Model: current implementations

Data Exchange and Conversion Utilities and Tools (DExT)

Registry Interchange Format: Collections and Services (RIF-CS) explained

Research Data Repository Interoperability Primer

Fedora Commons Update

Ing. José A. Mejía Villar M.Sc. Computing Center of the Alfred Wegener Institute for Polar and Marine Research

The Design of a DLS for the Management of Very Large Collections of Archival Objects


OAI-PMH. DRTC Indian Statistical Institute Bangalore

PROCESS HISTORY METADATA PEGGY GRIESINGER NATIONAL DIGITAL STEWARDSHIP RESIDENT MUSEUM OF MODERN ART DECEMBER 4 T H, 2014

Problem: Solution: No Library contains all the documents in the world. Networking the Libraries

Part 2: Current State of OAR Interoperability. Towards Repository Interoperability Berlin 10 Workshop 6 November 2012

A service-oriented national e-thesis information system and repository

An overview of the OAIS and Representation Information

Assessment of product against OAIS compliance requirements

B2SAFE metadata management

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University

Igitur Archive: Institutional Repository Utrecht University. May , Martin Slabbertje

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

The Metadata Challenge:

CARARE Training Workshops

Building a Digital Library Software

Wendy Thomas Minnesota Population Center NADDI 2014

The Ohio State University's Knowledge Bank: An Institutional Repository in Practice

Sessions 3/4: Member Node Breakouts. John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

Session Two: OAIS Model & Digital Curation Lifecycle Model

Its All About The Metadata

Chuck Cartledge, PhD. 25 February 2018

Presented by Dr Joanne Evans, Centre for Organisational and Social informatics Faculty of IT, Monash University Designing for interoperability

How to contribute information to AGRIS

Digital Library Curriculum Development Module 4-b: Metadata Draft: 6 May 2008

Repository Interoperability

Survey of Existing Services in the Mathematical Digital Libraries and Repositories in the EuDML Project

Architecture domain. Leonardo Candela. DL.org Autumn School Athens, 3-8 October th October 2010

Creating a National Federation of Archives using OAI-PMH

The DART-Europe E-theses Portal

Development of an Ontology-Based Portal for Digital Archive Services

An aggregation system for cultural heritage content

Long-term digital preservation of UNSWorks

Developing an Institutional Repository Service in Chinese Academy of Sciences

SECTION 10 EXCHANGE PROTOCOL

A service-oriented national e-theses information system and repository

Chinese-European Workshop on Digital Preservation. Beijing (China), July 14 16, 2004

Report on Enhancing Interoperability between existing Open Access Publication Infrastructures

ACDH AUSTRIAN CENTRE FOR DIGITAL HUMANITIES

Exploring Open Source Solutions in the Management of ETD Processes CHETAN S SONAWANE KMC COLLEGE, INDIA

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Metadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics

National Documentation Centre Open access in Cultural Heritage digital content

Persistent identifiers, long-term access and the DiVA preservation strategy

Purpose: A dynamic approach to make legacy databases like CDS/ISIS, interoperable with OAI-compliant digital libraries (DL).

oatd.org Discovery for Open Access Theses and Dissertations An ASERL Webinar, October 15, 2013 These slides:

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure

Geospatial Multistate Archive and Preservation Partnership Metadata Comparison

OAI-ORE. A non-technical introduction to: (

The adore Federation Architecture

SNHU Academic Archive Policies

Edinburgh Research Explorer

Search Interoperability, OAI, and Metadata

Bridging Continents. Kazu Yamaji National Institute of Informatics JAPAN

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Showing it all a new interface for finding all Norwegian research output

DSpace Fedora. Eprints Greenstone. Handle System

Interoperability for Digital Libraries

OpenData Hackathon Δημόσια, Ανοικτά Δεδομένα H εμπειρία του Εθνικού Κέντρου Τεκμηρίωσης

Non-text theses as an integrated part of the University Repository

Assessment of product against OAIS compliance requirements

Fedora Relationships and Information Network Overlays. CS 431 April 19, 2006 Carl Lagoze Cornell University

ORCA-Registry v2.4.1 Documentation

DA-NRW: a distributed architecture for long-term preservation

The Case for Interoperability for Open Access Repositories

Fedora Commons: Taking on the Challenge of the Next Generation of Scholarly Communication

Signed metadata : method and application

Digital Preservation with Special Reference to the Open Archival Information System (OAIS) Reference Model: An Overview

Infrastructure for the UK

Metadata Harvesting Framework

RDDS: Metadata Development

Digital Library Interoperability. Europeana

SMART CONNECTOR TECHNOLOGY FOR FEDERATED SEARCH

Introduction to Federico 2.0 and Fedora Commons

ProQuest Dissertations and Theses Overview. Austin McLean and Marlene Coles CGS Summer Workshop, July 2017

An Architecture to Share Metadata among Geographically Distributed Archives

Institutional repositories: description of VITAL as an example of a Fedora-based digital assets management system.

The OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Data

The Open Archives Initiative in Practice:

Richard Marciano Alexandra Chassanoff David Pcolar Bing Zhu Chien-Yi Hu. March 24, 2010

Research Data Management and Institutional Repositories

BPMN Processes for machine-actionable DMPs

Transcription:

Adding OAI ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library OR 09

Texas Digital Library Use Case for OAI OREO Overview Mapping ORE model to DSpace architecture Implementation ti Results and Implications

Texas Digital Library State wide initiative Eighteen members Public/Private Small/Medium/Large /L

Electronic Theses and Dissertations Federated Collection Built on top of DSpace/Manakin

Current Federation Method Performed via scripted ingest process New batch every semester Manual corrections to existing content

Replacement Requirements Perform maintenance automatically Detect changes in existing content Support interchange of metadata and content

Harvesting Solution Use the Open Archives Initiative Protocol for Metadata Harvesting Member institutions as data providers TDL Federated Repository as a service provider * Open Archives Initiative Protocol for Metadata Harvesting http://www.openarchives.org/pmh/

OAI PMH, advantages Ubiquitous Supports selective harvesting Tracks changes Can be automated

OAI PMH, obstacles No existing harvesting solution for DSpace Supports harvesting of metadata specifically

Disseminating content How do you disseminate content through a metadata harvesting protocol? Wrap it in a packaging format Include the metadata Encode the references to the files Harvest the package

METS, advantages Metadata Encoding and Transmission Standard Maintained by the Library of Congress Mature standard Widely adopted d * Metadata Encoding and Transmission Standard, Library of Congress http://www.loc.gov/standards/mets/

Packaging, disadvantages Complete packaging format Open to interpretation Ambiguities at the OAI PMH layer

OAI ORE ORE Open Archives Initiative Object Reuse and Exchange defines dfi standards d for the description i and exchange of aggregations of Web resources. Specialized Simple * Open Archives Initiative Object Reuse and Exchange http://www.openarchives.org/ore/

Mapping DSpace to OAI ORE ORE Abstract Data Model DSpace architecture The Mapping

ORE Data Model Aggregations Aggregated Resources Resource Maps

Aggregation (A) Describes a setof resources Conceptual construct

Aggregated Resource (AR) Object of interest Part of an aggregation Can itself be an aggregation

Aggregated Resource (AR) Object of interest Part of an aggregation Can itself be an aggregation

Resource Map (ReM) Describes an aggregation Enumerates its aggregated resources Can be serialized in RDF or Atom XML

DSpace Model v1.x Communities Collections Items Bundles Bitstreams

ORE DSpace

Mapping

Mapping

Mapping

Bundles?

Bundles, Potential Options Bundles as Aggregations of Bitstreams Bundles as filters for Aggregated Resources Bundles as DSpace specific metadata

Bundles, Observations By default, specialized for internal tasks Extendible for any use Obscured from the end user

DSpace Bundles

Serialization in Atom

ORE Dissemination Implementation ORE Harvesting Automation

Interfacing with DSpace Web UI LNI and SWORD Ingest and export scripts Crosswalks Ingestion Dissemination

ORE Dissemination Crosswalk Requires: A DSpace Item Produces: Atom serialized ORE ReM

ORE Dissemination via OAI PMH Dissemination crosswalk produces ORE ReMs from DSpace Items OAI PMH provider disseminates them

ORE Harvesting Item level ORE ReM interpreter Collection level OAI PMH harvester Repository level harvest scheduler

ORE Ingestion Crosswalk Requires: A DSpace Item Atom serialized ORE ReM Produces: A DSpace Item with Bitstreams created from AR s

OAI PMH Harvester Queries remote OAI PMH providers Processes responses as individual records Implemented at Collection level

Collection Settings Source of collection s content OAI PMH provider information Harvesting Level

Collection Source

OAI PMH Settings OAI PMH Provider OAI Set Id DMD Format

Harvest Level

Harvesting a Collection Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Harvest Metadata Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Metadata Replicated Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Case 1: Metadata Only Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Harvest ORE ReMs Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Case 2: Metadata + Content Ref s Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Case 2: Metadata + Content Ref s Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Case 3: Metadata + Content Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Case 3: Metadata + Content Local collection (OAI-PMH harvester) Remote collection (OAI-PMH provider)

Harvest Scheduling System Monitors harvested collections Starts harvests at regular intervals Alerts administrators i of errors

The Primary Use Case Results TDL in General The Greater Web The Greater Web Community

Harvesting using PMH+ORE Federated ETD collection currently in pre production production at TDL Addresses primary requirements Performs maintenance automatically Detects changes in existing content Supports interchange of metadata and content

Other Possibilities Specialized DSpace instances Flexible repository architecture Interoperability with other repository systems Large-scale ETD repositories: A case study of a digital library application, Adam Mikeal, James Creel, Alexey Maslov, Scott Phillips, John Leggett, Mark McFarland. JCDL 2009

Current Priorities Live deployment at TDL Release to the open source community Integration into DSpace 1.6

National Leadership Grant #LG 05 07 0095 07 07 0095 07

Questions?