Similar documents
Problem: Solution: No Library contains all the documents in the world. Networking the Libraries

OAI-PMH. DRTC Indian Statistical Institute Bangalore

IVOA Registry Interfaces Version 0.1

Integrating Access to Digital Content

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University

RVOT: A Tool For Making Collections OAI-PMH Compliant

Harvesting Metadata Using OAI-PMH

Indonesian Citation Based Harvester System

Creating a National Federation of Archives using OAI-PMH

Metadata Harvesting Framework

Interoperability and Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)

Flexible Design for Simple Digital Library Tools and Services

Joining the BRICKS Network - A Piece of Cake

arxiv, the OAI, and peer review

OAI-PMH implementation and tools guidelines

Increasing access to OA material through metadata aggregation

The Open Archives Initiative and the Sheet Music Consortium

CodeSharing: a simple API for disseminating our TEI encoding. Martin Holmes

The Open Archives Initiative Protocol for Metadata Harvesting: An Introduction

GNU EPrints 2 Overview

The Open Archives Initiative in Practice:

Networking European Digital Repositories

OAI AND AMF FOR ACADEMIC SELF-DOCUMENTATION

Taking D2D Services to the Users with OpenURL, RSS, and OAI-PMH. Chuck Koscher Technology Director, CrossRef

Metadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics

Citation Services for Institutional Repositories: Citebase Search. Tim Brody Intelligence, Agents, Multimedia Group University of Southampton

oatd.org Discovery for Open Access Theses and Dissertations An ASERL Webinar, October 15, 2013 These slides:

Building Interoperable and Accessible ETD Collections: A Practical Guide to Creating Open Archives

Open Archives Initiative protocol development and implementation at arxiv

MuseKnowledge Hybrid Search

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Version 2 of the OAI-PMH & some other stuff

Interoperability for Digital Libraries

Integration of Disciplinary Repository, Institutional Repository and National Portal

SMART CONNECTOR TECHNOLOGY FOR FEDERATED SEARCH

The Observation of Bahasa Indonesia Official Computer Terms Implementation in Scientific Publication

Harvesting Statistical Metadata from an Online Repository for Data Analysis and Visualization

Exposing and Harvesting Metadata Using the OAI Metadata Harvesting Protocol: A Tutorial

Network Information System. NESCent Dryad Subcontract (Year 1) Metacat OAI-PMH Project Plan 25 February Mark Servilla

Hello, I m Melanie Feltner-Reichert, director of Digital Library Initiatives at the University of Tennessee. My colleague. Linda Phillips, is going

How to contribute information to AGRIS

OAI Static Repositories (work area F)

Citation Services for Institutional Repositories: Citebase Search. Tim Brody Intelligence, Agents, Multimedia Group University of Southampton

Building for the Future

Publishing Based on Data Provider

SobekCM. Compiled for presentation to the Digital Library Working Group School of Oriental and African Studies

A Novel Architecture of Agent based Crawling for OAI Resources

Building Interoperable Digital Libraries: A Practical Guide to creating Open Archives

Making scholarly statistics count in UK repositories. RSP Statistics Webinar Paul Needham, Cranfield University 26 February 2013

IMu OAI-PMH Web Service

Metadata aggregation for digital libraries

CARARE Training Workshops

The multi-faceted use of the OAI-PMH in the LANL Repository

Building an OAI-based Union Catalog for the National Digital Archives Program in Taiwan

2nd Technical Validation Questionnaire - interim results -

OAI (Open Archives Initiative) Suite Version 3.0. Introductory Guide for New Users

Repository Interoperability

Adding OAI ORE Support to Repository Platforms

EXTENDING OAI-PMH PROTOCOL WITH DYNAMIC SETS DEFINITIONS USING CQL LANGUAGE

Developing data catalogue extensions for metadata harvesting in GIS

Go Sugimoto, Kerstin Arnold, Wim van Dongen, Yoann Moranville Reviewer: Lucile Grand

adore: a modular, standards-based Digital Object Repository

Outline of the course

How to Use Google Scholar An Educator s Guide

SciX Open, self organising repository for scientific information exchange. D15: Value Added Publications IST

OPENAIRE FP7 POST-GRANT OPEN ACCESS PILOT

Networking European Digital Repositories

A Comparative Study of the Search and Retrieval Features of OAI Harvesting Services

Purpose: A dynamic approach to make legacy databases like CDS/ISIS, interoperable with OAI-compliant digital libraries (DL).

Digitometric Services for Open Archives Environments

ORCA-Registry v2.4.1 Documentation

eresearch Australia The Elephant in the Room! Open Access Archiving and other Gateways to e-research Richard Levy

The DataCite Metadata Schema. Frauke Ziedorn Workshop: Metadata and Persistent Identifiers for Social and Economic Data 7th May 2012

Orbis Cascade Alliance Content Creation & Dissemination Program Digital Collections Service. Enabling OAI & Mapping Fields in Digital Commons

Building Institutional Repositories: Emerging Challenges

Chuck Cartledge, PhD. 25 February 2018

Expected and Unexpected Synergies

Networking European Digital Repositories

mod_oai: An Apache Module for Metadata Harvesting

Registry Interchange Format: Collections and Services (RIF-CS) explained

An introduction to OAI-PMH

Open Source Software Packages for E-Resource Management

Easy Access to Open Access

The OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Data

D4.8 Report on semantic interoperability with Europeana

Harvesting of Additional Metadata Schema into DSpace through OAI-PMH: Issues and Challenges

Brown University Libraries Technology Plan

Phase 1 RDRDS Metadata

SNHU Academic Archive Policies

American Institute of Physics

Comparing Open Source Digital Library Software

ILIA STATE UNIVERSITY LIBRARY GUIDE. Ilia State University Library

Research on the Interoperability Architecture of the Digital Library Grid

Deposit guide for Nottingham eprints

Metadata and Encoding Standards for Digital Initiatives: An Introduction

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Ponds, Lakes, Ocean: Pooling Digitized Resources and DPLA. Emily Jaycox, Missouri Historical Society SLRLN Tech Expo 2018

A Dublin Core Application Profile for Scholarly Works (eprints)

EPrints: Repositories for Grassroots Preservation. Les Carr,

1. General requirements

Transcription:

http://resolver.caltech.edu/caltechlib:spoiti05

Caltech CODA http://coda.caltech.edu CODA: Collection of Digital Archives Caltech Scholarly Communication 15 Production Archives 3102 Records Theses, technical reports, conference proceedings, oral histories, refereed articles

We Want Federation Search all archives at once (federated search) Browse all authors, and all records from a given author, in one place (electronic CV)

OAI-PMH Can Help Open Archives Initiative Protocol for Metadata Harvesting http://www.openarchives.org Two Tier Model Data Providers Service Providers Service Providers harvest metadata from Data Providers via the OAI Protocol

Data Providers Expose Metadata All records must be described by a minimal set of metadata: Author Title Abstract Submission date URL to Record Unique Identifier

Service Providers Metadata is routinely harvested and stored in a central database The central database is the foundation for federated services DP9, Celestial, Google Scholar

Federation using OAI A collection of records must be described with a common, minimal set of metadata Data Provider tools expose the metdata over http using the OAI-PMH Service Providers use OAI-PMH to harvest Data Providers, index the content and produce a new service (such as searching, or act as a Data Provider themselves)

Data Provider Requirements Expose metadata by responding to simple commands. Respond using xml over http. Identify GetRecord ListIdentifiers ListMetadataFormats ListRecords ListSets

OAI Repository Explorer Helps evaluate and validate a Data Provider implementation Provide an OAI Base URL and send it queries. Example Base URL: http://caltechcstr.library.caltech.edu /perl/oai2

Data Provider Tools http://www.openarchives.org/tools/t ools.html Currently 26 tools freely available to help implement OAI Most implementation burden placed on Service Providers, not Data Providers

Eprints at Caltech Eprints.org is a scholarly communication archiving software package It is also an OAI Data Provider All Caltech CODA archives are Data Providers Most run on eprints.org; Theses runs on VT ETDdb

The Problem Each Service Provider must harvest each of our 15 archives individually This discourages participation It is unnecessary, provided we can build a local Service Provider (union catalog of all of CODA)

The Solution Design Caltech CODA Union Catalog Locally harvest each archive into a central database using OAI-PMH Implement this database as an OAI Data Provider Instruct all outside harvesters to use this one Data Provider rather than the 15 individually

EPrints.org as SP Build a harvesting routine to feed metadata into another instance of eprints.org using OAI-PMH Eprints.org does the rest browse screens search interface Data Provider

End Result The Caltech Union Catalog will contain all 3100 CODA records in one database The metadata describing the records will be only the oai_dc subset (author, title, abstract, unique id, URL to target) Each record in union catalog will contain a link back to the full record in the harvested archive

End Result There will be one place for all harvesters to obtain Caltech records, instead of 15 Use eprints to provide the local federated search interface across all our archives Author browse pages (like a CV) Centralized RSS (eprints.org supports this) Centralized access statistics

Challenges Centralized Browse by Author requires author name identifier (authority) Implement OAI harvester to feed the Union Catalog (based on eprints.org) Customize eprints.org to import records provided by this harvester

Summary Using OAI-PMH for federated searching requires three steps: Define a minimal metadata set for all records Wrap a Data Provider service around each collection of records to expose metadata Harvest metadata centrally, then produce a service (such as search and browse) Skip step three if you re satisfied with existing OAI Service Providers (DP9, Google, Celestial, etc.)

http://resolver.caltech.edu/caltechlib:spoiti05