Search Interoperability, OAI, and Metadata

Similar documents
Integrating Access to Digital Content

Metadata aggregation for digital libraries

The Sunshine State Digital Network

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

The Design of a DLS for the Management of Very Large Collections of Archival Objects

DLF Update on Metadata Services

IMLS National Leadership Grant LG "Proposal for IMLS Collection Registry and Metadata Repository"

IMLS Digital Collections and Content

Aquifer Gap Analysis Task Group Final Report August 15, 2006

Hello, I m Melanie Feltner-Reichert, director of Digital Library Initiatives at the University of Tennessee. My colleague. Linda Phillips, is going

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Creating actionable URLs for the DLFAquifer asset action portal

A methodology for Sharing Archival Descriptive Metadata in a Distributed Environment

What Is OAI PMH Good For?

Harvesting Metadata Using OAI-PMH

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

A Repository of Metadata Crosswalks. Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

Ponds, Lakes, Ocean: Pooling Digitized Resources and DPLA. Emily Jaycox, Missouri Historical Society SLRLN Tech Expo 2018

IMLS Digital Collections and Content

AN EXPLORATORY STUDY OF THE DESCRIPTION FIELD IN THE DIGITAL PUBLIC LIBRARY OF AMERICA

Open Archive Solutions to Traditional Archive/Library Cooperation By DONATELLA CASTELLI

Creating Metadata Best Practices for CONTENTdm Users

DLF Aquifer: The Final Story. Katherine Kott, Susan Harum, Kat Hagedorn, Tom Habing DLF Spring Forum, Raleigh, NC May 5, 2009

Building Virtual Collections

IMLS Digital Collections and Content

Registry Interchange Format: Collections and Services (RIF-CS) explained

Interoperability and Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)

Image Database Task Force Final Report to the Board

MuseKnowledge Hybrid Search

An introduction to OAI-PMH

Creating a National Federation of Archives using OAI-PMH

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Getting Started with the Digital Commonwealth. Robin L. Dale Director of Digital & Preservation Services LYRASIS

Creative Commons Attribution-NonCommercial-ShareAlike 2.5 License.

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

Introduction to Archivists Toolkit Version (update 5)

OAI-PMH. DRTC Indian Statistical Institute Bangalore

The MEG Metadata Schemas Registry Schemas and Ontologies: building a Semantic Infrastructure for GRIDs and digital libraries Edinburgh, 16 May 2003

Digital Library Curriculum Development Module 4-b: Metadata Draft: 6 May 2008

OpenAIRE Guidelines Promoting Repositories Interoperability and Supporting Open Access Funder Mandates

Metadata Sharing Policy


SMART CONNECTOR TECHNOLOGY FOR FEDERATED SEARCH

Sharing Archival Metadata MODULE 20. Aaron Rubinstein

Europeana, the prototype EDLfoundation Europeana Network Europeana, vs. 1.0 ThoughtLab Technical requirements

Alphabet Soup: A Metadata Overview Melanie Schlosser Metadata Librarian

Developing Seamless Discovery of Scholarly and Trade Journal Resources Via OAI and RSS Chumbe, Santiago Segundo; MacLeod, Roddy

The Ohio State University's Knowledge Bank: An Institutional Repository in Practice

How to contribute information to AGRIS

Problem: Solution: No Library contains all the documents in the world. Networking the Libraries

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University

OAI-PMH for dummies: how to build an institutional repository with limited resources?

OAI/PMH Metadata Conformance to DLF/ Aquifer MODS Guidelines

12/16/2010 Best Practices for CONTENTdm and other OAI- PMH compliant repositories creating shareable metadata

Research on the Interoperability Architecture of the Digital Library Grid

For those of you who may not have heard of the BHL let me give you some background. The Biodiversity Heritage Library (BHL) is a consortium of

Metadata: The Theory Behind the Practice

The Open Archives Initiative and the Sheet Music Consortium

Metadata Standards and Applications

OAI Static Repositories (work area F)

Metadata for Digital Collections: A How-to-Do-It Manual

Draft for discussion, by Karen Coyle, Diane Hillmann, Jonathan Rochkind, Paul Weiss

Building for the Future

Metadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics

International Implementation of Digital Library Software/Platforms 2009 ASIS&T Annual Meeting Vancouver, November 2009

Digibess: thanks Islandora! Arcidosso Italy March, 20-22, Giancarlo Birello, Anna Perin IT office and Library CNR-Ceris

Developing Shareable Metadata for DPLA

Introduction. Search and Discovery across Collections: the IMLS Digital Collections and Content Project

MINT METADATA INTEROPERABILITY SERVICES

Search and Discovery across Collections -- the IMLS Digital Collections and Content Project

The Scottish Collections Network: landscaping the Scottish common information environment. Gordon Dunsire

Is Quality Metadata Shareable Metadata? The Implications of Local Metadata Practices for Federated Collections

Comparing Curricula for Digital Library. Digital Curation Education

Nuno Freire National Library of Portugal Lisbon, Portugal

Promoting shareability: Metadata activities of the DLF Aquifer initiative

EXTENDING OAI-PMH PROTOCOL WITH DYNAMIC SETS DEFINITIONS USING CQL LANGUAGE

Increasing access to OA material through metadata aggregation

Adding OAI ORE Support to Repository Platforms

Ontology Servers and Metadata Vocabulary Repositories

Metadata Cataloging. regarding items. For the assignment, I chose to outline some fields from three different

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

Research Data Repository Interoperability Primer

Guidelines for Developing Digital Cultural Collections

Networking European Digital Repositories

NDL Search -future development- Yasunao Kobayashi Assistant Director Library Support Division, Kansai-kan of the National Diet Library

The Open Archives Initiative in Practice:

Comparing Open Source Digital Library Software

ACDH AUSTRIAN CENTRE FOR DIGITAL HUMANITIES

The Biblioteca de Catalunya and Europeana

OAI-ORE experiments at the University of Illinois Library at Urbana-Champaign

The MetaArchive of Southern Digital Culture. Building a Multi-Institutional Digital Preservation Network

On Being a Hub: Some Details behind Providing Metadata for the Digital Public Library of America

Its All About The Metadata

Institutional repositories: description of VITAL as an example of a Fedora-based digital assets management system.

Taking D2D Services to the Users with OpenURL, RSS, and OAI-PMH. Chuck Koscher Technology Director, CrossRef

Metadata Harvesting Framework

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Guidelines for preparing a Z39.50/SRU target to enable metadata harvesting

Transcription:

Search Interoperability, OAI, and Metadata An Introduction to the OAI Protocol for Metadata Harvesting Sarah Shreeves University of Illinois at Urbana-Champaign November 30, 2006 This work is licensed under the Creative Commons AttributionNonCommercial-ShareAlike 2.5 License.

Scenario: An undergraduate is writing a paper comparing immigration in the early 20th century to immigration now and has to include a variety of primary sources

IMLS funded digital collections with relevant content The problem: The user has to access each collection individually. Wastes time and makes it harder to get work done. A partial solution: The OAI Protocol for Metadata Harvesting provides a relatively low barrier means for integrated access to the metadata describing items in these collections.

Outline Search interoperability basics What the OAI protocol is & what it is not Examples of OAI enabled services How it works (basically) Challenges for data / service providers

Search interoperability the ability to perform a search over diverse sets of metadata records and obtain meaningful results. Priscilla Caplan Metadata Fundamentals for All Librarians

Keys to Search Interoperability Communication protocol (Z39.50, OAI, etc.) Organizational commitment Standards Standards And more Standards

Sharing metadata: Federated search The distributed databases are searched directly. <title>my resource</ title> <date>04 Mill? <title>my resource</ title> <date>04 <title>my resource</ title> <date>04 For Example: Z39.50, SRU/SRW

Sharing metadata: Data aggregation The user searches a pre-aggregated database of metadata from diverse sources. Mill? <title>my resource</ title> <date>04 For Example: Search engines, union catalogs, OAI

Why share metadata? Benefits to users One-stop searching Aggregation of subject-specific resources Benefits to institutions Increased exposure for collections Broader user base Bringing together of distributed collections Don t expect users will know about your collection and remember to visit it.

Examples of OAI Service Providers OAIster: http://oaister.umdl.umich.edu/o/oaister/ Engineering, Computer Science, and Physics: http://g118.grainger.uiuc.edu/engroai/ CIC Metadata Portal: http://nergal.grainger.uiuc.edu/cgi/b/bib/oaister IMLS Digital Collections and Content: http://imlsdcc.grainger.uiuc.edu/

The OAI-PMH is a tool Moves metadata (not content for the most part yet) from a data provider to a service provider (or harvester) A set of rules that defines the communication between two systems (like FTP and HTTP) Facilitates the aggregation of metadata (like a union catalog) Developed in 2001 out of the eprint/pre-print community

Basic OAI-PMH Concepts Aggregated search rather than Federated search OAI-PMH based upon HTTP and XML Data providers support OAI PMH as a means to expose metadata Service providers harvests metadata from data providers via the OAI-PMH OAI-PMH requires use of simple Dublin Core BUT supports and encourages use of other metadata schemas

Sample OAI Request

OAI-PMH is not. Metadata A search tool A database Open Access

UIUC Library and OAI Early testers of the protocol in 2000 and 2001 Received Mellon grant in 2001 in first wave of establishing the protocol and have since received several grants to build OAI aggregations Currently have data providers for CONTENTdm, IDEALS, Archives, Aerial Photographs, and others. Have been active in the continued development of the protocol and associated activities since Static repository development Best practices for OAI which led to an IMLS training grant Implementation Guidelines for Shareable MODS Records Will be working on the next initiative out of the OAI: ORE (Object Reuse and Exchange): http://www.openarchives.org/ore/ Tim Cole and Muriel Foulonneau currently working on a book on OAI

Metadata challenge the ability to perform a search over diverse sets of metadata records and obtain meaningful results. Priscilla Caplan Metadata Fundamentals for All Librarians

OAI Dublin Core DC is OAI s lowest common denominator BUT OAI supports & encourages use of other community-driven metadata schemas

Metadata Interoperability Semantics Content rules How are values for the metadata elements selected and represented? Syntax What is the metadata format used? Mapping from one format to another How are the metadata elements encoded in machine readable form? Documentation

Dublin Core record retrieved via the OAI Protocol What does this record describe? identifier: http://name.university.edu/ic-fish3icx0802]1004_112 publisher: Museum of Zoology, Fish Field Notes format: jpeg rights: These pages may be freely searched and displayed. Permission must be received for subsequent distribution in print or electronically. type: image subject: 1926-05-18; 1926; 0812; 18; Trib. to Sixteen Cr. Trib. Pine River, Manistee R.; JAM26-460; 05; 1926/05/18; R10W; S26; S27; T21N language: UND source: Michigan 1926 Metzelaar, 1926--1926; description: Flora and Fauna of the Great Lakes Region

Dublin Core record harvested via OAI How about this one? title: subject: subject: subject: subject: subject: publisher: date: type: identifier: relation: relation: relation: relation: relation: (Woman Holding a Pie) LNG42122.5 Berkeley; male; outdoors; yard; stair Dorothea Lange Collection The War Years (1942-1944) Office of War Information (OWI) Woman Holding a Pie Museum of [state] 1944 image http://www.orgname.org/idnumber http://orgname.org/findaid/idnumber id:/13030/tf9779p783 http://www.orgname.org/ http://findaid.org.org/findaid/... http://www.orgname.edu/project/

Metadata for different communities

Metadata for different communities

Loss of Context: Record in OAI aggregation

Context: Record in native database

Loss of context / data

Loss of context / data

Granularity of Description: Excerpt of Metadata Record Describing American Woven Coverlet

Granularity of Description: Excerpt of Metadata Record Describing "Cotton coverlet with embroidered butterfly design"

????? GEM Collection Registries Photograph from Indiana University Charles W. Cushman Collection

Shareable metadata defined Promotes search interoperability - the ability to perform a search over diverse sets of metadata records and obtain meaningful results (Priscilla Caplan) Is human understandable outside of its local context Is useful outside of its local context Preferably is machine processable

Recap OAI protocol is a tool OAI is easy - metadata is hard Better metadata = better interoperability

Contact Information Sarah Shreeves Coordinator, IDEALS University of Illinois Library at Urbana-Champaign Email: sshreeve@uiuc.edu Phone: 217-244-3877 This work is licensed under the Creative Commons Attribution-NonCommercialShareAlike 2.5 License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/2.5/ or send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA.