Metadata Harvesting Framework

Size: px
Start display at page:

Download "Metadata Harvesting Framework"

Transcription

1 Metadata Harvesting Framework Library User 3. Provide searching, browsing, and other services over the data. Service Provider (TEL, NSDL) Harvested Records 1. Service Provider polls periodically for new records OAI protocol (over http) 2. New records downloaded and cached by the Service Provider Data Providers: (collection builders) OAI workshop, December 11, Multiple representations of an object MARC Record In XML Dublin Core Record In XML Qualified Dublin Core Record In XML MODS record In XML Honoré Daumier Lithograph (Brandeis University) OAI workshop, December 11,

2 HTTP and XML The OAI-PMH is an almost stateless request/response protocol Requests and responses are sent using the HTTP protocol Requests are made using HTTP GET/POST operations Responses are returned as well-formed, valid XML documents OAI workshop, December 11, Well-formed and Valid XML Correct <car> <make>dodge</make> <model>spirit</model> <year>1994</year> <owner> <name>you</name> <plate>co</plate> </owner> </car> Incorrect <car> <make>dodge</make> <model>spirit</model> <year>1994 <owner> <plate>co</plate> <name>you</name> </car> </owner> OAI workshop, December 11,

3 DTD, Schemas & Namespace DTD s: Document Type Definition Describe the elements of XML instance documents Not well-formed XML Some data-typing Namespaces harder to deal with Namespace: Schemas Describe the elements of XML instance documents Well-formed XML Strong data-typing Namespaces are easier to deal with Collection of related element names identified by a name label (e.g. dc) OAI workshop, December 11, XML Namespaces and Schema Consistency and data quality is ensured through XML schemas and schema validation Two separate XML namespaces are used: One that defines the OAI-PMH response Another that defines the metadata records contained in the response e.g. the record-level schema Example: entifier=oai:dlese.org:dlese OAI workshop, December 11,

4 OAI repositories can be organized in sets OAI-PMH mechanism to allow for harvesting of subcollections Semantics for sets are defined outside of the protocol Sets are defined by conventions established between data and service providers, or just by the data provider Sets can be established that enable querying (e.g. by topic, author name, subject area, etc.) Example: The Open Digital Library (Suleman, 2001) OAI workshop, December 11, OAI repositories can be organized in sets What do sets represent? Journals: issues Institutional repositories: Departments, research centers, etc. Set representations may be constrained by the software package used. EPrint Archives: Subject, Publication Status Cultural Heritage Repositories: Collections with Intent 5 April, 2006 OAI workshop, December 11,

5 Requirements to be a Data Provider Source of metadata Human or automated resource catalogers Metadata mappings Crosswalks from native formats to DC or other formats Server technology Handled by the OAI software Datestamps Indicates when the item was last changed (handled by the OAI software) Deletions Indicates if the item has been deleted and should be removed (handled by the OAI software) Unique identifiers Used to uniquely identify each item across repositories OAI workshop, December 11, Examples of repositories OAForum Information Resource Database is no longer active Refer to UKOLN site: More repositories at: OAI workshop, December 11,

6 Examples of services OAI workshop, December 11, The OAI-PMH OAI-PMH Requests Identify ListMetadataFormats ListSets GetRecord ListIdentifiers ListRecords Resumption Tokens Used for flow control when large responses are required OAI workshop, December 11,

7 OAI-PMH: overview and structure model OAI workshop, December 11, Key Definitions Harvester: client application issuing OAI-PMH requests Repository: network accessible server, able to process OAI- PMH requests correctly Set: optional construct for grouping items in a repository OAI workshop, December 11,

8 Key Definitions Resource: object the metadata is "about", nature of resources is not defined in the OAI- PMH resources may be digital or non-digital Item: component of an repository from which metadata about a resource can be disseminated; has an unique identifier Record: metadata in a specific metadata format Identifier: unique key for an item in a repository OAI workshop, December 11, Protocol Details: Records A record is the metadata of a resource in a specific format. A record has three parts: a header and metadata, both of which are mandatory, and an optional about statement. Each of these is made up of various components as set out below. header (mandatory) - identifier (mandatory: 1 only) - datestamp (mandatory: 1 only) - setspec elements (optional: 0, 1 or more) - status attribute for deleted item metadata (mandatory) - XML encoded metadata with root tag, namespace - repositories must support Dublin Core, may support other formats about (optional) - rights statements - provenance statements OAI workshop, December 11,

9 Protocol Details: Datestamps A datestamp is the date of last modification of a metadata record. Datestamp is a mandatory characteristic of every item. It has two possible levels of granularity: YYYY-MM-DD YYYY-MM-DDThh:mm:ssZ. The function of the datestamp is to provide information on metadata that enables selective harvesting using from and until arguments. Its applications are in incremental update mechanisms. It gives either the date of creation, last modification, or deletion. Deletion is covered with three support levels: no persistent transient. OAI workshop, December 11, Protocol Details: Metadata schema OAI-PMH supports dissemination of multiple metadata formats from a repository. The properties of metadata formats are: id string to specify the format (metadataprefix) metadata schema URL (XML schema to test validity) XML namespace URI (global identifier for metadata format) Repositories must be able to disseminate unqualified Dublin Core. The Dublin Core Metadata Element Set contains 15 elements. All elements are optional, and all elements may be repeated. Further arbitrary metadata formats can be defined and transported via the OAI-PMH. Any returned metadata must comply with an XML namespace specification. OAI workshop, December 11,

10 Protocol Details: Sets Sets enable a logical partitioning of repositories. They are optional - archives do not have to define Sets. There are no recommendations for the implementation of Sets. Sets are not necessarily exhaustive of the content of a repository. They are not necessarily strictly hierarchical. It is important and necessary to have negotiated agreements within communities defining useful sets for the communities. function: selective harvesting (set parameter) applications: subject gateways, dissertation search engine, and others examples publication types (thesis, article,?) document types (text, audio, image,?) content sets, according to DNB (medicine, biology,?) OAI workshop, December 11, Protocol Details: Request format Requests must be submitted using the GET or POST methods of HTTP, and repositories must support both methods. At least one key=value pair: verb=requesttype (where RequestType is some type of request such as ListRecords) must be provided. Additional key=value pairs depend on the request type. example for GET request: x=oai_dc The encoding of special characters must be supported; for example, ":" (host port separator) becomes "%3A" OAI workshop, December 11,

11 Protocol Details: Response Responses are formatted as HTTP responses. The content type must be text/xml. HTTP-based status codes, as distinguished from OAI-PMH errors, such as 302 (redirect) and 503 (service not available) may be returned. Compression codes are optional in OAI-PMH, only identity encoding is mandatory. The response format must be well-formed XML with markup as follows: 1. XML declaration (<?xml version="1.0" encoding="utf-8"?>) 2. root element named OAI-PMH with three attributes (xmlns, xmlns:xsi, xsi:schemalocation) 3. three child elements 1. responsedate (UTC datetime) 2. request (the request that generated this response) 3. a) error (in case of an error or exception condition) b) element with the name of the OAI-PMH request OAI workshop, December 11, Protocol Details: Flow control Four of the request types return a list of entries. Three of them may reply with 'large' lists. OAI-PMH supports partitioning. Those managing a repository make the decisions on partitioning: whether to partition and how. The response to a request includes: incomplete list resumption token expiration date, size of complete list, cursor (optional) For a new request with same request type: resumption token as parameter all other parameters omitted! The response includes the next (which may be the last) section of the list and a resumption token. That resumption token is empty if the last section of the list is enclosed. OAI workshop, December 11,

12 Protocol Details: Flow control OAI workshop, December 11, Protocol Details: Errors and exceptions Repositories must indicate OAI-PMH errors by the inclusion of one or more error elements. The defined error identifiers are: badargument badresumptiontoken badverb cannotdisseminateformat iddoesnotexist norecordsmatch nometadataformats nosethierarchy OAI workshop, December 11,

13 Request types There are six different request types: Identify ListMetadataFormats ListSets ListIdentifiers ListRecords GetRecord A harvester is not required to use all types. A repository must implement all types. There are required and optional arguments, depending on request types. OAI workshop, December 11, Request types: Identify function description of an archive example archive.org/oai-script?verb=identify parameters none errors / exceptions badargument (e.g. archive.org/oaiscript?verb=identify&set=biology) response format OAI workshop, December 11,

14 Request types: Identify Response format Element Example repositoryname My Archive baseurl protocolversion 2.0 earliestdatestamp deleterecords no, transient, persistent granularity YYY-MM-DD, YYYY-MM-DDThh:mm:ssZ admin compression deflate, compress description oai-identifier, eprints, friends, Ordinality: 1 = mandatory, 1 only; + = mandatory, 1 only; * = optional, 0 or more Ordinality * * Online example: OAI workshop, December 11, Request types: ListMetadataFormats function retrieve available metadata formats from archive example archive.org/oai-script?verb=listmetadataformats& identifier=oai:huberlin.de: parameters identifier (optional) errors / exceptions badargument iddoesnotexist e.g. archive.org/oai-script?verb=listmetadataformats &identifier=really-wrong-identifier nometadataformats Online examples OAI workshop, December 11,

15 Request types: ListSets function retrieve set structure of a repository example archive.org/oai-script?verb=listsets parameters resumptiontoken (exclusive) errors / exceptions badargument badresumptiontoken e.g. archive.org/oai-script?verb=listsets &resumptiontoken=any-wrong-token nosethierarchy Online examples OAI workshop, December 11, Request types: ListIdentifiers function abbreviated form of ListRecords, retrieving only headers example archive.org/oai-script?verb=listidentifiers& metadataprefix=oai_dc&from= parameters from (optional) until (optional) metadataprefix (required) set (optional) resumptiontoken (exclusive) errors / exceptions badargument (e.g.?&from= :45:00) badresumptiontoken cannotdisseminateformat norecordsmatch nosethierarchy online example OAI workshop, December 11,

16 Request types: ListRecords function harvest records from a repository example archive.org/oai-script?verb=listrecords& metadataprefix=oai_dc&set=biology parameters from (optional) until (optional) metadataprefix (required) set (optional) resumptiontoken (exclusive) errors / exceptions badargument badresumptiontoken cannotdisseminateformat norecordsmatch nosethierarchy Online example &set=bnd&metadataPrefix=tel OAI workshop, December 11, Request types: GetRecord function retrieve individual metadata record from a repository example archive.org/oai-script?verb=getrecord& identifier=oai:huberlin.de: & metadataprefix=oai_dc parameters identifier (required) metadataprefix (required) errors / exceptions badargument cannotdisseminateformat iddoesnotexist online examples OAI workshop, December 11,

17 Turn key systems and modules CWIS : ContentDM : Digitool : DSpace : EPrints : DLXS: OAICat: XMLFile: DLESE OAI software: More tools at: OAI workshop, December 11, References 1. Building Interoperable Digital Libraries: A Practical Guide to creating Open Archives, Hussein Suleman, JCDL 2001 Tutorial. 2. A Framework for Building Open Digital Libraries, Hussein Suleman and Edward A. Fox, in D-Lib Magazine, December, The Open Archives Initiative 4. DLF/NSDL best practices for OAI and shareable metadata 5. Open Archives Forum OAI workshop, December 11,

Introduction to the OAI Protocol for Metadata Harvesting Version 2.0. Hussein Suleman Virginia Tech DLRL 17 June 2002

Introduction to the OAI Protocol for Metadata Harvesting Version 2.0. Hussein Suleman Virginia Tech DLRL 17 June 2002 Introduction to the OAI Protocol for Metadata Harvesting Version 2.0 Hussein Suleman Virginia Tech DLRL 17 June 2002 Version 2.0 Already? Why? What are you guys thinking? But we didn t implemented version

More information

Tutorial. Open Archive Initiative

Tutorial. Open Archive Initiative Tutorial Open Archive Initiative Uwe Müller Computer- und Medienservice, Humboldt-Universität zu Berlin u.mueller@cms.hu-berlin.de Dr. Heinrich Stamerjohanns Institute for Science Networking, Universität

More information

Building Interoperable and Accessible ETD Collections: A Practical Guide to Creating Open Archives

Building Interoperable and Accessible ETD Collections: A Practical Guide to Creating Open Archives Building Interoperable and Accessible ETD Collections: A Practical Guide to Creating Open Archives Hussein Suleman, hussein@vt.edu Digital Library Research Laboratory Virginia Tech 1. Introduction What

More information

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University Using metadata for interoperability CS 431 February 28, 2007 Carl Lagoze Cornell University What is the problem? Getting heterogeneous systems to work together Providing the user with a seamless information

More information

OAI-PMH. DRTC Indian Statistical Institute Bangalore

OAI-PMH. DRTC Indian Statistical Institute Bangalore OAI-PMH DRTC Indian Statistical Institute Bangalore Problem: No Library contains all the documents in the world Solution: Networking the Libraries 2 Problem No digital Library is expected to have all documents

More information

Building Interoperable Digital Libraries: A Practical Guide to creating Open Archives

Building Interoperable Digital Libraries: A Practical Guide to creating Open Archives Building Interoperable Digital Libraries: A Practical Guide to creating Open Archives Hussein Suleman, hussein@vt.edu Digital Library Research Laboratory Virginia Tech 1. Introduction What is the OAI?

More information

Problem: Solution: No Library contains all the documents in the world. Networking the Libraries

Problem: Solution: No Library contains all the documents in the world. Networking the Libraries OAI-PMH Problem: No Library contains all the documents in the world Solution: Networking the Libraries 2 Problem No digital Library is expected to have all documents in the world Solution Networking the

More information

OAI-PMH repositories: Quality issues regarding metadata and protocol compliance

OAI-PMH repositories: Quality issues regarding metadata and protocol compliance OAI-PMH repositories: Quality issues regarding metadata and protocol compliance Tim Cole (University of Illinois at UC) & Simeon Warner (Cornell University) OAI4 @ CERN, Geneva, 20 October 2005 Schedule

More information

OAI-PMH implementation and tools guidelines

OAI-PMH implementation and tools guidelines ECP-2006-DILI-510003 TELplus OAI-PMH implementation and tools guidelines Deliverable number Dissemination level D-2.1 Public Delivery date 31 May 2008 Status Final v1.1 Author(s) Diogo Reis(IST), Nuno

More information

Version 2 of the OAI-PMH & some other stuff

Version 2 of the OAI-PMH & some other stuff Version 2 of the OAI-PMH & some other stuff 2 nd Workshop on the OAI, CERN Geneva, October 17 th 2002 Herbert Van de Sompel Los Alamos National Laboratory Carl Lagoze Cornell University about OAI-PMH v.2.0

More information

The Open Archives Initiative Protocol for Metadata Harvesting: An Introduction

The Open Archives Initiative Protocol for Metadata Harvesting: An Introduction DRTC Workshop on Digital Libraries: Theory and Practice March 2003 DRTC, Bangalore The Open Archives Initiative Protocol for Metadata Harvesting: An Introduction Documentation Research and Training Centre

More information

Network Information System. NESCent Dryad Subcontract (Year 1) Metacat OAI-PMH Project Plan 25 February Mark Servilla

Network Information System. NESCent Dryad Subcontract (Year 1) Metacat OAI-PMH Project Plan 25 February Mark Servilla Network Information System NESCent Dryad Subcontract (Year 1) Metacat OAI-PMH Project Plan 25 February 2009 Mark Servilla servilla@lternet.edu LTER Network Office Department of Biology, MSC03 2020 1 University

More information

Publishing Based on Data Provider

Publishing Based on Data Provider Publishing Based on Data Provider Version 16 and later Please note: Implementation of the following OAI tools requires an additional license agreement with Ex Libris. To learn more about licensing this

More information

The Open Archives Initiative Protocol for Metadata Harvesting

The Open Archives Initiative Protocol for Metadata Harvesting Page 1 of 34 The Open Archives Initiative Protocol for Metadata Harvesting Protocol Version 2.0 of 2002-06-14 Document Version 2003/02/21T00:00:00Z http://www.openarchives.org/oai/2.0/openarchivesprotocol.htm

More information

IMu OAI-PMH Web Service

IMu OAI-PMH Web Service IMu Documentation IMu OAI-PMH Web Service Document Version 1.1 EMu Version 4.00 IMu Version 1.0.03 www.kesoftware.com 2012 KE Software. All rights reserved. Contents SECTION 1 OAI-PMH Concepts 1 What

More information

Metadata aggregation for digital libraries

Metadata aggregation for digital libraries ICDAT 2005 Metadata aggregation for digital libraries Muriel Foulonneau () Grainger Engineering Library University of Illinois at Urbana-Champaign USA June 2005 Outlines Role and practices of actors in

More information

Exposing and Harvesting Metadata Using the OAI Metadata Harvesting Protocol: A Tutorial

Exposing and Harvesting Metadata Using the OAI Metadata Harvesting Protocol: A Tutorial Page 1 of 11 High Energy Physics Libraries Webzine Home Editorial Board Contents Issue 4 HEP Libraries Webzine Issue 4 / June 2001 Abstract Exposing and Harvesting Metadata Using the OAI Metadata Harvesting

More information

IVOA Registry Interfaces Version 0.1

IVOA Registry Interfaces Version 0.1 IVOA Registry Interfaces Version 0.1 IVOA Working Draft 2004-01-27 1 Introduction 2 References 3 Standard Query 4 Helper Queries 4.1 Keyword Search Query 4.2 Finding Other Registries This document contains

More information

The multi-faceted use of the OAI-PMH in the LANL Repository

The multi-faceted use of the OAI-PMH in the LANL Repository The multi-faceted use of the OAI-PMH in the LANL Repository Henry N. Jerez hjerez@lanl.gov Xiaoming Liu liu_x@lanl.gov Patrick Hochstenbach hochsten@lanl.gov Digital Library Research & Prototyping Team

More information

http://resolver.caltech.edu/caltechlib:spoiti05 Caltech CODA http://coda.caltech.edu CODA: Collection of Digital Archives Caltech Scholarly Communication 15 Production Archives 3102 Records Theses, technical

More information

OAI Static Repositories (work area F)

OAI Static Repositories (work area F) IMLS Grant Partner Uplift Project OAI Static Repositories (work area F) Serhiy Polyakov Mark Phillips May 31, 2007 Draft 3 Table of Contents 1. Introduction... 1 2. OAI static repositories... 1 2.1. Overview...

More information

Integrating Access to Digital Content

Integrating Access to Digital Content Integrating Access to Digital Content OR OAI is easy, metadata is hard Sarah Shreeves University of Illinois at Urbana-Champaign Why Integrate Access? Increase access to your collections 37% of visits

More information

The Open Archives Initiative and the Sheet Music Consortium

The Open Archives Initiative and the Sheet Music Consortium The Open Archives Initiative and the Sheet Music Consortium Jon Dunn, Jenn Riley IU Digital Library Program October 10, 2003 Presentation outline Jon: OAI introduction Sheet Music Consortium background

More information

Harvesting Metadata Using OAI-PMH

Harvesting Metadata Using OAI-PMH Harvesting Metadata Using OAI-PMH Roy Tennant California Digital Library Outline The Open Archives Initiative OAI-PMH The Harvesting Process Harvesting Problems Steps to a Fruitful Harvest A Harvesting

More information

RVOT: A Tool For Making Collections OAI-PMH Compliant

RVOT: A Tool For Making Collections OAI-PMH Compliant RVOT: A Tool For Making Collections OAI-PMH Compliant K. Sathish, K. Maly, M. Zubair Computer Science Department Old Dominion University Norfolk, Virginia USA {kumar_s,maly,zubair}@cs.odu.edu X. Liu Research

More information

Outline of the course

Outline of the course Outline of the course Introduction to Digital Libraries (15%) Description of Information (30%) Access to Information (30%) User Services (10%) Additional topics (15%) Buliding of a (small) digital library

More information

An introduction to OAI-PMH

An introduction to OAI-PMH CARLI DCUG Metadata Matters Webinar Series An introduction to OAI-PMH Library Digital Content Access Lead Head, Mathematics Library Prof. of Library Administration Prof. of Library & Info. Science (with

More information

Interoperability and Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)

Interoperability and Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) 338 Interoperability and Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Martha Latika Alexander J N Gautam Abstract Interoperability refers to the ability of a Digital Library to work

More information

Creating a National Federation of Archives using OAI-PMH

Creating a National Federation of Archives using OAI-PMH Creating a National Federation of Archives using OAI-PMH Luís Miguel Ferros 1, José Carlos Ramalho 1 and Miguel Ferreira 2 1 Departament of Informatics University of Minho Campus de Gualtar, 4710 Braga

More information

Open Archives Initiative protocol development and implementation at arxiv

Open Archives Initiative protocol development and implementation at arxiv Open Archives Initiative protocol development and implementation at arxiv Simeon Warner (Los Alamos National Laboratory, USA) (simeon@lanl.gov) OAI Open Day, Washington DC 23 January 2001 1 What is arxiv?

More information

CodeSharing: a simple API for disseminating our TEI encoding. Martin Holmes

CodeSharing: a simple API for disseminating our TEI encoding. Martin Holmes CodeSharing: a simple API for disseminating our TEI encoding 1. Introduction Martin Holmes Although the TEI Guidelines are full of helpful examples, and other inititatives such as TEI By Example have made

More information

Harvesting Statistical Metadata from an Online Repository for Data Analysis and Visualization

Harvesting Statistical Metadata from an Online Repository for Data Analysis and Visualization Sem Gebresilassie Harvesting Statistical Metadata from an Online Repository for Data Analysis and Visualization Concept application on Theseus Helsinki Metropolia University of Applied Sciences Bachelor

More information

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector Relais Culture Europe mfoulonneau@relais-culture-europe.org Community report A community report on

More information

Digital Library Curriculum Development Module 5-d: Protocols (Last Updated: )

Digital Library Curriculum Development Module 5-d: Protocols (Last Updated: ) Digital Library Curriculum Development Module 5-d: Protocols (Last Updated: 2009-10-09) 1. Module name: Protocols 2. Scope This module addresses the concepts, development and implementation of digital

More information

Joining the BRICKS Network - A Piece of Cake

Joining the BRICKS Network - A Piece of Cake Joining the BRICKS Network - A Piece of Cake Robert Hecht and Bernhard Haslhofer 1 ARC Seibersdorf research - Research Studios Studio Digital Memory Engineering Thurngasse 8, A-1090 Wien, Austria {robert.hecht

More information

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Metadata and Encoding Standards for Digital Initiatives: An Introduction Metadata and Encoding Standards for Digital Initiatives: An Introduction Maureen P. Walsh, The Ohio State University Libraries KSU-SLIS Organization of Information 60002-004 October 29, 2007 Part One Non-MARC

More information

How to contribute information to AGRIS

How to contribute information to AGRIS How to contribute information to AGRIS Guidelines on how to complete your registration form The dashboard includes information about you, your institution and your collection. You are welcome to provide

More information

Corso di Biblioteche Digitali

Corso di Biblioteche Digitali Corso di Biblioteche Digitali Vittore Casarosa casarosa@isti.cnr.it tel. 050-315 3115 cell. 348-397 2168 Ricevimento dopo la lezione o per appuntamento Valutazione finale 70-75% esame orale 25-30% progetto

More information

Comparing Open Source Digital Library Software

Comparing Open Source Digital Library Software Comparing Open Source Digital Library Software George Pyrounakis University of Athens, Greece Mara Nikolaidou Harokopio University of Athens, Greece Topic: Digital Libraries: Design and Development, Open

More information

A Repository of Metadata Crosswalks. Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research

A Repository of Metadata Crosswalks. Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research A Repository of Metadata Crosswalks Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research DLF-2004 Spring Forum April 21, 2004 Outline of this

More information

Applying SOAP to OAI-PMH

Applying SOAP to OAI-PMH Applying SOAP to OAI-PMH Sergio Congia, Michael Gaylord, Bhavik Merchant, and Hussein Suleman Department of Computer Science, University of Cape Town Private Bag, Rondebosch, 7701, South Africa {scongia,

More information

Orbis Cascade Alliance Content Creation & Dissemination Program Digital Collections Service. Enabling OAI & Mapping Fields in Digital Commons

Orbis Cascade Alliance Content Creation & Dissemination Program Digital Collections Service. Enabling OAI & Mapping Fields in Digital Commons Orbis Cascade Alliance Content Creation & Dissemination Program Digital Collections Service Enabling OAI & Mapping Fields in Digital Commons Produced by the Digital Collections Working Group of the Content

More information

2nd Technical Validation Questionnaire - interim results -

2nd Technical Validation Questionnaire - interim results - 2nd Technical Validation Questionnaire - interim results - Birgit Matthaei Humboldt-University, Berlin, Germany Electronic Publishing Group Computer- and Mediaservice birgit.matthaei@cms.hu-berlin.de Why

More information

The Observation of Bahasa Indonesia Official Computer Terms Implementation in Scientific Publication

The Observation of Bahasa Indonesia Official Computer Terms Implementation in Scientific Publication Journal of Physics: Conference Series PAPER OPEN ACCESS The Observation of Bahasa Indonesia Official Computer Terms Implementation in Scientific Publication To cite this article: D Gunawan et al 2018 J.

More information

NEEO TECHNICAL GUIDELINES FOR THE

NEEO TECHNICAL GUIDELINES FOR THE NEEO TECHNICAL GUIDELINES FOR THE EXCHANGE OF USAGE METADATA DRAFT Version 1.4 NEEO WP5 Author: Benoit Pauwels Date Version 9/9/2008 0.1 Initial skeleton document 20/11/2008 1.0 Introduce SWUP: the Scholarly

More information

Harvester Service Technical and User Guide 5 June 2008

Harvester Service Technical and User Guide 5 June 2008 Harvester Service Technical and User Guide 5 June 2008 1. Purpose...2 2. Overview...2 3. Services...3 4. Custom Harvests...5 5. Notes on Harvest Flow...6 6. Source Code Overview...6 1 1. Purpose The purpose

More information

Design of The PORTA EUROPA Portal (PEP) Pilot Project

Design of The PORTA EUROPA Portal (PEP) Pilot Project Design of The PORTA EUROPA Portal (PEP) Pilot Project Marco Pirri Maria Chiara Pettenati Electronics and Telecommunications Department University of Florence (Italy) Library European University Institute

More information

A Novel Architecture of Agent based Crawling for OAI Resources

A Novel Architecture of Agent based Crawling for OAI Resources A Novel Architecture of Agent based Crawling for OAI Resources Shruti Sharma YMCA University of Science & Technology, Faridabad, INDIA shruti.mattu@yahoo.co.in J.P.Gupta JIIT University, Noida, India jp_gupta/jiit@jiit.ac.in

More information

arxiv, the OAI, and peer review

arxiv, the OAI, and peer review arxiv, the OAI, and peer review Simeon Warner (arxiv, Los Alamos National Laboratory, USA) (simeon@lanl.gov) Workshop on OAI and peer review journals in Europe, Geneva, 22 24 March 2001 1 What is arxiv?

More information

Metadata Standards and Applications

Metadata Standards and Applications Clemson University TigerPrints Presentations University Libraries 9-2006 Metadata Standards and Applications Scott Dutkiewicz Clemson University Derek Wilmott Clemson University, rwilmot@clemson.edu Follow

More information

CARARE Training Workshops

CARARE Training Workshops CARARE Training Workshops Stein Runar Bergheim Asplan Viak Internet as CARARE is funded by the European Commission's ICT Policy Support Programme Introduction to Repox An OAI-PMH tool developed within

More information

Indonesian Citation Based Harvester System

Indonesian Citation Based Harvester System n Citation Based Harvester System Resmana Lim Electrical Engineering resmana@petra.ac.id Adi Wibowo Informatics Engineering adiw@petra.ac.id Raymond Sutjiadi Research Center raymondsutjiadi@petra.ac.i

More information

EXTENDING OAI-PMH PROTOCOL WITH DYNAMIC SETS DEFINITIONS USING CQL LANGUAGE

EXTENDING OAI-PMH PROTOCOL WITH DYNAMIC SETS DEFINITIONS USING CQL LANGUAGE EXTENDING OAI-PMH PROTOCOL WITH DYNAMIC SETS DEFINITIONS USING CQL LANGUAGE Cezary Mazurek Poznań Supercomputing and Networking Center Noskowskiego 12/14, 61-704 Poznań, Poland Marcin Werla Poznań Supercomputing

More information

Flexible Design for Simple Digital Library Tools and Services

Flexible Design for Simple Digital Library Tools and Services Flexible Design for Simple Digital Library Tools and Services Lighton Phiri Hussein Suleman Digital Libraries Laboratory Department of Computer Science University of Cape Town October 8, 2013 SARU archaeological

More information

Expected and Unexpected Synergies

Expected and Unexpected Synergies Page 1 of 8 Search Back Issues Author Index Title Index Contents D-Lib Magazine February 2005 Volume 11 Number 2 ISSN 1082-9873 SRW/U with OAI Expected and Unexpected Synergies Robert Sanderson University

More information

adore: a modular, standards-based Digital Object Repository

adore: a modular, standards-based Digital Object Repository adore: a modular, standards-based Digital Object Repository Herbert Van de Sompel, Jeroen Bekaert, Xiaoming Liu, Luda Balakireva, Thorsten Schwander Los Alamos National Laboratory, Research Library {herbertv,

More information

Research on the Interoperability Architecture of the Digital Library Grid

Research on the Interoperability Architecture of the Digital Library Grid Research on the Interoperability Architecture of the Digital Library Grid HaoPan Department of information management, Beijing Institute of Petrochemical Technology, China, 102600 bjpanhao@163.com Abstract.

More information

Increasing access to OA material through metadata aggregation

Increasing access to OA material through metadata aggregation Increasing access to OA material through metadata aggregation Mark Jordan Simon Fraser University SLAIS Issues in Scholarly Communications and Publishing 2008-04-02 1 We will discuss! Overview of metadata

More information

SDMX self-learning package XML based technologies used in SDMX-IT TEST

SDMX self-learning package XML based technologies used in SDMX-IT TEST SDMX self-learning package XML based technologies used in SDMX-IT TEST Produced by Eurostat, Directorate B: Statistical Methodologies and Tools Unit B-5: Statistical Information Technologies Last update

More information

Publications Repository Based on OAI-PMH 2.0 Using Google App Engine

Publications Repository Based on OAI-PMH 2.0 Using Google App Engine TELKOMNIKA, Vol.12, No.1, March 2014, pp. 251 ~ 262 ISSN: 1693-6930, accredited A by DIKTI, Decree No: 58/DIKTI/Kep/2013 DOI: 10.12928/TELKOMNIKA.v12i1.1789 251 Publications Repository Based on OAI-PMH

More information

Metadata: The Theory Behind the Practice

Metadata: The Theory Behind the Practice Metadata: The Theory Behind the Practice Item Type Presentation Authors Coleman, Anita Sundaram Citation Metadata: The Theory Behind the Practice 2002-04, Download date 06/07/2018 12:18:20 Link to Item

More information

OAI AND AMF FOR ACADEMIC SELF-DOCUMENTATION

OAI AND AMF FOR ACADEMIC SELF-DOCUMENTATION OAI AND AMF FOR ACADEMIC SELF-DOCUMENTATION Pavel I. Braslavsky Institute of Engineering Science Ural Branch, Russian Academy of Sciences Komsomolskaya 34 620219 Ekaterinburg Russia pb@imach.uran.ru Thomas

More information

mod_oai: An Apache Module for Metadata Harvesting

mod_oai: An Apache Module for Metadata Harvesting mod_oai: An Apache Module for Metadata Harvesting Michael L. Nelson 1, Herbert Van de Sompel 2, Xiaoming Liu 2, Terry L. Harrison 1, Nathan McFarland 2 1 Old Dominion University, Department of Computer

More information

The NSDL Repository and API

The NSDL Repository and API The NSDL Repository and API January 9, 2007 Contents 1 Basic Data Model 2 1.1 Object Types............................. 3 1.2 Object Content............................ 3 1.3 Object Identity............................

More information

Metadata Workshop 3 March 2006 Part 1

Metadata Workshop 3 March 2006 Part 1 Metadata Workshop 3 March 2006 Part 1 Metadata overview and guidelines Amelia Breytenbach Ria Groenewald What metadata is Overview Types of metadata and their importance How metadata is stored, what metadata

More information

A methodology for Sharing Archival Descriptive Metadata in a Distributed Environment

A methodology for Sharing Archival Descriptive Metadata in a Distributed Environment A methodology for Sharing Archival Descriptive Metadata in a Distributed Environment Nicola Ferro and Gianmaria Silvello Information Management Research Group (IMS) Department of Information Engineering

More information

Guidelines for Developing Digital Cultural Collections

Guidelines for Developing Digital Cultural Collections Guidelines for Developing Digital Cultural Collections Eirini Lourdi Mara Nikolaidou Libraries Computer Centre, University of Athens Harokopio University of Athens Panepistimiopolis, Ilisia, 15784 70 El.

More information

Taking D2D Services to the Users with OpenURL, RSS, and OAI-PMH. Chuck Koscher Technology Director, CrossRef

Taking D2D Services to the Users with OpenURL, RSS, and OAI-PMH. Chuck Koscher Technology Director, CrossRef Taking D2D Services to the Users with OpenURL, RSS, and OAI-PMH Chuck Koscher Technology Director, CrossRef ckoscher@crossref.org Scholarly Publishing Trends Everything is online if it s not online, it

More information

oatd.org Discovery for Open Access Theses and Dissertations An ASERL Webinar, October 15, 2013 These slides:

oatd.org Discovery for Open Access Theses and Dissertations An ASERL Webinar, October 15, 2013 These slides: oatd.org Discovery for Open Access Theses and Dissertations An ASERL Webinar, October 15, 2013 These slides: http://goo.gl/muxq15 Thomas Dowling dowlintp@wfu.edu I Can Haz ASERL ETDs? 34 of 37 ASERL universities

More information

Chuck Cartledge, PhD. 25 February 2018

Chuck Cartledge, PhD. 25 February 2018 Big Data: Data Wrangling Boot Camp Web Crawling with R and OAI-PMH Chuck Cartledge, PhD 25 February 2018 1/21 Table of contents (1 of 1) 1 Intro. 2 OAI-PMH What is OAI-PMH 3 Hands-on 4 Q & A 5 Conclusion

More information

Questionnaire for effective exchange of metadata current status of publishing houses

Questionnaire for effective exchange of metadata current status of publishing houses Questionnaire for effective exchange of metadata current status of publishing houses In 2011, important priorities were set in order to realise green publications in the open access movement in Germany.

More information

The OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Data

The OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Data The OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Bernhard Haslhofer University of Vienna Dept. of Distributed and Multimedia Systems Vienna, Austria bernhard.haslhofer@univie.ac.at ABSTRACT Many

More information

ORCA-Registry v2.4.1 Documentation

ORCA-Registry v2.4.1 Documentation ORCA-Registry v2.4.1 Documentation Document History James Blanden 26 May 2008 Version 1.0 Initial document. James Blanden 19 June 2008 Version 1.1 Updates for ORCA-Registry v2.0. James Blanden 8 January

More information

Overview NSDL Collection Representation and Information Flow

Overview NSDL Collection Representation and Information Flow Overview NSDL Collection Representation and Information Flow October 24, 2006 Introduction The intent of this document is to be a brief overview of the current state of collections and associated processing/representation

More information

Interoperability for Digital Libraries

Interoperability for Digital Libraries DRTC Workshop on Semantic Web 8 th 10 th December, 2003 DRTC, Bangalore Paper: C Interoperability for Digital Libraries Michael Shepherd Faculty of Computer Science Dalhousie University Halifax, NS, Canada

More information

GMA-PSMH: A Semantic Metadata Publish-Harvest Protocol for Dynamic Metadata Management Under Grid Environment

GMA-PSMH: A Semantic Metadata Publish-Harvest Protocol for Dynamic Metadata Management Under Grid Environment GMA-PSMH: A Semantic Metadata Publish-Harvest Protocol for Dynamic Metadata Management Under Grid Environment Yaping Zhu, Ming Zhang, Kewei Wei, and Dongqing Yang School of Electronics Engineering and

More information

SMART CONNECTOR TECHNOLOGY FOR FEDERATED SEARCH

SMART CONNECTOR TECHNOLOGY FOR FEDERATED SEARCH SMART CONNECTOR TECHNOLOGY FOR FEDERATED SEARCH VERSION 1.4 27 March 2018 EDULIB, S.R.L. MUSE KNOWLEDGE HEADQUARTERS Calea Bucuresti, Bl. 27B, Sc. 1, Ap. 10, Craiova 200675, România phone +40 251 413 496

More information

Open Archives Initiative Object Reuse & Exchange. Resource Map Discovery

Open Archives Initiative Object Reuse & Exchange. Resource Map Discovery Open Archives Initiative Object Reuse & Exchange Resource Map Discovery Michael L. Nelson * Carl Lagoze, Herbert Van de Sompel, Pete Johnston, Robert Sanderson, Simeon Warner OAI-ORE Specification Roll-Out

More information

Open Archives Forum - Technical Validation -

Open Archives Forum - Technical Validation - Open Archives Forum - Technical Validation - Birgit Matthaei Humboldt University Berlin, Germany Computer and Media Service, Electronic Publishing Group birgit.matthaei@cms.hu-berlin.de Creating Information

More information

Developing an Institutional Repository Service in Chinese Academy of Sciences

Developing an Institutional Repository Service in Chinese Academy of Sciences Developing an Institutional Repository Service in Chinese Academy of Sciences Zhongming Zhu, Jianxia Ma Lanzhou Branch of National Science Library, CAS Zhixiong Zhang National Science Library, CAS Sino-German

More information

How to Create a Custom Ingest Form

How to Create a Custom Ingest Form How to Create a Custom Ingest Form The following section presumes that you are using the Virtual Machine Image or are visiting http://sandbox.islandora.ca OR that you have installed and configured the

More information

Metadata. Week 4 LBSC 671 Creating Information Infrastructures

Metadata. Week 4 LBSC 671 Creating Information Infrastructures Metadata Week 4 LBSC 671 Creating Information Infrastructures Muddiest Points Memory madness Hard drives, DVD s, solid state disks, tape, Digitization Images, audio, video, compression, file names, Where

More information

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward Comp 336/436 - Markup Languages Fall Semester 2017 - Week 4 Dr Nick Hayward XML - recap first version of XML became a W3C Recommendation in 1998 a useful format for data storage and exchange config files,

More information

Metadata Standards and Applications. 4. Metadata Syntaxes and Containers

Metadata Standards and Applications. 4. Metadata Syntaxes and Containers Metadata Standards and Applications 4. Metadata Syntaxes and Containers Goals of Session Understand the origin of and differences between the various syntaxes used for encoding information, including HTML,

More information

INTRO INTO WORKING WITH MINT

INTRO INTO WORKING WITH MINT INTRO INTO WORKING WITH MINT TOOLS TO MAKE YOUR COLLECTIONS WIDELY VISIBLE BERLIN 16/02/2016 Nikolaos Simou National Technical University of Athens What is MINT? 2 Mint is a herb having hundreds of varieties

More information

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS) Institutional Repository using DSpace Yatrik Patel Scientist D (CS) yatrik@inflibnet.ac.in What is Institutional Repository? Institutional repositories [are]... digital collections capturing and preserving

More information

Using the WorldCat Digital Collection Gateway

Using the WorldCat Digital Collection Gateway Using the WorldCat Digital Collection Gateway This tutorial leads you through the steps for configuring your CONTENTdm collections for use with the Digital Collection Gateway and using the Digital Collection

More information

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly COAR Interoperability Roadmap Uppsala, May 21, 2012 COAR General Assembly 1 Background COAR WG2 s main objective for 2011-2012 was to facilitate a discussion on interoperability among Open Access repositories.

More information

Cross-domain Metadata Interoperability for Integrated Information Services

Cross-domain Metadata Interoperability for Integrated Information Services Cross-domain Metadata Interoperability for Integrated Information Services Xiaolin Zhang Library of Chinese Academy of Sciences 20 th International CODATA Conference Beijing, China, 2006.10.22-26 Cross-domain

More information

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication Citation for published version: Patel, M & Duke, M 2004, 'Knowledge Discovery in an Agents Environment' Paper presented at European Semantic Web Symposium 2004, Heraklion, Crete, UK United Kingdom, 9/05/04-11/05/04,.

More information

Harvesting of Additional Metadata Schema into DSpace through OAI-PMH: Issues and Challenges

Harvesting of Additional Metadata Schema into DSpace through OAI-PMH: Issues and Challenges Harvesting of Additional Metadata Schema into DSpace through OAI-PMH: Issues and Challenges Abstract 1 Anup Das* NDL Project Staff Central Library, IIT Kharagpur West Bengal, India E-mail: anupdas1704@gmail.com

More information

MuseKnowledge Hybrid Search

MuseKnowledge Hybrid Search MuseKnowledge Hybrid Search MuseGlobal, Inc. One Embarcadero Suite 500 San Francisco, CA 94111 415 896-6873 www.museglobal.com MuseGlobal S.A Calea Bucuresti Bl. 27B, Sc. 1, Ap. 10 Craiova, România 40

More information

Repository Interoperability

Repository Interoperability Repository Interoperability Open Repositories 2006 Sydney, January 31 to February 3, 2006 University of Sydney May 21, 2008 www.harvestroad.com.au Contents Alt-i-lab 2005 Demonstration Case Study Open

More information

Digital Library Curriculum Development Module 4-b: Metadata Draft: 6 May 2008

Digital Library Curriculum Development Module 4-b: Metadata Draft: 6 May 2008 Digital Library Curriculum Development Module 4-b: Metadata Draft: 6 May 2008 1. Module name: Metadata 2. Scope: This module addresses uses of metadata and some specific metadata standards that may be

More information

Digital Libraries: Interoperability

Digital Libraries: Interoperability Digital Libraries: Interoperability RAFFAELLA BERNARDI UNIVERSITÀ DEGLI STUDI DI TRENTO P.ZZA VENEZIA, ROOM: 2.05, E-MAIL: BERNARDI@DISI.UNITN.IT Contents 1 Interoperability...............................................

More information

RDF and Digital Libraries

RDF and Digital Libraries RDF and Digital Libraries Conventions for Resource Description in the Internet Commons Stuart Weibel purl.org/net/weibel December 1998 Outline of Today s Talk Motivations for developing new conventions

More information

OAI (Open Archives Initiative) Suite Version 3.0. Introductory Guide for New Users

OAI (Open Archives Initiative) Suite Version 3.0. Introductory Guide for New Users OAI (Open Archives Initiative) Suite Version 3.0 Introductory Guide for New Users Any comments or requests for change to this user guide should be referred to:- Axiell Ltd. Hall View Drive Bilborough Nottingham,

More information

Go Sugimoto, Kerstin Arnold, Wim van Dongen, Yoann Moranville Reviewer: Lucile Grand

Go Sugimoto, Kerstin Arnold, Wim van Dongen, Yoann Moranville Reviewer: Lucile Grand Handbook for the ingestion of content Deliverable D.4.2 Project URL: http://www.apenet.eu / Portal URL http://www.archivesportaleurope.eu Deliverable number/name: D.4.2. Handbook for the ingestion of content

More information

Package rdryad. June 18, 2018

Package rdryad. June 18, 2018 Type Package Title Access for Dryad Web Services Package rdryad June 18, 2018 Interface to the Dryad ``Solr'' API, their ``OAI-PMH'' service, and fetch datasets. Dryad () is a curated

More information

Lessons Learned in Implementing the Extended Date/Time Format in a Large Digital Library

Lessons Learned in Implementing the Extended Date/Time Format in a Large Digital Library Lessons Learned in Implementing the Extended Date/Time Format in a Large Digital Library Hannah Tarver University of North Texas Libraries, USA hannah.tarver@unt.edu Mark Phillips University of North Texas

More information