Taking D2D Services to the Users with OpenURL, RSS, and OAI-PMH. Chuck Koscher Technology Director, CrossRef

Similar documents
Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University


Increasing access to OA material through metadata aggregation

OAI-PMH. DRTC Indian Statistical Institute Bangalore

Making ETDs count in UK repositories. Paul Needham, Cranfield University ETD2014, 24 th July 2014

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Problem: Solution: No Library contains all the documents in the world. Networking the Libraries

Infrastructure for the UK

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

Indonesian Citation Based Harvester System

Hello, I m Melanie Feltner-Reichert, director of Digital Library Initiatives at the University of Tennessee. My colleague. Linda Phillips, is going

Taylor & Francis Online. A User Guide.

Harvesting Metadata Using OAI-PMH

Getting technical an overview

Discovery to Delivery with WorldCat Local

Implementing EDS. A ten step summary, as experienced at Hofstra University Library

SciX Open, self organising repository for scientific information exchange. D15: Value Added Publications IST

The bx Scholarly Recommender Service. Nettie Lagace Product Manager, Ex Libris

Scopus Development Focus

eresearch Australia The Elephant in the Room! Open Access Archiving and other Gateways to e-research Richard Levy

Using Metadata for the Interlinking of Digitized Mathematics

Crossref DOIs help to uniquely identify and therefore link content

Flexible Design for Simple Digital Library Tools and Services

Getting (slightly) technical an introduction to linking. Ross MacIntyre Mimas Service Manager The University of Manchester

Creating a National Federation of Archives using OAI-PMH

ALL content should be created with findability in mind.

Making scholarly statistics count in UK repositories. RSP Statistics Webinar Paul Needham, Cranfield University 26 February 2013

Getting Started: Primo Central Index. 1. Getting Started: Primo Central Index. 1.1 Primo Central Index. Notes:

Data Publishing and Data Linking Introducing SCHOLIX

MuseKnowledge Hybrid Search

Developing Seamless Discovery of Scholarly and Trade Journal Resources Via OAI and RSS Chumbe, Santiago Segundo; MacLeod, Roddy

Open Archives Initiative protocol development and implementation at arxiv

Metadata Standards & Applications. 7. Approaches to Models of Metadata Creation, Storage, and Retrieval

Definition and Basic Concepts

Institutional Repository Usage Statistics

Mimas Service Manager The University of Manchester. UKSG InSERT Webinar July 2010

How to contribute information to AGRIS

21st century library: The preferred starting point for serious research? Helle Lauridsen Market Manager, EMEA

Online Copywriting Tips and Search Engine Optimisation.

Pipe Dreams: Harvesting Local Collections into Primo Using OAI-PMH

The Open Archives Initiative and the Sheet Music Consortium

Metadata Harvesting Framework

IVOA Registry Interfaces Version 0.1

Integrating Access to Digital Content

Local Metadatamanagement in a global environment

DOIs for Research Data

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

arxiv, the OAI, and peer review

SMART CONNECTOR TECHNOLOGY FOR FEDERATED SEARCH

The Observation of Bahasa Indonesia Official Computer Terms Implementation in Scientific Publication

OAI AND AMF FOR ACADEMIC SELF-DOCUMENTATION

Two Traditions of Metadata Development

4th EBIB Conference Internet in libraries Open Access Torun, December 7-8, 2007

Introduction to the OAI Protocol for Metadata Harvesting Version 2.0. Hussein Suleman Virginia Tech DLRL 17 June 2002

LUND UNIVERSITY Open Access Journals dissemination and integration in modern library services

The VDOT Research Library and TRID. September 12, 2012 Ken Winter, MLIS Associate Director, VDOT Research Library

Networking European Digital Repositories

Repository Metadata: Challenges of Interoperability

Network Information System. NESCent Dryad Subcontract (Year 1) Metacat OAI-PMH Project Plan 25 February Mark Servilla

Networking European Digital Repositories

Getting (slightly) technical an introduction to linking

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Citation Services for Institutional Repositories: Citebase Search. Tim Brody Intelligence, Agents, Multimedia Group University of Southampton

<Metadata>! What is Crossref metadata Who uses Crossref metadata services How Crossref provides this content

WikiD (Wiki/Data) Jeffrey A. Young OCLC Office of Research code4lib 2006 Oregon State University, Corvallis, Oregon 15 February 2006

Questionnaire for effective exchange of metadata current status of publishing houses

RVOT: A Tool For Making Collections OAI-PMH Compliant

Customising Location of Knowledge

Discovery To Delivery In Digital Library Environment Library and

For those of you who may not have heard of the BHL let me give you some background. The Biodiversity Heritage Library (BHL) is a consortium of

a. Can you provide website addresses for live implementations that you feel are suitable representatives for this group?

How Primo Works VE. 1.1 Welcome. Notes: Published by Articulate Storyline Welcome to how Primo works.

Metadata Standards and Applications

Brown University Libraries Technology Plan

SobekCM. Compiled for presentation to the Digital Library Working Group School of Oriental and African Studies

CodeSharing: a simple API for disseminating our TEI encoding. Martin Holmes

Getting technical an overview

The OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Data

EXTENDING OAI-PMH PROTOCOL WITH DYNAMIC SETS DEFINITIONS USING CQL LANGUAGE

oatd.org Discovery for Open Access Theses and Dissertations An ASERL Webinar, October 15, 2013 These slides:

adore: a modular, standards-based Digital Object Repository

Using DSpace for Digitized Collections. Lisa Spiro, Marie Wise, Sidney Byrd & Geneva Henry Rice University. Open Repositories 2007 January 23, 2007

Bridging Continents. Kazu Yamaji National Institute of Informatics JAPAN

Electronic Resources Simplexity

Search Interoperability, OAI, and Metadata

CARARE Training Workshops

Whitepaper. Solving Complex Hierarchical Data Integration Issues. What is Complex Data? Types of Data

Orbis Cascade Alliance Content Creation & Dissemination Program Digital Collections Service. Enabling OAI & Mapping Fields in Digital Commons

Comparing Open Source Digital Library Software

Networking European Digital Repositories

Joining the BRICKS Network - A Piece of Cake

Citation Services for Institutional Repositories: Citebase Search. Tim Brody Intelligence, Agents, Multimedia Group University of Southampton

Main focus of the of the presentation

Follow this and additional works at:

CURZON PR BUYER S GUIDE WEBSITE DEVELOPMENT

Discovery services: next generation of searching scholarly information

The New Digest. Examples of the new Staff and Student Digest s:

The Complete. Solution. Fast, simple access to all of the library s full-text content (electronic and print)

Interoperability for Digital Libraries

Transcription:

Taking D2D Services to the Users with OpenURL, RSS, and OAI-PMH Chuck Koscher Technology Director, CrossRef ckoscher@crossref.org

Scholarly Publishing Trends Everything is online if it s not online, it doesn t exist Everything is interlinked if it s not linked it doesn t exist Breaking of barriers between academic and consumer behavior user expectations are set by Google, ebay, etc. Journal brand strong but moving to article economy Economic models changing Open Access Technical Reports and other grey lit are now findable Books going online Find-ability precedes usability, you can not use what you can not find" STM-TMR 2006 Amanda Spiteri, Marketing Director Elsevier

Getting noticed requires a store window One window: Your Web page but there are billions of web pages Users must know the URL User might have brand affinity User must read their RSS feeds Content may be indexed by a search engine

There are lots of windows among others

Metadata distribution via standardized methods is the bridge to these windows for your content Strength Complexity Targeted use RSS Wide adoption, great support, browser integrations, mass-user appeal. Simple to create and distribute. Just create an XML file and stick it on your web server Distribution of newsy data most often for human consumption OpenURL All inclusive specification, well positioned for advanced or diverse applications. Simple to complex syntax, only the more basic examples are human readable. Software implementation can be complex, lots of decision paths. Distribution of metadata or content of individual items, most likely implemented as part of a linking system. OAI-PMH Robust well thought out transaction model. Very extensible and adaptable. Wide spread adoption within the industry. Implementation is moderate to complex. Good frameworks (OCLC) available. Requires substantial resources (compute and human) for any non-trivial repository. Distribution of large volumes of metadata most likely to automated harvesters.

OpenURL is packaging OpenURL is a transport syntax (a box), a way to send Meta Data Context Object is an internal wrapper (box within a box) Complexity stems from the number of ways you can accomplish the same task: send metadata to a service (a resolver) OpenURL Meta Data OpenURL Context Object Meta Data OpenURL referent reference OpenURL Context Object referent reference OpenURL context reference Context Object OpenURL context reference Context Object Meta Data Meta Data Meta Data referent reference Meta Data

OpenURL basic example http://www.crossref.org/openurl?url_ver=z39.88-2004 &rft_id=info:doi/10.1361/15477020418786&noredirect=true

OpenURL: In-Line context object example http://www.crossref.org/openurl? url_ver=z39.88-2004 &url_tim=2004-01-09 &url_ctx_fmt=info:aofi/fmt:akev:amtx:actx &ctx_ver=z39.88-2004 &ctx_enc=info:aofi/enc:autf-8 &ctx_id=345871 &ctx_tim=2002-03-20t08:a55:a12z &rft_val_fmt=info:aofi/fmt:akev:mtx:journal &rft.atitle=isolation+of+a+common+receptor+for+coxsackie+b &rft.jtitle=science &rft.aulast=bergelson &rft.auinit=j &rft.date=1997 &rft.volume=275 &rft.spage=1320 &rft.epage=1323 &rfe_val_fmt=info:ofi/fmt:kev:mtx:journal &rfe.atitle=p27-p16+chimera:+a+superior+antiproliferative &rfe.jtitle=molecular+therapy&rfe.aulast=mcarthur &rfe.aufirst=james&rfe.date=2001 &rfe.volume=3 &rfe.issue=1 &rfe.spage=8 &rfe.epage=13 &req_ref_fmt=http://lib.caltech.edu/fmt/ldap-mtx.html &req_ref=http://ldap.caltech.edu/janed/record.txt

http://www.crossref.org/openurl NISO Z39.88-2004 OpenURL is a very comprehensive framework! CrossRef implemented the San Antonio Profile #1 The basic inline by value model might address a high percentage of actual needs By consolidating metadata in one place (CrossRef), publishers have created an ideal circumstance for a single resolver to reach a large amount of content. An OpenURL solution is not embodied in a single place. It is a community of contributors using a common language. OpenURL is the Esperanto of linking. No CrossRef account needed, available free to the public Number of resolutions in 2006 => 608,756

OAI-PMH is a set of commands used to pull metadata from a compliant repository Verb Identify ListMetadata Formats ListSets ListIdentifiers ListRecords GetRecord Use Ask a repository to tell you about itself. Ask a repository which formats (XML schemas) data is available in. Compliant repositories support Dublin Core. Ask a repository to list the hierarchical structure it uses to organize itself Ask a repository to list the identifiers in the whole repository or a particular set Ask a repository to return the metadata for all records in the repository or those in a given set Ask the repository for the metadata of a given identifier. Example oai.crossref.org/oaihandler?verb=identify oai.crossref.org/oaihandler?verb=listmetadata Formats oai.crossref.org/oaihandler?verb=identify oai.crossref.org/oaihandler?verb=listidentifiers oai.crossref.org/oaihandler?verb=listrecords& SetSpec=10.1002:300:1999 oai.crossref.org/oaihandler?verb=getrecord &metadataprefix=cr_unixml &identifier=info:doi/10.1002/jnr.490010101

OAI-PMH sample responses - Identify

OAI-PMH sample responses ListSets ListSets&resumptionToken=1160597811347!698!205002

OAI-PMH sample response verb=getrecord&metadataprefix=cr_unixml&identifier=info:doi/10.1002/jnr.490010101

OAI-PMH sample response verb=listidentifiers&metadataprefix=cr_unixml&set=10.1002:297:2004

CrossRef s OAI-PMH Mission December 2005 CrossRef announced a Web Services initiative Provide a central point for the distribution of metadata from 100s of publishers, for millions of identifiers Utilize common/existing distribution protocols and technology Targeted at consumers of mass quantities of metadata. Active: MS Academic Live and Scirus (search engines) Looking: EBSCO, Euopean Biomatics Institute, others Is not open (e.g. it is not free), uses IP authentication for access control Recipient identified by 2 IP address ranges Content can be selectively mapped to a recipient (opt-in/opt-out) at the publisher or title level

RSS CrossRef is not currently operating any RSS feeds (we have Blogs which are kinda sorta the same thing) Members view RSS feeds as a way to reach out and touch end users and bring them to the member s site For end uses: OpenURL is like plumbing ( Intel inside ), they really don t care OAI-PMH is a what? RSS they ve probably heard of (blogs) and may even know how to use CrossRef members have recognized the need to establish guidelines on content composition by feed type. e.g. a TOC feed should be organized the same way from one publisher to the next in order to avoid end user confusion. (a NISO initiative?)

RSS syndication Of course RSS is used for syndication as well Example: Syndication feed Google accepts RSS (Real Simple Syndication) 2.0 and Atom 0.3 feeds. Generally, you would use this format only if your site already has a syndication feed. Note that this method may not let Google know about all the URLs in your site, since the feed may only provide information on recent URLs. http://www.google.com/support/webmasters/bin/answer.py?answer=34656&ctx=sibling Google uses the <link> field in your feed to gather URLs from your site and uses the modified date field (the <pubdate> field for RSS feeds and the <modified> date for Atom feeds) to learn when each URL was last modified Make sure that the feed is located in the highest-level directory you want search engines to crawl

Conclusion Bringing users to content requires metadata distribution Be complete (article title, all authors, citations) Be accurate (author=given-name + surname, not the entire byline) Use a widely accepted (and expressive) format: NLM, DC, CrossRef Position metadata for discovery Aggregated distribution like CrossRef s PMH service Register as a PMH data provider (http://www.openarchives.org/data/registerasprovider.html) Find syndication channels (syndication.iop.org, Feedzilla, MedicineNet)