arxiv, the OAI, and peer review

Similar documents
Open Archives Initiative protocol development and implementation at arxiv

Version 2 of the OAI-PMH & some other stuff

OAI-PMH. DRTC Indian Statistical Institute Bangalore

RVOT: A Tool For Making Collections OAI-PMH Compliant

Problem: Solution: No Library contains all the documents in the world. Networking the Libraries


Exposing and Harvesting Metadata Using the OAI Metadata Harvesting Protocol: A Tutorial

OAI-PMH implementation and tools guidelines

Building Interoperable and Accessible ETD Collections: A Practical Guide to Creating Open Archives

Better interoperability through the Open Archives Initiative

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University

OAI AND AMF FOR ACADEMIC SELF-DOCUMENTATION

Interoperability and Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)

The Open Archives Initiative Protocol for Metadata Harvesting: An Introduction

Building Interoperable Digital Libraries: A Practical Guide to creating Open Archives

Interoperability, Metadata, and Complex Objects

Open Archives Forum - Technical Validation -

Harvesting Metadata Using OAI-PMH

IVOA Registry Interfaces Version 0.1

Establishing New Levels of Interoperability for Web-Based Scholarship

Creating a National Federation of Archives using OAI-PMH

BIBLID (2004) 93:1 pp (2004.6) 209. NBINet NBINet 92

Integrating Access to Digital Content

OAI Static Repositories (work area F)

Metadata Harvesting Framework

The Open Archives Initiative and the Sheet Music Consortium

OAI-ORE. A non-technical introduction to: (

Open Access Publishing with arxiv. Tommy Ohlsson KTH Royal Institute of Technology

Open Archives Initiative Object Reuse & Exchange. Resource Map Discovery

A Novel Architecture of Agent based Crawling for OAI Resources

RN Workshop Series on Innovations in Scholarly Communication: plementing the Benefits of OAI (OAI3)

The multi-faceted use of the OAI-PMH in the LANL Repository

PhysDis Electronic Dissertations in Physics

adore: a modular, standards-based Digital Object Repository

CORE: Improving access and enabling re-use of open access content using aggregations

The OAIS Reference Model: current implementations

Tutorial. Open Archive Initiative

Developing Seamless Discovery of Scholarly and Trade Journal Resources Via OAI and RSS Chumbe, Santiago Segundo; MacLeod, Roddy

An introduction to OAI-PMH

The Point of View of the Publisher

Indonesian Citation Based Harvester System

Introduction to the OAI Protocol for Metadata Harvesting Version 2.0. Hussein Suleman Virginia Tech DLRL 17 June 2002

Implementing OAI Data and Service Providers

Digitometric Services for Open Archives Environments

Open Archives Initiative Object Reuse and Exchange Technical Committee Meeting, May 29, Edited by: Carl Lagoze & Herbert Van de Sompel

Open Archives Initiative Object Reuse & Exchange. Resource Map Discovery

CodeSharing: a simple API for disseminating our TEI encoding. Martin Holmes

Repository models and policies for preservation

Hello, I m Melanie Feltner-Reichert, director of Digital Library Initiatives at the University of Tennessee. My colleague. Linda Phillips, is going

Initial Experiences Re-exporting Duplicate and Similarity Computations with an OAI-PMH Aggregator

SciX Open, self organising repository for scientific information exchange. D15: Value Added Publications IST

Types of Language Resource. The OLAC Metadata Set and Controlled Vocabularies. The Language Resources Community. Now: Underdevelopment

The Observation of Bahasa Indonesia Official Computer Terms Implementation in Scientific Publication

Taking D2D Services to the Users with OpenURL, RSS, and OAI-PMH. Chuck Koscher Technology Director, CrossRef

Archon - A Digital Library that Federates Physics Collections

Network Information System. NESCent Dryad Subcontract (Year 1) Metacat OAI-PMH Project Plan 25 February Mark Servilla

Expected and Unexpected Synergies

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Outline of the course

OAI-PMH repositories: Quality issues regarding metadata and protocol compliance

2nd Technical Validation Questionnaire - interim results -

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

mod_oai: An Apache Module for Metadata Harvesting

Applying SOAP to OAI-PMH

Scholarly collaboration platforms

Digital Objects, Data Models, and Surrogates. Carl Lagoze Computing and Information Science Cornell University

OpenData Hackathon Δημόσια, Ανοικτά Δεδομένα H εμπειρία του Εθνικού Κέντρου Τεκμηρίωσης

NRF Open Access Statement

A Repository of Metadata Crosswalks. Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research

Joining the BRICKS Network - A Piece of Cake

Networking European Digital Repositories

Towards repository interoperability

A metadata framework to support scholarly communication

Putting Open Access into Practice

Registry Interchange Format: Collections and Services (RIF-CS) explained

SciVerse Scopus. Date: 21 Sept, Coen van der Krogt Product Sales Manager Presented by Andrea Kmety

CARARE Training Workshops

Part 2: Current State of OAR Interoperability. Towards Repository Interoperability Berlin 10 Workshop 6 November 2012

Open Access to Publications in H2020

Invenio: A Modern Digital Library for Grey Literature

Lessons Learned with Arc, an OAI-PMH Service Provider

EXTENDING OAI-PMH PROTOCOL WITH DYNAMIC SETS DEFINITIONS USING CQL LANGUAGE

Engaging and Connecting Faculty:

Developing ArXivSI to Help Scientists to Explore the Research Papers in ArXiv

Increasing access to OA material through metadata aggregation

Chuck Cartledge, PhD. 25 February 2018

Integrating research data into the publication workflow: the ebank UK experience

Corso di Biblioteche Digitali

The OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Data

For Attribution: Developing Data Attribution and Citation Practices and Standards

DRIVER Step One towards a Pan-European Digital Repository Infrastructure

An RDF NetAPI. Andy Seaborne. Hewlett-Packard Laboratories, Bristol

USING TIMED-RELEASE CRYPTOGRAPHY TO MITIGATE PRESERVATION RISK OF EMBARGO PERIODS

Surveying the Digital Library Landscape

Flexible Design for Simple Digital Library Tools and Services

Development of an Ontology-Based Portal for Digital Archive Services

Showing it all a new interface for finding all Norwegian research output

ResourceSync Towards a Web-Based Approach for Resource Synchronization

From Fulltext Documents to Structured Citations: CERN automated Solution

Networking European Digital Repositories

Transcription:

arxiv, the OAI, and peer review Simeon Warner (arxiv, Los Alamos National Laboratory, USA) (simeon@lanl.gov) Workshop on OAI and peer review journals in Europe, Geneva, 22 24 March 2001 1

What is arxiv? http://arxiv.org/ aka Los Alamos e-print archive, formerly xxx. ( e-prints may be unpublished works, pre-prints or published works.) Unrefereed author self-archiving. No-fee retrieval by users worldwide.

Evolution Aug 1991 Physics e-print archive started: hep-th archive with email interface. 1992 ftp interface added. hep-ph and hep-lat added locally; alggeom, astro-ph and cond-mat added remotely. Dec 1993 Web interface added. Nov 1994 Data at some remote archives (using the same software) moved to main site, the remote sites become mirrors. Jun 1995 Automatic PostScript generation from TEX source. Apr 1996 PDF generation added. Jun 1996 Web upload facility added. from 1996 Worldwide mirror network grows. from 1999 arxiv involved in the OAI.

The present Covers physics, math, computer science, and non-linear systems. Serves over 70,000 users in over 100 countries. Estimated 13 million downloads in 2000. Over 30,000 new submissions in 2000, over 150,000 e-prints total (approximately linear growth in submission rate, 3500 extra each year). 99% of submissions entirely automated. Submission via web (68%), email (27%) and ftp (5%). Some journals now accept and arxiv identifiers instead of requiring direct submission (e.g. APS: Phys. Rev. D, Elsevier: Phys. Lett. B). Los Alamos site funded by DOE and NSF; mirror sites funded locally.

Involvement of arxiv in the OAI The meeting held in Santa Fe in 1999, from which OAI has emerged, was organized by Paul Ginsparg (arxiv), Rick Luce and Herbert Van de Sompel. arxiv has continued to be actively involved in both management and technical development. The subset Dienst protocol resulting from the Santa Fe meeting was implemented at arxiv by 15 February 2000. Initial focus of the OAI was e-print archive interoperability. While the scope of OAI has expanded considerably, the e-print community has led the protocol development. The e-print community is, so far, the only community to have defined community specific formats for use in the OAI (e.g. description section of the Identify verb).

OAI protocol v1.0 Protocol for metadata harvesting data providers e.g. arxiv service providers e.g. arc 5 verbs: Identify, ListSets, ListMetadataFormats, GetRecord, ListIdentifiers, ListRecords Concepts in protocol: identifiers, datestamps, sets, deleted records, metadata formats, and flow control.

Metadata formats OAI supports parallel metadata sets; arxiv disseminates metadata in the following formats: oai dc Dublin Core encoded in XML. oai rfc1807 RFC1807 encoded in XML. arxiv Test-bed for new internal XML metadata format. arxivold XML encoded version of current internal metadata format. amf Academic Metadata Format (draft by Krichel and Warner).

Flow control Avoid accidental DoS attack arxiv particularly vulnerable (on-the-fly PS/PDF generation) arxiv implementation: Implement partial response resumptiontoken. Implement delay with HTTP 503 and Retry-After. Successfully avoids compliant harvesters from getting blocked (e.g. arc).

OAI repositories as of 8 March 2001 arxiv (Simeon Warner) - 155522 identifiers (duplicates, 1000 identifiers/block) OCLC Theses and Dissertations Repository (Jeff Young) - 102762 (100 identifiers/block) NACA (Michael Nelson) - 6352 identifiers (all in one reply) M.I.T. Theses - 5196 identifiers (all in one reply) The Oxford Text Archive - 1290 identifiers ( 50 identifiers/block)

OAI repositories as of 8 March 2001 (contd. 1) Perseus Digital Library - 1030 identifiers (all in one reply) CogPrints - 1028 identifiers (all in one reply, eprints.org s/w) NSDL at Cornell - 870 identifiers (30 identifiers/blocks) PhysNet, Oldenburg, Germany - 472 identifiers (200 identifiers/block) Humboldt University of Berlin - 464 identifiers (200 identifiers/block) Resource Discovery Network - 388 identifiers A Celebration of Women Writers - 142 identifiers European Language Resources Association - 183 identifiers

OAI repositories as of 8 March 2001 (contd. 2) Linguistic Data Consortium - 216 identifiers University of Tennessee Libraries - 201 identifiers (20 identifiers/block) The Natural Language Software Registry - 78 identifiers CDLCIAS - 15 identifiers (3 deleted, eprints.org s/w) CDLDERM - 2 identifiers (eprints.org s/w) Tobacco Control Digital Repository - 1 identifier (eprints.org s/w)

Who is using flow control? arxiv - partial response and retry-after OCLC TDR - partial response Oxford Text Archive - partial response and retry-after NSDL - partial response

Overlays to arxiv Advances in Theoretical and Mathematical Physics (ATMP) Subscriptions for paper copy, peer reviewed. http://pascal.intlpress.com/journals/atmp/ Geometry and Topology Subscriptions for paper copy, peer reviewed, also keeps local copies. http://www.maths.warwick.ac.uk/gt/gtmono.html JHEP - The Journal of High Energy Physics No formal duplication mechanism but almost all papers also on arxiv http://jhep.cern.ch/

Cost per article $$ / article 100000 10000 1000 100 average research cost ( > $50k/article) "high-end" commercial publisher revenue ($10k-$20k/article) aggregate average publisher revenue (~$4k/article) "non-profit" publisher revenue ($1k-$2k/article) electronic "start-up" publisher revenue ($500-$1k/article) "web printer" (~$100/article) 10 1 "arxiv" ($1-$5/article) source: Paul Ginsparg arxiv is inexpensive peer review adds $500 per article

What does arxiv provide? Minimal screening (we hope to improve moderator coverage) Low level of formatting control Size control (important for worldwide access) free access long term availability

arxiv and peer review Easy to think of arxiv as passively orthogonal to peer review (perhaps Elsevier does?). in some fields (notably hep-th), arxiv makes peer review obsolete for scientific communication because of the speed with which the field evolves. arxiv could support separation of publication and peer review by storing certification information.

That s all folks...