Digital Objects, Data Models, and Surrogates. Carl Lagoze Computing and Information Science Cornell University

Similar documents
OAI-ORE. A non-technical introduction to: (

Obtain functionality

Interoperability, Metadata, and Complex Objects

Engaging and Connecting Faculty:

Fedora. An Architecture for Complex Objects and their Relationships. Carl Lagoze, Sandy Payette, Edwin Shin, Chris Wilper

Fedora. CS 431 April 17, 2006 Carl Lagoze Cornell University. Acknowledgements: Sandy Payette (Cornell)

Fedora: A network overlay approach to federated searching

Repository models and policies for preservation

Using metadata for interoperability. CS 431 February 28, 2007 Carl Lagoze Cornell University

The adore Federation Architecture

Using MPEG-21 DIP and NISO OpenURL for the Dynamic Dissemination of Complex Digital Objects in the Los Alamos National Laboratory Digital Library

Comparing Open Source Digital Library Software

FLAT: A CLARIN-compatible repository solution based on Fedora Commons

Representing and Storing Complex Digital Objects Fedora

arxiv:cs/ v1 [cs.dl] 1 Jun 2006

Combining Content, Semantic Relationships, and Web Services Fedora

Fedora Commons Update

Repository Replication Using NNTP and SMTP

Fedora Relationships and Information Network Overlays. CS 431 April 19, 2006 Carl Lagoze Cornell University

ACDH AUSTRIAN CENTRE FOR DIGITAL HUMANITIES

BPMN Processes for machine-actionable DMPs

adore: a modular, standards-based Digital Object Repository

Fedora Commons: Taking on the Challenge of the Next Generation of Scholarly Communication

DA-NRW: a distributed architecture for long-term preservation

Part 2: Current State of OAR Interoperability. Towards Repository Interoperability Berlin 10 Workshop 6 November 2012

Open source software for building open access repositories. Imma Subirats Coll knowledge and information management officer FAO of the United Nations

Metadata and Encoding Standards for Digital Initiatives: An Introduction

OpenDLib: a Digital Library Service System

The multi-faceted use of the OAI-PMH in the LANL Repository

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Building a Digital Repository on a Shoestring Budget

Edinburgh Research Explorer

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

EPrints: Repositories for Grassroots Preservation. Les Carr,

OAI-Publishers in Repository Infrastructures

Building for the Future

Surveying the Digital Library Landscape

OAI Static Repositories (work area F)

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Towards Interoperable Preservation Repositories TIPR. DLF Spring Forum, 2009 Joseph Pawletko (NYU), Priscilla Caplan (FCLA), Bill Kehoe (CUL)

The European Repositories Landscape - The view from 20,000 feet

Open Archives Forum - Technical Validation -

arxiv, the OAI, and peer review

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

An overview of the OAIS and Representation Information

Persistent identifiers, long-term access and the DiVA preservation strategy

Digital Curation and Preservation: Defining the Research Agenda for the Next Decade

Open Access Statistics: Interoperable Usage Statistics for Open Access Documents

The Fedora Project. D-Lib Magazine April An Open-source Digital Object Repository Management System. Introduction

Richard Marciano Alexandra Chassanoff David Pcolar Bing Zhu Chien-Yi Hu. March 24, 2010

ScienceSifter: Facilitating Activity Awareness in Collaborative Research Groups through Focused Information Feeds

Adding OAI ORE Support to Repository Platforms

National Documentation Centre Open access in Cultural Heritage digital content

If you build it, will they come? Issues in Institutional Repository Implementation, Promotion and Maintenance

The dawning of the Dutch network of Digital Academic REpositories (DARE): a sharing experience

Sessions 3/4: Member Node Breakouts. John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group

An Architecture to Share Metadata among Geographically Distributed Archives

Interoperability and Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)

Metadata. Week 4 LBSC 671 Creating Information Infrastructures

LORE: A Compound Object Authoring and Publishing Tool for Literary Scholars based on the FRBR. Anna Gerber, Jane Hunter

Open Archives Initiative protocol development and implementation at arxiv

Share.TEC Repository System

Research Data Repository Interoperability Primer

Using MPEG-21 DIDL to Represent Complex Digital Objects in the Los Alamos National Laboratory Digital Library

Registry Interchange Format: Collections and Services (RIF-CS) explained

Research Data Management and Institutional Repositories

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

Digital Library Curriculum Development Module 5-d: Protocols (Last Updated: )

RVOT: A Tool For Making Collections OAI-PMH Compliant

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

Infrastructure for the UK

Building Collaborative Tools on NSDL 2.0. Dean Krafft, Cornell University

SobekCM. Compiled for presentation to the Digital Library Working Group School of Oriental and African Studies

Igitur Archive: Institutional Repository Utrecht University. May , Martin Slabbertje

EUROPEANA METADATA INGESTION , Helsinki, Finland

Version 2 of the OAI-PMH & some other stuff

Using an Application Profile Based Service Registry

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

Ing. José A. Mejía Villar M.Sc. Computing Center of the Alfred Wegener Institute for Polar and Marine Research

Persistent Identifiers for Digital Resources

mod_oai: An Apache Module for Metadata Harvesting

Grant Agreement EUROPEANA INSIDE. Production Version of Europeana Inside. Public K-INT

Preservation for Institutional Repositories: practical and invisible

The OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Data

Trusted Digital Repositories. A systems approach to determining trustworthiness using DRAMBORA

On the Effective Manipulation of Digital Objects: A Prototype-Based Instantiation Approach

Open Archives Initiative Object Reuse and Exchange Technical Committee Meeting, May 29, Edited by: Carl Lagoze & Herbert Van de Sompel

Policy-Driven Repository Interoperability: Enabling Integration Patterns for irods and Fedora

Towards repository interoperability

Bibliographic Metadata Harvesting to Support the Management of an Institutional Repository

The Metadata Challenge:

A Novel Architecture of Agent based Crawling for OAI Resources

Integrating research data into the publication workflow: the ebank UK experience

ScienceDirect. BoRIS and BIA: CRIS and Institutional Repository integration at the Free University of Bozen-Bolzano

Long-term digital preservation of UNSWorks

A Distributed Digital Library System Architecture for Archive Metadata

The Semantic Institution: An Agenda for Publishing Authoritative Scholarly Facts. Leslie Carr

Digital repositories as research infrastructure: a UK perspective

Information or What is stuff? CS 431 Architecture of Web Information Systems. Carl Lagoze Cornell University Spring 2008

DSpace Fedora. Eprints Greenstone. Handle System

Transcription:

Digital Objects, Data Models, and Surrogates m Computing and Information Science Cornell University

Pathways Project NSF grant number IIS-0430906 http://www.infosci.cornell.edu/pathways/ PIs:, Sandy Payette, Herbert Van de Sompel, Simeon Warner Research Participants: Lyudmila Balakireva, Jeroen Bekaert, Xiaoming Liu, Chris Wilper, Zhiwu Xie

Lots of types of digital objects Lot s of data models.

Dienst

Fedora

OAI-PMH Resource Abstract content Item Available structured data about the resource Record Disseminated structured data about the resource

MPEG-21 DID

METS

Interoperability Layer m Obtain DSpace arxiv Fedora adore eprints Harvest Put Shared Data Model Shared Serialization of Model Shared Services on Model Individual Models and Interfaces

First pass: Graphite Model Graph-based abstraction as common reduction across heterogeneous models (Payette and Erickson) Basis for linking distributed identified resources (content, data, services)

Pathways Core: Graph-based Data Model (Bekaert, Lagoze, Liu, Payette, Van de Sompel, Warner) Abstract Concrete

Basis for a Network of Linked Objects

Why not just asset transfer? Full transfer is only necessary for some applications e.g., preservation mirroring In fact, it some cases it is forbidden and/or undesirable So, the infrastructure and model should accommodate but not be limited to transfer

By not committing to asset transfer Avoid embedding IP issues into the core interoperability layer Accommodate service-tuned asset transfer Allow live references, rather than static copies

m Model Core Requirement Identity independent of specific schemes Lineage relationships among objects evidence of workflow for evidential citation Semantics associated with entities facilitate service mapping Recursion for n-levels of entity containment Link to concrete representation Assertion of persistence levels

Data Model

m Identity

m Lineage Relationships

m Semantics

m Recursion

m Concrete Representation

m Persistence Guarantees

Serializing the Data Model Surrogate is a serialized and transportable representation of a digital object according to the model Accessed via obtain and harvest Deposited via put Prototype serializes via RDF/XML Serializations in other formats possible e.g., DIDL, METS

Surrogates <-> Identity providerinfo records obtain information {provider, id, version} in surrogate makes it an evidential record of the digital object, provided by a specific service, in specific version

Surrogate <-> Persistence hasproviderpersistence expresses commitment of provider to persistence of entity handle Commitment can vary: e.g., transient, handle persists, versions persist, object is stable

RDF/XML Representation of the Model

RDF Graph Surrogate

REPO :10 REPO :11 REPO :11 DOI:1 preferredid provider DOI:1 preferredid provider DOI:1 preferredid provider hasproviderinfo hasproviderinfo hasproviderinfo DOI:1 original Source haslineage DOI:1 in French haslineage DOI:1 in German hasdatastream hasdatastream hasdatastream re que st requ est obtain access reply reply arxiv mediate Fedora obtain access request ingest put DSpace English- > French xlate French- >German xlate