Richard Marciano Alexandra Chassanoff David Pcolar Bing Zhu Chien-Yi Hu. March 24, 2010

Similar documents
Policy-Driven Repository Interoperability: Enabling Integration Patterns for irods and Fedora

DSpace Fedora. Eprints Greenstone. Handle System

Fedora Commons Update

Transcontinental Persistent Archive Prototype

Implementing Trusted Digital Repositories

IRODS: the Integrated Rule- Oriented Data-Management System

Policy Based Distributed Data Management Systems

Introducing Fedora 4. Overview, examples, and features. David Wilcox,

Fedora, Islandora, & Samvera: Requirements and Gaps. David Wilcox,

The International Journal of Digital Curation Issue 1, Volume

PRESERVING DIGITAL OBJECTS

Comparing Curricula for Digital Library. Digital Curation Education

Introduction to Federico 2.0 and Fedora Commons

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

Knowledge-based Grids

An overview of the OAIS and Representation Information

Assessment of product against OAIS compliance requirements

irods - An Overview Jason Executive Director, irods Consortium CS Department of Computer Science, AGH Kraków, Poland

California Digital Library UC Office of the President

irods User Group integrated Rule Oriented Data System Reagan Moore

Data Partnerships to Improve Health Frequently Asked Questions. Glossary...9

Digital Curators: Who, What, & How

irods for Data Management and Archiving UGM 2018 Masilamani Subramanyam

Institutional repositories: description of VITAL as an example of a Fedora-based digital assets management system.

Powering Linked Open Data Applications

Geospatial Multistate Archive and Preservation Partnership Metadata Comparison

Robin Wilson Director. Digital Identifiers Metadata Services

Building for the Future

DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017

EUDAT. Towards a pan-european Collaborative Data Infrastructure

DRI: Dr Aileen O Carroll Policy Manager Digital Repository of Ireland Royal Irish Academy

Ing. José A. Mejía Villar M.Sc. Computing Center of the Alfred Wegener Institute for Polar and Marine Research

EUDAT Training 2 nd EUDAT Conference, Rome October 28 th Introduction, Vision and Architecture. Giuseppe Fiameni CINECA Rob Baxter EPCC EUDAT members

Long Term Digital Preserva2on

irods at TACC: Secure Infrastructure for Open Science Chris Jordan

Designing an institutional research data management infrastructure for the life sciences

Writing a Data Management Plan A guide for the perplexed

Promoting Open Standards for Digital Repository. case study examples and challenges

DA-NRW: a distributed architecture for long-term preservation

Data Exchange and Conversion Utilities and Tools (DExT)

Fedora. CS 431 April 17, 2006 Carl Lagoze Cornell University. Acknowledgements: Sandy Payette (Cornell)

Pluggable Rule Engine Architecture

White Paper: National Data Infrastructure for Earth System Science

Registry Interchange Format: Collections and Services (RIF-CS) explained

MetaData Management Control of Distributed Digital Objects using irods. Venkata Raviteja Vutukuri

Leveraging High Performance Computing Infrastructure for Trusted Digital Preservation

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

Comparing Open Source Digital Library Software

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

The DataBridge: A Social Network for Long Tail Science Data!

I data set della ricerca ed il progetto EUDAT

Best Practice Guidelines for the Development and Evaluation of Digital Humanities Projects

DataBridge: CREATING BRIDGES TO FIND DARK DATA. Vol. 3, No. 5 July 2015 RENCI WHITE PAPER SERIES. The Team

The Open Archives Initiative in Practice:

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Mitigating Risk of Data Loss in Preservation Environments

Preservation Aspects of a Curation-Oriented Thematic Aggregator

Fedora Relationships and Information Network Overlays. CS 431 April 19, 2006 Carl Lagoze Cornell University

The Design of a DLS for the Management of Very Large Collections of Archival Objects

Surveying the Digital Library Landscape

Electronic Records Archives: Philadelphia Federal Executive Board

Web-based workflow software to support book digitization and dissemination. The Mounting Books project

Drupal for Virtual Learning And Higher Education

Representing LEAD Experiments in a FEDORA digital repository

Assessment of product against OAIS compliance requirements

Requirements for data catalogues within facilities

EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

Long-term digital preservation of UNSWorks

Fedora Commons: Taking on the Challenge of the Next Generation of Scholarly Communication

PASIG Directions & Issues

EUROPEANA METADATA INGESTION , Helsinki, Finland

B2SAFE metadata management

Copyright 2008, Paul Conway.

Data Curation Handbook Steps

Safe Havens in a Choppy Sea:

Introduction to. Digital Curation Workshop. March 14, 2013 SFU Wosk Centre for Dialogue Vancouver, BC

EUDAT - Open Data Services for Research

The International Journal of Digital Curation Issue 1, Volume

Research Data Repository Interoperability Primer

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

DATA MANAGEMENT SYSTEMS FOR SCIENTIFIC APPLICATIONS

MetaMatrix Enterprise Data Services Platform

Ceph at the DRI. Peter Tiernan Systems and Storage Engineer Digital Repository of Ireland TCHPC

Wendy Thomas Minnesota Population Center NADDI 2014

Data Management: the What, When and How

EUDAT. Towards a pan-european Collaborative Data Infrastructure. Damien Lecarpentier CSC-IT Center for Science, Finland EUDAT User Forum, Barcelona

irods workflows for the data management in the EUDAT pan-european infrastructure

LORE: A Compound Object Authoring and Publishing Tool for Literary Scholars based on the FRBR. Anna Gerber, Jane Hunter

Content-based image retrieval integrated into Fedora

Representing and Storing Complex Digital Objects Fedora

Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments *

Statistics Requirements for Release 5.2

Guidelines for Developing Digital Cultural Collections

FLAT: A CLARIN-compatible repository solution based on Fedora Commons

Managing Image Metadata

National Documentation Centre Open access in Cultural Heritage digital content

Applied Interoperability in Digital Preservation: Solutions from the E-ARK Project

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Digital Preservation at NARA

Transcription:

Richard Marciano Alexandra Chassanoff David Pcolar Bing Zhu Chien-Yi Hu March 24, 2010

What is the feasibility of repository interoperability at the policy level? Can a preservation environment be assembled from two existing repositories? Can the policies of the federation be enforced across the repositories? What fundamental mechanisms are needed within a repository to implement new policies?

Persistent Interoperable Scalable Distributed Flexible Semantic

irods open source federated storage rule engine and microservices Fedora open source file-centric front-end APIs richer metadata

Integrate views of content, original arrangement (hierarchy) and metadata Create an audit trail of policy execution events and related provenance information Demonstrate policy management through Fedora Demonstrate policy management from irods

CDR is designed as a repository for material in electronic formats produced by members of the University of North Carolina at Chapel Hill community. Its chief purpose is to provide for long-term preservation of digital assets. By preservation we mean the ability to ingest the material, index and search it, replicate it, and keep it safe from alteration. Following standards developed in the reference model for an Open Archival Information Systems, the CDR employs Fedora for data content models and uses irods as a data store. The University Library is a partner in the PoDRI, to build on the initial Fedora to irods connector and investigate interoperability issues in greater depth.

UNC Libraries efforts are based on connecting technologies developed by Fedora, irods, PoDRI, and DCAPE and serves as a test bed for other developing technologies. In partnership with the Renaissance Computing Institute (RENCI), Duke University and North Carolina State University, the University Library and UNC Information Technology Services are participating in a project funded by the Triangle University Center for Advanced Studies, Inc. (TUCASI) to develop policies and infrastructures that will support the federation of independent repositories built on diverse hardware and software platforms. The CDR will form one of the core technologies for this project.

Functionality focused on Core Services Ingestion Indexing & Discovery Preservation Dissemination Metadata Standards Dublin Core Based (FOXML) Minimal Elements Collection Folder Item Rights/Access/Use Metadata (XACML based)

Where is PREMIS information stored (e.g., with an individual object or as an aggregate)? How will the collection structure be represented in the two products? How will Fedora be initialized for existing content in irods? How will Fedora be informed of content or metadata changes initiated directly in irods? How can content or metadata from Fedora be accessed by irods services?

1. New content ingest via Fedora 2. New content ingest via irods 3. Bulk registration from irods into Fedora 4. Update of content or metadata via Fedora 5. Update of content or metadata via irods

-Use Fedora features to add and relate rich metadata including policy, provenance and authenticity information. -- Fedora Digital Object (FDO) and Content Stores in irods. -- Index selected metadata for speedy access, services bindings, etc. -- Model tree structure through RDF relation and navigate through a triplestore. -- Use irods rules to monitor new content for indexing and metadata extraction.

Microservice is triggered after each new irods data object is created. irods files are registered in Fedora through micro-service trigger. Fedora Data Object (FDO) and FOXML are created.

irods Storage module irods Data Harvester for Fedora

Continuing working on use cases o Bulk registration from irods into Fedora o Update of content or metadata via Fedora o Update of content or metadata via irods Applying preservation policies on irods o integrity check o replication o and more...