PREMIS, METS and preservation metadata:

Similar documents
Practical Experiences with Ingesting Materials for Long-Term Preservation

Repository Interoperability and Preservation: The Hub and Spoke Framework

An introduction to the Metadata Encoding and Transmission Standard METS)

CCS Content Conversion Specialists. METS / ALTO introduction

Introduction to. Digital Curation Workshop. March 14, 2013 SFU Wosk Centre for Dialogue Vancouver, BC

DRI: Dr Aileen O Carroll Policy Manager Digital Repository of Ireland Royal Irish Academy

D.5.3 E ARK Dissemination Information Package (DIP) Final Specification. DOI: /zenodo

METS as a Container and Metadata Approach for Sample Course Content

Draft Digital Preservation Policy for IGNCA. Dr. Aditya Tripathi Banaras Hindu University Varanasi

Its All About The Metadata

Digital Preservation DMFUG 2017

The Promise of PREMIS: background, scope and purpose of the Data Dictionary for Preservation Metadata

Assessment of product against OAIS compliance requirements

The OAIS Reference Model: current implementations

Aligning METS with the OAI-ORE Data Model

Mitigating Risk of Data Loss in Preservation Environments

Different Aspects of Digital Preservation

DELIVERABLE Project Acronym: Grant Agreement Number: European Archival Records and Knowledge Preservation

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Chapter 5: The DAITSS Archiving Process

University of British Columbia Library. Persistent Digital Collections Implementation Plan. Final project report Summary version

PAWN: A Novel Ingestion Workflow Technology for Digital Preservation. Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall

Archival Information Package (AIP) E-ARK AIP version 1.0

Digits Fugit or. Preserving Digital Materials Long Term. Chris Erickson - Brigham Young University

An Overview and Case Study. Scott Yeadon

Susan Thomas, Project Manager. An overview of the project. Wellcome Library, 10 October

DIGITAL IMAGING VIEW OF DIGITAL PRESERVATION

Introduction to Digital Preservation. Danielle Mericle University of Oregon

Assessment of product against OAIS compliance requirements

MAPPING STANDARDS! FOR RICHER ASSESSMENTS. Bertram Lyons AVPreserve Digital Preservation 2014 Washington, DC

The digital preservation technological context

Linking Between and Within METS and PREMIS Documents

ISO Information and documentation Digital records conversion and migration process

Challenges and opportunities in digital preservation. what do I need to know about digital preservation? It won t go away It won t do itself

PREMIS Implementations at the British Library & PREMIS and the Planets Project. Angela Dappert The British Library PREMIS roundtable, February 2009

Applied Interoperability in Digital Preservation: Solutions from the E-ARK Project

1 There is no single, authorized definition of digital preservation. 2 Definition is from the Framework for Applying OAIS to

UNIVERSITY OF NOTTINGHAM LIBRARIES, RESEARCH AND LEARNING RESOURCES

Strategy for long term preservation of material collected for the Netarchive by the Royal Library and the State and University Library 2014

RODA 3 LONG-TERM DIGITAL PRESERVATION CHARACTERISTICS AND TECHNICAL REQUIREMENTS

What do you do when your file formats become obsolete? Lydia T. Motyka Florida Center for Library Automation USETDA 2011

An overview of the OAIS and Representation Information

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Digital Preservation in Theory and Practice

Agenda. Bibliography

Information Technology Engineers Examination. Database Specialist Examination. (Level 4) Syllabus. Details of Knowledge and Skills Required for

Digital Preservation at NARA

FDA Affiliate s Guide to the FDA User Interface

UVic Libraries digital preservation framework Digital Preservation Working Group 29 March 2017

Long Term Digital Preserva2on

Ex Libris Rosetta A Complete Digital Asset Management and Preservation System

Common Specification (FGS-PUBL) for deposit of single electronic publications to the National Library of Sweden - Kungl.

Trusted Digital Repositories. A systems approach to determining trustworthiness using DRAMBORA

Building for the Future

COMMON SPECIFICATION FOR INFORMATION PACKAGES

Cloud Archive and Long Term Preservation Challenges and Best Practices

Digital Preservation with Special Reference to the Open Archival Information System (OAIS) Reference Model: An Overview

Florida Digital Archive (FDA) SIP Specification

Digital Preservation in Theory and in Practice

Management: A Guide For Harvard Administrators

The Canadian Information Network for Research in the Social Sciences and Humanities.

Preservation of the H-Net Lists: Suggested Improvements

Persistent identifiers, long-term access and the DiVA preservation strategy

Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives

Certification Efforts at Nestor Working Group and cooperation with Certification Efforts at RLG/OCLC to become an international ISO standard

Protecting Future Access Now Models for Preserving Locally Created Content

METS: Implementing a metadata standard in the digital library

Developing a Research Data Policy

Enhanced Processing of METS/MODS Library Metadata in CouchDB

Schéma XML pro vytvoření datového balíčku SIP

DRI: Preservation Planning Case Study Getting Started in Digital Preservation Digital Preservation Coalition November 2013 Dublin, Ireland

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

RUtgers COmmunity REpository (RUcore)

What is METS used for?

The Choice For A Long Term Digital Preservation System or why the IISH favored Archivematica

Conch Appendix: Discovery Questionnaire. Questionnaire Summary

ISO Self-Assessment at the British Library. Caylin Smith Repository

CoSA & Preservica Practical Digital Preservation 2015/16. Practical OAIS Digital Preservation Online Workshop Module 1

Importance of cultural heritage:

What is Islandora? Islandora is an open source digital repository that preserves, manages, and showcases your institution s unique material.

Session Two: OAIS Model & Digital Curation Lifecycle Model

Long-Term Preservation Services

3. Technical and administrative metadata standards. Metadata Standards and Applications

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

Richard Marciano Alexandra Chassanoff David Pcolar Bing Zhu Chien-Yi Hu. March 24, 2010

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

An Introduction to PREMIS. Jenn Riley Metadata Librarian IU Digital Library Program

Transfers and Preservation of E-archives at the National Archives of Sweden

The role of PIONIER network in long term preservation services for cultural heritage institutions in Poland

Digital Preservation Network (DPN)

Preservation at scale

Long-term digital preservation of UNSWorks

The e-depot in practice. Barbara Sierman Digital Preservation Officer Madrid,

Australia s Remotely Sensed Data Archive: The Next 25 Years

B2SAFE metadata management

SECURECHAIN BLOCKCHAIN-BASED SECURITY FOR SOFTWARE-DEFINED NETWORKS (SDN).

Can a Consortium Build a Viable Preservation Repository?

2.4. Target Audience This document is intended to be read by technical staff involved in the procurement of externally hosted solutions for Diageo.

Archive and Preservation for Media Collections

DSpace Fedora. Eprints Greenstone. Handle System

Transcription:

PREMIS, METS and preservation metadata: emerging trends and future directions Eld Zierau The Royal Library of Denmark

Introduction My background Masters in Computer Science in 1989 At the Royal Library of Denmark since 2007 Strategy and design of preservation systems Creation of preservation policies and strategies Policies of using preservation metadata PhD in Digital Preservation in 2011 Currently at the Royal Library SIFD the digital library Management, dissemination and preservation Packaging and re-packaging for Bit Repository WARC, METS, PREMIS Framework for OAIS and Distributed Digital Preservation

Contents of this presentation Practices (at The Royal Library) Strategies and policies Putting it into practice Challenges Expressing preservation levels and intellectual entities Preserving preservation metadata Expressing preservation levels and intellectual entities over time An example on the bit level Risks mitigated in bit preservation Bit integrity/safety, confidentiality and availability Types of preservation Levels How to express them also over time Identification of intellectual entities How to express them also over time Summary

Preservation Strategies Logical preservation Migration Emulation Digital preservation Logical preservation Bit preservation Technology preservation Bit preservation 0101100010001000

Currently at The Royal Library Strategy and policies Bit preservation Logical preservation Putting it into practice The chosen Metadata Standards The Digital Library infrastructure The Danish Bit Repository Framework

Metadata Standards and use METS document <mets> METS header <metshdr> Descriptive metadata <dmdsec> Wrapped MODS <mdwrap><xmldata> Administrativ metadata <amdsec> Technical metadata <techmd> Rights metadata <rightsmd> Digital provenance metadata <digiprovmd> Analog/digital source metadata <sourcemd> File metadata <filesec> Structural Map <structmap> Structural link metadata <structlink> Behavior metadata <behaviorlink> <agent> <altrecordid> <metsdocumentid> Inspired by the Australian way http://www.dlib.org/dlib/march08/pearce/03pearce.html Wrapped PREMIS object part <mdwrap><xmldata> Wrapped MODS <mdwrap><xmldata> Wrapped PREMIS rights part <mdwrap><xmldata> Wrapped??? <mdwrap><xmldata> Wrapped PREMIS agent part <mdwrap><xmldata> Wrapped PREMIS event part <mdwrap><xmldata> Wrapped PREMIS preservationlevel part <mdwrap><xmldata> Wrapped??? Video Wrapped <mdwrap><xmldata> AES sound Wrapped <mdwrap><xmldata> MIX images <mdwrap><xmldata> METS element MODS element PREMIS element MIX element AES element To be decided Will be included May be left out

So what to do PREMIS does not do it all (Richard) e.g. other standards for technical metadata PREMIS do not re-invent (Richard) Sustainability, Community (Huw Jones, Dave) Rights packaging (All) as it suits your organisation Material representations (Angela, Robert) AIP be careful different levels, lot of information Events when useful on different levels (audit trail) All sorts of agents, challenges with description METS why and on what? (Steffen) Large METS (Steffen, Huw Jones) Tree-structure not net (as e.g. web or emails)

Digital Library infrastructure Preservation Dissemination BR Access Ingest Management Representations with metadata

Challenges with metadata Preserving preservation metadata (if ) Expressing preservation levels Expressing preservation levels over time We need to look more closely on bit preservation to define levels and levels over time Expressing Intellectual Entity (identifiers) Expressing Intellectual Entity (identifiers) over time

A General View of a Bit Repository Elements in bit preservation: Number of copies Independence between copies Frequency of integrity checks Bit sequences Bit Repository General System Layer Pillar Layer Pillar1 Pillar 2 Pillar 3 Pillar 4 Pillar 5 Pillar 6 Pillar 7 Pillar 8

Integrity Bit error Risk: Bits can change value 1. Error has occurred in Backup 2. File is corrupted 3. Error is not discovered 4. Cannot determine which file is intact File in form of bit stream Solutions: 1. No backup. All are copies of data 2. Vote on which copy that is the right one

Bit error System Layer Bit Repository (BR) System Layer Pillar Layer Solutions: 1. System layer checks and follow-up on basis of comparing copies 2. Minimum three voters, optimize by checksums

Integrity Bit error Risk: Bits can change value 1. Error has occurred in Backup 2. File is corrupted 3. Error is not discovered 4. Cannot determine which file is intact File in form of bit stream A0953B7 Solutions: 1. No backup. All are copies of data 2. Vote on which copy that is the right one 3. Introduce checksums of files to discover errors

Integrity Bit error Risk: The same error occurs for more copies 1. Same hardware 2. Same software 3. Same vendor

Integrity Bit error Risk: The same error occurs for more copies 1. Same hardware 2. Same software 3. Same vendor Solutions: Unix Windows Mac OS 1. Different hardware solutions 2. Different vendors 3. Different software (OS, interpreters, etc.)

Integrity Disasters Risk: All copies are damaged at the same time 1. Natural disasters 2. Attacks in connection with war or terror

Integrity Disasters Risk: All copies are damaged at the same time 1. Natural disasters 2. Attacks in connection with war or terror Solutions: 1. Different geographical locations

Integrity Organisation Risk: Errors/mistakes are made by the same person/org. 1. The same person has access and has delete rights 2. The same person makes procedural mistakes

Integrity Organisation Risk: Errors/mistakes are made by the same person/org. 1. The same person has access and has delete rights 2. The same person makes procedural mistakes Solutions: 1. Different organisations

Confidentiality Risk: Unauthorised gets access to confidential data 1. Unauthorised gets access to Bit Repository 2. Unauthorised gets access to data from Bit Repository A0953B7 Solutions: 1. Authentication of users of pillars with copies 2. Encryption internally on pillar 3. Hardware secured in locked rooms

Confidentiality System Layer Bit Repository (BR) System Layer Pillar Layer A09537

Availability Risk: Cannot get access as required 1. Cannot get any response on request 2. Processing not possible in reality Solutions: 1. Specialised pillar with distributed architecture

Availability Bit Repository (BR) System Layer Pillar Layer Solutions: 1. Redirection if access to a pillar is down 2. Distributed requests to different pillars 3. Scaling 4. Diversity, 5.

Bit Repository Offering Solutions BR General System Layer Pillar Layer Pillar 1 Pillar 2 Pillar 3 Pillar 4 Pillar 5 Pillar 6 Pillar 7 Pillar 8 Pillar 9 Media Data safety Access speed On-line Off-line Organisational placement Geographical placement

Bit Safety Value Comment for preservationleveltype = BitSafety Max VeryHigh High Medium Low VeryLow Min Maximum bit safety Very high bit safety High bit safety Medium bit safety Low bit safety Very low bit safety Minimum bit safety from http://id.kb.dk/vocabulary/

Bit Safety Value Comment for preservationleveltype = BitSafety Max VeryHigh High Medium Low VeryLow Min Maximum bit safety Very high bit safety High bit safety Medium bit safety Low bit safety Very low bit safety Minimum bit safety Policy: As high bit safety that we can get Strategy 2013: 10 copies spread over 3 continents, both optical and magnetic medias, checked every Strategy 2050: 8 copies ; at lest 2 on Mars, at least two written to DNA, checked every

Confidentiality Value Comment for preservationleveltype = Confidentiality Max VeryHigh High Medium Low VeryLow Min Maximum confidentiality Very high confidentiality High confidentiality Medium confidentiality Low confidentiality Very low confidentiality Minimum confidentiality from http://id.kb.dk/vocabulary/

Confidentiality Value Comment for preservationleveltype = Confidentiality Max VeryHigh High Medium Low VeryLow Min Maximum confidentiality Very high confidentiality High confidentiality Medium confidentiality Low confidentiality Very low confidentiality Minimum confidentiality Policy: Only restricted access, where it is as hard as it can get for others when skipping encryption Strategy 2013: No more than 2 copies, that are secured on off-line medias Strategy 2050:???

Availability Value Comment for preservationleveltype = Availability Max VeryHigh High Medium Low VeryLow Min Maximum availability Very high availability High availability Medium availability Low availability Very low availability Minimum availability

Logical Preservation Value Comment for preservationleveltype = logicalstrategy Migration Emulation Technical Migration of digital material to keep data interpretable Emulation of digital material to keep data interpretable Technology preservation to keep data interpretable

Preservation Level information preservationleveltype Comment Bit safety Confidetiality Availability Logical Preservation Strategy Bit preservation Bit preservation Bit preservation Logical Preservation Policy With institution preservation policies Express values - Same over time Strategy Requirements for fulfilment with current technologies

Preservation Level in metadata <digiprovmd CREATED="2013-01-18T19:28:01.456+01:00" ID="Premis1"> <mdwrap MDTYPE="PREMIS"> <xmldata> <preservationlevel xmlns:xlink="http://www.w3.org.1999/xlink" xsi: > <preservationlevelvalue>bitsafetyhigh</preservationlevelvalue> <preservationleveldateassigned>2013-01-18t19:28:01.458+01:00 </preservationleveldateassigned> </preservationlevel> <preservationlevel xmlns:xlink="http://www.w3.org.1999/xlink" xsi: > <premis:preservationlevelvalue>logicalstrategymigration </premis:preservationlevelvalue> <preservationleveldateassigned>2013-01-18t19:28:01.459+01:00 </preservationleveldateassigned> </preservationlevel> <preservationlevel xmlns:xlink="http://www.w3.org.1999/xlink" xsi: > <preservationlevelvalue>confidentialitylow</preservationlevelvalue> <preservationleveldateassigned>2013-01-18t19:28:01.460+01:00 </preservationleveldateassigned> </preservationlevel> </xmldata> </mdwrap> </digiprovmd>

Identification in the future 15AE9513 Producer Object Object id. Service Provider Object id. & Service Object Consumer 15AE9513

Preservation Level in metadata <techmd CREATED="2013-01-18T19:28:01.426+01:00" ID="PremisObject1"> <mdwrap MDTYPE="PREMIS:OBJECT"> <xmldata> <object xmlns:xlink="http://www.w3.org.1999/xlink" xsi: > <objectidentifier> <objectidentifiertype>uuid</premis:objectidentifiertype> <objectidentifiervalue>41d153d0-0099-11e2-9397-005056887b67 </objectidentifiervalue> </objectidentifier> <linkingintellectualentityidentifier> <linkingintellectualentityidentifiertype>uuid </linkingintellectualentityidentifiertype> <linkingintellectualentityidentifiervalue> 41d153d1-0099-11e2-9397-005056887b67 </linkingintellectualentityidentifiervalue> </linkingintellectualentityidentifier> </object> </xmldata> </mdwrap> </techmd>

Summary PREMIS, METS and preservation metadata: emerging trends and future directions Choosing preservation metadata standards Which and how Preserving preservation metadata Which and how Some challenges for the future Definition of preservation levels and intellectual entities Expressing preservation levels and intellectual entities Expressing preservation levels and intellectual entities over time

Questions and Comments