An Introduction to PREMIS. Jenn Riley Metadata Librarian IU Digital Library Program

Similar documents
The OAIS Reference Model: current implementations

An overview of the OAIS and Representation Information

The Promise of PREMIS: background, scope and purpose of the Data Dictionary for Preservation Metadata

Digits Fugit or. Preserving Digital Materials Long Term. Chris Erickson - Brigham Young University

3. Technical and administrative metadata standards. Metadata Standards and Applications

ISO INTERNATIONAL STANDARD. Information and documentation Managing metadata for records Part 2: Conceptual and implementation issues

Description Cross-domain Task Force Research Design Statement

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

Exploring the Concept of Temporal Interoperability as a Framework for Digital Preservation*

The digital preservation technological context

ISO Information and documentation Digital records conversion and migration process

From production to preservation to access to use: OAIS, TDR, and the FDLP OAIS TRAC / TDR

ISO INTERNATIONAL STANDARD. Information and documentation Records management processes Metadata for records Part 1: Principles

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Minimum Mandatory Metadata Set for RAIDmap

Its All About The Metadata

Certification Efforts at Nestor Working Group and cooperation with Certification Efforts at RLG/OCLC to become an international ISO standard

Description Cross Domain - Metadata Schema Registry Presentation to ISO Working Group Sydney, 2 November 2004

Alphabet Soup: A Metadata Overview Melanie Schlosser Metadata Librarian

Document Title Ingest Guide for University Electronic Records

AS/NZS ISO 13008:2014

Archival Information Package (AIP) E-ARK AIP version 1.0

Building Consensus: An Overview of Metadata Standards Development

Digital Preservation Standards Using ISO for assessment

Digital Library Curriculum Development Module 4-b: Metadata Draft: 6 May 2008

Robin Dale RLG

Preservation Health Check: introduction to the pilot

The Open Archives Initiative and the Sheet Music Consortium

Recordkeeping Standards Analysis of HealthConnect

Rules for Archival Description and Encoded Archival Description: Competing or Compatible Standards?

Preservation Planning in the OAIS Model

ISO ARCHIVE STANDARDS: STATUS REPORT

Digital Preservation with Special Reference to the Open Archival Information System (OAIS) Reference Model: An Overview

DRS 2 Glossary. access flag An object access flag records the least restrictive access flag recorded for one of the object s files: ο ο

Software Requirements Specification for the Names project prototype

Archivists Toolkit: Description Functional Area

Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS. Jenn Riley IU Metadata Librarian DLP Brown Bag Series February 25, 2005

Response to the CCSDS s DAI Working Group s call for corrections to the OAIS Draft for Public Examination

Management: A Guide For Harvard Administrators

PREMIS in Archivematica

Agenda. Bibliography

Data Partnerships to Improve Health Frequently Asked Questions. Glossary...9

Audit & Certification: an auditors perspective. Barbara Sierman, KB National Library of the Netherlands Royal Irish Academy, Dublin 4 june 2013

The International Journal of Digital Curation Issue 1, Volume

Long-term digital preservation of UNSWorks

Opus: University of Bath Online Publication Store

The Need for a Terminology Bridge. May 2009

ebooks Preservation at Scholars Portal Kate Davis & Grant Hurley Scholars Portal, Ontario Council of University Libraries

[MS-PICSL]: Internet Explorer PICS Label Distribution and Syntax Standards Support Document

This document is a preview generated by EVS

Digital Curators: Who, What, & How

Ensuring Proper Storage for Earth Science Data: The USGS Process to Certify Trusted Digital Repositories

Glossary of Exchange Network Related Groups

Implementation Guide for Delivery Notification in Direct

Trusted Digital Repositories. A systems approach to determining trustworthiness using DRAMBORA

RDA Resource Description and Access

DITA for Enterprise Business Documents Sub-committee Proposal Background Why an Enterprise Business Documents Sub committee

RDA: a new cataloging standard for a digital future

The International Journal of Digital Curation Issue 1, Volume

Best Practice Guidelines for the Development and Evaluation of Digital Humanities Projects

Workshop A: Using metadata to support digital preservation

Digital Preservation at NARA

For those of you who may not have heard of the BHL let me give you some background. The Biodiversity Heritage Library (BHL) is a consortium of

ISO TC46/SC11 Archives/records management

Transferring vital e-records to a trusted digital repository in Catalan public universities (the iarxiu platform)

Video Services Forum Rules of Procedure

Session Two: OAIS Model & Digital Curation Lifecycle Model

Metadata: The Theory Behind the Practice

International Audit and Certification of Digital Repositories

ETD Submission via ProQuest Step-by-Step

Long-Term Preservation Services

<goals> 10/15/11% From production to preservation to access to use: OAIS, TDR, and the FDLP

Australian Standard. Information and documentation Records management processes Metadata for records. Part 1: Principles

Network Working Group Internet-Draft October 27, 2007 Intended status: Experimental Expires: April 29, 2008

Content Management for the Defense Intelligence Enterprise

Survey of research data management practices at the University of Pretoria, South Africa: October 2009 March 2010

Draft Digital Preservation Policy for IGNCA. Dr. Aditya Tripathi Banaras Hindu University Varanasi

The Dublin Core Metadata Element Set

Susan Thomas, Project Manager. An overview of the project. Wellcome Library, 10 October

Workshop A: Using metadata to support digital preservation

Beginning To Define ebxml Initial Draft

Indexing Field Descriptions Recommended Practice

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

B2SAFE metadata management

ISO/IEC TR TECHNICAL REPORT. Information technology Procedures for achieving metadata registry (MDR) content consistency Part 1: Data elements

Consideration of Issues and Directives Federal Energy Regulatory Commission Order No. 791 June 2, 2014

Stakeholder and community feedback. Trusted Digital Identity Framework (Component 2)

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

The Sunshine State Digital Network

Digital Preservation: How to Plan

SAML V2.0 Profile for Token Correlation

[MS-DPSMDL]: Semantic Model Definition Language Data Portability Overview

ISO INTERNATIONAL STANDARD. Information and documentation Records management Part 1: General

Implementing Trusted Digital Repositories

This slide is relevant to providing either a single three hour training session or explaining how a series of shorter sessions focused on per chapter

Update on 3R Project (RDA Toolkit Restructure and Redesign Project)

The Data Management Plan: Putting policy into practice Suzanne Clarke Director, Information Resources

Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives

Transcription:

An Introduction to PREMIS Jenn Riley Metadata Librarian IU Digital Library Program

Outline Background and context PREMIS data model PREMIS data dictionary Implementing PREMIS Adoption and ongoing developments 2/7/2007 Digital Library Brown Bag Series 2

What is preservation metadata? Information that supports and documents the digital preservation process The problem is, we don t really know what that metadata looks like Well, we know something, but not nearly enough The community is thinking hard about the issues, but we still have very little real-world data 2/7/2007 Digital Library Brown Bag Series 3

Some related work National Library of Australia Preservation Metadata for Digital Collections (October 1999) Reference Model for an Open Archival Information System (OAIS) (June 2001) RLG/OCLC report Trusted Digital Repositories: Attributes and Responsibilities (May 2002) OCLC/RLG Metadata Framework to Support the Preservation of Digital Objects (June 2002) National Library of New Zealand Metadata Standards Framework Preservation Metadata (November 2002) RLG/NARA Audit Checklist for the Certification of Trusted Digital Repositories (August 2005) 2/7/2007 Digital Library Brown Bag Series 4

What is PREMIS? PREservation Metadata Implementation Strategies A working group of over 30 members sponsored by OCLC and RLG A data dictionary for preservation metadata included in the May 2005 final report of the working group 2/7/2007 Digital Library Brown Bag Series 5

Charge to the PREMIS working group define an implementable set of core preservation metadata elements, with broad applicability within the digital preservation community; draft a Data Dictionary to support the core preservation metadata element set; examine and evaluate alternative strategies for the encoding, storage, and management of preservation metadata within a digital preservation system, as well as for the exchange of preservation metadata among systems; conduct pilot programs for testing the group s recommendations and best practices in a variety of systems settings; and explore opportunities for the cooperative creation and sharing of preservation metadata. 2/7/2007 Digital Library Brown Bag Series 6

Working group structure Implementation Strategies Subgroup examined various strategies for encoding, storing, and managing preservation metadata within digital preservation systems performed survey of existing and planned digital preservation systems Core Elements Subgroup defined core elements drafted data dictionary 2/7/2007 Digital Library Brown Bag Series 7

How PREMIS defines preservation metadata The information a repository uses to support the digital preservation process Metadata that supports viability renderability understandability authenticity identity Mandatory elements represent the minimum amount for [a] second repository to accept custody of [a] digital object and assume responsibility for its long-term preservation 2/7/2007 Digital Library Brown Bag Series 8

PREMIS goals Build on the OAIS reference model Be implementation independent Provide a starting point for improvements and enhancements based on community experience and feedback 2/7/2007 Digital Library Brown Bag Series 9

Development strategies Paid particular attention to documenting digital provenance relationships Whenever possible the group defined elements that do not require human intervention to supply or analyze, but did not limit to these Defined semantic units rather than metadata elements 2/7/2007 Digital Library Brown Bag Series 10

Defining core Things that most working preservation repositories are likely to need to know in order to support digital preservation Core does not necessarily mean mandatory Core elements define information that a repository needs to know, regardless of how, or even whether, that information is stored Core elements support checking of: Fixity object is unchanged since some previous time Integrity compliant with relevant specifications Authenticity object is what it purports to be 2/7/2007 Digital Library Brown Bag Series 11

Outline Background and context PREMIS data model PREMIS data dictionary Implementing PREMIS Adoption and ongoing developments 2/7/2007 Digital Library Brown Bag Series 12

The PREMIS data model 2/7/2007 Digital Library Brown Bag Series 13

Intellectual entities A coherent set of content that is reasonably described as a unit Can include other Intellectual Entities May have one or more digital representations May not be managed by all repositories 2/7/2007 Digital Library Brown Bag Series 14

Objects A discrete unit of information in digital form Is a static set of bits that cannot be modified Three subtypes File Bitstream Representation 2/7/2007 Digital Library Brown Bag Series 15

File object A named and ordered sequence of bytes that is known by an operating system Defined like file in common usage No restriction on format, etc. 2/7/2007 Digital Library Brown Bag Series 16

Bitstream object Contiguous or non-contiguous data within a file that has meaningful common properties for preservation purposes Defined differently than common usage Can t span files Must have some sort of reformatting to be made into a file Weird exception: filestreams Bitstreams that don t need additional information to be transformed into a file Follow all the file rules in the data dictionary, not the bitstream rules 2/7/2007 Digital Library Brown Bag Series 17

Representation object The set of files, including structural metadata, needed for a complete and reasonable rendition of an Intellectual Entity More than one Representation may exist for each Intellectual Entity Repository doesn t necessarily have to track representations 2/7/2007 Digital Library Brown Bag Series 18

Objects example: ETD 2/7/2007 Digital Library Brown Bag Series 19

Events The Event entity aggregates metadata about actions that involve at least one object or agent known to the preservation repository Many types of Events might be of interest to a preservation repository Creation of a new version of an object Create/alter relationships Validity/integrity checking etc. All Events have outcomes Some Events have outputs 2/7/2007 Digital Library Brown Bag Series 20

Agents A person, organization, or software program associated with preservation events in the life of an object Agents represented minimally in PREMIS Means of identification Classification as person, organization, or software Assumes other initiatives will more fully define Agents Agents influence Objects only indirectly through Events 2/7/2007 Digital Library Brown Bag Series 21

Rights Rights Statements are assertions of one or more rights or permissions pertaining to an Object and/or Agent Semantic units related to rights restricted to those concerned with preservation activities All expressible as Agent A grants this permission for Object B. 3 semantic units allowed act expiration date of the permission all other terms, conditions, restrictions and/or limitations Acknowledges much more work needs to be done 2/7/2007 Digital Library Brown Bag Series 22

Relationships between entities Between objects Structural relationships Derivation relationships Dependency relationships Others defined by data model indicated in data dictionary by linking attributes 2/7/2007 Digital Library Brown Bag Series 23

Outline Background and context PREMIS data model PREMIS data dictionary Implementing PREMIS Adoption and ongoing developments 2/7/2007 Digital Library Brown Bag Series 24

The PREMIS data dictionary Defines semantic units for: Objects Events Agents Rights Intellectual Entity is out of scope because it is well served by descriptive metadata 2/7/2007 Digital Library Brown Bag Series 25

Entries include information on: Name Applicability Semantic components Examples Definition Repeatability Rationale Obligation Data constraint Object category Creation/Maintenance notes Usage notes 2/7/2007 Digital Library Brown Bag Series 26

Sample data dictionary entry 2/7/2007 Digital Library Brown Bag Series 27

Outline Background and context PREMIS data model PREMIS data dictionary Implementing PREMIS Adoption and ongoing developments 2/7/2007 Digital Library Brown Bag Series 28

Role of a preservation policy PREMIS helps a repository to implement a preservation policy; it doesn t set that policy Policy can be complicated Is descriptive metadata part of an Intellectual Entity? If so, should we treat it as a file? Is PREMIS data itself a file (or a bitstream) that is managed by the repository? etc., ad infinitum The data dictionary is only a starting point, does not include all information needed to preserve an Object 2/7/2007 Digital Library Brown Bag Series 29

Relationship to technical metadata PREMIS semantic units restricted to: intellectual characteristics characteristics common to all formats Some overlap between PREMIS semantic units and elements defined by various technical metadata standards 2/7/2007 Digital Library Brown Bag Series 30

Other types of metadata Structural and rights metadata fall at least partly within the scope of PREMIS, but perhaps not entirely Descriptive metadata is useful for defining Intellectual Entities, which are managed by some repositories 2/7/2007 Digital Library Brown Bag Series 31

Lack of relevant content standards and controlled vocabularies Only in some cases do definitions of semantic elements provide guidance on how to structure the data recorded PREMIS semantic units largely outside the scope of most existing content standards PREMIS assumes that repositories will adopt or define controlled vocabularies useful to them Perhaps a common content standard isn t needed, but the lack of one does mean more decisions have to be made when implementing a repository 2/7/2007 Digital Library Brown Bag Series 32

PREMIS conformance Local metadata can be used to extend but not modify the PREMIS semantic units The mandatory semantic units of the Data Dictionary represent the information that a preservation repository must be able to associate with any archived digital object in its possession Don t have to store PREMIS information, just have to know it Currently no formal means of stating or testing conformance 2/7/2007 Digital Library Brown Bag Series 33

XML Schemas Literal representations of the semantic units and attributes of the PREMIS data dictionary Of use for exchange of preservation objects Likely of less use for a repository s internal representation 5 separate schemas PREMIS container Object entity Event entity Agent entity Rights entity 2/7/2007 Digital Library Brown Bag Series 34

Outline Background and context PREMIS data model PREMIS data dictionary Implementing PREMIS Adoption and ongoing developments 2/7/2007 Digital Library Brown Bag Series 35

Adoption Hard to tell, since preservation repositories operate behind the scenes PREMIS Implementation Registry currently has 8 diverse entries Active implementer's discussion list Working Group won the 2005 Digital Preservation Coalition Digital Preservation Award Several PREMIS workshops scheduled 2/7/2007 Digital Library Brown Bag Series 36

Current PREMIS activity PREMIS Maintenance Activity hosted at the Library of Congress Editorial Committee named Commissioned report on Rights in the PREMIS Data Model Proposals for revisions of two semantic units in public comment period 2/7/2007 Digital Library Brown Bag Series 37

So where are we? The data dictionary looks to be having a big impact Discussion of preservation metadata is increasing It seems the PREMIS goal of a starting point has been well fulfilled PREMIS looks promising as a source of ideas for the IU DLP preservation repository 2/7/2007 Digital Library Brown Bag Series 38

For more information PREMIS Working Group site: <http://www.oclc.org/research/projects/pmwg/default.htm> PREMIS Maintenance Activity site: <http://www.loc.gov/standards/premis/> PREMIS Implementors Listserv: <http://listserv.loc.gov/listarch/pig.html> These presentation slides: <http://www.dlib.indiana.edu/~jenlrile/presentations/bbspr07/premis/premis.ppt> jenlrile@indiana.edu 2/7/2007 Digital Library Brown Bag Series 39