Safe Havens in a Choppy Sea:

Similar documents
BUILDING A NEW DIGITAL LIBRARY FOR THE NATIONAL LIBRARY OF AUSTRALIA

Mass Digitisation Enabling Access, Use and Reuse

Digital Preservation Efforts at UNLV Libraries

Compound or complex object: a set of files with a hierarchical relationship, associated with a single descriptive metadata record.

Digital The Harold B. Lee Library

PROCESSING AND CATALOGUING DATA AND DOCUMENTATION: QUALITATIVE

PROCESSING AND CATALOGUING DATA AND DOCUMENTATION - QUALITATIVE

UWDCC Case Study. Supplementing DPOE Workshop 6 June Steven Dast Digital Asset Librarian UW Digital Collections Center

DRI: Dr Aileen O Carroll Policy Manager Digital Repository of Ireland Royal Irish Academy

Archiving the Web: What can Institutions learn from National and International Web Archiving Initiatives

DRS 2 Glossary. access flag An object access flag records the least restrictive access flag recorded for one of the object s files: ο ο

Long-term digital preservation of UNSWorks

The e-depot in practice. Barbara Sierman Digital Preservation Officer Madrid,

Protecting Future Access Now Models for Preserving Locally Created Content

Assessment of product against OAIS compliance requirements

Web-based workflow software to support book digitization and dissemination. The Mounting Books project

Collection Policy. Policy Number: PP1 April 2015

Deposit-Only Service Needs Report (last edited 9/8/2016, Tricia Patterson)

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

PRODUCTION BACKBONE DAM TV PRODUCTION INGEST/DELIVERY PROCESS. Post files to PBB Storage Drop ALE into Hot Folder CONCEPTUAL ALE.

RUtgers COmmunity REpository (RUcore)

PASIG Directions & Issues

CONTENTS. Cisco Internet Streamer CDS 3.0 Software Configuration Guide iii OL CHAPTER 1 Product Overview 1-1

Archiving Full Resolution Images

Its All About The Metadata

Indiana University s Media Digitization and Preservation Initiative

2010 [XOS Vault Manual] A user guide to the XOS Digital Network Vault

DRI: Preservation Planning Case Study Getting Started in Digital Preservation Digital Preservation Coalition November 2013 Dublin, Ireland

Digits Fugit or. Preserving Digital Materials Long Term. Chris Erickson - Brigham Young University

Orbis Cascade Alliance Archives & Manuscripts Collections Service. ArchivesSpace Usage Manual: Digital Objects. Introduction to Digital Objects

Born digital Hull: early steps and lessons learnt (so far) Simon Wilson, Digital Archivist (AIMS Project)

Susan Thomas, Project Manager. An overview of the project. Wellcome Library, 10 October

Accessioning. Kevin Glick Yale University AIMS

DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017

A Digital Preservation Roadmap for Public Media Institutions

By: Kim Schroeder. Lecturer SLIS WSU A Presentation to the NDSA and SAA Wayne State University Student Groups

August 14th - 18th 2005, Oslo, Norway. Web crawling : The Bibliothèque nationale de France experience

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

Data Exchange and Conversion Utilities and Tools (DExT)

A Primer on the Use of TimeReference:!

Archivists Toolkit: Description Functional Area

Summary of Bird and Simons Best Practices

Accessioning Born-Digital Content with BitCurator

CONTENTdm Basic Skills 1: Getting Started with CONTENTdm

University at Buffalo's NEES Equipment Site. Data Management. Jason P. Hanley IT Services Manager

Implementation of the Data Seal of Approval

Kalaivani Ananthan Version 2.0 October 2008 Funded by the Library of Congress

BPMN Processes for machine-actionable DMPs

The digital preservation technological context

The Swedish National Archives digital preservation. Mats Berggren, IT-department,

Accession Procedures Born-Digital Materials Workflow

SobekCM. Compiled for presentation to the Digital Library Working Group School of Oriental and African Studies

Richard Marciano Alexandra Chassanoff David Pcolar Bing Zhu Chien-Yi Hu. March 24, 2010

Product Brief. NEW! Rimage Archiver for P2HD Safe, Portable Storage of Panasonic P2 Card Files

DAITSS Demo Virtual Machine Quick Start Guide

access to reformatted and born digital content regardless of the challenges of media failure and technological

Using DSpace for Digitized Collections. Lisa Spiro, Marie Wise, Sidney Byrd & Geneva Henry Rice University. Open Repositories 2007 January 23, 2007

NFP Client Learning Portal

Integration of non harvested web data into an existing web archive

3. Technical and administrative metadata standards. Metadata Standards and Applications

Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives

HOW TO Load etheses. Background:

Recital Preservation: before they fade away. DIGITAL FRONTIERS September 2016 Houston, Texas

If you are a podcast producer, or are curious to know more about how to start a plan to keep your files around forever, you ve come to the right spot.

This presentation is on issues that span most every digitization project.

Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains

Introduction to Islandora Kim Pham, Digital Projects & Technologies Librarian (UTSC) Kelli Babcock, Digital Initiatives Librarian (UTL)

CORAL Resources Module User Guide

GETTING STARTED WITH DIGITAL COMMONWEALTH

PRESERVING DIGITAL OBJECTS

Institutional repositories: description of VITAL as an example of a Fedora-based digital assets management system.

Persistent identifiers, long-term access and the DiVA preservation strategy

Oracle BI 12c: Build Repositories

Draft Digital Preservation Policy for IGNCA. Dr. Aditya Tripathi Banaras Hindu University Varanasi

Bringing the Voices of Communities Together:

How to Create a Custom Ingest Form

CONTENT PLANNING JOB AID

What is Islandora? Islandora is an open source digital repository that preserves, manages, and showcases your institution s unique material.

User Stories : Digital Archiving of UNHCR EDRMS Content. Prepared for UNHCR Open Preservation Foundation, May 2017 Version 0.5

Digital Preservation at NARA

Digital Preservation DMFUG 2017

Getting Bits off Disks: Using open source tools to stabilize and prepare born-digital materials for long-term preservation

DC Regional Fedora Users Meeting

Practical Experiences with Ingesting Materials for Long-Term Preservation

The Canadian Information Network for Research in the Social Sciences and Humanities.

Digital s ideal partner

Implementing a Data Publishing Service via DSpace. Jon W. Dunn, Randall Floyd, Garett Montanez, Kurt Seiffert

Developing a Research Data Policy

Decision document on digital archives to be transferred or outsourced.

Managing Web Resources for Persistent Access

Creating a System for the Online Delivery of Oral History Content

WIReDSpace (Wits Institutional Repository Environment on DSpace)

1Z0-430

itx Integrated Playout Platform

Terms in the glossary are listed alphabetically. Words highlighted in bold are defined in the Glossary.

Long-Term Preservation Services

Creating a System for the Online Delivery of Oral History Content

Technical Challenges in Digital Preservation

NDSA Web Archiving Survey

Preservation Metadata: Adapting or Adopting PREMIS for APSR

Transcription:

Safe Havens in a Choppy Sea: Digital Object Management Workflows at the National Library of Australia Gerard Clifton Manager, Digital and Audio Preservation Resources National Library of Australia 1

Seascape: NLA Digital Collections Digitised materials Images Sound recordings Online materials (PANDORA) Online publications, Web sites Published offline materials Documents, maps & multimedia Floppy disc, CD, DVD Manuscript materials in digital form Digital manuscripts, papers, email 2

Presentation Overview NLA Digital Services Architecture Persistent identifiers Storage Collection and management workflows Digital Collections Manager (Digitisation) PANDAS (Web archiving) Future work 3

Digital Services Architecture Principles: Support the entire activity cycle collection, storage, management, preservation, discovery, access and delivery Integrate access to print & digital materials Support hierarchical model Provide contextual data, navigation, delivery Persistent citation and access Cathro & Boston (2003) : http://www.nla.gov.au/nla/staffpaper/2003/cathro1.html 4

Digital Services Architecture Discovery ILMS OPAC Metadata Repository & Search System Persistent Access Resolver Service Delivery Collection & Management Delivery Systems Digital Collections Manager (DCM) & Digital Archiving System (PANDAS) Storage Digital Object Storage System (DOSS) 5

Persistent Identifiers <Collection> <Work> <Sequence> <Role> <Version> nla.ms ms51-13-1296 s2 t v1 Role codes (examples) Original o Examination copy e Master m View copy v Co-master c Thumbnail t Derivative master dm Related metadata rm Support for versions For further details, refer to Dack (2001) : http://www.nla.gov.au/initiatives/persistence/piappendix1.html 6

Delivery System 7 Try this page live at : http://nla.gov.au/nla.map-nk1436

Digital Services Architecture Discovery ILMS OPAC Metadata Repository & Search System Persistent Access Resolver Service Delivery Collection & Management Delivery Systems Digital Collections Manager (DCM) & Digital Archiving System (PANDAS) Storage Digital Object Storage System (DOSS) 8

Digital Object Storage Masters Derivatives (delivery) Capacities: Disc 14 TB Tape 60 TB Further information : http://www.nla.gov.au/dsp/doss/doss.doc 9

Digital Collections Workflows Digitised images (pictures, maps, manuscripts, music) Audio materials Online materials Digitisation, Imaging Services Sound Preservation Digital Audio Workstations Digital Collections Manager (DCM) Digital Archiving Section PANDAS Digital Object Storage System (DOSS) 10

Digital Collections Manager Supports digitisation workflows and object management upload preservation masters create delivery derivatives manage file storage store descriptive, technical, process metadata find & browse works, copies, jobs, projects upload, abort, download 11

Digital Collections Manager has subunit / is subunit of Set Work hierarchies Item Component Part Work is instantiated by / is instantiation of is of/ categorises Subunit Type Derivative Profiles Copy & file characteristics & relationships Process histories Tool Role Tool Functional specifications available from: http://www.nla.gov.au/dsp/doms/dcm.html File is used for / uses is used in / uses is input for / has as input Copy Process Job is output of / produces generates / is generated by has piece / is piece of is rationale for/ has rationale in Collection Pattern User Profiles Descriptive Structural Administrative Project 12

Digitisation Workflow 1 2 3 4 Create Create Work & Request Upload Digital Copy Job to storage Content details 5 Acquit Job Creation Ingest 13

Digitisation Workflow 1 2 3 4 Create Create Work & Request Upload Digital Copy Job to storage Content details 5 Acquit Job Work details (& hierarchies) can be: Imported from catalogue Imported from finding aids in EAD Created within DCM Add Copy details for each Work Characteristics of each copy Assign persistent identifiers (based on pattern for collection) 14

Digitisation Workflow 1 2 3 4 Create Create Work & Request Upload Digital Copy Job to storage Content details 5 Acquit Job Search DCM for items Add selected items to Cart Create Job 15

Digitisation Workflow 1 2 3 4 Create Create Work & Request Upload Digital Copy Job to storage Content details 5 Acquit Job Born-digital Originals Images (TIFF) Digital audio recordings (WAV or BWF) Digitised content (from physical originals) Digitised from selected copy Professional equipment & standards Images: scanners, cameras (TIFF) Audio: tape and DAT replay, A/D converters, sound cards (WAV) QUADRIGA digital audio workstations convert WAV to BWF 16

Digitisation Workflow 1 2 3 4 Create Create Work & Request Upload Digital Copy Job to storage Content details 5 Acquit Job Special step for audio content Preprocess creates XML metadata file for conversion of WAV to BWF by QUADRIGA workstation 17

3 Create Digital Content Incorporation of metadata into BWF 18

3 Create Digital Content Incorporation of metadata into BWF 19

3 Create Digital Content Incorporation of metadata into BWF 20

3 Create Digital Content Incorporation of metadata into BWF 21

3 Create Digital Content Incorporation of metadata into BWF 22

3 Create Digital Content Incorporation of metadata into BWF 23

3 Create Digital Content Incorporation of metadata into BWF Upload Upload 24

Digitisation Workflow 1 2 3 4 Create Create Work & Request Upload Digital Copy Job to storage Content details 5 Acquit Job Content submitted to uploader Check for required files Output copy records created Create version numbers Checksum generated (MD5) Extract metadata Generate delivery copies (& records) Move files to DOSS storage 25

Digitisation Workflow 1 2 3 4 Create Create Work & Request Upload Digital Copy Job to storage Content details 5 Acquit Job Metadata recorded Descriptive & Structural metadata From Work & Copy records Subunit details Parent-child relationships Process metadata Actions (Replicate, Derive, Ingest) Input and output copies Tools Technical metadata File system metadata Format-specific metadata 26

Digitisation Workflow 1 2 3 4 Create Create Work & Request Upload Digital Copy Job to storage Content details 5 Acquit Job Jobs acquitted Copies made available to system (Access restrictions remain in force) 27

Examples Image Master copy detail 28

Examples Audio Master copy detail 29

Examples Audio Master process history 30

Examples Audio Master process detail 31

Web archiving: PANDAS PANDORA Archive PANDAS (PANDORA Digital Archiving System) 32

Web Archiving: PANDAS Workflow, gathering and management system for Web materials Uses HTTrack as gatherer PI pattern reflects gathered URLs: <Collection> <Title PI> <Instance Date> <URL> nla.arc 45920 20050222 www.nla.gov.au/webarchiving/index.html 33

Web Archiving Workflow Select Register Gain Permissions Gather schedule fiters HTTrack crawl Process QA check QA fix Initial Capture Archive Preservation Master (TAR) QA Copy Archive Master (TAR) Display copies Restrict Catalogue Set Display 34

Digital Object Management Administration Management of works, copies, relationships, metadata Data Management Redundant storage and backup Refreshment cycles Restrictions on access User authentication Delivery Persistent citation and access Online delivery 35

Future enhancements Digital Collections Manager Additional formats (e.g. PDF) Rights management PANDAS Batch loading IA Heritrix crawler and ARC format 36

Future enhancements Both Review repository layer options (e.g. FEDORA) Enhance Preservation Planning and management With APSR: Enhance support for preservation metadata (PREMIS) Investigate automated monitoring of format obsolescence (PANIC) Support for preservation actions 37

Conclusion Review our position Plot our course Plan for weather Watch the horizon 38