Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

Similar documents
DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017

DRS Policy Guide. Management of DRS operations is the responsibility of staff in Library Technology Services (LTS).

Next-Generation Melvyl Pilot supported by WorldCat Local: The Future of Searching

Preservation and Access of Digital Audiovisual Assets at the Guggenheim

Agenda. Bibliography

GEOSS Data Management Principles: Importance and Implementation

Developing a Framework for File Format Migrations. ipres 2015 Chapel Hill, NC 3 November 2015 Joey Heinen and Andrea Goethals

The Canadian Information Network for Research in the Social Sciences and Humanities.

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

BUILDING A NEW DIGITAL LIBRARY FOR THE NATIONAL LIBRARY OF AUSTRALIA

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

University of British Columbia Library. Persistent Digital Collections Implementation Plan. Final project report Summary version

Strategy for long term preservation of material collected for the Netarchive by the Royal Library and the State and University Library 2014

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

Data management Backgrounds and steps to implementation; A pragmatic approach.

ISO Self-Assessment at the British Library. Caylin Smith Repository

Protecting Future Access Now Models for Preserving Locally Created Content

Digital Preservation at NARA

ebooks Preservation at Scholars Portal Kate Davis & Grant Hurley Scholars Portal, Ontario Council of University Libraries

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

Preserving Digital Content at Scale

DRI: Preservation Planning Case Study Getting Started in Digital Preservation Digital Preservation Coalition November 2013 Dublin, Ireland

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

CREATING DIGITAL REPOSITORIES PRESENTED BY CHAMA MPUNDU MFULA CHIEF LIBRARIAN NATIONAL ASSEMBLY OF ZAMBIA

Indiana University Research Technology and the Research Data Alliance

Developing a Research Data Policy

Digital Preservation: How to Plan

Woodson Research Center Digital Preservation Policy

Microsoft SharePoint Server 2013 Plan, Configure & Manage

FIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION

Managing Records in Electronic Formats. An Introduction

GUIDELINES FOR CREATION AND PRESERVATION OF DIGITAL FILES

Cloud Computing Standard 1.1 INTRODUCTION 2.1 PURPOSE. Effective Date: July 28, 2015

7.3. In t r o d u c t i o n to m e t a d a t a

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Emory Libraries Digital Collections Steering Committee Policy Suite

Helping Journals to Upgrade Data Publications for Reusable Research

NEW YORK PUBLIC LIBRARY

OnCommand Insight 7.1 Planning Guide

Indiana University s Media Digitization and Preservation Initiative

archiving with the IBM CommonStore solution

Digital Preservation Standards Using ISO for assessment

Digital Preservation and The Digital Repository Infrastructure

Digital Preservation Policy. Principles of digital preservation at the Data Archive for the Social Sciences

ORACLE SERVICES FOR APPLICATION MIGRATIONS TO ORACLE HARDWARE INFRASTRUCTURES

PASIG Directions & Issues

The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets

The Data Curation Profiles Toolkit: Interview Worksheet

User Stories : Digital Archiving of UNHCR EDRMS Content. Prepared for UNHCR Open Preservation Foundation, May 2017 Version 0.5

Digital repositories as research infrastructure: a UK perspective

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography

Collection Policy. Policy Number: PP1 April 2015

Science-as-a-Service

Certification. F. Genova (thanks to I. Dillo and Hervé L Hours)

The iplant Data Commons

UC Irvine LAUC-I and Library Staff Research

Fundamental Issues for Electronic Records

NHSmail Migration Communications Plan Template

SIMPLIFY, AUTOMATE & TRANSFORM YOUR BUSINESS

Safeguarding Digital Heritage through Sustained Use of Legacy Software

Digital Projects/Public Viewer Digital Preservation Storage Update National Preservation Projects

Finding the right resource for you

Taking the plunge: digital archives at HSBC

CA ERwin Data Profiler

Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives

State Government Digital Preservation Profiles

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Choosing a Secure Cloud Service Provider

Managing Born- Digital Documents.

28 September PI: John Chip Breier, Ph.D. Applied Ocean Physics & Engineering Woods Hole Oceanographic Institution

A global open source ecosystem for power systems

January 16, Re: Request for Comment: Data Access and Data Sharing Policy. Dear Dr. Selby:

Microsoft Core Solutions of Microsoft SharePoint Server 2013

Establishing Enterprise-Wide Data Governance

Campus IT Modernization OPERATIONAL CONTINUITY FLEXIBLE TECHNOLOGY MODERNIZED SYSTEMS

Legal Issues in Data Management: A Practical Approach

PROCESS HISTORY METADATA PEGGY GRIESINGER NATIONAL DIGITAL STEWARDSHIP RESIDENT MUSEUM OF MODERN ART DECEMBER 4 T H, 2014

DRI: Dr Aileen O Carroll Policy Manager Digital Repository of Ireland Royal Irish Academy

Fedora, Islandora, & Samvera: Requirements and Gaps. David Wilcox,

Future Trends of ILS

Collecting Public Health Data at a Global Scale. D. Cenk Erdil, PhD Marist College

STRATEGIC PLAN

Certification as a means of providing trust: the European framework. Ingrid Dillo Data Archiving and Networked Services

Data Curation Handbook Steps

Embedding Privacy by Design

Data Archiving and Networked Services. Valentijn Gilissen, MA

Big Data, exploiter de grands volumes de données

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

HIPAA Case Study. Implementing a Security Program at a Mid-size Hospital. Lehigh Valley Hospital and Health Network. Brian Martin

PRESERVING DIGITAL OBJECTS

RADAR A Repository for Long Tail Data

Cloud Computing. An introduction using MS Office 365, Google, Amazon, & Dropbox.

Prepared Testimony of. Bohdan R. Pankiw. Chief Counsel Pennsylvania Public Utility Commission. before the

Case Study: Building Permit Files

MOBIUS + ARKIVY the enterprise solution for MIFID2 record keeping

NDSA Web Archiving Survey

SciVerse Scopus. Date: 21 Sept, Coen van der Krogt Product Sales Manager Presented by Andrea Kmety

IN THE FRAME. Computacenter Public Sector Frameworks FRAMEWORK

ISO/ IEC (ITSM) Certification Roadmap

IBM TS7700 grid solutions for business continuity

Transcription:

Building on to the Digital Preservation Foundation at Harvard Library Andrea Goethals ABCD-Library Meeting June 27, 2016

What do we already have? What do we still need? Where I ll focus

DIGITAL PRESERVATION SERVICES Organizationally within HL Preservation Services Along with Media Preservation Services, Imaging Services, Weissman Preservation Center, Collections Care 2014-2020 Digital Preservation Roadmap Services Monitor and maintain long-term usability of DRS content Represent user preservation needs Consultations, advice, guidelines DRS oversight and management

WHAT IS THE DRS? Harvard-maintained service for digital collections & content for: preservation keep the content safe keep the information usable long-term on modern platforms delivery to users A service - not storage or a tool Includes preservation & IT staff actively monitoring the content and systems Includes documented policies, practices & preservation plans Uses technology & systems but these change over time

DRS ROLES DRS Business Owner Digital Preservation Services Key Responsibilities: Preservation & usage policies, strategies Manage & communicate about service Represent users & content s preservation needs Define high-level enhancement roadmaps Preservation plans Preservation outreach, consulting, guidelines DRS Technology Owner Library Technology Services Key Responsibilities: Technology & security policies, strategies Manage hardware, software, development Bug fixes & enhancements System monitoring & scaling Refine roadmaps based on resources System testing & documentation User support & training on systems

KEY POLICIES What can be deposited into the DRS Who can deposit to the DRS Obligations of collection managers Responsibilities of DRS staff Retention policies Discovery & access policies Delivery services Preservation services DRS Policy Guide: http://hul.harvard.edu/ois/systems/drs/policyguide/drs_policy_guide-printable.pdf

HIGHLIGHTS In production since 2000 Integrated with Library infrastructure & services Content from over 50 Harvard units Supports curatorial management Rich set of metadata Accepts any format but provides incentives to deposit in recommended preservable formats Supports HRCI content Proven track record Experience with several metadata migrations, maintenance of persistent names, several storage refreshes, planning for format migrations this year Never lost a file! Spawned well-used open-source tools (JHOVE, FITS)

WHAT S IN THE DRS? Quantity: 61.3 million files, 196 TB per copy Many formats Images, audio, text, digitized books, web sites, documents, biomedical image stacks, email and soon video But primarily digitized images and text Image 63% Text 35% Container 2% Audio 0% Document 0% Other 0%

WHO OWNS THE CONTENT? DRS Content by Owner 160 140 TB 120 100 80 60 40 20 0 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 Year Art Museum Business Design Divinity Education FAS Government HU Archives Law Medical Other Radcliffe Does not include content from the Harvard-Google project

IMLS-funded DRS Audit (1 resident through 5/16) Planning for Next-Generation DRS Storage Project LARGE PARALLEL PROJECTS Migration to Next-Generation DRS Storage Project Arcadia-funded Easier DRS Deposits Project (1 term position through 11/16) Arcadia-funded Long-Term Preservation Project (2 term positions through 11/16) DRS2 Metadata Migration Project (1 term position through 3/17) Planning for Format Migrations Project Format Migration Project Ongoing Enhancements, Upgrades and Project Support (2+ developers + systems and user support + management) 6/2016 We are here 12/2016

Prepared for a plan Mostrequested eventual for easier & IMLS-funded DRS Audit certification as a more formats (1 resident through 5/16) trustworthy efficient supported repository; have deposit by the DRS Migration to Next-Generation DRS Planning for Next-Generation DRS direction Storage for workflows; for Project future more Storage preservation Project improvements content and access; Arcadia-funded Easier DRS Deposits Project (1 term preserved position through more 11/16) content preserved Arcadia-funded Long-Term Preservation Project (2 term positions through 11/16) DRS2 Metadata Migration Project (1 term position through 3/17) Refreshed storage with greater geographic distribution; less preservation risk All DRS content described in consistent, highquality, standard ways to support preservation planning Planning for Format Migrations Project Format Migration All DRS Project audio and image files in accessible, modern formats for our users; framework in place for Ongoing Enhancements, Upgrades and Project Support future format migrations (2+ developers + systems and user support + management) 6/2016 We are here 12/2016

BUILDING ONTO THE DRS Additional workflows into the DRS Additional access from the DRS Related infrastructure Support for additional formats

ADDITIONAL WORKFLOWS INTO THE DRS imports from third-party services and vendors common workflows and tools for processing and depositing born-digital collections web-based deposits simpler deposit packages DRS automated transfer from Harvard systems deposit service

ADDITIONAL ACCESS FROM THE DRS researcher platforms for analysis, annotation, collection-building aggregators and peer institutions DRS new forms of delivery for sensitive content, emulated content, packaged content online exhibits, digital library usage data, content health metrics

RELATED INFRASTRUCTURE short-term storage for sensitive records DRS collaborative storage solutions What is needed: Harvard policies, strategies and guidelines for when to use which Transfer mechanisms between them short- and mediumterm storage solutions domain-specific repositories

SUPPORT FOR ADDITIONAL FORMATS DRS software databases spreadsheets presentations social media CAD/CAM long tail of formats in collections complex art datasets computer games camera RAW images image sequences vector graphics apps and executables