Digital Preservation at NARA

Similar documents
Managing Records in Electronic Formats. An Introduction

Woodson Research Center Digital Preservation Policy

State Government Digital Preservation Profiles

What do you do when your file formats become obsolete? Lydia T. Motyka Florida Center for Library Automation USETDA 2011

Agenda. Bibliography

Electronic Records Archives: Philadelphia Federal Executive Board

DRI: Dr Aileen O Carroll Policy Manager Digital Repository of Ireland Royal Irish Academy

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017

The Canadian Information Network for Research in the Social Sciences and Humanities.

Building the Archives of the Future: Self-Describing Records

Sparta Systems Stratas Solution

ARCW Digital Preservation Survey Report

Managing Official Electronic Records Guidelines

National Snow and Ice Data Center. Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC

State Government Digital Preservation Profiles

Openness, Growth, Evolution, and Closure in Archival Information Systems

University of British Columbia Library. Persistent Digital Collections Implementation Plan. Final project report Summary version

Preservation and Access of Digital Audiovisual Assets at the Guggenheim

Big Data Issues for Federal Records Managers

Strategy for long term preservation of material collected for the Netarchive by the Royal Library and the State and University Library 2014

Department of the Navy Annual Records Management Training Slides. Train the Trainer Version

Importance of cultural heritage:

Tape Sucks for Long-Term Retention Time to Move to the Cloud. How Cloud is Transforming Legacy Data Strategies

RECORDS AND INFORMATION MANAGEMENT AND RETENTION

Sparta Systems TrackWise Digital Solution

EXAMGOOD QUESTION & ANSWER. Accurate study guides High passing rate! Exam Good provides update free of charge in one year!

Managing and Electronic Records: A Transition Priority

5/6/2013. Creating and preserving records that contain adequate and proper documentation of the organization.

Management Case Study. Kapil Lohia

Can a Consortium Build a Viable Preservation Repository?

Records Retention Policy

National Snow and Ice Data Center. Plan for Reassessing the Levels of Service for Data at the NSIDC DAAC

MAPPING STANDARDS! FOR RICHER ASSESSMENTS. Bertram Lyons AVPreserve Digital Preservation 2014 Washington, DC

Importance of the Data Management process in setting up the GDPR within a company CREOBIS

Before Searching for Solutions It s All About the Data

Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives

8/28/2017. What Is a Federal Record? What is Records Management?

Different Aspects of Digital Preservation

Long-term digital preservation of UNSWorks

SPRING-FORD AREA SCHOOL DISTRICT

Sparta Systems TrackWise Solution

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Oracle Data Cloud ( ODC ) Inbound Security Policies

Data Curation Handbook Steps

Records Information Management

DIGITAL RECORDS MANAGEMENT GUIDELINES

Managing Born- Digital Documents.

NARA RECORDS MANAGEMENT INITIATIVES FOR MORE EFFECTIVE ACCESS TO INFORMATION. SERI Educational Webinar Tuesday, September 9, :00 pm Eastern

Southington Public Schools

4.2 Electronic Mail Policy

Exam : A : Assess: Fundamentals of Applying Tivoli Storage Management V3. Title. Version : Demo

GUIDELINES FOR CREATION AND PRESERVATION OF DIGITAL FILES

Records Management at MSU. Hillary Gatlin University Archives and Historical Collections January 27, 2017

The Trail of Electrons

Terms in the glossary are listed alphabetically. Words highlighted in bold are defined in the Glossary.

Cambridge University Libraries Digital Preservation Policy

State Government Digital Preservation Profiles

Records Management Standard for the New Zealand Public Sector: requirements mapping document

Refreshing Your Records Management. Tracy Rebstock, Southwest Regional Archivist

Management Guidelines - Roadmap LIBRARY AND ARCHIVES CANADA GOVERNMENT RECORDS BRANCH

Introduction to Digital Preservation. Danielle Mericle University of Oregon

Competency Definition

Leveraging High Performance Computing Infrastructure for Trusted Digital Preservation

DAOS - IBM Lotus Domino Attachment and Object Service

Getting ready for GDPR. Philipp Hobler EMEA Field CTO Global Technology Office Dell EMC Data Protection Solutions

DEPARTMENT OF HOMELAND SECURITY RECORDS MANAGEMENT HANDBOOK

MNsure Privacy Program Strategic Plan FY

Susan Thomas, Project Manager. An overview of the project. Wellcome Library, 10 October

EU GDPR & NEW YORK CYBERSECURITY REQUIREMENTS 3 KEYS TO SUCCESS

NDSA Web Archiving Survey

Preserving Digital Content at Scale

INFORMATION ASSURANCE DIRECTORATE

State Government Digital Preservation Profiles

IT SECURITY RISK ANALYSIS FOR MEANINGFUL USE STAGE I

ISO Information and documentation Digital records conversion and migration process

INFORMATION ASSURANCE DIRECTORATE

2 The IBM Data Governance Unified Process

MMARS Financial, Labor Cost Management (LCM) and Commonwealth Information Warehouse (CIW) Reports

HIPAA Compliance and OBS Online Backup

ICGI Recommendations for Federal Public Websites

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

TRICARE Operations Manual M, April 1, 2015 Records Management (RM) Chapter 9 Section 1

Ashford Board of Education Ashford, Connecticut POLICY REGARDING RETENTION OF ELECTRONIC RECORDS AND INFORMATION

Records Management Metadata Standard

Gain Control Over Your Cloud Use with Cisco Cloud Consumption Professional Services

Search Framework for a Large Digital Records Archive DLF SPRING 2007 April 23-25, 25, 2007 Dyung Le & Quyen Nguyen ERA Systems Engineering National Ar

Freedom of Access in Maine

DRI: Preservation Planning Case Study Getting Started in Digital Preservation Digital Preservation Coalition November 2013 Dublin, Ireland

CoSA & Preservica Practical Digital Preservation 2015/16. Practical OAIS Digital Preservation Online Workshop Module 2

ISO/ IEC (ITSM) Certification Roadmap

Recordkeeping Standards Analysis of HealthConnect

NATIONAL ARCHIVES AND RECORDS ADMINISTRATION. Office of Presidential Libraries; Disposal of Presidential Records

IT Governance Framework at KIT

Minnesota State Colleges and Universities System Procedures Chapter 5 Administration

Privacy Breach Policy

EU General Data Protection Regulation (GDPR) Achieving compliance

GETTING STARTED WITH DIGITAL COMMONWEALTH

Business Plan For Archival Preservation of Geospatial Data Resources

Overview of Archiving. Cloud & IT Services for your Company. EagleMercury Archiving

Transcription:

Digital Preservation at NARA Policy, Records, Technology Leslie Johnston Director of Digital Preservation US National Archives and Records Administration (NARA) ARMA, April 18, 2018

Policy Managing Government Records Directive (2012) All email managed electronically by end of 2016 All permanent electronic records managed electronically by end of 2019 2015-04: Metadata Guidance for the Transfer of Permanent Electronic Records 2015-03: Guidance on Managing Digital Identity Authentication Records 2015-02: Guidance on Managing Electronic Messages 2014-06: Guidance on Managing Email 2014-04: Revised Format Guidance for the Transfer of Permanent Electronic Records 2014-02: Guidance on Managing Social Media Records 2013-03: Guidance for Agency Employees on the Management of Federal Records, Including Email Accounts and the Protection of Federal Records from Unauthorized Removal 2012-02: Guidance on Managing Content on Shared Drives 2010-05: Guidance on Managing Records in Cloud Computing Environments https://www.archives.gov/records-mgmt/bulletins

Digital Preservation Strategy NARA published its first Digital Preservation Strategy in June 2017 to guide its operations. https://www.archives.gov/preservation/electronic-records.html This outlines the specific strategies that NARA will use in its digital preservation efforts, and specifically addresses: Infrastructure Data Integrity Format and Media Sustainability Information Security

Record Content Domain: Records of US Federal Government History: Electronic records covered by our records law from 1943 ( other documentary materials, regardless of physical form or characteristics ) NARA started collecting electronic records in 1970 Process: Records selected for preservation either through one of several federal regulations or through the records scheduling process. Permanent records: Document individual rights; Document actions of officials: provide government accountability; and/or Document the national experience.

Record Formats Domain: Records of US Federal Government History: Electronic records covered by our records law from 1943 ( other documentary materials, regardless of physical form or characteristics ) NARA started collecting electronic records in 1970 Process: Records selected for preservation either through one of several federal regulations or through the records scheduling process. Permanent records: Document individual rights; Document actions of officials: provide government accountability; and/or Document the national experience.

Technology Phase 1: Tape, 1970- present Classified records still managed on tape Phase 2: Electronic Records Archives (ERA), 2008 - present Approximately 550 TB Three separate environments aligned with record regulations, plus the National Archives Catalog Online support for records management transactions Preservation repository, PREMIS metadata catalog Phase 3: ERA 2.0, 2018 - future Cloud-based scalability for processing and storage Flexible processing environment for running diverse tools Improved search and access in preservation repository

How NARA Organized its Digital Preservation Planning Policy Goal: Policies and processes are in place to preserve all electronic records transferred to NARA for permanent retention and preservation as well as for digitized surrogate files. Example Policies: Digital Preservation policy for NARA. Policy on the preservation of digital surrogate master files. Example Gaps: Policy and SOPs for the assessment of file formats in the holdings and the triggers for format transformations. Infrastructure Goal: NARA has adequate storage, network capacity, systems, and tools for the ingest, processing, active file management, and preservation of its digital holdings. Infrastructure Examples: Managed, replicated preservation storage. Tools for the characterization and validation of file formats. Example Gaps: Hardware and software to read an increasing variety of legacy storage media upon which records are transferred. Tools for file format preservation transformations. Staffing Goal: NARA has sufficient staff with appropriate training to do the work of digital preservation, so that electronic records are properly and actively managed through their lifecycle. Examples: Archivists responsible for the ingest, processing, and description of electronic records. IT specialists who research technologies and develop or modify open source tools for the processing of electronic records. Example Gaps: Ongoing training in community best practices and technologies. Archivists who analyze file formats in the holdings and run system preservation operations.

Digital Preservation Assumptions Electronic records received should conform to the NARA Transfer Guidance for file formats. Master preservation files are retained. All files must have recorded fixities All actions taken on files must be recorded and tracked. Public use copies of files are created. At this time, preservation format transformations are not performed but are planned. Regular audits must be performed.

Current Digital Preservation Activities File Format Transfer Guidance for agencies to ensure that records are transferred to NARA as sustainable formats. Digital Preservation Strategy and Digital Preservation advisory group to guide internal operations, including SOPs and File Format Preservation Action Plans. Ingest of files from agency media (drives, optical media) and network transfers of files directly from agencies, including: Checking for fixities and assigning fixities if none came with the records Running file format validation checks Creating manifests and logs of all ingest actions Audits of media in the collection: Annual sample of media 10 year migration of media Ongoing monitoring of the systems and infrastructure: Monitoring of system and storage status Monitoring of the holdings files preserved using those systems Regular emergency system backup restoration tests

Case Study: Email Email is an obvious area of records management emphasis, but we lack a systematic approach to receive, process, and make emails available for public access Growing scale of email as a record type Need to determine best technical approach o Similar but not identical business/functional needs across Presidential, Congressional, and Federal records and the systems that currently hold them o How to ensure the availability of messages (including attached files) Need to assess staff resources for reviewing emails o How to identify and deploy capabilities in ERA 2.0 to eventually consolidate all systems o How to reduce dependence on human reviewers

What Recent Preservation Technology Trends Should We be Watching? Web Archiving is not new, but there is greater public awareness of web content as potentially valuable but often transitory, and its use in Federal records management and preservation is increasing. There are exciting new tools for indexing and playback of web archives. Format obsolescence is always an issue for digital preservation. There have long been tools to identify and characterize file formats, such as JHOVE and DROID, but more organizations internationally are starting to actively share their risk assessments that they use to gauge the sustainability of formats and what preservation actions to take. There is new recognition that digital items have a context -- they're not always files sitting in directories on desktops and servers -- they live in systems and in complex web applications and our interactions with them are based on algorithms that control how search results are ranked (or shown to us at all) and in some cases so personalized that those algorithms are actually mediating what information comes our way. There is increased interest in the preservation of software. The Library of Congress has a symposium on architectural design files and software preservation. The Software Preservation Network and Software Heritage are testing the boundaries of public policy and legal framework to preserve code. The issues of working with vintage hardware and media are getting more attention: The National Archives of Australia was interviewed about its practices. And video preservation continues to be a critical topic. Born digital media files only 4 years old can be at risk without stewardship. As is email.

Thoughts and Questions? Leslie Johnston Director of Digital Preservation National Archives and Records Administration leslie.johnston@nara.gov