Digital Preservation Preservation Strategies

Similar documents
DIGITAL IMAGING VIEW OF DIGITAL PRESERVATION

Digital Preservation: An Overview*

Different approaches to digital preservation

The digital preservation technological context

Digital Preservation with Special Reference to the Open Archival Information System (OAIS) Reference Model: An Overview

Technical Challenges in Digital Preservation

Digital Archeology: Recovering Digital Objects from Audio Waveforms

4. TECHNOLOGICAL DECISIONS

Digits Fugit or. Preserving Digital Materials Long Term. Chris Erickson - Brigham Young University

Summary of Bird and Simons Best Practices

What do you do when your file formats become obsolete? Lydia T. Motyka Florida Center for Library Automation USETDA 2011

Introduction to Digital Preservation. Danielle Mericle University of Oregon

Sustainable File Formats for Electronic Records A Guide for Government Agencies

Digital Preservation DMFUG 2017

DRS Policy Guide. Management of DRS operations is the responsibility of staff in Library Technology Services (LTS).

GUIDELINES FOR CREATION AND PRESERVATION OF DIGITAL FILES

Different Aspects of Digital Preservation

The OAIS Reference Model: current implementations

Where d My Photos Go? Challenges in Preserving Digital Data for the Long Term

A Collaboration Model between Archival Systems to Enhance the Reliability of Preservation by an Enclose-and-Deposit Method

Metadata. for the Long Term Preservation of Electronic Publications. Catherine Lupovici Julien Masanès. Bibliothèque nationale de France

Introduction to. Digital Curation Workshop. March 14, 2013 SFU Wosk Centre for Dialogue Vancouver, BC

Draft Digital Preservation Policy for IGNCA. Dr. Aditya Tripathi Banaras Hindu University Varanasi

State Government Digital Preservation Profiles

CoSA & Preservica Practical Digital Preservation 2015/16. Practical OAIS Digital Preservation Online Workshop Module 2

The Perils of Preserving Digitized and Born-Digital Video

Digital s ideal partner

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Defining OAIS requirements by Deconstructing the OAIS Reference Model Date last revised: August 28, 2005

Session Two: OAIS Model & Digital Curation Lifecycle Model

Fundamental Issues for Electronic Records

WHITE PAPER Cloud FastPath: A Highly Secure Data Transfer Solution

UVic Libraries digital preservation framework Digital Preservation Working Group 29 March 2017

Digital Preservation in Theory and in Practice

Backups and archives: What s the scoop?

The Canadian Information Network for Research in the Social Sciences and Humanities.

ISO Information and documentation Digital records conversion and migration process

When it comes to information archive strategy, digital has met its ideal partner.

Safeguarding audio-visual heritage

Efficiently Backing up Terabytes of Data with pgbackrest

Internet Architecture Board (IAB) Request for Comments: 8153 Category: Informational April 2017 ISSN:

POLICY AND GUIDELINES FOR THE RETENTION AND DISPOSITION OF ORIGINAL COUNTY RECORDS COPIED ONTO OPTICAL IMAGING AND DATA STORAGE SYSTEMS

File obsolescence at the ADS?

Ubiquitous and Open Access: the NextGen library. Edmund Balnaves, Phd. Information Officer, IFLA IT Section

Digital Preservation and BBC Domesday. Paul Wheatley University of Leeds

BookKeeper overview. Table of contents

An overview of the OAIS and Representation Information

Digital Preservation: From Theory to Practice

Archiving and Migrating Digital Files

Efficiently Backing up Terabytes of Data with pgbackrest. David Steele

Digital Archeology: Recovering Digital Objects from Audio Waveforms

Digital Archeology: Recovering Digital Objects from Audio Waveforms

DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017

The distant future of the recent past

INTRODUCTION TO XTREMIO METADATA-AWARE REPLICATION

Archive and Preservation for Media Collections

Usage of Virtualization Technologies in Long-Term Preservation of Integrity and Accessibility of Digital Data

An Introduction to Digital Preservation

GETTING STARTED WITH DIGITAL COMMONWEALTH

Assigns a persistent identifier that will always point to the object and/or its metadata.

ZYNSTRA TECHNICAL BRIEFING NOTE

System Manual Part 2: TetraNode Architecture

4. BACKUP PRACTICES FOR DIGITAL PRESERVATION

Challenges and opportunities in digital preservation. what do I need to know about digital preservation? It won t go away It won t do itself

GFS: The Google File System

Emerging Trends in Records Management Technology. Jessie Weston, CRA 2018 MISA Conference October 11-12, 2018

A Blueprint for Representation Information in the OAIS Model

Preserving Your Genealogy. Hope N. Tillman & Walt Howe Charlestown June 10, 2016

The Trustworthiness of Digital Records

Managing Records in Electronic Formats. An Introduction

Digital Preservation at NARA

Susan Thomas, Project Manager. An overview of the project. Wellcome Library, 10 October

The Making of PDF/A. 1st Intl. PDF/A Conference, Amsterdam Stephen P. Levenson. United States Federal Judiciary Washington DC USA

The Long-term Preservation of Accurate and Authentic Digital Data: The InterPARES Project

Title: Case Study 08 North Vancouver Museum and Archives (NVMA): Customizable Versions of Products

Migration. 22 AUG 2017 VMware Validated Design 4.1 VMware Validated Design for Software-Defined Data Center 4.1

De-dupe: It s not a question of if, rather where and when! What to Look for and What to Avoid

Archive exchange Format AXF

Creating our Digital Cultural Heritage. Characteristics of Digital Information

To Upgrade or to Re-implement Dynamics NAV. Presented by: Abhishek Agnihotri

Developing an Electronic Records Preservation Strategy

Data Curation Handbook Steps

NetCDF and Scientific Data Durability. Russ Rew, UCAR Unidata ESIP Federation Summer Meeting

Scale-out Data Deduplication Architecture

Managing Born- Digital Documents.

Managing Exchange Migration with Enterprise Vault

Data Centres in the Virtual Observatory Age

Migrating to Object Data Management

Rules of the Road for Scanning and Tossing. Imaging Requirements for Paper Based Records

Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives

Preservation Planning in the OAIS Model

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

Cloud FastPath: Highly Secure Data Transfer

Archival Information Package (AIP) E-ARK AIP version 1.0

Running head: Digital Library Disaster 1. Digital Library Disaster Planning. Bryan Hamilton IUPUI. Digital Libraries. Dr.

Hitachi Adaptable Modular Storage and Hitachi Workgroup Modular Storage

Guide. A small business guide to data storage and backup

Best Current Practice; mandatory IETF RFCs not on standards track, see below.

Big Data, exploiter de grands volumes de données

Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS. Jenn Riley IU Metadata Librarian DLP Brown Bag Series February 25, 2005

Transcription:

Digital Preservation Preservation Strategies by Dr. Jagdish Arora Director, INFLIBNET Centre jarora@inflibnet.ac.in

Digital Preservation Strategies Short-term Strategies Bit-stream Copying Refreshing Replication Technology Preservation or Computer Museum Backwards Compatibility and Version Migration 2

Digital Preservation Strategies Medium- to Long-term Strategies Migration Viewers and Migration at the Point of Access Canonicalization Emulation Investment Strategies Restricting Range of Formats and Standards Reliance on Standards Data Abstraction and Structuring Encapsulation Software Re-engineering Universal Virtual Computer 3

Digital Preservation Strategies Alternative strategies Analogue Backups Digital Archaeology or Data Recovery Combinations 4

Digital Preservation Strategies Short-term Strategies

Short-term Strategies Bit-stream Copying Known as backing up data ; making an exact duplicate of a digital object. Remote storage guard against disastrous event. Minimum maintenance strategy for even the most lightly valued, ephemeral data. Not a long-term preservation strategy. 6

Refreshing copying digital information from one long-term storage medium to another with no change in the bit-stream Addresses both decay and obsolescence issues related to the storage media. Does not address the issue of obsolescence of encoding and formatting schemes. Longitivity of media does not guarantee availability of hardware / software required to read the stored format. Backward compatibility and interoperability are serious threat to longevity of digital information. 7

Replication Replication is used to represent multiple digital preservation strategies. Bit-stream copying is a form of replication. LOCKSS (Lots of Copies Keeps Stuff Safe) is a consortial form of replication, while peer-to-peer data trading is an open, free-market form of replication. Objective is to enhance the longevity of digital documents while maintaining their authenticity and integrity through copying and the use of multiple storage locations. 8

Technology Preservation Also called the computer museum solution. Rely on preserving the computer, operating systems, original application software, media drives, etc. Applicable for neglected digital objects. Assumes that media has not decayed beyond readability. Limitation: No obsolete technology can be kept functional indefinitely. Requires a considerable investment in equipment and personnel. 9

Backward Compatibility & Version Migration Current versions of software can interpret and present digital material created with previous versions of the same software. Option for version migration converts documents into the current format. MS Word, Excel and Access applications, allow previous versions of their file formats to be transformed and resaved in a new version Option not be available for all types of objects. The process of migration is likely to introduce unwanted changes to a document incrementally if used over many generations. 10

Digital Preservation Strategies Medium to Long-term Strategies

Migration Periodic transfer of digital materials from one hardware / software configuration to another, or from one generation of computer technology to a subsequent generations. Migration may include conversion of data to avoid obsolescence not only of the physical storage medium, but of the encoding and format of the data. Digital objects will have to be constantly migrated and converted to new formats, computing devices, storage media and software to ensure they are not left behind on obsolete system. 12

Viewers and and Migration at the Point of Access An alternative to recurring migration. Involves use of viewers, software tools that provide accessibility at the time of access, using the original data stream. Example: The migration on request approach has been developed in CEDARS and CAMiLEON projects that uses a software tool to record method of access. Google s Google Docs Limitations i) viewers not be available for all formats; ii) Viewers may represent some, but not all; iii) the gap between the original format and the prevailing technologies at the time of access may be too much to tackle 13

Canonicalization Technique designed to determine maintenance of essential characteristics of a document through conversion from one format to another. Canonicalization relies on the creation of a representation of a digital object that conveys all its key aspects. Once created, this form could be used to algorithmically verify that a converted file has not lost any of its essence. 14

Emulation Emulation uses a special type of software, called an emulator, to translate instructions from original software to execute on new platforms. Eliminate the need to keep old hardware working. Emulation requires the creation of emulator programs that translate code and instructions from one computing environment so it can be properly executed in another. 15

Software Re-engineering Software reengineering may offer a number of strategies for transforming software as technologies change. Adjustment and re-compiling of source code for a new platform; Re-coding of the software from scratch, or recoding in another programming language; and Translation of compiled binary instructions for one platform directly into binary instructions for another platform. Requires source code. Porting to other platforms requires considerable time and effort. 16

Universal Virtual Computer A form of emulation. Requires development of a computer program independent of existing hardware or software that could simulate the basic architecture of every computer. Users could create and save digital files using software of their choice, but all files would be backed up in a way that could be read by the universal computer. Reading a file in future would require only a single emulation layer between the universal virtual computer and the computer of that time. 17

Digital Preservation Strategies Investment Strategies

Restricting Range of Formats and Standards Preservation programmes may decide to only store data in a limited range of formats and standards. All digital objects within an archival repository of a particular type can be converted into a single chosen file format that is thought to embody the best overall compromise amongst characteristics such as functionality, longevity, and preservability. For, example most of the textual and graphical information can be converted into PDF format. The strategy does not solve the access problem of obsolescence of formats and standards 19

Reliance on Standards Advocates use of well-recognized standards and discarding proprietary or less-supported standards. Backward compatibility for older formats would maintained if it is widely used as a standard. For example, if JPEG2000 becomes a widely adopted standard, the sheer volume of users will guarantee that software to encode, decode, and render JPEG2000 images will be upgraded to meet the demands of new operating systems, CPUs, etc. 20

Data Abstraction and Structuring Data abstraction involves analyzing and tagging data so that the functions, relationships and structure of specific elements can be described. Using data abstraction, the representation of content can be liberated from specific software applications and be achieved using different applications as technology changes. The technique requires extensive development of tools and methods for analysis and processing in order to correctly represent and tag each type of data. 21

Encapsulation Technique of grouping together digital objects and metadata necessary to provide access to that object. The grouping process lessens the possibility that any critical component necessary to decode and render a digital object will be lost. Appropriate types of metadata to encapsulate with a digital object include reference, representation, provenance, fixity and context information. Encapsulation is considered a key element of emulation. 22

Digital Preservation Strategies Alternative Strategies

Digital Archaeology Rescue content from damaged media or from damaged hardware and software An emergency recovery strategy involves specialized techniques to recover data from unreadable media, either due to physical damage or hardware failure. Carried out by data recovery companies Given enough resources, readable bitstreams can often be recovered even from heavily damaged media (especially magnetic media) 24

Analogue Backups Combines the conversion of digital objects into analogue form, e.g., taking high-quality printouts or the creation of microfilm. An analogue copy of a digital object can, in some respects, preserve its content and protect it from obsolescence. Technique makes sense for documents whose contents merit the highest level of redundancy and protection from loss. 25

Conclusion No single digital preservation strategy can serve as a practical solution to the problem of technological obsolescence for digital materials. No single solution that is appropriate for all data types, situations, or institutions It is, therefore, reasonable for preservation programmes to look for multiple strategies, especially if they are responsible for a range of materials over extended periods. 26

Conclusion The long-term strategies that are being practiced for digital preservation includes: Use of standards for data encoding, Structuring and description, Emulation of obsolete software or hardware Migration of data from one operating technology to another. None of them have proven themselves against unknown threats over centuries of change. Most of these strategies are, however, being used in the management of data, and it is likely that combinations of the will continue to be researched and proposed for large-scale, longterm preservation. 27

28