Towards an analytical evaluation of preservation strategies

Similar documents
Evaluating preservation strategies for audio and video files

Digital Preservation: How to Plan

Long-Term Preservation of Electronic Theses and Dissertations: A Case Study in Preservation Planning

Preservation Planning in the OAIS Model

4. TECHNOLOGICAL DECISIONS

Make Your Course Content Accessible using Microsoft Office and Windows.

Line Spacing and Double Spacing...24 Finding and Replacing Text...24 Inserting or Linking Graphics...25 Wrapping Text Around Graphics...

Making the Most of Microsoft Word: Hands-on Activities for Creating Word Documents for Conversion to HTML or PDF.

Audience: - Executives and managers who have already been using MS Office want to migrate to Libre Office suit.

Making Your Excel Spreadsheets Accessible

Virtual Bridging considerations from Server perspective

In this document, you will learn how to take a Microsoft Word Document and make it accessible and available as a PDF.

Karlen Communications

PDF and Accessibility

ADOBE 9A Adobe InDesign CS3 ACE. Download Full Version :

CoE CENTRE of EXCELLENCE ON DATA WAREHOUSING

How to Choose a Digital Preservation Strategy: Evaluating a Preservation Planning Procedure

An extensible monitoring framework for measuring and evaluating tool performance in a service-oriented architecture

Open Office(4.1.5) 2 paragraph about the topic:

DIGITAL IMAGING VIEW OF DIGITAL PRESERVATION

migration web-services and remote emulation for digital preservation

Report for Digital Preservation of Console Video Games (SNES)

WebPublish Theming Update

Guidelines on the correct use of the UBA document templates for research reports and surveys

Creating Accessible Documents in Microsoft Word

Instructions On How To Use Microsoft Word 2010 Pdf Filetype

Big Data, exploiter de grands volumes de données

Perfect PDF 9 Premium

Migrate Legacy Word Documentation into MadCap Flare. Matthew Ellison

Distributed simulation of situated multi-agent systems

Creating an Accessible Word Document. PC Computer. Revised November 27, Adapted from resources created by the Sonoma County Office of Education

Adobe InDesign CC. 1. Introducing the Workspace. 2. Getting to Know InDesign. 3. Setting Up a Document and Working with Pages

Different Aspects of Digital Preservation

Redacting with Confidence: How to Safely Publish Sanitized Reports Converted From Word to PDF

Karlen Communications Accessible Word Document Design: Images and Alt Text. Karen McCall, M.Ed.

Accessible Word Documents. Karen McCall, Med., and University of Arkansas Copyright 2017

Word 2010: Accessible Documents. Center for Effective Teaching and Learning CETL. Cal State L.A. (323)

This guideline cannot anticipate all operating systems and software versions, therefore general instructions are provided.

Creating Accessible PDFs

Communicator. Writing successfully for everyone. Including accessibility in your documentation. Reviewing Adobe RoboHelp version 9

Different approaches to digital preservation

Federal Agencies and the Transition to IPv6

Perfect PDF & Print 9

Office Suites Seminar

Creating 508 Accessible Documents

ArchiMate symbols for relating system elements

Certification Efforts at Nestor Working Group and cooperation with Certification Efforts at RLG/OCLC to become an international ISO standard

Maine CITE Webinar Presenter s Guide

BVCC General Meeting. April 9, The LibreOffice Free Office Suite, Joel Ewing

Digital Preservation DMFUG 2017

Microsoft Word 2003 for Windows, Part 2

idigbio Data Ingestion Requirements and Guidelines

Release Date July 12 th 2013

COMPUTER APPLICATIONS TECHNOLOGY

Nick Rozanski Andy Longshaw Eoin Woods. Sold! How to Describe, Explain and Justify your Architecture

Implementing the RDA Data Citation Recommendations for Long Tail Research Data. Stefan Pröll

Topics in Object-Oriented Design Patterns

RECOMMENDATION ITU-R BT.1720 *

Making Your Excel Spreadsheets Accessible

The OAIS Reference Model: current implementations

Optimized Data Integration for the MSO Market

Hands-On Metro Ethernet Carrier Class Networks

B.E. Publishing Correlations to The Office Specialist.com, 2E to Microsoft Office Specialist Word 2016 Core (77-725)

PDF/arkivering PDF/A. Per Haslev Adobe Systems Danmark Adobe Systems Incorporated. All Rights Reserved.

Design your source document with accessibility in mind. Do NOT use character formatting for headings, use the program s styles.

A UML 2 Profile for Variability Models and their Dependency to Business Processes

Alchemex. Web Reporting. Learning Services Alchemex Web Module

Improving the Performance of OLAP Queries Using Families of Statistics Trees

A Centralised System for Administrative Data Collection at Statistics Finland

Achieving Accessibility with PDF: Getting from Here to There

Dreamweaver CS3 Concepts and Techniques

Word-to-L A TEX specification

Review of InDesign CS

AASHTO Materials Standard Template Users Guide

Creating Accessible Web Sites with EPiServer

Metadata, Chief technicolor

Accessibility 101. Things to Consider. Text Documents & Presentations: Word, PDF, PowerPoint, Excel, and General D2L Accessibility Guidelines.

Word 2016: Using Section Breaks

Planning the Future with Planets The Planets Interoperability Framework. Presented by Ross King Austrian Research Centers GmbH ARC

Quick reference checklist for Accessible Document Design.

Spreadsheet Procedures

The digital preservation technological context

OpenOffice.org Writer

Chapter 14 Working with Fields

Cross-subnet roaming in ABB broadband wireless mesh networks

Table of Contents Headings:... 2 Changing text to a heading... 2 Modifying Style settings for current and future documents... 2

Chapter XLIV Digital Preservation

Jim Mains Director of Business Strategy and Media Services Media Solutions Group, EMC Corporation

Towards Automated Data Integration in Software Analytics

Analyzing PDFs with Citavi 6

Best Practices for Choosing Content Reporting Tools and Datasources. Andrew Grohe Pentaho Director of Services Delivery, Hitachi Vantara

GRAPHIC STANDARDS BOOK

Video Surveillance EMC Storage with Honeywell Digital Video Manager

<Insert Picture Here> Optimizing ASO

XF Rendering Server 2008

Chapter 12 Creating Web Pages

Unit 11.Introduction to Form and Report

Document Metadata: document technical metadata for digital preservation

SEO CASE STUDY REPORT

October p. 01. GCP Update Data Integrity

Transcription:

Towards an analytical evaluation of preservation strategies Presentation for the ERPANET Workshop By Carl Rauch and Andreas Rauber 10 th -11 th of Mai 2004, Vienna Department for Software Technology & Interactive Systems Vienna University of Technology

Motivation We have We have We need - collections with different file formats and preservation requirements - myriads of potential preserveration approaches (various converters, emulators, metadata schemes, ) - a way to decide which one to pick rather than un-transparent out-of-the-guts decisions 2

Outline Introduction Utility Analysis Set objectives Evaluate alternatives Define preferences and decide Summary 3

Selecting a preservation strategy Problem Requirements Solutions Several different preservation strategies, where no single one excels the others in all circumstances Different requirements for different file collections Steady change and development of strategies and tools Strategies that obey very different requirements Means to make strategies comparable Measures to be equally applicable to new preservation strategies Generic framework, which canbeeasily applied to specific environments Decision support system, which clearly ranks possible preservation solutions 4

Utility Analysis Developed in the 1970s Applied mainly for infrastructure projects, such as dams, bridges, neighbourhoods Well expandable Adapted to fit the preservation requirements 5

Utility Analysis procedure Define project objectives Assign effects to the objectives Define alternatives Measure alternatives performance Transform measured values Weight the objectives Aggregate partial and total values Rank the alternatives, Hanusch et. alt. 6

Define project objectives Appearance e.g. Character, sound, video,.. File characteristics Structure e.g. Caption, tag description Behavior e.g. Search, links, user inputs Originality e.g. Tracability of changes Collection preservation Process characteristics Stability Scalability e.g. Supplier independency e.g. Data increase Usability e.g. Complexity, functionality Technical e.g. Hardware, software, per file Costs Personel e.g. Maintenance 7

Implemented objective tree Appearance Characters Size Special Characters Separation Paragraph Picture Inclusion File Characteristics Structure Footnotes Page Numbering Page Page Borders Page Break Behaviour Word Functionality 8

Assign effects to objectives Measurable effects: for example in mm, EURO per year, seconds for file ingest, Objective i Subjective evaluation: Valued with subjective impression, necessary, where no measureable evaluation found, for example paragraph formatting or numbering of chapters. An extreme form is a simple yes/no decision. 9

Definition of alternatives Migration & Standardisation Emulation & Encapsulation Computer Museum Digital Tablet Migrate documents to Adobe PDF Migrate documents to OpenOffice.org Migrate documents to PostScript Migrate documents to a newer version of MS Word Encapsulate digital objects Try to preserve the hardware environment Try to construct a digital tablet No change to the strategy No preservation effort Do not adapt the strategy Do not take care of preservation 10

Alternatives evaluation Measure of the alternatives performance, using either: Original files Files from a testbed Newer MS Word version OpenOffice.org Writer PDF 5.0 Page borders 0 mm + 3 mm 0 mm 0 mm Ingest: sec. per file 10 sec 10 sec 15 sec 0 sec No changes at all Software costs per year 50 0 0 0 Numbering of chapters 3 N.A. 5 5 Paragraph formatting 4 2 5 5 11

Transform measured values Define the transformation table: 5 4 3 2 1 N.A Page borders +/- 0 mm +/- 1 mm +/- 2 mm +/- 3 mm Ingest: sec. per file 0-5 sec 5-10 sec 10-15 sec 15-25 sec +/- 4 mm 25-40 sec Software costs per year 0 1-30 31-50 51-70 71-100 > 100 Numbering of chapters 1 2 3 4 5 N.A. Paragraph formatting 1 2 3 4 5 N.A. Transform the results to make them comparable Newer MS Word version OpenOffice.org Writer PDF 5.0 Page borders 5 2 5 5 Ingest: sec. per file 4 4 3 5 Software costs per year 3 5 5 5 Numbering of chapters 3 N.A. 5 5 Paragraph formatting 4 2 5 5 > 4mm >.40 sec No changes at all 12

Weighting 0,6 Appearance 0,4 File characteristics Structure 0,3 0,3 Behaviour 0,3 Final weight of all leafs: Σ(w 1,j ) = 1 Process characteristics 0,1 Appearance 0,6 * 0,4 = 0,24 Structure 0,6 * 0,3 = 0,18 Behaviour 0,6 * 0,3 = 0,18 Costs Σ(leaf weights) = 1 13

Aggregating part values Part values per objective Leaf Weights x Transformed Values Total value per alternative Sum of all part values of a strategy Includes also not acceptable alternatives 14

Final Ranking Ranking of the alternatives according to their total values, not acceptable alternatives are ranked worst Final sensitivity analysis, concerning non measurable influences on the decision, such as expertice in a specific alternative good relation to a supplier. 15

Summary Composition of objective trees depend strongly on the collection s requirements Different solutions vary mainly in the objective tree composition and the objective s weights A few standard objective trees may evolve for specific scenarios We now have: A powerful tool to make accountable preservation decisions Decision process is transparent 16

Next steps Building and evaluation various objective trees for different preservation settings Specifically, create exhaustive listing of file format characteristics Development of a user interface for the objective definition Building a decision support system 17