SharePoint Archival Storage Strategies & Technologies January Porter-Roth Associates 1

Similar documents
The Making of PDF/A. 1st Intl. PDF/A Conference, Amsterdam Stephen P. Levenson. United States Federal Judiciary Washington DC USA

Sustainable File Formats for Electronic Records A Guide for Government Agencies

PDF/arkivering PDF/A. Per Haslev Adobe Systems Danmark Adobe Systems Incorporated. All Rights Reserved.

Archiving and Migrating Digital Files

Technology Special Interest Group Thursday, December 4, Tony Hanson Webmaster Technology Special Interest Group Leader

Lecture 19 Media Formats

GUIDELINES FOR CREATION AND PRESERVATION OF DIGITAL FILES

Advanced High Graphics

Graham Taylor.

Chapter 9 Section 3. Digital Imaging (Scanned) And Electronic (Born-Digital) Records Process And Formats

Part III: Survey of Internet technologies

Emerging Trends in Records Management Technology. Jessie Weston, CRA 2018 MISA Conference October 11-12, 2018

PDF solution comparison.

White paper Selecting the right method

Selecting the Right Method

ISO PDF/A -Standard Archive file format standard for long-term preservation

This document is a preview generated by EVS

EXCELLENT ACADEMY OF ENGINEERING. Telephone: /

PDF/A - The Basics. From the Understanding PDF White Papers PDF Tools AG

How to Build a Digital Library

DOWNLOAD OR READ : WHATS THE DIFFERENCE IN PROTESTANT AND ROMAN CATHOLIC BELIEFS PDF EBOOK EPUB MOBI

COMPUTER SOFTWARE RAYMOND ROSE

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG

Media Types. Web Architecture and Information Management [./] Spring 2009 INFO (CCN 42509) Contents. Erik Wilde, UC Berkeley School of

ADOBE 9A Adobe Acrobat Professional 8.0 ACE.

How. Can Acrobat Help My Bar Association? Catherine Sanders Reach ABA Legal Technology Resource Center

Managing Records in Electronic Formats. An Introduction

Summary of Bird and Simons Best Practices

Final Study Guide Arts & Communications

PDF Solution comparison

PDF solution comparison.

TIFF-PDF/A White Paper

PDF solution comparison.

color bit depth dithered

access to reformatted and born digital content regardless of the challenges of media failure and technological

DIGITAL RECORDS MANAGEMENT GUIDELINES

When it comes to information archive strategy, digital has met its ideal partner.

PDF solution comparison

This lesson was made possible with the assistance of the following organisations:

PDF/A in Healthcare. Dr. Bernd Wild. intarsys. Webinar: PDF/A in Healthcare Dr. Bernd Wild.

AVS4YOU Programs Help

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Formatting, Organising and Storing Data

Digital s ideal partner

Preserving. Your Digital Memories. The National Digital Information Infrastructure and Preservation Program

PDF solution comparison

IBM Archiving Solution DB2 CommonStore for Lotus Domino

Portal Subcommittee Agenda Thursday, May 10, :45-11:45. Criminal Case Initiation Workgroup Update Judge Martin Bidwill

Table of Contents Welcome Project History Overview II. Scanning Having a good work area for your scanning workstation

Don t just manage your documents. Mobilize them!

NXPowerLite Desktop (Mac)

19 - This PC Inside This PC

Adobe Acrobat 8 Professional - Available November 8, 2006 Communicate and Collaborate with the Essential PDF Solution

PDF solution comparison.

Application Aware. Efficient. Flexible. Cost Effective. Maintain logical consistency of data during backup Application aware recovery

RECOMMENDED FILE FORMATS

M4-R4: INTRODUCTION TO MULTIMEDIA (JAN 2019) DURATION: 03 Hrs

Image coding and compression

XF Rendering Server 2008

NUANCE. PDF Solutions Comparison. The experience speaks for itselftm. Adobe Acrobat XI Standard* Adobe Acrobat XI Professional* Power PDF

The Adobe XML Architecture

ISO (PDF/A-2)

6 Best Practices for Sharing

Smarter Document Capture

Using Smart Touch A-61829

Formatting Support: Word 2008

INSTRUCTIONS FOR PREPARING A THESIS

Digital Preservation with Special Reference to the Open Archival Information System (OAIS) Reference Model: An Overview

Introduction. You might be interested in the system requirements, the installation, payment and registration procedures.

The Need for a Terminology Bridge. May 2009

3.01C Multimedia Elements and Guidelines Explore multimedia systems, elements and presentations.

Importance of cultural heritage:

Archive 7.0 for File Systems and NAS

Best practices for producing high quality PDF files

The Future of PDF/A and Validation

Adobe EXAM - 9A Adobe InDesign CS5 ACE Exam. Buy Full Product.

Adobe InDesign Notes. Adobe InDesign CS3

EMC Documentum xdb. High-performance native XML database optimized for storing and querying large volumes of XML content

PDF and Accessibility

2/24/2015. What are File Formats? Types of File Formats. Sustainable Formats. File formats can be grouped into three categories:

This presentation is on issues that span most every digitization project.

Preparing PDF Files for ALSTAR

Rules of the Road for Scanning and Tossing. Imaging Requirements for Paper Based Records

DIS: Design and imaging software

NXPowerLite Desktop. User Manual. Version 8.0.X, February neuxpower.com. Simple Storage Reduction Software

Data Storage. Slides derived from those available on the web site of the book: Computer Science: An Overview, 11 th Edition, by J.

SPListX for SharePoint Installation Guide

Presented by: Felicia Christensen

RECORDS MANAGEMENT AND YOU

Elementary Computing CSC 100. M. Cheng, Computer Science

Trends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation

Executive Summary SOLE SOURCE JUSTIFICATION. Microsoft Integration

4. TECHNOLOGICAL DECISIONS

CoSA & Preservica Practical Digital Preservation 2015/16. Practical OAIS Digital Preservation Online Workshop Module 2

ABBYY FineReader 14. User s Guide ABBYY Production LLC. All rights reserved.

Draft Digital Preservation Policy for IGNCA. Dr. Aditya Tripathi Banaras Hindu University Varanasi

RECORDS WWU: . WWU Archives & Records Management Tony Kurtz x3124 Rachel Thompson x6654

Sherpa Archive Attender. Product Information Guide Version 3.5

Chapter 1B-26, Florida Administrative Code RECORDS MANAGEMENT - STANDARDS AND REQUIREMENTS Electronic Recordkeeping

Intelligent Document Solutions

Transcription:

SharePoint Archival Storage Strategies & Technologies January 2009 Porter-Roth Associates 1

Bud Porter-Roth Porter-Roth Associates 415-381-6217 budpr@erms.com http://www.erms.com Porter-Roth Associates 2

Agenda Introduction The Preservation Problem SharePoint Recommendations Porter-Roth Associates 3

Introduction Basic Need Disaster Recovery Compliance Porter-Roth Associates 4

Define Archival Storage Archival storage is a requirement to store a physical or digital document longer than 15 years Archivists consider 100 years to be the baseline How will you store a document or library in SharePoint for 100 years? (will SharePoint still be around in 100 years? Will the version 2109 be able to read the 2007 version?) Porter-Roth Associates 5

Drivers for Archival Storage Compliance Length of patient s life +2 years SEC 17a length of account +6 years Legal Legal hold documents 1 year end of case +?? Some case documents stored permanently Business Operations Company records life of company +?? Porter-Roth Associates 6

The Technical Preservation Problem The problem is actually two separate, sort of unrelated issues: Hardware and software to store and read documents becomes obsolete Hardware, OS, Applications, Drives tape, disk, etc. The format that the documents are in becomes obsolete or changes such that early versions are not readable WordStar, VisiCalc, Word, PDF, XML, PDF-A, XPS Porter-Roth Associates 7

The Management Preservation Problem Management may not be committed to long-term archival strategies as a necessary business procedure Management may be unwilling to fund long-term archival project ( I ll be gone in 4 years ) Problem is quite complex and requires a commitment to understanding the business needs over a long period of time ( I ll be gone in 4 years ) SharePoint, Lotus, eroom, etc. encourage unregulated document sites and sprawl, which makes it even harder to grasp the overall problem and potential costs Porter-Roth Associates 8

The Preservation Problem The Problem in brief Software formats change and become non-supported Software formats fall out of favor over time and disappear Hardware drives change and become non-supported Storage media changes overtime and becomes obsolete Floppy disks (5 ¼, 3 ½ ) Optical disks (WORM, CD, DVD) Tape (many flavors of) Portable storage media like the Memory Stick in use today With all of the above issues, for digital documents, it means that there is a strong chance that you will be forced to convert/migrate something to something else over time as a, in the foreseeable future, continuing process ($$$). Porter-Roth Associates 9

The Preservation Problem - Format TIFF (Tagged Image File Format) usually with ITG Group 4 compression (owned by Adobe, no plans for enhancement) JPEG (Joint Photographic Experts Group) GIF (Graphic Interchange Format) PNG (Portable Network Graphics) Native file formats (Word, Excel, WP etc) also known as Born Digital documents PDF, PDF/A, PDF/X, Microsoft XPS Many other proprietary electronic formats (especially proprietary compression algorithms) Paper Film (various types = various readers) Audio (various types) Porter-Roth Associates 10

The Preservation Problem What is the best option for preserving electronic documents over archival time spans? (Disregarding the hardware storage issues) TIFF? A digital picture of your page Widely adopted standard for document imaging Not human readable without the software No programmatic access to underlying text without OCR XML? A format description of the page a style sheet Good for describing logical structure, but not appearance Many incompatible domain-specific schemas Native Format (e.g., MS Word)? Several ubiquitous, but closed proprietary formats Can you spell WordPerfect? PDF? PDF/A? Microsoft XPS? Porter-Roth Associates 11

Desirable Properties of a Format Device independence Can be reliably and consistently rendered without regard to the hardware/software platform Self-contained Contains all resources necessary for rendering Self-documenting Contains its own description Transparency Amenable to direct analysis with basic tools Porter-Roth Associates 12

Adobe PDF and PDF/A PDF is a ubiquitous open format for electronic documents Proprietary, but with publicly available specification Companies, other than Adobe, make PDF products Has gained widespread use throughout the world Many statutory, regulatory, and institutional policies mandate the retention of PDF-based documents over multiple generations of technology The feature-rich nature of PDF can complicate preservation efforts (hence PDF/A) Porter-Roth Associates 13

PDF/A PDF/A is intended to address three primary issues: Define a file format that preserves the static visual appearance of electronic documents over time Provide a framework for recording metadata about electronic documents Provide a framework for defining the logical structure and semantic properties of electronic documents Is an ISO Standard - ISO 19005-1:2005 Porter-Roth Associates 14

PDF/A PDF/A constraints include: Audio and video content are forbidden JavaScript and executable file launches are prohibited All fonts must be embedded and also must be legally embeddable for unlimited, universal rendering Colorspaces specified in a device-independent manner Encryption is disallowed Use of standards-based metadata is mandated And others Porter-Roth Associates 15

PDF/A.Nevertheless PDF/A may not be the last preservation format you will use or need However, proper application of PDF/A should result in reliable, predictable, and unambiguous access to the full information content of electronic documents Porter-Roth Associates 16

Microsoft XPS XPS is an abbreviation for the XML Paper Specification The XML Paper Specification describes electronic paper in a way that can be read by hardware, read by software, and read by people. The XML Paper Specification itself is platform independent, openly published, and available royalty-free and Microsoft has integrated XPSbased technologies into Microsoft Windows Vista operating system and the 2007 Microsoft Office system. http://www.microsoft.com/whdc/xps/default.mspx Porter-Roth Associates 17

Recap Hardware Disk drives Software Programs Format that the documents are in The extension.wp,.doc, Porter-Roth Associates 18

But wait, we have many How many systems do you have that produce and store potential archival data? SharePoint SAP Documentum, FileNet, OpenText, etc. Lotus eroom Porter-Roth Associates 19

SharePoint Archiving Documents are normally stored on magnetic disk (various configurations) Backups are not typically document specific but backup an entire Farm or site collection Restoration of a library is restoration of a complete site collection to get to the library Backups are typically bound to a site collection and libraries within that site collection Backup tapes may be recycled every xx days Can you depend on this combination for long-term archival storage and compliance? Porter-Roth Associates 20

SharePoint Archiving Archival documents need to be locked such that they cannot be changed or deleted Archival documents may need to be separated from the original library to an archival library Magnetic disk/backup tape (regular cycle) may not be adequate May need a special backup routine for archival storage documents Porter-Roth Associates 21

SharePoint Archiving If published and backed up to a unique archival library, ensure that document metadata remains intact If moved to a new storage media or location, ensure that metadata remains intact If restored to a SharePoint library, ensure that metadata does not change Ensure that documents are in a long-term format (but this is complex issue and you should work with your records management specialist) Porter-Roth Associates 22

Recommendations This is still a wild frontier, with no certain outcome or single standard The good thing about standards is that there are so many of them. When in doubt about long-term storage of vital documents, paper or film is still a good answer (depending on volume) Beware of new technologies, even ones that are standards TIFF, JPEG, PDF, PDF/A are recommended XPS needs to be more universally adopted The weight of in-place document formats will mean that change will be very slow and may stop change unless a dramatic out of the blue technology appears (by analogy, think of PDF versus XPS) Porter-Roth Associates 23

Recommendations Consider establishing a separate group that is specifically chartered with archival storage Stakeholders should include Information Management, IT, Legal, HR, and other departments with archival storage requirements Position should be staffed permanently (rolebased) not as Jane Smith or Joe Blow Porter-Roth Associates 24

Conclusion & Questions Finally! Questions? Porter-Roth Associates 25