Hi, I m Jody DeRidder, and I d like to tell you about a recent NHPRC funded project in which we developed a cheap, fast model for getting large

Similar documents
You may print, preview, or create a file of the report. File options are: PDF, XML, HTML, RTF, Excel, or CSV.

Standard Change Template Library

SobekCM METS Editor Application Guide for Version 1.0.1

University of Massachusetts Amherst * Boston * Dartmouth * Lowell * President s Office * Worcester

Introduction to Archivists Toolkit Version (update 5)

Creating Compound Objects (Documents, Monographs Postcards, and Picture Cubes)

Adding EAD-Encoded Finding Aids in CONTENTdm

StreamServe Persuasion SP5 StreamServe Connect for SAP - Business Processes

Sales Bulletin. SUBJECT: NEW 2018 Snow & Ice Control Products Important Dealer Collateral and Website Updates

Created with CMB AutoDoc - Server licence for ID Data

Understanding Page Template Components. Brandon Scheirman Instructional Designer, OmniUpdate

Automating Digital Downloads

AVS4YOU Programs Help

White Paper. Backup and Recovery Challenges with SharePoint. By Martin Tuip. October Mimosa Systems, Inc.

QromaTag for Mac. User Guide. v1.0.5

Tab-Delimited File and Compound Objects - Documents, Postcards, and Cubes. (Not Monographs)

Public Data Portal Overview

Other Templates. Overview. URL Shortener & Redirect Page

Import and Optimize the Complete Search Form

Orbis Cascade Alliance Archives & Manuscripts Collections Service

Compound or complex object: a set of files with a hierarchical relationship, associated with a single descriptive metadata record.

Bentley Map Geospatial Administrator Workspace Base Source Directory and Files Node

How to Edit Your Website

ARCHIVISTS TOOLKIT WORKSHOP. March 13, 2008 Christine de Catanzaro Jody Thompson

Our legacy archival system resides in an Access Database lovingly named The Beast. Having the data in a database provides the opportunity and ability

Description of Hybrid Collections using Archivist Toolkit

BatchDO 2.1 README 04/20/2012

Using DSpace for Digitized Collections. Lisa Spiro, Marie Wise, Sidney Byrd & Geneva Henry Rice University. Open Repositories 2007 January 23, 2007

Note, you must have Java installed on your computer in order to use Exactly. Download Java here: Installing Exactly

CONTENTdm & The Digital Collection Gateway New Looks for Discovery and Delivery

Transfer Records to the State Archives with Exactly

Deposit-Only Service Needs Report (last edited 9/8/2016, Tricia Patterson)

Release Notes Documentum Release 2.1

Tips for Digital File Creation

Techniques for Optimizing Reusable Content in LibGuides

Creating an with Constant Contact. A step-by-step guide

Batch Convert Material Presets - to Genesis

Chapter 6. Importing Data EAD Constraints on EAD

Version 3.5 Organization Administrator Guide

GUIDE TO STORAGE CHARGEBACKS WITH DATADVANTAGE

X100 ARCHITECTURE REFERENCES:

Module 1: Introduction RStudio

ArchivesSpace at the University of Kentucky

GIMP GETTING STARTED

Creating an with Constant Contact. A step-by-step guide

Using the Book Content Model

dotcms: Upgrade Job Aid 1.9.3

trends in ARCHIVES PRACTICE MODULE 3 DESIGNING DESCRIPTIVE AND ACCESS SYSTEMS Daniel A. Santamaria CHICAGO

Technical Intro Part 1

SERVANT KEEPER 7. Upgrade today! You will feel right at home

Geospatial Multistate Archive and Preservation Partnership Metadata Comparison

WATERMARK S ENSE User Guide. VeprIT.

STAT 113: R/RStudio Intro

(Refer Slide Time: 0:48)

XBMC. Ultimate Guide. HenryFord 3/31/2011. Feel free to share this document with everybody!

2011 Annual Ryan White HIV/AIDS Program Regional Data Training 9/27/2013

Microsoft Dynamics GP. Extender User s Guide

Digital object and digital object component records

ALTIUM VAULT IMPLEMENTATION GUIDE

CCH INCORPORATED 05/03

How to use TRANSKRIBUS a very first manual

The Case of the 35 Gigabyte Digital Record: OCR and Digital Workflows

Developing SAS Studio Repositories

INFER ACCELERATES GROWTH WITH NITRO CLOUD

Roxen Content Provider

1. Download and install the Firefox Web browser if needed. 2. Open Firefox, go to zotero.org and click the big red Download button.

ForeScout Extended Module for Advanced Compliance

Text Data Processing Entity Extraction Dictionary File Generator User's Guide SAP Data Services 4.2 (14.2.0)

USER MANUAL v Given the difficultly of standardizing archival terms between repositories, the term "collection"

Digging into File Formats: Poking around at data using file, DROID, JHOVE, and more

PHOTO DVD MAKER USER MANUAL

Better Translation Technology. Documentation for. XTM Bridge

Screencast.com. Getting the Most from Your Screencast.com Account. July TechSmith Corporation. All rights reserved.

SOLIDWORKS TECHNICAL COMMUNICATIONS

NVR Management Software

<sysid type ="other" othertype="mh-atprod"> 12345</sysid> for an Archivists' Toolkit resource record number from which the EAD was derived

Metadata. AMA Digitization Workshop. Elizabeth-Anne Johnson University of Manitoba 29 February 2016

Content Creation & Dissemination Team EAD Database WG, EAD3 Group Implementing EAD3 in the CCD Program: Final Report and Recommendations 2016 March 3

CompsFromSpreadsheet Version 5.1 user guide

HDMS Finding Aids EAD FINDING AIDS 2 HTML FINDING AID 3 STEP 1: INPUT TITLE, CREATOR, COPYRIGHT, LANGUAGE AND URL DETAILS 4

ELAR: instructions for depositors

ead-transform.py (custom script by Josh) ArchivesSpace compliance schematron validation Post-import cleanup via Python/API

Table Of Contents. iii

token string (required) A valid token that was provided by the gettoken() method

Online Demo Guide. Barracuda PST Enterprise. Introduction (Start of Demo) Logging into the PST Enterprise

How to Edit Your Website

OpenText RightFax 10.6

Getting Started With Squeeze Server

HOW TO Load digital project images DEPRECATED DOCUMENT. Background:

Tzunami Inc. Evaluation Guide

Helping State Government Agencies Deliver a Better Constituent Experience through Better Communications

How to upload and format board files for the MAVRK wiki. The instructions below detail how to upload and format board files for the MAVRK wiki.

Introduction to Dreamweaver

Entering Finding Aid Data in ArchivesSpace

ACE Operation Manual

If you log into your account directly, please go to your personal startpage. You find the submission to review under the status Active.

TUTORIALS > DIAGRAM NAVIGATION AND AESTHETICS

Chapter 1. Introduction to the Archivists Toolkit TM. System requirements. Minimum System Requirements for AT Client. Recommended System Requirements

Chapter 1 True/False Instructions: Circle T if the statement is true or F if the statement is false.

Transcription:

Hi, I m Jody DeRidder, and I d like to tell you about a recent NHPRC funded project in which we developed a cheap, fast model for getting large manuscript collections on line. 1

Like other delivery methods that leverage the EAD for access to digitized content, we depend upon the information in the finding aid to for search and discovery. The better the finding aid, the better the access. While our grant funded project included descriptions down to the folder level, only the series level descriptions are absolutely necessary. We have all this great metadata, created by the archivists, providing context for delivery. Why not use it? Most of us do not have the resources to get large manuscript collections online any other way. 2

Here s the information we had at hand, which could be transformed into item level records, automatically. For others who want to use our system, we developed configuration files where you would enter the information on the left side for each collection; the information on the right side comes from the file names. This is actually quite a bit of metadata. By creating minimal MODS records, we enable access to discrete items, and pave the way for later remediation, possibly by harnessing crowd sourcing technologies. 3

Here s an example of how we encoded information in the file names, to avoid spreadsheets or hand created metadata for each item. The filename starts with a letter so we can use the identifier as XML id attributes. The first segment of our file names indicates the repository source and type of content. The second segment is the collection number, echoing the one used by the archivists. The third segment has three parts: Box, Folder and sequence of the item. If there s a fourth segment to the identifier, that means there s more than one page, and this holds the page sequence. In our archive and our web directories, the location of the file can be determined by replacing underscores with slashes; this helps us automate our work, and leverage the file system to organize our content. Our delivery system, Acumen, infers the relationships between the files by using these file name segments. 4

This is a segment of the template we provide with our open source software. The values with percent signs are filled in with the information provided in the configuration file for the collection. The capitalized segments are filled in from the file names though the item link is actually a combination of the two. To use our software, you make a collectionspecific copy of the basic config file, add in information about your specific collection and display requirements, add the path to your files, and run the scripts. The software will generate the MODS, link the content into the EAD, create HTML display files to meet your specifications, generate derivatives, and move content into your web directories. 5

Here s an example of our own MODS display in Acumen. Clicking on the thumbnails provide large image access with zoom and pan capabilities. However, we recognize that many people already have their own method for delivering EADs to the web. So for those who don t want to use our open source Acumen software, we developed HTML templates for both items and folders which make use of your configuration specifications for logos, color, display and more. 6

Here s a sample of an item using our HTML template. As you can see, the metadata appears in the upper portion of the page, and you can see at a glance how many pages the item has, and can access them via the navigation bar at the top. This display also has pan and zoom capabilities. Of course the logo and information are all provided by you, so the actual display would be specific to your institution. 7

If you don t want to link a list of items into the EAD, you have the option of linking folders instead. This is preferable particularly if you do not have folder level description. Here is a sample display of the contents of a folder that was linked into the EAD. Clicking on any of the items brings you to the item page we just saw. Again, known metadata is displayed at the top, with a return link to the EAD online to provide context. 8

Our focus was on streamlining and automation. The more we can automate our work, the cheaper it is. We use scripts for almost everything, from checking the file names, verifying the EAD is linkable, generating the JPEGs, linking the content, and moving the files. Oh, and work can be completed in batches. For large manuscript collections, you want to make content available online as soon as you have it digitized, right? We support batch processing of segments of the collection at a time. Digitize a box or two, get it online, and move on to the next boxes. No problem. We recently analyzed the cost of delivering content this way; not counting hardware, software, supervision and overhead, it came to less than 80 cents per page. That s less than a third of our usual method of digitization, where we hand create item level metadata. Clearly, this is potentially a great solution for large manuscript collections that may otherwise never see the light of day online. We thank the NHPRC for funding this project. 9

Here s a link to a recent article about our work processes, links to our wiki, project site, and display; and also a link to our open source delivery system, Acumen, which automatically indexes content in web directories anywhere, and handles multiple levels of granularity, materials, and XML metadata simultaneously. Please feel free to contact me with any questions. Thank you! 10