Allan Bell, University of British Columbia Alex Garnett, Simon Fraser University Carla Graebner, Simon Fraser University Geoff Harder, University of

Similar documents
University of British Columbia Library. Persistent Digital Collections Implementation Plan. Final project report Summary version

Introduction to. Digital Curation Workshop. March 14, 2013 SFU Wosk Centre for Dialogue Vancouver, BC

COPPUL Digital Preservation Network: Update for ACCOLEDS. Corey Davis COPPUL DPN Manager

Digital Preservation Platform

UVic Libraries digital preservation framework Digital Preservation Working Group 29 March 2017

Research Data Management & Preservation: A Library Perspective

PREMIS in Archivematica

The Canadian Information Network for Research in the Social Sciences and Humanities.

Assessment of product against OAIS compliance requirements

Agenda. Bibliography

Helping Journals to Upgrade Data Publications for Reusable Research

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

Microsoft SharePoint Server 2013 Plan, Configure & Manage

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

Digital Preservation Efforts at UNLV Libraries

The Choice For A Long Term Digital Preservation System or why the IISH favored Archivematica

Assessment of product against OAIS compliance requirements

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours

Advanced Solutions of Microsoft SharePoint 2013

Advanced Solutions of Microsoft SharePoint Server 2013

NEW YORK PUBLIC LIBRARY

MASAS. Overview & Backgrounder Document. Consultation Package. CanOps

QUADREAL S PURPOSE To create living and working environments that enhance the lives of the people and communities we serve.

Susan Thomas, Project Manager. An overview of the project. Wellcome Library, 10 October

Woodson Research Center Digital Preservation Policy

Evolving the digital library for digital scholarship enablement

Content Creation & Dissemination Team Recommendations for Annual Goals October 17, 2014

Data Management Checklist

Managing stakeholders and (creating) their expectations

The Evolution of Public-Private Partnerships in Canada

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

Preservation and Access of Digital Audiovisual Assets at the Guggenheim

PRESERVING DIGITAL OBJECTS

Shared Service and Common Purpose Digital Preservation as Infrastructure. Meg 20 March 2018

Building a Digital Archives at the City of Vancouver

Trusted Digital Repositories. A systems approach to determining trustworthiness using DRAMBORA

DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017

Scientific Research Data Management Policy

Architecture and Standards Development Lifecycle

Building for the Future

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

Launching the. Data Curation Network NDS/MBDH 2018

Systems Interoperability and Collaborative Development for Web Archiving

RDM, a view from Vancouver

Working with Islandora

JISC WORK PACKAGE: (Project Plan Appendix B, Version 2 )

Digital Preservation Workshop

The Data Curation Profiles Toolkit: Interview Worksheet

Preservation at scale

Data Curation Profile Food Technology and Processing / Food Preservation

Woolwich CHC adopts the CIW as a strategic planning tool. A tool to shift the conversation. Using the CIW to advance community health and wellbeing

CANARIE Mandate Renewal Proposal

Design Build Services - Service Description-v7

Data Curation Handbook Steps

Preservation Planning in the OAIS Model

The OAIS Reference Model: current implementations

Importance of cultural heritage:

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Digital Preservation Workshop

ISAO SO Product Outline

Response to Wood Buffalo Wildfire KPMG Report. Alberta Municipal Affairs

Preservation. Policy number: PP th March Table of Contents

22 September Urban Modelling. using Open Public. Data. Börkur Sigurbjörnsson Data

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC

DRS Policy Guide. Management of DRS operations is the responsibility of staff in Library Technology Services (LTS).

Edinburgh DataShare: Tackling research data in a DSpace institutional repository

ISO Self-Assessment at the British Library. Caylin Smith Repository

Earthquake Preparedness

Digital Preservation: From Theory to Practice

Evaluating and Improving Cybersecurity Capabilities of the Electricity Critical Infrastructure

Electronic Records Archives: Philadelphia Federal Executive Board

Common approaches to management. Presented at the annual conference of the Archives Association of British Columbia, Victoria, B.C.

Protecting Future Access Now Models for Preserving Locally Created Content

Its All About The Metadata

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

PRESERVING DIGITAL OBJECTS

CREATING DIGITAL REPOSITORIES PRESENTED BY CHAMA MPUNDU MFULA CHIEF LIBRARIAN NATIONAL ASSEMBLY OF ZAMBIA

RADAR Introduction and Basic Concepts. Matthias Razum

Introduction to Digital Preservation. Danielle Mericle University of Oregon

Click to edit Master title style

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

30 April 2012 Comprehensive Exam #3 Jewel H. Ward

Security and Privacy Governance Program Guidelines

Edward M. Corrado and Sandy Card: ELUNA 2011

Interdisciplinary Processes at the Digital Repository of Ireland

Data Protection. Practical Strategies for Getting it Right. Jamie Ross Data Security Day June 8, 2016

An overview of the OAIS and Representation Information

Response to Industry Canada Consultation Developing a Digital Research Infrastructure Strategy

DHS S&T supports National Level Exercise 2011 using SUMMIT

Building a Dutch National Research Infrastructure IRODS UGM 2017

Facilities Master Plan Toronto Public Library Board Consultation

Update on the Government of Canada s Information Technology Transformation Plan

2017 Inpatient Telemedicine Study

2017 Resource Allocations Competition Results

Construction Technologies and State Government Support

New Zealand Government IBM Infrastructure as a Service

Data Curation Profile Human Genomics

Stewarding NOAA s Data: How NCEI Allocates Stewardship Resources

Simplifying IT through Virtualization

13.f Toronto Catholic District School Board's IT Strategic Review - Draft Executive Summary (Refer 8b)

Transcription:

Allan Bell, University of British Columbia Alex Garnett, Simon Fraser University Carla Graebner, Simon Fraser University Geoff Harder, University of Alberta

About us History of collaboration Common infrastructure Our projects Barn raising - the farm of Alex. McLaren, near Pense. Ref Number R-B12988, Item Date 1912. Saskatchewan Archives.

Barn raising From Wikipedia, the free encyclopedia http://en.wikipedia.org/wiki/barn_raising Laying in the lumber and the materials Making the plans Clearing the ground Tradesmen are hired Topping out Identifying local resources Project planning Laying the groundwork Deploy the talent Topping out

Informal Communities of practice / common interest SLAIS Data Camp, CODE4LIB BC Professional associations Geography Formal Canadian Association of Research Libraries Council of Pacific and Prairie University Libraries British Columbia Research Libraries Group

Unique competencies / contributions / expertise COPPUL Digital Preservation Working Group COPPUL PLN Canadian Government Information PLN BCRLG Government Documents Digitization ABACUS/Dspace migrating to Dataverse Data Curation?

Identify local expertise Identify local expertise Develop partnerships Choose the vendor/developer Communicate objectives, parameters and time frame

Environmental Scan Institutional priorities Existing resources Potential partnerships Open Source solutions Community oriented Responsive to funded projects Made available to the community under creative commons licensing upon completion Collaborative continuum

Archivematica Simon Fraser University Library: Research Data Repository [SFU RaDaR] Simon Fraser University Archives University of British Columbia: Digital preservation program University of Alberta :Canadian Polar Data Network University of Saskatchewan Islandora Simon Fraser University Library RaDar University of Alberta University of Saskatchewan

SFU RaDaR To undertake a Proof of Concept project which will identify the technological infrastructure, policies and procedures, and costs associated with the development of a Research Data Management Repository for research data to ensure its security and integrity over time

Two separate but related customizations: - Automated Ingest of multiple files via a scriptable workflow API. - A Storage API to allow Archivematica to use non-filesystem storage (e.g. NFS mounts or Fedora Commons) for objects.

- Archivematica is designed to be actively used by depositors, curators, and archivists. - This doesn t necessarily scale for us -- we want to use it at the tail (preservation) end of existing repository services for electronic theses and local research data. - These objects are homogeneous and can receive the same normalization treatment.

- We re configuring Archivematica to use a local LOCKSS network for storage. - Archivematica is very good at file-level and metadata preservation, but it doesn t do much for long-term bit-level preservation (beyond checksums). - Our use of LOCKSS (which stores multiple copies of each file and verifies them against one another) with the new Storage API makes up for this.

LOCKSS -O- MATIC

Digital Preservation Program New and legacy born digital archival material circle, DSpace-based repository Digitized collections in CONTENTdm Websites Research data COPPUL Archivematica as a Service and EduCloud Centre for Hip Health and Mobility Research Data Management Pilot Research Data Management Training Research Engagement and Consultation Hub: REaCH-IT

circle (Dspace) Archivematica receives submissions from DSpace Also have Archivematica to DSpace workflow

CONTENTdm Master files uploaded to Archivematica Archivematica produces access versions and pushes to CONTENTdm

RBSC/UA born-digital acquisition workflow

Council of Prairie and Pacific University Libraries (COPPUL)- Digital Preservation Working Group Univ of Victoria (gold) Univ of Saskatchewan (gold) Vancouver Island Univ (silver) Univ of Lethbridge (bronze) Mt. Royal Univ (bronze)

Create spreadsheet templates Consent Project X Lead PI: Professor Charles Francis Xavier Primary Contact: Dr. Jean Grey Project Duration: 1960 to present Data Collection Data Processing Final Outcome Medical imaging device 1 Medical imaging device 2 Medical imaging device 3 Motion sensor Physical measures Questionnaires Raw files Store on network drive Raw files Perform batch processing Store on network drive Wear logs Paper packages Securely store paper file packages Analyze images, and determine exclusion/inclusion Analyzed images and standard analysis spreadsheet Batch report Enter data into spreadsheets, check and clean (exclusion/ inclusion) data Enter data into spreadsheets, check and clean (exclusion/ inclusion) data Device 1: perform custom analysis Exported spreadsheet Device 2: perform custom analysis Exported spreadsheet Device 3: perform custom analysis Exported spreadsheet Spreadsheet Final spreadsheet Clean data Final spreadsheet Clean data Final spreadsheet Clean data Store on network drive Upload into Database Spreadsheet Updated Database Export and Analysis Files containing responses to data requests Record and respond to data requests Regularly create participant reports Participant reports Project Details: Longitudinal study. Number of participants unknown. Participants generally suffer from some form of more or less benevolent mutation. Data is collected in various locations around the world, but mainly focused on the Eastern United States. For each Instrument: - Name/ID/Room/Location - Internal/external data source? - Activity performed - Settings - Operator(s) - Security - Use Frequency/Duration For each Outcome: - Types of Files/Documents - Number and Size of Files - Storage Location - Description/Organization - Security and Backup - Data/File Sharing - Need for Confidentiality, Integrity and Availability For each Processing Step: - Workstation/Laptop/ID - Software - Activity performed - Security and Backup - Team Members involved - Use Frequency/Duration

Setting the context: UBC Library s involvement in data management activities (January 2014) Data 101: introduction to research data and data management for Librarians (February 2014) Storing and working with data (March 2014) Data Summit (April/May 2014)

A centralized/distributed research IT support program is urgently needed to advance and coordinate the complex and specialized technology-intensive needs of the research community, address granting agencies requirements to maximize investments, mitigate risks and costs to the institution, and support an environment of research excellence at UBC. Research Engagement and Consultation Hub: REaCH-IT

Support for a growing research data management, curation, and preservation program - situated within a strong agenda to support digital scholarship Open and transparent toolkits Modular, not monolithic Trust what we can see, not what we ve been told Funding planning assumption: no new money!

Archivematica contract and partnership An opportunity to study the issues related to the design of Archival Information Packages (AIPs) for research data sets An opportunity for the development of consistent, shareable AIPs for curated research datasets An opportunity to better understand data formats - automated and manual normalization workflows as part of FPR development An opportunity to build further footings for ongoing and new work with regional, national and international partners like UBC and SFU Use Cases: Canadian Polar Data Network Image: Cow s head in a bucket. c1909. http://peel.library.ualberta.ca/postcards/pc003821.html

Canadian Polar Data Network RDM planning policy development packaging access preservation discovery

Format policy registry - automated and manual normalization Research data doesn t always play nice Should an AIP be kept small, pairing a single data file with its metadata? If a set of AIPs are part of a larger aggregation, for example, a "dataset", "study" or "project," should a separate AIP exist to serve as the parent of a group of related AIPs? Or should the parent information be bundled within each offspring AIP? Should a single AIP consist of all files in a dataset? Different definitions of what equates to a dataset. A related set of files serving some defined purpose; act as a unit. Should a single AIP exist for a study or a project? Is this the right scope? How important is it to be consistent in the design of AIPs? Is it acceptable to produce AIPs on the basis of all three of the above structures? How much should today's physical archival storage limitations play in determining the design of AIPs? For example, if the maximum storage block is three terabytes, how much does this contribute to the decision around smaller versus larger AIPs? The maximum block size certainly limits the size of a single AIP, but should it also contribute to an overall principle around the design of AIPs.

OAIS Reference Model Make use of the concept of Archival Information Collection (AIC) in combination with the Archival Information package (AIP) Model facilitates encapsulation of data files, metadata and any other necessary documentation in one AIC. An AIC can have many AIPs but logically is a single entity for transmission or for archival storage. If parts of an AIC are separated due to any incident, the whole structure can be re-constructed using this model Image source: www.snakeroot.net/farm/buildingtechniques.shtml

An AIC is a type of AIP that contains metadata pertaining to a class of AIPs. All AIPs that are members of a particular class will inherit the metadata of that AIC(kept in a documentation AIP). The structure of an AIC is identical to that of an AIP. The use of AICs offers the following advantages: the type and amount of information contained in an AIC is discretionary; large volumes of metadata can be accommodated easily; the information contained in an AIC is preserved in the same archive as its AIPs; certain types of metadata for an entire class of AIPs need only be stored once; a new AIP can be added without significant changes to the overall structure. Any orphaned AIP can easily be linked backed to its parent Image: Cattle Roping in Alberta. 1912. http://peel.library.ualberta.ca/postcards/pc009286.html

Image source: business.financialpost.com/2013/11/20/rob-fords-crack-scandal-is-driving-up-torontos-borrowing-costs/

Connect Archivematica up with existing CPDN workflows Connect Archivematica up with existing and new infrastructure and workflows Fedora, OpenStack, Dataverse, etc. Leverage Archivematica and ICA-AtoM for University Archives; handshake with Alfresco EDRMS Sustainability Partner investment in common technologies will help ensure our tools remain healthy and strong Growing expertise that can be shared between institutions to allow our brightest minds to do less of the same and more of what is really needed Succession planning for the data and a bridge for easier sharing and collaboration within and outside of the library

Martin Army Community Hospital "Topping Out" Ceremony http://www.flickr.com/photos/savannahcorps/8023265430/ Some rights Reserved.

Barn raising - the farm of Alex. McLaren, near Pense. Ref Number R-A4313, Item Date 1912. Saskatchewan Archives.