Long-term digital preservation of UNSWorks UNSW Library Arif Shaon, Maude Frances CAUL Community Days 2014
UNSW Australia The University of New South Wales at a Glance: https://www.unsw.edu.au/sites/default/files/documents/unsw4009_miniguide_2012_aw2_v2.pdf
UNSW Higher Degree Research UNSW HDR Statistics ~3,500 enrolled, ~4,200 Active Postgraduate Research Students: 80% PhDs / 20% Masters 60% Australian / 40% International Over 700 completions per year UNSW is research intensive university Integrate UNSW research ethos into researcher induction and development programs and collaborate with DVCA and HR on integrating researcher development with staff induction and development approaches Aim to foster good research practice requires training and infrastructure to support those practices
UNSW Library Repository Service UNSW Library has an increasingly important role in the long-term curation (management) and preservation of UNSW research materials Library Repository Service (LRS) supports this by providing Web-based repositories to UNSW academic community Research Centre School Deposit/Edit Primo Web-forms Deposit/Edit Primo Web-forms Deposit/Edit Primo Web-forms Fedora Fedora Faculty Fedora
Outline UNSWorks UNSW Library Digital only thesis collection policy UNSWorks Digital Preservation policy and procedure UNSWorks preservation architecture Future work
UNSWorks The online institutional repository for PhD and Masters by research thesis material 12000+ records; 100000+ more expected after integration with the university s Research Output System (ROS) enables discovery, accessibility and citation stores and disseminates digital preservation information to support the library s digital only thesis collection policy
UNSW Library Digital only thesis collection policy Addresses challenges of managing increasingly large print thesis collection 12000+ volumes of print thesis, growing at 480 volumes per annum Developed in collaboration with Graduate Research School and endorsed by the university s Higher Degree Research committee Effective from January 2014, subject to establishing a dependable and sustainable preservation plan and procedure.
http://www.dailytelegraph.com.au/university-centre-goes-up-in-smoke-at-kensington-in-sydney/story-e6freuy9-1225911563696
UNSW Library Digital only thesis collection policy Addresses challenges of managing increasingly large print thesis collection 12000+ volumes of print thesis, growing at 480 volumes per annum Developed in collaboration with Graduate Research School and endorsed by the Higher Degree Research committee Effective from January 2014, subject to establishing a dependable and sustainable preservation plan and procedure.
UNSWorks Digital Preservation Policy Published in 2014; aligned with UNSW Electronic Record Keeping policy Defines the Library s digital preservation responsibilities Establishes the preservation strategy adopted and specifies supported resource types Provides guidance to Library staff engaged in decision making and other activities that may affect UNSWorks Defines Adequacy of Preservation for UNSWorks Specifies a review period for the policy https://www.gs.unsw.edu.au/policy/documents/digitalpreservationpolicy.pdf
UNSWorks Digital Preservation Procedure (1) Also published in 2014 Outlines the preservation methodology; adopts the Open Archival Information System (OAIS) reference as the underlying framework Establishes the preservation workflow Specifies supported file formats Defines the main roles: depositors, UNSWorks users, UNSW Library, Committee on Research https://www.gs.unsw.edu.au/policy/documents/digitalpreservationprocedure.pdf
UNSWorks Digital Preservation Procedure (2) Resource Type Preservation Level Retention Period PhD and Masters by research theses Research data Level 1 (original files (bitstream), preservation metadata and rendering software) Level 2 (the original files (bitstream) and preservation metadata, including software metadata) Retain permanently Supported Format e.g. PDF 25 years e.g. TIFF, XML Domain specific Level 3 (compressed 25 years e.g. ZIP, TAR files and software original files (bitstream) (as supporting and preservation files for theses or metadata) https://www.gs.unsw.edu.au/policy/documents/digitalpreservationprocedure.pdf research data)
UNSWorks Digital Preservation Workflow https://www.gs.unsw.edu.au/policy/documents/digitalpreservationprocedure.pdf
UNSWorks Digital Preservation Architecture UNSWorks Preservation Storage Thesis and other publication records Ingest UNSWorks Records (Fedora) Software registry (Fedora) File format registry (Bigdata RDF triple store) OAI-PMH UNSWorks Discovery Service (Primo) UNSWorks Preservation Ontology (PREMIS 2.2) File format identification (DROID) Preservation Processes Format conversion Event capture PRONOM Technical Registry
UNSWorks Digital Preservation Processes Automatic file format identification and preservation metadata capture using DROID underpinned by a custom RDF ontology based on PREMIS Data Dictionary v2.2 Integrated with the UK National Archive s technical registry PRONOM to enhance the semantic quality of file format metadata enhanced file format metadata are stored in BigData a highly scalable, Open Source RDF triple store Automatic conversion of files to suitable preservation formats as stipulated by the UNSWorks digital preservation policy Integration with a Fedora-based software repository to record information about software needed to render electronic files
UNSWorks DPP Information dissemination (Primo under development)
Future work Expand format conversion support and further develop software registry Extend UNSWorks preservation policy and procedure to support other Library repositories Alignment with ISO standard for Audit and certification of trustworthy digital repositories (ISO 16363) Preservation of digitised theses
Questions? a.shaon@unsw.edu.au