Survey of Research Data Management Practices at the University of Pretoria

Similar documents
Survey of research data management practices at the University of Pretoria, South Africa: October 2009 March 2010

Ethics and Omics. Jeffrey Engler, Ph.D. Dept of Biochemistry and Molecular Genetics Associate Dean, UAB Graduate School

The Data Management Plan: Putting policy into practice Suzanne Clarke Director, Information Resources

Edinburgh DataShare: Tackling research data in a DSpace institutional repository

DataFlow and VIDaaS Workshop

Developing a Research Data Policy

Swedish National Data Service, SND Checklist Data Management Plan Checklist for Data Management Plan

Chartered Membership: Professional Standards Framework

Data Management Checklist

Scoping and Developing Institutional Data Services: the Data Libraries of 2020

UNIVERSITY OF MASSACHUSETTS AMHERST INFORMATION SECURITY POLICY October 25, 2017

An Institutional Approach to Developing Research Data Management Infrastructure

TDWI Data Governance Fundamentals: Managing Data as an Asset

UNIVERSITY OF MASSACHUSETTS AMHERST INFORMATION SECURITY POLICY September 20, 2017

Subject: University Information Technology Resource Security Policy: OUTDATED

Scientific Data Curation and the Grid

Evaluating and Improving Cybersecurity Capabilities of the Electricity Critical Infrastructure

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

Research Data Management Procedures and Guidance

When Recognition Matters WHITEPAPER CLFE CERTIFIED LEAD FORENSIC EXAMINER.

RDM through a UK lens - New Roles for Librarians?

RELATIONSHIP BETWEEN THE ISO SERIES OF STANDARDS AND OTHER PRODUCTS OF ISO/TC 46/SC 11: 1. Records processes and controls 2012

Data Curation Profile Human Genomics

Description Cross-domain Task Force Research Design Statement

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

MANUAL OF UNIVERSITY POLICIES PROCEDURES AND GUIDELINES. Applies to: faculty staff students student employees visitors contractors

Privacy Code of Conduct on mhealth apps the role of soft-law in enhancing trust ehealth Week 2016

Metadata Framework for Resource Discovery

ISO / IEC 27001:2005. A brief introduction. Dimitris Petropoulos Managing Director ENCODE Middle East September 2006

Legal Issues in Data Management: A Practical Approach

The Data Census: Assessing Data Services at MSU

Striving for efficiency

STRATEGIC PLAN

Data Life Cycle. Research. Access Collaborate. Acquire. Analyse. Comprehend. Plan. Manage Archive. Publish Reuse

Hundred and seventy-fifth session REPORT BY THE DIRECTOR-GENERAL ON THE IMPLICATIONS OF THE PROCLAMATION OF A WORLD DAY FOR AUDIOVISUAL HERITAGE

UC Irvine LAUC-I and Library Staff Research

Data Curation Profile Movement of Proteins

IUPUI eportfolio Grants Request for Proposals for Deadline: March 1, 2018

INFORMATION TECHNOLOGY ( IT ) GOVERNANCE FRAMEWORK

Building Resilience to Disasters for Sustainable Development: Visakhapatnam Declaration and Plan of Action

A structured workflow for implementing digital archiving standards in an organisation

DRS Policy Guide. Management of DRS operations is the responsibility of staff in Library Technology Services (LTS).

Academic Program Review at Illinois State University PROGRAM REVIEW OVERVIEW

OPEN SCIENCE AT THE SWEDISH RESEARCH COUNCIL. Sofie Björling Director of the Dept of Research Infrastructures, NPR Open Access

INFORMATION TECHNOLOGY DATA MANAGEMENT PROCEDURES AND GOVERNANCE STRUCTURE BALL STATE UNIVERSITY OFFICE OF INFORMATION SECURITY SERVICES

(b) Fiscal 2016 Initiative Results and Fiscal 2017 Plans

Building UAE s cyber security resilience through effective use of technology, processes and the local people.

PRODUCT SAFETY PROFESSIONAL CERTIFICATION PROGRAM DETAILS. Overview

Terms in the glossary are listed alphabetically. Words highlighted in bold are defined in the Glossary.

USING EPORTFOLIOS TO PROMOTE STUDENT SUCCESS THROUGH HIGH- IMPACT PRACTICES

Computing Accreditation Commission Version 2.0 CRITERIA FOR ACCREDITING COMPUTING PROGRAMS

NC Project Learning Tree Guidelines

The library s role in promoting the sharing of scientific research data

BUSINESS CONTINUITY AND DISASTER RECOVERY POLICY

Data management Backgrounds and steps to implementation; A pragmatic approach.

Intégrité scientifique: Data Research Management

REVIEW OF MANAGEMENT AND OVERSIGHT OF THE INTEGRATED BUSINESS MANAGEMENT SYSTEM (IBMS) January 16, 2009

CISER Data Archive Collection Policy

National Data Sharing and Accessibility Policy-2012 (NDSAP-2012)

Science Europe Consultation on Research Data Management

Position Description IT Auditor

Virginia State University Policies Manual. Title: Information Security Program Policy: 6110

The Science and Technology Roadmap to Support the Implementation of the Sendai Framework for Disaster Risk Reduction

Data publication and discovery with Globus

Only the original curriculum in Danish language has legal validity in matters of discrepancy

Please note: Only the original curriculum in Danish language has legal validity in matters of discrepancy. CURRICULUM

Information Official District information as defined herein and/or by other Board policy.

BEng (Hons) Civil Engineering E410 (Under Review)

UAE National Space Policy Agenda Item 11; LSC April By: Space Policy and Regulations Directory

Standard for Security of Information Technology Resources

All LJMU programmes are delivered and assessed in English

LICS Certification Scheme

Information Security Incident

Research Data Management Services in a UK Higher Education Institution: University of Edinburgh

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Reviewed by ADM(RS) in accordance with the Access to Information Act. Information UNCLASSIFIED.

ICS-ACI Policy Series

Implementation Strategy for Cybersecurity Workshop ITU 2016

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

IRVLA The Irish Virtual Research Library and Archive project.

Data Governance Central to Data Management Success

EGI federated e-infrastructure, a building block for the Open Science Commons

KEY PROGRAMME INFORMATION. Originating institution(s) Bournemouth University. Faculty responsible for the programme Faculty of Science and Technology

EISAS Enhanced Roadmap 2012

State Planning Organization Information Society Department

SSR Staff Information Sessions Information Technology

The National Digital Library Finna Among Digital Research Infrastructures in Finland

Resolution adopted by the General Assembly. [on the report of the Second Committee (A/56/561/Add.2)]

Cybersecurity for ALL

Improving a Trustworthy Data Repository with ISO 16363

Agenda. Bibliography

A Brief Introduction to the Data Curation Profiles

Preservation and Access of Digital Audiovisual Assets at the Guggenheim

What steps to take. when AV is yet to become a priority for your organisation

Document Title Ingest Guide for University Electronic Records

FOUNDATION CERTIFICATE IN INFORMATION SECURITY v2.0 INTRODUCING THE TOP 5 DISCIPLINES IN INFORMATION SECURITY SUMMARY

EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd

Data Curation Profile Plant Genetics / Corn Breeding

Implementing ITIL v3 Service Lifecycle

Threat and Vulnerability Assessment Tool

Transcription:

Survey of Research Data Management Practices at the University of Pretoria Undertaken by the Department of Library Services in order to improve research practices at the University Unisa Library Open Access Event 25 October 2011 Dr Heila Pienaar, UP Library Services, University of Pretoria, South Africa http://www.ais.up.ac.za/profile/heila_pienaar/index.htm

Content Context: UP Library e-strategy Data curation / management (definitions) Data management concepts, process Levels of research data management Rationale for the Library s involvement Survey research methodology Findings Top requirements for services National initiatives Recommendations Further actions Anecdotes from the survey!

Anecdotes from the survey Department of Chemistry: Data is very vulnerable because of the tendency of chemicals to burn Faculty of Engineering: When students graduate they download their data on DVD s etc and give it in a paper file to the supervisor. No backups. Faculty of Natural and Agricultural Sciences: A student was hijacked and all data stolen, plus lab book. Data could not be recovered.

e-information Strategy & projects 2011 Library web e-research: VRE; Research Data management; Advocacy e-learning e-resources Repositories Mobile services Integration with UP e-learning & e-research Open Scholarship Web / Library 2 Digitisation & Preservation (incl preparing info for mobile use)

Data & data curation / management (definitions) A relatively new discipline with many different definitions Research data: Research data, unlike other types of information, is collected, observed, or created, for purposes of analysis to produce original research results http://www.ed.ac.uk/is/data-management Data curation: the curation of records or measurements of information ( data ). Those scientific measurements or records ( data ) are further distinguished from the computer science meaning of data to refer to any type of digitally encoded information http://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=1027&context=lib_dean Digital curation: the selection, preservation, maintenance, collection and archiving of digital assets http://en.wikipedia.org/wiki/digital_curation

Blue Ribbon Task Force on Sustainable Digital Preservation and Access The Task Force s view on research data: There is a remarkable growth of data-intensive research in all knowledge domains. In most fields, there is high recognition of the benefits of preserving research data for various purposes and lengths of time. But there are few robust systems for making decisions about what to preserve; and there is often a lack of coordination of roles, responsibilities, and funding sources among those best positioned to preserve data (researchers) and the preservation infrastructure (curation and archiving services) that should support them. Research and education institutions, professional societies, archives, researchers, and the funding agencies that support data creation all have leading roles to play in creating sustainable preservation strategies (http://brtf.sdsc.edu/biblio/brtf_final_report.pdf)

Terminology (Blue Ribbon Task Force) The terminology for digital materials and preservation processes varies among stakeholder communities. As a rule members of the scientific community refer to digital materials as data; further, they refer to activities that enable use and long-term accessibility as curation and archiving, which taken together, are called stewardship. In cultural domains and the humanities, digital materials are more often referred to as content, and the activities that ensure their long-term availability are called preservation and access.

Research data management Is not only: Data archiving OR Data backups

Data Management concepts, process Data Ownership This pertains to who has the legal rights to the data and who retains the data after the project is completed. Data Collection This pertains to collecting project data in a consistent, systematic manner (i.e., reliability) and establishing an ongoing system for evaluating and recording changes to the project protocol (i.e., validity). Data Storage This concerns the amount of data that should be stored -- enough so that project results can be reconstructed. Data Protection This relates to protecting written and electronic data from physical damage and protecting data integrity, including damage from tampering or theft. Data Retention This refers to the length of time one needs to keep the project data according to the sponsor's or funder's guidelines. It also includes secure destruction of data. Data Analysis This pertains to how raw data are chosen, evaluated, and interpreted into meaningful and significant conclusions that other researchers and the public can understand and use. Data Sharing This concerns how project data and research results are disseminated to other researchers and the general public, and when data should not be shared. Data Reporting This pertains to the publication of conclusive findings, both positive and negative, after the project is completed. (Steneck, 2004) http://ori.dhhs.gov/education/products/clinicaltools/data.pdf

Why manage research data? Data management is one of the essential areas of responsible conduct of research. Before starting a new research project, the researchers and or the research teams must address issues related to data management. By managing your data you will: Meet funding body grant requirements. Ensure research integrity and replication. Ensure research data and records are accurate, complete, authentic and reliable. Increase your research efficiency. Save time and resources in the long run. Enhance data security and minimise the risk of data loss. Prevent duplication of effort by enabling others to use your data. Comply with practices conducted in industry and commerce. http://www.ed.ac.uk/is/data-management

Levels of research data management International e.g. World Data Centre on Climate National e.g. Very Large Database Initiative (DST: CHPC, Meraka, CSIR); NeDICC (Network of Distributed Data & Information Curation Centres) Initiative Campus e.g. repositories for open access

Rationale for the Library s involvement Thus with the experience gained from traditional cataloguing, indexing and organizational skills coupled to those acquired in developing, establishing and maintaining institutional repositories, the time is ripe for academic librarians to explore their role as data curators Data Curation and Libraries: Short-Term Developments, Long- Term Prospects http://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=1027&context=lib _dean A new role for academic librarians: data curation http://www.era.lib.ed.ac.uk/handle/1842/3207

Resources needed People / expertise / systems involved in data management and sharing may include: project director designing research (incl. research data management) research staff collecting, processing and analysing data external / internal contractors involved in data collection, data entry, processing or analysis support staff managing and administering research and research funding institutional IT services staff providing & advising on formats, repositories, data storage, data archiving and back-up services external data centres or web services archives who facilitate data sharing meta-data editors: data description, annotation and contextual information

Survey research methodology Fifty-two interviews were conducted by 15 information specialists from the relevant Faculty Libraries over the period October 2009 March 2010 Each Faculty s Research Committee was requested by the Vice Principal: Research and Postgraduate Studies to identify up to three researchers to take part in the survey Each researcher also identified at least one postgraduate student who could participate in the survey. The information specialists received formal training in interview techniques. Interviews were conducted according to a semistructured interview framework. Limitation: Results cannot be generalised to researchers not included in this study

Distribution of interview respondents Faculty Academic staff Post-graduate students (some are also academic staff) Total 1 Theology 2 2 4 2 Humanities 4 4 8 3 Education 3 1 4 4 Law 5-5 5 Economic & Management Sciences 8? 8 6 Health Sciences 3 3 6 7 Veterinary Sciences 3 3 6 8 Natural and Agricultural Sciences 9 Engineering, Built Environment & Information Technology 5 2 7 2 2 4 Total 35 17 52

Findings The general trends of findings are given, using six major categories: funding, data collection, processing of data, publishing, support

Funding Funding. This part of the interview was included in order to better understand how researchers think about data at the early stage of applying for funding and how well they are aware of their funders requirements in terms of data sharing and archiving. General trend: It depends on the funding agency proposal requirements and in most cases there is no need for data management or data sharing plans.

Data Collection Data collection. This aspect was discussed in order to learn about the different ways in which data are collected and captured, the different types of formats and sizes as well as the usefulness of these data to others. General trend: UP researchers make use of a wide variety of data collection methods and use both primary and secondary data. Both soft and hard data collection methods are used by all the Faculties. Data sets are often small

Processing of data Processing of data. In this portion of the interview the aim was to understand how researchers store data securely. General trend: Ad hoc storage of data, both on paper and electronically, is the norm. A few servers are available for data storage but in general the onus rests on the individual or department on how and where data is stored

Publishing Data publication. This part of the interview was included to see how researchers publish their data, if they do, and to explore the reasons behind not publishing data at all. General trend: In general raw data is not published for other researchers to use, and it is also not seen as necessary to do so.

Support Support. This section of the interview was designed to learn about the support researchers receive to manage their data and where they turn to for help when they encounter problems. General trend: Support for research as an activity is good throughout the university (faculty, departments, research support). But there is a lack of support with regard to the storage of data (physical and electronic).

Top requirements for services Top requirements for services. At the end of the interview challenges and concerns in terms of managing their data, were discussed with interviewees and they were asked to suggest services that could help them do their work more effectively. General trend: The top requirement is a central UP server or repository that is easy to use with good security. There is also a need for physical storage space. The biggest worry of academic staff is lack of sufficient time and lack of support for research by the UP Executive.

National initiatives Very Large Data Base (VLDB) mandated to the Centre for High Performance Computing (CHPC) by DST UP Library and the VLDB organised a Library Directors workshop to help identify research data management needs of SA universities

Recommendations It can be safely said that research data management does not exist in any formal manner (with the exception of one or two departments) at the University of Pretoria The Very Large Database initiative from the Department of Science and Technology should be investigated to see if it would support UP s research data management needs A formal staff position of research data manager is needed whether UP makes use of an external or internal system / repository or not. Such a position is necessary to drive the research data management endeavor

Acknowledgement This survey and report structure is based to a large extent on the Findings of the scoping study interviews and the research data management workshop. Scoping digital repository services for research data management. A Project of the Office of the Director of IT www.ict.ox.ac.uk/odit/projects/digitalrepository/ by Luis Martinez-Uribe (luis.martinezuribe@oerc.ox.ac.uk), Digital Repositories Research Co-ordinator, University of Oxford, UK

Further actions: Library has allocated a staff position for UP research data manager Presented the report at the UP Senate Research Committee Requested by prof Robin Crewe, previous Vice Principal: Research and Postgraduate Studies, to identify a Technical solution Decided at first meeting between Library, IT & Research support to evaluate the maturity of UP to manage research data

Example of maturity model (draft) *Based on the CobiT framework generic maturity model: http://www.ee.kth.se/php/modules/publications/reports/2007/ir-ee-ics_2007_026.pdf **Monash University Library Research Data Planning Checklist: http://www.researchdata.monash.edu/resources/datahdrchecklist.doc

Further actions (continued) Proposal on the implementation of research data management at UP accepted September 2011 by UP Research Computing Committee Role description and advert for UP Research Data Manager to be finalised (appointment in 2012) Pilot study 1: Research data to be included as part of a Department s electronic theses & dissertations Pilot study 2: Urgent request by Faculty of Health Sciences to support their research data management process

The end, for now Comments, Questions???