Research Data Management and Institutional Repositories

Similar documents
An overview of the OAIS and Representation Information

Edinburgh DataShare: Tackling research data in a DSpace institutional repository

Research Data Edinburgh: MANTRA & Edinburgh DataShare. Stuart Macdonald EDINA & Data Library University of Edinburgh

RADAR. Establishing a generic Research Data Repository: RESEARCH DATA REPOSITORY. Dr. Angelina Kraft

Data Curation Handbook Steps

Digital repositories as research infrastructure: a UK perspective

Developing a Research Data Policy

Session Two: OAIS Model & Digital Curation Lifecycle Model

Survey of Research Data Management Practices at the University of Pretoria

CORE: Improving access and enabling re-use of open access content using aggregations

8th International Digital Curation Conference January 2013

The Data Curation Profiles Toolkit: Interview Worksheet

Current JISC initiatives for Repositories

Striving for efficiency

The OAIS Reference Model: current implementations

JISC WORK PACKAGE: (Project Plan Appendix B, Version 2 )

An Institutional Approach to Developing Research Data Management Infrastructure

RADAR A Repository for Long Tail Data

Registry Interchange Format: Collections and Services (RIF-CS) explained

OAI-ORE. A non-technical introduction to: (

Data Management Checklist

Adding value to open access research data: the ebank UK Project.

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

RDM through a UK lens - New Roles for Librarians?

Scoping and Developing Institutional Data Services: the Data Libraries of 2020

Data Curation: Technical Challenges Facing Repositories. Brianna Marshall Jan. 9, 2014

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

Data Curation Profile Movement of Proteins

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

DOIs for Research Data

Advancing the fourth paradigm of research: Assimilating repositories into active research phases

University of British Columbia Library. Persistent Digital Collections Implementation Plan. Final project report Summary version

DRI: Preservation Planning Case Study Getting Started in Digital Preservation Digital Preservation Coalition November 2013 Dublin, Ireland

Digital Curators: Who, What, & How

Opus: University of Bath Online Publication Store

Checklist and guidance for a Data Management Plan, v1.0

RADAR Project. Data Archival and Publication as a Service. Matthias Razum FIZ Karlsruhe RESEARCH DATA REPOSITORIUM. Zurich, December 15, 2014

Part 2: Current State of OAR Interoperability. Towards Repository Interoperability Berlin 10 Workshop 6 November 2012

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI

Data Curation Profile Human Genomics

Science Europe Consultation on Research Data Management

The European Repositories Landscape - The view from 20,000 feet

Survey of research data management practices at the University of Pretoria, South Africa: October 2009 March 2010

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

Robin Wilson Director. Digital Identifiers Metadata Services

GEOSS Data Management Principles: Importance and Implementation

DataFlow and VIDaaS Workshop

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

Data publication and discovery with Globus

Networking European Digital Repositories

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Persistent identifiers, long-term access and the DiVA preservation strategy

EUDAT- Towards a Global Collaborative Data Infrastructure

Open Data is a new paradigm in which research data are freely and openly shared, with full re-use rights. Open data ensures that research integrity

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

ACDH AUSTRIAN CENTRE FOR DIGITAL HUMANITIES

Long-term digital preservation of UNSWorks

Agenda. Bibliography

Ensuring Proper Storage for Earth Science Data: The USGS Process to Certify Trusted Digital Repositories

Launching the. Data Curation Network NDS/MBDH 2018

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

Metadata Framework for Resource Discovery

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Research Data Management & Preservation: A Library Perspective

The Metadata Challenge:

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

Towards repository interoperability

Networking European Digital Repositories

The Data Management Plan: Putting policy into practice Suzanne Clarke Director, Information Resources

Improving a Trustworthy Data Repository with ISO 16363

Indiana University Research Technology and the Research Data Alliance

EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd

Trusted Digital Repositories. A systems approach to determining trustworthiness using DRAMBORA

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel

BPMN Processes for machine-actionable DMPs

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

Archivierung und Publikation von Forschungsdaten mit RADAR

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

The RMap Project: Linking the Products of Research and Scholarly Communication Tim DiLauro

The Canadian Information Network for Research in the Social Sciences and Humanities.

SHARING YOUR RESEARCH DATA VIA

UNIVERSITY OF NOTTINGHAM LIBRARIES, RESEARCH AND LEARNING RESOURCES

The digital preservation technological context

re3data.org - Making research data repositories visible and discoverable

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

The library s role in promoting the sharing of scientific research data

GETTING STARTED WITH DIGITAL COMMONWEALTH

COAR Interoperability Roadmap. Uppsala, May 21, 2012 COAR General Assembly

Digital Preservation DMFUG 2017

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

Where to store research data during and after a project. Dr. Chris Emmerson Research Data Manager

Metadata Quality Assessment: A Phased Approach to Ensuring Long-term Access to Digital Resources

Introduction to Digital Preservation. Danielle Mericle University of Oregon

Sessions 3/4: Member Node Breakouts. John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group

Its All About The Metadata

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

A Brief Introduction to the Data Curation Profiles

Big Data infrastructure and tools in libraries

Transcription:

Research Data Management and Institutional Repositories 2014 LIS Research Symposium UNISA Dr Lucia Lötter Social science that makes a difference 24 July 2014

Presentation overview Data and Research Data Management (RDM) Importance of research data and research data management Supporting RDM Repositories in a data ecosystem Institutional Repositories and RDM Requirements for research data repositories Nature of research data The management of research data as a digital objects Examples Closing remarks

What is research data? Research data, unlike other types of information, is collected, observed, or created for purposes of analysis to produce original research results. Edinburgh University 2010 Digital objects

Importance of research data & RDM Well-managed data in digital form have great potential to be searched, accessed, mined and reused. Data may be examined to validate research results; it could be consulted by researchers of related interest and save time and resources in data re-collecting; data may even be re-purposed to answer questions unrelated to the context in which it was first generated or gathered. The value of data grows significantly as the data form more accessible collections. The awareness of the data deluge phenomenon, the potential impact of data reuse, and the desire to maximize the return on investment of research funding all led to increasing amounts of discussion on research data management. Wong 2009

Research Data Management Data management is the process of controlling the information generated during a research project. Managing data is an integral part of the research process. How data is managed depends on the types of data involved, how data is collected and stored, and how it is used - throughout the research lifecycle. University of Leicester 2014

Supporting RDM Data re-use services Research Data Management IT Platforms Data curation services RDM services Policies Resources Capacity

Research Data Management Platforms Data analysis PRIMARY RESEARCH Data creators Data creation Research in-process (organising, storing, collaboration) Prepare for deposit of data SECONDARY DATA USE Data users Proposal writing & project start-up Discovery of data for re-use Data management planning Research information Using data Login Transfer Storage Interfaces Tools Sharing data Dissemination Managing & preparing data Preserving data Preservation Ingest DATA CURATION Data curators

Repositories in a data ecosystem Types of repositories Institutional repositories \ Discipline (subject repositories / data centres) Research \ Community \ Reference data collections Metadata repositories \ aggregators Implementation models One or multiple custodians Different data pathways Data producer Analysis (researcher) Data producer IR Data producer IR Discipline repository Data producer Discipline repository Metadata repository National Science Board 2005; Beagrie, Chruszcz et al. 2008; Wilson, Martinez-Uribe et al. 2011

Repositories in a data ecosystem The curation continuum Levels of curation (Rusbridge, 2010) High (high levels of expertise, where subject specialists are involved during the ingest phase of data archiving, adding and cleaning descriptive metadata) Low (greater degree of automation; minimal manual intervention) Information continuum Wilson, Martinez-Uribe et al. 2011; Treloar, Harboe-Ree 2008

Institutional Repositories An IR is a set of services and technologies that provide the means to collect, manage, provide access to, disseminate, and preserve digital materials produced at an institution. Shreeves, Cragin 2008 Research outputs Research data?

IRs and research data

IRs and research data Wong 2009; Shreeves, Cragin 2008, Salo 2010; Choudhury 2008

Questions to answer Who are the users of the services and stakeholders? What do they need / require? What is the nature of the content? Are data sets unique digital objects with unique requirements? Do data sets require a unique set of services? How is the content produced and used? How will the platform fit into the bigger data ecosystem? What is achievable within the context of the organisation? Other platforms and services in the organisation Commitment of organisation (financial and human resources) Readiness of the organisation

Data related requirements Nature of research data 1. Heterogeneity Standards & protocols 2. Contextual dependence 3. Transient nature 4. Digital nature RDM services Curation services Re-use services Data repository services Jacobs, Thomas et al. 2008; Wong 2009; Witt 2008; Plale, McDonald et al. 2013; Salo 2010; Taylor 2013 ; Palmer 2008; Weber 2011

Heterogeneity Research methodologies Practices of researchers Kinds of data, sizes, formats and composition Data value Raw, aggregated data Flexible and highly scalable Clarity on what the repository cater for Taylor 2013

Contextual dependence Collection composition Linking items (data files, contextual documents, outputs) Data with outputs or outputs with data Organise into project-based collections (a bag ) http://curation.hsrc.ac.za/dataset-278.phtml Weber 2011

Transient nature Snapshots vs live data Versioning of data sets digital preservation systems designed to steward only final, unchanging materials can only fail faced with realworld datasets and data-use practices. Salo 2010 Authority management (Author IDs) Persistent identifiers (DOIs) Citation standards

Digital object management Data submission (deposit) Ethics requirements for human subjects Consent to share Anonymisation Ownership issues Self-archiving Automation Taylor 2013

Digital object management Access (consistent with tools and processes of the research community) Discovery services (browsing, searching, OMP-MIH) Metadata Discovery, determine relevance, make data useable, provenance Compliance with recognized standards of the community Dissemination formats Ways to serve and use data Usage terms and conditions Access management Usage statistics

Digital object management Preservation Preservation management Registries Retention period, de-accessioning, destruction Strategies support to file formats and their long-term usability Archival formats Format migration Storage and storage management Multiple copies, multi-media, multi site Back-up Disaster recovery Security

Standards and protocols Interoperability Metadata (machine readable in appropriate standard) Various standards and protocols Trusted Digital Repository (TRAC ISO 16363) (http://www.iso.org/iso/home/store/catalogue_tc/catalogue_detail.htm?csnumber=56510) Open archival information system Reference model (OAIS ISO 14721:2012) (http://www.iso.org/iso/catalogue_detail.htm?csnumber=57284) Open Archives Initiative Object Reuse and Exchange (OAI-ORE) (http://www.openarchives.org/pmh/) Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) (http://www.openarchives.org/ore/) Simple Web-service Offering Repository Deposit (SWORD) (http://swordapp.org/)

Examples Witt 2008; Taylor 2013; Beagrie, Chruszcz et al. 2008 Flexible Storage and Metadata Architectures De-coupling Ingest, Storage and Use

Closing remarks Not just a one size fits all one vendor solution (install and go) RDM Services for small data is labour intensive (Various roles and responsibilities) Don t see IRs in isolation Clear vision of aims Investigate and experiment Collaborate The verdict?

References BEAGRIE, N., CHRUSZCZ, J. and LAVOIE, B., 2008. Keeping research data safe: A cost model and guidance for UK universities.final Report to JISC. BUSINESS DICTIONARY, What is data management? definition and meaning. Available: http://www.businessdictionary.com/definition/data-management.html [7/23/2014, 2014]. CHOUDHURY, G.S., 2008. Case study in data curation at Johns Hopkins University. Library Trends, 57(2), pp. 211-220. EDINBURGH UNIVERSITY, 2010. Edinburgh University Data Library Research Data Management Handbook. Edinburgh University. JACOBS, N., THOMAS, A. and MCGREGOR, A., 2008. Institutional repositories in the UK: the JISC approach. Library Trends, 57(2), pp. 124-141. LORD, P., MACDONALD, A., LYON, L. and GIARETTA, D., 2004. From data deluge to data curation, Proceedings of the UK e-science All Hands meeting 2004, pp. 371-357. MIX, K. and TAYLOR JR, L., VLA Paraprofessional Forum Twentieth Annual Conference. NATIONAL SCIENCE BOARD, 2005. Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century. National Science Foundation. PALMER, C.L., TEFFEAU, L.C. and NEWTON, M.P., 2008. Strategies for institutional repository development: a case study of three evolving initiatives. Library Trends, 57(2), pp. 142-167. PITTMAN, D., 2010. NSF REVAMPS DATA-SHARING POLICY Chemical & Engineering News, 88(39), pp. 46 <last_page> 47. PLALE, B., MCDONALD, R.H., CHANDRASEKAR, K., KOUPER, I., KONKIEL, S., HEDSTROM, M.L., MYERS, J. and KUMAR, P., 2013. SEAD Virtual Archive: Building a Federation of Institutional Repositories for Long-Term Data Preservation in Sustainability Science. International Journal of Digital Curation, 8(2), pp. 172-180. SALO, D., 2010. Retooling libraries for the data challenge. Ariadne, 64. SHREEVES, S.L. and CRAGIN, M.H., 2008. Introduction: Institutional repositories: Current state and future. Library Trends, 57(2), pp. 89-97. TAYLOR, L., 2013. Plugging the big data gap in DSpace using SWORD and Globus. United Kingdom: University of Exeter. TRELOAR, A. and HARBOE-REE, C., 2008. Data management and the curation continuum: how the Monash experience is informing repository relationships. Proceedings of VALA 2008. UNIVERSITY OF LEICESTER,, What is research data University of Leicester. Available: http://www2.le.ac.uk/services/researchdata/rdm/what-is-rdm/research-data [7/23/2014, 2014]. WEBER, B., 2011. The RUresearch Data Portal Providing Customized Access for Specific Types of Data and Primary Users. United States of America: Rutgers University Libraries. WILSON, J.A., MARTINEZ-URIBE, L., FRASER, M.A. and JEFFREYS, P., 2011. An institutional approach to developing research data management infrastructure. International Journal of Digital Curation, 6(2), pp. 274-287. WITT, M., 2008. Institutional repositories and research data curation in a distributed environment. Library Trends, 57(2), pp. 191-201.

Thank you Lucia Lötter llotter@hsrc.ac.za www.hsrc.ac.za