(PI, JISC UMF DataFlow Project) Introduction to DataStage

Size: px
Start display at page:

Download "(PI, JISC UMF DataFlow Project) Introduction to DataStage"

Transcription

1 DataFlow VIDaaS Launch Event Saïd Business School, Oxford University 2 March 2012 The JISC UMF DataFlow Project Introduction to DataStage David Shotton (PI, JISC UMF DataFlow Project) Image BioInformatics Research Group Department of Zoology University of Oxford, UK david.shotton@zoo.ox.ac.uk David Shotton, 2012 Published under the Creative Commons Attribution-Noncommercial-Share Alike 3.0 Licence

2 And the winning platform is... At Queen Mary College, there is a JISC MRD Project entitled Sustainable Management of Digital Music Research Data After carefully reviewing several data management systems last December, including Fedora Commons, DataVerse and DSpace, they concluded: On paper, DataFlow is a winner: it meets (almost) all our requirements, especially because of DataStage, something other platforms don't offer. DataStage would be particularly appreciated, because it would make the integration of the system in the research workflow much less disruptive. Sadly, the availability of DataFlow software will come too late to be useful for our short project (October 2011 March 2012). Well, now the DataFlow software systems, DataStage and DataBank, are available, and we hope they will meet the needs of many of you here

3 Why don t researchers publish data? Three pressures presently prevent researchers from publishing their data Information overload and pressure of work With twenty new papers each week, a researcher can never catch up there is just too much new scientific information being produced now Have to run to stand still - no time for fringe activities like data curation Departmental pressure for financial viability, determined by the REF pressure to win grants and to publish in high impact journals negligible incentives and academic reward in terms of peer esteem, tenure or promotion for data publication activities Cognitive overhead and skill barriers to best-practice data management metadata concepts are foreign to most biomedical researchers large amount of effort involved in preparing data for publication [From evidence submitted 5 August 2011 to the Royal Society s Science as a Public Enterprise policy study]

4 Easing the pain of data archiving and publication

5 Making data management as simple as possible - the principle of sheer curation ( Create a data management infrastructure that: works with you rather than against you accommodates the data management tools with which you are already familiar (e.g. spreadsheets) provides services that are of immediate benefit in your day-to-day activities (e.g. shared file access) makes data management, data publication and data archiving activities sufficiently lightweight, intuitive and transparent that they are easily achieved, without imposing a significant cognitive overhead By achieving this, we can bridge the gap between laboratory and repository

6 Managing data using a two-tier infrastructure Tier One: DataStage Researchers can save files to a secure private DataStage file store This is purely for their own benefit Just a file store - does not pose a cognitive overhead sheer curation Requires no software installation on the researchers computers Designed for deployment at the research group level, locally or on a cloud Primary access is as a mapped network drive, Drive D:, on each computer You save files to DataStage just as you would to your local hard drive No restrictions or limitations of file type whatever you normally use Web access allows users to browse files within DataStage Advantages over a cheap hard drive from PC World under your desk: Regular nightly automated backup no need to remember to do so Private, shared and collaborative areas, with controlled group access Additional Web interface to DataStage, using the same user credentials Can invite overseas colleagues to access your files, via password control

7 Managing data using a two-tier infrastructure Spanning the tiers: DataStage to DataBank The special Web submission interface permits researchers to select and package data files for publication and long-term repository archiving Easy to do When the researcher is ready Minimal metadata requirement, to encourage usage The selected files are put in a special directory, with optional sub-directories The files are accompanied by a simple metadata stored as an RDF manifest It is possible to represent data files stored elsewhere using URIs useful for large data files that already have stable storage locations Packaging uses the BagIt file packaging specification from the California Digital Library ( The resulting files are then zipped into a single object for transmission to DataBank, the institutional data repository

8 Managing data using a two-tier infrastructure Tier Two: DataBank DataBank is a scalable data repository designed for institutional deployment Developed by the Bodleian Library, with a track record in preservation Cloud-deployable Easy for researcher to update a revised dataset if required Data packages normally published under a CCZero Open Data Waiver Confidential data packages can be kept in a separate dark repository Data packages assigned DOIs, making them citable (for academic credit) Optional user-defined embargo period to permit journal article publication Upon receipt of a DataStage data package, DataBank unzips the data package to give access to the files, mints a DOI for the data package, and registers it with DataCite display the RDF manifest metadata, and enriches it (e.g. with the DOI) indexes the metadata, and provides a search and browse interface DataBank is, in actuality, just an interface layer over a generic object store, as Neil will explain later this morning

9 DataFlow software services - summary Researchers DataStage file system Zipped BagIt Data Package with RDF metadata manifest Researchers, other users DataBank repository

10 The DataStage / DataBank Beta Launch The DataFlow Project has involved taking our initial working DataStage and DataBank prototypes undertaking a complete code review, rewriting where necessary improving the user interfaces preparing the software for deployment in two forms as a Virtual Machine to run in a VMWare environment as a Debian Package to install on the Ubuntu operating system writing documentation to describe the installation and functionality Beta releases v0.1 of these DataStage and DataBank services are now available can be run locally or on a cloud installation easy and customizable (e.g. your name & logo) enable research groups and institutions to provide their members with zero-cost data management solutions (apart from hosting costs) cloud provision can expand and shrink with requirements no need to build and staff your own local data centre

11 Acknowledgements... thanks to the JISC UMF for funding and acknowledgement of the excellent work of my DataFlow colleagues: Bhavana Ananda, Katherine Fletcher, Graham Klyne (IBRG) Ian Chard, Neil Jefferies, Anusha Ranganathan (Bodleian Library) Alex Dutton, Joseph Talbot (OU Computing Service) Gabriel Hanganu, Sander van der Waal (OSS Watch) Ross Gardler (Open Directive LLP) Neil Caithness, Matteo Turilli, David Wallom (Oxford e-research Centre) Richard Jones, Ben O Steen (Cottage Labs) Stephanie Taylor (Critical Eye Communications) Matthew Barker, Tom Ellis, Alex Hartwig (Cannonical Ltd)

12 ... time for a user endorsement Chris Holland, Department of Zoology... and a DataStage demo Graham Klyne, architect of the original DataStage prototype Bhavana Ananda, current DataStage developer

13 New for Beta Release v0.2, early April 2012 Integration of SWORD v2 repository submission protocol DataStage data packages can be submitted to any SWORD-compliant repository (e.g. the Dryad Data Repository, DataBank will be able to ingest data packages from any SWORD client DataBank, as well as DataStage, will by then have Debian packaging for ease of deployment onto Ubuntu Linux hosts Re-inclusion of WebDAV, to permit users to read and write via Web access Deployment will be tested on a wider range of cloud hosting environments for both VMWare virtual machine and Debian package installation including the Eduserv academic cloud User interface improvement and additional functionality on the basis of existing plans and user feedback Leading to a fully-featured release (Version 1.0) in May 2012

14 DataFlow services summary adding SWORD Researchers DataStage file system Zipped BagIt Data Package with RDF metadata manifest Researchers, other users SWORD deposit protocol DataBank repository

15 The conventional research data lifecycle Hypothesis formulation and project design Scholarly publications: conference papers and journal articles Publication activities Institutional repositories Research plan Research results and conclusions Experimentation and data creation Data selection and interpretation Raw data in research notebooks and live PC files Research datasets abandoned on local hard drives or CD-ROMs

16 The DataFlow-enhanced research data lifecycle Dissemination Open data on Web Hypothesis formulation and project design Research plan Scholarly publications: conference papers and journal articles Publication activities Research results and conclusions DataBank repository Archived datasets Preservation Experimentation and data creation Raw data in research notebooks and live PC files Data selection and interpretation DataStage filestore Private yet sharable Management

17 So what have we got in DataStage? Just a file store, appearing as a mapped drive easy to use Customizable access controls to suit different types of groups Does not require software installation on user s computer Uses standard software components found on every client machine Cross-platform Windows, Mac or Linux DataStage server hosted on Ubuntu Linux system Deployable locally, or on a cloud FREE, apart from hosting costs Has Web access, permitting Web apps to be built on top For example, for data packaging and SWORD repository submission Other Web apps possible... Can be used for other things than just storing datasets

18

19 Wider applications of DataStage Escaping the Ivory Tower Applications in commerce Applications in education

20 Adding a security app Data Packaging Security wrapper Data Packaging Security DataBank or other SWORD repository DataStage kernel SWORD deposit protocol Time-stamp each data file using irrevocable method Encrypt each data file using, for example, the OpenPGP standard Create a data package of time-stamped encrypted files Compute the UNF (Universal Numeric Fingerprint) for date package, so one can later ensure that it has not been altered Applications: Experimental data security for patent application e.g. pharmaceuticals Secure storage of financial data many commercial companies

21 Raspberry Pi computer Designed by David Braben of the Raspberry Pi Foundation in Cambridge First released on 29 February 2012 Size of a credit card, and cost ~ 25 for a configured system Intended to stimulate the teaching of basic computer science in schools

22 Raspberry Pi computer schematic Ethernet port, two USB ports, HDMI monitor socket 700 MHz ARM processor running Linux Programmable in Python, C, BBC Basic 256 Mb RAM (eight times capacity of BBC Micro B) Storage on SD card (16 Gb card costs about 10) Samba file sharing permits connection to external drives

23 Pi Store (aka DataStage) for classroom data integration Pi Store One Pi Store for each class A cloud-based data integration solution Each pupil has a private directory to store stuff Accessible from school or from home The teacher has access to all pupils folders, for example to permit marking homework

24 DataStage folders Typically a researcher will use his private folder for daily work The research group leader can read files in that folder Files placed in the Shared folder can also be read by other group members, and those place in the Collaborative folder can be written and read by all

25 DataStage metadata are limited Intentionally, DataStage metadata are limited to author, title, identifier, date and description This is to encourage researchers to submit datasets to their repository, bearing in mind Graham s concept of curation by addition Additional rich metadata can be included in a separate metadata file as part of the entire data package, in XML or RDF format DataBank can recognize such a file and index the metadata, extracting elements for inclusion in the RDF manifest Separately from the DataFlow Project, we have been developing a minimal metadata information model for describing a research investigation and the various research outputs (papers, datasets, protocols, workflows, etc.) that may result from the investigation Tanya Gray has encoded this as an XML model, and can dynamically create from that model a Web form in which to enter such metadata Such rich metadata can form part of a DataStage data package

26 MIIDI data model - Minimal information for an Infectious Disease Investigation

27 The MIIDI input form for Research Investigation information

28 The MIIDI input form for Journal Article information

29 MIIRO data model - Minimal information for Investigations and Research Outputs

DMPonline / DaMaRO Workshop. Rewley House, Oxford 28 June DataStage. - a simple file management system. David Shotton

DMPonline / DaMaRO Workshop. Rewley House, Oxford 28 June DataStage. - a simple file management system. David Shotton DMPonline / DaMaRO Workshop Rewley House, Oxford 28 June 2013 DataStage - a simple file management system David Shotton Oxford e-research Centre and Department of Zoology University of Oxford, UK david.shotton@zoo.ox.ac.uk

More information

The Oxford DMPonline Project

The Oxford DMPonline Project DMPonline / DaMaRO Workshop Rewley House, Oxford 28 June 2013 The Oxford DMPonline Project - creating customized data management plans David Shotton Oxford e-research Centre and Department of Zoology University

More information

Minimal Metadata Standards and MIIDI Reports

Minimal Metadata Standards and MIIDI Reports Dryad-UK Workshop Wolfson College, Oxford 12 September 2011 Minimal Metadata Standards and MIIDI Reports David Shotton, Silvio Peroni and Tanya Gray Image BioInformatics Research Group Department of Zoology

More information

DataFlow and VIDaaS Workshop

DataFlow and VIDaaS Workshop Take Charge of Your Data DataFlow and VIDaaS Workshop Data Management at Oxford Professor Paul Jeffreys, Director of IT, University of Oxford SAID Business School, 2 March 2012 Digital Services In summer

More information

Practical experiences with Research Quality Exercises DAG ITWG - August 18, David Groenewegen ARROW Project Manager

Practical experiences with Research Quality Exercises DAG ITWG - August 18, David Groenewegen ARROW Project Manager Practical experiences with Research Quality Exercises DAG ITWG - August 18, 2006 David Groenewegen ARROW Project Manager Outline 1. Monash Mock RQF (2006) Process and workflow TARDIS pulling it all together

More information

The OAIS Reference Model: current implementations

The OAIS Reference Model: current implementations The OAIS Reference Model: current implementations Michael Day, UKOLN, University of Bath m.day@ukoln.ac.uk Chinese-European Workshop on Digital Preservation, Beijing, China, 14-16 July 2004 Presentation

More information

The Materials Data Facility

The Materials Data Facility The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials

More information

Research Data Edinburgh: MANTRA & Edinburgh DataShare. Stuart Macdonald EDINA & Data Library University of Edinburgh

Research Data Edinburgh: MANTRA & Edinburgh DataShare. Stuart Macdonald EDINA & Data Library University of Edinburgh Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare Stuart Macdonald EDINA & Data Library University of Edinburgh NFAIS Open Data Seminar, 16 June 2016 Context EDINA and Data Library are a

More information

JISC Grant Funding 07/11 Cover Sheet for Bids (All sections must be completed) Please indicate which strand you are applying for:

JISC Grant Funding 07/11 Cover Sheet for Bids (All sections must be completed) Please indicate which strand you are applying for: JISC Grant Funding 07/11 Cover Sheet for Bids (All sections must be completed) Please indicate which strand you are applying for: Strand A1 Strand A2 Strand B Strand C Name of Lead Institution: University

More information

IJDC General Article

IJDC General Article Developing a Data Vault Stuart Lewis Lorraine Beard Edinburgh University Library The University of Manchester Library Mary McDerby Robin Taylor IT Services The University of Manchester Edinburgh University

More information

JISC WORK PACKAGE: (Project Plan Appendix B, Version 2 )

JISC WORK PACKAGE: (Project Plan Appendix B, Version 2 ) Date: 22/10/2008 JISC WORK PACKAGE: (Project Plan Appendix B, Version 2 ) WORKPACKAGES Month 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 1: Project Management 2: Institutional Repository

More information

Brown University Libraries Technology Plan

Brown University Libraries Technology Plan Brown University Libraries Technology Plan 2009-2011 Technology Vision Brown University Library creates, develops, promotes, and uses technology to further the Library s mission and strategic directions

More information

SHARING YOUR RESEARCH DATA VIA

SHARING YOUR RESEARCH DATA VIA SHARING YOUR RESEARCH DATA VIA SCHOLARBANK@NUS MEET OUR TEAM Gerrie Kow Head, Scholarly Communication NUS Libraries gerrie@nus.edu.sg Estella Ye Research Data Management Librarian NUS Libraries estella.ye@nus.edu.sg

More information

Tools for Data Management. Research Data Management : Session 3 9 th June 2015

Tools for Data Management. Research Data Management : Session 3 9 th June 2015 Tools for Data Management Research Data Management : Session 3 9 th June 2015 What do we mean by tools for data? A system that automates in some way the process of creating, transforming, analysing, visualising,

More information

Experiments 1 How to set up Raspberry Pi B+ The little computer you can cook into DIY tech projects

Experiments 1 How to set up Raspberry Pi B+ The little computer you can cook into DIY tech projects Experiments 1 How to set up Raspberry Pi B+ The little computer you can cook into DIY tech projects The Raspberry Pi is a computer about the size of a credit card. The darling of the do-it-yourself electronics

More information

Data publication and discovery with Globus

Data publication and discovery with Globus Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,

More information

Queen s University Library. Research Data Management (RDM) Workflow

Queen s University Library. Research Data Management (RDM) Workflow Queen s University Library Research Data Management (RDM) Workflow Alexandra Cooper Jeff Moon Data Services, Open Scholarship Services Queen s University Library February 2018 Table of Contents RDM Planning...

More information

The library s role in promoting the sharing of scientific research data

The library s role in promoting the sharing of scientific research data The library s role in promoting the sharing of scientific research data Katherine Akers Biomedical Research/Research Data Specialist Shiffman Medical Library Wayne State University Funding agency requirements

More information

Edinburgh DataShare: Tackling research data in a DSpace institutional repository

Edinburgh DataShare: Tackling research data in a DSpace institutional repository Edinburgh DataShare: Tackling research data in a DSpace institutional repository Robin Rice EDINA and Data Library, Information Services University of Edinburgh, Scotland DSpace User Group Meeting Gothenburg,

More information

Writing a Data Management Plan A guide for the perplexed

Writing a Data Management Plan A guide for the perplexed March 29, 2012 Writing a Data Management Plan A guide for the perplexed Agenda Rationale and Motivations for Data Management Plans Data and data structures Metadata and provenance Provisions for privacy,

More information

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS) Institutional Repository using DSpace Yatrik Patel Scientist D (CS) yatrik@inflibnet.ac.in What is Institutional Repository? Institutional repositories [are]... digital collections capturing and preserving

More information

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1 Persistent Identifier the data publishing perspective Sünje Dallmeier-Tiessen, CERN 1 Agenda Data Publishing Specific Data Publishing Needs THOR Latest Examples/Solutions Publishing Centerpiece of research

More information

Data Management: the What, When and How

Data Management: the What, When and How Data Management: the What, When and How Data Management: the What DAMA(Data Management Association) states that "Data Resource Management is the development and execution of architectures, policies, practices

More information

Data Management Checklist

Data Management Checklist Data Management Checklist Managing research data throughout its lifecycle ensures its long-term value and prevents data from falling into digital obsolescence. Proper data management is a key prerequisite

More information

An Institutional Approach to Developing Research Data Management Infrastructure

An Institutional Approach to Developing Research Data Management Infrastructure An Institutional Approach to Developing Research Data Management Infrastructure Wednesday 8 December 2010 James A J Wilson, Michael A Fraser, Luis Martinez-Uribe, Paul Jeffreys Institutional structure

More information

Research Data Repository Interoperability Primer

Research Data Repository Interoperability Primer Research Data Repository Interoperability Primer The Research Data Repository Interoperability Working Group will establish standards for interoperability between different research data repository platforms

More information

Invitation to Tender Content Management System Upgrade

Invitation to Tender Content Management System Upgrade Invitation to Tender Content Management System Upgrade The IFRS Foundation (Foundation) is investigating the possibility of upgrading the Content Management System (CMS) it currently uses to support its

More information

Developing a Research Data Policy

Developing a Research Data Policy Developing a Research Data Policy Core Elements of the Content of a Research Data Management Policy This document may be useful for defining research data, explaining what RDM is, illustrating workflows,

More information

Callicott, Burton B, Scherer, David, Wesolek, Andrew. Published by Purdue University Press. For additional information about this book

Callicott, Burton B, Scherer, David, Wesolek, Andrew. Published by Purdue University Press. For additional information about this book Making Institutional Repositories Work Callicott, Burton B, Scherer, David, Wesolek, Andrew Published by Purdue University Press Callicott, Burton B. & Scherer, David & Wesolek, Andrew. Making Institutional

More information

The Data Curation Profiles Toolkit: Interview Worksheet

The Data Curation Profiles Toolkit: Interview Worksheet Purdue University Purdue e-pubs Data Curation Profiles Toolkit 11-29-2010 The Data Curation Profiles Toolkit: Interview Worksheet Jake Carlson Purdue University, jakecar@umich.edu Follow this and additional

More information

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

NSF Data Management Plan Template Duke University Libraries Data and GIS Services NSF Data Management Plan Template Duke University Libraries Data and GIS Services NSF Data Management Plan Requirement Overview The Data Management Plan (DMP) should be a supplementary document of no more

More information

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow Data Management Plans Sarah Jones Digital Curation Centre, Glasgow sarah.jones@glasgow.ac.uk Twitter: @sjdcc Data Management Plan (DMP) workshop, e-infrastructures Austria, Vienna, 17 November 2016 What

More information

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...)

Slide 1 & 2 Technical issues Slide 3 Technical expertise (continued...) Technical issues 1 Slide 1 & 2 Technical issues There are a wide variety of technical issues related to starting up an IR. I m not a technical expert, so I m going to cover most of these in a fairly superficial

More information

CrossRef tools for small publishers

CrossRef tools for small publishers pissn 2288-8063 eissn 2288-7474 Sci Ed 2015;2(2):79-85 http://dx.doi.org/10.6087/kcse.48 Training Material CrossRef tools for small publishers Rachael Lammey CrossRef, Oxford, United Kingdom Abstract CrossRef

More information

Globus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018

Globus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018 Globus Platform Services for Data Publication Greg Nawrocki greg@globus.org University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018 Outline Globus Overview Globus Data Publication v1 Lessons

More information

Checklist and guidance for a Data Management Plan, v1.0

Checklist and guidance for a Data Management Plan, v1.0 Checklist and guidance for a Data Management Plan, v1.0 Please cite as: DMPTuuli-project. (2016). Checklist and guidance for a Data Management Plan, v1.0. Available online: https://wiki.helsinki.fi/x/dzeacw

More information

Your Open Science and Research Publishing Platform. 1st SciShops Summer School

Your Open Science and Research Publishing Platform. 1st SciShops Summer School Your Open Science and Research Publishing Platform 1st SciShops Summer School to researchers? to Open Science? Personal / project / community profile Thematic / personal / project repositories Enriched

More information

Open Science, FAIR data and effective data management

Open Science, FAIR data and effective data management , FAIR data and effective data management This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License Federica Rosetta Director, Global Strategic Networks

More information

Preservation and Access of Digital Audiovisual Assets at the Guggenheim

Preservation and Access of Digital Audiovisual Assets at the Guggenheim Preservation and Access of Digital Audiovisual Assets at the Guggenheim Summary The Solomon R. Guggenheim Museum holds a variety of highly valuable born-digital and digitized audiovisual assets, including

More information

Digitally Preserving African Heritage

Digitally Preserving African Heritage Digitally Preserving African Heritage Hussein Suleman hussein@cs.uct.ac.za University of Cape Town Department of Computer Science Centre for ICT for Development Digital Libraries Laboratory April 2016

More information

How to make your data open

How to make your data open How to make your data open Marialaura Vignocchi Alma Digital Library Muntimedia Center University of Bologna The bigger picture outside academia Thursday 29th October 2015 There is a strong societal demand

More information

ISO Self-Assessment at the British Library. Caylin Smith Repository

ISO Self-Assessment at the British Library. Caylin Smith Repository ISO 16363 Self-Assessment at the British Library Caylin Smith Repository Manager caylin.smith@bl.uk @caylinssmith Outline Digital Preservation at the British Library The Library s Digital Collections Achieving

More information

Focus: Themes within Introduction and Context

Focus: Themes within Introduction and Context The following slides repeat the same pattern of; DICE DMP breakdown summary, general topics within the DMP topic, then UH Template questions. There are some generic examples and UH specific solutions.

More information

Demos: DMP Assistant and Dataverse

Demos: DMP Assistant and Dataverse Demos: DMP Assistant and Dataverse Alexandra Cooper, Data Services Coordinator, Queen s University Meghan Goodchild, RDM Systems Librarian, Queen s University/Scholars Portal Overview of session Research

More information

Enhancing the Interface of the research repository at The Glasgow School of Art. RADAR Research, Art, Design, Architecture, Repository

Enhancing the Interface of the research repository at The Glasgow School of Art. RADAR Research, Art, Design, Architecture, Repository Enhancing the Interface of the research repository at The Glasgow School of Art. RADAR Research, Art, Design, Architecture, Repository Dr Robin Burgess 13 th July 2012 (r.burgess@gsa.ac.uk) Objectives

More information

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version:

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version: Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version: 20180316 Peanut Innovation Lab Management Entity The University of Georgia, Athens, Georgia Feed the Future

More information

Exactly User Guide. Contact information. GitHub repository. Download pages for application. Version

Exactly User Guide. Contact information. GitHub repository. Download pages for application. Version Exactly User Guide Version 0.1 2016 01 11 Contact information AVPreserve http://www.avpreserve.com/ GitHub repository https://github.com/avpreserve/uk exactly Download pages for application Windows https://www.avpreserve.com/wp

More information

Fusion Registry 9 SDMX Data and Metadata Management System

Fusion Registry 9 SDMX Data and Metadata Management System Registry 9 Data and Management System Registry 9 is a complete and fully integrated statistical data and metadata management system using. Whether you require a metadata repository supporting a highperformance

More information

Where to store research data during and after a project. Dr. Chris Emmerson Research Data Manager

Where to store research data during and after a project. Dr. Chris Emmerson Research Data Manager Where to store research data during and after a project Dr. Chris Emmerson Research Data Manager Welcome Research Data Service Data Lifecycle Data Storage Questions 1 Research Data Service 2 Research Data

More information

Science Europe Consultation on Research Data Management

Science Europe Consultation on Research Data Management Science Europe Consultation on Research Data Management Consultation available until 30 April 2018 at http://scieur.org/rdm-consultation Introduction Science Europe and the Netherlands Organisation for

More information

Wolfson Digital Research Cluster. Inaugural Meeting

Wolfson Digital Research Cluster. Inaugural Meeting Wolfson Digital Research Cluster Inaugural Meeting Wolfson Digital Research Cluster What could a Digital Research Cluster do for the College? Donna Kurtz Professor of Classical Art and Beazley Archivist

More information

Data Citation and Scholarship

Data Citation and Scholarship University of California, Los Angeles From the SelectedWorks of Christine L. Borgman August 25, 2015 Data Citation and Scholarship Christine L Borgman, University of California, Los Angeles Available at:

More information

Exactly User Guide. Contact information. GitHub repository. Download pages for application. Version

Exactly User Guide. Contact information. GitHub repository. Download pages for application. Version Exactly User Guide Version 0.1.4 2017-02-07 Contact information AVPreserve http://www.avpreserve.com/ GitHub repository https://github.com/avpreserve/uk-exactly Download pages for application Windows (32bit)

More information

re3data.org - Making research data repositories visible and discoverable

re3data.org - Making research data repositories visible and discoverable re3data.org - Making research data repositories visible and discoverable Robert Ulrich, Karlsruhe Institute of Technology Hans-Jürgen Goebelbecker, Karlsruhe Institute of Technology Frank Scholze, Karlsruhe

More information

Google is flabbergasted

Google is flabbergasted Computing At School Google is flabbergasted Google Chairman Eric Schmidt August 2011 I was flabbergasted to learn that today Computer Science isn't even taught as standard in UK schools Your IT curriculum

More information

Adafruit's Raspberry Pi Lesson 1. Preparing an SD Card for your Raspberry Pi

Adafruit's Raspberry Pi Lesson 1. Preparing an SD Card for your Raspberry Pi Adafruit's Raspberry Pi Lesson 1. Preparing an SD Card for your Raspberry Pi Created by Simon Monk Last updated on 2013-07-08 12:15:38 PM EDT Guide Contents Guide Contents Overview You Will Need Downloading

More information

Data Exchange and Conversion Utilities and Tools (DExT)

Data Exchange and Conversion Utilities and Tools (DExT) Data Exchange and Conversion Utilities and Tools (DExT) Louise Corti, Angad Bhat, Herve L Hours UK Data Archive CAQDAS Conference, April 2007 An exchange format for qualitative data Data exchange models

More information

RDM through a UK lens - New Roles for Librarians?

RDM through a UK lens - New Roles for Librarians? RDM through a UK lens - New Roles for Librarians? Stuart Macdonald Research Data Management Service Coordinator Research & Library Services University of Edinburgh Email: stuart.macdonald@ed.ac.uk Towards

More information

Exactly Quickstart Guide Version

Exactly Quickstart Guide Version 253 36th Street Suite C309 #22 Brooklyn, NY 11232 http://weareavp.com 917.475.9630 info@weareavp.com Exactly Quickstart Guide Version 0.1.6 2018-04-25 Contact information AVP http://www.weareavp.com/ GitHub

More information

Research Data Management: lessons learned - and still to learn

Research Data Management: lessons learned - and still to learn Research Data Management: lessons learned - and still to learn SWITCH Research Data Management (RDM) Workshop, 15. Dezember 2014 Dr., ETH-Bibliothek, ETH Zürich 15.12.2014 1 Overview Digital Curation Office

More information

DaMaRO (Data Management Roll-out at Oxford): DataBank & DataFinder

DaMaRO (Data Management Roll-out at Oxford): DataBank & DataFinder DaMaRO (Data Management Roll-out at Oxford): DataBank & DataFinder Thursday 25 th October, 2012 James A J Wilson James.wilson@oucs.ox.ac.uk Damaro DataBank Oxford s in-development data archive Intended

More information

DATA SHARING FOR BETTER SCIENCE

DATA SHARING FOR BETTER SCIENCE DATA SHARING FOR BETTER SCIENCE THE DATAVERSE PROJECT Mercè Crosas, Institute for Quantitative Social Science, Harvard University @mercecrosas MAX PLANCK INSTITUTE FOR RADIOASTRONOMY, SEPTEMBER 12, 2017

More information

TREENO ELECTRONIC DOCUMENT MANAGEMENT. Administration Guide

TREENO ELECTRONIC DOCUMENT MANAGEMENT. Administration Guide TREENO ELECTRONIC DOCUMENT MANAGEMENT Administration Guide February 2012 Contents Introduction... 8 About This Guide... 9 About Treeno... 9 Managing Security... 10 Treeno Security Overview... 10 Administrator

More information

Helping Journals to Upgrade Data Publications for Reusable Research

Helping Journals to Upgrade Data Publications for Reusable Research Helping Journals to Upgrade Data Publications for Reusable Research Sonia Barbosa (Project Manager) Eleni Castro (Project Coordinator) Ins9tute for Quan9ta9ve Social Science (IQSS) Harvard University @thedataorg

More information

The International Journal of Digital Curation Volume 8, Issue

The International Journal of Digital Curation Volume 8, Issue doi:10.2218/ijdc.v8i2.273 Here, KAPTUR This! 68 Here, KAPTUR This! Identifying and Selecting the Infrastructure Required to Support the Curation and Preservation of Visual Arts Research Data Leigh Garrett,

More information

Deliverable D7.3 Data Management Plan (M30)

Deliverable D7.3 Data Management Plan (M30) DREAM: Deferred Restructuring of Experience in Autonomous Machines H2020-FETPROACT-2014 Deliverable D7.3 Data Management Plan (M30) Due date of deliverable: 30th, June, 2017 Actual submission date: 2nd,

More information

Data management Backgrounds and steps to implementation; A pragmatic approach.

Data management Backgrounds and steps to implementation; A pragmatic approach. Data management Backgrounds and steps to implementation; A pragmatic approach. Research and data management through the years Find the differences 2 Research and data management through the years Find

More information

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA MATTHEW WOOLLARD.. ECONOMIC AND SOCIAL DATA SERVICE UNIVERSITY OF ESSEX... METADATA AND PERSISTENT IDENTIFIERS FOR SOCIAL AND ECONOMIC DATA,

More information

Familiarity with data types, data structures, as well as standard program design, development, and debugging techniques.

Familiarity with data types, data structures, as well as standard program design, development, and debugging techniques. EE 472 Lab 1 (Individual) Introduction to C and the Lab Environment University of Washington - Department of Electrical Engineering Introduction: This lab has two main purposes. The first is to introduce

More information

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment Shigeo Sugimoto Research Center for Knowledge Communities Graduate School of Library, Information

More information

VMware Infrastructure 3 Primer Update 2 and later for ESX Server 3.5, ESX Server 3i version 3.5, VirtualCenter 2.5

VMware Infrastructure 3 Primer Update 2 and later for ESX Server 3.5, ESX Server 3i version 3.5, VirtualCenter 2.5 Update 2 and later for ESX Server 3.5, ESX Server 3i version 3.5, VirtualCenter 2.5 VMware Infrastructure 3 Primer Revision: 20090313 Item: EN-000021-02 You can find the most up-to-date technical documentation

More information

Course Syllabus: Linux Essentials

Course Syllabus: Linux Essentials Course Syllabus: Linux Essentials Instructor: Jay Hanks Email: jayhhanks@gmail.com Phone: Office: (740) 364-2299 Courseware Course #: Hours: Meeting Days & Times: Location TestOut Linux Pro 4.1 LPI Linux

More information

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites Access IT Training 2003 Google indexed 3,3 billion of pages http://searchenginewatch.com/3071371 2005 Google s index contains 8,1 billion of websites http://blog.searchenginewatch.com/050517-075657 Estimated

More information

Born digital Hull: early steps and lessons learnt (so far) Simon Wilson, Digital Archivist (AIMS Project)

Born digital Hull: early steps and lessons learnt (so far) Simon Wilson, Digital Archivist (AIMS Project) Born digital archives @ Hull: early steps and lessons learnt (so far) Simon Wilson, Digital Archivist (AIMS Project) outline What is the AIMS Project? Steps taken @ Hull Steps still to take Questions A

More information

Course Syllabus: Linux Essentials

Course Syllabus: Linux Essentials Course Syllabus: Linux Essentials Instructor: Roger Elliott Email: rlelliott@c-tec.edu Phone: Office: (740) 364-2299 Cell: (740) 814-7504 Course text Course #: Hours: Meeting Days & Times: Location Linux

More information

Facilitate Open Science Training for European Research

Facilitate Open Science Training for European Research Facilitate Open Science Training for European Research Open access and research data management: Horizon 2020 and beyond University College Cork, April 14 th & 15 th 2015 Using existing institutional repository

More information

Fitness Manager V4 Install Guide

Fitness Manager V4 Install Guide Fitness Manager V4 Install Guide Table of Contents 1 Welcome to V4...3 License Agreement...4 Copyright...4 2. Minimum System Requirements...5 3. Navigating the Install...6 4. Installing V4 on the Server...7

More information

University of British Columbia Library. Persistent Digital Collections Implementation Plan. Final project report Summary version

University of British Columbia Library. Persistent Digital Collections Implementation Plan. Final project report Summary version University of British Columbia Library Persistent Digital Collections Implementation Plan Final project report Summary version May 16, 2012 Prepared by 1. Introduction In 2011 Artefactual Systems Inc.

More information

Project Transfer: Five Years Later ER&L 2012

Project Transfer: Five Years Later ER&L 2012 Project Transfer: Five Years Later ER&L 2012 Nancy Beals (Wayne State University Libraries) Jennifer Bazeley (Miami University Library) Overview Context Journal Transfers and Consequences What and How

More information

Integration of Agilent OpenLAB CDS EZChrom Edition with OpenLAB ECM Compliance with 21 CFR Part 11

Integration of Agilent OpenLAB CDS EZChrom Edition with OpenLAB ECM Compliance with 21 CFR Part 11 OpenLAB CDS Integration of Agilent OpenLAB CDS EZChrom Edition with OpenLAB ECM Compliance with 21 CFR Part 11 Technical Note Introduction Part 11 in Title 21 of the Code of Federal Regulations includes

More information

The Data Management Plan: Putting policy into practice Suzanne Clarke Director, Information Resources

The Data Management Plan: Putting policy into practice Suzanne Clarke Director, Information Resources The Data Management Plan: Putting policy into practice Suzanne Clarke Director, Information Resources August 2008 Monash environment High level interest DVC (Research, Prof Edwina Cornish) E-Research Centre

More information

Infrastructure for the UK

Infrastructure for the UK 07/07/2014 Building a Cohesive Repository Infrastructure for the UK Balviar Notay Senior Manager for Repository Shared Services Bringing together key repository services to deliver a connected national

More information

Subtlenoise: sonification of distributed computing operations

Subtlenoise: sonification of distributed computing operations Journal of Physics: Conference Series PAPER OPEN ACCESS Subtlenoise: sonification of distributed computing operations To cite this article: P A Love 2015 J. Phys.: Conf. Ser. 664 062034 View the article

More information

Persistent identifiers, long-term access and the DiVA preservation strategy

Persistent identifiers, long-term access and the DiVA preservation strategy Persistent identifiers, long-term access and the DiVA preservation strategy Eva Müller Electronic Publishing Centre Uppsala University Library, http://publications.uu.se/epcentre/ 1 Outline DiVA project

More information

Reproducibility and FAIR Data in the Earth and Space Sciences

Reproducibility and FAIR Data in the Earth and Space Sciences Reproducibility and FAIR Data in the Earth and Space Sciences December 2017 Brooks Hanson Sr. VP, Publications, American Geophysical Union bhanson@agu.org Earth and Space Science is Essential for Society

More information

File Services. File Services at a Glance

File Services. File Services at a Glance File Services High-performance workgroup and Internet file sharing for Mac, Windows, and Linux clients. Features Native file services for Mac, Windows, and Linux clients Comprehensive file services using

More information

Doc Ref: eup HRIS User Manual [02 Dec 2015] v 1.3

Doc Ref: eup HRIS User Manual [02 Dec 2015] v 1.3 1 Copyright 2015 by the University of the Philippines System All rights reserved. No part of this publication may be reproduced, distributed, or transmitted in any form or by any means, including photocopying,

More information

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018 0 Welcome to the Pure International Conference Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018 1 Mendeley Data Use Synergies with Pure to Showcase Additional Research Outputs Nikhil Joshi Solutions

More information

DataSTORRE Deposit Guide

DataSTORRE Deposit Guide DataSTORRE Deposit Guide Introduction DataStorre is an online digital repository of multi-disciplinary research datasets produced at the University of Stirling. University of Stirling researchers who have

More information

DigitalHub Getting started: Submitting items

DigitalHub Getting started: Submitting items DigitalHub Getting started: Submitting items This guide will take you through the process of uploading and labeling a file in DigitalHub. Logging Into DigitalHub 1. Identify an item you would like to deposit

More information

OPENAIRE FP7 POST-GRANT OPEN ACCESS PILOT

OPENAIRE FP7 POST-GRANT OPEN ACCESS PILOT OPENAIRE FP7 POST-GRANT OPEN ACCESS PILOT Alternative Funding Bid No 10. Hungarian Educational Research Journal (HERJ) Presenter: Laura Morvai University of Debrecen University and National Library Managing

More information

Adding Research Datasets to the UWA Research Repository

Adding Research Datasets to the UWA Research Repository University Library Adding Research Datasets to the UWA Research Repository Guide to Researchers What does UWA mean by Research Datasets? Research Data is defined as facts, observations or experiences on

More information

OPPORTUNITY FOR PLACEMENT AHRC Knowledge Exchange Fellow FuseBox/Wired Sussex

OPPORTUNITY FOR PLACEMENT AHRC Knowledge Exchange Fellow FuseBox/Wired Sussex OPPORTUNITY FOR PLACEMENT AHRC Knowledge Exchange Fellow FuseBox/Wired Sussex The AHRC in partnership with Wired Sussex is looking to recruit a Knowledge Exchange Fellow to be based at the FuseBox, an

More information

Jisc Research Data Shared Service

Jisc Research Data Shared Service Arpri 2017 Jisc Research Data Shared Service John Kaye Senior Co-Design Manager, Research Data ORCiD 0000-0002-4400-4252 #JiscRDM Who we are Jisc Research Data Services Context RDSS Context and Vision

More information

Universidad de Extremadura 22 October Elsevier s approach to Open Access and latest developments. Edward Wedel-Larsen. Research Solutions

Universidad de Extremadura 22 October Elsevier s approach to Open Access and latest developments. Edward Wedel-Larsen. Research Solutions TITLE OF PRESENTATION Universidad de Extremadura 22 October 2014 Elsevier s approach to Open Access and latest developments Edward Wedel-Larsen Research Solutions October 2014 Open access developments

More information

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology Making research data repositories visible and discoverable Robert Ulrich Karlsruhe Institute of Technology Outline Background Mission Schema, Icons, Quality and Workflow Interface Growth Cooperations Experiences

More information

haplo-services.com

haplo-services.com haplo-services.com 020 7100 1155 Haplo specialises in powerful, customisable software that supports academic institutions, helping them to control and manage large volumes of research information easily

More information

Science Panel Discussion presentation: "A Data Sharing Story"

Science Panel Discussion presentation: A Data Sharing Story University of Massachusetts Medical School escholarship@umms University of Massachusetts and New England Area Librarian e-science Symposium 2012 e-science Symposium Apr 4th, 10:45 AM - 11:15 AM Science

More information

Advancing code and data publication and peer review. Erika Pastrana, PhD Executive Editor, Nature Journals ALPSP_Sept 2018

Advancing code and data publication and peer review. Erika Pastrana, PhD Executive Editor, Nature Journals ALPSP_Sept 2018 Advancing code and data publication and peer review Erika Pastrana, PhD Executive Editor, Nature Journals ALPSP_Sept 2018 2 Code related initiatives at Nature Research 1.1 Nature journals have been industry

More information

Scoping and Developing Institutional Data Services: the Data Libraries of 2020

Scoping and Developing Institutional Data Services: the Data Libraries of 2020 Scoping and Developing Institutional Data Services: the Data Libraries of 2020 IASSIST Conference Friday 29 May 2009 Luis Martinez Uribe Luis.Martinez-Uribe@oerc.ox.ac.uk Digital Repositories Research

More information