Data Curation Profile Food Technology and Processing / Food Preservation

Similar documents
Data Curation Profile Human Genomics

Data Curation Profile: Agronomy / Grain Yield

Data Curation Profile Movement of Proteins

Data Curation Profile Water Flow and Quality

Data Curation Profile Plant Genomics

Data Curation Profile Botany / Plant Taxonomy

Data Curation Profile Plant Genetics / Corn Breeding

The Data Curation Profiles Toolkit: Interview Worksheet

Data Curation Profile Architectural History / Epigraphy

Data Curation Profile Cornell University, Biophysics

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Data Curation Profile Geophysics and Seismology/Structural Geology and Neotectonics

A Brief Introduction to the Data Curation Profiles

Developing a Research Data Policy

Data Management Checklist

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version:

Preservation and Access of Digital Audiovisual Assets at the Guggenheim

Writing a Data Management Plan A guide for the perplexed

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

Problem Gambling Service DATA COLLECTION AND SUBMISSION MANUAL

Swedish National Data Service, SND Checklist Data Management Plan Checklist for Data Management Plan

Data Management Plan: OR Mooring - Ocean Acidification related measurements (Taken from NOAA Data Sharing Template and adapted for IOOS Certification)

SHARING YOUR RESEARCH DATA VIA

Implementation of a reporting workflow to maintain data lineage for major water resource modelling projects

NSF Data Management Plan Template Duke University Libraries Data and GIS Services

PROCEDURE POLICY DEFINITIONS AD DATA GOVERNANCE PROCEDURE. Administration (AD) APPROVED: President and CEO

Checklist and guidance for a Data Management Plan, v1.0

28 September PI: John Chip Breier, Ph.D. Applied Ocean Physics & Engineering Woods Hole Oceanographic Institution

The type of organization for which you created the collection and the potential user and their needs.

Horizon2020/EURO Coordination and Support Actions. SOcietal Needs analysis and Emerging Technologies in the public Sector

REFERENCE GUIDE FOR MANUAL DATA INPUT v1.1

Investigation. City of Edmonton Office of the City Auditor. ETS Workforce Development. January 14, 2019

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Data Management Plan: Port Radar, Oregon State University (Taken from NOAA Data Sharing Template and adapted for IOOS Certification)

National Data Sharing and Accessibility Policy-2012 (NDSAP-2012)

Title: Interactive data entry and validation tool: A collaboration between librarians and researchers

DATA MANAGEMENT PLANS Requirements and Recommendations for H2020 Projects. Matthias Razum April 20, 2018

Category: Data/Information Keywords: Records Management, Digitization, Imaging, Image capture, Scanning, Process

Collection Policy. Policy Number: PP1 April 2015

Step: 9 Conduct Data Standardization

30 April 2012 Comprehensive Exam #3 Jewel H. Ward

Electronic Official Personnel Folder (eopf) Frequently Asked Questions for Department of the Army Employees

COUNTY AUDIT HILLSBOROUGH COUNTY, FLORIDA CONSULTANT COMPETITIVE NEGOTIATION ACT (CCNA) PROCUREMENT PROCESS AUDIT REPORT # 251

Survey of Research Data Management Practices at the University of Pretoria

Adding Research Datasets to the UWA Research Repository

Guidelines for Depositors

Data Partnerships to Improve Health Frequently Asked Questions. Glossary...9

Data management Backgrounds and steps to implementation; A pragmatic approach.

Data Curation Handbook Steps

Open Access. Green Open Access

Deliverable Final Data Management Plan

DATA-SHARING PLAN FOR MOORE FOUNDATION Coral resilience investigated in the field and via a sea anemone model system

eportfolio Requirements Request

Open Geospatial Consortium

Terms in the glossary are listed alphabetically. Words highlighted in bold are defined in the Glossary.

Callicott, Burton B, Scherer, David, Wesolek, Andrew. Published by Purdue University Press. For additional information about this book

Queen s University Library. Research Data Management (RDM) Workflow

SIMSme Management Cockpit Documentation

Completing & Submitted the IRB Approval of Human Subjects Form

Deliverable Initial Data Management Plan

Service Description: CNS Federal High Touch Technical Support

The library s role in promoting the sharing of scientific research data

EC Horizon 2020 Pilot on Open Research Data applicants

Content Manager. Software Version 9.3. Release Notes

BlackPearl Customer Created Clients Using Free & Open Source Tools

Metadata Sharing Policy

Rodale. Upper East Side

PREPARATION GUIDELINES FOR SUSPICIOUS ACTIVITY REPORT FORM (SAR) August 2001

Managing Image Metadata

VMware BCDR Accelerator Service

CHOICE Broadband Performance Measuring Program FAQs

Physical Security Reliability Standard Implementation

Data Management Plan: HF Radar

Data Governance for Asset Management & Safety:

Texas Commission on Fire Protection

3) Develop a user interface to facilitate discovery, delivery and rights management of media content

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

April 17, Ronald Layne Manager, Data Quality and Data Governance

University at Buffalo's NEES Equipment Site. Data Management. Jason P. Hanley IT Services Manager

Workforce Repository and Planning Tool Creating a Standard Baseline report

Best Practice Guidelines for the Development and Evaluation of Digital Humanities Projects

ADMINISTRATIVE POLICY NO ISSUING MUNICIPAL EQUIPMENT (Computer, Lap Tops, Notebooks, ipads)

Legal Issues in Data Management: A Practical Approach

Report to the IMC EML Data Package Checks and the PASTA Quality Engine July 2012

Research Data Edinburgh: MANTRA & Edinburgh DataShare. Stuart Macdonald EDINA & Data Library University of Edinburgh

Kalaivani Ananthan Version 2.0 October 2008 Funded by the Library of Congress

Data Storage, Recovery and Backup Checklists for Public Health Laboratories

Science Europe Consultation on Research Data Management

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

Kohelet Policy Forum R.A. Site Legal Terms. What personal data we collect and why we collect it. Comments. Media. Contact forms and newsletter

Deposit-Only Service Needs Report (last edited 9/8/2016, Tricia Patterson)

Certified Tester Foundation Level Performance Testing Sample Exam Questions

DigitalHub Getting started: Submitting items

Metadata Workshop 3 March 2006 Part 1

Montana State Library Spatial Data Transfer Design

Data Management Plan

Data Migration Plan Updated (57) Fingrid Datahub Oy

Kansas City s Metropolitan Emergency Information System (MEIS)

By Cornelia Wawretchek. The Drug Manufacturer s Guide to Site Master Files

COMPUTER FLOOD STANDARDS

Transcription:

Data Curation Profile Food Technology and Processing / Food Preservation Profile Author Author s Institution Contact Researcher(s) Interviewed Researcher s Institution Sonia Wade Lorenz & Lisa Zilinski Lisa Zilinski, lzilinski@usf.edu Date of Creation January 19, 2012 Date of Last Update August 23, 2012 [name withheld], Professor of Food Science Version of the Tool Data Curation Profile Toolkit is 1.0 Version of the Content Discipline / Sub- Discipline Purpose Profile Content: 1.2 Food Technology and Processing / Food Preservation Data Curation Profiles are designed to capture requirements for specific data generated by a single scientist or scholar as articulated by the scientist him or herself. They are also intended to enable librarians and others to make informed decisions in working with data of this form, from this research area or sub-discipline. Context Sources of Information used for the profile Scope Notes Editorial Notes Author s Notes Permissions URL Licensing Data Curation Profiles employ a standardized set of fields to enable comparison; however, they are designed to be flexible enough for use in any domain or discipline A profile is based on the reported needs and preferences for these data. They are derived from several kinds of information, including interview and document data, disciplinary materials, and standards documentation. An initial interview conducted on January 9, 2012. A worksheet completed by the scientist as a part of the interviews. A recording of the interview on January 9, 2012. A visit to the scientist s lab on January 11, 2012 The scope of individual profiles will vary, based on the authors and participating researcher s background, experiences, and knowledge, as well as the materials available for analysis. Any modifications of this document will be subject to version control, and annotations require a minimum of creator name, data, and identification of related source documents. The Food Science and Nutrition data curation profile is based on the analysis of an interview completed with the scientist working in this research area or subdiscipline. Some sub-sections of the profile may be left blank; this occurs when there was no relevant data in the interview or available documents used to construct this profile. The researcher has reviewed the completed data profile and confirmed the accuracy of the information. The researcher gives consent to allow the profile to be published on Purdue s website for completed data profiles. http://datacurationprofiles.org This work is licensed under a Creative Commons Attribution 3.0 Unported License. (CC BY 3.0) http://dx.doi.org/10.5703/1288284315003

Brief summary of data curation needs The scientist is conducting a series of experiments on food preservation for the Department of Defense. While the data is valuable, it cannot be publicly shared without permission; privacy and security are high priorities. The data is currently stored in multiple locations including on the lab computers, the scientist s home, and an external hard drive. The scientist also maintains a strict schedule for backing up the material. The scientist is interested in the possibility of being able to securely store data in a digital asset management system that could be accessed remotely; however, security and privacy are her main priorities. Overview of the research Research area focus The scientist describes this project as having one global goal with three objectives. Globally, the team is creating a tool to allow the Department of Defense to ship MREs (Meals Ready to Eat) with the expiration information included. MREs undergo extreme changes in climate therefore an expiration tool is required. Objective 1: Objective 2: Objective 3: Create a handheld RFID (Radio Frequency Identification Device) that will measure the time and temperature profile of the MRE. Evaluate quality/shelf life of MREs in adverse conditions by: A.) evaluating color, composition and texture of food and B.) evaluating taste of food. Evaluate quality/shelf life of fresh fruits and vegetables delivered to troops. The scientist participating in the data profile is focusing on Objective 2 part A exclusively. For Objective 2 part A, the scientist will subject seven different types of MREs to five different environments: 44, 84, 100, 120, and 140 Fahrenheit. Samples of the MREs are tested incrementally throughout the experiment. The tests measure color, texture and composition. The composition testing measures the following: ph levels, acidity, soluble solids content, sugars, ascorbic acid and fat oxidation. With the information from all three objectives, the scientists will be able to create a tool to determine the quality and remaining shelf life of food delivered to soldiers. Intended audiences Other researchers working with MREs may be interested in the data. The findings could be applied to commercial food products. Civilian food manufacturers, packing companies, distributors and retailers may be interested in the findings. However, per the scientist s agreement with the Department of Defense, the data may not be made publically available. Funding sources The research is primarily funded by the Department of Defense. The University of South Florida in Lakeland also contributes to the project by providing the facility within which the research is conducted. Page 2

Data kinds and stages Data narrative In order to create a tool that will allow MREs to be shipped with the expiration information included, expiration data must first be generated and collected (Objective 2A). The scientist will subject seven different types of MREs to five different environments: 44, 84, 100, 120, and 140 Fahrenheit. Samples of the MREs will be tested incrementally throughout the experiment until the MRE is no longer fit for consumption. The tests will measure color, texture and composition. The composition testing will measure the following: ph levels, acidity, soluble solids content, sugars, ascorbic acid and fat oxidation. To obtain this data, the scientist utilizes several machines: a Texture Analyzer, the Minolta Chroma Meter, High Performance Liquid Chromatogrphy (HLPC), Titrator, PH Meter and a Refractometer. The results are automatically exported from the above listed machines into Excel files. Once testing ends, the scientist will review and analyze the data. In August of 2012, the scientist plans to import the Excel files into SAS for the statistical analysis. In December of 2012, the scientist plans to use Sigma Plot to generate the necessary graphs and tables for her research. In February of 2013, the scientist expects that the project will be complete and she will be able to write the finalized reports and articles to summarize her findings. The scientist has a rigorous method for saving and backing up her files to be certain nothing is lost. Each night the files are saved on the computer s hard drive and an external hard drive. At the end of each month the files are backed up on a separate hard drive and on a different computer housed at a separate location. The data table Data Stage Raw Data Statistical Analysis Graphs / Tables Generated Output Photos and data generated from Texture Analyzer and Minolta Chroma Meter # of Files / Typical Size Format Other / Notes Primary Data MB/JPG Exponent, Utility 4 software. Pictures Export Excel Files 700 files/ 40KB Excel Data is organized by feature (temperature, Unknown file replicates, etc ) size SAS Finalized Graphs and Tables Target data for sharing Graphs and tables with an article. Unknown: 500+ graphs Unknown file size Sigma Plot Graphs, Pictures, Tables & MS Word Data is available on analysis machines and photos are taken to record the food s appearance. Data is automatically exported from the machines to Excel. The scientist will compile Excel data and execute statistical analysis using SAS Analyzed data is imported from excel into sigma plot to create graphs The final report will include previously generated graphs and tables as well as a written document/article. The scientist is able to publish articles about her research but the data cannot be publicly shared without permission due to the privacy restrictions required by her funding agencies. Page 3

Value of the data The scientist advises that the data could hold value for other researchers doing work on MREs and that her findings could be applicable to commercial foods. The concepts learned in this research could be applied to civil food retailers, distributers, and preservation. Contextual narrative The scientist emphasized that even though others could benefit from the data, the regulations required by the Department of Defense prevent the data from being made public without permission. Intellectual property context and information Data owner(s) The Department of Defense is the sole owner of the data according to the scientist. Stakeholders The provides the facility within which the scientist conducts her research. The scientist also advised that she has research assistance working with her on this project. However, the Department of Defense has sole ownership of this project as this is considered in the same manner as a work for hire. Terms of use The scientist states that her immediate collaborators may have access to the data but the data cannot be made publicly available without permission. Attribution Because the data cannot be publicly shared without permission, attribution is not relevant. Organization and description of data (incl. metadata) Overview of data organization and description (metadata) First, the data is populated by each machine and is automatically organized in Excel sheets. These excel sheet describe the features of the food products including date, temperature, color, ph levels, acidity, soluble solids content, sugars, ascorbic acid and fat oxidation. Toward the end of the project, the scientist will examine the Excel data using SAS and import the analyzed data into Sigma Plot to create graphs. Metadata is not currently used with this dataset. This data set is well organized and well protected, but the scientist did express interest in being able to access and search the data from offsite locations. Assuming all required security measures could be met, the scientist advised that it would be very helpful if both she and her team could access the data sets remotely. Metadata is not currently used with this dataset. However, if the data was stored in a database that could be accessed remotely, metadata could be employed to make the dataset easier to search. Page 4

Formal standards used No formal metadata standards, ontologies, or controlled vocabularies have been applied to this data. The scientist organizes her data by date. Locally developed standards The scientist organizes the Excel sheets in chronological order and by MRE sample groups. With this structure, she can see how each MRE sample is impacted over time. Crosswalks Documentation of data organization/description The scientist did not mention any existing papers or documents that detailed standard operating procedures. The excel spreadsheets detail all of her data automatically. Ingest / Transfer The scientist indicated that because the bulk of her data is already in common formats (Excel), she did not anticipate needing any unique functionality from the repository. The scientist expects that the repository would automatically be able to read basic excel and word formats. However, the ability to personally submit the data in single and batch uploads to the repository is a high priority for the scientist. Because of privacy concerns, the scientist requires complete control over the data including when and how the data is uploaded in the repository. Sharing & Access Willingness / Motivations to share The scientist indicates that this project was commissioned by the Department of Defense and as a result, the dataset cannot be made public without permission. Embargo Access control The scientist requires complete control over the data to ensure that she maintains the privacy guidelines set forth by the Department of Defense. She is interested in tiered permissions of authority because that would enable her team to have remote access to the data. The scientist also believes that tiered permissions would enable her team to work more efficiently. However, she must maintain primary control and access to the data cannot be extended outside her research team without permission. Page 5

Secondary (Mirror) site The scientist indicates that a secondary/mirror site at a separate geographic location is a medium priority. Discovery Discovery is not important to the scientist because the data cannot be shared without permission. Tools To obtain this data, the scientist utilizes several machines: a Texture Analyzer, the Minolta Chroma Meter, High Performance Liquid Chromatogrphy (HLPC), Titrator, PH Meter and a Refractometer. The results are automatically exported from the above listed machines into Excel files. While no additional tools would be necessary to be able to read or use the excel files, a data profile would aid significantly in being able to interpret and give context to the files. Linking / Interoperability Linking/Interoperability is not important to the scientist. Measuring Impact Usage statistics & other identified metrics Usage statistics & other identified metrics are not important to the scientist because the data cannot be shared without permission.. Gathering information about users Data Management Security / Back-ups Currently, the data is kept on the lab s computer hard drive and an external hard drive (these are updated manually with each new dataset collection). At the end of each week, the lab computer and external hard drive are backed up by the main lab computer. At the end of each month, the main lab computer is backed up to final external hard drive kept at a secondary location. While the scientist is confident in this method, she is interested in being able to use a data repository as an offsite means for backing up her data. Page 6

Secondary storage sites A Secondary storage site and a secondary storage site at a different geographic location are medium priorities for the scientist. Version control Version control is a high priority to the scientist. Preservation Duration of preservation The scientist indicated that her data should be stored for 5 years. Data provenance Documentation of any and all changes made to the data over time is a medium priority for the scientist. Data audits The ability to audit the dataset is a medium priority for the scientist. Format migration The ability to migrate the dataset into new formats over time is not a priority for the scientist. Personnel Primary data contact [name withheld] Data stewards Lisa Zilinski Business Librarian Sonia Wade Lorenz Library Assistant Other contacts Page 7