Chain of Data Creation. Data Creation. Lab Notebook. Plan Your Experiment, Experiment With your Plan

Similar documents
Making Metadata Work. Using metadata to document your science. August 1 st, 2010

12/6/2012. Getting Started with Metadata. Presenter. Making Metadata Work. Overall Topics. Data Collection. Topics

Metadata Overview: digital repositories

Data Curation Profile Human Genomics

Metadata Zoo Dataset Metadata Rebecca Koskela Execu4ve Director, DataONE

Site# Date H20 Temperature Conductance Turbidity KRS Sep KRS Aug KRS Aug

Chain of Preservation Model Diagrams and Definitions

Metadata Workshop 3 March 2006 Part 1

Data Curation Profile Food Technology and Processing / Food Preservation

Getting Started with Omeka Music Library Association March 5, 2016

Writing a Data Management Plan A guide for the perplexed

Forensic Analysis Approach Based on Metadata and Hash Values for Digital Objects in the Cloud

International Multidisciplinary Metadata Workshop 18 January Rebecca Koskela Arctic Region Supercomputing Center

University at Buffalo's NEES Equipment Site. Data Management. Jason P. Hanley IT Services Manager

Managing Image Metadata

BIBLIOGRAPHIC REFERENCE DATA STANDARD

Metadata: The Theory Behind the Practice

The type of organization for which you created the collection and the potential user and their needs.

UKOLN involvement in the ARCO Project. Manjula Patel UKOLN, University of Bath

The Dublin Core Metadata Element Set

Guidelines for Depositors

Data Curation Profile Movement of Proteins

The Value of Metadata

Data publication and discovery with Globus

McAfee Total Protection for Data Loss Prevention

DIAS_Satellite_MODIS_SurfaceReflectance dataset

SHARING YOUR RESEARCH DATA VIA

Alphabet Soup: A Metadata Overview Melanie Schlosser Metadata Librarian

7.3. In t r o d u c t i o n to m e t a d a t a

METADATA AT DIGITAL-INFORMATIVE ERA

National Data Network The Basics. What is the NDN?

Developing a Research Data Policy

NEES Community and Communication (NEEScomm) Data Sharing and Archiving Policies and Guidelines, Version 1. December

ATTACHED BINARY OBJECT DATA STANDARD

XML, Metadata and More!

Indexing Field Descriptions Recommended Practice

SERIES X: DATA NETWORKS, OPEN SYSTEM COMMUNICATIONS AND SECURITY. ITU-T X.660 Guidelines for using object identifiers for the Internet of things

Introduction to Metadata for digital resources (2D/3D)

Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Perceptions

Striking a Balance: Metadata Creation in Digital Library Projects. Holly Mercer University of Kansas Libraries

More than a Lifetime of

A tool for Entering Structural Metadata in Digital Libraries

Metadata and Archival. System (MADRAS) Anne Gilliland Department of Information Studies University of California, Los Angeles

Archivists Toolkit: Description Functional Area

Metadata. Week 4 LBSC 671 Creating Information Infrastructures

Building OWL Ontology of Unique Bulgarian Bells Using Protégé Platform

RDF and Digital Libraries

Edinburgh DataShare: Tackling research data in a DSpace institutional repository

Based on the functionality defined there are five required fields, out of which two are system generated. The other elements are optional.

ARKive-ERA Project Lessons and Thoughts

The challenge of collecting and evaluating LRs for commercial use

Technical University of Munich. Exercise 8: Neural Networks

Data Access User Needs Study A User Needs Assessment for the Colorado DOT (CDOT)

Preservation Standards (& Specifications) (&& Best Practices)

Introduction and Overview

Open Geospatial Consortium

Regional Workshop: Cataloging and Metadata 101

Oregon Maps Metadata Guide Page 1 of 6

Data Curation Profile Cornell University, Biophysics

Introduction and Overview

Personal Digital Information Project, Part 2: Hands-on Exercise

Specific requirements on the da ra metadata schema

Content Organization and Knowledge Management in the Digital Environment

CONTENTdm Core Metadata Application Profile v2.1

Data management is fun. Casey Dunn Assistant Professor Ecology and Evolutionary Biology

Working with a Preservation Software Vendor - The Kentucky Experience Glen McAninch

Improving a Trustworthy Data Repository with ISO 16363

EasyMAM V USER MANUAL. Ver.1.0 MAY Easy MAM

GETTING STARTED WITH DIGITAL COMMONWEALTH

Title: Interactive data entry and validation tool: A collaboration between librarians and researchers

DRI: Dr Aileen O Carroll Policy Manager Digital Repository of Ireland Royal Irish Academy

XML Description Schema for Power Quality Data

Managing stakeholders and (creating) their expectations

An App for iphone Users. Photo Tagger. Use tags to organize photos more flexibly

Deliverable Initial Data Management Plan

Cataloging Basics Documenting your museum s collections. Beau Harris Collections Manager, Fairbanks Museum & Planetarium

Data Management Checklist

October 7, 2013 Kourtney Blackburn

This presentation is on issues that span most every digitization project.

Data about data (metadata)

ISO INTERNATIONAL STANDARD. Information and documentation Managing metadata for records Part 2: Conceptual and implementation issues

Indexing and subject organisation

Disk Drill by LaWanda Warren

3. Technical and administrative metadata standards. Metadata Standards and Applications

Where Are My Photos? Filing, Storing & Retrieving Photo Files HOW SAFE IS YOUR HARD COPY? David Hollander ITS. Number 41 February 2010

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version:

Data Curation Profile Water Flow and Quality

Metadata. AMA Digitization Workshop. Elizabeth-Anne Johnson University of Manitoba 29 February 2016

The e-depot in practice. Barbara Sierman Digital Preservation Officer Madrid,

University of New Hampshire Scholars' Repository

Practical Applications of Machine Learning for Image and Video in the Cloud

WM2015 Conference, March 15 19, 2015, Phoenix, Arizona, USA

Forensics for Cybersecurity. Pete Dedes, CCE, GCFA, GCIH

Automatic Exposure, Las Vegas, Nevada - Nov 5, Gunar Penikis Product Manager, XMP Adobe Systems Incorporated

Data Management Glossary

Unearthing Collec/on Gems. an introduc/on to indexing digital collec/ons

[ PARADIGM SCIENTIFIC SEARCH ] A POWERFUL SOLUTION for Enterprise-Wide Scientific Information Access

The DPM metamodel detail

How to add a standard page

Description Cross-domain Task Force Research Design Statement

Transcription:

Chain of Data Creation Data Creation 1. Preparation 2. Creation of Metadata 3. Acquisition 4. Building a Permanent Record 5. Data Management 6. Storage 7. Data Sharing Lab Notebook Plan Your Experiment, Experiment With your Plan Record of hypotheses Record of Protocols Second brain medicine.umich.edu

Question to Ask About your Data Collection Activity Meta-Data What am I measuring? When am I measuring it? How am I measuring it? What are the tools I am using? What about the lab/field environment do I need to know? Is my protocol reproducible? What is Metadata? What is Metadata? Metadata is: Data reporting Collec.on of data objects Descrip.on of whole collec.on WHO created the data? WHAT is the content of the data? WHEN were the data created? WHERE is it geographically? HOW were the data developed? WHY were the data developed? Photo by Michelle Chang. All Rights Reserved Descriptors of each data object

Metadata in Real Life Metadata is all around Time of data development Information Entropy Specific details about problems with individual items or specific dates are lost relacvely rapidly Author(s) Boullosa, Carmen. Title(s) They're cows, we're pigs / by Carmen Boullosa Place New York : Grove Press, 1997. Physical Descr viii, 180 p ; 22 cm. Subject(s) Pirates Caribbean Area Fic.on. Format Fic.on CC image by Mskadu on Flickr CC image by USDAgov on Flickr DATA DETAILS Accident or technology change may make data unusable General details about datasets are lost through Cme ReCrement or career change makes access to mental storage difficult or unlikely Loss of data developer leads to loss of remaining informacon TIME (From Michener et al 1997) Information Entropy Data Management via Metadata DATA DETAILS Sound informacon management, including metadata development, can arrest the loss of dataset detail. Maintenance & Update Data Accountability TIME Discovery & Re-use Data Liability

Cellphone Metadata htp://www.zeit.de/datenschutz/malte-spitz-data-reten.on EXIF Metadata What Meta-Data Do You Need? Descriptive metadata describes a resource for purposes such as discovery and identification Administrative metadata provides information to help manage a resource, such as when and how it was created Rights management metadata, which deals with intellectual property rights Preservation metadata, which contains information needed to archive and preserve a resource Understanding Metadata: niso.org

Structured Metadata Dublin Core Example Title= Metadata Demys.fied Creator= Brand, Amy Creator= Daly, Frank Creator= Meyers, Barbara Subject= metadata Descrip.on= Presents an overview of metadata conven.ons in publishing. Publisher= NISO Press Publisher= The Sheridan Press Date= 2003-07" Type= Text Format= applica.on/pdf Iden.fier= htp://www.niso.org/standards/resources/metadata_demys.fied.pdf Language= en Structured Metadata <mcp:parametername> <mcp:dp_term> TAGS <mcp:term> <gco:characterstring>!le_lower_right_y_ll</gco:characterstring> </mcp:term> <mcp:type> <mcp:dp_typecode codelist="htp://schemas.aodn.org.au/mcp-2.0/ schema/resources/codelist/gmxcodelists.xml#dp_typecode" codelistvalue="shortname">shortname</mcp:dp_typecode> </mcp:type> <mcp:usedindataset> <gco:boolean>true</gco:boolean> </mcp:usedindataset> </mcp:dp_term> </mcp:parametername> Understanding Metadata: niso.org KNB A Simple Typology of Data Numerical Continuous Integers Ordinal Controlled Vocabulary Certain defined words with defined meanings Has a reference dictionary Dates/Times Many formats POSIX Raw Text What is a Metadata Standard? A Standard provides a structure to describe data with: Common terms to allow consistency between records Common definitions for easier interpretation Common language for ease of communication Common structure to quickly locate information In search and retrieval, standards provide: Documentation structure in a reliable and predictable format for computer interpretation A uniform summary description of the dataset CC image by ccarlstead on Flickr Other Media

Metadata Standards htp://www.dcc.ac.uk/resources/subject-areas/biology Britishlibrary.co.uk Case Study 1 Case Study 2

Case Study 3 Data Collection beespotter.org Creating a Good Data Gathering Sheet How easy is it to read? Are column and row definitions clear? Is there metadata? How similar is it to your digital data entry form? After the Collection Preserve original data Created digital archive of raw data Implement robust storage strategy Quality Control Can you use it at 4am?

Scanning Storage: Physical DO NOT LET THIS BE YOU Storage: Physical Storage: The Cloud

Data Sharing Distribution: Data Discovery The descriptive content of the metadata file can be used to identify, assess, and access available data resources. IDENTIFY keywords geographic location time period attributes ASSESS use constraints access constraints data quality availability/pricing ACCESS online access order process contacts blog.veritythink.com Things to Consider when Data Sharing 1. Is what you did understandable? 2. How do you want your work credited? 3. Will your data sharing service be around in 50 years? Why Share Data? One scientist can only do so much More data = more Power Science must be reproducible Who paid for this data collection?

Examples https://www.dataone.org/ http://blast.ncbi.nlm.nih.gov/blast.cgi http://datadryad.org/ http://www.oceandataportal.org/ Backlash? A second concern held by some is that a new class of research person will emerge people who had nothing to do with the design and execution of the study but use another group s data for their own ends, possibly stealing from the research productivity planned by the data gatherers, or even use the data to try to disprove what the original investigators had posited. There is concern among some front-line researchers that the system will be taken over by what some researchers have characterized as research parasites.

Should all Data be Open? Let s Look at Data & Metadata Examples htps://biol355.github.io/datasets.html