Metadata Zoo Dataset Metadata Rebecca Koskela Execu4ve Director, DataONE

Size: px
Start display at page:

Download "Metadata Zoo Dataset Metadata Rebecca Koskela Execu4ve Director, DataONE"

Transcription

1 Metadata Zoo Dataset Metadata Rebecca Koskela Execu4ve Director, DataONE eurocris September 9, 2013

2 Outline Data Challenges Metadata Solu=on DataONE addressing the Data Challenge Enabling Scien=fic Discovery Example 2 2

3 Data Management Challenges Entropy Discovery Heterogeneity 3

4 Data Entropy Time of publica4on Specific details Informa4on Content Accident General details Re4rement or career change Death Time (Michener et al. 1997) 4

5 Data Discovery 5

6 Data Heterogeneity Syntax (format) Schema (model) Seman=cs (meaning) Jones et al

7 Research and data life cycle integra=on Plan Analyze Collect Integrate Assure Discover Describe Preserve Many itera=ons of data to final data product 7

8 Open Science Movement 8

9 Metadata as a Solu=on Descrip=ve Subject maver of content Structural Content types and avributes Administra=ve Who, when, how content created Defines who can access, how it can be used IDENTIFY keywords geographic location time period attributes ASSESS use constraints access constraints data quality availability/pricing ACCESS online access order process contacts 9

10 Dataset Metadata Has Value to ALL Data developers Data users Metadata helps Organiza4ons 10

11 Proper Dataset Cura=on Enables Data Reuse 11

12 DataONE addressing the Data Challenge Provide universal access to data about life on earth and the environment 1. Building community 2. Developing sustainable data discovery and interoperability solu=ons 3. Enabling science through tools and services Plan Analyze Collect Integrate Assure Discover Describe Preserve 12

13 Discovery via DataONE Three major components for a flexible, scalable, sustainable network Coordina4ng Nodes retain complete metadata catalog indexing for search network- wide services ensure content availability (preserva=on) replica=on services 13

14 Discovery via DataONE Three major components for a flexible, scalable, sustainable network Coordina4ng Nodes retain complete metadata Member Nodes catalog diverse ins=tu=ons indexing serve local for community search network- wide provide resources services for managing ensure content their data availability (preserva=on) retain copies of data replica=on services 14

15 Discovery via DataONE Three major components for a flexible, scalable, sustainable network Coordina4ng Nodes retain complete metadata Member Nodes catalog diverse ins=tu=ons indexing serve local for community search network- wide provide resources services for managing ensure content their data availability (preserva=on) retain copies of data replica=on services 15

16 Discovery via DataONE Three major components for a flexible, scalable, sustainable network Coordina4ng Nodes retain complete metadata Member Nodes catalog diverse ins=tu=ons Inves4gator indexing for Toolkit serve local community search network- wide provide resources services for managing ensure content their data availability (preserva=on) retain copies of data replica=on services 16

17 Enable Data Discovery ORNL DAAC FGDC, ISO, DIF, FGDC USGS CSAS KNB PISCO SANParks ESA ONEShare UC MerriV CLO/AKN FGDC, ISO, FGDC EML, ISO EML EML, FGDC EML EML EML EML Extract and Align Metadata Augment Metadata Internal Metadata Index Search API LTER EML 17

18 Community Listening Understanding Engaging 18

19 Listening stakeholder surveys persona and scenario development scien9sts library s & librarians data managers cyberinfrastructure development usability tes9ng external assessments / surveys 19

20 Scien=sts want to share data Use other researchers datasets if easily accessible Willing to share data across a broad group of researchers 84% 81% Appropriate to create new datasets from shared data 76% Currently share all of their data 6% 676 Metadata standards DIF DwC DC EML FGDC Open GIS ISO My Lab none 20

21 Understanding 2/3 rd report that organiza=onal help and support is lacking 21

22 The Long Tail Science Volume Specialized repositories (e.g. GenBank, PDB) Orphan data Most of the bytes are at the high end, but most of the datasets are at the low end Jim Gray Rank frequency of datatype (B. Heidorn) 22

23 Intercept researchers where they already work 23

24 Check for best prac=ces Create metadata Connect to ONEShare Data & Metadata (EML) 24

25 Enable Data Discovery Ease of good search key for users Metadata quality key enabler Scaling search is hard Access search API Promote quality metadata Enable search refinement Develop & curate metadata Promote quality metadata Index & replicate metadata Fast, robust search API Validate metadata Seman=cs research collabora=on Drive usability improvements 25

26 Preserve Data and Metadata Support common metadata standards Use checksums for invariance Replicate metadata across all CNs Replicate data across par=cipa=ng MNs Assemble data packages Curate data & metadata Establish replica=on policies Promote quality metadata Manage system metadata Replicate and preserve metadata Enforce checksum consistency Broker & maintain data replica=on 26

27 Enabling Scien=fic Discovery Example 27

28 Data Intensive Science in Biodiversity Research Data Collec9on Organiza9on Valida9on Preserva9on, & Access Data Explora9on, Visualiza9on, And Analysis Kelling et al The full data life cycle from data cura=on to scien=fic learning engage scien=sts, land- managers, policy makers, students, educators, and the public

29 Joining Disparate Data

30 High Performance Compu4ng and Climate Change: Environmental Cues of Migra=on Teragrid SuperCompu=ng

31 Climate Change: Environmental Cues of Migra=on Spring Arrival Dates Expected Spring Arrival Dates with leaf out two weeks early

32 Enabling Scien=fic Discovery Diverse bird observa=ons and environmental data from 300,00 loca=ons in the US integrated and analyzed using High Performance Compu=ng Resources Model results Occurrence of Indigo Bun4ng (2008) Land Cover Meteorology MODIS Remote sensing data Spa=o- Temporal Exploratory Model iden=fies factors affec=ng paverns of migra=on Jan Apr Jun Sep Dec Examine paverns of migra=on Infer how climate change may affect bird migra=on 32

33 Thank you Rebecca Koskela, Execu=ve Director 33

DataONE Cyberinfrastructure. Ma# Jones Dave Vieglais Bruce Wilson

DataONE Cyberinfrastructure. Ma# Jones Dave Vieglais Bruce Wilson DataONE Cyberinfrastructure Ma# Jones Dave Vieglais Bruce Wilson Foremost a Federa9on Member Nodes (MNs) Heart of the federa9on Harness the power of local cura9on Coordina9ng Nodes (CNs) Services to link

More information

Data Symposium 2012 SeWHIP & CTSI John W. Cobb, Ph.D. Milwaukee, WI March 1, 2012

Data Symposium 2012 SeWHIP & CTSI John W. Cobb, Ph.D. Milwaukee, WI March 1, 2012 : Some Lessons Learned Data Symposium 2012 SeWHIP & CTSI John W. Cobb, Ph.D. Milwaukee, WI March 1, 2012 Acknowledgement and collaborators DataONE http://www.dataone.org/ Cal Dig. Lib. http://www.cdlib.org/

More information

DataONE Enabling Cyberinfrastructure for the Biological, Environmental and Earth Sciences

DataONE Enabling Cyberinfrastructure for the Biological, Environmental and Earth Sciences DataONE Enabling Cyberinfrastructure for the Biological, Environmental and Earth Sciences William K. Michener 1,2, Rebecca Koskela 1,2, Matthew B. Jones 2,3, Robert B. Cook 2,4, Mike Frame 2,5, Bruce Wilson

More information

Commi&ng to Data Quality

Commi&ng to Data Quality Commi&ng to Data Quality Ann Green Digital Lifecycle Research & Consul;ng NADDI Vancouver 2014 outline Data Quality Building the DDI ShiGs Crisis of Quality & Loss of Data Commi&ng to Data Quality Data

More information

DataONE: Open Persistent Access to Earth Observational Data

DataONE: Open Persistent Access to Earth Observational Data Open Persistent Access to al Robert J. Sandusky, UIC University of Illinois at Chicago The Net Partners Update: ONE and the Conservancy December 14, 2009 Outline NSF s Net Program ONE Introduction Motivating

More information

International Multidisciplinary Metadata Workshop 18 January Rebecca Koskela Arctic Region Supercomputing Center

International Multidisciplinary Metadata Workshop 18 January Rebecca Koskela Arctic Region Supercomputing Center Metadata: A Means to Manage Ecological Data International Multidisciplinary Metadata Workshop 18 January 2007 Rebecca Koskela Arctic Region Supercomputing Center Why Should You Create Metadata? Data Entropy

More information

The OpenAIRE Infrastructure

The OpenAIRE Infrastructure The OpenAIRE Infrastructure EC Policy on Open Access and the OpenAIRE Ini:a:ve EGI Scien2fic Publica2ons Repository Workshop Pasquale Pagano CNR - ISTI Courtesy by Donatella Castelli, Yannis Ionnadis,

More information

Site# Date H20 Temperature Conductance Turbidity KRS Sep KRS Aug KRS Aug

Site# Date H20 Temperature Conductance Turbidity KRS Sep KRS Aug KRS Aug ID ASR_Number Sample_Number QC_Code Analysis_Request_No External_Sample_Number Start_Date 1 1383 892 1 08-Aug-2002 2 1383 902 1 08-Aug-2002 3 1383 912 1 08-Aug-2002 Site# Date H20 Temperature Conductance

More information

CIS : Computational Reproducibility

CIS : Computational Reproducibility CIS 602-01: Computational Reproducibility Virtual Machines Dr. David Koop Figure 2. The MODIS grid, with highlighted tiles (red) of spatial extent for California (green), with citation. Computational Data

More information

Key cyberinfrastructure elements implemented as RESTful webservices

Key cyberinfrastructure elements implemented as RESTful webservices Key cyberinfrastructure elements implemented as RESTful webservices Investigator Toolkit Web Interface Analysis, Visualization Data Management Client Libraries Java Python Command Line Member Nodes Service

More information

Making Research Data Public: Why, What, and How. Fall 2016

Making Research Data Public: Why, What, and How. Fall 2016 Making Research Data Public: Why, What, and How Fall 2016 Research Data Service (RDS) The Research Data Service provides the Illinois research community with exper:se, tools, and infrastructure to manage

More information

IMPLEMENTING THE WASCAL DATA INFRASTRUCTURE (WADI)

IMPLEMENTING THE WASCAL DATA INFRASTRUCTURE (WADI) IMPLEMENTING THE WASCAL DATA INFRASTRUCTURE (WADI) Ralf Kunkel, Antonio Rogmann, Jürgen Sorg, Huaping Wang Helmholtz Open Science Webinare zu Forschungsdaten, 2015-03- 11 What is WASCAL? West African Science

More information

DataDryad.org and the interoperability continuum.

DataDryad.org and the interoperability continuum. DataDryad.org and the interoperability continuum. Repositories and Interoperability 2nd National Data Service Consortium Workshop (NDS2) October 24, 2014 Jane Greenberg Professor, College of Computing

More information

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION Plato Smith, Ph.D., Data Management Librarian DataONE Member Node Special Topics Discussion June 8, 2017, 2pm - 2:30 pm ASSESSING

More information

Managing Ecological and Biodiversity Data Using Ecoinformatics: Taiwan Experience. Chau Chin Lin Taiwan Forestry Research Institute

Managing Ecological and Biodiversity Data Using Ecoinformatics: Taiwan Experience. Chau Chin Lin Taiwan Forestry Research Institute Managing Ecological and Biodiversity Data Using Ecoinformatics: Taiwan Experience Chau Chin Lin Taiwan Forestry Research Institute Persons to Thank First for The Following Presentation Dr. Hen-biau King

More information

Data publication and discovery with Globus

Data publication and discovery with Globus Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,

More information

Indiana University Research Technology and the Research Data Alliance

Indiana University Research Technology and the Research Data Alliance Indiana University Research Technology and the Research Data Alliance Rob Quick Manager High Throughput Computing Operations Officer - OSG and SWAMP Board Member - RDA Organizational Assembly RDA Mission

More information

WP4: Data Forum. Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang

WP4: Data Forum. Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang WP4: Data Forum Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang Motivation INTERACT research stations generate data and metadata Long term monitoring Short term process studies External

More information

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation INSPIRE 2010, KRAKOW Dr. Arif Shaon, Dr. Andrew Woolf (e-science, Science and Technology Facilities Council, UK) 3

More information

aginfra: High Performance Compu8ng einfrastructure for Agriculture

aginfra: High Performance Compu8ng einfrastructure for Agriculture aginfra: High Performance Compu8ng einfrastructure for Agriculture Antun Balaz Ins,tute of Physics Belgrade What is aginfra? A 3- years project, co- funded by the European Union, developing data infrastructure

More information

Data Portal and Integra.on in JAMSTEC

Data Portal and Integra.on in JAMSTEC Data Portal and Integra.on in JAMSTEC Yasunori Hanafusa Data Research Center for Marine-Earth Sciences (DrC) Agency for Marine-Earth Science and Technology (JAMSTEC) 1 Overview of Data Management in JAMSTEC

More information

Enabling Interaction and Quality in a Distributed Data DRIS

Enabling Interaction and Quality in a Distributed Data DRIS Purdue University Purdue e-pubs Libraries Research Publications 5-11-2006 Enabling Interaction and Quality in a Distributed Data DRIS D. Scott Brandt Purdue University, techman@purdue.edu James L. Mullins

More information

System Modeling Environment

System Modeling Environment System Modeling Environment Requirements, Architecture and Implementa

More information

Data Curation Profile Human Genomics

Data Curation Profile Human Genomics Data Curation Profile Human Genomics Profile Author Profile Author Institution Name Contact J. Carlson N. Brown Purdue University J. Carlson, jrcarlso@purdue.edu Date of Creation October 27, 2009 Date

More information

Sessions 3/4: Member Node Breakouts. John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group

Sessions 3/4: Member Node Breakouts. John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group Sessions 3/4: Member Node Breakouts John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group Schedule 1:00-2:20 and 2:40-4:00 Member Node Breakouts Member Node Overview and Process Overview Documentation

More information

Data Management Tools. Lizzy Rolando, Georgia Tech Aaron Trehub, Auburn University August 6, 2013

Data Management Tools. Lizzy Rolando, Georgia Tech Aaron Trehub, Auburn University August 6, 2013 Data Management Tools Lizzy Rolando, Georgia Tech Aaron Trehub, Auburn University August 6, 2013 A brief history of how we got here The march of data, 3000 BC 2010 AD 2011-2013 Etc. Kipling on data management

More information

Outline. In Situ Data Triage and Visualiza8on

Outline. In Situ Data Triage and Visualiza8on In Situ Data Triage and Visualiza8on Kwan- Liu Ma University of California at Davis Outline In situ data triage and visualiza8on: Issues and strategies Case study: An earthquake simula8on Case study: A

More information

Dataverse 4.0 & Beyond. Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University

Dataverse 4.0 & Beyond. Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University Dataverse 4.0 & Beyond ì Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University 2 Data Science Team Data Cura/on & Stewardship Informa/on Scien/sts Researchers Sta/s/cal Innova/on

More information

The What, Why, Who and How of Where: Building a Portal for Geospatial Data. Alan Darnell Director, Scholars Portal

The What, Why, Who and How of Where: Building a Portal for Geospatial Data. Alan Darnell Director, Scholars Portal The What, Why, Who and How of Where: Building a Portal for Geospatial Data Alan Darnell Director, Scholars Portal What? Scholars GeoPortal Beta release Fall 2011 Production release March 2012 OLITA Award

More information

IRODS USER GROUP 2014 CAMBRIDGE,MA John Burns. 6/25/14 Archive Analy3cs Solu3ons 1

IRODS USER GROUP 2014 CAMBRIDGE,MA John Burns. 6/25/14 Archive Analy3cs Solu3ons 1 IRODS USER GROUP 2014 CAMBRIDGE,MA John Burns 6/25/14 Archive Analy3cs Solu3ons 1 Credits Archive Analy3cs Solu3ons is presen3ng an archive system that embodies best prac3ce for long- term, high integrity

More information

Chain of Data Creation. Data Creation. Lab Notebook. Plan Your Experiment, Experiment With your Plan

Chain of Data Creation. Data Creation. Lab Notebook. Plan Your Experiment, Experiment With your Plan Chain of Data Creation Data Creation 1. Preparation 2. Creation of Metadata 3. Acquisition 4. Building a Permanent Record 5. Data Management 6. Storage 7. Data Sharing Lab Notebook Plan Your Experiment,

More information

Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center

Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center Robert Cook, DAAC Scientist Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN cookrb@ornl.gov

More information

NARCCAP: North American Regional Climate Change Assessment Program. Seth McGinnis, NCAR

NARCCAP: North American Regional Climate Change Assessment Program. Seth McGinnis, NCAR NARCCAP: North American Regional Climate Change Assessment Program Seth McGinnis, NCAR mcginnis@ucar.edu NARCCAP: North American Regional Climate Change Assessment Program Nest highresolution regional

More information

EOSC Services & Architecture: the EOSC-hub approach Tiziana Ferrari, Project Coordinator, EGI Founda?on

EOSC Services & Architecture: the EOSC-hub approach Tiziana Ferrari, Project Coordinator, EGI Founda?on EOSC Services & Architecture: the EOSC-hub approach Tiziana Ferrari, Project Coordinator, EGI Founda?on eosc-hub.eu @EOSC_eu EOSC-hub receives funding from the European Union s Horizon 2020 research and

More information

April 17, Ronald Layne Manager, Data Quality and Data Governance

April 17, Ronald Layne Manager, Data Quality and Data Governance Ensuring the highest quality data is delivered throughout the university providing valuable information serving individual and organizational need April 17, 2015 Ronald Layne Manager, Data Quality and

More information

When the Need for an Ins/tu/onal Repository Gives Rise to a Federa/on

When the Need for an Ins/tu/onal Repository Gives Rise to a Federa/on When the Need for an Ins/tu/onal Repository Gives Rise to a Federa/on Lisa Schmidt lschmidt@msu.edu Michigan Academic Library Council March 18, 2011 Overview Ins?tu?onal Background Why an Ins?tu?onal Repository?

More information

Cloud Data Management System (CDMS)

Cloud Data Management System (CDMS) Cloud Management System (CMS) Wiqar Chaudry Solu9ons Engineer Senior Advisor CMS Overview he OpenStack cloud data management system features a canonical data modeling framework designed to broker context

More information

Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implementation

Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implementation Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implementation NFAIS, 23 July 2014 Laura Dawson Product Manager, Identifier Services, Bowker Laura.Dawson@bowker.com ISNI 0000 0004 1029

More information

From Open Data to Data- Intensive Science through CERIF

From Open Data to Data- Intensive Science through CERIF From Open Data to Data- Intensive Science through CERIF Keith G Jeffery a, Anne Asserson b, Nikos Houssos c, Valerie Brasse d, Brigitte Jörg e a Keith G Jeffery Consultants, Shrivenham, SN6 8AH, U, b University

More information

Data Curation Profile Plant Genetics / Corn Breeding

Data Curation Profile Plant Genetics / Corn Breeding Profile Author Author s Institution Contact Researcher(s) Interviewed Researcher s Institution Katherine Chiang Cornell University Library ksc3@cornell.edu Withheld Cornell University Date of Creation

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

SHARING GEOGRAPHIC INFORMATION ON THE INTERNET ICIMOD S METADATA/DATA SERVER SYSTEM USING ARCIMS

SHARING GEOGRAPHIC INFORMATION ON THE INTERNET ICIMOD S METADATA/DATA SERVER SYSTEM USING ARCIMS SHARING GEOGRAPHIC INFORMATION ON THE INTERNET ICIMOD S METADATA/DATA SERVER SYSTEM USING ARCIMS Sushil Pandey* Birendra Bajracharya** *International Centre for Integrated Mountain Development (ICIMOD)

More information

Data is the new Oil (Ann Winblad)

Data is the new Oil (Ann Winblad) Data is the new Oil (Ann Winblad) Keith G Jeffery keith.jeffery@keithgjefferyconsultants.co.uk 20140415-16 JRC Workshop Big Open Data Keith G Jeffery 1 Data is the New Oil Like oil has been, data is Abundant

More information

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI SEAD Data Services Jim Myers(myersjd@umich.edu), Best Practices in Data Infrastructure Workshop Cooperative agreement #OCI0940824 SEAD: Sustainable Environment - Actionable Data An NSF DataNet project

More information

Open-Source Based Solutions for Processing, Preserving, and Presenting Oral Histories

Open-Source Based Solutions for Processing, Preserving, and Presenting Oral Histories Western Washington University From the SelectedWorks of Mark I. Greenberg April 2, 2011 Open-Source Based Solutions for Processing, Preserving, and Presenting Oral Histories Mark I. Greenberg, University

More information

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel National Center for Supercomputing Applications University of Illinois

More information

Introduction to SDIs (Spatial Data Infrastructure)

Introduction to SDIs (Spatial Data Infrastructure) www.grid.unep.ch Regional training workshop on geographical information system for energy planning Introduction to SDIs (Spatial Data Infrastructure) Dakar, 12 August 2014 Gregory Giuliani Andrea de Bono,

More information

NextData System of Systems Infrastructure (ND-SoS-Ina)

NextData System of Systems Infrastructure (ND-SoS-Ina) NextData System of Systems Infrastructure (ND-SoS-Ina) DELIVERABLE D2.3 (CINECA, CNR-IIA) - Web Portal Architecture DELIVERABLE D4.1 (CINECA, CNR-IIA) - Test Infrastructure Document identifier: D2.3 D4.1

More information

Introduc)on to Data Management. Elizabeth Wickes Chris1e Wiley

Introduc)on to Data Management. Elizabeth Wickes Chris1e Wiley Introduc)on to Data Management Elizabeth Wickes Chris1e Wiley Data Management Workshop Series Introduc)on to Data Management February 16 th 10AM 11AM Documenta)on and Organiza)on for Data and Processes

More information

Digital Cura+on Planning at Michigan State University

Digital Cura+on Planning at Michigan State University Digital Cura+on Planning at Michigan State University Lisa Schmidt, Electronic Records Archivist Michigan State University Archives & Historical Collec+ons January 17, 2010 Overview Michigan State University

More information

Building a Materials Data Facility (MDF)

Building a Materials Data Facility (MDF) Building a Materials Data Facility (MDF) Ben Blaiszik (blaiszik@uchicago.edu) Ian Foster (foster@uchicago.edu) Steve Tuecke, Kyle Chard, Rachana Ananthakrishnan, Jim Pruyne (UC) Kandace Turner-Jones, John

More information

Dagmar Triebel, Peter Grobe, Anton Güntsch, Gregor Hagedorn, Joachim Holstein, Carola Söhngen, Claus Weiland, Tanja Weibulat.

Dagmar Triebel, Peter Grobe, Anton Güntsch, Gregor Hagedorn, Joachim Holstein, Carola Söhngen, Claus Weiland, Tanja Weibulat. How to organize, process and archive collection and occurrence data using GFBio services provided by Germany s major natural history and culture collection data repositories, Peter Grobe, Anton Güntsch,

More information

The NCAR Community Data Portal

The NCAR Community Data Portal The NCAR Community Data Portal http://cdp.ucar.edu/ QuickTime and a TIFF (Uncompressed) decompressor are needed to see this picture. QuickTime and a TIFF (Uncompressed) decompressor are needed to see this

More information

The Changing Role of Data Stewardship in Creating Trustworthy, Transdisciplinary High Performance Data Platforms for the Future

The Changing Role of Data Stewardship in Creating Trustworthy, Transdisciplinary High Performance Data Platforms for the Future AGU Fall Meeting 2016 IN31-G The Changing Role of Data Stewardship in Creating Trustworthy, Transdisciplinary High Performance Data Platforms for the Future Clare Richards, Ben Evans, Lesley Wyborn, Jingbo

More information

Research Elsevier

Research Elsevier Research Data @ Elsevier From generation through sharing and publishing to discovery IJsbrand Jan Aalbersberg SVP Journal and Data Solutions NDS, Boulder - June 12, 2014 Contributors: Anita de Waard Hylke

More information

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation, Integration Alan Blatecky Director OCI 1 1 Framing the

More information

Fundamentals of Data Infrastructures

Fundamentals of Data Infrastructures Fundamentals of Data Infrastructures Dublin, March 2014 Welcome & Introduction Adam Carter EPCC, The University of Edinburgh Training Coordinator, EUDAT Timetable 09:00 Registration & Coffee 09:15 Welcome

More information

Globus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018

Globus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018 Globus Platform Services for Data Publication Greg Nawrocki greg@globus.org University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018 Outline Globus Overview Globus Data Publication v1 Lessons

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

EUDAT & AAI. Daan Broeder MPI for Psycholinguistics

EUDAT & AAI. Daan Broeder MPI for Psycholinguistics EUDAT & AAI Daan Broeder MPI for Psycholinguistics Initially six research communities on Board EPOS: European Plate Observatory System CLARIN: Common Language Resources and Technology Infrastructure ENES:

More information

The Materials Data Facility

The Materials Data Facility The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials

More information

Big Data infrastructure and tools in libraries

Big Data infrastructure and tools in libraries Line Pouchard, PhD Purdue University Libraries Research Data Group Big Data infrastructure and tools in libraries 08/10/2016 DATA IN LIBRARIES: THE BIG PICTURE IFLA/ UNIVERSITY OF CHICAGO BIG DATA: A VERY

More information

Earth Science Community view on Digital Repositories

Earth Science Community view on Digital Repositories Ground European Network for Earth Science Interoperations Digital Repository Dissemination and Exploitation of GRids in Earth science Earth Science Community view on Digital Repositories Luigi FUSCO -

More information

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography Christopher Crosby, San Diego Supercomputer Center J Ramon Arrowsmith, Arizona State University Chaitan

More information

UX & Usability Strategies and Website Assessments. Candice Kail, Web Services Librarian

UX & Usability Strategies and Website Assessments. Candice Kail, Web Services Librarian UX & Usability Strategies and Website Assessments Candice Kail, Web Services Librarian Usage Data We have been collec9ng Google Analy9cs Data since we migrated our content to our current Web CMS, AEM/CQ,

More information

Jeffery S. Horsburgh. Utah Water Research Laboratory Utah State University

Jeffery S. Horsburgh. Utah Water Research Laboratory Utah State University Advancing a Services Oriented Architecture for Sharing Hydrologic Data Jeffery S. Horsburgh Utah Water Research Laboratory Utah State University D.G. Tarboton, D.R. Maidment, I. Zaslavsky, D.P. Ames, J.L.

More information

DataONE. Promoting Data Stewardship Through Best Practices

DataONE. Promoting Data Stewardship Through Best Practices DataONE Promoting Data Stewardship Through Best Practices Carly Strasser 1,2, Robert Cook 1,3, William Michener 1,4, Amber Budden 1,4, Rebecca Koskela 1,4 1 DataONE 2 University of California Santa Barbara

More information

Linked Open Data and Semantic Technologies for Research in Agriculture and Forestry

Linked Open Data and Semantic Technologies for Research in Agriculture and Forestry Linked Open and Semantic Technologies for Research in Agriculture and Forestry Platform Linked Nederland 2 April 2015 Rob Lokers, Alterra, Wageningen UR Contents related challenges in agricultural (and

More information

EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd

EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd EUDAT Data Services & Tools for Researchers and Communities Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd CSC IT CENTER FOR SCIENCE! Founded in 1971 as a technical support

More information

Introduc)on to Data Management. Chris'e Wiley Sarah C. Williams Heidi Imker

Introduc)on to Data Management. Chris'e Wiley Sarah C. Williams Heidi Imker Introduc)on to Data Management Chris'e Wiley Sarah C. Williams Heidi Imker Data Management Workshop Series Introduc)on to Data Management Feb 10 th 4PM 5PM Apr 6 th 1PM 2PM Documenta)on and Organiza)on

More information

Introduction to Grid Computing

Introduction to Grid Computing Milestone 2 Include the names of the papers You only have a page be selective about what you include Be specific; summarize the authors contributions, not just what the paper is about. You might be able

More information

Leveraging Tools and Components from OODT and Apache within Climate Science and the Earth System Grid Federa9on

Leveraging Tools and Components from OODT and Apache within Climate Science and the Earth System Grid Federa9on Leveraging Tools and Components from OODT and Apache within Climate Science and the Earth System Grid Federa9on Luca Cinquini, Dan Crichton, Chris Ma2mann NASA Jet Propulsion Laboratory, California Ins9tute

More information

Distributed Systems INF Michael Welzl

Distributed Systems INF Michael Welzl Distributed Systems INF 3190 Michael Welzl What is a distributed system (DS)? Many defini8ons [Coulouris & Emmerich] A distributed system consists of hardware and sodware components located in a network

More information

Software + Services for Data Storage, Management, Discovery, and Re-Use

Software + Services for Data Storage, Management, Discovery, and Re-Use Software + Services for Data Storage, Management, Discovery, and Re-Use CODATA 22 Conference Stellenbosch, South Africa 25 October 2010 Alex D. Wade Director Scholarly Communication Microsoft External

More information

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Paving the Rocky Road Toward Open and FAIR in the Field Sciences Paving the Rocky Road Toward Open and FAIR Kerstin Lehnert Lamont-Doherty Earth Observatory, Columbia University IEDA (Interdisciplinary Earth Data Alliance), www.iedadata.org IGSN e.v., www.igsn.org Field

More information

Tyco Data Set Integration Project

Tyco Data Set Integration Project Tyco Data Set Integration Project Design Team Naif Al-Mulhim, Samuel Bar, Sunny Gupta, Rebecca Payne Design Advisor Prof. Abe Zeid Project Sponsor Edward Jones Abstract Many Tyco products rely on dynamic

More information

Robin Wilson Director. Digital Identifiers Metadata Services

Robin Wilson Director. Digital Identifiers Metadata Services Robin Wilson Director Digital Identifiers Metadata Services Report Digital Object Identifiers for Publishing and the e-learning Community CONTEXT elearning the the Publishing Challenge elearning the the

More information

Metadata of geographic information

Metadata of geographic information Metadata of geographic information Kai Koistinen Management of environmental data and information 4.10.2017 Topics Metadata of geographic information What is metadata? Metadata standards and recommendations

More information

Enabling Scalable Data Analysis for Large Computa9onal Structural Biology Datasets on Distributed Memory Systems

Enabling Scalable Data Analysis for Large Computa9onal Structural Biology Datasets on Distributed Memory Systems Enabling Scalable Data Analysis for Large Computa9onal Structural Biology Datasets on Distributed Memory Systems Michela Taufer Global Compu9ng Laboratory Computer and Informa9on Sciences University of

More information

The Common Framework for Earth Observation Data. US Group on Earth Observations Data Management Working Group

The Common Framework for Earth Observation Data. US Group on Earth Observations Data Management Working Group The Common Framework for Earth Observation Data US Group on Earth Observations Data Management Working Group Agenda USGEO and BEDI background Concise summary of recommended CFEOD standards today Full document

More information

Jisc Research Data Shared Service

Jisc Research Data Shared Service Arpri 2017 Jisc Research Data Shared Service John Kaye Senior Co-Design Manager, Research Data ORCiD 0000-0002-4400-4252 #JiscRDM Who we are Jisc Research Data Services Context RDSS Context and Vision

More information

Update on the TDL Metadata Working Group s activities for

Update on the TDL Metadata Working Group s activities for Update on the TDL Metadata Working Group s activities for 2009-2010 Provide Texas Digital Library (TDL) with general metadata expertise. In particular, the Metadata Working Group will address the following

More information

Engaging and Connecting Faculty:

Engaging and Connecting Faculty: Engaging and Connecting Faculty: Research Discovery, Access, Re-use, and Archiving Janet McCue and Jon Corson-Rikert Albert R. Mann Library Cornell University CNI Spring 2007 Task Force Meeting April 16,

More information

Toward the Development of a Comprehensive Data & Information Management System for THORPEX

Toward the Development of a Comprehensive Data & Information Management System for THORPEX Toward the Development of a Comprehensive Data & Information Management System for THORPEX Mohan Ramamurthy, Unidata Steve Williams, JOSS Jose Meitin, JOSS Karyn Sawyer, JOSS UCAR Office of Programs Boulder,

More information

Submitted to: Dr. Sunnie Chung. Presented by: Sonal Deshmukh Jay Upadhyay

Submitted to: Dr. Sunnie Chung. Presented by: Sonal Deshmukh Jay Upadhyay Submitted to: Dr. Sunnie Chung Presented by: Sonal Deshmukh Jay Upadhyay Submitted to: Dr. Sunny Chung Presented by: Sonal Deshmukh Jay Upadhyay What is Apache Survey shows huge popularity spike for Apache

More information

Advancing the fourth paradigm of research: Assimilating repositories into active research phases

Advancing the fourth paradigm of research: Assimilating repositories into active research phases Title Here Advancing the fourth paradigm of research: Assimilating repositories into active research phases Tyler Walters Dean, University Libraries, Virginia Tech SPARC Conference, Kansas City, March

More information

Wade Sheldon. Georgia Coastal Ecosystems LTER University of Georgia CUAHSI Virtual Workshop Field Data Management Solutions

Wade Sheldon. Georgia Coastal Ecosystems LTER University of Georgia   CUAHSI Virtual Workshop Field Data Management Solutions Wade Sheldon Georgia Coastal Ecosystems LTER University of Georgia email: sheldon@uga.edu CUAHSI Virtual Workshop Field Data Management Solutions 01-Oct-2014 Georgia Coastal Ecosystems LTER started in

More information

5/23/18. Atomized individual items vs. Organized collec=ons (1/2) Atomized individual items vs. Organized collec=ons (2/2)

5/23/18. Atomized individual items vs. Organized collec=ons (1/2) Atomized individual items vs. Organized collec=ons (2/2) Archival Prac+ce involves Cura+on; Trying to minimize the impact of ruling narra+ves- Archival Prac+ce involves Cura+on; Trying to minimize the impact of ruling narra+ves Howard Besser Moving Image Archiving

More information

Using XML-encoded Metadata as a Basis for Advanced Information Systems for Ecological Research

Using XML-encoded Metadata as a Basis for Advanced Information Systems for Ecological Research Using XML-encoded Metadata as a Basis for Advanced Information Systems for Ecological Research Peter H. MCCARTNEY Center for Environmental Studies Arizona State University Tempe, AZ 85282, USA And Matthew

More information

The C3S Climate Data Store and its upcoming use by CAMS

The C3S Climate Data Store and its upcoming use by CAMS Atmosphere The C3S Climate Data Store and its upcoming use by CAMS Miha Razinger, ECMWF thanks to Angel Alos, Baudouin Raoult, Cedric Bergeron and the CDS contractors Atmosphere What are C3S and CDS? The

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Jeremy York, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Noncommercial Share Alike 3.0 License: http://creativecommons.org/licenses/by-nc-sa/3.0/

More information

WORLD. Patrick Combes Senior Solu3on Architect for Life Sciences at EMC/Isilon

WORLD. Patrick Combes Senior Solu3on Architect for Life Sciences at EMC/Isilon ISILON @GLOBUS WORLD Patrick Combes Senior Solu3on Architect for Life Sciences at EMC/Isilon patrick.combes@isilon.com Support Contact: Educa3on Services Isilon Overview Cluster of nodes, easily managed

More information

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data GeoDaRRs: What is the existing landscape and what gaps exist in that landscape for data producers and users? 7 August

More information

Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library

Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library Live poll at: https://pollev.com/ cyndyparr196 Problems with Public Ag Data Government Website

More information

SEXTANT 1. Purpose of the Application

SEXTANT 1. Purpose of the Application SEXTANT 1. Purpose of the Application Sextant has been used in the domains of Earth Observation and Environment by presenting its browsing and visualization capabilities using a number of link geospatial

More information

Crea%ng and U%lizing Linked Open Sta%s%cal Data for the Development of Advanced Analy%cs Services E. Kalampokis, A. Karamanou, A. Nikolov, P.

Crea%ng and U%lizing Linked Open Sta%s%cal Data for the Development of Advanced Analy%cs Services E. Kalampokis, A. Karamanou, A. Nikolov, P. Crea%ng and U%lizing Linked Open Sta%s%cal Data for the Development of Advanced Analy%cs Services E. Kalampokis, A. Karamanou, A. Nikolov, P. Haase, R. Cyganiak, B. Roberts, P. Hermans, E. Tambouris, K.

More information

4 th Working Group on Geospatial Information

4 th Working Group on Geospatial Information 4 th Working Group on Geospatial Information Session 5: Contributing to the Work of the Custodian Agencies United Nations Headquarters December 6-8, 2017 Argyro Kavvada, NASA - BAH & EO4SDG Exec. Sec.

More information

Hawaii Energy and Environmental Technologies (HEET) Initiative

Hawaii Energy and Environmental Technologies (HEET) Initiative Hawaii Energy and Environmental Technologies (HEET) Initiative Office of Naval Research Grant Award Number N0014-11-1-0391 Task 8. ENERGY-NEUTRAL ENERGY TEST PLATFORMS 8.3 Advanced Database Research, Development

More information

COSC 310: So*ware Engineering. Dr. Bowen Hui University of Bri>sh Columbia Okanagan

COSC 310: So*ware Engineering. Dr. Bowen Hui University of Bri>sh Columbia Okanagan COSC 310: So*ware Engineering Dr. Bowen Hui University of Bri>sh Columbia Okanagan 1 Admin A2 is up Don t forget to keep doing peer evalua>ons Deadline can be extended but shortens A3 >meframe Labs This

More information

Data Partnerships to Improve Health Frequently Asked Questions. Glossary...9

Data Partnerships to Improve Health Frequently Asked Questions. Glossary...9 FAQ s Data Partnerships to Improve Health Frequently Asked Questions BENEFITS OF PARTICIPATING... 1 USING THE NETWORK.... 2 SECURING THE DATA AND NETWORK.... 3 PROTECTING PRIVACY.... 4 CREATING METADATA...

More information