DataONE Cyberinfrastructure. Ma# Jones Dave Vieglais Bruce Wilson

Size: px
Start display at page:

Download "DataONE Cyberinfrastructure. Ma# Jones Dave Vieglais Bruce Wilson"

Transcription

1 DataONE Cyberinfrastructure Ma# Jones Dave Vieglais Bruce Wilson

2 Foremost a Federa9on Member Nodes (MNs) Heart of the federa9on Harness the power of local cura9on Coordina9ng Nodes (CNs) Services to link Member Nodes Inves9gator Toolkit (ITK) Tools for the whole data lifecycle Interoperability 2

3 Requirements for DataONE Scalable Usable by people and agents Resilient to technical and ins9tu9onal change Adap9ve to evolving standards Inclusive of exis9ng communi9es and tools Cognizant of sociological drivers Informed by prior and current work

4 Why a Federa9on? Diverse Federa9on == Resilience Failover for temporary outages Insurance against project/ins9tu9onal failure Diverse Federa9on == Scalability Storage increases with Member Nodes Incremental costs to each MN to replicate Distributes sustainability costs 4

5 Member Nodes Authorita9ve members of the Federa9on Curate their own data holdings Provide unique iden,fiers for each object Ensure availability, quality, and reliability Replicate holdings for other MNs Provide access and access control Log and report accesses to objects Engage with DataONE community Deploy a DataONE- compa9ble sovware system 5

6 Implementa9on Tiers Tier 1 Supports publicly readable content without authen9ca9on or more specific access control rules. Tier 2 Tier 1 plus access control support Tier 3 Tier 2 plus ability to add content through the DataONE service interfaces and provides full support for interac9on with DataONE Inves9gator Toolkit applica9ons and plugins. Tier 4 Support the full set of DataONE APIs and can operate as replica9on targets, accep9ng content from compa9ble (technical and policy) Member Nodes and fully suppor9ng the DataONE content access control rules. 6

7 Characterizing Member Nodes Diverse Contributors Data Types Ecological Environmental Demographic Social/Legal/Economic 45 Individual inves9gators 30 Field sta9ons and networks Government agencies 15 Non- profit partnerships 0 Scien9fic Socie9es Synthesis centers MB 7 60 < Data Sizes % >200

8 Characterizing Member Nodes Diverse Contributors Data Types Ecological Environmental Demographic Social/Legal/Economic 45 Individual inves9gators 30 Field sta9ons and networks Government agencies 15 Non- profit partnerships 0 Scien9fic Socie9es Synthesis centers MB 7 60 < Data Sizes % >200

9 Coordina9ng Nodes Provide coordina9ng services Search and Discovery Preserva9on monitoring Object tracking and replica management User iden9ty management Logging and monitoring Op9mized High availability Performance Scalability

10 The Inves9gator Toolkit Inves9gator Toolkit Web Interface Analysis, Visualiza9on Data Management Client Libraries Java Python Command Line Discovery tools Data Management tools Analysis and modeling tools Cita9on and publica9on tools

11 Data Lifecycle Collect Analyze Assure Integrate Describe Discover Deposit Preserve

12 Data Lifecycle Collect Analyze Assure Integrate Describe Morpho Discover Deposit Preserve

13 Iden9fy objects Goal: Uniquely iden9fy data or metadata objects Support the several iden9fier types widely used Iden9fiers assigned by Member Nodes Uniqueness ensured by Coordina9ng Nodes Resolu9on through Coordina9ng Nodes GUID {3F2504E0-4 LSID PURL

14 Iden9fy people Iden9ty provider selected by the user Member nodes define access rules Rules propagated by Coordina9ng Nodes Iden9ty and access control consistent across en9re infrastructure

15 Deposit Data and Metadata Proxy KNB Native Generic <meta> Science metadata EML, FGDC, DC, ISO, DIF, System metadata Globally unique IDs for data & metadata (DOI, GUID, Hdl, ) Checksums of objects Object policies

16 Preserve Data and Metadata Metadata mirrored at Coordina9ng Nodes Data replicated between Member Nodes CNs manage copies Checksums recorded and verified Promote quality metadata Coordina9ng Nodes

17 Discover Content

18 Integrate and Analyze water temperature (bottom, 10m ADCP)!!!! Graphs and derived data can be archived in DataONE Temperature degrees C !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! 01:00 05:00 09:00 13:00 17:00 Time!!!!!!!!!!!!! 16

19 Analysis and Visualiza9on Diverse bird observa9ons and environmental data from 300,000 loca9ons in the US integrated and analyzed using High Performance Compu9ng Resources Model results Occurrence of Swainson s Hawk Land Cover Meteorology MODIS Remote sensing data Spa9o- Temporal Exploratory Model iden9fies factors affec9ng pa#erns of migra9on Jan Apr Jun Sep Dec Examine pa#erns of migra9on Infer how climate change may affect bird migra9on Slide from S. Kelling

20 DataONE System Overview

21 DataONE System Overview

22 DataONE Ac9vi9es Through Year 2 Deploy core infrastructure suppor9ng four fundamental services: Persistent, unique iden9fiers Bit- level preserva9on Search and retrieval Federated iden9ty Along with: Build out and deployment of Member Nodes Add ITK func9onality Test, test, test Ramp up R&D on addi9onal features

23 SoVware Delivered at Public Release Inves9gator Toolkit SoUware SearchPortal R Client Morpho Zotero Fuse FS Excel Mendeley Client Libraries Java Python Command Line DataONE Service Programming Interface (SPI) Member Node SoUware Metacat Dryad GMN CUASHI MerriZ Coordina9ng Node SoUware Service Interfaces Resolu9on Replica9on Iden9fiers Preserva9on Object Store Registra9on Discovery Catalog Monitor Index

24 DataONE Ac9vi9es: Years 3-5 Data sub- selng, transforma9on Visualiza9on Workflow support Seman9c search Seman9c data integra9on Computa9onal, or specialized nodes Inves9gator Toolkit expansion DMP-Tool

25 Cyberinfrastructure Outline CI Architecture, Requirements, and Design Member Nodes Coordina9ng Nodes Inves9gator Toolkit Demonstra9ons

26 Demonstra9ons Collect Analyze Assure Integrate Describe Discover Deposit Preserve 23

27 Demonstra9ons Collect Analyze Assure Integrate Describe Morpho Discover Deposit Preserve 23

28 Demonstra9ons Collect Analyze Assure Integrate Describe Morpho Discover Deposit Preserve 23

29 Demonstra9ons Collect Analyze Assure Integrate Describe Morpho Discover Deposit Preserve 23

30 Demonstra9ons Collect Analyze Assure Integrate Describe Morpho Discover Deposit Preserve 23

31 Demonstra9ons Collect Analyze Assure Integrate Describe Morpho Discover Deposit Preserve 23

32 Describing and deposit with Morpho 24

33 Data discovery 25

34 File system access 26

35 R plugin demonstra9on 27

36 Value of DataONE Discovery and access: Enabling discovery and universal access to data about life on earth from around the world Data integra9on and synthesis: Providing transforma9onal tools that enable cross- culng research Educa9on and training: Providing essen9al skills (e.g., data management training, best prac9ces) for scien9fic enquiry Building community: Combining exper9se and resources across diverse communi9es to collec9vely educate, advocate, and support stewardship of scien9fic data Data Sharing: Providing incen9ves and infrastructure for sharing data from federally funded researchers 28

Metadata Zoo Dataset Metadata Rebecca Koskela Execu4ve Director, DataONE

Metadata Zoo Dataset Metadata Rebecca Koskela Execu4ve Director, DataONE Metadata Zoo Dataset Metadata Rebecca Koskela Execu4ve Director, DataONE eurocris September 9, 2013 Outline Data Challenges Metadata Solu=on DataONE addressing the Data Challenge Enabling Scien=fic Discovery

More information

Data Symposium 2012 SeWHIP & CTSI John W. Cobb, Ph.D. Milwaukee, WI March 1, 2012

Data Symposium 2012 SeWHIP & CTSI John W. Cobb, Ph.D. Milwaukee, WI March 1, 2012 : Some Lessons Learned Data Symposium 2012 SeWHIP & CTSI John W. Cobb, Ph.D. Milwaukee, WI March 1, 2012 Acknowledgement and collaborators DataONE http://www.dataone.org/ Cal Dig. Lib. http://www.cdlib.org/

More information

DataONE Enabling Cyberinfrastructure for the Biological, Environmental and Earth Sciences

DataONE Enabling Cyberinfrastructure for the Biological, Environmental and Earth Sciences DataONE Enabling Cyberinfrastructure for the Biological, Environmental and Earth Sciences William K. Michener 1,2, Rebecca Koskela 1,2, Matthew B. Jones 2,3, Robert B. Cook 2,4, Mike Frame 2,5, Bruce Wilson

More information

Key cyberinfrastructure elements implemented as RESTful webservices

Key cyberinfrastructure elements implemented as RESTful webservices Key cyberinfrastructure elements implemented as RESTful webservices Investigator Toolkit Web Interface Analysis, Visualization Data Management Client Libraries Java Python Command Line Member Nodes Service

More information

Commi&ng to Data Quality

Commi&ng to Data Quality Commi&ng to Data Quality Ann Green Digital Lifecycle Research & Consul;ng NADDI Vancouver 2014 outline Data Quality Building the DDI ShiGs Crisis of Quality & Loss of Data Commi&ng to Data Quality Data

More information

EOSC Services & Architecture: the EOSC-hub approach Tiziana Ferrari, Project Coordinator, EGI Founda?on

EOSC Services & Architecture: the EOSC-hub approach Tiziana Ferrari, Project Coordinator, EGI Founda?on EOSC Services & Architecture: the EOSC-hub approach Tiziana Ferrari, Project Coordinator, EGI Founda?on eosc-hub.eu @EOSC_eu EOSC-hub receives funding from the European Union s Horizon 2020 research and

More information

Sessions 3/4: Member Node Breakouts. John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group

Sessions 3/4: Member Node Breakouts. John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group Sessions 3/4: Member Node Breakouts John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group Schedule 1:00-2:20 and 2:40-4:00 Member Node Breakouts Member Node Overview and Process Overview Documentation

More information

DataONE: Open Persistent Access to Earth Observational Data

DataONE: Open Persistent Access to Earth Observational Data Open Persistent Access to al Robert J. Sandusky, UIC University of Illinois at Chicago The Net Partners Update: ONE and the Conservancy December 14, 2009 Outline NSF s Net Program ONE Introduction Motivating

More information

IMPLEMENTING THE WASCAL DATA INFRASTRUCTURE (WADI)

IMPLEMENTING THE WASCAL DATA INFRASTRUCTURE (WADI) IMPLEMENTING THE WASCAL DATA INFRASTRUCTURE (WADI) Ralf Kunkel, Antonio Rogmann, Jürgen Sorg, Huaping Wang Helmholtz Open Science Webinare zu Forschungsdaten, 2015-03- 11 What is WASCAL? West African Science

More information

Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library

Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library Live poll at: https://pollev.com/ cyndyparr196 Problems with Public Ag Data Government Website

More information

The OpenAIRE Infrastructure

The OpenAIRE Infrastructure The OpenAIRE Infrastructure EC Policy on Open Access and the OpenAIRE Ini:a:ve EGI Scien2fic Publica2ons Repository Workshop Pasquale Pagano CNR - ISTI Courtesy by Donatella Castelli, Yannis Ionnadis,

More information

Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implementation

Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implementation Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implementation NFAIS, 23 July 2014 Laura Dawson Product Manager, Identifier Services, Bowker Laura.Dawson@bowker.com ISNI 0000 0004 1029

More information

IRODS USER GROUP 2014 CAMBRIDGE,MA John Burns. 6/25/14 Archive Analy3cs Solu3ons 1

IRODS USER GROUP 2014 CAMBRIDGE,MA John Burns. 6/25/14 Archive Analy3cs Solu3ons 1 IRODS USER GROUP 2014 CAMBRIDGE,MA John Burns 6/25/14 Archive Analy3cs Solu3ons 1 Credits Archive Analy3cs Solu3ons is presen3ng an archive system that embodies best prac3ce for long- term, high integrity

More information

This is an important a-empt (one of the first in the US) to rebalance the rela8onships between publishers, academic ins8tu8ons, funders, and

This is an important a-empt (one of the first in the US) to rebalance the rela8onships between publishers, academic ins8tu8ons, funders, and This is an important a-empt (one of the first in the US) to rebalance the rela8onships between publishers, academic ins8tu8ons, funders, and individual researchers in workflows surrounding the publishing

More information

Creating a Digital Preservation Network with Shared Stewardship and Cost

Creating a Digital Preservation Network with Shared Stewardship and Cost Creating a Digital Preservation Network with Shared Stewardship and Cost The National Digital Information Infrastructure and Preservation Program Experience NDIIPP Investments Preservation Network Partnerships

More information

System Modeling Environment

System Modeling Environment System Modeling Environment Requirements, Architecture and Implementa

More information

Making Research Data Public: Why, What, and How. Fall 2016

Making Research Data Public: Why, What, and How. Fall 2016 Making Research Data Public: Why, What, and How Fall 2016 Research Data Service (RDS) The Research Data Service provides the Illinois research community with exper:se, tools, and infrastructure to manage

More information

The Office for Outer Space Affairs bringing space- based tools and applica:ons at the heart of the 2030 Agenda for Sustainable Development

The Office for Outer Space Affairs bringing space- based tools and applica:ons at the heart of the 2030 Agenda for Sustainable Development The Office for Outer Space Affairs bringing space- based tools and applica:ons at the heart of the 2030 Agenda for Sustainable Development SIMONETTA DI PIPPO, DIRECTOR United Nations Office for Outer Space

More information

Leveraging Tools and Components from OODT and Apache within Climate Science and the Earth System Grid Federa9on

Leveraging Tools and Components from OODT and Apache within Climate Science and the Earth System Grid Federa9on Leveraging Tools and Components from OODT and Apache within Climate Science and the Earth System Grid Federa9on Luca Cinquini, Dan Crichton, Chris Ma2mann NASA Jet Propulsion Laboratory, California Ins9tute

More information

NARCCAP: North American Regional Climate Change Assessment Program. Seth McGinnis, NCAR

NARCCAP: North American Regional Climate Change Assessment Program. Seth McGinnis, NCAR NARCCAP: North American Regional Climate Change Assessment Program Seth McGinnis, NCAR mcginnis@ucar.edu NARCCAP: North American Regional Climate Change Assessment Program Nest highresolution regional

More information

Engaging Employees and Customers with Video. The Benefits of Corporate Webcas3ng

Engaging Employees and Customers with Video. The Benefits of Corporate Webcas3ng Engaging Employees and Customers with Video The Benefits of Corporate Webcas3ng Agenda Introduc9on UnityLivestream Teradek Wowza Workflow Produc9on Streaming Delivery Case Studies Demo - Live Solu9on -

More information

Data Archival and Dissemination Tools to Support Your Research, Management, and Education

Data Archival and Dissemination Tools to Support Your Research, Management, and Education Data Archival and Dissemination Tools to Support Your Research, Management, and Education LIZA BRAZIL CUAHSI PRODUCT MANAGER Shout Out: Upcoming Cyberseminars April 13: Liza Brazil, CUAHSI: Data Archiving

More information

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development Jeremy Fischer Indiana University 9 September 2014 Citation: Fischer, J.L. 2014. ACCI Recommendations on Long Term

More information

DataONE. Promoting Data Stewardship Through Best Practices

DataONE. Promoting Data Stewardship Through Best Practices DataONE Promoting Data Stewardship Through Best Practices Carly Strasser 1,2, Robert Cook 1,3, William Michener 1,4, Amber Budden 1,4, Rebecca Koskela 1,4 1 DataONE 2 University of California Santa Barbara

More information

National Science and Technology Council. Interagency Working Group on Digital Data

National Science and Technology Council. Interagency Working Group on Digital Data National Science and Technology Council Interagency Working Group on Digital Data 1 Interagency Working Group White House Executive Office of the President Office of Science and Technology Policy National

More information

Digital Cura+on Planning at Michigan State University

Digital Cura+on Planning at Michigan State University Digital Cura+on Planning at Michigan State University Lisa Schmidt, Electronic Records Archivist Michigan State University Archives & Historical Collec+ons January 17, 2010 Overview Michigan State University

More information

Con$nuous Audi$ng and Risk Management in Cloud Compu$ng

Con$nuous Audi$ng and Risk Management in Cloud Compu$ng Con$nuous Audi$ng and Risk Management in Cloud Compu$ng Marcus Spies Chair of Knowledge Management LMU University of Munich Scien$fic / Technical Director of EU Integrated Research Project MUSING Cloud

More information

Data Management Tools. Lizzy Rolando, Georgia Tech Aaron Trehub, Auburn University August 6, 2013

Data Management Tools. Lizzy Rolando, Georgia Tech Aaron Trehub, Auburn University August 6, 2013 Data Management Tools Lizzy Rolando, Georgia Tech Aaron Trehub, Auburn University August 6, 2013 A brief history of how we got here The march of data, 3000 BC 2010 AD 2011-2013 Etc. Kipling on data management

More information

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data GeoDaRRs: What is the existing landscape and what gaps exist in that landscape for data producers and users? 7 August

More information

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI SEAD Data Services Jim Myers(myersjd@umich.edu), Best Practices in Data Infrastructure Workshop Cooperative agreement #OCI0940824 SEAD: Sustainable Environment - Actionable Data An NSF DataNet project

More information

WP4: Data Forum. Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang

WP4: Data Forum. Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang WP4: Data Forum Øystein Godøy, Boris Radosavljević, Boris Biskaborn, Anna Irrgang Motivation INTERACT research stations generate data and metadata Long term monitoring Short term process studies External

More information

Outline. In Situ Data Triage and Visualiza8on

Outline. In Situ Data Triage and Visualiza8on In Situ Data Triage and Visualiza8on Kwan- Liu Ma University of California at Davis Outline In situ data triage and visualiza8on: Issues and strategies Case study: An earthquake simula8on Case study: A

More information

Data Portal and Integra.on in JAMSTEC

Data Portal and Integra.on in JAMSTEC Data Portal and Integra.on in JAMSTEC Yasunori Hanafusa Data Research Center for Marine-Earth Sciences (DrC) Agency for Marine-Earth Science and Technology (JAMSTEC) 1 Overview of Data Management in JAMSTEC

More information

Introduction to Securing Critical Infrastructure

Introduction to Securing Critical Infrastructure Her kan tekst skrives Her kan tekst skrives Introduction to Securing Critical Infrastructure Her kan tekst skrives Keith Frederick CISSP, CAP, CRISC, Author securenok.com Topics A)acks on the Oil and Gas

More information

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Paving the Rocky Road Toward Open and FAIR in the Field Sciences Paving the Rocky Road Toward Open and FAIR Kerstin Lehnert Lamont-Doherty Earth Observatory, Columbia University IEDA (Interdisciplinary Earth Data Alliance), www.iedadata.org IGSN e.v., www.igsn.org Field

More information

Indiana University Research Technology and the Research Data Alliance

Indiana University Research Technology and the Research Data Alliance Indiana University Research Technology and the Research Data Alliance Rob Quick Manager High Throughput Computing Operations Officer - OSG and SWAMP Board Member - RDA Organizational Assembly RDA Mission

More information

Robin Wilson Director. Digital Identifiers Metadata Services

Robin Wilson Director. Digital Identifiers Metadata Services Robin Wilson Director Digital Identifiers Metadata Services Report Digital Object Identifiers for Publishing and the e-learning Community CONTEXT elearning the the Publishing Challenge elearning the the

More information

aginfra: High Performance Compu8ng einfrastructure for Agriculture

aginfra: High Performance Compu8ng einfrastructure for Agriculture aginfra: High Performance Compu8ng einfrastructure for Agriculture Antun Balaz Ins,tute of Physics Belgrade What is aginfra? A 3- years project, co- funded by the European Union, developing data infrastructure

More information

EUDAT & AAI. Daan Broeder MPI for Psycholinguistics

EUDAT & AAI. Daan Broeder MPI for Psycholinguistics EUDAT & AAI Daan Broeder MPI for Psycholinguistics Initially six research communities on Board EPOS: European Plate Observatory System CLARIN: Common Language Resources and Technology Infrastructure ENES:

More information

How to use Water Data to Produce Knowledge: Data Sharing with the CUAHSI Water Data Center

How to use Water Data to Produce Knowledge: Data Sharing with the CUAHSI Water Data Center How to use Water Data to Produce Knowledge: Data Sharing with the CUAHSI Water Data Center Jon Pollak The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) August 20,

More information

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard University @mercecrosas mercecrosas.com Open Research Cloud, May 11, 2017 Best Practices

More information

Western Michigan University

Western Michigan University CS-6030 Cloud compu;ng Google App engine Sepideh Mohammadi Summer II 2017 Western Michigan University content Categories of cloud compu;ng Google cloud plaborm Google App Engine Storage technologies Datastore

More information

Site# Date H20 Temperature Conductance Turbidity KRS Sep KRS Aug KRS Aug

Site# Date H20 Temperature Conductance Turbidity KRS Sep KRS Aug KRS Aug ID ASR_Number Sample_Number QC_Code Analysis_Request_No External_Sample_Number Start_Date 1 1383 892 1 08-Aug-2002 2 1383 902 1 08-Aug-2002 3 1383 912 1 08-Aug-2002 Site# Date H20 Temperature Conductance

More information

Dagmar Triebel, Peter Grobe, Anton Güntsch, Gregor Hagedorn, Joachim Holstein, Carola Söhngen, Claus Weiland, Tanja Weibulat.

Dagmar Triebel, Peter Grobe, Anton Güntsch, Gregor Hagedorn, Joachim Holstein, Carola Söhngen, Claus Weiland, Tanja Weibulat. How to organize, process and archive collection and occurrence data using GFBio services provided by Germany s major natural history and culture collection data repositories, Peter Grobe, Anton Güntsch,

More information

MPI Performance Analysis Trace Analyzer and Collector

MPI Performance Analysis Trace Analyzer and Collector MPI Performance Analysis Trace Analyzer and Collector Berk ONAT İTÜ Bilişim Enstitüsü 19 Haziran 2012 Outline MPI Performance Analyzing Defini6ons: Profiling Defini6ons: Tracing Intel Trace Analyzer Lab:

More information

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018 0 Welcome to the Pure International Conference Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018 1 Mendeley Data Use Synergies with Pure to Showcase Additional Research Outputs Nikhil Joshi Solutions

More information

Big Data infrastructure and tools in libraries

Big Data infrastructure and tools in libraries Line Pouchard, PhD Purdue University Libraries Research Data Group Big Data infrastructure and tools in libraries 08/10/2016 DATA IN LIBRARIES: THE BIG PICTURE IFLA/ UNIVERSITY OF CHICAGO BIG DATA: A VERY

More information

NetCDF and Related Interna/onal Standards

NetCDF and Related Interna/onal Standards NetCDF and Related Interna/onal Standards Ben Domenico October 2012 Outline Brief historical context Unidata/partners have established a solid founda/on: Standard data access interfaces enable other Earth

More information

Submitted to: Dr. Sunnie Chung. Presented by: Sonal Deshmukh Jay Upadhyay

Submitted to: Dr. Sunnie Chung. Presented by: Sonal Deshmukh Jay Upadhyay Submitted to: Dr. Sunnie Chung Presented by: Sonal Deshmukh Jay Upadhyay Submitted to: Dr. Sunny Chung Presented by: Sonal Deshmukh Jay Upadhyay What is Apache Survey shows huge popularity spike for Apache

More information

Quality Assured (QA) data

Quality Assured (QA) data Quality Assured (QA) data Towards DOI quality of data generated at the UFZ Mark Frenzel (Ecologist) & Thomas Schnicke (IT) DataCite / Helmholtz Open Science Workshop Leipzig, 12.01.2016 QA + DOI: Best

More information

5/23/18. Atomized individual items vs. Organized collec=ons (1/2) Atomized individual items vs. Organized collec=ons (2/2)

5/23/18. Atomized individual items vs. Organized collec=ons (1/2) Atomized individual items vs. Organized collec=ons (2/2) Archival Prac+ce involves Cura+on; Trying to minimize the impact of ruling narra+ves- Archival Prac+ce involves Cura+on; Trying to minimize the impact of ruling narra+ves Howard Besser Moving Image Archiving

More information

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography Christopher Crosby, San Diego Supercomputer Center J Ramon Arrowsmith, Arizona State University Chaitan

More information

Engaging and Connecting Faculty:

Engaging and Connecting Faculty: Engaging and Connecting Faculty: Research Discovery, Access, Re-use, and Archiving Janet McCue and Jon Corson-Rikert Albert R. Mann Library Cornell University CNI Spring 2007 Task Force Meeting April 16,

More information

DRS Policy Guide. Management of DRS operations is the responsibility of staff in Library Technology Services (LTS).

DRS Policy Guide. Management of DRS operations is the responsibility of staff in Library Technology Services (LTS). Harvard University Library Office for Information Systems DRS Policy Guide This Guide defines the policies associated with the Harvard Library Digital Repository Service (DRS) and is intended for Harvard

More information

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms Grid Computing 1 Resource sharing Elements of Grid Computing - Computers, data, storage, sensors, networks, - Sharing always conditional: issues of trust, policy, negotiation, payment, Coordinated problem

More information

WE HAVE SOME GREAT EARLY ADOPTERS

WE HAVE SOME GREAT EARLY ADOPTERS WE HAVE SOME GREAT EARLY ADOPTERS 1 2 3 Data cita%on at Springer Nature journals key events 1998 : Accession codes required for various data types at Nature journals and marked up in ar:cles (= data referencing

More information

Striving for efficiency

Striving for efficiency Ron Dekker Director CESSDA Striving for efficiency Realise the social data part of EOSC How to Get the Maximum from Research Data Prerequisites and Outcomes University of Tartu, 29 May 2018 Trends 1.Growing

More information

Composing, Reproducing, and Sharing Simula5ons

Composing, Reproducing, and Sharing Simula5ons Composing, Reproducing, and Sharing Simula5ons Daniel Mosse {mosse,childers}@cs.pi

More information

REsources linkage for E-scIence - RENKEI -

REsources linkage for E-scIence - RENKEI - REsources linkage for E-scIence - - hlp://www.e- sciren.org/ REsources linkage for E- science () is a research and development project for new middleware technologies to enable e- science communi?es. ""

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure EUDAT Towards a pan-european Collaborative Data Infrastructure Martin Hellmich Slides adapted from Damien Lecarpentier DCH-RP workshop, Manchester, 10 April 2013 Research Infrastructures Research Infrastructure

More information

User Community Driven Development in Trust and Identity Services

User Community Driven Development in Trust and Identity Services User Community Driven Development in Trust and Identity Services Ann Harding, SWITCH Internet2 Global Summit 27 April 2015 Washington DCs Agenda Trust and Iden.ty Landscape GÉANT Research Community Engagement

More information

Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center

Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center Robert Cook, DAAC Scientist Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN cookrb@ornl.gov

More information

Cloud Computing WSU Dr. Bahman Javadi. School of Computing, Engineering and Mathematics

Cloud Computing WSU Dr. Bahman Javadi. School of Computing, Engineering and Mathematics Cloud Computing Research @ WSU Dr. Bahman Javadi School of Computing, Engineering and Mathematics Research Team and Research Interests Team 4 Academic Staff 5 PhD Students 1 Master Student Resource Scheduling

More information

Reproducibility and FAIR Data in the Earth and Space Sciences

Reproducibility and FAIR Data in the Earth and Space Sciences Reproducibility and FAIR Data in the Earth and Space Sciences December 2017 Brooks Hanson Sr. VP, Publications, American Geophysical Union bhanson@agu.org Earth and Space Science is Essential for Society

More information

A Model-Driven Approach to Situations: Situation Modeling and Rule-Based Situation Detection

A Model-Driven Approach to Situations: Situation Modeling and Rule-Based Situation Detection A Model-Driven Approach to Situations: Situation Modeling and Rule-Based Situation Detection Patrícia Dockhorn Costa Izon Thomas Mielke Isaac Pereira João Paulo A. Almeida jpalmeida@ieee.org http://nemo.inf.ufes.br

More information

An introduc/on to Sir0i

An introduc/on to Sir0i Authen4ca4on and Authorisa4on for Research and Collabora4on An introduc/on to Sir0i Addressing Federated Security Incident Response Hannah Short CERN hannah.short@cern.ch TF-CSIRT May, 2016 Agenda Federated

More information

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository Robert R. Downs and Robert S. Chen Center for International Earth Science Information

More information

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan Introduction NERC, Science and Data Centres NERC Discovery Metadata The Data Catalogue Service NERC Data Services Case study:

More information

Open-Source Based Solutions for Processing, Preserving, and Presenting Oral Histories

Open-Source Based Solutions for Processing, Preserving, and Presenting Oral Histories Western Washington University From the SelectedWorks of Mark I. Greenberg April 2, 2011 Open-Source Based Solutions for Processing, Preserving, and Presenting Oral Histories Mark I. Greenberg, University

More information

WORLD. Patrick Combes Senior Solu3on Architect for Life Sciences at EMC/Isilon

WORLD. Patrick Combes Senior Solu3on Architect for Life Sciences at EMC/Isilon ISILON @GLOBUS WORLD Patrick Combes Senior Solu3on Architect for Life Sciences at EMC/Isilon patrick.combes@isilon.com Support Contact: Educa3on Services Isilon Overview Cluster of nodes, easily managed

More information

When the Need for an Ins/tu/onal Repository Gives Rise to a Federa/on

When the Need for an Ins/tu/onal Repository Gives Rise to a Federa/on When the Need for an Ins/tu/onal Repository Gives Rise to a Federa/on Lisa Schmidt lschmidt@msu.edu Michigan Academic Library Council March 18, 2011 Overview Ins?tu?onal Background Why an Ins?tu?onal Repository?

More information

Wendy Thomas Minnesota Population Center NADDI 2014

Wendy Thomas Minnesota Population Center NADDI 2014 Wendy Thomas Minnesota Population Center NADDI 2014 Coverage Problem statement Why are there problems with interoperability with external search, storage and delivery systems Minnesota Population Center

More information

Big Data, Big Compute, Big Interac3on Machines for Future Biology. Rick Stevens. Argonne Na3onal Laboratory The University of Chicago

Big Data, Big Compute, Big Interac3on Machines for Future Biology. Rick Stevens. Argonne Na3onal Laboratory The University of Chicago Assembly Annota3on Modeling Design Big Data, Big Compute, Big Interac3on Machines for Future Biology Rick Stevens stevens@anl.gov Argonne Na3onal Laboratory The University of Chicago There are no solved

More information

Fundamentals of Federated Iden0ty Infrastructure

Fundamentals of Federated Iden0ty Infrastructure Fundamentals of Federated Iden0ty Infrastructure Sal D Agos0no IDmachines LLC Federate fed er ate Verb past tense: federated; past participle: federated ˈfedəәˌrāt/ 1. (with reference to a number of states

More information

Increase Engagement in Educa0on with Video Streaming. How The University of Maine Changed Their Learning Experience with Wowza

Increase Engagement in Educa0on with Video Streaming. How The University of Maine Changed Their Learning Experience with Wowza Increase Engagement in Educa0on with Video How The University of Maine Changed Their Learning Experience with Wowza Agenda Introduc0ons The University of Maine BioMedia Lab Synapse LCMS (Learning Content

More information

Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations

Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations October 29, 2014 Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations XSEDE for HPC Users What is XSEDE? XSEDE mo/va/on and goals XSEDE Resources XSEDE for HPC Users: Before

More information

The Storage Networking Industry Association (SNIA) Data Preservation and Metadata Projects. Bob Rogers, Application Matrix

The Storage Networking Industry Association (SNIA) Data Preservation and Metadata Projects. Bob Rogers, Application Matrix The Storage Networking Industry Association (SNIA) Data Preservation and Metadata Projects Bob Rogers, Application Matrix Overview The Self Contained Information Retention Format Rationale & Objectives

More information

ACCESS Health Indonesia. ACCESS Global Mee.ng February 10-13, 2014 Goa, India

ACCESS Health Indonesia. ACCESS Global Mee.ng February 10-13, 2014 Goa, India ACCESS Health Indonesia ACCESS Global Mee.ng February 10-13, 2014 Goa, India 1 CONTENTS 1. ACCESS Health Interna.onal 2. Sustainable ehealth Ecosystem 3. 4. 5. 6. 7. ACCESS Mission and Sustainable ehealth

More information

Collaborative data-driven science. Collaborative data-driven science

Collaborative data-driven science. Collaborative data-driven science Alex Szalay ! Started with the SDSS SkyServer! Built very quickly in 2001! Goal: instant access to rich content! Idea: bring the analysis to the data! Interac

More information

Fundamentals of Data Infrastructures

Fundamentals of Data Infrastructures Fundamentals of Data Infrastructures Dublin, March 2014 Welcome & Introduction Adam Carter EPCC, The University of Edinburgh Training Coordinator, EUDAT Timetable 09:00 Registration & Coffee 09:15 Welcome

More information

Dataverse 4.0 & Beyond. Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University

Dataverse 4.0 & Beyond. Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University Dataverse 4.0 & Beyond ì Eleni Castro > Ins/tute for Quan/ta/ve Social Science (IQSS), Harvard University 2 Data Science Team Data Cura/on & Stewardship Informa/on Scien/sts Researchers Sta/s/cal Innova/on

More information

Brown University Libraries Technology Plan

Brown University Libraries Technology Plan Brown University Libraries Technology Plan 2009-2011 Technology Vision Brown University Library creates, develops, promotes, and uses technology to further the Library s mission and strategic directions

More information

Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences

Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences Vicki L. Ferrini, Kerstin A. Lehnert, Suzanne M. Carbotte, and Leslie Hsu Lamont-Doherty Earth Observatory What is

More information

Agenda. About ECRIN Overview of ECRIN Ac4vi4es Increasing value

Agenda. About ECRIN Overview of ECRIN Ac4vi4es Increasing value Agenda About ECRIN Overview of ECRIN Ac4vi4es Increasing value ECRIN Overview A non- profit organisa4on with the legal status of European Research Infrastructure Consor4um (ERIC) Mission: support the conduct

More information

Microsoft SharePoint Server 2013 Plan, Configure & Manage

Microsoft SharePoint Server 2013 Plan, Configure & Manage Microsoft SharePoint Server 2013 Plan, Configure & Manage Course 20331-20332B 5 Days Instructor-led, Hands on Course Information This five day instructor-led course omits the overlap and redundancy that

More information

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG Interoperable Cloud Storage with the CDMI Standard Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA.

More information

Building Open Source IoT Ecosystems. November 2017

Building Open Source IoT Ecosystems. November 2017 Building Open Source IoT Ecosystems November 2017 Jim White, Dell Distinguished Engineer & Senior Software Architect james_white2@dell.com Dell Project Fuse Architect EdgeX Foundry Technical Steering Committee

More information

Introduction to GT3. Introduction to GT3. What is a Grid? A Story of Evolution. The Globus Project

Introduction to GT3. Introduction to GT3. What is a Grid? A Story of Evolution. The Globus Project Introduction to GT3 The Globus Project Argonne National Laboratory USC Information Sciences Institute Copyright (C) 2003 University of Chicago and The University of Southern California. All Rights Reserved.

More information

Direc>ons in Distributed Compu>ng

Direc>ons in Distributed Compu>ng Direc>ons in Distributed Compu>ng Robert Shimp Group Vice President August 23, 2016 Copyright 2016 Oracle and/or its affiliates. All rights reserved. Safe Harbor Statement The following is intended to outline

More information

From Continuous Integration To Continuous Delivery With Jenkins

From Continuous Integration To Continuous Delivery With Jenkins From Continuous Integration To Continuous Delivery With Cyrille Le Clerc, Solution Architect, CloudBees About Me @cyrilleleclerc CTO Solu9on Architect Open Source Cyrille Le Clerc DevOps, Infra as Code,

More information

Enterprise Risk Management (ERM) and Cybersecurity. Na9onal Science Founda9on March 14, 2018

Enterprise Risk Management (ERM) and Cybersecurity. Na9onal Science Founda9on March 14, 2018 Enterprise Risk Management (ERM) and Cybersecurity Na9onal Science Founda9on March 14, 2018 Agenda Guiding Principles for Implementing ERM at NSF (Based on COSO) NSF s ERM Framework ERM Cybersecurity Risk

More information

Making Data Count Promo0ng Open Data Through Usage and Impact Tracking #mdc

Making Data Count Promo0ng Open Data Through Usage and Impact Tracking #mdc CNI Spring 2017 Membership Mee;ng Albuquerque, May 3-4, 2017 Making Data Count Promo0ng Open Data Through Usage and Impact Tracking #mdc Stephen Abrams UC Cura0on Center California Digital Library @slabrams

More information

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION Plato Smith, Ph.D., Data Management Librarian DataONE Member Node Special Topics Discussion June 8, 2017, 2pm - 2:30 pm ASSESSING

More information

Review of the DCMI Abstract Model

Review of the DCMI Abstract Model Review of the DCMI Abstract Model Thomas Baker, DCMI Joint Mee>ng of the DCMI Architecture Forum and W3C Library Linked Data Incubator Group 22 October 2010 DRAFT SLIDES 2010-10- 06 Early 2000s DC straddling

More information

ArcGIS 9.2 Works as a Complete System

ArcGIS 9.2 Works as a Complete System ArcGIS 9.2 Works as a Complete System A New Way to Manage and Disseminate Geographic Knowledge Author/Serve/Use Maps Data Models Globes Metadata Use Desktop Explorer Web Map Viewer Mobile Open APIs Enterprise

More information

Research Data Management & Preservation: A Library Perspective

Research Data Management & Preservation: A Library Perspective Research Data Management & Preservation: A Library Perspective Brian Owen, Associate University Librarian Library Technology Services & Special Collections, Simon Fraser University Library LIBRARIES &

More information

DSpace Fedora. Eprints Greenstone. Handle System

DSpace Fedora. Eprints Greenstone. Handle System Enabling Inter-repository repository Access Management between irods and Fedora Bing Zhu, Uni. of California: San Diego Richard Marciano Reagan Moore University of North Carolina at Chapel Hill May 18,

More information

Software + Services for Data Storage, Management, Discovery, and Re-Use

Software + Services for Data Storage, Management, Discovery, and Re-Use Software + Services for Data Storage, Management, Discovery, and Re-Use CODATA 22 Conference Stellenbosch, South Africa 25 October 2010 Alex D. Wade Director Scholarly Communication Microsoft External

More information

CIS : Computational Reproducibility

CIS : Computational Reproducibility CIS 602-01: Computational Reproducibility Virtual Machines Dr. David Koop Figure 2. The MODIS grid, with highlighted tiles (red) of spatial extent for California (green), with citation. Computational Data

More information

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation INSPIRE 2010, KRAKOW Dr. Arif Shaon, Dr. Andrew Woolf (e-science, Science and Technology Facilities Council, UK) 3

More information