Indiana University Research Technology and the Research Data Alliance

Similar documents
Global Data Sharing The Research Data Alliance

RDA in a nutshell. Warsaw, 28 January Leif Laksonen/RDA Europe 3 CC BY-SA 4.0

The RDA Structure and Governance

RDA in a nutshell. June CC BY-SA 4.0

The Research Data Alliance Creating the culture and technology for an international data infrastructure

Certification. F. Genova (thanks to I. Dillo and Hervé L Hours)

Legal Interoperability of Research Data: Principles and Implementation Guidelines

Globally Interoperable IoT Identification and Data Processing

The Research Data Alliance (RDA): building global data connections CC BY-SA 4.0

European Cloud Initiative: implementation status. Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

EUDAT Common data infrastructure

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

Sharing research data globally and supporting re-use - Playing your part Leif Laaksonen /National Archive of Finland

Community and Impact Building the Research Data Alliance

Improving a Trustworthy Data Repository with ISO 16363

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

DATA SHARING FOR BETTER SCIENCE

Assessing the FAIRness of Datasets in Trustworthy Digital Repositories: a 5 star scale

Towards FAIRness: some reflections from an Earth Science perspective

Striving for efficiency

DOIs for Research Data

Caring for research data and what about software? Peter Doorn, director DANS

Science Europe Consultation on Research Data Management

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

EUDAT. Towards a pan-european Collaborative Data Infrastructure

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

Progress towards the EOSC

GEOSS Data Management Principles: Importance and Implementation

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

Federated Identity Management for Research Collaborations. Bob Jones IT dept CERN 29 October 2013

European Open Science Cloud Implementation roadmap: translating the vision into practice. September 2018

Data Management at NIST

Ensuring Proper Storage for Earth Science Data: The USGS Process to Certify Trusted Digital Repositories

EUDAT- Towards a Global Collaborative Data Infrastructure

Collaborations and Partnerships. John Broome CODATA-International

Conferenza GARR Moving Forward to deliver an «EOSC in Practice» by 2018

The Changing Role of Data Stewardship in Creating Trustworthy, Transdisciplinary High Performance Data Platforms for the Future

The EUDAT Collaborative Data Infrastructure

EGI federated e-infrastructure, a building block for the Open Science Commons

Data Curation Handbook Steps

Fair data and open data: differences and consequences

Business Model for Global Platform for Big Data for Official Statistics in support of the 2030 Agenda for Sustainable Development

EUDAT. Towards a pan-european Collaborative Data Infrastructure. Damien Lecarpentier CSC-IT Center for Science, Finland EUDAT User Forum, Barcelona

Enabling Collaboration for Digital Preservation

Applying Archival Science to Digital Curation: Advocacy for the Archivist s Role in Implementing and Managing Trusted Digital Repositories

towards a federated infrastructure enabling integrated life science research

Implementation of the CoreTrustSeal

Digital repositories as research infrastructure: a UK perspective

Why CERIF? Keith G Jeffery Scientific Coordinator ERCIM Anne Assserson eurocris. Keith G Jeffery SDSVoc Workshop Amsterdam

Implementing the RDA Data Citation Recommendations for Long Tail Research Data. Stefan Pröll

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

Digital Preservation Policy. Principles of digital preservation at the Data Archive for the Social Sciences

Reproducibility and FAIR Data in the Earth and Space Sciences

Open Science Commons: A Participatory Model for the Open Science Cloud

Coupled Computing and Data Analytics to support Science EGI Viewpoint Yannick Legré, EGI.eu Director

Supporting European e-infrastructure service providers to join the European Open Science Cloud

UAE National Space Policy Agenda Item 11; LSC April By: Space Policy and Regulations Directory

ESFRI WORKSHOP ON RIs AND EOSC

Federal STI Managers Group Presentation to Board on Research Data and Information Ellen Herbst Director, NTIS CENDI Chair

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

Data Discovery - Introduction

DataONE: Open Persistent Access to Earth Observational Data

Horizon 2020 and the Open Research Data pilot. Sarah Jones Digital Curation Centre, Glasgow

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)

European digital repository certification: the way forward

Call for Participation in AIP-6

State of the Art in Data Citation

DSA WDS Partnership Working Group Catalogue of Common Requirements

EUDAT Towards a Collaborative Data Infrastructure

B2FIND and Metadata Quality

e-infrastructure for the 21st Century - one year later

FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

National Research Data Cloud

How to share research data

Federal-State Connections: Opportunities for Coordination and Collaboration

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography

Networking European Digital Repositories

re3data.org Registry of Research Data Repositories Peter Schirmbacher Humboldt-Universität zu Berlin ETD Hong Kong, September 25.

The Virtual Observatory and the IVOA

Data Governance Central to Data Management Success

Overview of the European Open Science Cloud. Jan Wiebelitz e-irg support programme

GSCB-Workshop on Ground Segment Evolution Strategy

Dataverse and DataTags

ESTABLISHMENT OF AN OFFICE OF FORENSIC SCIENCES AND A FORENSIC SCIENCE BOARD WITHIN THE DEPARTMENT OF JUSTICE

EUDAT. Towards a pan-european Collaborative Data Infrastructure

OUR VISION To be a global leader of computing research in identified areas that will bring positive impact to the lives of citizens and society.

TEXT MINING: THE NEXT DATA FRONTIER

Making Open Data work for Europe

strategy IT Str a 2020 tegy

Electronic Records Archives: Philadelphia Federal Executive Board

The Arab ICT Organization

Resolution adopted by the General Assembly. [on the report of the Second Committee (A/64/417)]

Transcription:

Indiana University Research Technology and the Research Data Alliance Rob Quick Manager High Throughput Computing Operations Officer - OSG and SWAMP Board Member - RDA Organizational Assembly

RDA Mission and History RDA Structure RDA Outputs IU RT Organizational Goals

The Research Data Alliance builds the social and technical bridges that enable open sharing of data. The RDA vision is researchers and innovators openly sharing data across technologies, disciplines, and countries to address the grand challenges of society.

FAIR Open Data Principles Findable Accessible Interoperable Re-usable

A brief history of IU and the RDA 1st Plenary Meeting - March, 2013 - Launch EC, NSF, NIST, and ANDS (Australia) IU SoIC - Beth Plale - Technical Advisory Board 4th Plenary Meeting - September, 2014 - IU Representative attended the OAB 6th Plenary Meeting - September, 2015 - IU officially an organizational member Currently 3000 members from 102 countries

RDA Structure Council - Oversight Secretariat Staff - Administration Technical Advisory Board - Development, maintenance and evolution of the Technical Roadmap Organizational Advisory Board - Provides organizational advise to the council

Structure

Interest Groups Agriculture Data Active Data Management Plans Big Data Development of cloud computing capacity in world research Digital Practices in History and Ethnography Long Tail of Research Data Marine Data Harmonization Metabolomics Data Interoperability RDA/WDS Certification of Digital Repositories RDA/WDS Publishing Data Cost Recovery for Data Centers Biodiversity Data Integration Brokering Chemistry Research Data Data Fabric Domain Repositories Education and Training on handling of research data ELIXIR Bridging Force Engagement Metadata National Data Services PID Preservation e- Infrastructure Repository Platforms for Research Data Reproducibility Research data needs of the Photon and Neutron Science Community Data for Development Data Foundations and Terminology Ethics and Social Aspects Federated Identity Management Quality of Urban Life RDA/CODATA Legal Interoperability Research Data Provenance Service Management Data in Context Data Rescue Geospatial Libraries for Research Data RDA/CODATA Materials Data, Infrastructure, and Interoperability Structural Biology Vocabulary Services

Interest Groups Agriculture Data Active Data Management Plans Big Data Development of cloud computing capacity in world research Digital Practices in History and Ethnography Long Tail of Research Data Marine Data Harmonization Metabolomics Data Interoperability RDA/WDS Certification of Digital Repositories RDA/WDS Publishing Data Cost Recovery for Data Centers Biodiversity Data Integration Brokering Chemistry Research Data Data Fabric Domain Repositories Education and Training on handling of research data ELIXIR Bridging Force Engagement Metadata National Data Services PID Preservation e- Infrastructure Repository Platforms for Research Data Reproducibility Research data needs of the Photon and Neutron Science Community Data for Development Data Foundations and Terminology Ethics and Social Aspects Federated Identity Management Quality of Urban Life RDA/CODATA Legal Interoperability Research Data Provenance Service Management Data in Context Data Rescue Geospatial Libraries for Research Data RDA/CODATA Materials Data, Infrastructure, and Interoperability Structural Biology Vocabulary Services

Interest Groups Agriculture Data Active Data Management Plans Big Data Development of cloud computing capacity in world research Digital Practices in History and Ethnography Long Tail of Research Data Marine Data Harmonization Metabolomics Data Interoperability RDA/WDS Certification of Digital Repositories RDA/WDS Publishing Data Cost Recovery for Data Centers Biodiversity Data Integration Brokering Chemistry Research Data Data Fabric Domain Repositories Education and Training on handling of research data ELIXIR Bridging Force Engagement Metadata National Data Services PID Preservation e- Infrastructure Repository Platforms for Research Data Reproducibility Research data needs of the Photon and Neutron Science Community Data for Development Data Foundations and Terminology Ethics and Social Aspects Federated Identity Management Quality of Urban Life RDA/CODATA Legal Interoperability Research Data Provenance Service Management Data in Context Data Rescue Geospatial Libraries for Research Data RDA/CODATA Materials Data, Infrastructure, and Interoperability Structural Biology Vocabulary Services

Interest Groups Agriculture Data Active Data Management Plans Big Data Development of cloud computing capacity in world research Digital Practices in History and Ethnography Long Tail of Research Data Marine Data Harmonization Metabolomics Data Interoperability RDA/WDS Certification of Digital Repositories RDA/WDS Publishing Data Cost Recovery for Data Centers Biodiversity Data Integration Brokering Chemistry Research Data Data Fabric Domain Repositories Education and Training on handling of research data ELIXIR Bridging Force Engagement Metadata National Data Services PID Preservation e- Infrastructure Repository Platforms for Research Data Reproducibility Research data needs of the Photon and Neutron Science Community Data for Development Data Foundations and Terminology Ethics and Social Aspects Federated Identity Management Quality of Urban Life RDA/CODATA Legal Interoperability Research Data Provenance Service Management Data in Context Data Rescue Geospatial Libraries for Research Data RDA/CODATA Materials Data, Infrastructure, and Interoperability Structural Biology Vocabulary Services

Working Groups Array Database Brokering Governance Data Citation Data Description Registry Interoperability Data Foundations and Terminology Data Type Registries Metadata Standards Catalog Metadata Standards Directory PID Information Types Practical Policy QoS-DataLC Definitions RDA/CODATA Summer Schools in Data Science and Cloud Computing in the Developing World RDA/WDS Publishing Data Bibliometrics RDA/WDS Publishing Data Services RDA/WDS Publishing Data Workflows Repository Audit and Certification DSA-WDS Partnership Research Data Collections The BioSharing Registry Wheat Data Interoperability Security and Trust

Working Group Outputs

Data Foundations and Terminology Working Group No common language between data communities At some level of abstraction, management and organization of data is independent of content 21 data models and 120 interviews Defined a number of simple definitions and http://smw-rda.esc.rzg.mpg.de/index.php/ Main_Page

Data Type Registries Working Group Researchers can gather data from a wide variety of sources However, if the data is not typed it is often hard use and often discarded A data type registry allows data producers to record the implicit details of their data http://typeregistry.org

PID Information Types Working Group Several providers of Persistent Identifiers (PIDs) Thus serval different access methods (aka APIs) Checksums should be provided independently of the service provider Come up with a core set of information types Provide a common API Available as a prototype - documentation at https:// www.rd-alliance.org/group/pid-information-types-wg.html

Practical Policy Working Group Data repositories require highly automated, safe, and documented management strategies With increased complexity published data procedures and policies are required to build trust in operations Collect typical policies, evaluate them, and provide best practice solutions Policy Workbook being created by DATASET and EUDAT https://www.rd-alliance.org/group/practical-policy-wg.html

Scalable Dynamic Data Citation Working Group Existing tools and data collections were not developed considering long term sustainability Big data sets are not used in their entirety and data sets are not static making a need for data identification and citation mechanisms that identify arbitrary subsets Devise a precise, machine actionable identification of arbitrary sub selections of data at a given point RDA Recommendations for Data Citation of Evolving Data - https://rd-alliance.org/rda-wgdc-recommendationsvers-sep-24-2015.html

Data Description Registry Interoperability Working Group Data repositories and registries are fragmented across institutions, countries and research domains Creating a series of bi-lateral information exchange projects Open, extensible, and flexible cross-platform research data discovery software solutions RD-Switchboard - uses research graph technology to link datasets - work is ongoing

Metadata Standards Directory A common challenge with research datasets is inconsistent approaches to in applying metadata or metadata schemes Create a metadata standard directory for users not automated tools and lay the foundation for a machine understandable catalogue of metadata standards http://rd-alliance.github.io/metadata-directory/

Wheat Data Interoperability Working Group To produce an adequate amount of wheat, worldwide yield increase must go from 1% to 1.6% a year To tackle this many researchers are doing experiments in fields and laboratories Data standard harmonization is a priority in promoting research Recommendations at http://ist.blogs.inra.fr/wdi/ phenotypes/ and https://rd-alliance.org/groups/wheatdata-interoperability-wg.html

RDA/WDS Publishing Data Bibliometrics Working Group Statistical analysis of articles (bibliometrics) is essential to obtain measures of the quality and impact of research products Currently, it is not easy to apply these indicators to research outputs beyond academic journals Systems for bibliometrics do not scale and do not cope well with research data Data citation is getting better but still not ingrained in research communities We need a mechanism for evaluating the impact/use of datasets

Repository Audit and Certification DSA/WDS Partnership Working Group Certification standards and accreditation have been developed by DSA and WDS among others Both offer a basic certification standard for trustworthy digital repositories DSA focus on Humanities and Social Sciences, WDS focus on Earth and Space Science Plan to produce a harmonized catalog of criteria for certification of repositories along with a common set of procedures for repository assessment Proposal of a joint DSA-WDS Certification Procedure expected in Spring 2016

RDA/WDS Publishing Data Services Working Group No overarching standard to service to combine links between data sources and literature Bi-directional links provide visibility to both the publications and data along with context Building a universal interlinking service between data and scientific literature Prototype portal at http://dliservice.researchinfrastructures.eu/ full fledged demo tool scheduled for Spring of 2016

RDA/WDS Publishing Data Workflows Working Group Researchers are encouraged to publish research data for reuse but there are insufficient incentives An analysis of data publishing models and solutions was needed to establish a solid future of data publishing workflows http://zenodo.org/record/20308#.vlx63cqtrr8 More analysis to come

IU RT Goals Promote data sharing and exchange, interoperability, use and re-use, discovery and analysis, stewardship and preservation Knowledge sharing Leads on important data projects and research opportunities Interactions with funding agencies and insight to what these funding agencies expect in Big Data proposals Add value to IU Researcher Data!

What does IU RT want/need from RDA? While there is participation in RDA from SoIC and Libraries, the organizational member is IU RT What would add value to the data we collect, analyze, curate, share? I m prepared to sing carols, so I d suggest providing feedback now