EUDAT. Towards a pan-european Collaborative Data Infrastructure

Similar documents
EUDAT. Towards a pan-european Collaborative Data Infrastructure - A Nordic Perspective? -

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd

EUDAT - Open Data Services for Research

EUDAT- Towards a Global Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure

The EUDAT Collaborative Data Infrastructure

I data set della ricerca ed il progetto EUDAT

EUDAT. Towards a pan-european Collaborative Data Infrastructure. Damien Lecarpentier CSC-IT Center for Science, Finland EUDAT User Forum, Barcelona

European Collaborative Data Infrastructure EUDAT - Training on EUDAT Principles -

Using EUDAT services to replicate, store, share, and find cultural heritage data

EUDAT. Towards a Collaborative Data Infrastructure. Ari Lukkarinen CSC-IT Center for Science, Finland NORDUnet 2012 Oslo, 18 August 2012

EUDAT Training 2 nd EUDAT Conference, Rome October 28 th Introduction, Vision and Architecture. Giuseppe Fiameni CINECA Rob Baxter EPCC EUDAT members

Data management and discovery

EUDAT & AAI. Daan Broeder MPI for Psycholinguistics

EUDAT & SeaDataCloud

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

Data Discovery - Introduction

EUDAT Towards a Collaborative Data Infrastructure

EUDAT Common data infrastructure

EUDAT and Cloud Services

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

EUDAT. Towards a pan-european Collaborative Data Infrastructure. KNMI Workshop, Utrecht, Netherlands

European digital repository certification: the way forward

e-infrastructures in Horizon 2020 e-infrastructures for data and computing

Federated Identity Management for Research Collaborations. Bob Jones IT dept CERN 29 October 2013

Research Data Management & Preservation: A Library Perspective

DCH-RP Trust-Building Report

USE CASES IN SEISMOLOGY. Alberto Michelini INGV

dcache: challenges and opportunities when growing into new communities Paul Millar on behalf of the dcache team

European Cloud Initiative: implementation status. Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud

NorStore. a national infrastructure for scientific data. Andreas O Jaunsen UNINETT Sigma as

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Data Staging and Data Movement with EUDAT. Course Introduction Helsinki 10 th -12 th September, Course Timetable TODAY

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

Horizon 2020 and the Open Research Data pilot. Sarah Jones Digital Curation Centre, Glasgow

Giovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France

e-infrastructures in FP7 INFO DAY - Paris

e-infrastructure: objectives and strategy in FP7

Key Elements of Global Data Infrastructures

Striving for efficiency

Towards FAIRness: some reflections from an Earth Science perspective

BUILDING A NEW DIGITAL LIBRARY FOR THE NATIONAL LIBRARY OF AUSTRALIA

Ways to improve Global Data Sharing and Re-use

Data Staging: Moving large amounts of data around, and moving it close to compute resources

EOSC Services & Architecture: the EOSC-hub approach Tiziana Ferrari, Project Coordinator, EGI Founda?on

Progress towards the EOSC

OpenAIRE Open Knowledge Infrastructure for Europe

EGI federated e-infrastructure, a building block for the Open Science Commons

GEOSS Data Management Principles: Importance and Implementation

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

Digital Single Market Technologies and Public Service Modernisation Package -DSM. Grazyna Wojcieszko DG CONNECT

OpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe

Coupled Computing and Data Analytics to support Science EGI Viewpoint Yannick Legré, EGI.eu Director

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

DCH-RP and PREFORMA Two case studies on the digital preservation of cultural heritage

Science Europe Consultation on Research Data Management

Certification. F. Genova (thanks to I. Dillo and Hervé L Hours)

ASTRONOMY & PARTICLE PHYSICS CLUSTER

Data publication and discovery with Globus

AARC. Christos Kanellopoulos AARC Architecture WP Leader GRNET. Authentication and Authorisation for Research and Collaboration

Persistent Identifier the data publishing perspective. Sünje Dallmeier-Tiessen, CERN 1

Making Open Data work for Europe

Assessing the FAIRness of Datasets in Trustworthy Digital Repositories: a 5 star scale

The Photon and Neutron Data Initiative PaN-data

Digital Curators: Who, What, & How

Data Staging: Moving large amounts of data around, and moving it close to compute resources

DEVELOPING, ENABLING, AND SUPPORTING DATA AND REPOSITORY CERTIFICATION

Europe and its Open Science Cloud: the Italian perspective. Luciano Gaido Plan-E meeting, Poznan, April

irods workflows for the data management in the EUDAT pan-european infrastructure

Indiana University Research Technology and the Research Data Alliance

European Open Science Cloud Implementation roadmap: translating the vision into practice. September 2018

European Open Science Cloud

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)

Richard Marciano Alexandra Chassanoff David Pcolar Bing Zhu Chien-Yi Hu. March 24, 2010

OpenAIRE From Pilot to Service

Managing very large Multimedia Archives and their Integration into Federations

PASIG Directions & Issues

Scientific Data Curation and the Grid

BPMN Processes for machine-actionable DMPs

META-SHARE: An Open Resource Exchange Infrastructure for Stimulating Research and Innovation

Ensuring Proper Storage for Earth Science Data: The USGS Process to Certify Trusted Digital Repositories

An overview of the OAIS and Representation Information

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

PID System for eresearch

Records Management and Interoperability Initiatives and Experiences in the EU and Estonia

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

Towards a joint service catalogue for e-infrastructure services

e-research Infrastructures for e-science Axel Berg SARA national HPC & e-science support center RAMIRI, June 15, 2011

ODC and future EIDA/ EPOS-S plans within EUDAT2020. Luca Trani and the EIDA Team Acknowledgements to SURFsara and the B2SAFE team

Research Elsevier

Poland - e-infrastructure ecosystem and relation to EOSC

Guide to e-infrastructure Requirements for European Research Infrastructures

Applied Interoperability in Digital Preservation: Solutions from the E-ARK Project

EU projects for ehealth new ehealth activities in EU

INTAROS Integrated Arctic Observation System

e-infrastructures in FP7: Call 7 (WP 2010)

Outline. Infrastructure and operations architecture. Operations. Services Monitoring and management tools

University of British Columbia Library. Persistent Digital Collections Implementation Plan. Final project report Summary version

Transcription:

EUDAT Towards a pan-european Collaborative Data Infrastructure Martin Hellmich Slides adapted from Damien Lecarpentier DCH-RP workshop, Manchester, 10 April 2013

Research Infrastructures Research Infrastructure trends: Internationalisation Diversification Increasingly relying on ICT Data deluge is a common challenge European Ris: Around 500 100 billion investment middle age 19th century 20th century 21st century 2

Exponential growth Data trends Zettabytes Exabytes Petabytes Terabytes Gigabytes Increasing complexity and variety Where to store it? How to find it? How to make the most of it? How to ensure interoperability? 3

Data Curation Trust Collaborative Data Infrastructure -A framework for the future? - Data Generators Users User functionalities, data capture & transfer, virtual research environments Community Support Services Data discovery & navigation, workflow generation, annotation, interpretability Common Data Services Persistent storage, identification, authenticity, workflow execution, mining

5

Data Centers and Communities 6

Five research communities on Board EPOS: European Plate Observatory System CLARIN: Common Language Resources and Technology Infrastructure ENES: Service for Climate Modelling in Europe LifeWatch: Biodoversity Data and Observatories VPH: The Virtual Physiological Human All share common challenges: Reference models and architectures Persistent data identifiers Metadata management Distributed data sources Data interoperability 7

Communities Data Centers

Building Blocks of the CDI EUDAT Portal Integrated APIs and harmonized access to EUDAT facilities Metadata Catalogue Aggregated EUDAT metadata domain. Data inventory Data Staging Safe Replication Simple Store Dynamic replication to HPC workspace for processing Data curation and access optimization Researcher data store (simple upload, share and access) AAI Network of trust among authentication and authorization actors

Infrastructure first pilots ENES VPH CLARIN EUDAT service provider Lifewatch Community service provider Safe Replication Data staging EPOS

Principles where we want to be (1) 1: Data deposited with the EUDAT CDI will be preserved in perpetuity Costs will (probably) have to fall on data producers (or their funders) at time of data deposit 2: Access to data in the EUDAT CDI is free at the point of use Charging users for access or use creates a barrier to use that runs counter to the principles of open access

Principles where we want to be (2) 3: Data are best curated in their own communities Data producers are central to discussions about its long term preservation 4: EUDAT will operate as a federation of community-facing repositories and back-office hosting providers EUDAT must operate as a partnership model with a distributed infrastructure hosting common services

Principles where we want to be (3) 5: EUDAT services and infrastructure must be a suitable target for Trustworthy Digital Repository TDR outsourcing (cf. datasealofapproval.org) opens the door for existing TDRs to join the EUDAT federation 6: EUDAT will not assert ownership of any data that it holds

Work plan Moving the services to a production environment Integrating new partners to EUDAT (in particular research communities) Working groups, pilots, observers and associate partners Collaborating with other initiatives European e-infrastructures: EGI, PRACE, DANTE, HELIX NEBULA, SCIDIPS-ES, TERENA, etc. Global initiatives: RDA, CODATA, etc Defining EUDAT s path to sustainability Cost and funding models Governance 14

Your Questions How to submit feedback? EUDAT User Forums, service contact lists http://eudat.eu/2nd-eudat-user-forum -> Fact sheets We welcome Use Cases from DCH-RP! We want to know how you want to be involved D7.2.1: Managing data curation and long-term preservation in a federated environment (not avail. yet) Contact: Damien Lecarpentier damien.lecarpentier@csc.fi 15

eudat-info@postit.csc.fi 16

Hierarchy of data needs Value creation, though openness and sharing Combining data Sharing data Combining data in imaginative new ways to solve problems Sharing data across communities relies on more basic data needs being met first Improving data usability and reusability Keeping data safe and accessible Storing and archiving data Metadata catalogue, persistent identifiers Data security, data standards, data curation Data archive, data storage facilities