EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd

Similar documents
EUDAT. Towards a pan-european Collaborative Data Infrastructure

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

EUDAT Training 2 nd EUDAT Conference, Rome October 28 th Introduction, Vision and Architecture. Giuseppe Fiameni CINECA Rob Baxter EPCC EUDAT members

EUDAT. Towards a pan-european Collaborative Data Infrastructure - A Nordic Perspective? -

EUDAT and Cloud Services

EUDAT- Towards a Global Collaborative Data Infrastructure

Data management and discovery

EUDAT - Open Data Services for Research

The EUDAT Collaborative Data Infrastructure

I data set della ricerca ed il progetto EUDAT

EUDAT Common data infrastructure

EUDAT. Towards a Collaborative Data Infrastructure. Ari Lukkarinen CSC-IT Center for Science, Finland NORDUnet 2012 Oslo, 18 August 2012

EUDAT. Towards a pan-european Collaborative Data Infrastructure. Damien Lecarpentier CSC-IT Center for Science, Finland EUDAT User Forum, Barcelona

EUDAT & SeaDataCloud

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Data Staging and Data Movement with EUDAT. Course Introduction Helsinki 10 th -12 th September, Course Timetable TODAY

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

European Collaborative Data Infrastructure EUDAT - Training on EUDAT Principles -

EUDAT & AAI. Daan Broeder MPI for Psycholinguistics

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

Data Discovery - Introduction

Using EUDAT services to replicate, store, share, and find cultural heritage data

Coupled Computing and Data Analytics to support Science EGI Viewpoint Yannick Legré, EGI.eu Director

NorStore. a national infrastructure for scientific data. Andreas O Jaunsen UNINETT Sigma as

EGI federated e-infrastructure, a building block for the Open Science Commons

EUDAT Towards a Collaborative Data Infrastructure

Giovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France

PID System for eresearch

Data publication and discovery with Globus

e-infrastructures in FP7 INFO DAY - Paris

USE CASES IN SEISMOLOGY. Alberto Michelini INGV

ODC and future EIDA/ EPOS-S plans within EUDAT2020. Luca Trani and the EIDA Team Acknowledgements to SURFsara and the B2SAFE team

e-infrastructure: objectives and strategy in FP7

irods workflows for the data management in the EUDAT pan-european infrastructure

Open Science Commons: A Participatory Model for the Open Science Cloud

Striving for efficiency

Petaflop Computing in the European HPC Ecosystem

Research Data Management & Preservation: A Library Perspective

Developing a social science data platform. Ron Dekker Director CESSDA

Towards a joint service catalogue for e-infrastructure services

European Open Science Cloud

EUDAT. Towards a pan-european Collaborative Data Infrastructure. KNMI Workshop, Utrecht, Netherlands

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

DCH-RP Trust-Building Report

Remote Workflow Enactment using Docker and the Generic Execution Framework in EUDAT

Persistent Identifiers for Audiovisual Archives and Cultural Heritage

Federated Identity Management for Research Collaborations. Bob Jones IT dept CERN 29 October 2013

Research Infrastructures and Horizon 2020

eresearch UCT Jason van Rooyen, PhD eresearch Analyst

GÉANT Services Supporting International Networking and Collaboration

Fundamentals of Data Infrastructures

Building a Dutch National Research Infrastructure IRODS UGM 2017

Europe and its Open Science Cloud: the Italian perspective. Luciano Gaido Plan-E meeting, Poznan, April

CLARIN s central infrastructure. Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna

The National Digital Library Finna Among Digital Research Infrastructures in Finland

WEB-BASED COLLECTION MANAGEMENT FOR LIBRARIES

Webinar Annotate data in the EUDAT CDI

Survey of Research Data Management Practices at the University of Pretoria

Cloud28+ Compliance in Cross Border Business

European Cloud Initiative: implementation status. Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud

Reporting on EOSC matters. Onur Temizsoylu TÜBİTAK ULAKBİM

Poland - e-infrastructure ecosystem and relation to EOSC

ASTRONOMY & PARTICLE PHYSICS CLUSTER

Greek e-infrastructures Short report

Certification. F. Genova (thanks to I. Dillo and Hervé L Hours)

Technical documentation. D2.4 KPI Specification

Towards FAIRness: some reflections from an Earth Science perspective

The Photon and Neutron Data Initiative PaN-data

Big Data infrastructure and tools in libraries

Version 11

Scientific Data Curation and the Grid

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

CARARE: project overview

An overview of the OAIS and Representation Information

B2SAFE metadata management

Digitising European industry

A national approach for storage scale-out scenarios based on irods

EGI Strategy Enabling collaborative data- and compute-intensive science

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

Federated Services and Data Management in PRACE

Electronic Records Archives: Philadelphia Federal Executive Board

Digital Single Market Technologies and Public Service Modernisation Package -DSM. Grazyna Wojcieszko DG CONNECT

Introduction

The challenges of (non-)openness:

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

Outline. Infrastructure and operations architecture. Operations. Services Monitoring and management tools

Global Data Sharing The Research Data Alliance

ehealth Ministerial Conference 2013 Dublin May 2013 Irish Presidency Declaration

An Introduction to Digital Preservation

European Open Science Cloud Implementation roadmap: translating the vision into practice. September 2018

e-infrastructures in Horizon 2020 e-infrastructures for data and computing

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

The European Cloud Initiative and High Performance Computing (HPC) Teratec 2016

Writing a Data Management Plan A guide for the perplexed

Progress towards the EOSC

Long-term digital preservation of UNSWorks

Transcription:

EUDAT Data Services & Tools for Researchers and Communities Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd

CSC IT CENTER FOR SCIENCE! Founded in 1971 as a technical support unit for Univac 1108! Connected Finland to the Internet in 1988! Reorganized as a company, CSC Scientific Computing Ltd. in 1993! All shares to the Ministry of Education and Culture of Finland in 1997! Operates on a non-profit principle! Facilities in Espoo, close to Otaniemi campus (of 15,000 students and 16,000 technology professionals) and Kajaani! Staff >250! Turnover 2013 31.2 million euros

Research, Where Is It Going? Research Infrastructure trends: Internationalisation Diversification Increasingly relying on on ICT Data deluge is a common challenge European RIs: Around 500 100 billion investment middle age 19th century 20th century 21st century 3

Complex Collaborations Complex Workflows Complex workflows encompassing experimenta4on, simula4on, analysis and publica4on! Data is the asset

ExponenAal growth Data Deluge Ze@abytes Exabytes Petabytes Terabytes Gigabytes Increasing complexity and variety Where to store it? How to find it? How to make the most of it? 5

Synergies If there are hundreds of Research Infrastructures, how many different data management systems can we sustain? 6 6

Common and Collaborative Data Infrastructure - A framework for the future? - Data Generators Users User functionalities, data capture & transfer, virtual research environments A SURFBOARD FOR RIDING THE WAVE TOWARDS A FOUR COUNTRY ACTION PROGRAMME ON RESEARCH DATA Data Curation Trust Community Support Services Data discovery & navigation, workflow generation, annotation, interpretability Common Data Services Persistent storage, identification, authenticity, workflow execution, mining

Consortium 8

9

Metadata Catalogue Aggregated EUDAT metadata domain. Data inventory Data Staging Safe Replica/on Simple Store Dynamic replication to HPC workspace for processing Selected Services Data curation and access optimization Researcher data store (simple upload, share and access) PID Identity Integrity Authenticity LocaAons AAI Network of trust among authentication and authorization actors New services to come EUDAT Box dropbox- like service easy sharing local synching Seman/c Anno checking & referencing Dynamic Data immediate handling

Safe Replication Service Robust, safe and highly available data replication service for small- and medium- sized repositories To guard against data loss in long-term archiving and preservation To optimize access for user from different regions To bring data closer to powerful computers for compute-intensive analysis PIDs Policy rules EUDAT CDI Domain of registered data http://eudat.eu/safe-replication eudat-safereplication@postit.csc.fi 11

Data Staging Service Support researchers in transferring large data collections from EUDAT storage to HPC facilities Reliable, efficient, and easy-to-use tools to manage data transfers Provide the means to reingest computational results back into the EUDAT infrastructure EUDAT CDI Domain of registered data PRACE HPC HPC http://eudat.eu/datastaging eudat-datastaging@postit.csc.fi 12

Simple Store Service Allow registered users to upload long tail data into the EUDAT store Enable sharing objects and collections with other researchers Utilise other EUDAT services to provide reliability and data retention Simple upload Simple metadata PID registraaon EUDAT CDI Domain of registered data http://eudat.eu/simplestore eudat-simplestore@postit.csc.fi 13

Metadata Service Easily find collections of scientific data generated either by various communities or via EUDAT services Access those data collections through the given references in the metadata to the relevant data stores Europeana of scientific data EUDAT CDI Domain of registered data http://eudat.eu/metadata eudat-metadata@postit.csc.fi 14

www.eudat.eu 15

EUDAT Site B PID Community Store irods GridFTP EUDAT Site A PID gridw p Data Managers EUDAT Site C OAI- PMH PID CiAzen scienasts Customised store For research communiaes and CiAzen ScienAsts h@p h@p Researchers OAI- PMH

Sustainable Community data sites General data centres Independent and sustainable centers working within a common framework to develop shared services & policies EUDAT is about providing solutions in a federated environment Partnerships between legal entities relying on OLAs and SLAs

What EUDAT Can Offer to a (Virtual) Research Community Additional storage capacities located at selected centers in Europe to keep pace with an accelerated generation of data Based on clear service offererings and SLAs Interoperability with European computing e- Infrastructures: HPC (PRACE) and Federated Cloud for Data Analysis (EGI) Open Data Sharing platform tailored for VRC stakeholders (researchers, citizen scientists, ) Dissemination and discoverability of the data through specific solutions to access the data and metadata catalogue 18

European General e-infrastructure Open Data Services Open PublicaAon and Discovery Data CompuAng High- performance CompuAng Federated Cloud for Data Analysis Federated Cloud Service Marketplace 19

eudat-info@postit.csc.fi 20

Acknowledgments European Commission: Riding the Wave, http://cordis.europa.eu/fp7/ict/e-infrastructure/ docs/hlg-sdi-report.pdf Knowledge Exchange: Surfboard for Riding the Wave, http://www.knowledge-exchange.info/ 21