Europe and its Open Science Cloud: the Italian perspective Luciano Gaido (gaido@to.infn.it) Plan-E meeting, Poznan, April 27 2017
Background Italy has a long-standing expertise and experience in the management of different types of e-infrastructures, dealing with: Network High Throughput Computing (HTC) High Performance Computing (HPC) Distributed data This has been carried out in the frame of various national, European and International projects and initiatives. The main actors are CINECA, GARR and INFN but a number of other national research institutions are involved as well. 2
The GARR network More that 15.000 km of GARR owned fibers ~9.000 Km of backbone ~6.000 Km of access links About 1000 user sites interconnected > 1 Tbps aggregated access capacity > 2 Tbps total backbone capacity 2x100 Gbps IP capacity to GÉANT Cross border fibers with ARNES (Slovenia), SWITCH (Switzerland). > 100 Gbps to General Internet and Internet Exchanges in Italy NOC and engineering are in-house, in Rome. 3
DATA, HPC & HTC Centres HPC: CINECA HTC: INFN, RECAS, ENEA, GARR, etc. All sites connected to the GARR network with optical fibres and multiple 10 Gb links. 4
ICT & Cloud Infrastructure @ GARR 5 Sites for a total of 8448 virtual CPU and 10 PB di STORAGE Federated Cloud OpenStack based
Computing & Data infrastructure @ INFN (+ReCaS) TORINO Tot Cores: 2500 (27 khs06) Disk Space: 2500 TB Netw connectivity: 10 Gb/s PISA Tot Cores: 12000 (125 khs06) Disk Space: 2000 TB Netw connectivity: 20 Gb/s ROMA Tot Cores: 3172 (32 khs06) Disk Space: 2160 TB Netw connectivity: 10 Gb/s MILANO Tot Cores: 2448 (23 khs06) Disk Space: 1850 TB Netw connectivity: 10 Gb/s PADOVA/LEGNARO Tot Cores: 5200 (55 khs06) Disk Space: 3000 TB Netw connectivity: 20 Gb/s CNAF/BOLOGNA Tot Cores: 21250 (221 khs06) Disk Space: 22765 TB Tape Space: 42000 TB Netw connectivity: 80 Gb/s BARI (INFN and UNIBA) Tot Cores: 13000 (130 khs06) Disk Space: 5000 TB Netw connectivity: 20 Gb/s COSENZA Tot Cores: 3500 (35 khs06) Disk Space: 900 TB Netw connectivity: 10 Gb/s FRASCATI Tot Cores: 2000 (20 khs06) Disk Space: 1350 TB Netw connectivity: 10 Gb/s NAPOLI (INFN and UNINA) Tot Cores: 8440 (69 khs06) Disk Space: 2805 TB Netw connectivity: 20 Gb/s CATANIA Tot Cores: 3000 (30 khs06) Disk Space: 1500 TB Netw connectivity: 20 Gb/s
Resources, Research and Projects @ INFN INFN Computing and Data infrastructure Total cores: 76500 (765 khs06) Disk Space: 46 PB Support to physics: Tape Space: 42 PB LHC (ALICE, ATLAS, CMS, LHCb) Main INFN projects related to Computing: INDIGO-DataCloud PRISMA EGI-Engage ExaNeST HNSCICloud Open City Platform (OCP) AMS, BELLE II, KM3NET, OPERA ICARUS, VIRGO, ARGO, MAGIC Borexino, Xenon100 and more. 7
Current projects and synergies/1 Italy positioning in the leading edge projects and activities at European and International level: - PRACE: - Building a world-class pan-european High Performance Computing (HPC) Service - CINECA is the Italian partner, hosting the a Tier-0 center - EUDAT2020 (H2020 EINFRA-2014-1 call): - Sharing and preserving data across borders and disciplines - CINECA and INGV are the Italian partners - WLCG/EGI infrastructures: - INFN is one of the main providers of data and computing resources for Grid and Cloud (EGI FedCloud) infrastructures 8
Current projects and synergies/2 - EGI_Engage (H2020 EINFRA-2014-1 call): - Accelerate the implementation of the Open Science Commons by expanding the capabilities of a European backbone of federated services for compute, storage, data, communication, knowledge and expertise, complementing community-specific capabilities - INFN is one of the main contributors and leads a Joint Research Unit (with INAF and INGV); CNR is also contributing - OpenAIRE2020 (H2020 EINFRA-2014-1 call): - promote open scholarship and substantially improve discoverability and reusability of research publications and data - CNR and CINECA are the Italian partners 9
- INDIGO-DataCloud Current projects and synergies/3 (H2020 EINFRA-2014-1 call): - aims at developing a data and computing platform to fill the existing gaps in PaaS and SaaS services, deployable on multiple hardware and provisioned over hybrid (private or public) e-infrastructures. - INFN is the project coordinator; many Italian partners are involved: CNR, INAF, INGV, Reply, CIRMMP, CMCC, ICCU - IPCEI-HPC-BDA (Important Project of Common European Interest on HPC and Big Data Enabled Applications): - Ensure a European industrial sovereignty on key HPC technologies (necessary in terms of safety and security); - Support the development of new usages of HPC by the industry; - Grant access to world-class HPC facilities for public and private research. - CNR, CINECA, ENEA, GARR and INFN are the Italian partners - And many other projects... 10
Towards the EOSC/1 - EOSCpilot project (H2020 INFRADEV-04-2016 call): - a first step towards the development of the European Open Science Cloud - main goals: - design and trial a stakeholder-driven governance framework; - develop demonstrators of integrated services and infrastructures in a number of scientific domains, showcasing interoperability and its benefits - CNR and INFN are actively participating. INFN coordinates the interoperability pilots task. 11
Towards the EOSC/2 - EOSC-hub proposal (H2020 EINFRA-12-2017): - Joint EGI, EUDAT and INDIGO-DataCloud proposal - Main goals: - consolidate digital infrastructures by expanding capacities and capabilities, improving discoverability, access, interoperability and sharing, across research communities and countries - Extend access to integrated compute, storage, data and software to new user groups including high-education and industry, increase the user base - Aims at providing the first EOSC implementation, through the EOSChub service catalogue - INFN is the second main partner; CINECA, CIRMMP, CMCC, CNR and INGV are participating 12
Plans at national level/1 - The successful collaboration among the Italian main actors, in the framework of both national and European projects, led to the creation of the Italian Grid Infrastructure - The is no national Cloud infrastructure, yet - However almost all research institutions are involved in projects related to Cloud or have built/are building their own Cloud infrastructure - The National Programme for the Research Infrastructures (PNIR) for 2014-2020 has selected 56 Research Infrastructures (out of 97) to be sustained by various national funding streams, which should be complemented by European funds 13
Plans at national level/2 - The goal is to build a federated and interoperable Cloud national infrastructure based on: - the available tools (developed by INDIGO-Datacloud and other projects) - the existing collaboration and synergies among the main Italian Research Institutions and Agencies - This is for the benefit of the Italian Research Communities (ranging from well structured ones up to the Long Tail of Science) and of their participation to International projects and initiatives (incuding ESFRIs) - It will also be an important contribution to the shaping of the European Open Science Cloud 14