De BiG Grid e-infrastructuur digitaal onderzoek verbonden

Similar documents
Grid: data delen op wereldschaal

Grid Computing: dealing with GB/s dataflows

Grid Computing: dealing with GB/s dataflows

e-research Infrastructures for e-science Axel Berg SARA national HPC & e-science support center RAMIRI, June 15, 2011

Grid Computing a new tool for science

Computing grids, a tool for international collaboration and against digital divide Guy Wormser Director of CNRS Institut des Grilles (CNRS, France)

Travelling securely on the Grid to the origin of the Universe

Moving e-infrastructure into a new era the FP7 challenge

A short introduction to the Worldwide LHC Computing Grid. Maarten Litmaath (CERN)

CERN openlab II. CERN openlab and. Sverre Jarp CERN openlab CTO 16 September 2008

The LHC Computing Grid. Slides mostly by: Dr Ian Bird LCG Project Leader 18 March 2008

IEPSAS-Kosice: experiences in running LCG site

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

Netherlands Institute for Radio Astronomy. May 18th, 2009 Hanno Holties

EGI: Linking digital resources across Eastern Europe for European science and innovation

Accelerating Throughput from the LHC to the World

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

The LHC Computing Grid

Storage and I/O requirements of the LHC experiments

CERN and Scientific Computing

The Grid: Processing the Data from the World s Largest Scientific Machine

Insight: that s for NSA Decision making: that s for Google, Facebook. so they find the best way to push out adds and products

CineGrid GRID & Networking

Big Data - Some Words BIG DATA 8/31/2017. Introduction

GRIDS INTRODUCTION TO GRID INFRASTRUCTURES. Fabrizio Gagliardi

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

CSCS CERN videoconference CFD applications

Overview. About CERN 2 / 11

Database Services at CERN with Oracle 10g RAC and ASM on Commodity HW

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research

The grid for LHC Data Analysis

CC-IN2P3: A High Performance Data Center for Research

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

High Performance Computing Data Management. Philippe Trautmann BDM High Performance Computing Global Research

The Grid. Processing the Data from the World s Largest Scientific Machine II Brazilian LHC Computing Workshop

Virtualizing a Batch. University Grid Center

High Performance Computing from an EU perspective

Distributed e-infrastructures for data intensive science

High-Energy Physics Data-Storage Challenges

Physics Computing at CERN. Helge Meinhard CERN, IT Department OpenLab Student Lecture 27 July 2010

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

DT-ICT : Big data solutions for energy

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA

BiG Grid HPC Cloud Beta

Chris Dwan - Bioteam

Connecting the e-infrastructure chain

PROJECT FINAL REPORT. Tel: Fax:

Irish and European Grid Projects

Grid Computing at the IIHE

Grid computing: yesterday, today and tomorrow?

Towards a Strategy for Data Sciences at UW

EGEE and Interoperation

HealthGrids: In Search for Sustainable Solutions

High-Performance Computing Europe s place in a Global Race

Figure 1: cstcdie Grid Site architecture

e-infrastructure: objectives and strategy in FP7

VLBI Participation in Next Generation Network Development

APAN Global Collaboration Linking the World with Light

The European DataGRID Production Testbed

Data Centres in the Virtual Observatory Age

Advanced School in High Performance and GRID Computing November Introduction to Grid computing.

Fact sheet AMSTERDAM DATA CENTRE CAMPUS. Connect, transact and grow

SURFsara Data Services

Some Reflections on Advanced Geocomputations and the Data Deluge

Connectivity Services, Autobahn and New Services

Gigabyte Bandwidth Enables Global Co-Laboratories

Učešće srpskih naučnika u CERN-u. Dušan Vudragović Laboratorija za primenu računara u nauci Institut za fiziku, Beograd, Srbija

Data Intensive Science Impact on Networks

CSD3 The Cambridge Service for Data Driven Discovery. A New National HPC Service for Data Intensive science

Deep Storage for Exponential Data. Nathan Thompson CEO, Spectra Logic

High Performance Computing Course Notes Grid Computing I

SCTE Event. Metro Network Alternatives 5/22/2013

Commercial Partnerships for Exploration: Making Small Lunar Missions Viable

SARA Overview. Walter Lioen Group Leader Supercomputing & Senior Consultant

NRENs as Enablers of eresearch

Scientific Data Curation and the Grid

EU Research Infra Integration: a vision from the BSC. Josep M. Martorell, PhD Associate Director

Buy don t Build. Use don t Manage.

National R&E Networks: Engines for innovation in research

2017 Resource Allocations Competition Results

Operating the Distributed NDGF Tier-1

Introduction to National Supercomputing Centre in Guangzhou and Opportunities for International Collaboration

Renovating your storage infrastructure for Cloud era

Acknowledgements. Beyond DBMSs. Presentation Outline

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA

Introduction to Grid Computing

Astronomy and Big Data challenge. Sandra Jaque, CTO, REUNA

The creation of a Tier-1 Data Center for the ALICE experiment in the UNAM. Lukas Nellen ICN-UNAM

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.

Big Computing and the Mitchell Institute for Fundamental Physics and Astronomy. David Toback

CESGA: General description of the institution. FINIS TERRAE: New HPC Supercomputer for RECETGA: Galicia s Science & Technology Network

Journey Towards Science DMZ. Suhaimi Napis Technical Advisory Committee (Research Computing) MYREN-X Universiti Putra Malaysia

Fact sheet COPENHAGEN DATA CENTRE CAMPUS. Connect, transact and grow

Ivane Javakhishvili Tbilisi State University High Energy Physics Institute HEPI TSU

Pan-European Grid einfrastructure for LHC Experiments at CERN - SCL's Activities in EGEE

THE CURRENT STATE OF DATA CENTER ENERGY EFFICIENCY IN EUROPE. White Paper

Challenges of Big Data Movement in support of the ESA Copernicus program and global research collaborations

Programmable Information Highway (with no Traffic Jams)

Advancing European R&E through collaboration

Pre-Commercial Procurement project - HNSciCloud. 20 January 2015 Bob Jones, CERN

Transcription:

Graphics: Real Time Monitor, Gidon Moont, Imperial College London, see http://gridportal.hep.ph.ic.ac.uk/rtm/ De BiG Grid e-infrastructuur digitaal onderzoek verbonden David Groep, Nikhef KennisKring Amsterdam David Groep, NIKHEF 28 october 2009 (abbreviated version)

e-infrastructure for Research World Wide Web (1990) sharing information Grid (1997) sharing computers and storage Clouds (2007) commoditizing the Grid more than one place on earth more than one computer more than one science! more than

And Why Do We Need It? Enhanced Science needs more and more computations and Collected data in science and industry grows exponentially The Bible 5 MByte Your own digital photographs 5 MByte/image Bio-informatics databases 500 GByte each Refereed journal papers 1 TByte/yr Satellite world imagery 5 TByte/yr Large Synoptic Survey Telescope 30 Tbyte/day Internet Archive 1996-2002 100 Tbyte Web downloads for Google indexing 4 PByte/yr Large Hadron Collider physics 20 PByte/yr Astronomy tomorrow: SKA 365 PByte/yr 1 Petabyte = 1 000 000 000 Megabyte

Balloon (30 Km) Computing for Sub-Atomic Physics Example: the Large Hadron Collider looking at the fundamental forces of nature CD stack with 1 year LHC data! (~ 20 Km) 27 km circumference Located at CERN, Geneva, CH Concorde (15 Km) ~ 20 000 000 Gigabyte per year ~ 60 000 modern PC-style computers Mt. Blanc (4.8 Km)

Work regardless of geographical location, interact with colleagues, share and access data Beyond the Web: Grids for e-science Grid enabling distributed collaboration Software enables resource sharing Community building and collaboration Access resources at any place Move your work around the world, or in to the Cloud Scientific instruments, libraries and experiments provide huge amounts of data Graphic: Federico.Carminati@cern.ch

Wide-area In-Silico Docking On Malaria 47 sites 15 countries 3000 CPUs 12 TByte disk over 46 million ligands virtually docked on malaria and H5N1 avian flu viruses in less than a month

image sources: EGEE ESR VO, EGEE BioMedical VO, EU FP5 CrossGrid project, Nikhef Science and Corporate Grids Big science is not alone Medical imaging Aerospace modelling air flow and stress Finance rapid what-if analyses (oops!) Climate modelling Flood prediction But although the parallelism is convenient, managing complexity in a large-scale environment is not and cooling and power constraints limit the data centre the grid could help do the work in the greenest place

Making the Grid the persistent e-infrastructure Different Communities build Different Grids

Finance Pharma Aerospace Cinema Large clusters Dedicated or Virtual Private Networks Enterprise Grids the Cloud Amazon Google SalesForce.com... Commoditized services Available to SMEs and even consumers Different name, same concept! Backup-as-a-Service Software-as-a-Service Infrastructure-as-a-Service

Contributed Volunteer Computing Many applications fit a client-server model it does not matter where the computer or data is and if you have mainly compute tasks and little data, even idle home PCs can contribute compute power although network bandwidth is limited Pioneered ~ 1996 by SETI@home and distributed.net BOINC: generic middleware for volunteer grids: 2005 go to boinc.berkeley.edu for information and links to projects

Lab A Conveniently Parallel Computing Find ligands from the bowl that match the molecule! I can try all of them in parallel! Computer centre B Grid Site Grid Site Grid Site Grid Site Does it fit? scientific and research cluster grid computing

Nikhef (NDPF) 2550 processor cores 390 000 GByte disk 3x10000 Mbps networks SARA (GINA+LISA) ~2900 processor cores 850 000 GByte disk 1 500 000 GByte tape 4x 10 000 Mbps networks Philips Research Ehv 416 processor cores 126 000 GByte disk 1 000 Mbps networks RUG-CIT (Grid) > 200 processor cores 34 000 GByte disk 10 000 Mbps networks

there s always fibre within 2 miles from you where ever you are in the Netherlands it s just that last mile to your home that s missing and a business model for your telecom provider Enabling the Grid the Network LHC Optical Private Network 10 000 Mbps dedicated global networks

Applications beyond Big Science or: can the Grid help me?

Image sources: VL-e Consortium Partners Virtual Laboratory for e-science Data integration for genomics, proteomics, etc. analysis Timo Breit et al. Swammerdam Institute of Life Sciences Avian Alert and FlySafe Willem Bouten et al. UvA Institute for Biodiversity Ecosystem Dynamics, IBED in ESA, SARA collaboration Molecular Cell Biology and 3D Electron Microscopy Medical Imaging and fmri Silvia Olabarriaga et al. AMC and UvA IvI Bram Koster et al. LUMC Microscopic Imaging group

Grid Infrastructures Work! Number of active VOs in EU since 2004 260 distinct communities in Europe Compute usage since 2004 by VO >12 million hrs/mo! www.biggrid.nl data: EGEE monitoring, RAL and CESGA, http://goc.grid-support.ac.uk/gridsite/accounting/ over 35 science communities hosted in NL

Energizing the Data Centre the next challenge Floor space is no longer the limiting factor in data centres, its energy! Rating of the power socket dominates the cost of in commercial housing We probably have the largest number, world-wide, of the energy efficient L5520 Intel CPUs at the Amsterdam Science Park Our BiG Grid tenders drive vendors to offer the latest in Green IT hardware Energy consumption is as important as investment price And maybe... Grid computing, and fast networks, can help move energy-intensive calculations to where energy is plentiful... and it could even follow the sun It s important to be where the network is better: be right on top!... and we are powered by green electricity from renewable sources...

http://www.vl-e.nl/ http://www.biggrid.nl/ http://www.nikhef.nl/grid/