Future of Grid Computing

Similar documents
An e-voyage showing Single Sign-on via Shibboleth in Action

Grid Computing (M) Richard Sinnott

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

Introduction to Grid Computing

The LHC Computing Grid

e-infrastructures in FP7 INFO DAY - Paris

Advanced School in High Performance and GRID Computing November Introduction to Grid computing.

Enabling Grids for E-sciencE. EGEE security pitch. Olle Mulmo. EGEE Chief Security Architect KTH, Sweden. INFSO-RI

einfrastructures Concertation Event

The University of Oxford campus grid, expansion and integrating new partners. Dr. David Wallom Technical Manager

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

e-infrastructure: objectives and strategy in FP7

Grid Computing Middleware. Definitions & functions Middleware components Globus glite

Grids and Security. Ian Neilson Grid Deployment Group CERN. TF-CSIRT London 27 Jan

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

Moving e-infrastructure into a new era the FP7 challenge

Andrea Sciabà CERN, Switzerland

Review of OSI Report Developing the UK s e-infrastructure for Science and Innovation

Workflow, Planning and Performance Information, information, information Dr Andrew Stephen M c Gough

An Open Grid Services Architecture for Mobile Network Operators

The LHC Computing Grid

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

Grid Scheduling Architectures with Globus

EGEE and Interoperation

<Insert Picture Here> Enterprise Data Management using Grid Technology

High Performance Computing from an EU perspective

Travelling securely on the Grid to the origin of the Universe

Juliusz Pukacki OGF25 - Grid technologies in e-health Catania, 2-6 March 2009

Giovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France

GRIDS INTRODUCTION TO GRID INFRASTRUCTURES. Fabrizio Gagliardi

Managing CAE Simulation Workloads in Cluster Environments

ELFms industrialisation plans

Research and Design Application Platform of Service Grid Based on WSRF

Globus GTK and Grid Services

Outline. Infrastructure and operations architecture. Operations. Services Monitoring and management tools

e-science Gap Analysis

The EGEE-III Project Towards Sustainable e-infrastructures

Grid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)

Presentation Title. Grid Computing Project Officer / Research Assistance. InfoComm Development Center (idec) & Department of Communication

Operating the Distributed NDGF Tier-1

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI

glite Grid Services Overview

Deposited on: 10 September 2009

CS6703 GRID AND CLOUD COMPUTING. Question Bank Unit-I. Introduction

the steps that IS Services should take to ensure that this document is aligned with the SNH s KIMS and SNH s Change Requirement;

EUDAT- Towards a Global Collaborative Data Infrastructure

The SweGrid Accounting System

GÉANT Community Programme

"Charting the Course... Certified Information Systems Auditor (CISA) Course Summary

Service Level Agreement Metrics

RESEARCH DATA DEPOT AT PURDUE UNIVERSITY

Service Oriented Architectures Visions Concepts Reality

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

Supporting Security-Oriented, Collaborative nanocmos Electronics Research

Access the power of Grid with Eclipse

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

Advanced Grid Technologies, Services & Systems: Research Priorities and Objectives of WP

Distributed Data Management with Storage Resource Broker in the UK

CSD3 The Cambridge Service for Data Driven Discovery. A New National HPC Service for Data Intensive science

Geoffrey Fox Community Grids Laboratory Indiana University

The Virtual Observatory and the IVOA

DIMETRA X CORE DATA SHEET DIMETRA X CORE

Computing as a Service

ATLAS NorduGrid related activities

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

APAC: A National Research Infrastructure Program

Tutorial 1: Introduction to Globus Toolkit. John Watt, National e-science Centre

Transformational Projects to Remain Globally Competitive. Dr Mary Davies, University Librarian & Director (Information Management)

Multiple Broker Support by Grid Portals* Extended Abstract

CHAPTER 2 LITERATURE REVIEW AND BACKGROUND

On the EGI Operational Level Agreement Framework

Deploying virtualisation in a production grid

A VO-friendly, Community-based Authorization Framework

The Social Grid. Leveraging the Power of the Web and Focusing on Development Simplicity

S.No QUESTIONS COMPETENCE LEVEL UNIT -1 PART A 1. Illustrate the evolutionary trend towards parallel distributed and cloud computing.

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

Distributed Repository for Biomedical Applications

- C3Grid Stephan Kindermann, DKRZ. Martina Stockhause, MPI-M C3-Team

The Canadian CyberSKA Project

The Grid: Processing the Data from the World s Largest Scientific Machine

Grid Services and the Globus Toolkit

RUSSIAN DATA INTENSIVE GRID (RDIG): CURRENT STATUS AND PERSPECTIVES TOWARD NATIONAL GRID INITIATIVE

Trust and Certification: the case for Trustworthy Digital Repositories. RDA Europe webinar, 14 February 2017 Ingrid Dillo, DANS, The Netherlands

Storage Virtualization. Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan

Computing grids, a tool for international collaboration and against digital divide Guy Wormser Director of CNRS Institut des Grilles (CNRS, France)

Conducting a Self-Assessment of a Long-Term Archive for Interdisciplinary Scientific Data as a Trustworthy Digital Repository

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research

Database Assessment for PDMS

HPC SERVICE PROVISION FOR THE UK

Introduction to National Supercomputing Centre in Guangzhou and Opportunities for International Collaboration

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft. Presented by Manfred Alef Contributions of Jos van Wezel, Andreas Heiss

European Grid Infrastructure

Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services. Patrick Wendel Imperial College, London

Experiences of Applying Advanced Grid Authorisation Infrastructures

Garuda : The National Grid Computing Initiative Of India. Natraj A.C, CDAC Knowledge Park, Bangalore.

Distribution and web services

N. Marusov, I. Semenov

An Experience in Accessing Grid Computing from Mobile Device with GridLab Mobile Services

Potential for Technology Innovation within the Internet2 Community: A Five-Year View

Transcription:

Future of Grid Computing Dr Richard Sinnott http://csperkins.org/teaching/2004-2005/gc5/

Future of Grid Computing Overview Classifications of Grid Computing Future and challenges of different classifications Technology and standards gaps OGSA WSRF WS-? Security Key application domains for the future Funding and industry

Classifications of Grid Computing Compute Grids Run multiple jobs with distributed compute and data resources Data Grids Grids focused on finding, managing and generally dealing with large distributed data resources Complexity Grids Hybrid combination of Data and Compute Grid Campus Grids Grid supporting University community computing Enterprise Grids Grid supporting a company s enterprise infrastructure Semantic Grids Integration of Grid and Semantic Web meta-data and ontology technologies Lightweight Grids Grid designed for rapid deployment and minimum maintenance Collaboration Grids Grid supporting collaborative tools like the Access Grid, whiteboard and shared application. Autonomic Grids Fault tolerant and self-healing Grids Others possible too, e.g. P2P Grids,

Classifications of Grid Computing Compute Grid Data Grid Complexity Grid Campus Grid Enterprise Grid Semantic Grid Peer-to-peer Grid Lightweight Grid Collaboration Grid Autonomic Grid

Compute Grid Future Compute Grid Run multiple jobs with distributed compute and data resources Aspects of Compute Grids well supported, e.g. Condor Future application domains Physics (monte carlo simulations, statistical physics ) Engineering (simulations, optimising designs, ) Life sciences (high throughput BLASTing, ) Future of Compute Grids likely to include Robust middleware for» information services» resource brokering services» Meta-job scheduling» fault tolerance, dependability» security» on heterogeneous infrastructure

Data Grid Future Data Grid Grids focused on finding, managing and generally dealing with large data resources Application domains Particle physics» LHC collider Astrophysics» Worldwide virtual observatories Life sciences are likely to form own data grids in the near future» Issue of developing domain data is the primary driver for Grids in the future? Future of Data Grids likely to include robust and stable middleware for accessing and integrating scientific data enhanced services for replication (e.g. faster as per PP domain experiences) advanced and stable meta-data catalogues (per application domain) fast and robust data movement services closer integration with security services data curation and provenance services» more understanding of longevity of data

Complexity Grid Future Complexity Grids Hybrid combination of Data and Compute Grid (= Grid vision?) Much to be done to realise these Grids right now Scientific need and agreements (e.g. on naming) coupled with advanced software engineering Note could be argued that can build complexity Grids right now But not without considerable cost» In future should be able to dynamically (and quickly!) establish VOs using both Data and Compute Grid technologies

Campus Grid Future To co-ordinate resources and people across campus Likely to involve various faculties At Glasgow it will involve ALL faculties Heterogeneous resources Co-ordination of people critical Why? Sharing resources/people is only way forward Cost benefits Resilience of resources, people Being organised is extremely beneficial to the university E-Science = big business Quick and conservative estimate of 13.5M+ monies TO GLASGOW from e-science

Campus Grid at Glasgow Consolidation of resources Story started with building around ScotGrid Providing shared Grid resource for wide variety of scientists inside/outside Glasgow HEP, CS, BRC, EEE,» Target shares established» Non-contributing groups encouraged Hardware 59 IBM X Series 330 dual 1 GHz Pentium III with 2GB memory 2 IBM X Series 340 dual 1 GHz Pentium III with 2GB memory 3 IBM X Series 340 dual 1 GHz Pentium III with 2GB memory and 100 + 1000 Mbit/s ethernet 1TB disk LTO/Ultrium Tape Library Cisco ethernet switches New.. IBM X Series 370 PIII Xeon with 32 x 512 MB RAM 5TB FastT500 disk 70 x 73.4 GB IBM FC Hot-Swap HDD edikt 28 IBM blades dual 2.4 GHz Xeon with 1.5GB memory edikt 6 IBM X Series 335 dual 2.4 GHz Xeon with 1.5GB memory CDF 10 Dell PowerEdge 2650 2.4 GHz Xeon with 1.5GB memory CDF 7.5TB Raid disk ScotGrid [ Disk ~15TB CPU ~ 255 1GHz ] Over 1 million CPU hours completed (June 2004) Over 100,000 jobs completed Includes time out for major rebuilds Typically running at ~90% usage

Glasgow e-science Infrastructure Future Plans But not enough Computer Services second HPC facility (128 processor) (being procured) University SAN (50TB 25TB mirrored at separate locations across campus) (being procured) ~ 850k investment» Expected usage first quarter 2005 Access to campus wide resources Physics and astronomy training lab condor pools NeSC training lab condor pool Computer services??? EEE compute clusters and larger SMP machines??? others??? NGS proposals for Scottish Grid Service infrastructure SBRN (and other) equipment funds

Glasgow e-science People Plans E-Science Strategy/Business Plan Research Computing Director (new position) E-Science lectureship (new position) E-Science applications co-ordinator (moi) Underwriting of existing staff contracts Future running costs To be funded through contributions from university wide e-science activities and university Strategic Investment Funds university wide engagement and support of e-science

Glasgow e-science Organisational Aspects Essential to nurture relationships across university at ALL levels Social issues almost as important for Grid success as technical issues Steering Committee Fac X Fac X Management Board Fac X User Groups Technical Board Fac X

Enterprise Grid Future Enterprise Grid Grid supporting a company s enterprise infrastructure Major corporations? Need to establish dynamic virtual organisations? More organised (???) Trust of open source Grid solutions? Private Networks and Intranets GSK Have full time staff for keeping up to date copies of ALL bio-data sets SMEs more likely How many need access to HPC facilities? More likely need tailored access to numerous data sets, or mgt of internal operations, e.g. FirstDIG project Maturity of Grid software Needs to evolve into mature product before adoption likely Security!!!!!

Semantic Grid Future Semantic Grid Integration of Grid and Semantic Web metadata and ontology technologies extension of current Grid in which information and services are given well-defined meaning, better enabling computers and people to work in cooperation Semantics are key to virtualisation and abstraction in the Grid WHY NEEDED?

The Semantic Grid Problem mygrid Wish to reuse Data Services Knowledge Software Combechem Anticipated use Unanticipated use Taken from De Roure/Goble talk

Semantic Grid Future Numerous activities on-going in this area Development of ontologies (per application domain) Languages used to capture semantics Prototype toolsets www.semanticgrid.org

Lightweight Grid Future Lightweight Grids Simple solutions that work with minimal complexity for installation and usage on a variety of infrastructures/os (see Chin/Coveney paper) Needed if more heavyweight Grids easier to install/use? Essential for current would be Grid adopters No major re-engineering to get a piece of code to run on the Grid EGEE working towards this glite LCG-1 LCG-2 Globus 2 based glite-1 glite-2 Web services based

Collaboration Grid Future Grid supporting collaborative tools like Access Grid, whiteboards etc Crucial to support e-science not just about connecting machines/data sources Useful for virtual F2F meetings/seminars, but Should be linked to rest of Grid infrastructure also allow scientists to view same simulations, models and decide what to do in real time aka computational steering Trialled in projects like RealityGrid but still not mainstream Security issues of supporting this also

Autonomic Grid Future Autonomic Grids Fault tolerant and self-healing Grids Yet another possible Grid of the (distant?) future Once we know how to build fault tolerant, dependable Grids Then look towards automating the processes by which a Grid can manage itself Look towards optimised Grid solutions, e.g. for load balancing as first step towards autonomic computing Profiling of Grids to user demands, e.g. data replication based upon general usage

Future of OGSA Framework Portals Applications Problem Solving Environments Higher Level Services Job Submission Registry Scheduling Brokering Accounting Usage/Mon. Workflow Provenance Management Logging Data Transport Grid Service Infrastructure Security Infrastructure Hosting Environments, OS, Compute, Data and Storage Resources Data Integration Relational DB OGSA-DAI Data Access XML DB Semistructured

Future of OGSA Technologies and standards evolving Move from OGSI to Web Service Resource Framework (WSRF) Preserves OGSI functionality Lifetime, properties, notification, error types, Separates service from resource Service is static and stateless Resource is dynamic and stateful Builds on WS-Addressing Is WS-Interoperability standards compliant

Justification (?) for WSRF Borrowed from Malcolm Atkinson talk Borrowed from Malcolm Atkinson talk

Timelines for Grid Computing Plus EGEE OMII NGS Borrowed from Malcolm Atkinson talk

Future of Grid Security Security Grid should not weaken existing security infrastructures Scaling up Grid and security to masses needed Overcome issues with PKI take away X.509 certificates from users?» e.g. more widespread usage of MyProxy Need AAA and best practice guidelines in how to establish Grid security Need better tools to set up and manage security infrastructures (experiences from Problem set 2) What to do in case of incidents?» SC2004 and FBI session!

Future of Grid Security ctd AAA projects (DyVOSE) ScotGrid GU Condor pool Other (known!) Grid resources Education VO policies PERMIS based authorisation Authorisation checks Authorisation decisions

Future of Grid Security ctd ScotGrid Glasgow Condor pool Blue Dwarf Edinburgh Dynamically established VO resources/users Delegated VO policies Glasgow Education VO policies Shibboleth Edinburgh Education VO policies PERMIS based Authorisation checks/decisions

Future of Grid Security ctd Local certificate allocation e.g. to students when matriculating Used for authentication and authorisation Local monitoring and policy enforcement Across UK academia that this model together with Shibboleth for security aspects of inter-site sharing of resources will be taken up Shibboleth: A word which was made the criterion by which to distinguish the Ephraimites from the Gileadites. The Ephraimites, not being able to pronounce sh, called the word sibboleth. Hence, the criterion, test, or watchword of a party; a party cry or pet phrase. Will be seeing more Shibbleth origin, target, WAYF services

Future Key Directions for e-science Physics production level services for particle physics in 2007 astrophysics, gravitational waves,... Chemistry Molecular simulations, structure visualisations,... Engineering Mechanical, civil, electronic, aerospace,... Computing science (users and developers) Grid development, usage of Grid, e.g. for information retrieval experiments,... Social sciences Data everywhere, information retrieval,... Environmental sciences Earth modelling, pollution control,...... Life sciences biggest driver for future? Data data everywhere, linkage to clinical, medical, genomic,... Who sees it, who can change it,... What does is the significance of this data in comparison to...

Bioinformatics Grid Needs Workflow / Virtual Organisation Needs WSDL descriptions, Semantic grid, OGSA_DAI/DAIT, IBM DiscoveryLink, Grid engineering (scheduling, resource reservation, workflow enactment, ) National Data curation centre BioInf community, Database schemas, UDDI repositories, BioInf portals, Single sign on authentication, Granularity of authorisation

Future Funding for Grids/e-Science e-science/grid happening and influencing all research / faculties at Glasgow DTI call for 100M out on 29 th November Looking towards aligning the following technology themes: design, simulation & modelling pervasive computing nanotechnology imaging smart manufacturing knowledge transfer networks April 2005 call for 150M pounds Government SR allocated 115M for e-science Spread across ALL funding councils PPARC, EPSRC, MRC, BBSRC, NERC, ESRC, SHEFC,... Plus 150M for each of the three Scottish ITIs TechMedia, Life Sciences, Energy Plus European monies Framework programmes,...

Future of Grid Computing Conclusions e-science/grid has been big business for university It will in some shape or form continue to be so Grid has numerous open research areas that have still to be addressed Is it novel? Perhaps from the applications side...