The European DataGRID Production Testbed

Similar documents
The EU DataGrid Testbed

Future Developments in the EU DataGrid

Grid Challenges and Experience

CrossGrid testbed status

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

GRIDS INTRODUCTION TO GRID INFRASTRUCTURES. Fabrizio Gagliardi

Introduction to Grid computing and overview of the European Data Grid Project

Andrea Sciabà CERN, Switzerland

The LHC Computing Grid

RUSSIAN DATA INTENSIVE GRID (RDIG): CURRENT STATUS AND PERSPECTIVES TOWARD NATIONAL GRID INITIATIVE

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

EGEE (JRA4) Loukik Kudarimoti DANTE. RIPE 51, Amsterdam, October 12 th, 2005 Enabling Grids for E-sciencE.

GRID NETWORK MONITORING IN THE EUROPEAN DATAGRID PROJECT

Status of KISTI Tier2 Center for ALICE

The Grid: Processing the Data from the World s Largest Scientific Machine

The grid for LHC Data Analysis

ELFms industrialisation plans

Data transfer over the wide area network with a large round trip time

Operating the Distributed NDGF Tier-1

where the Web was born Experience of Adding New Architectures to the LCG Production Environment

e-infrastructure: objectives and strategy in FP7

The LHC Computing Grid

High bandwidth, Long distance. Where is my throughput? Robin Tasker CCLRC, Daresbury Laboratory, UK

The LHC Computing Grid. Slides mostly by: Dr Ian Bird LCG Project Leader 18 March 2008

Towards Network Awareness in LHC Computing

The INFN Tier1. 1. INFN-CNAF, Italy

Grid and Cloud Activities in KISTI

e-infrastructures in FP7 INFO DAY - Paris

VMs at a Tier-1 site. EGEE 09, Sander Klous, Nikhef

EGEE and Interoperation

A large-scale International IPv6 Network. A large-scale International IPv6 Network.

On the employment of LCG GRID middleware

D Demonstration of integrated BoD testing and validation

Challenges and Evolution of the LHC Production Grid. April 13, 2011 Ian Fisk

Summary of the LHC Computing Review

R-GMA (Relational Grid Monitoring Architecture) for monitoring applications

Les Cottrell SLAC. Presented at the GGF4 meeting, Toronto Feb 20-21, 2002

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN

Connectivity Services, Autobahn and New Services

Moving e-infrastructure into a new era the FP7 challenge

CC-IN2P3: A High Performance Data Center for Research

Constant monitoring of multi-site network connectivity at the Tokyo Tier2 center

IEPSAS-Kosice: experiences in running LCG site

CHIPP Phoenix Cluster Inauguration

CHAPTER 3 GRID MONITORING AND RESOURCE SELECTION

The Grid. Processing the Data from the World s Largest Scientific Machine II Brazilian LHC Computing Workshop

The CHAIN-REDS Project

Grids and Security. Ian Neilson Grid Deployment Group CERN. TF-CSIRT London 27 Jan

Easy Access to Grid Infrastructures

Worldwide Production Distributed Data Management at the LHC. Brian Bockelman MSST 2010, 4 May 2010

Access the power of Grid with Eclipse

The LCG 3D Project. Maria Girone, CERN. The 23rd Open Grid Forum - OGF23 4th June 2008, Barcelona. CERN IT Department CH-1211 Genève 23 Switzerland

Storage Resource Sharing with CASTOR.

WLCG Network Throughput WG

The EPIKH, GILDA and GISELA Projects

A short introduction to the Worldwide LHC Computing Grid. Maarten Litmaath (CERN)

A European Vision and Plan for a Common Grid Infrastructure

Deploying virtualisation in a production grid

Federated Identities and Services: the CHAIN-REDS vision

Networks & protocols research in Grid5000 DAS3

Internet data transfer record between CERN and California. Sylvain Ravot (Caltech) Paolo Moroni (CERN)

ANSE: Advanced Network Services for [LHC] Experiments

GRID monitoring with NetSaint

National R&E Networks: Engines for innovation in research

GÉANT Support for Research Within and Beyond Europe

D0 Grid: CCIN2P3 at Lyon

Distributed Monte Carlo Production for

Enabling BigData Workflows in Earth Observation Science

D42 - Monitoring and Measurement Session at TNC 2004

Tier-2 structure in Poland. R. Gokieli Institute for Nuclear Studies, Warsaw M. Witek Institute of Nuclear Physics, Cracow

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.

Description of the LDAP schema and DIT for network monitoring Version 2

Introduction to Grid Computing

AMGA metadata catalogue system

LCG data management at IN2P3 CC FTS SRM dcache HPSS

HEP replica management

CMS HLT production using Grid tools

Understanding StoRM: from introduction to internals

EU DataGRID testbed management and support at CERN

On the EGI Operational Level Agreement Framework

TERENA, the NRENs, GÉANT & promoting Campus Best Practice

THE GLOBUS PROJECT. White Paper. GridFTP. Universal Data Transfer for the Grid

Pan-European Grid einfrastructure for LHC Experiments at CERN - SCL's Activities in EGEE

Clouds in High Energy Physics

ISTITUTO NAZIONALE DI FISICA NUCLEARE

Storage Virtualization. Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan

Monitoring system for geographically distributed datacenters based on Openstack. Gioacchino Vino

CernVM-FS beyond LHC computing

Provisioning of Grid Middleware for EGI in the framework of EGI InSPIRE

Figure 1: cstcdie Grid Site architecture

Parallel Storage Systems for Large-Scale Machines

ALHAD G. APTE, BARC 2nd GARUDA PARTNERS MEET ON 15th & 16th SEPT. 2006

LHCb Distributed Conditions Database

HEP Grid Activities in China

Outline. Infrastructure and operations architecture. Operations. Services Monitoring and management tools

Garuda : The National Grid Computing Initiative Of India. Natraj A.C, CDAC Knowledge Park, Bangalore.

LHCb Computing Strategy

Interconnect EGEE and CNGRID e-infrastructures

LHCb Computing Resource usage in 2017

Compact Muon Solenoid: Cyberinfrastructure Solutions. Ken Bloom UNL Cyberinfrastructure Workshop -- August 15, 2005

Transcription:

The European DataGRID Production Testbed Franck Bonnassieux CNRS/UREC ENS-Lyon France DataGrid Network Work Package Manager Franck.Bonnassieux@ens-lyon.fr

Presentation outline General DataGrid project status Numbers and assets Testbeds and Applications Quality and validation Summary and last year project Network activities Monitoring Transports and Services High Speed Transfers QOS NetworkCost Suite Perspectives 19-22 Mai 2003 The European DataGRID Production Testbed 2

DataGrid in Numbers People Testbeds >350 registered users >15 regular sites 12 Virtual Organisations >40 000s jobs submitted 16 Certificate Authorities >1000 CPUs >200 people trained >5 TeraBytes disk 278 man-years of effort 3 Mass Storage Systems 100 years funded Software 50 use cases 18 software releases Scientific applications 5 Earth Obs institutes 9 bio-informatics apps 6 HEP experiments >300K lines of code 19-22 Mai 2003 The European DataGRID Production Testbed 3

EDG currently provides a set of middleware services Job & Data Management GRID & Network monitoring Security, Authentication & Authorization tools Fabric Management Current Project Status EDG release 1 currently deployed to the EDG-Testbeds ~15 sites in application testbed actively used by application groups Core sites CERN(CH), RAL(UK), NIKHEF(NL), CNAF(I), CC-Lyon(F) EDG sw also deployed at total of ~40 sites via CrossGrid, DataTAG and national grid projects Many applications ported to EDG testbeds and actively being used Intense middleware development continuously going-on 19-22 Mai 2003 The European DataGRID Production Testbed 4

Testbeds available through-out the year DataGrid Assets Have gone further than any other project in providing a continuous, large-scale grid facility Innovative middleware Resource Broker Replica Location Service (joint development with Globus) and layered data management tools (Replica Manager & Optimizer) R-GMA Information and Monitoring System Automated configuration and installation tools Access to diverse mass storage systems VOMS security model Distributed team of people across Europe that can work together effectively to produce concrete results Application groups are an integral part of the project contributing to all aspects of the work 19-22 Mai 2003 The European DataGRID Production Testbed 5

Testbeds Application Testbed: End-user Applications Software: Stable, certified release (EDG 1.4.3) Certification Testbed: Extended, Detailed Testing Software: release candidate Collaboration with Testing Group/LCG. Development Testbed: Integration & Evaluation of SW Software: alpha & beta release. Active use; 5 sites involved. Development Machines: Testing of Middleware in Isolation Software: development release Under control of middleware work packages. 19-22 Mai 2003 The European DataGRID Production Testbed 6

Application Testbed Resources Site Country CPUs Storage Since Last Year: CC-IN2P3* FR 620 192 GB Improved software (EDG 1.4.3). CERN* CH 138 1321 GB Doubled sites. More waiting Australia, Taiwan, USA (U. Wisc.), UK Sites, INFN, French sites, CrossGrid, Significantly more CPU/Storage. Hidden Infrastructure CNAF* Ecole Poly. Imperial Coll. Liverpool Manchester IT FR UK UK UK 48 6 92 2 9 1300 GB 220 GB 450 GB 10 GB 15 GB MDS Hierarchy Resource Brokers User Interfaces VO Replica Catalogs VO Membership Servers Certification Authorities NIKHEF* Oxford Padova RAL* SARA TOTAL NL UK IT UK NL 5 142 1 11 6 0 1075 433 GB 30 GB 666 GB 332 GB 10000+ GB 14969 GB *also Dev. TB; +200 TB including tape 19-22 Mai 2003 The European DataGRID Production Testbed 7

Refocus on quality objectives Year 1 - Focus on: Quality of the deliverables Deliverable procedure Document management Project monitoring and reporting Software infrastructure: Software release procedure - Central repository - Bug reporting and tracking - Standards and tools Year 2 - Focus on: Quality of the software production - Stability of the system - User support - Software distribution and Testbed infrastructure Supported by the Project Quality Statement Year 3 - Focus on: Global provisioning of Quality of Services (QoS) 19-22 Mai 2003 The European DataGRID Production Testbed 8

Test and Validation process WP specific machines Build system Development Testbed ~15cpu Certification Testbed ~40cpu Production Testbed ~1000cpu Unit Test Build Integration Certification Production WPs add unit tested code to CVS repository Fix problems WPs Run nightly build & auto. tests Build system Tagged package Individual WP tests Integration Team Overall release tests Releases Tagged candidate Releases Tagged release selected for certification Grid certification Test Group Application Certification Apps. Representatives Releases Certified candidate Releases Certified release selected for deployment Certified public release for use by apps. Users Bugzilla anomalies reports Office hours 24x7 (**) 19-22 Mai 2003 The European DataGRID Production Testbed 9

A few statistics Since mid-november 2002 Virtual Org. ATLAS CMS DØ Earth Obs. Biomedical LHCb SE Data (Gb) 3258.300 388.934 148.000 83.000 8.186 7.400 Virtual Org. CMS ATLAS Local Users WP6 LHCb EarthOb Biomedical Alice # jobs CPU hrs 12869 33841 6930 11583 2151 8906 1627 973 810 444 1462 365 6821 195 1819 136 Integ. Team WP6 Alice 2.800 0.311 0.156 Tutorial Iteam BaBar DØ 1651 7207 1 1 2 1 0 0 BaBar 0.001 Totals Failed 43349 3159 56445 19-22 Mai 2003 The European DataGRID Production Testbed 10

General Status Summary Successful deployment of M/W for use by real applications Periodic releases Testbed available throughout the year Applications heavily involved in all phases of the project Many applications ported to EDG testbed Extensive testing and usage Feedback to drive the project development Improved international network support Many upgrades within the NRN area Strong collaboration with Geant is key to success Active participation in international standard bodies (GGF etc.) High-level coordination with related Grid projects Open source license developed and adopted Major dissemination success with tutorial and road-shows 19-22 Mai 2003 The European DataGRID Production Testbed 11

Related Grid Projects Through links with sister projects, there is the potential for a truely global scientific applications grid Demonstrated at IST2002 and SC2002 in November 19-22 Mai 2003 The European DataGRID Production Testbed 12

More software releases Overview of planned activities for 2003 Release EDG 2.0 to be deployed on application testbed in May 2003 Subsequent updates expected based on application feedback and availability of new mware modules Further improve testing and verification Would like to go even further but resources are already fully stretched Applications More HEP experiments, EO projects, bio-informatics applications will use EDG facilities Expand on task force initiatives to provide active support for applications Extend cooperation and coordination with related grid projects Explore migration paths for EDG software to Open Grid Services Architecture More dissemination activities Participation at many events already planned Further sessions of the tutorial road-show 19-22 Mai 2003 The European DataGRID Production Testbed 13

WP7 (Network) : Generalities Planning for provisioning of infrastructure for testbed operation D7.1 (M9) [Report]: Report on Network infrastructure for Testbed-1 D7.4 (M36) [Report]: Final report on network infrastructure and services Network and Transport Services D7.3 (M9) [Report]: Network Services: requirements, deployment and use in testbeds Network and Grid traffic monitoring D7.2 (M12) [Prototype] : Demonstration and deployment of monitoring tools Grid Security D7.5 (M15) [Report] : Security requirements and report on first project release D7.6 (M25) [Report] : Security Design Report D7.7 (M36) [Report] : Final security report 19-22 Mai 2003 The European DataGRID Production Testbed 14

WP7 Network Monitoring Standard and ad-hoc developed tools to measure network metrics One Way Delay => Ripe Boxes Round Trip Delay => PinGEr Packet Loss => PinGEr TCP throughput => IPerfEr UDP throughput => UDPMon Jitter => UDPMon Routers traffic => NetLoad Agent PCP (Probe Coordination Protocol, dev. by WP7) schedules all active network measurements and avoids conflicts. Dedicated MDS Schema and R-GMA infrastructure stores all network metrics and GridFTP logging. 19-22 Mai 2003 The European DataGRID Production Testbed 15

WP7 Network Monitoring Architecture Visualization Network Managers WEB RTPL MapCenter Replica Managers & resources brokers NetworkCost Processing Forecaster Collect And Storage Info Info Services Services MDS MDS & R-GMA R-GMA Archive Raw Distributed Data Collector Measure PingEr PCP IPerf UDPmon GridFTP 19-22 Mai 2003 The European DataGRID Production Testbed 16

WP7 Monitoring : Visualization Tools MapCenter TopoGrid rtpl 19-22 Mai 2003 The European DataGRID Production Testbed 17

WP7 Transport and Services Close technical collaboration with DANTE on : High Throughput transfer Parallel streams High-speed & Scalable TCP More than 350 Mbit/s single stream between DataGRID Storage Elements (CERN and SARA) LBE and IP Premium 19-22 Mai 2003 The European DataGRID Production Testbed 18

WP7 Services : NetworkCost functionality CERN RAL NIKHEF IN2P3 CNAF CERN 6,7 10,53 3,11 5,31 RAL 7,46 2,44 7,12 4,35 NIKHEF 11,13 3,25 11,86 2,66 IN2P3 5,03 10,38 6,24 7,08 CNAF 4,5 6,53 4,04 13,08 getnetworkcost CERN RAL NIKHEF IN2P3 CNAF CERN RAL NIKHEF IN2P3 CNAF FileSize = 10 GB Results = time to transfer (sec.) 19-22 Mai 2003 The European DataGRID Production Testbed 19

WP7 Services : NetworkCost Suite getnetworkcost functions assist replica managers and resource brokering Based on various back-ends for flexibility: CGI,and Globus MDS back-ends in release 1 R-GMA back-end in release 2 Web Services back-end also under development Based on regular TCP throughput measurement. (release 1) Parameters to be added for enhanced precision: GridFTP logging information : the more the grid is used, the more precise are the results. historical data stored in R-GMA Archiver other network metrics (RTT, Jitter ) forecasting methods will be also tested. 19-22 Mai 2003 The European DataGRID Production Testbed 20

Major accomplishments WP7 Summary & perspectives Follow-up of network infrastructure evolutions (GEANT and NRENs) Close technical collaboration with DANTE tests and prove of network QOS benefits Less than Best Effort for bulk transfers IP Premium for more interactive applications Achievement of high throughput transfers between EDG sites Deployment of Network Monitoring Infrastructure Installation of network sensors on main EDG sites and storage of metrics in Globus MDS. Delivery of first release of NetworkCost function, built upon this infrastructure Major goals for next year Deployment of R-GMA Archives to store all historical network metrics Enhancement of monitoring and of NetworkCost functions suite (GridFTP logging, RTT, Jitter, scheduling of measurements ) Continue close collaboration with DANTE on network QOS and performance to Understand the behavior of GEANT backbone Learn the benefits of QoS deployment 19-22 Mai 2003 The European DataGRID Production Testbed 21

General DataGrid Project Perspectives Third year activities will build on the assets from the first two years of the project (European-wide testbeds, software and highly motivated groups) Third year of the project will be at least as stimulating and challenging as the first two years Advances are planned for all aspects of the EDG middleware and testbeds Providing more functionality, computing resources and higher levels of service The project is following the development of OGSA and sees it as the future for grids Established relationships with related projects will ensure that DataGrid developments will live on after the project has run to completion DataGrid partners will participate in a proposal (EGEE www.cern.ch/egee-ei) of the EU FP6 to further develop the production aspects of the project 19-22 Mai 2003 The European DataGRID Production Testbed 22