e-science Infrastructure and Applications in Taiwan Eric Yen and Simon C. Lin ASGC, Taiwan Apr. 2008

Similar documents
Integration of an Asian NGI with European counterparts

The LHC Computing Grid

The LHC Computing Grid. Slides mostly by: Dr Ian Bird LCG Project Leader 18 March 2008

Grid Interoperation and Regional Collaboration

The grid for LHC Data Analysis

The EGEE-III Project Towards Sustainable e-infrastructures

First Experience with LCG. Board of Sponsors 3 rd April 2009

Travelling securely on the Grid to the origin of the Universe

Pan-European Grid einfrastructure for LHC Experiments at CERN - SCL's Activities in EGEE

RUSSIAN DATA INTENSIVE GRID (RDIG): CURRENT STATUS AND PERSPECTIVES TOWARD NATIONAL GRID INITIATIVE

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

A short introduction to the Worldwide LHC Computing Grid. Maarten Litmaath (CERN)

Constant monitoring of multi-site network connectivity at the Tokyo Tier2 center

Deployment of e-science Infrastructure and Applications for PNC

Operating the Distributed NDGF Tier-1

Bob Jones. EGEE and glite are registered trademarks. egee EGEE-III INFSO-RI

Presentation Title. Grid Computing Project Officer / Research Assistance. InfoComm Development Center (idec) & Department of Communication

Compact Muon Solenoid: Cyberinfrastructure Solutions. Ken Bloom UNL Cyberinfrastructure Workshop -- August 15, 2005

Andrea Sciabà CERN, Switzerland

Grid Security Policy

Grid Computing a new tool for science

GRIDS INTRODUCTION TO GRID INFRASTRUCTURES. Fabrizio Gagliardi

SLATE. Services Layer at the Edge. First Meeting of the National Research Platform Montana State University August 7-8, 2017

The LCG 3D Project. Maria Girone, CERN. The 23rd Open Grid Forum - OGF23 4th June 2008, Barcelona. CERN IT Department CH-1211 Genève 23 Switzerland

The LHC Computing Grid

Grid and Cloud Activities in KISTI

Grid Scheduling Architectures with Globus

Interconnect EGEE and CNGRID e-infrastructures

Data Transfers Between LHC Grid Sites Dorian Kcira

150 million sensors deliver data. 40 million times per second

Grids and Security. Ian Neilson Grid Deployment Group CERN. TF-CSIRT London 27 Jan

The CHAIN-REDS Project

Distributed e-infrastructures for data intensive science

Storage Virtualization. Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan

EGEE and Interoperation

Moving e-infrastructure into a new era the FP7 challenge

Garuda : The National Grid Computing Initiative Of India. Natraj A.C, CDAC Knowledge Park, Bangalore.

ALICE Grid Activities in US

Grid Computing Middleware. Definitions & functions Middleware components Globus glite

Computing grids, a tool for international collaboration and against digital divide Guy Wormser Director of CNRS Institut des Grilles (CNRS, France)

Sustainability Issues of Asia Pacific Region

Grid Computing Activities at KIT

Big Computing and the Mitchell Institute for Fundamental Physics and Astronomy. David Toback

High Throughput WAN Data Transfer with Hadoop-based Storage

Challenges and Evolution of the LHC Production Grid. April 13, 2011 Ian Fisk

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science

European Grid Infrastructure

The Grid: Processing the Data from the World s Largest Scientific Machine

AMGA metadata catalogue system

einfrastructures Concertation Event

Grid Computing for Bioinformatics: An Implementation of a User-Friendly Web Portal for ASTI's In Silico Laboratory

Challenges of the LHC Computing Grid by the CMS experiment

The INFN Tier1. 1. INFN-CNAF, Italy

Computing for LHC in Germany

CERN and Scientific Computing

CHIPP Phoenix Cluster Inauguration

Status of KISTI Tier2 Center for ALICE

The Grid. Processing the Data from the World s Largest Scientific Machine II Brazilian LHC Computing Workshop

High Performance Computing Course Notes Grid Computing I

I Service Challenge e l'implementazione dell'architettura a Tier in WLCG per il calcolo nell'era LHC

Grid Challenges and Experience

MAIN THEME Artificial Intelligence, Architecture and Applications

High-density Grid storage system optimization at ASGC. Shu-Ting Liao ASGC Operation team ISGC 2011

Ganga The Job Submission Tool. WeiLong Ueng

Journey Towards Science DMZ. Suhaimi Napis Technical Advisory Committee (Research Computing) MYREN-X Universiti Putra Malaysia

Introduction to Grid Infrastructures

The European DataGRID Production Testbed

ELFms industrialisation plans

HEP Grid Activities in China

CMS Grid Computing at TAMU Performance, Monitoring and Current Status of the Brazos Cluster

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

Outline. Infrastructure and operations architecture. Operations. Services Monitoring and management tools

UNICORE Globus: Interoperability of Grid Infrastructures

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.

arxiv: v1 [physics.ins-det] 1 Oct 2009

Regional e-infrastructures

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

A Simulation Model for Large Scale Distributed Systems

IEPSAS-Kosice: experiences in running LCG site

The LHC computing model and its evolution. Dr Bob Jones CERN

N. Marusov, I. Semenov

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

e-infrastructure: objectives and strategy in FP7

From raw data to new fundamental particles: The data management lifecycle at the Large Hadron Collider

CESGA: General description of the institution. FINIS TERRAE: New HPC Supercomputer for RECETGA: Galicia s Science & Technology Network

Workload Management. Stefano Lacaprara. CMS Physics Week, FNAL, 12/16 April Department of Physics INFN and University of Padova

DESY at the LHC. Klaus Mőnig. On behalf of the ATLAS, CMS and the Grid/Tier2 communities

The EPIKH, GILDA and GISELA Projects

e-sens Nordic & Baltic Area Meeting Stockholm April 23rd 2013

I Tier-3 di CMS-Italia: stato e prospettive. Hassen Riahi Claudio Grandi Workshop CCR GRID 2011

Progress Report on the Utilisation of the ASEAN Foundation Funds for Projects (As of 31 March 2003 )

Project Progress Report

Government of Canada IPv6 Adoption Strategy. IEEE International Conference on Communications (ICC 12) June 14 th, 2012

EGI Operations and Best Practices

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

Promoting Open Standards for Digital Repository. case study examples and challenges

The CMS experiment workflows on StoRM based storage at Tier-1 and Tier-2 centers

HKIX Updates at APAN 44

HPC Progress and Response to the National Cyber-Infrastructure

Standards capability in Central America and lessons learned from training programs Sungnam CHOI

Transcription:

e-science Infrastructure and Applications in Taiwan Eric Yen and Simon C. Lin ASGC, Taiwan Apr. 2008 1

Outline Driving by WLCG -- Infrastructure, Reliability and Scalability Customized services and Application Extension Core Technology Building Interoperation Facilitating Regional and Global Collaboration Summary 2

ASGC and TWGrid Asia Pacific Regional Operation Center Worldwide Grid Infrastructure Grid Application Platform Avian Flu Drug Discovery Large Hadron Collider (LHC) 3

ASGC Profile Operational from the deployment of LCG0 since 2002, and Takes the Tier-1 Center responsibility from 2005 We support the ATLAS and CMS experiments at the same time in WLCG ATLAS: Institute of Physics, Academia Sinica CMS: National Taiwan University and National Central University Federated Taiwan Tier-2 center -- Taiwan Analysis Facility (TAF) is also collocated in ASGC Leader of EGEE e-science Asia Federation ASGC is not just for WLCG, but also acting as the national center of Grid infrastructure and e-science research and application in Taiwan Providing Asia Pacific Regional Operation Center (APROC) services to Asia Pacific WLCG/EGEE sites 4

What Do We Deliver? e-infrastructure Operation 21+ sites across 8+ countries in Asia Pacific Region > 3,500 Cores and >2 PB storages Continuous monitoring of grid services & automated site configuration management Middleware R&D Production quality MW distributed under friendly open source license model Application integration User Support: Managed process from first contact to production usage Training Expertise in grid-enabling applications online helpdesk Dissemination: attracting more collaborations Interoperability: expanding geographical reach and interoperability with collaborating e-infrastructures 5

6

Network Connectivity and Quality Monitoring Try to have real-time monitoring of site-to-site connectivity quality, including it s latency, data throughput, etc. Optimize the site-to-site routing 7

Enabling Grids for E-sciencE Collaborating e-infrastructures TWGRID & EUAsiaGrid Production = Reliable, sustainable, with commitments to quality of service Potential for linking ~80 countries INFSO-RI-508833 ISGC2007 8

x 9

Transfer Rate CMS CCRC08 ASGC Inbound Rate April.4 April 7 [ MB/s ] T0 95.13 CNAF - FNAL 17.48 FZK 35.61 IN2P3 16.89 PIC 20.12 RAL 53.90 ASGC Outbound Rate April 1 April 7 [ MB/s ] T1_CERN - CNAF - FNAL 14.5 FZK 18.16 IN2P3 9.27 PIC 7.12 RAL 23.09 Taiwan 2.01 KNU 38.73 Pakistan - I/TIFR 2.48 Transfer Quality Taiwan 10.92 KNU 16.44 Pakistan - I/TIFR - UCSD 3.38 Bari 1.50 Estonia 8.94 RWTH 5.36 DESY 9.31 CSCS 5.53 UCSD 4.41 Bari 0.03 Caltech - DESY - Estonia 5.11 Florida - MIT 1.45 Pisa 2.01 RWTH 12.68 Nebraska 5.68 Spain - Total 143.59 Nebraska 0.10 Pisa 6.48 Total 299.68 10

Atlas T0-T1 transfer

ASGC Resource Level Date CPU (ksi2k) Disk (TB) Tape (TB) 2006 636 360 800 2007 2300 1100 800 2008 3400 1500 1300 12

CPU Utilization Statistics max ~19K jobs/day, 41.5K CPU hours/day ~ 3K job slots available from late Feb. 08 6 active VOs 179.6K jobs and 712.68K CPU hours in March 2008 13

Throughput of Data Storage System @ ASGC Figure 1 : Inbound can reached 6.52 Gbps, Outbound can reached 3.7 Gbps in Mar. 2008 Fig. 1 Figure 2 : the average tape writing.( drives rate is ~200MB. (8 reading rate ~150MB/s Fig. 2

Asia Pacific Regional Operations Center Mission Provide deployment support facilitating Grid expansion Maximize the availability of Grid services Supports EGEE sites in Asia Pacific since April 2005 21 production sites in 8 countries Over 3,500 CPU Cores and >2PB in 2008 Runs ASGCCA Certification Authority since 2003 Middleware installation support Production resource center certification Operations Support Monitoring Diagnosis and troubleshooting Problem tracking Security

TWGrid Introduction Consortium Initiated and hosted by ASGC in 2002 Objectives Gateway to the Global e-infrastructure & e-science Applications Providing Asia Pacific Regional Operation Services Fostering e-science Applications collaboratively in AP Dissemination & Outreach Taiwan Grid/e-Science portal Providing the access point to the services and demonstrate the activities and achievements Integration of Grid Resources of Taiwan VO of general Grid applications in Taiwan NTCU 16

EGEE Asia Federation is Extending the glite Infrastructure, currently led by ASGC Engaging more user communities to join worldwide e- Science collaboration Building regional e-infrastructure and e-science application Conducting and supporting a production e- Infrastructure Working together to provide better user support Conducting more business and industry cooperations for new business model and opportunity 17

Core Technology 18

Grid Application Platform (GAP) A light-weight framework for developing problem solving applications on the Grid Enabling Grids for E-sciencE Developing customizable problem solving applications Presentation Tier web portal command-line Job Management User Management Proxy management Visualization Configuration & Plug-ins Application Tier Autodock Application Blast Application <interface> Application extends Shell Application More Applications contains Docking Command Visualization Command <interface> Command depends extends RunScript Command More Commands Components Portable application package: light-weight client-side package for managing jobs and running applications Virtual Queuing System: high-level meta-schedule with application specific resource matching Local System Agent: uniform interface for adapting heterogeneous computing environments Supported computing environments Single Server Computing Cluster: PBS, Grid: LCG, glite Seamless access to Grid applications Distributed system architecture powered by VQS client APIs Features Service-oriented architecture Portable, intuitive and application specific user interface Integrated proxy delegation and automatic proxy renewal with MyProxy server Multi-user environment with historical job archiving and grid proxy management Uniform interface integrating a variety of computing environments ranged from single workstation to world-wide Grid Dynamic resource allocation based on application specification Full Java implementation Workflow support EGEE-II INFSO-RI-031688

SRM-SRB Development Objectives: make SRM the common interfaces for grid storages, and be interoperable among those storages. Features Flexible file/space type supported: volatile, durable and permanent Disk usage status checking is available space reservation functions Progress Implementation of discovery, permission, directory, space functions are all finished. transfer function will be done in April. Endpoint ready for testing Testbed: httpg://fct01.grid.sinica.edu.tw:8443/axis/services/srm SRM Preproduction: httpg://tap02.grid.sinica.edu.tw:8443/axis/services/srm SRB 20

e-science Applications in Taiwan High Energy Physics: WLCG, CDF, Belle Bioinformatics: mpiblast-g2 Biomedicine: Distributing AutoDock tasks on the Grid using DIANE, BioPortal Digital Archive: Data Grid for Digital Archive Long-term preservation Atmospheric Science Earth Sciences: SeisGrid, GeoGrid for data management and hazards mitigation Ecology Research and Monitoring: EcoGrid Humanity and Social Sciences General HPC Services Environment and Biodiversity Informatics Astronomy: ALMA, PanStar e-science Application Development Platform 21

Bio-Portal and Virtual Screening Services x 22

Virtual Screening Service with GAP A standalone GUI Application One-click job submission Submit the docking job to the Grid with just one click Visualize your job status View the best conformation of a simulation Generate the histogram with a given energy threshold 23

DataGrid for Long Term Preservation of NDAP

25

26

27

Collaboration of NCeSS and ASGC on e-social Sciences Comparative study of the current development and adoption of e- Infrastructure in e-(social) sciences in Taiwan and the UK, by mapping e-social science in the areas of digital archives and geoscience. Establish a long-running programme of collaboration internationally Idea is to understand and widen uptake of e-infrastructure Drawing on science and technology studies Early adopters - followers - late adopters (Not character types) Mutual shaping Socio-technical alignment Path dependencies - lock-in Uneven distribution of costs & benefits User-designer relations Designing interventions: Based on understanding of drivers / barriers / enablers / alignment / beaten paths Social-Economic Applications will be another focus

EUAsiaGrid Identify and engage scientific communities which can benefit from the use of state-of-art Grid technologies; Disseminate EGEE middleware in Asian countries by means of public events and written and multimedia material; Provide training resources and organise training events for potential and actual Grid users; Support the scientific applications and create a human network of scientific communities by building on and leveraging the e-science Grid infrastructure. 29

Training Dissemination & Outreach Target Audience: Site Admin, User, Application Developer, and General Public Incubation Program Grid Camp and Station Program e-science application and industrial program Symposium/Conference/Workshop promoting EGEE values, ASGC services etc., and to engage more collaboration Project coordination, learning, and sharing and interactions by hosting events in Taiwan. Evaluation & Follow-ups Program would be commenced 30

Dissemination & Outreach(2) International Symposium on Grid Computing (ISGC) from 2002 TWGRID Web Portal (http://www.twgrid.org) Grid Tutorial, Workshop & User Training: ~700 participants in past 10 events Publication Grid Café / Chinese (http://gridcafe.web.cern.ch/gridcafe/) Event Date Attendant Venue China Grid LCG Training 16-18 May 2004 40 Beijing, China ISGC 2004 Tutorial 26 July 2004 50 AS, Taiwan Grid Workshop 16-18 Aug. 2004 50 Shang-Dong, China NTHU 22-23 Dec. 2004 110 Shin-Chu, Taiwan NCKU 9-10 Mar. 2005 80 Tainan, Taiwan ISGC 2005 Tutorial 25 Apr. 2005 80 AS, Taiwan Tung-Hai Univ. June 2005 100 Tai-chung, Taiwan EGEE Workshop Aug. 2005 80 20th APAN, Taiwan EGEE Administrator Workshop Mar. 2006 40 AS, Taiwan EGEE Tutorial and ISGC 06 1 May, 2006 73 AS, Taiwan EGEE Tutorial with APAN 23 26 Jan. 2007 30 Manila, Philippine EGEE Tutorial with ISGC 07 26 Mar. 2007 90 AS, Taiwan EGEE Tutorial with GridAsia2007 8 Jun. 2007 20 Singapore GridCamp 28.Oct.2007 97 AS, Taiwan Do Son Grid School Nov. 2007 80 HCMC, Vietnam MIMOS Grid Tutorial Dec. 2007 30 Malaysia 31

Summary Practitioner for the new generation infrastructure and international collaboration. Reliability is the first course. Scalability is totally another issue. Customized and advanced functionality upon user requirements is a long-term lesson. Asia Pacific Region is of virtuous potential to adopt the e- Infrastructure : More and more Asia countries will deploy Grid system and take part in the e-science world From Asia Federation to EUAsiaGrid, we are widening the uptake of e-science, by the close collaboration regionally and internationally 32

Data Federation for Biodiversity and Environment Informatics 33

engage the regional/domestic application communities to EGEE and the world --> EGEE, OSG, and others 34