The Cyberinfrastructure Laboratory: Middleware Overview

Similar documents
The Center for Computational Research: High-End Computing & Visualization and Grid Computing Russ Miller

Molecular Structure Determination, Grid Computing, and the Center for Computational Research

Molecular Structure Determination on the Grid

An Overview of CSNY, the Cyberinstitute of the State of New York at buffalo

Shaking-and-Baking on a Grid

BnP on the Grid Russ Miller 1,2,3, Mark Green 1,2, Charles M. Weeks 3

Shake-and-Bake, Grid Computing, and Visualization

High-Performance Computing & High-End Visualization in Buffalo

Grid Computing Middleware. Definitions & functions Middleware components Globus glite

High-Performance Computing, Computational and Data Grids

The Center for Computational Research & Grid Computing

Shake-and-Bake on the grid

The Center for Computational Research

Parallel Computing in Bioinformatics and Computational Biology Wiley Book Series on Parallel and Distributed Computing

EGEE and Interoperation

The University of Oxford campus grid, expansion and integrating new partners. Dr. David Wallom Technical Manager

Layered Architecture

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

Introduction to Grid Computing

Grid Scheduling Architectures with Globus

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

Introduction. Software Trends. Topics for Discussion. Grid Technology. GridForce:

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

The Center for Computational Research

Grid Challenges and Experience

Production Grids. Outline

Molecular structure determination on a computational and data grid

The LHC Computing Grid

By Ian Foster. Zhifeng Yun

Grid Architectural Models

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

Grid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen

An Experience in Accessing Grid Computing from Mobile Device with GridLab Mobile Services

MONTE CARLO SIMULATION FOR RADIOTHERAPY IN A DISTRIBUTED COMPUTING ENVIRONMENT

UNICORE Globus: Interoperability of Grid Infrastructures

Day 1 : August (Thursday) An overview of Globus Toolkit 2.4

Michigan Grid Research and Infrastructure Development (MGRID)

Garuda : The National Grid Computing Initiative Of India. Natraj A.C, CDAC Knowledge Park, Bangalore.

REU Site: Bio-Grid Initiatives for Interdisciplinary Research and Education Chun-Hsi Huang, University of Connecticut. EduHPC 2015, Austin TX

Moving e-infrastructure into a new era the FP7 challenge

Advanced School in High Performance and GRID Computing November Introduction to Grid computing.

Introduction to Grid Infrastructures

Access the power of Grid with Eclipse

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)

SLATE. Services Layer at the Edge. First Meeting of the National Research Platform Montana State University August 7-8, 2017

NMI Component Testing Guidelines Pertaining to: NMI Release 3 (Release date: April 28, 2003) May 2, 2003

Storage Resource Sharing with CASTOR.

Interconnect EGEE and CNGRID e-infrastructures

A short introduction to the Worldwide LHC Computing Grid. Maarten Litmaath (CERN)

N. Marusov, I. Semenov

Pan-European Grid einfrastructure for LHC Experiments at CERN - SCL's Activities in EGEE

Usage of LDAP in Globus

JOB SUBMISSION ON GRID

The EGEE-III Project Towards Sustainable e-infrastructures

Grid Technologies & Applications: Architecture & Achievements

Managing MPICH-G2 Jobs with WebCom-G

ISTITUTO NAZIONALE DI FISICA NUCLEARE

Leveraging the InCommon Federation to access the NSF TeraGrid

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

Grids and Security. Ian Neilson Grid Deployment Group CERN. TF-CSIRT London 27 Jan

Hungarian Supercomputing Grid 1

Getting the Most Out of SnB

ALICE Grid Activities in US

The Grid: Feng Shui for the Terminally Rectilinear

Cornell Red Cloud: Campus-based Hybrid Cloud. Steven Lee Cornell University Center for Advanced Computing

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science

The University of Michigan Center for Advanced Computing

glite Grid Services Overview

High Performance Computing Course Notes Grid Computing I

Scientific Computing with UNICORE

The Grid: Processing the Data from the World s Largest Scientific Machine

The Beta Grid: A National Infrastructure for Computer Systems Research

NUSGRID a computational grid at NUS

ATLAS NorduGrid related activities

Introduction to Grid Technology

Middleware of Taiwan UniGrid

Heterogeneous Grid Computing: Issues and Early Benchmarks

Grid Interoperation and Regional Collaboration

A Distributed Media Service System Based on Globus Data-Management Technologies1

XSEDE Software and Services Table For Service Providers and Campus Bridging

Grid Compute Resources and Job Management

CHIPP Phoenix Cluster Inauguration

Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments *

Implementing GRID interoperability

DiPerF: automated DIstributed PERformance testing Framework

Day 9: Introduction to CHTC

NMI Component Testing Guidelines Pertaining to: NMI Release 1 (released May 7, 2002)

S i m p l i f y i n g A d m i n i s t r a t i o n a n d M a n a g e m e n t P r o c e s s e s i n t h e P o l i s h N a t i o n a l C l u s t e r

GridMonitor: Integration of Large Scale Facility Fabric Monitoring with Meta Data Service in Grid Environment

Improving Network Infrastructure to Enable Large Scale Scientific Data Flows and Collaboration (Award # ) Klara Jelinkova Joseph Ghobrial

Computational Mini-Grid Research at Clemson University

Scheduling Large Parametric Modelling Experiments on a Distributed Meta-computer

e-science for High-Energy Physics in Korea

Clouds: An Opportunity for Scientific Applications?

The LHC Computing Grid. Slides mostly by: Dr Ian Bird LCG Project Leader 18 March 2008

Globus and Grids. Jennifer M. Schopf Argonne National Lab

A Data-Aware Resource Broker for Data Grids

Knowledge Discovery Services and Tools on Grids

Grid portal solutions: a comparison of GridPortlets and OGCE

Transcription:

The Cyberinfrastructure Laboratory: Middleware Overview Russ Miller Director, oratory Dept of Comp Sci & Eng Hauptman-Woodward Med Res Inst Executive Director, NYSGrid NSF, NIH, DOE, NIMA, NYS, HP www.cse.buffalo.edu/faculty/miller/ci/ www.nysgrid.org

Grid Computing

Grid Computing Overview Data Acquisition Advanced Visualization Analysis Imaging Instruments LHC Computational Resources Large-Scale Databases Coordinate Computing Resources, People, Instruments in Dynamic Geographically-Distributed Multi-Institutional Environment Treat Computing Resources like Commodities Compute cycles, data storage, instruments Human communication environments No Central Control; No Trust

ACDC-Grid Collaborations I High-Performance Networking Infrastructure Grid3+ Collaboration ivdgl Member Only External Member Open Science Grid Organizational Committee Blueprint Committee Security Working Group Data Working Group GRASE VO Grid-Lite: Campus Grid HP Labs Collaboration Innovative Laboratory Prototype Dell Collaboration

ACDC-Grid Collaborations II Grass Roots NYS Grid (pre-nysgrid.org) SUNY-Albany SUNY-Binghamton SUNY-Buffalo SUNY-Geneseo Canisius College Columbia Hauptman-Woodward Inst. Niagara University GRASE VO: Grid Resources for Advanced Science and Engineering Virtual Organization (Non-Physics Research) Structural Biology Groundwater Modeling Earthquake Engineering Computational Chemistry GIS/BioHazards

Middleware Intermediate Software Layer between Application Codes and Grid Resources Required for applications, users, and resource providers to operate effectively in a manner transparent to the user Security; Resource Management; Data Access; Policies; Accounting; Globus; Condor Checks availability of Resources CPUs; Storage; Networking; Render Farms; etc. Scheduling / Workload Management System Resource Broker Evaluates Job and Breaks Up/Submits

NSF Middleware Initiative (NMI) Develop, improve, and deploy a suite of reusable software components for use in national-scale cyberinfrastructure. APST, Condor, CPM, DataCutter, DataCutter STORM, Globus Toolkit, GPT, Gridconfig, GridPort, GridSolve, GSI OpenSSH, Inca, KX.509/KCA, Look, MPICH-G2, MyProxy, Network Weather Service, OpenSAML, PERMIS, PyGlobus, Shibboleth, SRB Client, UberFTP, and WebISO (Web Initial Sign-on).

Grid Issues High-Throughput Computing Transparent Integration of Data, Computing, Sensors/Devices, Networking Heterogeneous Resources Standards (Grid, Data) Major User Communities High-Energy Physics and Astrophysics Medicine and Biological Sciences Earth Sciences Public Funding Still Critical Grids are in their Infancy

Major Grid Initiatives EGEE: Enabling Grids for E-SciencE (European Commision) Initial Focus on CERN (5PB of Data/Year) High-Energy Physics and Life Sciences Expanded Focus Includes Virtually All Scientific Domains 200 Institutions; 40 Countries 20K+ CPUs; 5PB; 25,000 jobs per day! OSG (DOE, NSF) High-Throughput Distributed Facility Open & Heterogeneous Biology, Computer Science, Astrophysics, LHC 57 Compute Sites; 11 Storage Sites; 10K CPUS; 6PB TeraGrid (NSF) Integrates High-End Resources High-Performance (Dedicated) Networks 9 Sites; 100TF & 15PB 100+ Databases Available

Open Science Grid Courtesy of Paul Avery EGEE Focus on CERN (5EB of Data/Year) OSG TeraGrid

NSF Integrated Cyberinfrastructure NSF Integrated CI NSF Director Arden L. Bement: leadership in cyberinfrastructure may determine America's continued ability to innovate and thus our ability to compete successfully in the global arena.

ACDC-Grid Cyberinfrastructure Lightweight Grid Monitor (Dashboard) Predictive Scheduler Define quality of service estimates of job completion, by better estimating job runtimes by profiling users. Dynamic Resource Allocation Develop automated procedures for dynamic computational resource allocation. High-Performance Grid-Enabled Data Repositories Develop automated procedures for dynamic data repository creation and deletion. Integrated Data Grid Automated Data File Migration based on profiling users. Grid Portal

ACDC-Grid System Architecture Grid Portal Applications and Administration Grid Collaborations Internal Gateway Portal External Gateway Portal Data Grid Dynamic Resource Allocation Predictive Scheduler Computational Grid Resource Monitoring Storage Elements Campus Grid High Perfomance Platforms Storage Area Network Network Attached Storage MultiStore Platform Resources Condor Flock 1..N Distributed Memory Shared Memory Data Grid Infrastructure Campus Grid Infrastructure

ACDC-Grid Monitoring: The ACDC-Grid DASHBOARD http://osg.ccr.buffalo.edu

ACDC Monitor http://osg.ccr.buffalo.edu/operations-dashboard.php?grids=3&vos=10

Predictive Scheduler Build profiles based on statistical analysis of logs of past jobs Per User/Group Per Resource Use these profiles to predict runtimes of new jobs Make use of these predictions to determine Resources to be utilized Availability of Backfill

ACDC-Grid Dynamic Resource Allocation at SC03 with Grid3 Small number (40) of CPUs were dedicated at night An additional 400 CPUs were dynamically allocated during the day No human intervention was required Grid applications were able to utilize the resources and surpassed the Grid3 goals

ACDC-Grid Dynamic Resource Allocation Node scratch space (120 GB) 4 node Dell 2650 PVFS server (1096 GB) Joplin Configuration Diagram Dell 6650 4-way (ACDC) Dell 2650 backup front-end 292 Dell 2650 production nodes Dell 6650 4-way (GRID) Dell 6650 4-way front-end Dell 6650 4-way (EAGLES) 1 node Dell 2650 NFS server (342 GB) GigE and Myrinet connection GigE connection 73 GB hard drive

ACDC-Grid Administration

ACDC-Grid Data Grid Browser view of miller group files published by user rappleye

ACDC-Grid Data Grid Functionality Basic file management functions are accessible via a platform-independent web interface. User-friendly menus/interface. File Upload/Download to/from the Data Grid Portal. Simple Web-based file editor. Efficient search utility. Logical display of files (user/ group/ public). Ability to logically display files based on metadata (file name, size, modification date, etc.)

Grid Services and Applications Applications ACDC-Grid Computational Resources Globus Toolkit MPI Shake-and-Bake MPI-IO Apache High-level Services and Tools C, C++, Fortran, PHP MySQL NWS globusrun Oracle Metacomputing Directory Service Core Services Globus Security Interface GRAM GASS ACDC-Grid Data Resources Condor Stork MPI Local Services RedHat Linux WINNT LSF PBS Maui Scheduler TCP UDP Irix Solaris Adapted from Ian Foster and Carl Kesselman

Grid-Enabling Application Templates (GATs) Structural Biology SnB and BnP for Molecular Structure Determination/Phasing Groundwater Modeling Ostrich: Optimization and Parameter Estimation Tool POMGL: Princeton Ocean Model Great Lakes for Hydrodynamic Circulation Split: Modeling Groundwater Flow with Analytic Element Method Earthquake Engineering EADR: Evolutionary Aseismic Design and Retrofit; Passive Energy Dissipation System for Designing Earthquake Resilient Structures Computational Chemistry Q-Chem: Quantum Chemistry Package Geographic Information Systems & BioHazards Titan: Computational Modeling of Hazardous Geophysical Mass Flows

Grid Enabled SnB Required Layered Grid Services Grid-enabled Application Layer Shake and Bake application Apache web server MySQL database High-level Service Layer Globus, NWS, PHP, Fortran, and C Core Service Layer Metacomputing Directory Service, Globus Security Interface, GRAM, GASS Local Service Layer Condor, MPI, PBS, Maui, WINNT, IRIX, Solaris, RedHat Linux

Required Grid Services Application Layer Shake-and-Bake Apache web server MySQL database High-level Services Globus, PHP, Fortran, C Core Services Metacomputing Directory Service, Globus Security Interface, GRAM, GASS Local Services Condor, MPI, PBS, Maui, WINNT, IRIX, Solaris, RedHat Linux Grid Implementation as a Layered Set of Services GlobusView DUROC Nexus Gloperf Condor LSF MPI Easy Metacomputing Directory Service Applications High-level Services and Tools MPI NQE MPI-IO Core Services Globus Security Interface CC++ Local Services Nimrod/G Heartbeat Monitor AIX Testbed Status TCP Irix globusrun GRAM GASS UDP Solaris

Grid Enabled SnB Execution User defines Grid-enabled SnB job using Grid Portal or SnB supplies location of data files from Data Grid supplies SnB mode of operation Grid Portal assembles required SnB data and supporting files, execution scripts, database tables. determines available ACDC-Grid resources. ACDC-Grid job management includes: automatic determination of appropriate execution times, number of trials, and number/location of processors, logging/status of concurrently executing resource jobs, & automatic incorporation of SnB trial results into the molecular structure database.

Startup Screen for ACDC-Grid Job Submission

Instructions and Description for Running a Job on ACDC-Grid

Software Package Selection

Full Structure / Substructure Template Selection

Default Parameters Based on Template

Default Parameters (cont d)

Generating Reflections (Drear)

Invariant Generation

SnB Setup

SnB Setup (cont d)

SnB Review (Grid job ID: 447)

Graphical Representation of Intermediate Job Status

Histogram of Completed Trial Structures

Status of Jobs

Heterogeneous Back-End Interactive Collaboratory User starts up default image of structure.

Molecule scaled, rotated, and labeled.

Acknowledgments Mark Green Cathy Ruby Amin Ghadersohi Naimesh Shah Steve Gallo Jason Rappleye Jon Bednasz Sam Guercio Martins Innus Cynthia Cornelius George DeTitta Herb Hauptman Charles Weeks Steve Potter Alan Rabideau Igor Janckovic Michael Sheridan Abani Patra Matt Jones NSF ITR NSF CRI NSF MRI NYS CCR

www.cse.buffalo.edu/faculty/miller