Anatomy of the BIRN The Biomedical Informatics Research Network

Size: px
Start display at page:

Download "Anatomy of the BIRN The Biomedical Informatics Research Network"

Transcription

1 Anatomy of the BIRN The Biomedical Informatics Research Network Dr. Philip Papadopoulos University of California San Diego and the San Diego Supercomputer Center BIRN Coordinating Center Co-Investigator NBCR Co-investigator

2 The BIRN Collaboratory Today Enabling collaborative research at 28 research institutions comprised of 37 research groups.

3 What is BIRN A Physical and Geographically-Distributed test bed for a Biomedical Knowledge Infrastructure Creation/Support of Federated Bioscience Databases Data Integration to Enable New Queries among Disparate Data Sources Interoperable Domain Analysis Tools Scalable, Extensible, Replicable Driven by Research needs Pull, not Technology Push A Domain-Specific Grid ~30 Institutions

4 Major Components 3 Distributed Test beds (Scientific Drivers) Mouse, Morph, Function BIRN IT Coordinating Center (@ UCSD) Distributed Hardware Well-defined, Evolving Software Stack on All Machines OS, Middleware, Applications Centralized/BIRN-Wide Services Grid Glue BIRN Portal Information/News/Application Starting Point People-to-people communication, organization Outreach, Training

5 Scientific/Research Goals Drive the BIRN Infrastructure BIRN is Pushing on the How of Cyberinfrastructure NBCR and BIRN are intimately related on a variety of levels sharing increases as you get closer to the IT Infrastructure

6 Why Biomedical Informatics & Research Bio-Complexity Discovery and Systems Research Approaches Complement Hypothesis-based Research Integrative, Multidisciplinary Team Approach Adapted for Complex Queries Versus Focused Approach for Hypothesis-driven Research Team Approach more Dependent on Advanced Technologies and Instrumentation Which Generate Large Data Sets Information Management is a Critical Capability for Biomedical Research in 21st Century and Beyond

7 Data Integrated Across Scales Enable new understanding of the brain by linking data about macroscopic brain function to its molecular and cellular underpinnings Federate Distributed Multiscale Brain Map Data Accommodate associated Large Scale Computational Challenges Provide Infrastructure for Construction of more Accurate Models and more Realistic Simulations of Brain Activity

8 Challenges of Large and Distributed Data Dr. Art Toga (UCLA) was one of the first to articulate the magnitude of the challenge of human brain data - Large-Scale data points and comparisons among 100s to 1000s is needed

9 BIRN Testbeds - Overview Morphometry BIRN Brain Structure in AD, MCI, Depression Function BIRN Activation Differences in Schizophrenia Mouse BIRN High Resolution Imaging and Animal Models of Human Diseases BIRN-CC Coordinating Center for Cyberinfrastructure (Extending to others Groups)

10 MIRIAD Distributed Analysis Deidentified Data from Duke Retrospective Archive Loaded in BIRN Data Grid UCLA LONI Pipeline Register Probabilistic Anatomy Atlas to Subjects Lobar Analysis BWH/MIT 3D Slicer Image Analysis and Segmentation UCSD Clusters Use Teragrid for very large runs Statistical Analysis Detailed Clinical Database Multi-Institutional Research in the Analysis of Depression (MIRIAD) UCLA AIR Registration and Lobar Analysis Duke Archives 1 BWH Probabilistic Atlas (one time transfer) 2 3 Duke Clinical Analysis BWH Intensity Normalization and EM Segmentation UCSD Cluster Computing MIRIAD Data Flow 1) Retrospective data upload from Duke 2) Lobar analysis and Registration of Atlas to Subjects 3) Anatomical Segmentation 4) Comparison to Clinical History 4

11 MIRIAD Project: Accomplishments Improved computational capabilities Segmentation Duke BIRN-MIRIAD Item (semi-automated) (fully-automated) # of tissue classes 3 (Fig1) 23 (Fig2) Time for 200 brains 400 hours 1 hour Time for 200 lobe & 250 hours all lobes (Fig3) and 27 regional analysis regions included above 1 2 3

12 fbirn Multi-Site Data Normalization Reference Anatomical Scan fmri Scans from 10 Different Sites Same Subject, Registered, Same Slice Notice the obvious differences in image across different scanners.

13 Mouse BIRN - Dimensional Scales of Analysis UCLA (LONI) U. Tenn Duke (CIVM) Cal Tech (Beckman) UCSD (NCMIR)

14 Autosegmentation- Morph- Mouse BIRN

15 An Exercise for the Reader There exists a large body of useful middleware It s assembly, hardening and extension into a useful system is left as an exercise to the reader The BIRN is the reader

16 The IT Underwear BIRN Science Teams Challenge Every Facet of Current Grid Technology Large Data Storage/Movement Integration of Data across Sites Network capacity, real transfer rates Computational Resources All-to-All Data Comparisons Fluid Deformations, Automated Tissue Segmentation Security, Integrity, Provenance (History) of Data and Transformations Ease of Use

17 BIRN Evolution: JSR-168 Portlets gridsphere (GSI-aware) Portlet Container User-Defined Layouts and Capability Integrate both BIRN and non-birn capabilities

18 We Began with Standard Hardware Jumpstarted BIRN for functionality Software footprint managed by the BIRN-CC Integration of domain tools, middleware, OS, updates, and more Expansion/upgrade of existing sites have more generic (and less expensive) hardware e.g. Opteron/EM64T, Multi-vendor HW costs $100K/Site four years ago, now $10K-$15K

19 Racks are Replicated at Each Major BIRN Site 20+ Distinct Installations, 100 s of Individual Machines

20 Software Problem in a Nutshell Data & Network CPU Security Enable Analysis of Distributed Biomedical Data in a National-Scale Production Facility Data Sets are Large Data Sets are Many Enable New Queries that Integrate Multiple Sources Specialized Application Codes (from Test Beds) need to work on BIRN-accessible Data Some Analysis Pipelines Require Significant Computation Privacy, Patient Anonymity Required Institutional Ownership of Originals Easily Replicate Entire Software Stack (Including Centralized Services) for other Groups

21 Major System Components Collaborating Groups of Biomedical Researchers Complete Workflows Application Portal Command/Batch Access Domain Application Tools Distributed Data (Collections) Distributed Data (file system) Computation/Analysis Facilities Identity/Login Management Data Integration Mechanisms Authorization and Role Definition Integrated SW Distribution Overall Operations Note: Similar Structure as Many Other Grids

22 Specific Implementations Mouse, Function, Morphometry (+ New Areas and Users ) Pegasus, Kepler, and others BIRN Portal Command/Batch Access E.g. AFNI, Air, 3DSlicer, LONI,.. GSI-Based. GAMA + MyProxy BIRN Data Integration Storage Resource Broker (SRB) AFS (file system) (SSHFS usermount) Condor,Globus: Local clusters + Teragrid SRB for Access Control to Data Semi-Annual BIRN SW Distribution (April/Oct) BIRN-CC Note: Similar Structure as Many Other Grids

23 Global Elements Data location Storage Resource Broker Meta data catalog Data-type aggregation, cross-correlation, integration BIRN Data Mediator Identity Management Use Grid Security Infrastructure (GSI) Public Key System Integrated Grid Accounts Management Architecture (GAMA) from SDSC for ease-of-use and Single Sign On Portal Services Based on GridSphere Dedicated Compute Cluster (32 nodes) + ability to use larger non-dedicated Condor pool

24 More Global Elements Administrative Subset Grid Certificate Authority Condor Collector Integration of computing elements in Runnable/Accessible state for compute jobs Dynamic system nucleated from the BIRN Dedicated Compute Cluster Can integrate (Using Condor-G) with larger infrastructures like Teragrid Network Operations Center (NOC) Styled functions Performance Analysis VPN Services for secure remote administration and data collection Hardware/Network health monitoring BIRN Software Version Control (CVS) Server Rocks Central for Network-based system installation YUM Server (Interim Security Updates) Mail-list services, Video Teleconference Multipoint (MCU)

25 Reality: Substantial Hidden Infrastructure needed to function BIRN Integrated Services (Existence is made as Transparent as Possible)

26 Example: BIRN Login

27 Login Behind the Scenes Login Portlet (standalone functionality that can be rendered by the Portal itself) is active when user gives username/password 1. Secure request is sent to BIRN GAMA Server GAMA Validates user name/password and returns a time-limited GSI proxy credential 2. GridSphere intializes the User Session Loads user preferences for layout of session Holds proxy credential in per-session memory for use to authenticate the user to other grid services 3. Portlets that require user identification can use GSI proxy (single Sign-on)

28 Data is The #1 issue Generic Data issues Key Middleware Issues Authenticated/Authorized Sharing Location and (High-performance) Access when using Distributed Resources Auditing Use. Insuring Data Integrity Replication BIRN-specific Mediation Of Different Data Representations Semantics of the data is key Some Specific meta data: Patient with identity protection MRI Pulse Sequence, Magnet Description

29 Practical Whole System Management The BIRN Software Stack is Large, Complex and Evolutionary Others need to be able to replicate ALL OF BIRN without spending man years of effort The Physical Hardware is Cheap Especially relative to the cost of a person Throwing System Administrators at the problem (the Microsoft/IBM/DBA Model) is not the answer All deployed components of BIRN are Appliances BIRN Uses the Rocks Clustering Toolkit at the Glue to define all System Configurations

30 Replicability Deployed as complete environment Site Racks and Centralized Services Others should be able to replicate all of BIRN Goal is all of BIRN is represented as complete software-only solution BIRN began with Specific HW for sites and has now relaxes that to multi-vendor We use Rocks as a key packaging/deployment vehicle Open Source (Enterprise Linux) Deployed on over 1000 clusters worldwide BIRN needs have substantially affected its design

31 Basic Software Stack for Cluster/Distributed System

32 Common to Every Networked System

33 Rolls Break Apart Software Stacks into Logical and Re-combinable Components Rocks Base HPC Web Server BIRN Base Free- Surfer BIRN Extensions

34 Software Complexity is Reality BIRN-Specific Rolls Rocks-developed Rolls

35 BIRN Software Releases BIRN Rolls Integrates All Software Defines Multiple Appliances Grid POP Compute Servers Application Servers Metadata Catalog Grid Certificate Server Portal Servers Network Analysis Systems Database Servers Goal: A single person should be able to Build all of BIRN The Entire Structure must be Replicable

36 Supporting and Evolving the Deployed Infrastructure MONITORING ASSESSING TECHNICAL OPTIONS FOR BIRN TO BUILD WORKING SYSTEMS SCALABLE SOFTWARE DISTRIBUTION

37 BIRN Now Provides High Performance Connectivity Between Distributed Resources (Computation and Data Storage) JHU Utilizing Teragrid Resources Pulling Data from SRB Secure Access to Large Volumes of Distributed Data Seamless Access to Distributed High Performance Computing Resources Condor is a key abstraction Frameworks (Standards, APIs, Services) and Software for Integration and Interoperation An Evolving Software and Service Structure to meet the needs of Biomedical Researchers

38 Summary BIRN Infrastructure is Driven By the Needs of Key Test beds BIRN-CC is Integrating, Extending, and Evolving an end-to-end Infrastructure Built to enable new interactions Build rapidly, deploy, monitor and test to uncover the real problems The CC is not alone in this endeavor and we actively work with other groups to get to best of breed IT Challenges are formidable (and satisfying)

The Biomedical Informatics Research Network:

The Biomedical Informatics Research Network: BRIITE - IT SUPPORT FOR MULTI-INSTITUTION COLLABORATIVE RESEARCH November 4, 2005 The Biomedical Informatics Research Network: Experiences with Cyberinfrastructure in a Biomedical Research Community Jeffrey

More information

Distributed Repository for Biomedical Applications

Distributed Repository for Biomedical Applications Distributed Repository for Biomedical Applications L. Corradi, I. Porro, A. Schenone, M. Fato University of Genoa Dept. Computer Communication and System Sciences (DIST) BIOLAB Contact: ivan.porro@unige.it

More information

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms Grid Computing 1 Resource sharing Elements of Grid Computing - Computers, data, storage, sensors, networks, - Sharing always conditional: issues of trust, policy, negotiation, payment, Coordinated problem

More information

Juliusz Pukacki OGF25 - Grid technologies in e-health Catania, 2-6 March 2009

Juliusz Pukacki OGF25 - Grid technologies in e-health Catania, 2-6 March 2009 Grid Technologies for Cancer Research in the ACGT Project Juliusz Pukacki (pukacki@man.poznan.pl) OGF25 - Grid technologies in e-health Catania, 2-6 March 2009 Outline ACGT project ACGT architecture Layers

More information

Distributed Data Management with Storage Resource Broker in the UK

Distributed Data Management with Storage Resource Broker in the UK Distributed Data Management with Storage Resource Broker in the UK Michael Doherty, Lisa Blanshard, Ananta Manandhar, Rik Tyer, Kerstin Kleese @ CCLRC, UK Abstract The Storage Resource Broker (SRB) is

More information

Introduction to Grid Computing

Introduction to Grid Computing Milestone 2 Include the names of the papers You only have a page be selective about what you include Be specific; summarize the authors contributions, not just what the paper is about. You might be able

More information

UCLA Grid Portal (UGP) A Globus Incubator Project

UCLA Grid Portal (UGP) A Globus Incubator Project UCLA Grid Portal (UGP) A Globus Incubator Project OGF 2007 Documentation at: http://www.ucgrid.org Prakashan Korambath & Joan Slottow Research Computing Technologies UCLA Academic Technology Services UGP

More information

High Performance Computing Course Notes Grid Computing I

High Performance Computing Course Notes Grid Computing I High Performance Computing Course Notes 2008-2009 2009 Grid Computing I Resource Demands Even as computer power, data storage, and communication continue to improve exponentially, resource capacities are

More information

GRIDS INTRODUCTION TO GRID INFRASTRUCTURES. Fabrizio Gagliardi

GRIDS INTRODUCTION TO GRID INFRASTRUCTURES. Fabrizio Gagliardi GRIDS INTRODUCTION TO GRID INFRASTRUCTURES Fabrizio Gagliardi Dr. Fabrizio Gagliardi is the leader of the EU DataGrid project and designated director of the proposed EGEE (Enabling Grids for E-science

More information

UGP and the UC Grid Portals

UGP and the UC Grid Portals UGP and the UC Grid Portals OGF 2007 Documentation at: http://www.ucgrid.org Prakashan Korambath & Joan Slottow Research Computing Technologies UCLA UGP (UCLA Grid Portal) Joins computational clusters

More information

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment Paul Watry Univ. of Liverpool, NaCTeM pwatry@liverpool.ac.uk Ray Larson Univ. of California, Berkeley

More information

Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments *

Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments * Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments * Joesph JaJa joseph@ Mike Smorul toaster@ Fritz McCall fmccall@ Yang Wang wpwy@ Institute

More information

Mitigating Risk of Data Loss in Preservation Environments

Mitigating Risk of Data Loss in Preservation Environments Storage Resource Broker Mitigating Risk of Data Loss in Preservation Environments Reagan W. Moore San Diego Supercomputer Center Joseph JaJa University of Maryland Robert Chadduck National Archives and

More information

UNIT IV PROGRAMMING MODEL. Open source grid middleware packages - Globus Toolkit (GT4) Architecture, Configuration - Usage of Globus

UNIT IV PROGRAMMING MODEL. Open source grid middleware packages - Globus Toolkit (GT4) Architecture, Configuration - Usage of Globus UNIT IV PROGRAMMING MODEL Open source grid middleware packages - Globus Toolkit (GT4) Architecture, Configuration - Usage of Globus Globus: One of the most influential Grid middleware projects is the Globus

More information

Mouse BIRN Data Integration. Maryann Martone Mouse All Hands Meeting

Mouse BIRN Data Integration. Maryann Martone Mouse All Hands Meeting Mouse BIRN Data Integration Maryann Martone 2005 Mouse All Hands Meeting Specific Aims Specific Aim 1: Data Access and Management Continue development of multi-scale databases along existing lines extending

More information

Introduction to The Storage Resource Broker

Introduction to The Storage Resource Broker http://www.nesc.ac.uk/training http://www.ngs.ac.uk Introduction to The Storage Resource Broker http://www.pparc.ac.uk/ http://www.eu-egee.org/ Policy for re-use This presentation can be re-used for academic

More information

IRODS: the Integrated Rule- Oriented Data-Management System

IRODS: the Integrated Rule- Oriented Data-Management System IRODS: the Integrated Rule- Oriented Data-Management System Wayne Schroeder, Paul Tooby Data Intensive Cyber Environments Team (DICE) DICE Center, University of North Carolina at Chapel Hill; Institute

More information

SimPortal. Overview. Frank McKenna. What is SimpPortal Simple Example of Job Submission. UC Berkeley. OpenSees Parallel Workshop Berkeley, CA

SimPortal. Overview. Frank McKenna. What is SimpPortal Simple Example of Job Submission. UC Berkeley. OpenSees Parallel Workshop Berkeley, CA SimPortal Frank McKenna UC Berkeley OpenSees Parallel Workshop Berkeley, CA Overview What is SimpPortal Simple Example of Job Submission George E. Brown, Jr. Network for Earthquake Engineering Simulation

More information

DSpace Fedora. Eprints Greenstone. Handle System

DSpace Fedora. Eprints Greenstone. Handle System Enabling Inter-repository repository Access Management between irods and Fedora Bing Zhu, Uni. of California: San Diego Richard Marciano Reagan Moore University of North Carolina at Chapel Hill May 18,

More information

Grid Architectural Models

Grid Architectural Models Grid Architectural Models Computational Grids - A computational Grid aggregates the processing power from a distributed collection of systems - This type of Grid is primarily composed of low powered computers

More information

THE GLOBUS PROJECT. White Paper. GridFTP. Universal Data Transfer for the Grid

THE GLOBUS PROJECT. White Paper. GridFTP. Universal Data Transfer for the Grid THE GLOBUS PROJECT White Paper GridFTP Universal Data Transfer for the Grid WHITE PAPER GridFTP Universal Data Transfer for the Grid September 5, 2000 Copyright 2000, The University of Chicago and The

More information

The University of Oxford campus grid, expansion and integrating new partners. Dr. David Wallom Technical Manager

The University of Oxford campus grid, expansion and integrating new partners. Dr. David Wallom Technical Manager The University of Oxford campus grid, expansion and integrating new partners Dr. David Wallom Technical Manager Outline Overview of OxGrid Self designed components Users Resources, adding new local or

More information

A Simple Mass Storage System for the SRB Data Grid

A Simple Mass Storage System for the SRB Data Grid A Simple Mass Storage System for the SRB Data Grid Michael Wan, Arcot Rajasekar, Reagan Moore, Phil Andrews San Diego Supercomputer Center SDSC/UCSD/NPACI Outline Motivations for implementing a Mass Storage

More information

InfoBrief. Platform ROCKS Enterprise Edition Dell Cluster Software Offering. Key Points

InfoBrief. Platform ROCKS Enterprise Edition Dell Cluster Software Offering. Key Points InfoBrief Platform ROCKS Enterprise Edition Dell Cluster Software Offering Key Points High Performance Computing Clusters (HPCC) offer a cost effective, scalable solution for demanding, compute intensive

More information

Sriram Krishnan

Sriram Krishnan A Web Services Based Architecture for Biomedical Applications Sriram Krishnan sriram@sdsc.edu Goals Enabling integration across multi-scale biomedical applications Leveraging geographically distributed,

More information

NextData System of Systems Infrastructure (ND-SoS-Ina)

NextData System of Systems Infrastructure (ND-SoS-Ina) NextData System of Systems Infrastructure (ND-SoS-Ina) DELIVERABLE D2.3 (CINECA, CNR-IIA) - Web Portal Architecture DELIVERABLE D4.1 (CINECA, CNR-IIA) - Test Infrastructure Document identifier: D2.3 D4.1

More information

ACET s e-research Activities

ACET s e-research Activities 18 June 2008 1 Computing Resources 2 Computing Resources Scientific discovery and advancement of science through advanced computing Main Research Areas Computational Science Middleware Technologies for

More information

DATA MANAGEMENT SYSTEMS FOR SCIENTIFIC APPLICATIONS

DATA MANAGEMENT SYSTEMS FOR SCIENTIFIC APPLICATIONS DATA MANAGEMENT SYSTEMS FOR SCIENTIFIC APPLICATIONS Reagan W. Moore San Diego Supercomputer Center San Diego, CA, USA Abstract Scientific applications now have data management requirements that extend

More information

Digital Curation and Preservation: Defining the Research Agenda for the Next Decade

Digital Curation and Preservation: Defining the Research Agenda for the Next Decade Storage Resource Broker Digital Curation and Preservation: Defining the Research Agenda for the Next Decade Reagan W. Moore moore@sdsc.edu http://www.sdsc.edu/srb Background NARA research prototype persistent

More information

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography Christopher Crosby, San Diego Supercomputer Center J Ramon Arrowsmith, Arizona State University Chaitan

More information

The Canadian CyberSKA Project

The Canadian CyberSKA Project The Canadian CyberSKA Project A. G. Willis (on behalf of the CyberSKA Project Team) National Research Council of Canada Herzberg Institute of Astrophysics Dominion Radio Astrophysical Observatory May 24,

More information

The NeuroLOG Platform Federating multi-centric neuroscience resources

The NeuroLOG Platform Federating multi-centric neuroscience resources Software technologies for integration of process and data in medical imaging The Platform Federating multi-centric neuroscience resources Johan MONTAGNAT Franck MICHEL Vilnius, Apr. 13 th 2011 ANR-06-TLOG-024

More information

Managing Petabytes of data with irods. Jean-Yves Nief CC-IN2P3 France

Managing Petabytes of data with irods. Jean-Yves Nief CC-IN2P3 France Managing Petabytes of data with irods Jean-Yves Nief CC-IN2P3 France Talk overview Data management context. Some data management goals: Storage virtualization. Virtualization of the data management policy.

More information

The International Journal of Digital Curation Issue 1, Volume

The International Journal of Digital Curation Issue 1, Volume Towards a Theory of Digital Preservation 63 Towards a Theory of Digital Preservation Reagan Moore, San Diego Supercomputer Center June 2008 Abstract A preservation environment manages communication from

More information

A High-Level Distributed Execution Framework for Scientific Workflows

A High-Level Distributed Execution Framework for Scientific Workflows A High-Level Distributed Execution Framework for Scientific Workflows Jianwu Wang 1, Ilkay Altintas 1, Chad Berkley 2, Lucas Gilbert 1, Matthew B. Jones 2 1 San Diego Supercomputer Center, UCSD, U.S.A.

More information

EGEE and Interoperation

EGEE and Interoperation EGEE and Interoperation Laurence Field CERN-IT-GD ISGC 2008 www.eu-egee.org EGEE and glite are registered trademarks Overview The grid problem definition GLite and EGEE The interoperability problem The

More information

Neuro-imaging Informatics:

Neuro-imaging Informatics: Neuro-imaging Informatics: Neil Killeen The Centre for Neuroscience University of Melbourne, Victoria 3010, Australia Talk Outline Neuro-Informatics/Neuro-imaging Informatics Components of System and Workflow

More information

DataONE: Open Persistent Access to Earth Observational Data

DataONE: Open Persistent Access to Earth Observational Data Open Persistent Access to al Robert J. Sandusky, UIC University of Illinois at Chicago The Net Partners Update: ONE and the Conservancy December 14, 2009 Outline NSF s Net Program ONE Introduction Motivating

More information

A Simplified Access to Grid Resources for Virtual Research Communities

A Simplified Access to Grid Resources for Virtual Research Communities Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA A Simplified Access to Grid Resources for Virtual Research Communities Roberto BARBERA (1-3), Marco FARGETTA (3,*) and Riccardo ROTONDO (2) (1) Department

More information

An End-to-End Web Services-based Infrastructure for Biomedical Applications

An End-to-End Web Services-based Infrastructure for Biomedical Applications An End-to-End Web Services-based Infrastructure for Biomedical Applications Sriram Krishnan *, Kim K. Baldridge, Jerry P. Greenberg, Brent Stearn and Karan Bhatia * sriram@sdsc.edu Modeling and Analysis

More information

Knowledge-based Grids

Knowledge-based Grids Knowledge-based Grids Reagan Moore San Diego Supercomputer Center (http://www.npaci.edu/dice/) Data Intensive Computing Environment Chaitan Baru Walter Crescenzi Amarnath Gupta Bertram Ludaescher Richard

More information

Introducing IBM WebSphere CloudBurst Appliance and IBM WebSphere Application Server Hypervisor Edition

Introducing IBM WebSphere CloudBurst Appliance and IBM WebSphere Application Server Hypervisor Edition Introducing IBM WebSphere CloudBurst Appliance and IBM WebSphere Application Server Hypervisor Edition Reduced cost and time-to-value; increased correctness and agility R.Vinoth Industry Solution Architect

More information

Report. Middleware Proxy: A Request-Driven Messaging Broker For High Volume Data Distribution

Report. Middleware Proxy: A Request-Driven Messaging Broker For High Volume Data Distribution CERN-ACC-2013-0237 Wojciech.Sliwinski@cern.ch Report Middleware Proxy: A Request-Driven Messaging Broker For High Volume Data Distribution W. Sliwinski, I. Yastrebov, A. Dworak CERN, Geneva, Switzerland

More information

Potential for Technology Innovation within the Internet2 Community: A Five-Year View

Potential for Technology Innovation within the Internet2 Community: A Five-Year View Potential for Technology Innovation within the Internet2 Community: A Five-Year View Steve Corbató Managing Director, Technology Direction & Development Industry Strategy Council meeting DTW Westin 17

More information

THE WIDE AREA GRID. Architecture

THE WIDE AREA GRID. Architecture THE WIDE AREA GRID Architecture Context The Wide Area Grid concept was discussed during several WGISS meetings The idea was to imagine and experiment an infrastructure that could be used by agencies to

More information

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research Dr Paul Calleja Director of Research Computing University of Cambridge Global leader in science & technology

More information

A Tale of Two Grids. Two Very Successful Grids

A Tale of Two Grids. Two Very Successful Grids A Tale of Two Grids Dr. Philip Papadopoulos Program Director, Grid and Cluster Computing San Diego Supercomputer Center University of California, San Diego phil@sdsc.edu April 2004 http://www.pragma-grid.net

More information

Kepler and Grid Systems -- Early Efforts --

Kepler and Grid Systems -- Early Efforts -- Distributed Computing in Kepler Lead, Scientific Workflow Automation Technologies Laboratory San Diego Supercomputer Center, (Joint work with Matthew Jones) 6th Biennial Ptolemy Miniconference Berkeley,

More information

FIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION

FIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION FIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION The process of planning and executing SQL Server migrations can be complex and risk-prone. This is a case where the right approach and

More information

Advanced School in High Performance and GRID Computing November Introduction to Grid computing.

Advanced School in High Performance and GRID Computing November Introduction to Grid computing. 1967-14 Advanced School in High Performance and GRID Computing 3-14 November 2008 Introduction to Grid computing. TAFFONI Giuliano Osservatorio Astronomico di Trieste/INAF Via G.B. Tiepolo 11 34131 Trieste

More information

South African Science Gateways

South African Science Gateways Co-ordination & Harmonisation of Advanced e-infrastructures for Research and Education Data Sharing Research Infrastructures Grant Agreement n. 306819 South African Science Gateways Bruce Becker, Coordinator,

More information

Virtual Organizations in Academic Settings

Virtual Organizations in Academic Settings Virtual Organizations in Academic Settings Alan Sill Senior Scientist, Texas Internet Grid for Research and Education and Adjunct Professor of Physics Texas Tech University Dec. 6, 2006 Internet2 Fall

More information

CEER Cyber-Physical Testbeds (a generational leap)

CEER Cyber-Physical Testbeds (a generational leap) CEER Cyber-Physical Testbeds (a generational leap) CEER: Cyber-Physical Experimentation (testbed operation support) DATA ASSETS CLOUD Customer TESTBED PEOPLE PROVISION LOCAL SCIENCE Other Testbeds Testbed

More information

Genomics on Cisco Metacloud + SwiftStack

Genomics on Cisco Metacloud + SwiftStack Genomics on Cisco Metacloud + SwiftStack Technology is a large component of driving discovery in both research and providing timely answers for clinical treatments. Advances in genomic sequencing have

More information

Grid Scheduling Architectures with Globus

Grid Scheduling Architectures with Globus Grid Scheduling Architectures with Workshop on Scheduling WS 07 Cetraro, Italy July 28, 2007 Ignacio Martin Llorente Distributed Systems Architecture Group Universidad Complutense de Madrid 1/38 Contents

More information

Introduction to Grid Technology

Introduction to Grid Technology Introduction to Grid Technology B.Ramamurthy 1 Arthur C Clarke s Laws (two of many) Any sufficiently advanced technology is indistinguishable from magic." "The only way of discovering the limits of the

More information

Grid Computing Middleware. Definitions & functions Middleware components Globus glite

Grid Computing Middleware. Definitions & functions Middleware components Globus glite Seminar Review 1 Topics Grid Computing Middleware Grid Resource Management Grid Computing Security Applications of SOA and Web Services Semantic Grid Grid & E-Science Grid Economics Cloud Computing 2 Grid

More information

MONTE CARLO SIMULATION FOR RADIOTHERAPY IN A DISTRIBUTED COMPUTING ENVIRONMENT

MONTE CARLO SIMULATION FOR RADIOTHERAPY IN A DISTRIBUTED COMPUTING ENVIRONMENT The Monte Carlo Method: Versatility Unbounded in a Dynamic Computing World Chattanooga, Tennessee, April 17-21, 2005, on CD-ROM, American Nuclear Society, LaGrange Park, IL (2005) MONTE CARLO SIMULATION

More information

Nancy Wilkins-Diehr San Diego Supercomputer Center (SDSC) University of California at San Diego

Nancy Wilkins-Diehr San Diego Supercomputer Center (SDSC) University of California at San Diego SimpleGrid Toolkit: Enabling Efficient Learning and Development of TeraGrid Science Gateway Shaowen Wang Yan Liu CyberInfrastructure and Geospatial Information Laboratory (CIGI) National Center for Supercomputing

More information

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN Application of Virtualization Technologies & CernVM Benedikt Hegner CERN Virtualization Use Cases Worker Node Virtualization Software Testing Training Platform Software Deployment }Covered today Server

More information

The Portal Aspect of the LSST Science Platform. Gregory Dubois-Felsmann Caltech/IPAC. LSST2017 August 16, 2017

The Portal Aspect of the LSST Science Platform. Gregory Dubois-Felsmann Caltech/IPAC. LSST2017 August 16, 2017 The Portal Aspect of the LSST Science Platform Gregory Dubois-Felsmann Caltech/IPAC LSST2017 August 16, 2017 1 Purpose of the LSST Science Platform (LSP) Enable access to the LSST data products Enable

More information

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy Why the Grid? Science is becoming increasingly digital and needs to deal with increasing amounts of

More information

BEYOND AUTHENTICATION IDENTITY AND ACCESS MANAGEMENT FOR THE MODERN ENTERPRISE

BEYOND AUTHENTICATION IDENTITY AND ACCESS MANAGEMENT FOR THE MODERN ENTERPRISE BEYOND AUTHENTICATION IDENTITY AND ACCESS MANAGEMENT FOR THE MODERN ENTERPRISE OUR ORGANISATION AND SPECIALIST SKILLS Focused on delivery, integration and managed services around Identity and Access Management.

More information

Data Intensive processing with irods and the middleware CiGri for the Whisper project Xavier Briand

Data Intensive processing with irods and the middleware CiGri for the Whisper project Xavier Briand and the middleware CiGri for the Whisper project Use Case of Data-Intensive processing with irods Collaboration between: IT part of Whisper: Sofware development, computation () Platform Ciment: IT infrastructure

More information

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT. Chapter 4:- Introduction to Grid and its Evolution Prepared By:- Assistant Professor SVBIT. Overview Background: What is the Grid? Related technologies Grid applications Communities Grid Tools Case Studies

More information

a.k.a. Introducing the IBM MQ Appliance

a.k.a. Introducing the IBM MQ Appliance Understanding MQ Deployment Choices and Use Cases a.k.a. Introducing the IBM MQ Appliance Morag Hughson hughson@uk.ibm.com Session # 17060 Introduction Introducing IBM MQ Appliance The scalability and

More information

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development Jeremy Fischer Indiana University 9 September 2014 Citation: Fischer, J.L. 2014. ACCI Recommendations on Long Term

More information

Internet of Things (IoT) CSE237A

Internet of Things (IoT) CSE237A Internet of Things (IoT) CSE237A Class Overview What ve covered until now: All material that will be on exam! Where we are going today: IoT & exam review Due today: Article on IoT HW3 at 11:59pm; upload.pdf

More information

LONI IMAGE & DATA ARCHIVE USER MANUAL

LONI IMAGE & DATA ARCHIVE USER MANUAL LONI IMAGE & DATA ARCHIVE USER MANUAL Laboratory of Neuro Imaging Dr. Arthur W. Toga, Director April, 2017 LONI Image & Data Archive INTRODUCTION The LONI Image & Data Archive (IDA) is a user-friendly

More information

XSEDE s Campus Bridging Project Jim Ferguson National Institute for Computational Sciences

XSEDE s Campus Bridging Project Jim Ferguson National Institute for Computational Sciences January 3, 2016 XSEDE s Campus Bridging Project Jim Ferguson National Institute for Computational Sciences jwf@utk.edu What is XSEDE? extreme Science and Engineering Discovery Environment $121M project

More information

N. Marusov, I. Semenov

N. Marusov, I. Semenov GRID TECHNOLOGY FOR CONTROLLED FUSION: CONCEPTION OF THE UNIFIED CYBERSPACE AND ITER DATA MANAGEMENT N. Marusov, I. Semenov Project Center ITER (ITER Russian Domestic Agency N.Marusov@ITERRF.RU) Challenges

More information

HIGH PERFORMANCE COMPUTING (PLATFORMS) SECURITY AND OPERATIONS

HIGH PERFORMANCE COMPUTING (PLATFORMS) SECURITY AND OPERATIONS HIGH PERFORMANCE COMPUTING (PLATFORMS) SECURITY AND OPERATIONS AT PITT Kim F. Wong Center for Research Computing SAC-PA, June 22, 2017 Our service The mission of the Center for Research Computing is to

More information

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation, Integration Alan Blatecky Director OCI 1 1 Framing the

More information

Web Services Based Instrument Monitoring and Control

Web Services Based Instrument Monitoring and Control Web Services Based Instrument Monitoring and Control Peter Turner, 1 Ian M. Atkinson, 2 Douglas du Boulay, 1 Cameron Huddlestone-Holmes, 2 Tristan King, 2 Romain Quilici, 1 Mathew Wyatt, 2 Donald F. McMullen,

More information

Cisco Unified Computing System Delivering on Cisco's Unified Computing Vision

Cisco Unified Computing System Delivering on Cisco's Unified Computing Vision Cisco Unified Computing System Delivering on Cisco's Unified Computing Vision At-A-Glance Unified Computing Realized Today, IT organizations assemble their data center environments from individual components.

More information

Graphical System Design

Graphical System Design Graphical System Design Nancy Dib Marketing Manager 21 st Century Challenges Engineering Grand Challenges, NAE Advance health informatics Engineer the tools of scientific discovery Reverse-engineer the

More information

Georgia State University Cyberinfrastructure Plan

Georgia State University Cyberinfrastructure Plan Georgia State University Cyberinfrastructure Plan Summary Building relationships with a wide ecosystem of partners, technology, and researchers are important for GSU to expand its innovative improvements

More information

Flexible HPC for Bio-informatics. Peter Clapham

Flexible HPC for Bio-informatics. Peter Clapham Flexible HPC for Bio-informatics Peter Clapham Overview Overview of the Sanger Institute How our data flow works today New scientific demands Private cloud deployment Transitional and future challenges

More information

How to build Scientific Gateways with Vine Toolkit and Liferay/GridSphere framework

How to build Scientific Gateways with Vine Toolkit and Liferay/GridSphere framework How to build Scientific Gateways with Vine Toolkit and Liferay/GridSphere framework Piotr Dziubecki, Piotr Grabowski, Michał Krysiński, Tomasz Kuczyński, Dawid Szejnfeld, Dominik Tarnawczyk, Gosia Wolniewicz

More information

Data Virtualization Implementation Methodology and Best Practices

Data Virtualization Implementation Methodology and Best Practices White Paper Data Virtualization Implementation Methodology and Best Practices INTRODUCTION Cisco s proven Data Virtualization Implementation Methodology and Best Practices is compiled from our successful

More information

Pegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute

Pegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute Pegasus Workflow Management System Gideon Juve USC Informa3on Sciences Ins3tute Scientific Workflows Orchestrate complex, multi-stage scientific computations Often expressed as directed acyclic graphs

More information

MediGRID Grid Computing for Medicine and Life Sciences

MediGRID Grid Computing for Medicine and Life Sciences MediGRID Grid Computing for Medicine and Life Sciences Anette Weisbecker, Fraunhofer IAO, Stuttgart Otto Rienhoff, Georg-August-Universität, Göttingen International Symposium on Grid Computing Taipei,

More information

Day 1 : August (Thursday) An overview of Globus Toolkit 2.4

Day 1 : August (Thursday) An overview of Globus Toolkit 2.4 An Overview of Grid Computing Workshop Day 1 : August 05 2004 (Thursday) An overview of Globus Toolkit 2.4 By CDAC Experts Contact :vcvrao@cdacindia.com; betatest@cdacindia.com URL : http://www.cs.umn.edu/~vcvrao

More information

Virtualization of Workflows for Data Intensive Computation

Virtualization of Workflows for Data Intensive Computation Virtualization of Workflows for Data Intensive Computation Sreekanth Pothanis (1,2), Arcot Rajasekar (3,4), Reagan Moore (3,4). 1 Center for Computation and Technology, Louisiana State University, Baton

More information

Knowledge Discovery Services and Tools on Grids

Knowledge Discovery Services and Tools on Grids Knowledge Discovery Services and Tools on Grids DOMENICO TALIA DEIS University of Calabria ITALY talia@deis.unical.it Symposium ISMIS 2003, Maebashi City, Japan, Oct. 29, 2003 OUTLINE Introduction Grid

More information

Technical Overview. Access control lists define the users, groups, and roles that can access content as well as the operations that can be performed.

Technical Overview. Access control lists define the users, groups, and roles that can access content as well as the operations that can be performed. Technical Overview Technical Overview Standards based Architecture Scalable Secure Entirely Web Based Browser Independent Document Format independent LDAP integration Distributed Architecture Multiple

More information

Scaling a Global File System to the Greatest Possible Extent, Performance, Capacity, and Number of Users

Scaling a Global File System to the Greatest Possible Extent, Performance, Capacity, and Number of Users Scaling a Global File System to the Greatest Possible Extent, Performance, Capacity, and Number of Users Phil Andrews, Bryan Banister, Patricia Kovatch, Chris Jordan San Diego Supercomputer Center University

More information

Cloud Computing. Up until now

Cloud Computing. Up until now Cloud Computing Lecture 4 and 5 Grid: 2012-2013 Introduction. Up until now Definition of Cloud Computing. Grid Computing: Schedulers: Condor SGE 1 Summary Core Grid: Toolkit Condor-G Grid: Conceptual Architecture

More information

Managing the Evolution of Dataflows with VisTrails

Managing the Evolution of Dataflows with VisTrails Managing the Evolution of Dataflows with VisTrails Juliana Freire http://www.cs.utah.edu/~juliana University of Utah Joint work with: Steven P. Callahan, Emanuele Santos, Carlos E. Scheidegger, Claudio

More information

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science T. Maeno, K. De, A. Klimentov, P. Nilsson, D. Oleynik, S. Panitkin, A. Petrosyan, J. Schovancova, A. Vaniachine,

More information

GAMA: Grid Account Management Architecture

GAMA: Grid Account Management Architecture GAMA: Grid Account Management Architecture Karan Bhatia, Sandeep Chandra, Kurt Mueller San Diego Supercomputer Center {karan,chandras,kurt}@sdsc.edu Abstract Security is a critical component of grid systems

More information

THEBES: THE GRID MIDDLEWARE PROJECT Project Overview, Status Report and Roadmap

THEBES: THE GRID MIDDLEWARE PROJECT Project Overview, Status Report and Roadmap THEBES: THE GRID MIDDLEWARE PROJECT Project Overview, Status Report and Roadmap Arnie Miles Georgetown University adm35@georgetown.edu http://thebes.arc.georgetown.edu The Thebes middleware project was

More information

The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets

The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets Jeffrey Brown, Lesley Curtis, and Rich Platt June 13, 2014 Previously The NIH Collaboratory:

More information

An Experience in Accessing Grid Computing from Mobile Device with GridLab Mobile Services

An Experience in Accessing Grid Computing from Mobile Device with GridLab Mobile Services An Experience in Accessing Grid Computing from Mobile Device with GridLab Mobile Services Riri Fitri Sari, Rene Paulus Department of Electrical Engineering, Faculty of Engineering University of Indonesia

More information

iscsi Technology: A Convergence of Networking and Storage

iscsi Technology: A Convergence of Networking and Storage HP Industry Standard Servers April 2003 iscsi Technology: A Convergence of Networking and Storage technology brief TC030402TB Table of Contents Abstract... 2 Introduction... 2 The Changing Storage Environment...

More information

Issues Regarding fmri Imaging Workflow and DICOM

Issues Regarding fmri Imaging Workflow and DICOM Issues Regarding fmri Imaging Workflow and DICOM Lawrence Tarbox, Ph.D. Fred Prior, Ph.D Mallinckrodt Institute of Radiology Washington University in St. Louis What is fmri fmri is used to localize functions

More information

UNICORE Globus: Interoperability of Grid Infrastructures

UNICORE Globus: Interoperability of Grid Infrastructures UNICORE : Interoperability of Grid Infrastructures Michael Rambadt Philipp Wieder Central Institute for Applied Mathematics (ZAM) Research Centre Juelich D 52425 Juelich, Germany Phone: +49 2461 612057

More information

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands Unleash Your Data Center s Hidden Power September 16, 2014 Molly Rector CMO, EVP Product Management & WW Marketing

More information

Grid Portal Architectures for Scientific Applications

Grid Portal Architectures for Scientific Applications Grid Portal Architectures for Scientific Applications M. P. Thomas 1, J. Burruss 2, L. Cinquini 3, G. Fox 4, D. Gannon 5, L. Gilbert 6, G. von Laszewski 7, K. Jackson 8, D. Middleton 3, R. Moore 6, M.

More information

Introduction to Grid Infrastructures

Introduction to Grid Infrastructures Introduction to Grid Infrastructures Stefano Cozzini 1 and Alessandro Costantini 2 1 CNR-INFM DEMOCRITOS National Simulation Center, Trieste, Italy 2 Department of Chemistry, Università di Perugia, Perugia,

More information