The NeuroLOG Platform Federating multi-centric neuroscience resources

Similar documents
NeuroLOG WP1 Sharing Data & Metadata

Toward ontology-based federated systems for sharing medical images: lessons from the NeuroLOG experience Bernard Gibaud

Multiple Sclerosis Brain MRI Segmentation Workflow deployment on the EGEE grid

NeuroBase: An Information System for Managing Distributed Knowledge and Data Bases in Neuroimaging. Christian BARILLOT DR CNRS

Specification of the NeuroLOG architecture components

Software composition for scientific workflows

Neurobase: Sharing data and image processing tools in neuroimaging

Juliusz Pukacki OGF25 - Grid technologies in e-health Catania, 2-6 March 2009

NeuroLOG Client User Guide

Manifold Learning: Applications in Neuroimaging

Joint Tumor Segmentation and Dense Deformable Registration of Brain MR Images

Semantic SOA - Realization of the Adaptive Services Grid

Engineering challenges in mhealth systems Octav Chipara

Identität und Autorisierung als Grundlage für sichere Web-Services. Dr. Hannes P. Lubich IT Security Strategist

Distributed Repository for Biomedical Applications

NeuroLOG: a community-driven middleware design

ETL is No Longer King, Long Live SDD

NeuroQLab A Software Assistant for Neurosurgical Planning and Quantitative Image Analysis

Integrative Informatics

Improving Collaborations in Neuroscientist Community

Grid-wide neuroimaging data federation in the context of the NeuroLOG project

IMPROVING COLLABORATIONS IN NEUROSCIENTIST COMMUNITY

Characterizing semantic service parameters with Role concepts to infer domain-specific knowledge at runtime

Learning-based Neuroimage Registration

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research

Managing CDISC version changes: how & when to implement? Presented by Lauren Shinaberry, Project Manager Business & Decision Life Sciences

Army Data Services Layer (ADSL) Data Mediation Providing Data Interoperability and Understanding in a

University of British Columbia Library. Persistent Digital Collections Implementation Plan. Final project report Summary version

Realizing the Army Net-Centric Data Strategy (ANCDS) in a Service Oriented Architecture (SOA)

Software Engineering

Implementing the Army Net Centric Data Strategy in a Service Oriented Environment

VISION Virtualized Storage Services Foundation for the Future Internet

The ICT for Health Perspective

better images mean better results

We recommend you cite the published version. The publisher s URL is:

Development of a Large-scale Neuroimages and Clinical Variables Data Atlas in the neugrid4you (N4U) project

Web Services Composition: Mashups Driven Orchestration Definition

Global Reference Architecture: Overview of National Standards. Michael Jacobson, SEARCH Diane Graski, NCSC Oct. 3, 2013 Arizona ewarrants

Panel 1 Service Platform and Network Infrastructure for Ubiquitous Services

Chapter 17 Web Services Additional Topics

Conception of Information Systems Lecture 1: Basics

NeuroLOG Security Policy proposal

Amigo Symposium 28 February 2008

Huntington s Disease and Vertex Pharmaceuticals

Chapter Outline. Chapter 2 Distributed Information Systems Architecture. Layers of an information system. Design strategies.

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

My Health, My Data (and other related projects) Yannis Ioannidis ATHENA Research Center & University of Athens

Extending SOA Infrastructure for Semantic Interoperability

GRID COMPUTING IN MEDICAL APPLICATIONS

Organizing and Managing Grassroots Enterprise Mashup Environments. Doctorial Thesis, 24 th June, Volker Hoyer

Grid Platform for Medical Federated Queries Supporting Semantic and Visual Annotations

Preprocessing, Management, and Analysis of Mass Spectrometry Proteomics Data

Enterprise Architecture Deployment Options. Mark Causley Sandy Milliken Sue Martin

Introduction to Federation Server

A Vision for Bigger Biomedical Data: Integration of REDCap with Other Data Sources

EXABEAM HELPS PROTECT INFORMATION SYSTEMS

Using standards to make sense of data

Secure Enterprise Access to Support Collaboration on Clinical Research

Executive Summary for deliverable D6.1: Definition of the PFS services (requirements, initial design)

DICOM Research Applications - life at the fringe of reality

The GEMOC Initiative On the Globalization of Modeling Languages

DRAGEN Bio-IT Platform Enabling the Global Genomic Infrastructure

Six Sigma in the datacenter drives a zero-defects culture

High Performance Computing Course Notes Course Administration

Data Integration and Data Warehousing Database Integration Overview

An Integrated e-science Analysis Base for Computational Neuroscience Experiments and Analysis

Concurrent Visualization of and Mapping between 2D and 3D Medical Images for Disease Pattern Analysis

Ramnish Singh IT Advisor Microsoft Corporation Session Code:

Towards a Long Term Research Agenda for Digital Library Research. Yannis Ioannidis University of Athens

An Information Sharing Platform Prototype for Hadron Therapy

Designing an institutional research data management infrastructure for the life sciences

ITU-T Y Next generation network evolution phase 1 Overview

Automatic Registration-Based Segmentation for Neonatal Brains Using ANTs and Atropos

Medical Image Registration

Design A Database Schema For A Hospital

Outline. The MammoGrid project Meta-Data and domain ontology (flexibility) Meta-Data, services and grid (openness) Query negotiator Outlook

Issues Regarding fmri Imaging Workflow and DICOM

RiskSense Attack Surface Validation for IoT Systems

Medicaid: Beyond the Silos Series Health and Housing Integration August 7, Arizona Environment

From gridified scripts to workflows: the FSL Feat case

CHIEF INFORMATION OFFICER

Neuroimaging and mathematical modelling Lesson 2: Voxel Based Morphometry

Fujitsu World Tour 2018

Certified Professional in Enterprise Management (CPEM) Exam Preparation Boot Camp

Globus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018

Solving the Enterprise Data Dilemma

Sources in Neuroimaging: The NeuroBase Project

High Performance Computing Course Notes HPC Fundamentals

Web Services. Lecture I. Valdas Rapševičius Vilnius University Faculty of Mathematics and Informatics

IBM Cloud Security for the Cloud. Amr Ismail Security Solutions Sales Leader Middle East & Pakistan

Pooling Clinical Data: Key points and Pitfalls. October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit

Medical Image Registration by Maximization of Mutual Information

Medical Image Segmentation

Neuroimaging Analysis using Grid Aware Planning and Optimisation Techniques

Click to edit Master title style

The MDPHnet Distributed Querying for Public Health Surveillance

Reducing Consumer Uncertainty

Wang Jian, He Keqing, SKLSE, Wuhan University, China

Archiving. Services. Optimize the management of information by defining a lifecycle strategy for data. Archiving. ediscovery. Data Loss Prevention

MARS: Multiple Atlases Robust Segmentation

Transcription:

Software technologies for integration of process and data in medical imaging The Platform Federating multi-centric neuroscience resources Johan MONTAGNAT Franck MICHEL Vilnius, Apr. 13 th 2011 ANR-06-TLOG-024 http://neurolog.polytech.unice.fr

Neurosciences requirements Major challenge for this century population aging, brain disorders growth, brain function understanding... Large medical image databases Statistical studies Population-specific atlases of the brain Data intensive procedures Heterogeneous data sets Different acquisition conditions, centers Several imaging modalities Associated clinical information Complex data analysis procedures Specific to some modalities, acquisition parameters Minutes to hours of computation time each Chained into application pipelines (workflows) Sensitive data Stringent access control requirements ANR-06-TLOG-024 2

Collaborative approach Sharing Computing algorithms and resources Research (populations studies, models design, validation, statistics) Complex analysis algorithms & pipelines (compute intensive image processing, time constraints...) Data Procedures Processing tools Computing power ANR-06-TLOG-024 3

Brain atrophy measure workflow Detection of the longitudinal brain volume change is an issue of central relevance in neuroimaging. Early diagnosis for neurodegenerative diseases (e.g. Alzheimer's). Reduction of costs in clinical trials, increasing of the power in longitudinal studies. ANR-06-TLOG-024 4

Inputs: longitudinal study Baseline image (T0) Other time point mages (T0 + 6 months, T0 + 12 months...) ANR-06-TLOG-024 5

Image normalization Space alignment (registration) Intensity alignment ANR-06-TLOG-024 6

Parameters extraction Mask Brain extraction 1044901-10294 -0.009 1044901-10484 -0.010 Quantitative parameters (atrophy measurement For Alzheimer's disease diagnosis) Deformation field computation ANR-06-TLOG-024 7

Generic infrastructure limitations The grid provides a foundational layer for distributed, intensive computing Distributed files, large number of computing tasks Gap between grid infrastructures and medical environment Low level foundational middlewares Complex requirements from the health community Need for neuroradiological data integration Domain-specific data representation, mediation for existing databases Legacy neuroscience computing environments Bridging local and grid resources Neurology data analysis pipelines Need to integrate neuro-data analysis codes and procedures Access control and privacy The foundational security layer needs to be refined with adapted security policies ANR-06-TLOG-024 8

objectives Enable the sharing of resources: Data & knowledge representation Ontologies + relational schema Neuroradiological data & associated metadata Distributed on neuroscience centers + EGI grid resources Integration of heterogeneous data stores Image analysis tools Bundled, relocatable, remote invocation Application pipelines Sites computing resources Four pathologies Multiple Sclerosis, brain strokes, brain tumors, Alzheimer's disease ANR-06-TLOG-024 9

Middleware design ANR-06-TLOG-024 10

Software architecture ANR-06-TLOG-024 11

Platform deployment 5 sites connected 4 collaborating hosiptals Pitié Salpétrière (Paris) Michalon (Grenoble) CHU Rennes Antoine Lacassagne (Nice) 7 academic partners I3S, IRISA, GIN, MIS, IFR49, INRIA Sophia, LRI 2 companies SAP, Visioscopie ANR-06-TLOG-024 12

Data management layer Provide a seamless access to heterogeneous distributed data Heterogeneous data (modality, clinical context ) Heterogeneous legacy database providers & schemas Heterogeneous file systems, resource storage units (local, grid) Need to provide: Federated view of the metadata Common access to physical files While enforcing strong constraints: Each partner site should keep control of access to their data Keep autonomous data management on each site: weak coupling Legacy data stores should not be altered Ensure secure access to sensitive data and metadata ANR-06-TLOG-024 13

Ontology Common relational schema Federated relational schema Variables Study Instrument Scores Examination Assessment Subject MR Protocols Dataset ANR-06-TLOG-024 14

Data management layer Approach Derived the ontology into relational Federated Schema A dynamic mediation & federation interface maps local database schemas to the federated schema A file transfer interface makes files available to the end-user or processing tools Come up with a global federated view that hides data distribution and heterogeneity from the end-user ANR-06-TLOG-024 15

Sharing image analysis tools Generic Application Service Wrapper (GASW) Service wrapper to non instrumented code Tool packaging in re-locatable self-contained executable units Expose tools as web services, standard invocation interface Handle data transfer Remote execution capability on the EGI grid Tools discovery through the federated view Executable remotely by any authorized user ANR-06-TLOG-024 16

Enabling processing pipelines MOTEUR workflow engine Generic workflow design and execution Support for different interfaces to processors Data and processing parallelism Handles stand-alone (client) and client-server deployment ANR-06-TLOG-024 17

Distributed data access control Multiple credentials per user Grid certificates (delivered by grid authority) Middleware certificates (delivered by site authority) Databases credential (SQL 92) Health professional smartcards Single sign-on enforced security policy Individuals identification Distributed security administration No central point of control Sites keep access control over all their data Adapts to heterogeneous site security policies ANR-06-TLOG-024 18

Distributed data access control Each site data access policy prevails for the data items the site owns rule : { StudyA ; read ; } ANR-06-TLOG-024 19

Results collaborating platform for multi-centric studies Integrate heterogeneous & distributed legacy data sets Share image analysis tools, distribute invocation Build complex experiment pipelines Distributed access control with prevailing local policies Advanced functionality High level ontology-based data representation EGI Grid interface, large-scale distributed processing The grid for neuroscientists Transparent access to grid resources Compliance with legacy environments ANR-06-TLOG-024 20

Limitations Semantic validation not yet integrated At processing tool annotation time: check compatibility of inputs/outputs with DataSet Processing class constraints At w/f design time: user-assisted composition checks compatibility of inputs/outputs of composed services At run time: check validity of actual inputs for each service Produced semantic data is still limited Developments on-going to provide richer semantic description of produced datasets using reasoning EGI interface still limited Integration work on-going to complete remote invocation and retrieval of results from storage elements ANR-06-TLOG-024 21

Global picture PS2 S3 Q S 4 Workflow Manager ANR-06-TLOG-024 22

Thank you ANR-06-TLOG-024 23