GPU Data Mining in Neuroimaging Genomics

Similar documents
Neuroimaging and mathematical modelling Lesson 2: Voxel Based Morphometry

MARS: Multiple Atlases Robust Segmentation

NIH Public Access Author Manuscript Med Image Comput Comput Assist Interv. Author manuscript; available in PMC 2015 January 01.

MARS: Multiple Atlases Robust Segmentation

Efficient Computation of Radial Distribution Function on GPUs

Dimension Reduction for Big Data Analysis. Dan Shen. Department of Mathematics & Statistics University of South Florida.

The Anatomical Equivalence Class Formulation and its Application to Shape-based Computational Neuroanatomy

Sharing Biomedical Research Data. Michael F. Huerta, Ph.D. Associate Director National Library of Medicine, NIH

Learning-based Neuroimage Registration

GPU-Based Implementation of a Computational Model of Cerebral Cortex Folding

Processing math: 100% Intensity Normalization

Graphical Models, Bayesian Method, Sampling, and Variational Inference

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS

Sparse Screening for Exact Data Reduction

Applications of admixture models

ASAP_2.0 (Automatic Software for ASL Processing) USER S MANUAL

Introduction to Design Optimization: Search Methods

Blood vessel tracking in retinal images

The organization of the human cerebral cortex estimated by intrinsic functional connectivity

Click on "+" button Select your VCF data files (see #Input Formats->1 above) Remove file from files list:

Semi-supervised Hierarchical Multimodal Feature and Sample Selection for Alzheimer s Disease Diagnosis

Large scale Imaging on Current Many- Core Platforms

Adaptive-Mesh-Refinement Hydrodynamic GPU Computation in Astrophysics

Medical Image Registration by Maximization of Mutual Information

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

A probabilistic logic incorporating posteriors of hierarchic graphical models

Genomics. Nolan C. Kane

BGGN-213: FOUNDATIONS OF BIOINFORMATICS (Lecture 14)

Power Estimates for Voxel-Based Genetic Association Studies using Diffusion Imaging

NIH Public Access Author Manuscript Conf Proc (Midwest Symp Circuits Syst). Author manuscript; available in PMC 2015 August 01.

VIDEO SEARCHING AND BROWSING USING VIEWFINDER

PhD: a web database application for phenotype data management

Lecture Overview. Sequence search & alignment. Searching sequence databases. Sequence Alignment & Search. Goals: Motivations:

Performance Evaluation of the TINA Medical Image Segmentation Algorithm on Brainweb Simulated Images

Statistical Data Mining

LECTURE 20: SWARM INTELLIGENCE 6 / ANT COLONY OPTIMIZATION 2

Sequence Alignment & Search

A New GPU-Based Level Set Method for Medical Image Segmentation

GPU ACCELERATED SELF-JOIN FOR THE DISTANCE SIMILARITY METRIC

Monte-Carlo Analysis of Object Reentry in Earth s Atmosphere Based on Taguchi Method

Neuroimage Processing

Orifice Flow Meter

Introduction to Computer Graphics with WebGL

Correction for multiple comparisons. Cyril Pernet, PhD SBIRC/SINAPSE University of Edinburgh

Immersive Out-of-Core Visualization of Large-Size and Long-Timescale Molecular Dynamics Trajectories

K-Means Segmentation of Alzheimer s Disease In Pet Scan Datasets An Implementation

for Images A Bayesian Deformation Model

Mass Spec Data Post-Processing Software. ClinProTools. Wayne Xu, Ph.D. Supercomputing Institute Phone: Help:

1 Introduction Motivation and Aims Functional Imaging Computational Neuroanatomy... 12

Application of MCNP Code in Shielding Design for Radioactive Sources

Lattice QCD on Graphics Processing Units?

General Purpose GPU Computing in Partial Wave Analysis

Histograms. h(r k ) = n k. p(r k )= n k /NM. Histogram: number of times intensity level rk appears in the image

Computer Graphics and Visualization. What is computer graphics?

Exact Sampling for Hardy- Weinberg Equilibrium

Manifold Learning: Applications in Neuroimaging

GMDR User Manual. GMDR software Beta 0.9. Updated March 2011

Automatic Subcortical Segmentation Using a Contextual Model

Random Forest in Genomic Selection

Acknowledgments. High Performance Cone-Beam CT of Acute Traumatic Brain Injury

GLIRT: Groupwise and Longitudinal Image Registration Toolbox

SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data

9/30/2018. Hierarchical Markov decision processes. Outline. Difficulties when modeling

A Work-Efficient GPU Algorithm for Level Set Segmentation. Mike Roberts Jeff Packer Mario Costa Sousa Joseph Ross Mitchell

17th Annual Meeting of the Organization for Human Brain Mapping (HBM)

DRAGEN Bio-IT Platform Enabling the Global Genomic Infrastructure

Progressive Graph-Based Transductive Learning for Multi-modal Classification of Brain Disorder Disease

Joe Wingbermuehle, (A paper written under the guidance of Prof. Raj Jain)

called Hadoop Distribution file System (HDFS). HDFS is designed to run on clusters of commodity hardware and is capable of handling large files. A fil

Mathematical computations with GPUs

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 26, NO. 10, OCTOBER

better images mean better results

An approach to calculate and visualize intraoperative scattered radiation exposure

Neuron Crawler: An Automatic Tracing Algorithm for Very Large Neuron Images

Introduction to GE Microarray data analysis Practical Course MolBio 2012

MRI Segmentation MIDAS, 2007, 2010

Using eportfolios to Assess PhD Competencies. a bit about me

Towards Interactive Global Illumination Effects via Sequential Monte Carlo Adaptation. Carson Brownlee Peter S. Shirley Steven G.

SURFACE RECONSTRUCTION OF EX-VIVO HUMAN V1 THROUGH IDENTIFICATION OF THE STRIA OF GENNARI USING MRI AT 7T

Rapid k-d tree construction for sparse volume data

A MULTI-GPU COMPUTE SOLUTION FOR OPTIMIZED GENOMIC SELECTION ANALYSIS. A Thesis. presented to. the Faculty of California Polytechnic State University

Methods for data preprocessing

mritc: A Package for MRI Tissue Classification

Detecting Anatomical Landmarks from Limited Medical Imaging Data using Two-Stage Task-Oriented Deep Neural Networks

Bayesian Spatiotemporal Modeling with Hierarchical Spatial Priors for fmri

Medical Volume Segmentation using Bank of Gabor Filters

CircosVCF workshop, TAU, 9/11/2017

Prostate Detection Using Principal Component Analysis

Randomized algorithms have several advantages over deterministic ones. We discuss them here:

Parametric Response Surface Models for Analysis of Multi-Site fmri Data

Genetic Algorithms. PHY 604: Computational Methods in Physics and Astrophysics II

FHIR Genomics Use Case: Reporting HLA Genotyping Consensus-Sequence-Block & Observation-genetics-Sequence Cardinality

Step-by-Step Guide to Advanced Genetic Analysis

Isointense infant brain MRI segmentation with a dilated convolutional neural network Moeskops, P.; Pluim, J.P.W.

Getting Started. April Strand Life Sciences, Inc All rights reserved.

Data mining for neuroimaging data. John Ashburner

fmri Image Preprocessing

FCUDA: Enabling Efficient Compilation of CUDA Kernels onto

STUDYING OPENMP WITH VAMPIR & SCORE-P

COMPARATIVE DEEP LEARNING FOR CONTENT- BASED MEDICAL IMAGE RETRIEVAL

Transcription:

GPU Data Mining in Neuroimaging Genomics Bob Zigon Beckman Coulter Indianapolis, Indiana May 10, 2017 1 / 20

Outline Background ANOVA for Voxels and SNPs VEGAS for Voxels and Genes High Speed GPU Monte-Carlo Simulator 2 / 20

What is Neuroimaging Genomics? 1 Neuroimaging Genomics is the fusion of brain imaging and genotyping data. 2 Study the influence of genetic variation on brain structure and function. 3 / 20

MRI and Sequencing data MRI instrument MRI data Genotyping instrument Genotyping data 4 / 20

Problem Definition Develop an interactive tool for studying Alzheimer s Disease by coupling a 3D brain explorer with a genome explorer. Prior Art Our Goal 120 ROI s 1,000,000 voxels 20,000 SNP s 1,000,000 SNP s 5 / 20

Problem Definition Brain with 120 Regions of Interest Brain with 1,000,000 voxels 6 / 20

How We Do It The UI

How We Do It The UI Brain Explorer

How We Do It The UI SNP Explorer Brain Explorer

How We Do It The UI SNP Explorer Brain Explorer Heat Map 7 / 20

ANOVA ANOVA - Analysis of Variance Understand the relationship between the gray matter density from the MRI and the SNP genotype. Did the combination happen by chance or not? 8 / 20

ANOVA ANOVA - Analysis of Variance Understand the relationship between the gray matter density from the MRI and the SNP genotype. Did the combination happen by chance or not? Computational complexity O(N v N j N s ) N v - number of voxels N j - number of subjects N s - number of SNPS 8 / 20

ANOVA Voxels SNPs Subjects v 11 v 1M v 21 v 2M..... v N1 v NM Subjects s 11 s 1M s 21 s 2M..... s K1 s KM ANOVA Voxels SNPs vs 11 vs 1M vs 21 vs 2M..... vs N1 vs NM 9 / 20

VEGAS VEGAS - VErsatile Gene based Association Study Understand the relationship between the gray matter density from the MRI and the collective effect of multiple SNPs within a gene. Did the combination happen by chance or not? 10 / 20

VEGAS VEGAS - VErsatile Gene based Association Study Understand the relationship between the gray matter density from the MRI and the collective effect of multiple SNPs within a gene. Did the combination happen by chance or not? Computational complexity O( N v N j N s + N v N g N i ) }{{}}{{} ANOVA component Monte-Carlo component N v - number of voxels N g - number of genes N i - number of Monte-Carlo iterations (10 2, 10 3, 10 4, 10 5, or 10 6 ) 10 / 20

VEGAS Voxels Subjects v 11 v 1M v 21 v 2M..... v N1 v NM ANOVA Voxels SNPs vs 11 vs 1M vs 21 vs 2M..... VEGAS Voxels Genes vg 11 vg 1G vg 21 vg 2G..... SNPs Subjects s 11 s 1M s 21 s 2M..... vs N1 vs NM Monte Carlo Simulation vg N1 vg NG s K1 s KM 11 / 20

Video Video of the Interactive Neuroimaging Genomic Browser 12 / 20

Lessons Learned How do you build a high speed Monte-Carlo Simulator for an N dimensional problem on a GPU? 13 / 20

Lessons Learned One Dimensional for i = 1 to K do Choose X from N(0,1) Y = F (X ) Make decision about Y 14 / 20

Lessons Learned One Dimensional for i = 1 to K do Choose X from N(0,1) Y = F (X ) Make decision about Y N-Dimensional for i = 1 to K do Choose n values from N(0,1) giving X n Y n = F (X n ) Make decision about Y n 14 / 20

Lessons Learned First Attempt at N-Dimensional foreach voxel V in parallel do foreach gene G in parallel do for i = 1 to K do Choose n values from N(0,1) giving X n Y n = F (X n ) Make decision about Y n 15 / 20

Lessons Learned First Attempt at N-Dimensional foreach voxel V in parallel do foreach gene G in parallel do for i = 1 to K do Choose n values from N(0,1) giving X n Y n = F (X n ) Make decision about Y n Slow Fast Theoretical Memory Bandwidth (gb/sec) 20 320 GFLOPS 20 10,000 15 / 20

Lessons Learned - Solution N-Dimensional Ah-Ha In parallel, generate n K values from N(0,1) giving X n K Y n K = F n n X n K In parallel, decide about Y n K 16 / 20

Lessons Learned Slow N-Dimensional foreach voxel V in parallel do foreach gene G in parallel do for i = 1 to K do n values from N(0,1) Y n = F (X n ) Make decision about Y n Fast N-Dimensional foreach voxel V sequentially do foreach gene G sequentially do In parallel, generate nk values Y n K = F n n X n K In parallel, decide about Y n K X X 17 / 20

Lessons Learned Slow N-Dimensional foreach voxel V in parallel do foreach gene G in parallel do for i = 1 to K do n values from N(0,1) Y n = F (X n ) Make decision about Y n Fast N-Dimensional foreach voxel V sequentially do foreach gene G sequentially do In parallel, generate nk values Y n K = F n n X n K In parallel, decide about Y n K X X 17 / 20

Lessons Learned Slow N-Dimensional foreach voxel V in parallel do foreach gene G in parallel do for i = 1 to K do n values from N(0,1) Y n = F (X n ) Make decision about Y n Fast N-Dimensional foreach voxel V sequentially do foreach gene G sequentially do In parallel, generate nk values Y n K = F n n X n K In parallel, decide about Y n K X X Slow Fast Theoretical Memory Bandwidth (gb/sec) 20 155 320 GFLOPS 20 2,000 10,000 800X improvement! 17 / 20

Vegas Results Time in seconds 10 4 10 3 10 2 10 1 1-CPU 4-CPU GeForce Titan X 10 0 70x70 80x80 90x90 100x100 V-Voxels by G-Genes Figure 1: Execution times for 1 VEGAS run with K=10,000 Monte-Carlo iterations 18 / 20

Acknowledgements 1 Professor Li Shen, Dept. of Radiology and Imaging Sciences, IU School of Medicine, NIH R01 EB022574, R01 LM011360, U01 AG024904, and IUPUI ITDP Program. 2 Professor Shiaofen Fang, Computer Science Dept. Chair, Indiana University-Purdue University of Indianapolis 3 Professor Mohammad Al Hasan, Associate Professor Computer Science, Indiana University-Purdue University of Indianapolis 4 Huang Li, PhD Candidate, Computer Science, Indiana University-Purdue University of Indianapolis 19 / 20

Thank you robert.zigon@beckman.com 20 / 20

21 / 20