Reconstructing visual experiences from brain activity evoked by natural movies

Similar documents
Pixels to Voxels: Modeling Visual Representation in the Human Brain

Nonparametric sparse hierarchical models describe V1 fmri responses to natural images

arxiv: v1 [q-bio.nc] 24 Nov 2014

A spatial constrained multi target regression model for human brain activity prediction

Bayesian Reconstruction of Natural Images from Human Brain Activity

arxiv: v1 [q-bio.qm] 5 Mar 2015

Analysis of Functional MRI Timeseries Data Using Signal Processing Techniques

Attention modulates spatial priority maps in human occipital, parietal, and frontal cortex

Topographic Mapping with fmri

Supplementary Figure 1. Decoding results broken down for different ROIs

CS/NEUR125 Brains, Minds, and Machines. Due: Wednesday, April 5

Bayesian Inference in fmri Will Penny

Temporal and Cross-Subject Probabilistic Models for fmri Prediction Tasks

INDEPENDENT COMPONENT ANALYSIS APPLIED TO fmri DATA: A GENERATIVE MODEL FOR VALIDATING RESULTS

CS 229 Project report: Extracting vital signs from video

Seeking Interpretable Models for High Dimensional Data

Cognitive States Detection in fmri Data Analysis using incremental PCA

A Semantic Shared Response Model

Multi-voxel pattern analysis: Decoding Mental States from fmri Activity Patterns

PIXELS TO VOXELS: MODELING VISUAL REPRESENTATION IN THE HUMAN BRAIN

Stability. Bin Yu Statistics and EECS, University of California-Berkeley. SAMSI Opening Workshop on Massive Data, Sept, 2012

Modeling Visual Cortex V4 in Naturalistic Conditions with Invari. Representations

A SYNOPTIC ACCOUNT FOR TEXTURE SEGMENTATION: FROM EDGE- TO REGION-BASED MECHANISMS

Graphical Models, Bayesian Method, Sampling, and Variational Inference

Coordinate Transformations in Parietal Cortex. Computational Models of Neural Systems Lecture 7.1. David S. Touretzky November, 2013

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution

CS 229 Final Project Report Learning to Decode Cognitive States of Rat using Functional Magnetic Resonance Imaging Time Series

Independent Component Analysis of fmri Data

HST.583 Functional Magnetic Resonance Imaging: Data Acquisition and Analysis Fall 2006

Multivariate Pattern Classification. Thomas Wolbers Space and Aging Laboratory Centre for Cognitive and Neural Systems

Scalable Video Coding

Introduction to Neuroimaging Janaina Mourao-Miranda

Bayesian Spatiotemporal Modeling with Hierarchical Spatial Priors for fmri

How Many Humans Does it Take to Judge Video Quality?

Representational similarity analysis. Dr Ian Charest,

FMRI data: Independent Component Analysis (GIFT) & Connectivity Analysis (FNC)

Naïve Bayes, Gaussian Distributions, Practical Applications

Supplementary Methods

Applied Statistics and Machine Learning

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

arxiv: v1 [q-bio.nc] 9 Jun 2016

Optimizing deep video representation to match brain activity

Multivariate pattern classification

Supplementary Data. in residuals voxel time-series exhibiting high variance, for example, large sinuses.

Supplementary Figure 1

Cross-orientation suppression in human visual cortex

Simultaneous Estimation of Response Fields and Impulse Response Functions PhD Thesis Proposal

Face Recognition by Combining Kernel Associative Memory and Gabor Transforms

CP Generalize Concepts in Abstract Multi-dimensional Image Model Component Semantics. David Clunie.

No-reference perceptual quality metric for H.264/AVC encoded video. Maria Paula Queluz

Bayes Classifiers and Generative Methods

arxiv: v1 [stat.ap] 14 Apr 2011

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

Voxel selection algorithms for fmri

Analysis and Synthesis of Texture

Texture. Outline. Image representations: spatial and frequency Fourier transform Frequency filtering Oriented pyramids Texture representation

Estimating image bases for visual image reconstruction from human brain activity

Artifacts and Textured Region Detection

MultiVariate Bayesian (MVB) decoding of brain images

The organization of the human cerebral cortex estimated by intrinsic functional connectivity

Face Recognition & Detection Using Image Processing

HST.583 Functional Magnetic Resonance Imaging: Data Acquisition and Analysis Fall 2008

Pattern Recognition for Neuroimaging Data

CSE / 60537: Biometrics

Multivariate analyses & decoding

K-Means Clustering. Sargur Srihari

A Reduced-Dimension fmri! Shared Response Model

Bayesian Segmentation and Motion Estimation in Video Sequences using a Markov-Potts Model

Pyramid Coding and Subband Coding

A fast and efficient method for compressing fmri data sets

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011

Tutorial BOLD Module

Aiyar, Mani Laxman. Keywords: MPEG4, H.264, HEVC, HDTV, DVB, FIR.

CMPT 365 Multimedia Systems. Media Compression - Video

Temporal Modeling and Missing Data Estimation for MODIS Vegetation data

Directional Tuning in Single Unit Activity from the Monkey Motor Cortex

Super-Resolution from Image Sequences A Review

Computer Vision 2. SS 18 Dr. Benjamin Guthier Professur für Bildverarbeitung. Computer Vision 2 Dr. Benjamin Guthier

Statistical Analysis of MRI Data

Elliptical Head Tracker using Intensity Gradients and Texture Histograms

THE SHUFFLE ESTIMATOR FOR EXPLAINABLE VARIANCE IN FMRI EXPERIMENTS. Department of Statistics, UC Berkeley

Pyramid Coding and Subband Coding

Locating ego-centers in depth for hippocampal place cells

Statistical Analysis of Neuroimaging Data. Phebe Kemmer BIOS 516 Sept 24, 2015

Extensions of One-Dimensional Gray-level Nonlinear Image Processing Filters to Three-Dimensional Color Space

MULTIVARIATE ANALYSES WITH fmri DATA

CS229 Project: Classification of Motor Tasks Based on Functional Neuroimaging

Time-Frequency Method Based Activation Detection in Functional MRI Time-Series

CSC 578 Neural Networks and Deep Learning

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

Temporal Feature Selection for fmri Analysis

Feature Selection for fmri Classification

Filter Select ion Model for Generating Visual Mot ion Signals

Deep Learning. Deep Learning. Practical Application Automatically Adding Sounds To Silent Movies

Spatiotemporal Gabor filters: a new method for dynamic texture recognition

Saliency Extraction for Gaze-Contingent Displays

Efficient Visual Coding: From Retina To V2

Predicting Eyes Fixations in Movie Videos: Visual Saliency Experiments on a New Eye-Tracking Database

Short Survey on Static Hand Gesture Recognition

Transcription:

Reconstructing visual experiences from brain activity evoked by natural movies Shinji Nishimoto, An T. Vu, Thomas Naselaris, Yuval Benjamini, Bin Yu, and Jack L. Gallant, Current Biology, 2011 -Yi Gao, PhD candidate in Neuroscience 11/19/2018

Brain activity Functional magnetic resonance imaging(fmri) measures brain activity by detecting changes associated with blood flow Unit: voxel The activity of neurons in your brain constantly changes as you are involved in different activities https://vimeo.com/143768892 by Dr. Mark Lescroart

Part 1: Encoding Part 2: Decoding f(x)? ffffffff f(x)

Part 1: Encoding Recorded BOLD signals from three human subjects while they viewed a series of color natural movies (20 * 20 at 15 Hz) Two separate data sets were obtained from each subjects Training data: BOLD signals evoked by 7,200 s of color natural movies Data fit to a separate encoding model for each voxel located in posterior and ventral occipitotemporal visual cortex Test data: BOLD signals evoked by 540 s of color natural movies Data used to assess accuracy of encoding model and as targets for movie reconstruction

Motion-energy encoding model A two-stage process: Motion-energy filters Temporal hemodynamic response filters

Step 1: Motion-energy filters Spatially down-sample all movie frames to 96x96 pixels Convert RGB to CIE color space: luminance info maintained, color info discarded Luminance patterns pass through a bank of three-dimensional spatiotemporal Gabor wavelet filters

Gabor Wavelet filters Each filter is constructed by multiplying a three-dimensional spatiotemporal (2 dimensions for space, 1 dimension for time) sinusoid by a three-dimensional spatiotemporal Gaussian envelope Contains 6,555 separate three-dimensional Gabor filters Filters occur at six spatial frequencies (0, 2, 4, 8, 16, and 32 cycles/image), three temporal frequency filters (0, 2, and 4 Hz), and eight directions (0, 45,,135 degrees) NIshimoto & Gallant, 2011

Step 1: Motion-energy filters Each filter occurs at two quadratic phases (0 and 90 degrees) to facilitate motionenergy computation Output of each quadrature pair of filters is squared and summed to yield local motion-energy measurements(adelson & Bergen, 1985)

Step 1: Motion-energy filters Compression: log transformation

Step 1: Motion-energy filters Temporally down-sampled from 15 Hz to the sampling rate used to measure BOLD signals (1 Hz) Z-score transformation (mean = 0, standard deviation = 1)

Step 2: Hemodynamic response filters

Step 2: Hemodynamic response filters To capture temporal delays of BOLD signals in the model, vector s is constructed by concatenating motion-energy filtered stimulus vectors at various temporal delays Temporal window of hemodynamic response filters was 3-6 s before BOLD signal(4 time samples) L1-regularized least squares regression procedure was used to obtain the linear weights

Part 1: Encoding

Validity of motion-energy encoding model Comparison of retinotopic angle maps estimated using (top) the motion-energy encoding model and (bottom) conventional multi-focal mapping on a flattened cortical map

Movie identification accuracy Can the model correctly associate an observed BOLD signal pattern with the specific stimulus that evoked it? Motion-energy encoding model identified the specific movie stimulus that evoked an observed BOLD signal 95% of the time Identification accuracy was >75% for all three subjects

Motion-energy encoding model Valid Temporal information Spatial information Might provide good reconstructions of natural movies from brain activity measurements

Part 2: Decoding

A Bayesian approach to reconstruct movies from evoked BOLD signals Use BOLD signals measured from a set of voxels to recreate a picture of an unknown stimulus Observed signals Unknown movie?

A Bayesian approach to reconstruct movies from evoked BOLD signals Prior: Sampled natural movie (~18 million 1-s movie clips from YouTube) Posterior: 1. Each clip in the sampled prior is processed using the motion-energy encoding models fit to each voxel 2. Predicted signals are compared to measured signals evoked by an unknown stimulus Posterior rank: Likelihood of the observed response given the clip

Maximum a posteriori (MAP) and Averaged high posterior (AHP) reconstructions

Quantifying reconstruction quality Reconstruction accuracy was significantly higher than chance (p <.0001)

Summary Developed an encoding model that predicts BOLD signals in early visual areas with high accuracy. By using this model in a Bayesian framework, the authors were the first to reconstuct natural movies from human brain activity.