Bias-Variance Tradeos Analysis Using Uniform CR Bound. Mohammad Usman, Alfred O. Hero, Jerey A. Fessler and W. L. Rogers. University of Michigan

Similar documents
M. Usman, A.O. Hero and J.A. Fessler. University of Michigan. parameter =[ 1 ; :::; n ] T given an observation of a vector

M. Usman, A.O. Hero and J.A. Fessler University of Michigan. where,

A Weighted Least Squares PET Image Reconstruction Method Using Iterative Coordinate Descent Algorithms

Optimum Array Processing

Iterative SPECT reconstruction with 3D detector response

MAXIMUM a posteriori (MAP) or penalized ML image

Unmatched Projector/Backprojector Pairs in an Iterative Reconstruction Algorithm

STATISTICAL positron emission tomography (PET) image

Low-Dose Dual-Energy CT for PET Attenuation Correction with Statistical Sinogram Restoration

0.1. alpha(n): learning rate

Projection Space Maximum A Posterior Method for Low Photon Counts PET Image Reconstruction

THE methods of maximum likelihood (ML) and maximum

J. Weston, A. Gammerman, M. Stitson, V. Vapnik, V. Vovk, C. Watkins. Technical Report. February 5, 1998

Estimating 3D Respiratory Motion from Orbiting Views

Splitting-Based Statistical X-Ray CT Image Reconstruction with Blind Gain Correction

4.12 Generalization. In back-propagation learning, as many training examples as possible are typically used.

Algebraic Iterative Methods for Computed Tomography

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an

1 INTRODUCTION The LMS adaptive algorithm is the most popular algorithm for adaptive ltering because of its simplicity and robustness. However, its ma

Contents. I Basics 1. Copyright by SIAM. Unauthorized reproduction of this article is prohibited.

Tomographic Algorithm for Industrial Plasmas

Consistency in Tomographic Reconstruction by Iterative Methods

Single and Multipinhole Collimator Design Evaluation Method for Small Animal SPECT

PET Image Reconstruction using Anatomical Information through Mutual Information Based Priors

Calibrating a Structured Light System Dr Alan M. McIvor Robert J. Valkenburg Machine Vision Team, Industrial Research Limited P.O. Box 2225, Auckland

SPECT QA and QC. Bruce McBride St. Vincent s Hospital Sydney.

Reconstruction from Projections

PET image reconstruction algorithms based on maximum

Emission Computed Tomography Notes

STATISTICAL image reconstruction methods for X-ray

FAST KVP-SWITCHING DUAL ENERGY CT FOR PET ATTENUATION CORRECTION

BMVC 1996 doi: /c.10.41

Constructing System Matrices for SPECT Simulations and Reconstructions

Supervised vs. Unsupervised Learning

Issues with Curve Detection Grouping (e.g., the Canny hysteresis thresholding procedure) Model tting They can be performed sequentially or simultaneou

Unconstrained Optimization

TENTH WORLD CONGRESS ON THE THEORY OF MACHINES AND MECHANISMS Oulu, Finland, June 20{24, 1999 THE EFFECT OF DATA-SET CARDINALITY ON THE DESIGN AND STR

AN ALGORITHM FOR BLIND RESTORATION OF BLURRED AND NOISY IMAGES

CoE4TN4 Image Processing. Chapter 5 Image Restoration and Reconstruction

Column-Action Methods in Image Reconstruction

IN THIS PAPER we consider the solution of ill-posed

(RMSE). Reconstructions showed that modeling the incremental blur improved the resolution of the attenuation map and quantitative accuracy.

Two Dimensional Microwave Imaging Using a Divide and Unite Algorithm

Introduction to Digital Image Processing

THE general task in emission computed tomography (CT)

The performance of xed block size fractal coding schemes for this model were investigated by calculating the distortion for each member of an ensemble

PARALLEL COMPUTATION OF THE SINGULAR VALUE DECOMPOSITION ON TREE ARCHITECTURES

SPECT reconstruction

Classifier C-Net. 2D Projected Images of 3D Objects. 2D Projected Images of 3D Objects. Model I. Model II

Implementation and evaluation of a fully 3D OS-MLEM reconstruction algorithm accounting for the PSF of the PET imaging system

Axial block coordinate descent (ABCD) algorithm for X-ray CT image reconstruction

Simplified statistical image reconstruction algorithm for polyenergetic X-ray CT. y i Poisson I i (E)e R } where, b i

The maximum reachable astrometric precision. The Cramér-Rao Limit

Iterative Methods for Solving Linear Problems

Some elements for Matlab programming

NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION

Limited view X-ray CT for dimensional analysis

Supporting Information for Azimuthal Polarization Filtering for Accurate, Precise, and Robust Single-Molecule Localization Microscopy

Modeling and Incorporation of System Response Functions in 3D Whole Body PET

I How does the formulation (5) serve the purpose of the composite parameterization

Adaptive algebraic reconstruction technique

Sparse Reconstruction / Compressive Sensing

A Study of Medical Image Analysis System

Index. aliasing artifacts and noise in CT images, 200 measurement of projection data, nondiffracting

Robust Signal-Structure Reconstruction

Workshop on Quantitative SPECT and PET Brain Studies January, 2013 PUCRS, Porto Alegre, Brasil Corrections in SPECT and PET

NON-COLLIMATED SCATTERED RADIATION TOMOGRAPHY

Computer Vision I. Announcements. Fourier Tansform. Efficient Implementation. Edge and Corner Detection. CSE252A Lecture 13.

MAPI Computer Vision. Multiple View Geometry

Resolution and Noise Properties of MAP Reconstruction for Fully 3-D PET

Off-the-Grid Compressive Imaging: Recovery of Piecewise Constant Images from Few Fourier Samples

Improved smoothing spline regression by combining estimates of dierent smoothness

An efficient algorithm for sparse PCA

Neural Networks. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

Fast and accurate automated cell boundary determination for fluorescence microscopy

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

Engineered Diffusers Intensity vs Irradiance

Diffuse Optical Tomography, Inverse Problems, and Optimization. Mary Katherine Huffman. Undergraduate Research Fall 2011 Spring 2012

Transformation of curve. a. reflect the portion of the curve that is below the x-axis about the x-axis

Algebraic Iterative Methods for Computed Tomography

Efficient Minimization of New Quadric Metric for Simplifying Meshes with Appearance Attributes

WE consider the gate-sizing problem, that is, the problem


EMISSION tomography, which includes positron emission

1 Algorithms for Control and Image Reconstruction

Programming, numerics and optimization

Advanced Techniques for Mobile Robotics Graph-based SLAM using Least Squares. Wolfram Burgard, Cyrill Stachniss, Kai Arras, Maren Bennewitz

Contents. 1 Introduction Background Organization Features... 7

WEINER FILTER AND SUB-BLOCK DECOMPOSITION BASED IMAGE RESTORATION FOR MEDICAL APPLICATIONS

Experimental Data and Training

Superiorized polyenergetic reconstruction algorithm for reduction of metal artifacts in CT images


ATTENUATION correction is required for quantitatively

Fast Algorithm for Matrix-Vector Multiply of Asymmetric Multilevel Block-Toeplitz Matrices

would be included in is small: to be exact. Thus with probability1, the same partition n+1 n+1 would be produced regardless of whether p is in the inp

Orthogonal Matching Pursuit: Recursive Function Approximation with Applications to Wavelet. Y. C. Pati R. Rezaiifar and P. S.

Assignment 2. Classification and Regression using Linear Networks, Multilayer Perceptron Networks, and Radial Basis Functions

Multi-slice CT Image Reconstruction Jiang Hsieh, Ph.D.

Parameterization. Michael S. Floater. November 10, 2011

Online Calibration of Path Loss Exponent in Wireless Sensor Networks

Transcription:

Bias-Variance Tradeos Analysis Using Uniform CR Bound Mohammad Usman, Alfred O. Hero, Jerey A. Fessler and W. L. Rogers University of Michigan ABSTRACT We quantify fundamental bias-variance tradeos for the image reconstruction problem in radio-pharmaceutical tomography using Cramer-Rao (CR) bound analysis. The image reconstruction problem is very often biased and the classical or the unbiased CR bound on the mean square error performance of the estimator can not be used. We use a recently developed `uniform' CR bound which applies to biased estimators whose bias gradient satises a user specied length constraint. We demonstrate the use of the `uniform' CR bound for a simple SPECT system using several dierent examples. I. INTRODUCTION As in any parametric estimation problem, in image reconstruction there always exists a tradeo between bias and variance of the reconstructed image. For example in penalized maximum likelihood reconstruction a smoothness penalty reduces the variance at the expense of introducing bias. This paper provides an estimator independent fundamental tradeo inherent in emission tomography. In [, ] classical CR bound was applied to unbiased image reconstruction in emission tomography. Very often in tomographic imaging problems an unbiased image reconstruction with a smoothness constraint may results in large variance. For biased image reconstruction classical biased CR bound can be used []. However the biased CR bound is not very useful in that it is only applicable to the estimators with a given bias gradient r b(). Therefore it is unable to give a meaningful comparison of dierent biased estimators that may have acceptable bias but different bias gradients. Hero [] gives a `uniform' CR bound that is applicable to all the estimators whose bias gradient length is less than a prespecied threshold, permitting a meaningful comparison for dierent biased estimators to be performed. For images, the calculation of the CR bound involves inversion of a huge n n Fisher Information Matrix (FIM), where n is the total number of image pixels. Direct inversion algorithms require O(n ) ops and can become computationally intractable. The system matrix in im- This work was supported in part by National Science Foundation under grant BCS- and a Government of Pakistan Postgraduate Fellowship. age reconstruction problem is in general sparse. Here, we give an inversion algorithm based on conjugate gradient approach that exploits the sparsity of the system matrix and can compute only a few columns of the inverse of FIM corresponding to pixels of interest. We apply the methods of [] to a simple SPECT system for several imaging examples. The examples include determining the optimal collimator opening as a function of the tradeo between resolution and sensitivity; deriving biasvariance tradeos curves for dierent angular sampling; and performance comparison of dierent unbiased and biased image reconstruction algorithms using the uniform CR bound. A. System Description The system used in this study is shown in Figure, called the SPRINT II system []. SPRINT II was designed specically for brain imaging and consists of a ring of detectors and a ring of collimators. During the experiment the ring of collimators are rotated through small steps for adequate spatial sampling. The system parameters are Source Collimator Ring Ring Detector Collimator Opening Figure : The SPRINT II system: Not drawn to scale. given in the appendix and unless otherwise specied are those used in the simulations. II. CR BOUND Given the projection data, an n-pixel image and a probability density f Y (y; ), the objective is to estimate the intensity in a small Region Of Interest (ROI) ROI = [ ; ; :::; q ] T. In the sequel we specialize to the case when the ROI consists of a single pixel. A function ^ (y) is a parameter estimator based on the observation Y = y. ^ is unbiased if the bias b () = E[ ^? ] is equal to zero.

The Cramer-Rao lower bound on the variance of unbiased parameter estimator ^ is given by the upper-left q q block of the inverse of an n n, symmetric, positive semidenite Fisher information matrix (FIM) F Y : where, F Y () =?E [r T var(^ ) e T F? Y () e ; () ln f Y (Y ; ) r ln f Y (Y ; )]; and r denotes the (row) gradient vector [ ; ; :::; n ] and e = [; ; :::; ] T. While the unbiased CR bound () is known to be asymptotically achievable for large imaging times, in practice, most of the popular image reconstruction algorithms are biased and the unbiased CR bound is inapplicable. A. Motivating Example In Figure the unbiased CR bound, for a high intensity pixel at the top of the image in Figure, is plotted as a function of the collimator rotations. Also plotted is the variance of the weighted least square algorithm for the same pixel. The variance of the weighted least square algorithm (section C.) is clearly less than the CR lower bound at all points. This is due to the fact that the weighted least square estimator is biased. The unbiased CR bound can not be used for performance analysis in this case. SQRT(variance)... Performance of weighted least square algorithm Unbiased CR Bound Weighted Least Square Algorithm Number of collimator rotations Figure : The regularized weighted least square estimator shows lower variance than unbiased CR bound at all the points. III. BIASED CR BOUND For a biased estimator ^ the following form of the biased CR bound is well known []: var( ^ ) [r m ()] F? Y () [r m ()] T ; () where r m () = r b () + e is an n element row vector of the gradient of the mean E ( ^ ) = m (). A. Uniform CR Bound The application of the biased CR bound () is very restricted due to the fact that it is only applicable to estimators with a given bias gradient r b (). In [] Hero gives a `uniform' CR bound on the variance of a single parameter for non-singular F Y. This bound is applicable to all biased estimators whose length kr b ()k of the bias gradient r b () is less than a small prespecied threshold: krb ()k < : Then we have the following theorem []. (-a) Theorem Let be an estimator with bias b () whose n-element bias gradient vector r b = d T satises (-a). Then the variance of satises: where B(; ) is equal to: var( ^ ) B(; ); (-b) B(; ) = [e + d min ] T F? Y [e + d min ]: (-c) Here e = [; ; :::; ] T is an n-element unit vector and: d min =?[I + F Y ]? e ; (-d) where is given by the unique non-negative solution of the following equation involving the monotone decreasing, convex function g() [; ]: g() = d T min d min = : In Theorem, d min dened in (-d) is an optimal bias gradient in the sense that it minimizes the biased CR bound () over all vectors r b (). Normalized uniform CR bound: SQRT(variance)......... UNACHIEVABLE REGION Uniform CR bound ACHIEVABLE REGION......... Figure : The Uniform CR bound Figure shows a typical bias-variance tradeo curve. The curve is drawn for the same pixel as in Figure. The region above and including the curve is the so called `achievable' region where all the realizable estimators exist. Note that if an estimator lies on the curve then lower variance can only be bought at the price of increased bias and vice versa. At = the variance goes to zero. This corresponds to the trivial case ^ =Constant.

B. Inversion Algorithm Equation (-d) in the calculation of uniform CR bound requires computation of the rst column of [I + F Y ]? for each. Often we want to compute the bound at several dierent values of and therefore it is important to have a computationally ecient algorithm to compute (-d). Conjugate gradient is a recursive method to solve the linear system of equation C x = b. If we substitute b = [; ; ; :::; ] T then it is easy to recognize that the solution to such a system of equations will be the rst column of C?. In general we can get the inverse of the m-th column of C if we choose b m = and b j = : j = m. The following algorithm is applicable to only symmetric, positive denite matrices C []. Algorithm INITIALIZATIONS: u = ; b = [; ; ; :::; ] T ; r = b FOR i := UNTIL C u - b < tolerance DO IF i = DO END DO ELSE END IF p i = r i i = < r i ; r i > < r i? ; r i? > p i = r i + i p i? i = < r i ; r i > < p i ; A p i > u i+ = u i + i p i r i+ = b? C u i+ The iterations are terminated when the error j C u? b j is less than a user specied tolerance. The convergence rate of the conjugate gradient depends on the condition number of C and its eigenvalue distribution. One advantage in using the conjugate gradient algorithm is that the algorithm is guaranteed to converge in maximum n iterations. Each iteration of the algorithm involves a matrix-vector multiplication A p i, requiring O(n ) ops and therefore the algorithm can take O(n ) ops to converge. For SPECT the system matrix in general is sparse (typically less than %) and, if the sparsity of A is taken into account, then the total number of ops can be reduced signicantly. In relation to the recursive algorithm for the unbiased CR bound given in [], a disadvantage of this algorithm is its non-monotonic convergence. This means that if stopped at iteration k < n then the quantity calculated by the algorithm might not be a valid lower bound on var( ^ ). This, however, may not be a problem if we let the algorithm run until a desired accuracy is achieved. IV. APPLICATIONS The object used in all the following simulations has two point sources in a uniform intensity background (gure ). In all cases the pixel of interest was the high intensity pixel at the top of the image. Noise due to scatter was neglected, however the eect of attenuation was considered. It can be shown that for emission computed tomography, the Fisher information matrix has the form F Y = A T [diag()]? A where A is a d n weight matrix; d = total number of detectors, and = A. Therefore F Y is an n n symmetric positive semi-denite matrix (F Y ). Here we will only consider non-singular F Y (> ). The system used in this Figure : The phantom used in the simulations. The pixel of interest is the pixel at the top of the phantom. The image dimensions are x pixels. study is the SPRINT II system as described before. A. Spatial Sampling Study Normalized uniform bound: SQRT(variance).... Figure : The eect of angular sampling on the unbiased CR bound. Curves denote lower bound for,,, and rotation increments of the collimator.

In Figure the bias-variance curves are displayed for varying degrees of spatial sampling. Spatial sampling was varied by rotating the collimator ring through dierent number of steps. The time for each step was kept constant. Note the monotonic nature of the curves: they do not intersect each other due to the fact that the projections from the lower sampled image is a subset of those of the higher sampled image. Also it can be seen that beyond a sampling rate of increasing sampling rate does not reduce the bound signicantly. An interesting point to note is that the unbiased CR bound (zero bias gradient) for rotations is greater than the bound at a bias gradient length of : for only rotations. This means that a biased algorithm with lower sampling can perform better than an unbiased algorithm with higher sampling. For a time normalized case, when the total time for each curve is kept constant, the curves may intersect each other (Figure ). They, however, display the same trend as above. Normalized uniform bound: SQRT(variance).... Figure : The eect of angular sampling on the unbiased CR bound. Time normalized case. B. Optimal System Design Spatial resolution and sensitivity are the two most important criteria for tomographic system design. These criteria are coupled, higher resolution can only be bought at the price of lower sensitivity. We demonstrate the use of the uniform CR bound (-b) by determining an aperture opening that optimizes the tradeo between resolution and sensitivity. For these simulations collimator rotations were used and the aperture opening was varied from very narrow (ray width =. pixels) to very wide (ray width = pixels). The total imaging time was adjusted so that the total number of detected counts are the same for all cases hence smaller exposure time for wider openings. Figure shows a sharp minimum over all when the width of the ray is approximately one pixel. Therefore for the object and the ROI studied, this one pixel aperture width is `universally' optimal for estimation, irrespective of the bias of Normalized Uniform CR bound: variance.... Width of the ray: pixels.... delta Figure : Optimal system design using the CR bound to study the eect of resolution and sensitivity tradeos the estimator. C. Performance Comparison Here we will compare two least square reconstruction algorithms by placing them on the bias-variance plane and comparing them to the uniform CR lower bound. In the following simulations we used the standard SPRINT II system with collimator rotations. The least square estimator can be motivated by the following representation of y: y = A + n Where n is the Poisson distributed noise and in SPECT it is generated by photon scatter. C. Unweighted Linear Least Square Estimator The unweighted Linear Least Square Estimator () is given by: ^ = (A T A + I)? A T y () where is a smoothing or regularization parameter to counteract any ill-conditioning of A. It can be easily shown that the bias vector b() = E(^? ) is : b() = I? (A T A + I)? A T A Therefore the bias gradient is the n n matrix: r b() = I? (A T A + I)? A T A The covariance of the ^ is: cov(^ ) = (A T A + I)? A T A (A T A + I)? () Note that the bias gradient and bias are very simply related. In particular the bias gradient is equal to the bias for

Normalized uniform CR bound:sqrt(var) Performance of and WLSE Uniform CR bound and: * -Ideal WLSE......... V. CONCLUSIONS We have demonstrated, with the help of several examples, that the commonly used unbiased CR bound is not sucient by itself for performance analysis and the uniform CR bound should be used to get a true overall picture. The image reconstruction problem is very often ill-posed due to the insucient radial sampling resulting in a singular FIM. Many of the results that are presented in this paper have been extended to the case of singular FIM but the space limitation does not permit us to include them here. For details please see []. A System Specications Ring Detector Figure : Linear Least Square Estimator a point source intensity = [; ; :::; ] T. The bias gradient r b of ^ is simply the rst row of r b. The length of the bias gradient r b is a function of the smoothing parameter, increasing from as increases from. In Figure the uniform CR bound (-b) is plotted as a function of the length of r b. Also plotted is the variance of ^, obtained from the (; ) element of the covariance matrix (). Note that as the smoothing parameter gets large the variance of approaches the CR bound but for small there is a large dierence. C. Weighted Least Squares Estimator The weighted least squares estimator (WLSE) is: ^ W LSE = (A T A + I)? A T y where is the regularization parameter is an nn weight matrix. Note that when = I, ^ is simply the unweighted least square estimator (). It is easy to show that the bias gradient is: r b() = I? (A T A + I)? A T A and the covariance is: cov(^ W LSE ) = (A T A + I)? A T A (A T A + I)? For = diag( i ), where = A, the variance of ^ W LSE is plotted in Figure. Observe that for this choice of weights the WLSE estimator achieves the bound exactly for all values of as shown in []. The WLSE implemented with = diag( i ) is not physically realizable since i depends on the unknown image. However, by replacing the mean by the sample mean ^, an estimated WLSE: ^ W LSE = (A T ^ A + I)? A T ^ y; ^ = diag( ) has ^i been found to have similar variance and bias for large number of detected counts []. Radius of the detector ring cms Number of detectors Radius of the collimator ring cms Number of collimator slits (uniformly spaced) Slit Width. mm REFERENCES [] W.L. Rogers, N.H. Clinthorne, L.Shao, P.Chiao, J.Stamos, and K.F.Koral SPRINT II, A Second Generation Single Photon Ring Tomograph. IEEE Transactions on Medical Imaging, vol.:, pp. -,. [] Alfred O. Hero A Cramer-Rao Type Lower Bound for Essentially Unbiased parameter Estimation, Technical Report, Lincoln Laboratory, MIT. [] Harry L. van Trees Detection, Estimation and Modulation Theory, (Part I), John Wiley and Sons.. [] W. H. Press, S. A. Teukolsky, W. T. Vetterling and B.P. Flannery Numerical Recipes in FORTRAN, The Art of Scientic Computing, (Second Edition), Cambridge University Press,. [] M. Usman, A. O. Hero and J. A. Fessler Uniform CR Bound for Ill-conditioned Systems In preparation [] A. O. Hero, J. A. Fessler and W. L. Rogers A Fast Recursive Algorithm for Computing CR-Type Bounds for Image Reconstruction Problems, IEEE Symposium on Nuclear Science and Medical Imaging Conference, October, Orlando, FA. [] J. A. Fessler and A. O. Hero Cramer-Rao Bounds for Biased Estimators in Image Reconstruction, MWSCS, August, Detriot, MI. [] J. A. Fessler Penalized Weighted Least-Squares Image Reconstruction for Positron Emission Tomography, Accepted by IEEE Transactions on Medical Imaging,. [] M. Usman, A. O. Hero and W. L. Rogers Performance Gain Analysis for Adding Vertex View to a Standard SPECT, MWSCS, August, Detroit, MI.