Predictive Interpolation for Registration

Similar documents
Introduction to Image Super-resolution. Presenter: Kevin Su

Coding of Coefficients of two-dimensional non-separable Adaptive Wiener Interpolation Filter

Performance study on point target detection using super-resolution reconstruction

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

MOTION COMPENSATION IN BLOCK DCT CODING BASED ON PERSPECTIVE WARPING

Overview: motion-compensated coding

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

New feature preserving noise removal algorithm based on thediscrete cosine transform and the a prior knowledge of pixel type.

Local Image Registration: An Adaptive Filtering Framework

The Elimination of Correlation Errors in PIV Processing

EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines

Announcements. Image Matching! Source & Destination Images. Image Transformation 2/ 3/ 16. Compare a big image to a small image

Pixels. Orientation π. θ π/2 φ. x (i) A (i, j) height. (x, y) y(j)

Novel Iterative Back Projection Approach

Overview: motion estimation. Differential motion estimation

Compression of Light Field Images using Projective 2-D Warping method and Block matching

Fingerprint Mosaicking by Rolling with Sliding

Super-Resolution on Moving Objects and Background

Single Pass Connected Components Analysis

Hardware Implementation of DCT Based Image Compression on Hexagonal Sampled Grid Using ARM

Image Mosaicing with Motion Segmentation from Video

Terrain correction. Backward geocoding. Terrain correction and ortho-rectification. Why geometric terrain correction? Rüdiger Gens

University of Technology Building & Construction Department / Remote Sensing & GIS lecture

Keywords: vectorization, satellite images, interpolation, Spline, zooming

Model Assessment and Selection. Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer

Scaling and Power Spectra of Natural Images

Motion estimation for video compression

Obtaining Feature Correspondences

CMPT 365 Multimedia Systems. Media Compression - Video

Reconstruction PSNR [db]

DIFFERENTIAL IMAGE COMPRESSION BASED ON ADAPTIVE PREDICTION

Image Warping: A Review. Prof. George Wolberg Dept. of Computer Science City College of New York

Motion Estimation. There are three main types (or applications) of motion estimation:

A Real-time Algorithm for Atmospheric Turbulence Correction

INTELLIGENT AND OPTIMAL NORMALIZED CORRELATION FOR HIGH- SPEED PATTERN MATCHING. Abstract

Locally Weighted Least Squares Regression for Image Denoising, Reconstruction and Up-sampling

Aiyar, Mani Laxman. Keywords: MPEG4, H.264, HEVC, HDTV, DVB, FIR.

Error Equalisation for Sparse Image Mosaic Construction

Fast image warping using adaptive partial matching

CoE4TN3 Medical Image Processing

A High Quality/Low Computational Cost Technique for Block Matching Motion Estimation

Super-resolution Image Reconstuction Performance

Color Characterization and Calibration of an External Display

ADS40 Calibration & Verification Process. Udo Tempelmann*, Ludger Hinsken**, Utz Recke*

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK

Minimizing Noise and Bias in 3D DIC. Correlated Solutions, Inc.

Research Article Block-Matching Translational and Rotational Motion Compensated Prediction Using Interpolated Reference Frame

NEW CONCEPT FOR JOINT DISPARITY ESTIMATION AND SEGMENTATION FOR REAL-TIME VIDEO PROCESSING

IMAGE COMPRESSION USING HYBRID QUANTIZATION METHOD IN JPEG

Image Compression Algorithm and JPEG Standard

1. Estimation equations for strip transect sampling, using notation consistent with that used to

Efficient Block Matching Algorithm for Motion Estimation

Region Weighted Satellite Super-resolution Technology

AREA BASED STEREO IMAGE MATCHING TECHNIQUE USING HAUSDORFF DISTANCE AND TEXTURE ANALYSIS

IMAGE RECONSTRUCTION WITH SUPER RESOLUTION

Determination of Geometric Distortion in STIS Images

BLOCK MATCHING-BASED MOTION COMPENSATION WITH ARBITRARY ACCURACY USING ADAPTIVE INTERPOLATION FILTERS

Detection of small-size synthetic references in scenes acquired with a pixelated device

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

A FAST METHOD FOR SCALING COLOR IMAGES

Effects of Weight Approximation Methods on Performance of Digital Beamforming Using Least Mean Squares Algorithm

INVESTIGATIONS OF CROSS-CORRELATION AND EUCLIDEAN DISTANCE TARGET MATCHING TECHNIQUES IN THE MPEF ENVIRONMENT. Greg And Ken Holmlund # ABSTRACT

Noise filtering for television receivers with reduced memory

Multicomponent f-x seismic random noise attenuation via vector autoregressive operators

1 More configuration model

Chain Coding Streamed Images through Crack Run-Length Encoding

Hierarchical Matching Techiques for Automatic Image Mosaicing

A Robust Wipe Detection Algorithm

An Adaptive Cross Search Algorithm for Block Matching Motion Estimation

Enhancing DubaiSat-1 Satellite Imagery Using a Single Image Super-Resolution

Interactive comment on Quantification and mitigation of the impact of scene inhomogeneity on Sentinel-4 UVN UV-VIS retrievals by S. Noël et al.

Rate Distortion Optimization in Video Compression

Interactive comment on Quantification and mitigation of the impact of scene inhomogeneity on Sentinel-4 UVN UV-VIS retrievals by S. Noël et al.

Pictures at an Exhibition

Fluid flow modelling with seismic cluster analysis

ELEC Dr Reji Mathew Electrical Engineering UNSW

LECTURE VIII: BASIC VIDEO COMPRESSION TECHNIQUE DR. OUIEM BCHIR

CONTENT ADAPTIVE SCREEN IMAGE SCALING

Yui-Lam CHAN and Wan-Chi SIU

The optimum design of a moving PM-type linear motor for resonance operating refrigerant compressor

Interpolation is a basic tool used extensively in tasks such as zooming, shrinking, rotating, and geometric corrections.

REMOVAL OF THE EFFECT OF COMPTON SCATTERING IN 3-D WHOLE BODY POSITRON EMISSION TOMOGRAPHY BY MONTE CARLO

Chapter 18. Geometric Operations

EECS 556 Image Processing W 09

ALMA Memo 386 ALMA+ACA Simulation Tool J. Pety, F. Gueth, S. Guilloteau IRAM, Institut de Radio Astronomie Millimétrique 300 rue de la Piscine, F-3840

A Background Subtraction Based Video Object Detecting and Tracking Method

Low Bitrate Video Communications

Fast Optical Flow Using Cross Correlation and Shortest-Path Techniques

International ejournals

Modeling and Estimation of FPN Components in CMOS Image Sensors

Adaptive subregion variable shape block motion compensated prediction

Introduction to Digital Image Processing

Performance of Error Normalized Step Size LMS and NLMS Algorithms: A Comparative Study

Mesh Based Interpolative Coding (MBIC)

An Accurate Method for Skew Determination in Document Images

GEOG 4110/5100 Advanced Remote Sensing Lecture 4

Directionally Selective Fractional Wavelet Transform Using a 2-D Non-Separable Unbalanced Lifting Structure

SPM8 for Basic and Clinical Investigators. Preprocessing. fmri Preprocessing

Transcription:

Predictive Interpolation for Registration D.G. Bailey Institute of Information Sciences and Technology, Massey University, Private bag 11222, Palmerston North D.G.Bailey@massey.ac.nz Abstract Predictive interpolation is a new registration method that uses the bilinear interpolation equation to predict the pixel values of one image from those in another. The method requires that the images already be registered to the nearest pixel, which is not a serious limitation since hierarchical methods may be used to perform the initial registration. Registration accuracy of better than 5% of the width of a pixel can be expected between pairs of images, and this may be improved significantly if multiple images are to be registered to one another. Keywords: interpolation, registration, linear prediction. 1. Introduction Image fusion involves the combining of data from a number of image sources in such a manner that the output image contains more information than that contained in any of the individual images. The image sources are often different in their properties or characteristics. An important use of image fusion is to trade temporal resolution for spatial resolution, by combining several low-resolution images to construct a high-resolution image [1-3]. The pixels of an image provide a set of samples of the world. The sampling density, or the spacing between the pixels, limits the achievable resolution. However if a series of images is captured, each with the samples in slightly different locations, the combined sample density is higher that that of any single image. For the ensemble to contain more information than any one of its constituent images, a single image must not contain all of the information. This implies that the sample frequency for each individual image must be below the Nyquist rate, and the images be subject to aliasing [3]. The process of constructing a higher resolution image untangles the aliased information, so that the output image contains more information than that available from any of the individual input images. An important step within this reconstruction process is to register the low-resolution images with one another so that they may be combined. A range of different image registration methods can be used, and the performance of these was reviewed at last year's conference [4]. Four types of pixel based registration methods were compared: correlation methods; difference methods; phase methods; and predictive interpolation. High level, segmentation based methods were not discussed or compared. It can be shown that the correlation and difference methods are closely related [5], and they gave similar performance for sub-pixel registration accuracy [4]. The predictive interpolation method was a new method, that is computationally efficient, gives very good registration accuracy in the presence of aliasing, and degrades relatively gracefully in the presence of moderate noise. The purpose of this paper is to present a more detailed description of this new method, and to give an overview of some of its properties. 240

2. Using interpolation for registration To give sub-pixel accuracy, some form of interpolation is required. Consider at this stage, determining the offset between a pair of images. Also, assume that the images have already been registered to the nearest pixel, so all that is required is to refine the accuracy to within a pixel. This interpolation may be approached in one of two ways. The first is to interpolate one (or both) images to be registered and use standard pixel-level interpolation methods (correlation or difference methods) to evaluate the fit. There are at least two problems with this approach. The first is the computational overhead of performing the image interpolation at each offset tested. The second is that since interpolation is a form of low pass filtering, there may be a loss of some fine detail in the interpolated image. For this reason, both images may be interpolated by the same amount but in opposite directions so the filter characteristics do not affect the accuracy. A second problem with interpolation is that since the images contain aliased information, the interpolation filter further distorts this information. Subsequent resampling of the interpolated images will not recover the aliased information. The second approach to interpolation is to use pixel level registration, and evaluate the fit on an integer grid. This fit surface is then interpolated to estimate the position of optimum fit. An appropriate surface model must be selected to accurately find the best fit [4]. 2.1 Predictive interpolation for registration The predictive interpolation method turns the problem around. Rather than interpolate and then evaluate the best fit, the interpolation method attempts to predict the pixel values of one image from the other, and from that to infer what the offset is. f (x, y +1) v f (x +1,y + 1) f (x + u, y + v) Consider bilinear interpolation of an image f f (x, y) u f (x +1,y) within an integer spaced grid as shown in figure 1. Figure 1: Interpolation on an integer grid. The value of the interpolated point, f (x + u, y + v) is given by [6] f (x + u, y + v) = (1 u)(1 v) f ( x, y) + (1 u)vf ( x, y +1) + u(1 v) f (x +1,y) + uvf ( x +1, y +1) (1) Given that we have a pair of images, f and g, that have already been registered to the nearest pixel, we can use this relation to estimate the offset between images. The idea here is to try and predict the pixel values of one image from the other using a linear predictor: g(x,y) = A 00 f (x,y) + A 01 f ( x, y +1) + A 10 f (x +1,y) + A 11 f (x + 1, y +1) (2) where the A ij are the linear prediction coefficients. Further, from comparing equations (1) and (2), we want to apply the following contraint: A 00 + A 01 + A 10 + A 11 = 1 (3) Substituting (3) into (2) gives: g(x,y) f (x,y) = A 01 ( f (x,y + 1) f (x,y)) ( ) ( ) + A 10 f (x + 1, y) f (x, y) + A 11 f ( x +1, y +1) f (x,y) (4) 241

Least squares minimisation can then be used over the whole image to give the values of the coefficients that minimise the prediction error between the two images. By equating equations (1) and (2) we have A 01 = v uv A 10 = u uv A 11 = uv (5) which can be solved to give the sub-pixel offset between the two images as u = A 10 + A 11 v = A 01 + A 11 (6) Therefore using bilinear interpolation filter as a prediction equation, we are able to obtain a direct estimate of the sub-pixel offset between two images. 2.2 Other considerations In practise, simple application of least squares over equation (4) is not particularly effective because featureless regions within the images would give meaningless prediction coefficients, dominated by noise. This may be overcome by weighting the individual instances of equation (4) by the local variance. In this way regions which are relatively featureless are given little weight, while those regions that have more edge activity are given more weight. One limitation of using predictive interpolation is that it requires the two images to be registered to the nearest pixel before it can be applied. There are two ways that this limitation may be overcome. The first is through scanning, where a range of different pixel level offsets is tried. The pixel level offset that has the minimum prediction error will also provide the sub-pixel offset. The second approach is to use a hierarchical method to determine the offset at coarser scales. With this approach, a series of increasingly lower resolution images can be created by averaging 2x2 blocks from the next higher resolution. Sub-pixel registration on the lowresolution images would indicate the nearest pixel-level registration for use at higher levels. The procedure described in the previous section is computationally efficient when compared with the more conventional correlation and difference methods for sub-pixel registration. 3 Registration accuracy To evaluate the accuracy of registration, it is necessary to work with a set of images where the offsets are known in advance. This was accomplished by taking a high-resolution image, and sub-sampling it to produce a series of low-resolution images with predetermined offsets. Predictive interpolation was then used to measure the offsets, and these results were compared with the known offsets to determine the registration error. In the tests here, the original images were 512x512 pixels. These were filtered using a 3x3 linear filter and then sub-sampled by a factor of 2 by discarding every second pixel in the horizontal and vertical directions. Selecting the filter weights controlled the sub-pixel offset of each low-resolution image. This allowed a set of 256x256 images to be created that were offset by multiples of 5% of a pixel in both the horizontal and vertical directions. The sample images that were used are shown in figure 2. The images were chosen for their varying level of detail, from a fully structured image (A), two general images (B and C) with varying levels of detail, a high resolution text image (D) and a low resolution text image (E) with inadequate resolution to be able to read the text. The final image is expected to present a considerable challenge because the fine detail within the image will result in considerable aliasing. 242

A B C D E Figure 2: The sample low resolution images. 3.1 Registration error To determine the registration error, each pairwise combination of images was taken, and the offset between the images was predicted using the central 100x100 pixels from the images. The RMS error between the predicted offsets and the true offsets was used as a measure of registration performance. The average registration errors are listed in table 1 for the sample images used. Image A B C D E Error 3.48% 4.56% 4.28% 4.59% 8.65% Table 1: RMS registration error for the different images as a fraction of a pixel. Preliminary investigations [4] indicated that there might be a systematic bias in the errors. To check for the presence of such a bias, the horizontal and vertical components of the error were plotted as a function of the known offsets between the images. With horizontal and vertical offsets in steps of 5% of a pixel, this gave a clear indication of the form of the errors. Two components of the errors were determined: the systematic bias (shown in figure 3) and the random component (shown in figure 4). 243

Error mean 0.08 0.06 0.04 0.02 0-0.02-0.04-0.06-0.08 A B C D E 0 0.2 0.4 0.6 0.8 1 Offset Figure 3: Non-random bias component. Error mean as a function of offset. Both scales are in pixels. Error standard deviation 0.06 0.05 0.04 0.03 0.02 0.01 A B C D E 0 0 0.2 0.4 0.6 0.8 1 Offset Figure 4: Random error component. Standard deviation as a function of offset. Both scales are in pixels. 3.2 Effect of window size To investigate the effects of the window size on the two error components, the experiment was repeated, varying the size of the window used for registration from 10x10 pixels up to 160x160 pixels. The systematic bias component did not vary significantly with window sizes greater than 20x20. This implies that the systematic bias is not introduced by the windowing, but by the method itself. Unfortunately, the bias appears to be image dependent (at least to some extent as illustrated in figure 3), so cannot be removed without knowing in advance what its level is. For small window sizes, the random component of the error decreased with increasing window size, as would be expected. However, for window sizes above 60x60 to 100x100 (depending on the image), the reduction in error was negligible. For small window sizes, the error is distributed uniformly as a function of offset, with a slight peak at about 0.5 pixel. As the window size is increased, the double peaked nature of the error, as seen in figure 4, becomes pronounced. The error at 0.5 pixel offset reduces significantly while the peaks at 0.2 and 0.8 pixel offsets do not decrease significantly with windows sizes larger than 100x100 pixels. This implies that the limit of the technique is being reached and the variability remaining is inherent in the technique. 3.3 Enforcing consistency The results presented so far in this paper are based on pairwise registrations. If more than two images are to be registered relative to one another, it is possible to reduce the errors between the pairs of images by enforcing consistency. If there are N images being registered then there are 1 2 N(N 1) separate registration measurements. Enforcing consistency determines the N 1 independent offsets in such a way that the error with respect to the measurements is minimised. If M ij is the measured offset between image i and image j, then the minimised offsets with respect to the first image are given by M i1 = 1 N i1 N M j1 + M ij M ji N (7) j= 2 j =1 j= i+ 1 If the registration errors are independent and normally distributed, then the average error will be 2 reduced by a factor of N. Results presented at last year's conference [4] indicated that the average error was reduced by significantly more than this factor. The reason for this is the particular pattern of bias measured in section 3.1. In [4] the low resolution images were offset in the horizontal and vertical directions by exactly 1/3 and 2/3 pixels. In enforcing consistency, the 244

biases cancelled exactly, leaving only the random component of the error. As a consequence, the error improved significantly better than was expected from enforcing consistency. A more realistic study to investigate the effects of enforcing consistency would be to create a set of images with random (but known) relative offsets, and measure the reduction in average error that results from enforcing consistency. The results shown in table 2 are the average improvements resulting from 10 runs on each image with a 100x100 window. Image A B C D E Expected improvement 4 images 0.7190 0.7140 0.7160 0.7117 0.7109 0.7071 8 images 0.6992 0.5059 0.5069 0.5154 0.5439 0.5000 16 images 0.6352 0.3642 0.3605 0.4152 0.4302 0.3536 Table 2: The reduction in average error resulting from enforcing consistency. In these experiments, the improvement in accuracy as a result of enforcing consistency was almost to the expected level. The notable exception to this was image A. It is surmised that because there are relatively few edges in this image that gathering more data does not improve the accuracy significantly. 4 Summary and Conclusions Predictive interpolation is an efficient method for determining the relative offset between pairs of images. The accuracy of the method between pairs of images is typically 5% of a pixel or better. When more than two images are being registered relative to one another, the average accuracy may be further improved by enforcing consistency within the offsets of an image set. There are two components to the error: a systematic bias component, and a random component. The bias component is independent of window size above 20x20, and the random component is independent of window size above 100x100 pixels. Therefore, unless there is noise present, the optimum window size for matching is approximately 100x100 pixels. A flaw has been identified in the previous experiments [4] that resulted in a better than expected improvement from enforcing consistency. Enforcing consistency over multiple images with random offsets gives the expected improvement in accuracy. References [1] R.C. Hardie, T.R. Tuinstra, J. Bognar, K.J. Bernard, and E. Armstrong: High resolution image reconstruction from digital video with global and non-global scene motion. Proc. IEEE International Conference On Image Processing, Vol I (October 1997) 153-156. [2] M.C. Hong, M.G. Kang, A.K. Katsaggelos: An iterative weighted regularized algorithm for improving the resolution of video sequences. Proc. IEEE International Conference On Image Processing, Vol II (October 1997) 474-477. [3] D.G. Bailey: Super-resolution of bar codes. Proc SPIE Vol 3521, Machine Vision Systems for Inspection and Metrology VII (November 1998) 204-213 [4] D.G. Bailey and T.H. Lill: Image Registration Methods for Resolution Improvement, Proceedings of Image and Vision Computing NZ (August 1999) 91-96 [5] A.K. Jain: Fundamentals of Image Processing. Prentice Hall, Englewood Cliffs, New Jersey (1989) 400-407. [6] G. Wolberg: Digital Image Warping. IEEE Computer Society, Los Alamitos, CA (1990) 58-59. 245