Guided Image Super-Resolution: A New Technique for Photogeometric Super-Resolution in Hybrid 3-D Range Imaging

Similar documents
Guided Image Super-Resolution: A New Technique for Photogeometric Super-Resolution in Hybrid 3-D Range Imaging

Gradient-Based Differential Approach for Patient Motion Compensation in 2D/3D Overlay

Quality Guided Image Denoising for Low-Cost Fundus Imaging

Multi-Sensor Super-Resolution for Hybrid Range Imaging with Application to 3-D Endoscopy and Open Surgery

EFFICIENT PERCEPTUAL, SELECTIVE,

GPU-Accelerated Time-of-Flight Super-Resolution for Image-Guided Surgery

Super-Resolution (SR) image re-construction is the process of combining the information from multiple

Bilevel Sparse Coding

High Performance GPU-Based Preprocessing for Time-of-Flight Imaging in Medical Applications

Multi-Frame Super-Resolution with Quality Self-Assessment for Retinal Fundus Videos

Blind Image Deblurring Using Dark Channel Prior

Locally Weighted Least Squares Regression for Image Denoising, Reconstruction and Up-sampling

Geometry-Based Optic Disk Tracking in Retinal Fundus Videos

Adaptive Multiple-Frame Image Super- Resolution Based on U-Curve

Super Resolution Using Graph-cut

Introduction to Image Super-resolution. Presenter: Kevin Su

Super resolution: an overview

Combined Bilateral Filter for Enhanced Real-Time Upsampling of Depth Images

Robust Video Super-Resolution with Registration Efficiency Adaptation

Super-Resolution from Image Sequences A Review

Super-Resolution for Hyperspectral Remote Sensing Images with Parallel Level Sets

Structured light 3D reconstruction

Mesh from Depth Images Using GR 2 T

Iterative CT Reconstruction Using Curvelet-Based Regularization

ToF/RGB Sensor Fusion for Augmented 3-D Endoscopy using a Fully Automatic Calibration Scheme

Depth-Layer-Based Patient Motion Compensation for the Overlay of 3D Volumes onto X-Ray Sequences

Notes 9: Optical Flow

Super-resolution on Text Image Sequences

Digital Image Restoration

Sparse Principal Axes Statistical Deformation Models for Respiration Analysis and Classification

Multi-Input Cardiac Image Super-Resolution using Convolutional Neural Networks

One Network to Solve Them All Solving Linear Inverse Problems using Deep Projection Models

Advanced phase retrieval: maximum likelihood technique with sparse regularization of phase and amplitude

Subpixel accurate refinement of disparity maps using stereo correspondences

SUPPLEMENTARY MATERIAL

What have we leaned so far?

EE795: Computer Vision and Intelligent Systems

Computer Vision 2. SS 18 Dr. Benjamin Guthier Professur für Bildverarbeitung. Computer Vision 2 Dr. Benjamin Guthier

Dense Tracking and Mapping for Autonomous Quadrocopters. Jürgen Sturm

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

Fast Guided Global Interpolation for Depth and. Yu Li, Dongbo Min, Minh N. Do, Jiangbo Lu

IN MANY applications, it is desirable that the acquisition

BIL Computer Vision Apr 16, 2014

G Practical Magnetic Resonance Imaging II Sackler Institute of Biomedical Sciences New York University School of Medicine. Compressed Sensing

Improvement and Evaluation of a Time-of-Flight-based Patient Positioning System

LidarBoost: Depth Superresolution for ToF 3D Shape Scanning

Tri-modal Human Body Segmentation

A MULTI-RESOLUTION APPROACH TO DEPTH FIELD ESTIMATION IN DENSE IMAGE ARRAYS F. Battisti, M. Brizzi, M. Carli, A. Neri

Computer Vision II Lecture 4

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

High Quality Shape from a Single RGB-D Image under Uncalibrated Natural Illumination

Single-Image Super-Resolution Using Multihypothesis Prediction

DEEP LEARNING OF COMPRESSED SENSING OPERATORS WITH STRUCTURAL SIMILARITY (SSIM) LOSS

A Novel Multi-Frame Color Images Super-Resolution Framework based on Deep Convolutional Neural Network. Zhe Li, Shu Li, Jianmin Wang and Hongyang Wang

Motion Tracking and Event Understanding in Video Sequences

Motion Estimation (II) Ce Liu Microsoft Research New England

Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing

Image de-fencing using RGB-D data

Depth Sensors Kinect V2 A. Fornaser

Computer Vision I - Basics of Image Processing Part 1

CALIBRATION BETWEEN DEPTH AND COLOR SENSORS FOR COMMODITY DEPTH CAMERAS. Cha Zhang and Zhengyou Zhang

Accurate Image Registration for MAP Image Super-Resolution

A primal-dual framework for mixtures of regularizers

Markov Random Fields and Gibbs Sampling for Image Denoising

Colored Point Cloud Registration Revisited Supplementary Material

EN1610 Image Understanding Lab # 4: Corners, Interest Points, Hough Transform

Automatic No-Reference Quality Assessment for Retinal Fundus Images Using Vessel Segmentation

Performance study on point target detection using super-resolution reconstruction

Reliability Based Cross Trilateral Filtering for Depth Map Refinement

Ping Tan. Simon Fraser University

Superresolution software manual

Image Restoration with Deep Generative Models

Edge-Preserving MRI Super Resolution Using a High Frequency Regularization Technique

Total Variation Regularization Method for 3D Rotational Coronary Angiography

Enhancing DubaiSat-1 Satellite Imagery Using a Single Image Super-Resolution

Multi-task Multi-modal Models for Collective Anomaly Detection

Bidirectional Recurrent Convolutional Networks for Video Super-Resolution

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Short Survey on Static Hand Gesture Recognition

Feature Tracking and Optical Flow

Multi-Frame Image Restoration by a Variational Bayesian Method for Motion-Blur-Free Multi-Exposure Photography

Normalized cuts and image segmentation

Motion Estimation using Block Overlap Minimization

Markov Networks in Computer Vision

Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation

ECE276A: Sensing & Estimation in Robotics Lecture 11: Simultaneous Localization and Mapping using a Particle Filter

SEMI-BLIND IMAGE RESTORATION USING A LOCAL NEURAL APPROACH

Segmentation: Clustering, Graph Cut and EM

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

The SIFT (Scale Invariant Feature

3D Imaging from Video and Planar Radiography

Introduction à la vision artificielle X

Markov Networks in Computer Vision. Sargur Srihari

Augmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit

Locally Adaptive Learning for Translation-Variant MRF Image Priors

Total Variation Regularization Method for 3-D Rotational Coronary Angiography

Image Reconstruction from Videos Distorted by Atmospheric Turbulence

When Sparsity Meets Low-Rankness: Transform Learning With Non-Local Low-Rank Constraint For Image Restoration

Efficient Imaging Algorithms on Many-Core Platforms

Model-based Visual Tracking:

Transcription:

Guided Image Super-Resolution: A New Technique for Photogeometric Super-Resolution in Hybrid 3-D Range Imaging Florin C. Ghesu 1, Thomas Köhler 1,2, Sven Haase 1, Joachim Hornegger 1,2 04.09.2014 1 Pattern Recognition Lab 2 Erlangen Graduate School in Advanced Optical Technologies (SAOT)

Outline Introduction Proposed Guided Super-Resolution Bayesian Modeling Modeling the Image Formation Process Numerical Optimization Experiments and Results Simulated Data Real Data Summary and Conclusion 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 2

Introduction

Single-Sensor Super-Resolution Reconstruct high-resolution image from multiple low-resolution frames Exploit subpixel motion present in low-resolution image sequence Conventional algorithms only applicable to single modality (sensor) Single-sensor super-resolution Low-resolution 2 Super-resolved 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 4

Multi-Sensor Super-Resolution Main question: Can super-resolution do a better job if we consider data from multiple sensors? Yes, if we model dependencies/correlations between the sensors Applications: Hybrid range imaging: 3-D range data augmented with photometric information Multispectral imaging Other hybrid imaging setups, e. g. PET/CT or PET/MR Multi-sensor super-resolution 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 5

Related Work (in Hybrid Range Imaging) Single-sensor super-resolution applied to range images 1 2 Adopt techniques to range images originally introduced for color images Limitation: does not exploit complementary photometric information Multi-sensor super-resolution for range images guided by photometric data Guidance for motion estimation in presence of highly undersampled range data 3 Adaptive regularization driven by color images 4 Limitation: requires high-quality photometric information Our contribution: New regularization technique to guide range super-resolution by photometric data Super-resolved photometric data as by-product (photogeometric super-resolution) 1 S. Schuon et al., (2008), High-quality scanning using time-of-flight depth superresolution, CVPR 2008 2 S. Schuon et al., (2009), LidarBoost: Depth superresolution for ToF 3D shape scanning, CVPR 2009 3 T. Köhler et al., (2013), ToF Meets RGB: Novel Multi-Sensor Super-Resolution for Hybrid 3-D Endoscopy, MICCAI 2013 4 J. Park et al., (2010), High quality depth map upsampling for 3D-TOF cameras, ICCV 2011 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 6

Proposed Guided Super-Resolution

Bayesian Modeling of Multi-Sensor Super-Resolution Single-sensor super-resolution: Given: sequence of low-resolution frames y = ( y (1)... y (K)) Consider data of one modality (range images) Maximum a posteriori (MAP) estimation to reconstruct the most probable high-resolution image x: ˆx = arg max x = arg max x p(x y) p(y x)p(x) (1) y, x 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 8

Bayesian Modeling of Multi-Sensor Super-Resolution Multi-sensor super-resolution with independent channels: Low-resolution range (y) and photometric data (p) High-resolution range (x) and photometric data (q) MAP estimation: ˆx, ˆq = arg max x,q = arg max x,q p(x, q y, p) p(y x)p(p q) }{{} data likelihood p(x)p(q) }{{} prior Single-sensor super-resolution applied to each channel (2) p, q y, x 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 9

Bayesian Modeling of Multi-Sensor Super-Resolution Multi-sensor super-resolution with dependent channels: The sensors see the same scene Extend the MAP estimation: ˆx, ˆq = arg max p(x, q y, p) x,q = arg max p(y, p x, q) p(x, q) (3) x,q Joint density for both modalities to model prior: p, q p(x, q) = p(x)p(q x) }{{} dependencies How to model p(y, p x, q), p(x) and p(q x)? (4) 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 10 y, x

Modeling the Image Formation Process Mathematical model M to describe formation of k-th low-resolution frame (y (k) and p (k) ) from high-resolution image (x and q) M x : x y (k) M q : q p (k) (range data) (photometric data) Generative model for range and photometric data: ( ) ( ) y (k) (x γ (k) p (k) m W (k) ) y 0 = 0 η (k) m W (k) + q p ( γ (k) a 1 η (k) a 1 ) (5) W (k) y γ (k) m η (k) m and W (k) p and γ (k) a and η (k) a (system matrices) model subpixel motion, blur and subsampling models out-of-plane motion for range data models additive/multiplicative photometric differences 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 11

Joint Energy Minimization Log-likelihood function: (ˆx, ˆq) = arg min x,q ˆx, ˆq = arg min { log p(x, q y, p)} x,q = arg min { log (p(y, p x, q)p(x)p(q x))} (6) x,q Formulation as unconstrained energy minimization problem: F data (x, q) }{{} Data likelihood: p(y,p x,q) + R smooth (x, q) + R correlate (x, q) }{{} (7) Prior: p(x)p(p x) We use photometric data to guide range data as modeled by R correlate (x, q) Joint optimization performed in cyclic coordinate descent scheme 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 12

Anatomy of the Objective Function { } (ˆx, ˆq) = arg min F data (x, q) + R smooth (x, q) + R correlate (x, q) x,q Data fidelity term for range and photometric data: F data (x, q) = KN y i=1 β y,i r y,i (x) 2 + KN p i=1 β p,i r p,i (q) 2, (8) Residual error to measure data fidelity: r (k) y r (k) p = y (k) γ (k) = p (k) η (k) m W (k) y m W (k) p x γ (k) a 1 q η (k) a 1. β y,i and β p,i are confidence maps (estimated in our optimization algorithm) (9) 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 13

Anatomy of the Objective Function { } (ˆx, ˆq) = arg min F data (x, q) + R smooth (x, q) + R correlate (x, q) x,q Smoothness regularization for range and photometric data: R smooth (x 1,..., x n ) = λ x R(x) + λ q R(q) (10) Edge preserving regularization weighted by λ x 0 and λ q 0 We use the bilateral total variation (BTV) 5 : R(z) = P P α i + j z S i vs j h z 1 (11) i= P j= P Calculates BTV for local neighborhood of radius P with weighting factor α where S i v and S j h models vertical/horizontal shifts of z 5 S. Farsiu et al., Fast and Robust Multi-Frame Super-Resolution, IEEE TIP, 2004 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 14

Anatomy of the Objective Function { } (ˆx, ˆq) = arg min F data (x, q) + R smooth (x, q) + R correlate (x, q) x,q Interdependence regularization between both modalities: R correlate (x, q) = λ c x Aq b 2 2 (12) Local (patch-wise) linear correlation model defined by guided filtering 6 A (diagonal matrix) and b are guided filter coefficients (estimated in our optimization algorithm) λ c 0 indicates how strong photometric data guides the range data 6 K. He et al., Guided Image Filtering, IEEE PAMI, 2013 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 15

Numerical Optimization Update Confidence Maps Photometric Super-Resolution Guided Filtering Range Super-Resolution Goto next iteration We employ iteratively re-weighted least squares (IRLS) optimization to reconstruct super-resolved range and photometric data Iteration sequence: let (x (t), q (t) ) be the estimates at iteration t Guided filter coefficients (interdependence regularization) and confidence maps (data fidelity term) are iteratively updated 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 16

Numerical Optimization Update Confidence Maps Photometric Super-Resolution Guided Filtering Range Super-Resolution Goto next iteration Step 1 (confidence maps): derive from the residual error for (x (t), q (t) ) β (t) 1 if r (t) y,i y,i = ɛ y ɛ y β (t) 1 if r (t) p,i otherwise p,i = ɛ p ɛ p otherwise r (t) y,i r (t) p,i (13) Assign smaller confidence to observation with higher residual ɛ y and ɛ p are estimated from the median absolute deviation of the residual 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 17

Numerical Optimization Update Confidence Maps Photometric Super-Resolution Guided Filtering Range Super-Resolution Goto next iteration Step 2 (photometric super-resolution): update q (t 1) to q (t) for fixed x q (t) = arg min q {F data (x, q) + R smooth (x, q)} x=x (t 1) (14) Interdependence regularization not used (photometric data guides range data but not vice versa) Convex optimization problem solved by scaled conjugate gradient method 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 18

Numerical Optimization Update Confidence Maps Photometric Super-Resolution Guided Filtering Range Super-Resolution Goto next iteration Step 3 (guided filtering): compute filter coefficients A (t) and b (t) 7 i ω k q i x i E ωk (q)e ωk (x) 1 ω k à k,k = (15) Var ωk (q) + ɛ b k = E ωk (x) à k,k E ωk (q) (16) Filter parameters: ω k (kernel size) and ɛ (regularization parameter) 7 K. He et al., Guided Image Filtering, IEEE PAMI, 2013 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 19

Numerical Optimization Update Confidence Maps Photometric Super-Resolution Guided Filtering Range Super-Resolution Goto next iteration Step 4 (range super-resolution): update x (t 1) to x (t) for fixed q x (t) = arg min x {F data (x, q) + R smooth (x, q) + R correlate (x, q)} q=q (t) (17) Use filter coefficients A (t) and b (t) for interdependence regularization Convex optimization problem solved by scaled conjugate gradient method 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 20

Experiments and Results

Experiments and Results Experiments: Simulated data with known ground truth Real datasets (Microsoft s Kinect) Compared methods: MAP super-resolution with L 2 norm model (applied to each modality separately) Robust super-resolution based on L 1 norm model (applied to each modality separately) 8 Proposed method for photogeometric super-resolution Motion estimation: optical flow on photometric data (employed for all super-resolution methods) 9 8 S. Schuon et al., (2008), High-quality scanning using time-of-flight depth superresolution, CVPR 2008 9 T. Köhler et al., (2013), ToF Meets RGB: Novel Multi-Sensor Super-Resolution for Hybrid 3-D Endoscopy, MICCAI 2013 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 22

Simulated Data: Experimental Setup Low-resolution range Ground truth Ground truth range/photometric data simulated with 640 480 px 10 Simulation of Gaussian PSF and subsampling (factor: 4) to generate low-resolution data 4 datasets: 10 image sequences (K = 31 low-resolution frames per sequence) 10 using the range imaging toolkit (RITK): http://www5.cs.fau.de/research/software/range-imaging-toolkit-ritk/ 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 23

Simulated Data: Results Evaluation for range data: MAP (L 2 norm) MAP (L 1 norm) Proposed 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 24

Simulated Data: Results MAP (L 2 norm) MAP (L 1 norm) Proposed Ground truth 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 25

Simulated Data: Results Evaluation for photometric data: Low-resolution Guided approach Ground truth 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 26

Simulated Data: Results Evaluation for photometric data: Low-resolution Guided approach Ground truth 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 27

Simulated Data: Results Evaluation of peak-signal-to-noise ratio (PSNR) and structural similarity (SSIM) for range and photometric data: All results averaged over n = 10 test sequences Sequence Low-resolution MAP - L 1 MAP - L 2 Proposed Bunny-1 32.78 (0.96) 34.10 (0.96) 34.05 (0.97) 35.01 (0.98) Range Bunny-2 31.29 (0.94) 32.84 (0.95) 33.22 (0.97) 33.34 (0.98) Dragon-1 24.63 (0.57) 27.68 (0.72) 28.71 (0.84) 30.00 (0.91) Dragon-2 27.14 (0.75) 29.09 (0.84) 29.76 (0.93) 30.80 (0.95) Photom. Bunny-1 28.48 (0.79) 29.82 (0.87) 29.79 (0.87) 29.79 (0.88) Bunny-2 30.05 (0.81) 31.35 (0.86) 31.42 (0.86) 31.43 (0.86) Dragon-1 23.34 (0.65) 24.25 (0.72) 24.24 (0.71) 24.27 (0.72) Dragon-2 24.65 (0.66) 25.60 (0.72) 25.51 (0.70) 25.60 (0.72) Improved PSNR/SSIM for range data and competitive results for photometric data (photometric data guides range data, not vice versa) 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 28

Simulated Data: Results Experimental analysis of the convergence: PSNR vs. IRLS iteration number 30 31 PSNR [db] 29.5 PSNR [db] 30.5 29 5 10 15 Iteration number 30 5 10 15 Iteration number Dragon-1 dataset Dragon-2 dataset Initialization: MAP super-resolution based on L 2 norm Typically converged after 10 IRLS iterations 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 29

Real Data: Experimental Setup Microsoft s Kinect: Photometric data Range data Acquisition of real data using Microsoft s Kinect (640 480 px, 30 fps) Subpixel motion due to small shaking of the device Datasets with sequences of K = 31 frames (magnification factor: 4) 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 30

Real Data: Results Evaluation for range data: Low-resolution MAP (L 2 norm) MAP (L 1 norm) Proposed 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 31

Real Data: Results Evaluation for photometric data: Low-resolution MAP (L 2 norm) MAP (L 1 norm) Proposed 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 32

Real Data: Results Super-resolved 3-D mesh with color overlay: MAP (L 2 norm) MAP (L 1 norm) Proposed Invalid pixels (due to occlusions) reconstructed by all methods Improved reconstruction of edges by our method 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 33

Summary and Conclusion

Summary and Conclusion Novel interdependence regularization to guide range super-resolution by photometric data Photogeometric resolution enhancement: super-resolve range and photometric data in a joint framework Robust image reconstruction based on IRLS optimization Outlook: Adaption/generalization for other sensors and hybrid imaging setups, e. g. Time-of-Flight imaging (range + amplitude data) RGB-D imaging to handle multiple color channels Multispectral imaging 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 35

Supplementary Material A super-resolution toolbox (Matlab & MEX/C++) and datasets used for our experiments are available on our webpage: http://www5.cs.fau.de/research/data Thank you very much for your attention! 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 36

Acknowledgments Thank you very much for the support of this work 04.09.2014 T. Köhler Pattern Recognition Lab, SAOT Guided Image Super-Resolution 37