Deconvolution Networks

Similar documents
Encoder-Decoder Networks for Semantic Segmentation. Sachin Mehta

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution and Fully Connected CRFs

Convolutional Neural Network based Medical Imaging Segmentation: Recent Progress and Challenges. Jiaxing Tan

Lecture 7: Semantic Segmentation

RGBd Image Semantic Labelling for Urban Driving Scenes via a DCNN

Convolutional Networks in Scene Labelling

MoonRiver: Deep Neural Network in C++

3D Convolutional Neural Networks for Landing Zone Detection from LiDAR

RGBD Occlusion Detection via Deep Convolutional Neural Networks

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs Supplementary Material

3 Object Detection. BVM 2018 Tutorial: Advanced Deep Learning Methods. Paul F. Jaeger, Division of Medical Image Computing

Deep Models for 3D Reconstruction

Detecting cars in aerial photographs with a hierarchy of deconvolution nets

Deep Learning with Tensorflow AlexNet

Deep Learning. Visualizing and Understanding Convolutional Networks. Christopher Funk. Pennsylvania State University.

Fully Convolutional Networks for Semantic Segmentation

Presentation Outline. Semantic Segmentation. Overview. Presentation Outline CNN. Learning Deconvolution Network for Semantic Segmentation 6/6/16

3D Shape Analysis with Multi-view Convolutional Networks. Evangelos Kalogerakis

6. Convolutional Neural Networks

Deconvolutions in Convolutional Neural Networks

JOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS. Zhao Chen Machine Learning Intern, NVIDIA

Advanced Video Analysis & Imaging

Deep Learning for Computer Vision with MATLAB By Jon Cherrie

Structured Prediction using Convolutional Neural Networks

April 4-7, 2016 Silicon Valley

INTRODUCTION TO DEEP LEARNING

NVIDIA FOR DEEP LEARNING. Bill Veenhuis

08 An Introduction to Dense Continuous Robotic Mapping

ECE 5470 Classification, Machine Learning, and Neural Network Review

Using Machine Learning for Classification of Cancer Cells

LSTM and its variants for visual recognition. Xiaodan Liang Sun Yat-sen University

Video Object Segmentation using Deep Learning

Martian lava field, NASA, Wikipedia

Deep Learning in Visual Recognition. Thanks Da Zhang for the slides

Capsule Networks. Eric Mintun

Machine Learning 13. week

CS6220: DATA MINING TECHNIQUES

Predicting ground-level scene Layout from Aerial imagery. Muhammad Hasan Maqbool

CIS680: Vision & Learning Assignment 2.b: RPN, Faster R-CNN and Mask R-CNN Due: Nov. 21, 2018 at 11:59 pm

Deep Learning Benchmarks Mumtaz Vauhkonen, Quaizar Vohra, Saurabh Madaan Collaboration with Adam Coates, Stanford Unviersity

INF 5860 Machine learning for image classification. Lecture 11: Visualization Anne Solberg April 4, 2018

SHIV SHAKTI International Journal in Multidisciplinary and Academic Research (SSIJMAR) Vol. 7, No. 2, April 2018 (ISSN )

Object Localization, Segmentation, Classification, and Pose Estimation in 3D Images using Deep Learning

ADVANCED IMAGE PROCESSING METHODS FOR ULTRASONIC NDE RESEARCH C. H. Chen, University of Massachusetts Dartmouth, N.

LEARNING TO INFER GRAPHICS PROGRAMS FROM HAND DRAWN IMAGES

Estimating Human Pose in Images. Navraj Singh December 11, 2009

SSD: Single Shot MultiBox Detector. Author: Wei Liu et al. Presenter: Siyu Jiang

Keras: Handwritten Digit Recognition using MNIST Dataset

Lung Tumor Segmentation via Fully Convolutional Neural Networks

CS 1674: Intro to Computer Vision. Object Recognition. Prof. Adriana Kovashka University of Pittsburgh April 3, 5, 2018

Scene Text Recognition for Augmented Reality. Sagar G V Adviser: Prof. Bharadwaj Amrutur Indian Institute Of Science

Dynamic Routing Between Capsules. Yiting Ethan Li, Haakon Hukkelaas, and Kaushik Ram Ramasamy

Deep Learning on Graphs

Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks

Dynamic Routing Between Capsules

Keras: Handwritten Digit Recognition using MNIST Dataset

Extracting and Composing Robust Features with Denoising Autoencoders

Backpropagation + Deep Learning

Spatial Localization and Detection. Lecture 8-1

Recurrent Convolutional Neural Networks for Scene Labeling

CSE 559A: Computer Vision

Perceiving the 3D World from Images and Videos. Yu Xiang Postdoctoral Researcher University of Washington

COMP 551 Applied Machine Learning Lecture 16: Deep Learning

COMP9444 Neural Networks and Deep Learning 7. Image Processing. COMP9444 c Alan Blair, 2017

Computer Vision: Making machines see

CS 223B Computer Vision Problem Set 3

Is Bigger CNN Better? Samer Hijazi on behalf of IPG CTO Group Embedded Neural Networks Summit (enns2016) San Jose Feb. 9th

Deep learning for dense per-pixel prediction. Chunhua Shen The University of Adelaide, Australia

Automatic Thoracic CT Image Segmentation using Deep Convolutional Neural Networks. Xiao Han, Ph.D.

Alternatives to Direct Supervision

Depth Estimation from a Single Image Using a Deep Neural Network Milestone Report

An ICA based Approach for Complex Color Scene Text Binarization

Polytechnic University of Tirana

Reconstructing Images of Bar Codes for Construction Site Object Recognition 1

CIS581: Computer Vision and Computational Photography Project 4, Part B: Convolutional Neural Networks (CNNs) Due: Dec.11, 2017 at 11:59 pm

Deep Learning For Video Classification. Presented by Natalie Carlebach & Gil Sharon

Tutorial on Keras CAP ADVANCED COMPUTER VISION SPRING 2018 KISHAN S ATHREY

Computational Foundations of Cognitive Science

Ryerson University CP8208. Soft Computing and Machine Intelligence. Naive Road-Detection using CNNS. Authors: Sarah Asiri - Domenic Curro

Adaptive Waveform Inversion: Theory Mike Warner*, Imperial College London, and Lluís Guasch, Sub Salt Solutions Limited

A performance comparison of Deep Learning frameworks on KNL

Classification of objects from Video Data (Group 30)

REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION

ConvolutionalNN's... ConvNet's... deep learnig

Akarsh Pokkunuru EECS Department Contractive Auto-Encoders: Explicit Invariance During Feature Extraction

CEA LIST s participation to the Scalable Concept Image Annotation task of ImageCLEF 2015

PIXELS TO VOXELS: MODELING VISUAL REPRESENTATION IN THE HUMAN BRAIN

Tunnel Effect in CNNs: Image Reconstruction From Max-Switch Locations

SEMANTIC segmentation has a wide array of applications

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset. By Joa õ Carreira and Andrew Zisserman Presenter: Zhisheng Huang 03/02/2018

Generative Modeling with Convolutional Neural Networks. Denis Dus Data Scientist at InData Labs

Deep Learning for Computer Vision II

Enhao Gong, PhD Candidate, Electrical Engineering, Stanford University Dr. John Pauly, Professor in Electrical Engineering, Stanford University Dr.

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. Charles R. Qi* Hao Su* Kaichun Mo Leonidas J. Guibas

CS 231A Computer Vision (Fall 2012) Problem Set 3

Learning Feature Hierarchies for Object Recognition

The exam is closed book, closed notes except your one-page (two-sided) cheat sheet.

Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling

EE-559 Deep learning Networks for semantic segmentation

Transcription:

Deconvolution Networks Johan Brynolfsson Mathematical Statistics Centre for Mathematical Sciences Lund University December 6th 2016 1 / 27

Deconvolution Neural Networks 2 / 27

Image Deconvolution True Deconvolution Using neural network to deconvolve blurring in an image. For the unblurring to be effective, large convolutional kernels must be used. This however is both hard to optimise and expensive computationally. L. Xu et al presents a method in A Deep Convolutional Neural Network for Image Deconvolution. 3 / 27

Image Deconvolution A simple model is used to model the blurring y = x k (1) The image can be reconstructed using convolution x = F 1 (F (y) /F (k)) = F 1 (1/F (k)) y (2) A Wiener deconvolution is proposed ( { 1 x = F 1 F(k) 2 }) F (k) F(k) 2 + 1 y = k y (3) SNR 4 / 27

x = k y (4) For the unblurring to be effective, large convolutional kernels must be used. This however is both hard to optimise and expensive computationally, even with the model. By introducing the noise term k becomes compact with finite support. The kernel separability is achieved using singular value decomposition, making the problem 1D x = k y = ( ) s j u j vj T y (5) j 5 / 27

Image Deconvolution x = k y = j ( ) s j u j vj T y (6) 6 / 27

Image deconvolution using Neural Networks Image Deconvolution 7 / 27

Image Deconvolution 8 / 27

Deconvolution Networks for Semantic Segmentation 9 / 27

Instead of only wanting a class prediction we also want a localisation of where in the image the object(s) is(are). One way to solve this is to produce 5 values - image class + two bounding box coordinates. 10 / 27

Semantic Segmentation 11 / 27

Very naive approach: Pixel by pixel classification by sending separate patches into neural network. Non-robust and noisy 12 / 27

Less naive approach proposed by Sermanet et al in Integrated Recognition, Localisation and Detection using Convolutional Networks Pixel by pixel classification by sending separate patches into neural network. Repeat using multiple different size of boxes 13 / 27

Farabet et al proposes a multi-scale setup in Learning Hierarchical Features for Scene Labeling, together with an image section segmentation 14 / 27

Deconvolution Disliked name, but seems to have have stuck Also called transpose convolution, fractional strided convolution, or backwards convolution. 15 / 27

Long et al proposes deconvolution in Fully Convolutional Networks for Semantic Segmentation 16 / 27

Unpooling Used to sample up the image between deconvolution First proposed to be used naively, by up-sampling each pixel to e.g. 2x2 pixels. 17 / 27

Unpooling Noh et al proposes in an improved unpooling in Learning Deconvolution Network for Semantic Segmentation In each max-pooling layer, the coordinates of max-value is stored. In a corresponding unpooling layer, values from a previous layer are entered into stored coordinated, setting the rest of the pixels to zero. By doing this, localisation information is included in the unpooling. 18 / 27

Deconvolution Networks Noh et al proposes a VGG 16-layer net, with classification layer removed The structure is reversed, with deconvolution and unpooling, to produce a semantic segmentation 19 / 27

An example of deconvolution/unpooling 20 / 27

Neural network deconvolution for auto-encoding The deconvolution network may also be used for auto encoding, where the image itself is given as ground-truth https://github.com/mikesj-public/convolutional_autoencoder/ tree/master 21 / 27

22 / 27

23 / 27

24 / 27

Physicians spend 1h / scan segmenting! Huge amount of data ( 500x500x250 = 62 500 000 voxels ) Noisy data Poor resolution Arbitrary intensities Tumors, hernias, prosthetics, tatoos etc. 25 / 27

A successful solution is to extract data around each voxel Use multiple networks Cure segmentation using shape awareness 26 / 27

HOMEWORK (One of the following is sufficient) Install and test the deconvolution network for semantic segmentation ( caffe ) https://github.com/hyeonwoonoh/deconvnet Install and test deconvolution for auto-encoding and compare with fully connected auto-encoding for the mnist dataset ( theano ) https://github.com/mikesj-public/convolutional_ autoencoder/tree/master 27 / 27