RGBd Image Semantic Labelling for Urban Driving Scenes via a DCNN
|
|
- Claire Farmer
- 5 years ago
- Views:
Transcription
1 RGBd Image Semantic Labelling for Urban Driving Scenes via a DCNN Jason Bolito, Research School of Computer Science, ANU Supervisors: Yiran Zhong & Hongdong Li
2 2 Outline 1. Motivation and Background 2. Proposed Method 3. Implementation, Experiment and Results 4. Conclusion and Future Work
3 Motivation Semantic Segmentation Understanding road scenes. Useful for autonomous cars and drones. Source: cityscape datasets. 3
4 4 Semantic Segmentation vs. Object Recognition Object Recognition Semantic Segmentation Person Source: cityscapes-datasets.com Road Person Vegetation Motorcycle
5 5 What we want from our method Leverage both 3D and colour information. Attain more accurate and robust semantic segmentation.
6 6 Background RGB Semantic Labelling Earlier days: CRFs (low level vision cues). Recently: Deep Neural Nets.
7 7 Background Fully Convolutional Networks Source: FCNs for semantic segmentation by J. Long et al. Pixels to pixels approach. Builds on VGG16. (encoder) Upsampling using deconvolution to get label map. (decoder)
8 8 Background Deconvolution Networks Source: Learning Deconvolution Network for Semantic Segmentation by H. Noh et al. Expands VGG16. (encoder) Uses unpooling + deconv to get label map. (decoder)
9 9 Background SegNet Source: SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation by V. Badrinarayanan et al. Similar encoder-decoder structure. Removes fully connected layers. Prioritises memory efficiency.
10 10 Background RGBd Semantic Labelling HHA representation (Saurabh et all, 2014). Hard mutex constraints (Deng et al 2015). LSTM-F (Li et al, 2016). Fusenet (Hazirbas et al, 2016).
11 11 Background (cont d) Presented methods use depth as a channel. Depth used as generic information. 3D structure not considered/learned.
12 12 Proposed Method Ideas Use depth to partially reconstruct 3D scene. Use 3D convolution to capture structure. Apply encoder-decoder design to achieve rich segmentation maps.
13 13 Proposed Method S3D Feature maps Encoder Decoder Conv3D + ReLU Deconv3D + ReLU Conv3D (2x stride) + ReLU Softmax
14 14 S3D building blocks Input Layer Input RGB image I is voxelised via disparity map D: I 3D (z, x, y, c) := I(x, y, c), for z = bd(x, y)c 0, otherwise 2.5D reconstruction of environment., Points at infinity have disparity 0.
15 S3D building blocks Encoder Feature extraction via 3D convolution: F out (z, x, y, c out )= X F in (z + k, x + i, y + j, c in )K cout (k, i, j, c in ) k,i,j,c in Each 3x3x3 filter is a learnable template. High response = input matches template. 3D structure = 3D input + 3D templates 15
16 16 S3D building blocks Encoder (cont d) Non-linear activation function: ReLU(x) = max(0, x) Good gradients for backprop. Learnable downsampling = strided 3D convolution.
17 17 S3D building blocks Decoder 3D deconvolution = inverse of 3D convolution. F out (z + k, x + i, y + j, c out )+= X c in F in (z, x, y, c in )K cout (k, i, j, c in ) Already implemented as backwards Conv3D pass. Learnable upsampling = strided 3D deconvolution.
18 18 S3D building blocks Decoder (cont d) Skip layers (top down modulation) Shallow = Low level features Conv3D... Helps with convergence and refines features... Deconv3D... Deep = High level knowledge
19 19 S3D building blocks Inference Use softmax to get probability cube: ˆP(z, x, y, c) := exp(f(z, x, y, c)) P c 0 2Classes exp(f(z, x, y, c0 )) Argmax over classes to get 3D labels: ˆL 3D (z, x, y) :=argmax c2classes ˆP(z, x, y, c). Project using D to get 2D labels: ˆL(x, y) :=ˆL 3D (bd(x, y)c, x, y)
20 20 Implementation Implemented using a deep-learning facade API and TensorFlow.
21 21 Experiment and Results Dataset: Cityscapes (urban scene dataset) Splits: 2795 training / 500 test images over 50 cities. GPU: Nvidia GeForce Titan X Pascal.
22 22 Experiment and Results (cont d) Image size: 128x64x128 Iterations: Around 30k Results: (State of Art has miou = 80.1%) G miou C G test miou test C test Learning feature extraction takes a while. Can we de better?
23 23 Experiment and Results (cont d) Trick: Let pre-trained 2D DCNN do feature extraction. Use S3D on extracted features. Method G miou C G test miou test C test time/it (s) S3D-ResNet S3D-ResNet S3D-ResNet Not SoA but matches DeepLab (71.4%)! Depth accuracy/efficiency trade-off!
24 24 Conclusion and Future Work Presented a DNN solution for semantic segmentation. Solution fully utilises 3D structure. Achieves good results especially when used on pre-extracted features. Good results achieved without any extra goodies! (CRFs, data augmentation, ) There is plenty of room for improvement!
25 25 Conclusion and Future Work (cont d) Need to push S3D to the limit. Can be done with post-processing, balancing, upsampling, What happens when we generalise one of the other architectures to 3D?
26 26 Questions? Thank You!
Encoder-Decoder Networks for Semantic Segmentation. Sachin Mehta
Encoder-Decoder Networks for Semantic Segmentation Sachin Mehta Outline > Overview of Semantic Segmentation > Encoder-Decoder Networks > Results What is Semantic Segmentation? Input: RGB Image Output:
More informationDeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution and Fully Connected CRFs
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution and Fully Connected CRFs Zhipeng Yan, Moyuan Huang, Hao Jiang 5/1/2017 1 Outline Background semantic segmentation Objective,
More informationDeep learning for dense per-pixel prediction. Chunhua Shen The University of Adelaide, Australia
Deep learning for dense per-pixel prediction Chunhua Shen The University of Adelaide, Australia Image understanding Classification error Convolution Neural Networks 0.3 0.2 0.1 Image Classification [Krizhevsky
More informationPlaces Challenge 2017
Places Challenge 2017 Scene Parsing Task CASIA_IVA_JD Jun Fu, Jing Liu, Longteng Guo, Haijie Tian, Fei Liu, Hanqing Lu Yong Li, Yongjun Bao, Weipeng Yan National Laboratory of Pattern Recognition, Institute
More informationPresentation Outline. Semantic Segmentation. Overview. Presentation Outline CNN. Learning Deconvolution Network for Semantic Segmentation 6/6/16
6/6/16 Learning Deconvolution Network for Semantic Segmentation Hyeonwoo Noh, Seunghoon Hong,Bohyung Han Department of Computer Science and Engineering, POSTECH, Korea Shai Rozenberg 6/6/2016 1 2 Semantic
More informationA MULTI-RESOLUTION FUSION MODEL INCORPORATING COLOR AND ELEVATION FOR SEMANTIC SEGMENTATION
A MULTI-RESOLUTION FUSION MODEL INCORPORATING COLOR AND ELEVATION FOR SEMANTIC SEGMENTATION Wenkai Zhang a, b, Hai Huang c, *, Matthias Schmitz c, Xian Sun a, Hongqi Wang a, Helmut Mayer c a Key Laboratory
More informationLecture 7: Semantic Segmentation
Semantic Segmentation CSED703R: Deep Learning for Visual Recognition (207F) Segmenting images based on its semantic notion Lecture 7: Semantic Segmentation Bohyung Han Computer Vision Lab. bhhanpostech.ac.kr
More informationMulti-View 3D Object Detection Network for Autonomous Driving
Multi-View 3D Object Detection Network for Autonomous Driving Xiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, Tian Xia CVPR 2017 (Spotlight) Presented By: Jason Ku Overview Motivation Dataset Network Architecture
More informationDeconvolution Networks
Deconvolution Networks Johan Brynolfsson Mathematical Statistics Centre for Mathematical Sciences Lund University December 6th 2016 1 / 27 Deconvolution Neural Networks 2 / 27 Image Deconvolution True
More informationAdvanced Video Analysis & Imaging
Advanced Video Analysis & Imaging (5LSH0), Module 09B Machine Learning with Convolutional Neural Networks (CNNs) - Workout Farhad G. Zanjani, Clint Sebastian, Egor Bondarev, Peter H.N. de With ( p.h.n.de.with@tue.nl
More informationSEMANTIC segmentation has a wide array of applications
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 39, NO. 12, DECEMBER 2017 2481 SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation Vijay Badrinarayanan,
More informationDeconvolutions in Convolutional Neural Networks
Overview Deconvolutions in Convolutional Neural Networks Bohyung Han bhhan@postech.ac.kr Computer Vision Lab. Convolutional Neural Networks (CNNs) Deconvolutions in CNNs Applications Network visualization
More informationFully Convolutional Networks for Semantic Segmentation
Fully Convolutional Networks for Semantic Segmentation Jonathan Long* Evan Shelhamer* Trevor Darrell UC Berkeley Chaim Ginzburg for Deep Learning seminar 1 Semantic Segmentation Define a pixel-wise labeling
More informationSemantic Segmentation
Semantic Segmentation UCLA:https://goo.gl/images/I0VTi2 OUTLINE Semantic Segmentation Why? Paper to talk about: Fully Convolutional Networks for Semantic Segmentation. J. Long, E. Shelhamer, and T. Darrell,
More informationIn-Place Activated BatchNorm for Memory- Optimized Training of DNNs
In-Place Activated BatchNorm for Memory- Optimized Training of DNNs Samuel Rota Bulò, Lorenzo Porzi, Peter Kontschieder Mapillary Research Paper: https://arxiv.org/abs/1712.02616 Code: https://github.com/mapillary/inplace_abn
More informationDeep Multimodal Fusion Networks for Semantic Segmentation
Clemson University TigerPrints All Theses Theses 8-2017 Deep Multimodal Fusion Networks for Semantic Segmentation Jesse Tetreault Clemson University Follow this and additional works at: https://tigerprints.clemson.edu/all_theses
More informationPOINT CLOUD DEEP LEARNING
POINT CLOUD DEEP LEARNING Innfarn Yoo, 3/29/28 / 57 Introduction AGENDA Previous Work Method Result Conclusion 2 / 57 INTRODUCTION 3 / 57 2D OBJECT CLASSIFICATION Deep Learning for 2D Object Classification
More informationDeep Learning with Tensorflow AlexNet
Machine Learning and Computer Vision Group Deep Learning with Tensorflow http://cvml.ist.ac.at/courses/dlwt_w17/ AlexNet Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton, "Imagenet classification
More informationLinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation
LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation Abhishek Chaurasia School of Electrical and Computer Engineering Purdue University West Lafayette, USA Email: aabhish@purdue.edu
More informationLearning Fully Dense Neural Networks for Image Semantic Segmentation
Learning Fully Dense Neural Networks for Image Semantic Segmentation Mingmin Zhen 1, Jinglu Wang 2, Lei Zhou 1, Tian Fang 3, Long Quan 1 1 Hong Kong University of Science and Technology, 2 Microsoft Research
More informationLearning Deep Structured Models for Semantic Segmentation. Guosheng Lin
Learning Deep Structured Models for Semantic Segmentation Guosheng Lin Semantic Segmentation Outline Exploring Context with Deep Structured Models Guosheng Lin, Chunhua Shen, Ian Reid, Anton van dan Hengel;
More informationExtend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network
Extend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network Liwen Zheng, Canmiao Fu, Yong Zhao * School of Electronic and Computer Engineering, Shenzhen Graduate School of
More informationDeep Learning For Video Classification. Presented by Natalie Carlebach & Gil Sharon
Deep Learning For Video Classification Presented by Natalie Carlebach & Gil Sharon Overview Of Presentation Motivation Challenges of video classification Common datasets 4 different methods presented in
More informationINTRODUCTION TO DEEP LEARNING
INTRODUCTION TO DEEP LEARNING CONTENTS Introduction to deep learning Contents 1. Examples 2. Machine learning 3. Neural networks 4. Deep learning 5. Convolutional neural networks 6. Conclusion 7. Additional
More informationMask R-CNN. By Kaiming He, Georgia Gkioxari, Piotr Dollar and Ross Girshick Presented By Aditya Sanghi
Mask R-CNN By Kaiming He, Georgia Gkioxari, Piotr Dollar and Ross Girshick Presented By Aditya Sanghi Types of Computer Vision Tasks http://cs231n.stanford.edu/ Semantic vs Instance Segmentation Image
More informationDeep Fully Convolutional Networks with Random Data Augmentation for Enhanced Generalization in Road Detection
Deep Fully Convolutional Networks with Random Data Augmentation for Enhanced Generalization in Road Detection Jesu s Mun oz-bulnes, Carlos Fernandez, Ignacio Parra, David Ferna ndez-llorca, Miguel A. Sotelo
More informationStructured Prediction using Convolutional Neural Networks
Overview Structured Prediction using Convolutional Neural Networks Bohyung Han bhhan@postech.ac.kr Computer Vision Lab. Convolutional Neural Networks (CNNs) Structured predictions for low level computer
More informationFully Convolutional Network for Depth Estimation and Semantic Segmentation
Fully Convolutional Network for Depth Estimation and Semantic Segmentation Yokila Arora ICME Stanford University yarora@stanford.edu Ishan Patil Department of Electrical Engineering Stanford University
More informationCOMP9444 Neural Networks and Deep Learning 7. Image Processing. COMP9444 c Alan Blair, 2017
COMP9444 Neural Networks and Deep Learning 7. Image Processing COMP9444 17s2 Image Processing 1 Outline Image Datasets and Tasks Convolution in Detail AlexNet Weight Initialization Batch Normalization
More informationFuzzy Set Theory in Computer Vision: Example 3, Part II
Fuzzy Set Theory in Computer Vision: Example 3, Part II Derek T. Anderson and James M. Keller FUZZ-IEEE, July 2017 Overview Resource; CS231n: Convolutional Neural Networks for Visual Recognition https://github.com/tuanavu/stanford-
More informationMask R-CNN. Kaiming He, Georgia, Gkioxari, Piotr Dollar, Ross Girshick Presenters: Xiaokang Wang, Mengyao Shi Feb. 13, 2018
Mask R-CNN Kaiming He, Georgia, Gkioxari, Piotr Dollar, Ross Girshick Presenters: Xiaokang Wang, Mengyao Shi Feb. 13, 2018 1 Common computer vision tasks Image Classification: one label is generated for
More informationComputer Vision: Making machines see
Computer Vision: Making machines see Roberto Cipolla Department of Engineering http://www.eng.cam.ac.uk/~cipolla/people.html http://www.toshiba.eu/eu/cambridge-research- Laboratory/ Vision: what is where
More informationLearning Depth from Single Images with Deep Neural Network Embedding Focal Length
Learning Depth from Single Images with Deep Neural Network Embedding Focal Length Lei He, Guanghui Wang (Senior Member, IEEE) and Zhanyi Hu arxiv:1803.10039v1 [cs.cv] 27 Mar 2018 Abstract Learning depth
More informationSpeeding up Semantic Segmentation for Autonomous Driving
Speeding up Semantic Segmentation for Autonomous Driving Michael Treml 1, José Arjona-Medina 1, Thomas Unterthiner 1, Rupesh Durgesh 2, Felix Friedmann 2, Peter Schuberth 2, Andreas Mayr 1, Martin Heusel
More informationRefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation
: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation Guosheng Lin 1 Anton Milan 2 Chunhua Shen 2,3 Ian Reid 2,3 1 Nanyang Technological University 2 University of Adelaide 3 Australian
More informationTeam G-RMI: Google Research & Machine Intelligence
Team G-RMI: Google Research & Machine Intelligence Alireza Fathi (alirezafathi@google.com) Nori Kanazawa, Kai Yang, George Papandreou, Tyler Zhu, Jonathan Huang, Vivek Rathod, Chen Sun, Kevin Murphy, et
More informationarxiv: v1 [cs.cv] 7 Jun 2016
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation arxiv:1606.02147v1 [cs.cv] 7 Jun 2016 Adam Paszke Faculty of Mathematics, Informatics and Mechanics University of Warsaw, Poland
More informationJOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS. Zhao Chen Machine Learning Intern, NVIDIA
JOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS Zhao Chen Machine Learning Intern, NVIDIA ABOUT ME 5th year PhD student in physics @ Stanford by day, deep learning computer vision scientist
More informationRGBD Occlusion Detection via Deep Convolutional Neural Networks
1 RGBD Occlusion Detection via Deep Convolutional Neural Networks Soumik Sarkar 1,2, Vivek Venugopalan 1, Kishore Reddy 1, Michael Giering 1, Julian Ryde 3, Navdeep Jaitly 4,5 1 United Technologies Research
More informationUAV Navigation above Roads Using Convolutional Neural Networks
UAV Navigation above Roads Using Convolutional Neural Networks Thomas Ayoul Toby Buckley Felix Crevier tayoul@stanford.edu tobyb@stanford.edu fcrevier@stanford.edu Abstract In this project a UAV is used
More informationECE 5470 Classification, Machine Learning, and Neural Network Review
ECE 5470 Classification, Machine Learning, and Neural Network Review Due December 1. Solution set Instructions: These questions are to be answered on this document which should be submitted to blackboard
More informationFlow-Based Video Recognition
Flow-Based Video Recognition Jifeng Dai Visual Computing Group, Microsoft Research Asia Joint work with Xizhou Zhu*, Yuwen Xiong*, Yujie Wang*, Lu Yuan and Yichen Wei (* interns) Talk pipeline Introduction
More informationRefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation
: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation Guosheng Lin 1,2, Anton Milan 1, Chunhua Shen 1,2, Ian Reid 1,2 1 The University of Adelaide, 2 Australian Centre for Robotic
More informationObject Detection. CS698N Final Project Presentation AKSHAT AGARWAL SIDDHARTH TANWAR
Object Detection CS698N Final Project Presentation AKSHAT AGARWAL SIDDHARTH TANWAR Problem Description Arguably the most important part of perception Long term goals for object recognition: Generalization
More informationRDFNet: RGB-D Multi-level Residual Feature Fusion for Indoor Semantic Segmentation
RDFNet: RGB-D Multi-level Residual Feature Fusion for Indoor Semantic Segmentation Seong-Jin Park POSTECH Ki-Sang Hong POSTECH {windray,hongks,leesy}@postech.ac.kr Seungyong Lee POSTECH Abstract In multi-class
More informationOctree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs Supplementary Material
Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs Supplementary Material Peak memory usage, GB 10 1 0.1 0.01 OGN Quadratic Dense Cubic Iteration time, s 10
More informationInception and Residual Networks. Hantao Zhang. Deep Learning with Python.
Inception and Residual Networks Hantao Zhang Deep Learning with Python https://en.wikipedia.org/wiki/residual_neural_network Deep Neural Network Progress from Large Scale Visual Recognition Challenge (ILSVRC)
More information3D Object Recognition and Scene Understanding from RGB-D Videos. Yu Xiang Postdoctoral Researcher University of Washington
3D Object Recognition and Scene Understanding from RGB-D Videos Yu Xiang Postdoctoral Researcher University of Washington 1 2 Act in the 3D World Sensing & Understanding Acting Intelligent System 3D World
More informationDense Image Labeling Using Deep Convolutional Neural Networks
Dense Image Labeling Using Deep Convolutional Neural Networks Md Amirul Islam, Neil Bruce, Yang Wang Department of Computer Science University of Manitoba Winnipeg, MB {amirul, bruce, ywang}@cs.umanitoba.ca
More informationEfficient Segmentation-Aided Text Detection For Intelligent Robots
Efficient Segmentation-Aided Text Detection For Intelligent Robots Junting Zhang, Yuewei Na, Siyang Li, C.-C. Jay Kuo University of Southern California Outline Problem Definition and Motivation Related
More informationHand-Object Interaction Detection with Fully Convolutional Networks
Hand-Object Interaction Detection with Fully Convolutional Networks Matthias Schröder Helge Ritter Neuroinformatics Group, Bielefeld University {maschroe,helge}@techfak.uni-bielefeld.de Abstract Detecting
More informationScene Text Recognition for Augmented Reality. Sagar G V Adviser: Prof. Bharadwaj Amrutur Indian Institute Of Science
Scene Text Recognition for Augmented Reality Sagar G V Adviser: Prof. Bharadwaj Amrutur Indian Institute Of Science Outline Research area and motivation Finding text in natural scenes Prior art Improving
More informationBeyond RGB: Very High Resolution Urban Remote Sensing With Multimodal Deep Networks
Beyond RGB: Very High Resolution Urban Remote Sensing With Multimodal Deep Networks Nicolas Audebert, Bertrand Le Saux, Sébastien Lefèvre To cite this version: Nicolas Audebert, Bertrand Le Saux, Sébastien
More informationSemantic Segmentation of RGBD Videos with Recurrent Fully Convolutional Neural Networks
Semantic Segmentation of RGBD Videos with Recurrent Fully Convolutional Neural Networks Ekrem Emre Yurdakul, Yücel Yemez Computer Engineering Department, Koç University Istanbul, Turkey {ekyurdakul,yyemez}@ku.edu.tr
More informationDeep Learning. Visualizing and Understanding Convolutional Networks. Christopher Funk. Pennsylvania State University.
Visualizing and Understanding Convolutional Networks Christopher Pennsylvania State University February 23, 2015 Some Slide Information taken from Pierre Sermanet (Google) presentation on and Computer
More informationMoonRiver: Deep Neural Network in C++
MoonRiver: Deep Neural Network in C++ Chung-Yi Weng Computer Science & Engineering University of Washington chungyi@cs.washington.edu Abstract Artificial intelligence resurges with its dramatic improvement
More informationVisual Perception for Autonomous Driving on the NVIDIA DrivePX2 and using SYNTHIA
Visual Perception for Autonomous Driving on the NVIDIA DrivePX2 and using SYNTHIA Dr. Juan C. Moure Dr. Antonio Espinosa http://grupsderecerca.uab.cat/hpca4se/en/content/gpu http://adas.cvc.uab.es/elektra/
More informationREGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION
REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION Kingsley Kuan 1, Gaurav Manek 1, Jie Lin 1, Yuan Fang 1, Vijay Chandrasekhar 1,2 Institute for Infocomm Research, A*STAR, Singapore 1 Nanyang Technological
More informationNvidia Jetson TX2 and its Software Toolset. João Fernandes 2017/2018
Nvidia Jetson TX2 and its Software Toolset João Fernandes 2017/2018 In this presentation Nvidia Jetson TX2: Hardware Nvidia Jetson TX2: Software Machine Learning: Neural Networks Convolutional Neural Networks
More informationSemi-supervised Semantic Segmentation using Generative Adversarial Networks
Vision, Graphics and Interactive Systems, Spring Semester 2017 Semi-supervised Semantic Segmentation using Generative Adversarial Networks Master s Thesis Troels Høg Peter Jensen June 8, 2017 Department
More informationTwo-Stream Convolutional Networks for Action Recognition in Videos
Two-Stream Convolutional Networks for Action Recognition in Videos Karen Simonyan Andrew Zisserman Cemil Zalluhoğlu Introduction Aim Extend deep Convolution Networks to action recognition in video. Motivation
More informationarxiv: v2 [cs.cv] 21 Dec 2017
Classification with an edge: improving semantic image segmentation with boundary detection arxiv:1612.01337v2 [cs.cv] 21 Dec 2017 D. Marmanis a,c, K. Schindler b, J. D. Wegner b, S. Galliani b, M. Datcu
More informationDepth-aware CNN for RGB-D Segmentation
Depth-aware CNN for RGB-D Segmentation Weiyue Wang [0000 0002 8114 8271] and Ulrich Neumann University of Southern California, Los Angeles, California {weiyuewa,uneumann}@usc.edu Abstract. Convolutional
More information3D Shape Analysis with Multi-view Convolutional Networks. Evangelos Kalogerakis
3D Shape Analysis with Multi-view Convolutional Networks Evangelos Kalogerakis 3D model repositories [3D Warehouse - video] 3D geometry acquisition [KinectFusion - video] 3D shapes come in various flavors
More informationDynamic Routing Between Capsules
Report Explainable Machine Learning Dynamic Routing Between Capsules Author: Michael Dorkenwald Supervisor: Dr. Ullrich Köthe 28. Juni 2018 Inhaltsverzeichnis 1 Introduction 2 2 Motivation 2 3 CapusleNet
More informationAutomatic Thoracic CT Image Segmentation using Deep Convolutional Neural Networks. Xiao Han, Ph.D.
Automatic Thoracic CT Image Segmentation using Deep Convolutional Neural Networks Xiao Han, Ph.D. Outline Background Brief Introduction to DCNN Method Results 2 Focus where it matters Structure Segmentation
More informationLSTM and its variants for visual recognition. Xiaodan Liang Sun Yat-sen University
LSTM and its variants for visual recognition Xiaodan Liang xdliang328@gmail.com Sun Yat-sen University Outline Context Modelling with CNN LSTM and its Variants LSTM Architecture Variants Application in
More informationMergeNet: A Deep Net Architecture for Small Obstacle Discovery
MergeNet: A Deep Net Architecture for Small Obstacle Discovery Krishnam Gupta 1, Syed Ashar Javed 2, Vineet Gandhi 2 and K. Madhava Krishna 2, Abstract We present here, a novel network architecture called
More informationCNN Basics. Chongruo Wu
CNN Basics Chongruo Wu Overview 1. 2. 3. Forward: compute the output of each layer Back propagation: compute gradient Updating: update the parameters with computed gradient Agenda 1. Forward Conv, Fully
More informationConvolutional Neural Network based Medical Imaging Segmentation: Recent Progress and Challenges. Jiaxing Tan
Convolutional Neural Network based Medical Imaging Segmentation: Recent Progress and Challenges Jiaxing Tan Road Map Introduction CNN based Models Encoder-Decoder based Models GAN Based Models Some Challenges
More informationCS 1674: Intro to Computer Vision. Object Recognition. Prof. Adriana Kovashka University of Pittsburgh April 3, 5, 2018
CS 1674: Intro to Computer Vision Object Recognition Prof. Adriana Kovashka University of Pittsburgh April 3, 5, 2018 Different Flavors of Object Recognition Semantic Segmentation Classification + Localization
More informationDetecting cars in aerial photographs with a hierarchy of deconvolution nets
Detecting cars in aerial photographs with a hierarchy of deconvolution nets Satyaki Chakraborty Daniel Maturana Sebastian Scherer CMU-RI-TR-16-60 November 2016 Robotics Institute Carnegie Mellon University
More informationDeep Back-Projection Networks For Super-Resolution Supplementary Material
Deep Back-Projection Networks For Super-Resolution Supplementary Material Muhammad Haris 1, Greg Shakhnarovich 2, and Norimichi Ukita 1, 1 Toyota Technological Institute, Japan 2 Toyota Technological Institute
More informationTutorial on Keras CAP ADVANCED COMPUTER VISION SPRING 2018 KISHAN S ATHREY
Tutorial on Keras CAP 6412 - ADVANCED COMPUTER VISION SPRING 2018 KISHAN S ATHREY Deep learning packages TensorFlow Google PyTorch Facebook AI research Keras Francois Chollet (now at Google) Chainer Company
More informationarxiv: v1 [cs.cv] 2 Aug 2018
arxiv:1808.00897v1 [cs.cv] 2 Aug 2018 BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation Changqian Yu 1[0000 0002 4488 4157], Jingbo Wang 2[0000 0001 9700 6262], Chao Peng 3[0000
More informationEE-559 Deep learning Networks for semantic segmentation
EE-559 Deep learning 7.4. Networks for semantic segmentation François Fleuret https://fleuret.org/ee559/ Mon Feb 8 3:35:5 UTC 209 ÉCOLE POLYTECHNIQUE FÉDÉRALE DE LAUSANNE The historical approach to image
More informationDeep Learning for Computer Vision with MATLAB By Jon Cherrie
Deep Learning for Computer Vision with MATLAB By Jon Cherrie 2015 The MathWorks, Inc. 1 Deep learning is getting a lot of attention "Dahl and his colleagues won $22,000 with a deeplearning system. 'We
More informationarxiv: v1 [cs.cv] 14 Dec 2016
Detect, Replace, Refine: Deep Structured Prediction For Pixel Wise Labeling arxiv:1612.04770v1 [cs.cv] 14 Dec 2016 Spyros Gidaris University Paris-Est, LIGM Ecole des Ponts ParisTech spyros.gidaris@imagine.enpc.fr
More informationDepth Estimation from Single Image Using CNN-Residual Network
Depth Estimation from Single Image Using CNN-Residual Network Xiaobai Ma maxiaoba@stanford.edu Zhenglin Geng zhenglin@stanford.edu Zhi Bie zhib@stanford.edu Abstract In this project, we tackle the problem
More informationINF 5860 Machine learning for image classification. Lecture 11: Visualization Anne Solberg April 4, 2018
INF 5860 Machine learning for image classification Lecture 11: Visualization Anne Solberg April 4, 2018 Reading material The lecture is based on papers: Deep Dream: https://research.googleblog.com/2015/06/inceptionism-goingdeeper-into-neural.html
More informationPerceiving the 3D World from Images and Videos. Yu Xiang Postdoctoral Researcher University of Washington
Perceiving the 3D World from Images and Videos Yu Xiang Postdoctoral Researcher University of Washington 1 2 Act in the 3D World Sensing & Understanding Acting Intelligent System 3D World 3 Understand
More informationPaper Motivation. Fixed geometric structures of CNN models. CNNs are inherently limited to model geometric transformations
Paper Motivation Fixed geometric structures of CNN models CNNs are inherently limited to model geometric transformations Higher-level features combine lower-level features at fixed positions as a weighted
More informationFuseNet: Incorporating Depth into Semantic Segmentation via Fusion-based CNN Architecture
FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-based CNN Architecture Caner Hazirbas, Lingni Ma, Csaba Domoos, and Daniel Cremers Technical University of Munich, Germany {hazirbas,lingni,domoos,cremers}@cs.tum.edu
More informationYOLO9000: Better, Faster, Stronger
YOLO9000: Better, Faster, Stronger Date: January 24, 2018 Prepared by Haris Khan (University of Toronto) Haris Khan CSC2548: Machine Learning in Computer Vision 1 Overview 1. Motivation for one-shot object
More informationForeground Segmentation for Anomaly Detection in Surveillance Videos Using Deep Residual Networks
Foreground Segmentation for Anomaly Detection in Surveillance Videos Using Deep Residual Networks Lucas P. Cinelli, Lucas A. Thomaz, Allan F. da Silva, Eduardo A. B. da Silva and Sergio L. Netto Abstract
More informationConditional Random Fields as Recurrent Neural Networks
BIL722 - Deep Learning for Computer Vision Conditional Random Fields as Recurrent Neural Networks S. Zheng, S. Jayasumana, B. Romera-Paredes V. Vineet, Z. Su, D. Du, C. Huang, P.H.S. Torr Introduction
More informationMartian lava field, NASA, Wikipedia
Martian lava field, NASA, Wikipedia Old Man of the Mountain, Franconia, New Hampshire Pareidolia http://smrt.ccel.ca/203/2/6/pareidolia/ Reddit for more : ) https://www.reddit.com/r/pareidolia/top/ Pareidolia
More informationSemi-Parallel Deep Neural Network (SPDNN) Hybrid Architecture, First Application on Depth from Monocular Camera
1 Semi-Parallel Deep Neural Network (SPDNN) Hybrid rchitecture, First pplication on Depth from Monocular Camera Shabab Bazrafkan 1, Student, IEEE, Hossein Javidnia 1, Student, IEEE, Joseph Lemley 2, Student,
More informationArbitrary Style Transfer in Real-Time with Adaptive Instance Normalization. Presented by: Karen Lucknavalai and Alexandr Kuznetsov
Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization Presented by: Karen Lucknavalai and Alexandr Kuznetsov Example Style Content Result Motivation Transforming content of an image
More informationConvolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech
Convolutional Neural Networks Computer Vision Jia-Bin Huang, Virginia Tech Today s class Overview Convolutional Neural Network (CNN) Training CNN Understanding and Visualizing CNN Image Categorization:
More informationKnow your data - many types of networks
Architectures Know your data - many types of networks Fixed length representation Variable length representation Online video sequences, or samples of different sizes Images Specific architectures for
More informationCSE 559A: Computer Vision
CSE 559A: Computer Vision Fall 2018: T-R: 11:30-1pm @ Lopata 101 Instructor: Ayan Chakrabarti (ayan@wustl.edu). Course Staff: Zhihao Xia, Charlie Wu, Han Liu http://www.cse.wustl.edu/~ayan/courses/cse559a/
More informationCapsule Networks. Eric Mintun
Capsule Networks Eric Mintun Motivation An improvement* to regular Convolutional Neural Networks. Two goals: Replace max-pooling operation with something more intuitive. Keep more info about an activated
More informationDeep Learning in Visual Recognition. Thanks Da Zhang for the slides
Deep Learning in Visual Recognition Thanks Da Zhang for the slides Deep Learning is Everywhere 2 Roadmap Introduction Convolutional Neural Network Application Image Classification Object Detection Object
More informationResidual Networks And Attention Models. cs273b Recitation 11/11/2016. Anna Shcherbina
Residual Networks And Attention Models cs273b Recitation 11/11/2016 Anna Shcherbina Introduction to ResNets Introduced in 2015 by Microsoft Research Deep Residual Learning for Image Recognition (He, Zhang,
More informationCS 1674: Intro to Computer Vision. Neural Networks. Prof. Adriana Kovashka University of Pittsburgh November 16, 2016
CS 1674: Intro to Computer Vision Neural Networks Prof. Adriana Kovashka University of Pittsburgh November 16, 2016 Announcements Please watch the videos I sent you, if you haven t yet (that s your reading)
More informationDeep learning in MATLAB From Concept to CUDA Code
Deep learning in MATLAB From Concept to CUDA Code Roy Fahn Applications Engineer Systematics royf@systematics.co.il 03-7660111 Ram Kokku Principal Engineer MathWorks ram.kokku@mathworks.com 2017 The MathWorks,
More informationDeep Learning and Its Applications
Convolutional Neural Network and Its Application in Image Recognition Oct 28, 2016 Outline 1 A Motivating Example 2 The Convolutional Neural Network (CNN) Model 3 Training the CNN Model 4 Issues and Recent
More informationPredicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture David Eigen, Rob Fergus
Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture David Eigen, Rob Fergus Presented by: Rex Ying and Charles Qi Input: A Single RGB Image Estimate
More informationSynscapes A photorealistic syntehtic dataset for street scene parsing Jonas Unger Department of Science and Technology Linköpings Universitet.
Synscapes A photorealistic syntehtic dataset for street scene parsing Jonas Unger Department of Science and Technology Linköpings Universitet 7D Labs VINNOVA https://7dlabs.com Photo-realistic image synthesis
More information