Deep Learning for Remote Sensing

Size: px
Start display at page:

Download "Deep Learning for Remote Sensing"

Transcription

1 1 ENPC Data Science Week Deep Learning for Remote Sensing Alexandre Boulch

2 2 ONERA Research, Innovation, expertise and long-term vision for industry, French government and Europe

3 3 Materials Optics Aerodynamics Flight dynamics Propulsion Electromagnetism Information Processing

4 4 Materials Optics Electromagnetism Aerodynamics Flight dynamics Propulsion Information Processing Autonomous UAVS Image and Video processing Data Fusion Remote Sensing

5 Remote Sensing Remote Sensing Obtaining information about objects without contact Earth Observation Gathering information about Earth via Remote Sensors aerial spatial ground 5

6 Remote Sensing 6 Massive data Satellites constellations covering the earth. e.g. Sentinel (ESA) Applications examples Urban area analysis

7 Remote Sensing 7 Massive data Satellites constellations covering the earth. e.g. Sentinel (ESA) Applications examples Urban area analysis Biomass estimation

8 Remote Sensing 8 Massive data Satellites constellations covering the earth. e.g. Sentinel (ESA) Applications examples Urban area analysis Biomass estimation Oil spread detection MNT estimation Building deformation from space Need for robust, automatic and fast processing

9 9 Objective INPUT DATA SEMANTIC MAP Registered aerial or spatial images 1 label per pixel Heterogeneous sources: - RGB - Hyperspectral - LIDAR - SAR... E.g. Tree, Building, Roads,... Machine Learning

10 10 Classification problem Labels: Car Dog House Plane Toy Game Ball Cat Road Tree Lego Ski Food Pillow Sun Flower... Images + Labels

11 11 Classification problem Labels: SKI HOUSE BALL CAT COMET Images + Labels PLANE Annotated images Car Dog House Plane Toy Game Ball Cat Road Tree Lego Ski Food Pillow Sun Flower...

12 12 Classification framework Direct classification is difficult Feature Classification extraction Human Image Annotated image Same object, different colors Image space : - High dimension (Image space) - No structure of the image - Not robust to changes Example

13 13 Classification framework Feature extraction Classification Human Image Annotated image Same object, different colors Similar after transformation Gradient and threshold Example

14 14 Classification framework Feature extraction Classification Human Image Annotated image Class 2 Class 1 Class 3

15 15 How to do it? Expert features Feature extraction Classification Human Image Emprical rules Emprical thresholds or SVM Water is blue + blue level > 200 green & red < 100 Annotated image

16 Designing features is difficult Binary classification Orange Not orange 16

17 Designing features is difficult What is a good and generic description of oranges? Binary classification Orange Not orange 17

18 18 Designing features is difficult What is a good and generic description of oranges? Round Binary classification Orange Not orange

19 19 Designing features is difficult What is a good and generic description of oranges? Round Binary classification Orange Not orange

20 20 Designing features is difficult What is a good and generic description of oranges? Round Color: orange Binary classification Orange Not orange

21 21 Designing features is difficult What is a good and generic description of oranges? Round Color: orange No line Binary classification Orange Not orange

22 22 Designing features is difficult What is a good and generic description of oranges? Round Color: orange No line Binary classification Orange Not orange

23 23 How to do it? Complex features Feature extraction Classification Human Image Annotated image Complex features: - generic features - human design SVM (support vector machines) Bike Histograms of Oriented Gradients (HOGs)

24 24 How to do it? Deep Neural Networks Feature extraction Classification Human Image Annotated image Convolutions (First layers) Fully connected layers Output Input

25 25 Deep neural networks Output Input Two step algorithm: FORWARD: predict the output given a input BACKWARD: compute the gradient of the loss, update the parameters of the network (back propagation) A lot of parameters to optimize Efficient computing using GPUs State of the art

26 26 First convolution layer Gabor filters [Daugman 1985] AlexNet first convolution layer [Krizhevsky 2012]

27 Neural networks are not new s, Artificial Neural Networks Fukushima, Neocognitron : A Self-organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position, Biological Cybernetics 36-4, 1980 Late 1980 s, Backpropagation algorithm applied to deep neural networks LeCun et al., Backpropagation Applied to Handwritten Zip Code Recognition, Neural Computation, 1, pp , s, GPU implementation, renewed interest in Deep Learning Ciresan et al., Deep Big Simple Neural Nets for Handwritten Digit Recognition, Neural Computation, 22, pp , 2010 Since 2010: Highly multiclass object recognition (ImageNet challenges), optical flow, segmentation...

28 28 What changed? Internet Data Deep Learning Computing power

29 What about Remote Sensing? 29 More data available Sentinel Landsat Free data ESA NSA / JPL Platforms PEPS Google Earth Engine

30 30 Application examples Optical Urban semantics RADAR Tree Species recognition 3D data Urban semantics

31 31 Tree species recognition Objective Semantic labeling of forest areas up to the species level Remningstorp (Sweden) Challenge Doing better than human eye.

32 Tree species recognition with SAR data Radar penetrates vegetation, it reflects on object with size similar to wave length. Radar data gives information on the structure of the forest. 32

33 33 Tree species recognition Generated fom open data: Dept. of Forest Resource Management, Swedish University of Agricultural Sciences 7 classes: Water No tree Trees: Birch Oak Pine Spruce Misc.

34 Tree species recognition 34 SAR data Band P Polarized Vegetation penetration HH HV VV channels RGB Color composition

35 35 Tree species recognition Ground Truth Histograms and SVM LeNet Small neural network AlexNet Large neural network

36 36 Urban semantics Car park optimization Road mapping Urban extension

37 37 Using a classification framework Inputs Multi source data Color composition 3 channels Superpixels or Regular grid Set of small images (one per super pixel) Classification Process Semantic Map One label per superpixel

38 Using segmentation networks 38 Dense prediction Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks Nicolas Audebert, B. Le Saux, Sébastien Lefèvre ACCV 2016

39 39 3D data semantics using segmentation networks Pick snap shots of the Point cloud Project back from 2d predictions to point cloud First place on the Semantic 3D dataset

40 40 On going projects at ONERA Big data for Remote Sensing w3.onera.fr/medusa DeLTA Deep learning for aerospatial delta-onera.github.io

41 Intern and PhD positions Internship opportunities From October to January PhD positions 2017 A joint geometrical and semantic approach to reconstructing digital model (ENPC / ONERA) Remote sensing images registration using a deep framework (ONERA) Deep learning for multi-temporal activity analysis in remote sensing (ONERA / ENST) All opportunities on 41

42 Conclusion 42 We chose it because we deal with huge amounts of data. Besides, it sounds really cool. Larry Page - Google RS in Information Processing team: Alexandre Boulch, Nicolas Audebert, Guillaume Brigot, Fabrice Janez, Elise Koeniguer, Bertrand Le Saux

Multi-modal, multi-temporal data analysis for urban remote sensing

Multi-modal, multi-temporal data analysis for urban remote sensing Multi-modal, multi-temporal data analysis for urban remote sensing Xiaoxiang Zhu www.sipeo.bgu.tum.de 10.10.2017 Uncontrolled Growth in Mumbai Slums Leads to Massive Fires and Floods So2Sat: Big Data for

More information

Spatial Localization and Detection. Lecture 8-1

Spatial Localization and Detection. Lecture 8-1 Lecture 8: Spatial Localization and Detection Lecture 8-1 Administrative - Project Proposals were due on Saturday Homework 2 due Friday 2/5 Homework 1 grades out this week Midterm will be in-class on Wednesday

More information

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech Convolutional Neural Networks Computer Vision Jia-Bin Huang, Virginia Tech Today s class Overview Convolutional Neural Network (CNN) Training CNN Understanding and Visualizing CNN Image Categorization:

More information

Presented at the FIG Congress 2018, May 6-11, 2018 in Istanbul, Turkey

Presented at the FIG Congress 2018, May 6-11, 2018 in Istanbul, Turkey Presented at the FIG Congress 2018, May 6-11, 2018 in Istanbul, Turkey Evangelos MALTEZOS, Charalabos IOANNIDIS, Anastasios DOULAMIS and Nikolaos DOULAMIS Laboratory of Photogrammetry, School of Rural

More information

An Exploration of Computer Vision Techniques for Bird Species Classification

An Exploration of Computer Vision Techniques for Bird Species Classification An Exploration of Computer Vision Techniques for Bird Species Classification Anne L. Alter, Karen M. Wang December 15, 2017 Abstract Bird classification, a fine-grained categorization task, is a complex

More information

Counting Passenger Vehicles from Satellite Imagery

Counting Passenger Vehicles from Satellite Imagery Counting Passenger Vehicles from Satellite Imagery Not everything that can be counted counts, and not everything that counts can be counted NVIDIA GPU Technology Conference 02 Nov 2017 KEVIN GREEN MACHINE

More information

INTRODUCTION TO DEEP LEARNING

INTRODUCTION TO DEEP LEARNING INTRODUCTION TO DEEP LEARNING CONTENTS Introduction to deep learning Contents 1. Examples 2. Machine learning 3. Neural networks 4. Deep learning 5. Convolutional neural networks 6. Conclusion 7. Additional

More information

Object Recognition II

Object Recognition II Object Recognition II Linda Shapiro EE/CSE 576 with CNN slides from Ross Girshick 1 Outline Object detection the task, evaluation, datasets Convolutional Neural Networks (CNNs) overview and history Region-based

More information

DEEP NEURAL NETWORKS FOR OBJECT DETECTION

DEEP NEURAL NETWORKS FOR OBJECT DETECTION DEEP NEURAL NETWORKS FOR OBJECT DETECTION Sergey Nikolenko Steklov Institute of Mathematics at St. Petersburg October 21, 2017, St. Petersburg, Russia Outline Bird s eye overview of deep learning Convolutional

More information

Deep Learning for Computer Vision II

Deep Learning for Computer Vision II IIIT Hyderabad Deep Learning for Computer Vision II C. V. Jawahar Paradigm Shift Feature Extraction (SIFT, HoG, ) Part Models / Encoding Classifier Sparrow Feature Learning Classifier Sparrow L 1 L 2 L

More information

Project 3 Q&A. Jonathan Krause

Project 3 Q&A. Jonathan Krause Project 3 Q&A Jonathan Krause 1 Outline R-CNN Review Error metrics Code Overview Project 3 Report Project 3 Presentations 2 Outline R-CNN Review Error metrics Code Overview Project 3 Report Project 3 Presentations

More information

Does the Brain do Inverse Graphics?

Does the Brain do Inverse Graphics? Does the Brain do Inverse Graphics? Geoffrey Hinton, Alex Krizhevsky, Navdeep Jaitly, Tijmen Tieleman & Yichuan Tang Department of Computer Science University of Toronto How to learn many layers of features

More information

Image Classification pipeline. Lecture 2-1

Image Classification pipeline. Lecture 2-1 Lecture 2: Image Classification pipeline Lecture 2-1 Administrative: Piazza For questions about midterm, poster session, projects, etc, use Piazza! SCPD students: Use your @stanford.edu address to register

More information

Synscapes A photorealistic syntehtic dataset for street scene parsing Jonas Unger Department of Science and Technology Linköpings Universitet.

Synscapes A photorealistic syntehtic dataset for street scene parsing Jonas Unger Department of Science and Technology Linköpings Universitet. Synscapes A photorealistic syntehtic dataset for street scene parsing Jonas Unger Department of Science and Technology Linköpings Universitet 7D Labs VINNOVA https://7dlabs.com Photo-realistic image synthesis

More information

Remote Sensing Introduction to the course

Remote Sensing Introduction to the course Remote Sensing Introduction to the course Remote Sensing (Prof. L. Biagi) Exploitation of remotely assessed data for information retrieval Data: Digital images of the Earth, obtained by sensors recording

More information

Neural Networks. Single-layer neural network. CSE 446: Machine Learning Emily Fox University of Washington March 10, /10/2017

Neural Networks. Single-layer neural network. CSE 446: Machine Learning Emily Fox University of Washington March 10, /10/2017 3/0/207 Neural Networks Emily Fox University of Washington March 0, 207 Slides adapted from Ali Farhadi (via Carlos Guestrin and Luke Zettlemoyer) Single-layer neural network 3/0/207 Perceptron as a neural

More information

Machine Learning. Deep Learning. Eric Xing (and Pengtao Xie) , Fall Lecture 8, October 6, Eric CMU,

Machine Learning. Deep Learning. Eric Xing (and Pengtao Xie) , Fall Lecture 8, October 6, Eric CMU, Machine Learning 10-701, Fall 2015 Deep Learning Eric Xing (and Pengtao Xie) Lecture 8, October 6, 2015 Eric Xing @ CMU, 2015 1 A perennial challenge in computer vision: feature engineering SIFT Spin image

More information

Image Classification pipeline. Lecture 2-1

Image Classification pipeline. Lecture 2-1 Lecture 2: Image Classification pipeline Lecture 2-1 Administrative: Piazza For questions about midterm, poster session, projects, etc, use Piazza! SCPD students: Use your @stanford.edu address to register

More information

Computer Vision Lecture 16

Computer Vision Lecture 16 Computer Vision Lecture 16 Deep Learning for Object Categorization 14.01.2016 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de leibe@vision.rwth-aachen.de Announcements Seminar registration period

More information

Classification of objects from Video Data (Group 30)

Classification of objects from Video Data (Group 30) Classification of objects from Video Data (Group 30) Sheallika Singh 12665 Vibhuti Mahajan 12792 Aahitagni Mukherjee 12001 M Arvind 12385 1 Motivation Video surveillance has been employed for a long time

More information

Multisensoral UAV-Based Reference Measurements for Forestry Applications

Multisensoral UAV-Based Reference Measurements for Forestry Applications Multisensoral UAV-Based Reference Measurements for Forestry Applications Research Manager D.Sc. Anttoni Jaakkola Centre of Excellence in Laser Scanning Research 2 Outline UAV applications Reference level

More information

CPSC340. State-of-the-art Neural Networks. Nando de Freitas November, 2012 University of British Columbia

CPSC340. State-of-the-art Neural Networks. Nando de Freitas November, 2012 University of British Columbia CPSC340 State-of-the-art Neural Networks Nando de Freitas November, 2012 University of British Columbia Outline of the lecture This lecture provides an overview of two state-of-the-art neural networks:

More information

Computer Vision Lecture 16

Computer Vision Lecture 16 Announcements Computer Vision Lecture 16 Deep Learning Applications 11.01.2017 Seminar registration period starts on Friday We will offer a lab course in the summer semester Deep Robot Learning Topic:

More information

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Si Chen The George Washington University sichen@gwmail.gwu.edu Meera Hahn Emory University mhahn7@emory.edu Mentor: Afshin

More information

Computer Vision Lecture 16

Computer Vision Lecture 16 Computer Vision Lecture 16 Deep Learning Applications 11.01.2017 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de leibe@vision.rwth-aachen.de Announcements Seminar registration period starts

More information

Real-time Object Detection CS 229 Course Project

Real-time Object Detection CS 229 Course Project Real-time Object Detection CS 229 Course Project Zibo Gong 1, Tianchang He 1, and Ziyi Yang 1 1 Department of Electrical Engineering, Stanford University December 17, 2016 Abstract Objection detection

More information

Convolutional Neural Networks

Convolutional Neural Networks Lecturer: Barnabas Poczos Introduction to Machine Learning (Lecture Notes) Convolutional Neural Networks Disclaimer: These notes have not been subjected to the usual scrutiny reserved for formal publications.

More information

Rich feature hierarchies for accurate object detection and semantic segmentation

Rich feature hierarchies for accurate object detection and semantic segmentation Rich feature hierarchies for accurate object detection and semantic segmentation Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik Presented by Pandian Raju and Jialin Wu Last class SGD for Document

More information

Application of Deep Learning Techniques in Satellite Telemetry Analysis.

Application of Deep Learning Techniques in Satellite Telemetry Analysis. Application of Deep Learning Techniques in Satellite Telemetry Analysis. Greg Adamski, Member of Technical Staff L3 Technologies Telemetry and RF Products Julian Spencer Jones, Spacecraft Engineer Telenor

More information

CROP MAPPING WITH SENTINEL-2 JULY 2017, SPAIN

CROP MAPPING WITH SENTINEL-2 JULY 2017, SPAIN _p TRAINING KIT LAND01 CROP MAPPING WITH SENTINEL-2 JULY 2017, SPAIN Table of Contents 1 Introduction to RUS... 3 2 Crop mapping background... 3 3 Training... 3 3.1 Data used... 3 3.2 Software in RUS environment...

More information

Yiqi Yan. May 10, 2017

Yiqi Yan. May 10, 2017 Yiqi Yan May 10, 2017 P a r t I F u n d a m e n t a l B a c k g r o u n d s Convolution Single Filter Multiple Filters 3 Convolution: case study, 2 filters 4 Convolution: receptive field receptive field

More information

Deep Convolutional Neural Networks. Nov. 20th, 2015 Bruce Draper

Deep Convolutional Neural Networks. Nov. 20th, 2015 Bruce Draper Deep Convolutional Neural Networks Nov. 20th, 2015 Bruce Draper Background: Fully-connected single layer neural networks Feed-forward classification Trained through back-propagation Example Computer Vision

More information

Object Detection. Part1. Presenter: Dae-Yong

Object Detection. Part1. Presenter: Dae-Yong Object Part1 Presenter: Dae-Yong Contents 1. What is an Object? 2. Traditional Object Detector 3. Deep Learning-based Object Detector What is an Object? Subset of Object Recognition What is an Object?

More information

The Feature Analyst Extension for ERDAS IMAGINE

The Feature Analyst Extension for ERDAS IMAGINE The Feature Analyst Extension for ERDAS IMAGINE Automated Feature Extraction Software for GIS Database Maintenance We put the information in GIS SM A Visual Learning Systems, Inc. White Paper September

More information

Forest Structure Estimation in the Canadian Boreal forest

Forest Structure Estimation in the Canadian Boreal forest Forest Structure Estimation in the Canadian Boreal forest Michael L. Benson Leland E.Pierce Kathleen M. Bergen Kamal Sarabandi Kailai Zhang Caitlin E. Ryan The University of Michigan, Radiation Lab & School

More information

Rotation Invariance Neural Network

Rotation Invariance Neural Network Rotation Invariance Neural Network Shiyuan Li Abstract Rotation invariance and translate invariance have great values in image recognition. In this paper, we bring a new architecture in convolutional neural

More information

List of Accepted Papers for ICVGIP 2018

List of Accepted Papers for ICVGIP 2018 List of Accepted Papers for ICVGIP 2018 Paper ID ACM Article Title 3 1 PredGAN - A deep multi-scale video prediction framework for anomaly detection in videos 7 2 Handwritten Essay Grading on Mobiles using

More information

Dynamic Routing Between Capsules

Dynamic Routing Between Capsules Report Explainable Machine Learning Dynamic Routing Between Capsules Author: Michael Dorkenwald Supervisor: Dr. Ullrich Köthe 28. Juni 2018 Inhaltsverzeichnis 1 Introduction 2 2 Motivation 2 3 CapusleNet

More information

Synthetic Aperture Radar (SAR) Polarimetry for Wetland Mapping & Change Detection

Synthetic Aperture Radar (SAR) Polarimetry for Wetland Mapping & Change Detection Synthetic Aperture Radar (SAR) Polarimetry for Wetland Mapping & Change Detection Jennifer M. Corcoran, M.S. Remote Sensing & Geospatial Analysis Laboratory Natural Resource Science & Management PhD Program

More information

Fully Convolutional Networks for Semantic Segmentation

Fully Convolutional Networks for Semantic Segmentation Fully Convolutional Networks for Semantic Segmentation Jonathan Long* Evan Shelhamer* Trevor Darrell UC Berkeley Chaim Ginzburg for Deep Learning seminar 1 Semantic Segmentation Define a pixel-wise labeling

More information

Structured Prediction using Convolutional Neural Networks

Structured Prediction using Convolutional Neural Networks Overview Structured Prediction using Convolutional Neural Networks Bohyung Han bhhan@postech.ac.kr Computer Vision Lab. Convolutional Neural Networks (CNNs) Structured predictions for low level computer

More information

Deep Learning in Visual Recognition. Thanks Da Zhang for the slides

Deep Learning in Visual Recognition. Thanks Da Zhang for the slides Deep Learning in Visual Recognition Thanks Da Zhang for the slides Deep Learning is Everywhere 2 Roadmap Introduction Convolutional Neural Network Application Image Classification Object Detection Object

More information

Anisotropic Methods for Image Registration

Anisotropic Methods for Image Registration Anisotropic Methods for Image Registration James Murphy November 24, 2014 () November 24, 2014 1 / 37 Goals of Talk 1 Explain the problem of image registration. 2 Discuss existing methods for image registration,

More information

EnsoMOSAIC. Kopterit metsäninventointidatan keruualustoina

EnsoMOSAIC. Kopterit metsäninventointidatan keruualustoina EnsoMOSAIC Kopterit metsäninventointidatan keruualustoina 20.4.2017 Company introduction MosaicMill founded in 2009 EnsoMOSAIC technology since 1994 Main businesses EnsoMOSAIC forestry solutions EnsoMOSAIC

More information

S7348: Deep Learning in Ford's Autonomous Vehicles. Bryan Goodman Argo AI 9 May 2017

S7348: Deep Learning in Ford's Autonomous Vehicles. Bryan Goodman Argo AI 9 May 2017 S7348: Deep Learning in Ford's Autonomous Vehicles Bryan Goodman Argo AI 9 May 2017 1 Ford s 12 Year History in Autonomous Driving Today: examples from Stereo image processing Object detection Using RNN

More information

Back propagation Algorithm:

Back propagation Algorithm: Network Neural: A neural network is a class of computing system. They are created from very simple processing nodes formed into a network. They are inspired by the way that biological systems such as the

More information

COMP9444 Neural Networks and Deep Learning 7. Image Processing. COMP9444 c Alan Blair, 2017

COMP9444 Neural Networks and Deep Learning 7. Image Processing. COMP9444 c Alan Blair, 2017 COMP9444 Neural Networks and Deep Learning 7. Image Processing COMP9444 17s2 Image Processing 1 Outline Image Datasets and Tasks Convolution in Detail AlexNet Weight Initialization Batch Normalization

More information

From Multi-sensor Data to 3D Reconstruction of Earth Surface: Innovative, Powerful Methods for Geoscience and Other Applications

From Multi-sensor Data to 3D Reconstruction of Earth Surface: Innovative, Powerful Methods for Geoscience and Other Applications From Multi-sensor Data to 3D Reconstruction of Earth Surface: Innovative, Powerful Methods for Geoscience and Other Applications Bea Csatho, Toni Schenk*, Taehun Yoon* and Michael Sheridan, Department

More information

SVM Segment Video Machine. Jiaming Song Yankai Zhang

SVM Segment Video Machine. Jiaming Song Yankai Zhang SVM Segment Video Machine Jiaming Song Yankai Zhang Introduction Background When watching a video online, users might need: Detailed video description information Removal of repeating openings and endings

More information

OCCLUSION BOUNDARIES ESTIMATION FROM A HIGH-RESOLUTION SAR IMAGE

OCCLUSION BOUNDARIES ESTIMATION FROM A HIGH-RESOLUTION SAR IMAGE OCCLUSION BOUNDARIES ESTIMATION FROM A HIGH-RESOLUTION SAR IMAGE Wenju He, Marc Jäger, and Olaf Hellwich Berlin University of Technology FR3-1, Franklinstr. 28, 10587 Berlin, Germany {wenjuhe, jaeger,

More information

Ryerson University CP8208. Soft Computing and Machine Intelligence. Naive Road-Detection using CNNS. Authors: Sarah Asiri - Domenic Curro

Ryerson University CP8208. Soft Computing and Machine Intelligence. Naive Road-Detection using CNNS. Authors: Sarah Asiri - Domenic Curro Ryerson University CP8208 Soft Computing and Machine Intelligence Naive Road-Detection using CNNS Authors: Sarah Asiri - Domenic Curro April 24 2016 Contents 1 Abstract 2 2 Introduction 2 3 Motivation

More information

A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images

A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images Marc Aurelio Ranzato Yann LeCun Courant Institute of Mathematical Sciences New York University - New York, NY 10003 Abstract

More information

Classifying Depositional Environments in Satellite Images

Classifying Depositional Environments in Satellite Images Classifying Depositional Environments in Satellite Images Alex Miltenberger and Rayan Kanfar Department of Geophysics School of Earth, Energy, and Environmental Sciences Stanford University 1 Introduction

More information

Deep Learning for Robust Normal Estimation in Unstructured Point Clouds. Alexandre Boulch. Renaud Marlet

Deep Learning for Robust Normal Estimation in Unstructured Point Clouds. Alexandre Boulch. Renaud Marlet Deep Learning for Robust Normal Estimation in Unstructured Point Clouds Alexandre Boulch Renaud Marlet Normal estimation in point clouds Normal: 3D normalized vector At each point: local orientation of

More information

Building the Software 2.0 Stack. Andrej Karpathy May 10, 2018

Building the Software 2.0 Stack. Andrej Karpathy May 10, 2018 Building the Software 2.0 Stack Andrej Karpathy May 10, 2018 1M years ago Engineering: approach by decomposition AWS stack 1. Identify a problem 2. Break down a big problem to smaller problems 3.

More information

ENVI ANALYTICS ANSWERS YOU CAN TRUST

ENVI ANALYTICS ANSWERS YOU CAN TRUST ENVI ANALYTICS ANSWERS YOU CAN TRUST HarrisGeospatial.com Since its launch in 1991, ENVI has enabled users to leverage remotely sensed data to better understand our complex world. Over the years, Harris

More information

Two-Stream Convolutional Networks for Action Recognition in Videos

Two-Stream Convolutional Networks for Action Recognition in Videos Two-Stream Convolutional Networks for Action Recognition in Videos Karen Simonyan Andrew Zisserman Cemil Zalluhoğlu Introduction Aim Extend deep Convolution Networks to action recognition in video. Motivation

More information

Deep learning for dense per-pixel prediction. Chunhua Shen The University of Adelaide, Australia

Deep learning for dense per-pixel prediction. Chunhua Shen The University of Adelaide, Australia Deep learning for dense per-pixel prediction Chunhua Shen The University of Adelaide, Australia Image understanding Classification error Convolution Neural Networks 0.3 0.2 0.1 Image Classification [Krizhevsky

More information

A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images

A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images Marc Aurelio Ranzato Yann LeCun Courant Institute of Mathematical Sciences New York University - New York, NY 10003 Abstract

More information

Machine Learning and Sensor Fusion for Precision Farming. Solmaz Hajmohammadi, Christopher Schardt, Noah Fahlgren, Arash Abbasi, Stefan Paulus

Machine Learning and Sensor Fusion for Precision Farming. Solmaz Hajmohammadi, Christopher Schardt, Noah Fahlgren, Arash Abbasi, Stefan Paulus Machine Learning and Sensor Fusion for Precision Farming Solmaz Hajmohammadi, Christopher Schardt, Noah Fahlgren, Arash Abbasi, Stefan Paulus Food Insecurity Increase in population 2.8 Billion more people

More information

THE SPEED-LIMIT SIGN DETECTION AND RECOGNITION SYSTEM

THE SPEED-LIMIT SIGN DETECTION AND RECOGNITION SYSTEM THE SPEED-LIMIT SIGN DETECTION AND RECOGNITION SYSTEM Kuo-Hsin Tu ( 塗國星 ), Chiou-Shann Fuh ( 傅楸善 ) Dept. of Computer Science and Information Engineering, National Taiwan University, Taiwan E-mail: p04922004@csie.ntu.edu.tw,

More information

MASTER THESIS. A Local - Global Approach to Semantic Segmentation in Aerial Images

MASTER THESIS. A Local - Global Approach to Semantic Segmentation in Aerial Images University Politehnica of Bucharest Automatic Control and Computers Faculty, Computer Science and Engineering Department arxiv:1607.05620v1 [cs.cv] 19 Jul 2016 MASTER THESIS A Local - Global Approach to

More information

Towards Large-Scale Semantic Representations for Actionable Exploitation. Prof. Trevor Darrell UC Berkeley

Towards Large-Scale Semantic Representations for Actionable Exploitation. Prof. Trevor Darrell UC Berkeley Towards Large-Scale Semantic Representations for Actionable Exploitation Prof. Trevor Darrell UC Berkeley traditional surveillance sensor emerging crowd sensor Desired capabilities: spatio-temporal reconstruction

More information

LSTM and its variants for visual recognition. Xiaodan Liang Sun Yat-sen University

LSTM and its variants for visual recognition. Xiaodan Liang Sun Yat-sen University LSTM and its variants for visual recognition Xiaodan Liang xdliang328@gmail.com Sun Yat-sen University Outline Context Modelling with CNN LSTM and its Variants LSTM Architecture Variants Application in

More information

Extended Dictionary Learning : Convolutional and Multiple Feature Spaces

Extended Dictionary Learning : Convolutional and Multiple Feature Spaces Extended Dictionary Learning : Convolutional and Multiple Feature Spaces Konstantina Fotiadou, Greg Tsagkatakis & Panagiotis Tsakalides kfot@ics.forth.gr, greg@ics.forth.gr, tsakalid@ics.forth.gr ICS-

More information

(Deep) Learning for Robot Perception and Navigation. Wolfram Burgard

(Deep) Learning for Robot Perception and Navigation. Wolfram Burgard (Deep) Learning for Robot Perception and Navigation Wolfram Burgard Deep Learning for Robot Perception (and Navigation) Lifeng Bo, Claas Bollen, Thomas Brox, Andreas Eitel, Dieter Fox, Gabriel L. Oliveira,

More information

YOLO9000: Better, Faster, Stronger

YOLO9000: Better, Faster, Stronger YOLO9000: Better, Faster, Stronger Date: January 24, 2018 Prepared by Haris Khan (University of Toronto) Haris Khan CSC2548: Machine Learning in Computer Vision 1 Overview 1. Motivation for one-shot object

More information

On the usability of deep networks for object-based image analysis

On the usability of deep networks for object-based image analysis On the usability of deep networks for object-based image analysis Nicolas Audebert, Bertrand Le Saux, Sébastien Lefèvre To cite this version: Nicolas Audebert, Bertrand Le Saux, Sébastien Lefèvre. On the

More information

Artificial Neural Networks. Introduction to Computational Neuroscience Ardi Tampuu

Artificial Neural Networks. Introduction to Computational Neuroscience Ardi Tampuu Artificial Neural Networks Introduction to Computational Neuroscience Ardi Tampuu 7.0.206 Artificial neural network NB! Inspired by biology, not based on biology! Applications Automatic speech recognition

More information

EO Monitoring Solutions

EO Monitoring Solutions EO Monitoring Solutions Florin SERBAN, Managing Director TERRASIGNA GEO-CRADLE Regional Workshop in Romania May 9 th 2017, Bucharest Copyright TERRASIGNA 2017 Content Monitoring services based on satellite

More information

Measuring the World: Designing Robust Vehicle Localization for Autonomous Driving. Frank Schuster, Dr. Martin Haueis

Measuring the World: Designing Robust Vehicle Localization for Autonomous Driving. Frank Schuster, Dr. Martin Haueis Measuring the World: Designing Robust Vehicle Localization for Autonomous Driving Frank Schuster, Dr. Martin Haueis Agenda Motivation: Why measure the world for autonomous driving? Map Content: What do

More information

Characterization and Benchmarking of Deep Learning. Natalia Vassilieva, PhD Sr. Research Manager

Characterization and Benchmarking of Deep Learning. Natalia Vassilieva, PhD Sr. Research Manager Characterization and Benchmarking of Deep Learning Natalia Vassilieva, PhD Sr. Research Manager Deep learning applications Vision Speech Text Other Search & information extraction Security/Video surveillance

More information

DEEP NEURAL NETWORKS CHANGING THE AUTONOMOUS VEHICLE LANDSCAPE. Dennis Lui August 2017

DEEP NEURAL NETWORKS CHANGING THE AUTONOMOUS VEHICLE LANDSCAPE. Dennis Lui August 2017 DEEP NEURAL NETWORKS CHANGING THE AUTONOMOUS VEHICLE LANDSCAPE Dennis Lui August 2017 THE RISE OF GPU COMPUTING APPLICATIONS 10 7 10 6 GPU-Computing perf 1.5X per year 1000X by 2025 ALGORITHMS 10 5 1.1X

More information

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet Classification with Deep Convolutional Neural Networks ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky Ilya Sutskever Geoffrey Hinton University of Toronto Canada Paper with same name to appear in NIPS 2012 Main idea Architecture

More information

arxiv: v1 [cs.cv] 20 Dec 2016

arxiv: v1 [cs.cv] 20 Dec 2016 End-to-End Pedestrian Collision Warning System based on a Convolutional Neural Network with Semantic Segmentation arxiv:1612.06558v1 [cs.cv] 20 Dec 2016 Heechul Jung heechul@dgist.ac.kr Min-Kook Choi mkchoi@dgist.ac.kr

More information

Deep Learning Workshop. Nov. 20, 2015 Andrew Fishberg, Rowan Zellers

Deep Learning Workshop. Nov. 20, 2015 Andrew Fishberg, Rowan Zellers Deep Learning Workshop Nov. 20, 2015 Andrew Fishberg, Rowan Zellers Why deep learning? The ImageNet Challenge Goal: image classification with 1000 categories Top 5 error rate of 15%. Krizhevsky, Alex,

More information

Object Localization, Segmentation, Classification, and Pose Estimation in 3D Images using Deep Learning

Object Localization, Segmentation, Classification, and Pose Estimation in 3D Images using Deep Learning Allan Zelener Dissertation Proposal December 12 th 2016 Object Localization, Segmentation, Classification, and Pose Estimation in 3D Images using Deep Learning Overview 1. Introduction to 3D Object Identification

More information

TorontoCity: Seeing the World with a Million Eyes

TorontoCity: Seeing the World with a Million Eyes TorontoCity: Seeing the World with a Million Eyes Authors Shenlong Wang, Min Bai, Gellert Mattyus, Hang Chu, Wenjie Luo, Bin Yang Justin Liang, Joel Cheverie, Sanja Fidler, Raquel Urtasun * Project Completed

More information

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601 Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network Nathan Sun CIS601 Introduction Face ID is complicated by alterations to an individual s appearance Beard,

More information

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality

More information

Machine Learning 13. week

Machine Learning 13. week Machine Learning 13. week Deep Learning Convolutional Neural Network Recurrent Neural Network 1 Why Deep Learning is so Popular? 1. Increase in the amount of data Thanks to the Internet, huge amount of

More information

! References: ! Computer eyesight gets a lot more accurate, NY Times. ! Stanford CS 231n. ! Christopher Olah s blog. ! Take ECS 174!

! References: ! Computer eyesight gets a lot more accurate, NY Times. ! Stanford CS 231n. ! Christopher Olah s blog. ! Take ECS 174! Exams ECS 189 WEB PROGRAMMING! If you are satisfied with your scores on the two midterms, you can skip the final! As soon as your Photobooth and midterm are graded, I can give you your course grade (so

More information

ON THE USABILITY OF DEEP NETWORKS FOR OBJECT-BASED IMAGE ANALYSIS

ON THE USABILITY OF DEEP NETWORKS FOR OBJECT-BASED IMAGE ANALYSIS ON THE USABILITY OF DEEP NETWORKS FOR OBJECT-BASED IMAGE ANALYSIS Nicolas Audebert a, b, Bertrand Le Saux a, Sébastien Lefèvre b a ONERA, The French Aerospace Lab, F-976 Palaiseau, France (nicolas.audebert,bertrand.le_saux)@onera.fr

More information

RGBD Occlusion Detection via Deep Convolutional Neural Networks

RGBD Occlusion Detection via Deep Convolutional Neural Networks 1 RGBD Occlusion Detection via Deep Convolutional Neural Networks Soumik Sarkar 1,2, Vivek Venugopalan 1, Kishore Reddy 1, Michael Giering 1, Julian Ryde 3, Navdeep Jaitly 4,5 1 United Technologies Research

More information

Know your data - many types of networks

Know your data - many types of networks Architectures Know your data - many types of networks Fixed length representation Variable length representation Online video sequences, or samples of different sizes Images Specific architectures for

More information

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem Many slides from Derek Hoiem Attributes Computer Vision James Hays Recap: Human Computation Active Learning: Let the classifier tell you where more annotation is needed. Human-in-the-loop recognition:

More information

A Deep Learning Framework for Authorship Classification of Paintings

A Deep Learning Framework for Authorship Classification of Paintings A Deep Learning Framework for Authorship Classification of Paintings Kai-Lung Hua ( 花凱龍 ) Dept. of Computer Science and Information Engineering National Taiwan University of Science and Technology Taipei,

More information

Object Detection. CS698N Final Project Presentation AKSHAT AGARWAL SIDDHARTH TANWAR

Object Detection. CS698N Final Project Presentation AKSHAT AGARWAL SIDDHARTH TANWAR Object Detection CS698N Final Project Presentation AKSHAT AGARWAL SIDDHARTH TANWAR Problem Description Arguably the most important part of perception Long term goals for object recognition: Generalization

More information

Beyond RGB: Very High Resolution Urban Remote Sensing With Multimodal Deep Networks

Beyond RGB: Very High Resolution Urban Remote Sensing With Multimodal Deep Networks Beyond RGB: Very High Resolution Urban Remote Sensing With Multimodal Deep Networks Nicolas Audebert, Bertrand Le Saux, Sébastien Lefèvre To cite this version: Nicolas Audebert, Bertrand Le Saux, Sébastien

More information

IMAGINE Objective. The Future of Feature Extraction, Update & Change Mapping

IMAGINE Objective. The Future of Feature Extraction, Update & Change Mapping IMAGINE ive The Future of Feature Extraction, Update & Change Mapping IMAGINE ive provides object based multi-scale image classification and feature extraction capabilities to reliably build and maintain

More information

A THREE LAYERED MODEL TO PERFORM CHARACTER RECOGNITION FOR NOISY IMAGES

A THREE LAYERED MODEL TO PERFORM CHARACTER RECOGNITION FOR NOISY IMAGES INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONSAND ROBOTICS ISSN 2320-7345 A THREE LAYERED MODEL TO PERFORM CHARACTER RECOGNITION FOR NOISY IMAGES 1 Neha, 2 Anil Saroliya, 3 Varun Sharma 1,

More information

Deep Learning with Tensorflow AlexNet

Deep Learning with Tensorflow   AlexNet Machine Learning and Computer Vision Group Deep Learning with Tensorflow http://cvml.ist.ac.at/courses/dlwt_w17/ AlexNet Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton, "Imagenet classification

More information

Cloud-based Large Scale Video Analysis

Cloud-based Large Scale Video Analysis Cloud-based Large Scale Video Analysis Marcos Nieto Principal Researcher Vicomtech-IK4 Joachim Kreikemeier Manager V-Drive Valeo Schalter und Sensoren GmbH INDEX 1. Cloud-LSVA project 2. ADAS validation

More information

Thematic Mapping with Remote Sensing Satellite Networks

Thematic Mapping with Remote Sensing Satellite Networks Thematic Mapping with Remote Sensing Satellite Networks College of Engineering and Computer Science The Australian National University outline satellite networks implications for analytical methods candidate

More information

Fuzzy Set Theory in Computer Vision: Example 3, Part II

Fuzzy Set Theory in Computer Vision: Example 3, Part II Fuzzy Set Theory in Computer Vision: Example 3, Part II Derek T. Anderson and James M. Keller FUZZ-IEEE, July 2017 Overview Resource; CS231n: Convolutional Neural Networks for Visual Recognition https://github.com/tuanavu/stanford-

More information

A MULTI-RESOLUTION FUSION MODEL INCORPORATING COLOR AND ELEVATION FOR SEMANTIC SEGMENTATION

A MULTI-RESOLUTION FUSION MODEL INCORPORATING COLOR AND ELEVATION FOR SEMANTIC SEGMENTATION A MULTI-RESOLUTION FUSION MODEL INCORPORATING COLOR AND ELEVATION FOR SEMANTIC SEGMENTATION Wenkai Zhang a, b, Hai Huang c, *, Matthias Schmitz c, Xian Sun a, Hongqi Wang a, Helmut Mayer c a Key Laboratory

More information

DEEP LEARNING TO DIVERSIFY BELIEF NETWORKS FOR REMOTE SENSING IMAGE CLASSIFICATION

DEEP LEARNING TO DIVERSIFY BELIEF NETWORKS FOR REMOTE SENSING IMAGE CLASSIFICATION DEEP LEARNING TO DIVERSIFY BELIEF NETWORKS FOR REMOTE SENSING IMAGE CLASSIFICATION S.Dhanalakshmi #1 #PG Scholar, Department of Computer Science, Dr.Sivanthi Aditanar college of Engineering, Tiruchendur

More information

Semantic RGB-D Perception for Cognitive Robots

Semantic RGB-D Perception for Cognitive Robots Semantic RGB-D Perception for Cognitive Robots Sven Behnke Computer Science Institute VI Autonomous Intelligent Systems Our Domestic Service Robots Dynamaid Cosero Size: 100-180 cm, weight: 30-35 kg 36

More information

Apparel Classifier and Recommender using Deep Learning

Apparel Classifier and Recommender using Deep Learning Apparel Classifier and Recommender using Deep Learning Live Demo at: http://saurabhg.me/projects/tag-that-apparel Saurabh Gupta sag043@ucsd.edu Siddhartha Agarwal siagarwa@ucsd.edu Apoorve Dave a1dave@ucsd.edu

More information

CSE 559A: Computer Vision

CSE 559A: Computer Vision CSE 559A: Computer Vision Fall 2018: T-R: 11:30-1pm @ Lopata 101 Instructor: Ayan Chakrabarti (ayan@wustl.edu). Course Staff: Zhihao Xia, Charlie Wu, Han Liu http://www.cse.wustl.edu/~ayan/courses/cse559a/

More information