Urban 3D Challenge & Future Directions

Similar documents
Open Source Geospatial Tools to Enable Large Scale 3D Scene Modeling

Presented at the FIG Congress 2018, May 6-11, 2018 in Istanbul, Turkey

TorontoCity: Seeing the World with a Million Eyes

Deep learning for dense per-pixel prediction. Chunhua Shen The University of Adelaide, Australia

Lecture 7: Semantic Segmentation

End-to-End Learning of Polygons for Remote Sensing Image Classification

Spatial Localization and Detection. Lecture 8-1

Object detection using Region Proposals (RCNN) Ernest Cheung COMP Presentation

Object Detection on Self-Driving Cars in China. Lingyun Li

Hide-and-Seek: Forcing a network to be Meticulous for Weakly-supervised Object and Action Localization

Flow-Based Video Recognition

What s New in Imagery in ArcGIS. Presented by: Christopher Patterson Date: October 18, 2017

Scalability for Large Photogrammetry Projects

Object Geolocation from Crowdsourced Street Level Imagery

International D&I Programs SUMMER OLYMPICS SECURITY PACKAGE

R-FCN: Object Detection with Really - Friggin Convolutional Networks

INTEGRATION OF TREE DATABASE DERIVED FROM SATELLITE IMAGERY AND LIDAR POINT CLOUD DATA

Depth from Stereo. Dominic Cheng February 7, 2018

Learning to generate 3D shapes

IDENTIFICATION OF NUCLEAR ACTIVITIES USING SATELLITE IMAGING

Multi-modal, multi-temporal data analysis for urban remote sensing

Object Detection. CS698N Final Project Presentation AKSHAT AGARWAL SIDDHARTH TANWAR

Collaborative Mapping with Streetlevel Images in the Wild. Yubin Kuang Co-founder and Computer Vision Lead

VALIDATION OF A NEW 30 METER GROUND SAMPLED GLOBAL DEM USING ICESAT LIDARA ELEVATION REFERENCE DATA

Encoder-Decoder Networks for Semantic Segmentation. Sachin Mehta

BUILDING EXTRACTION FROM REMOTE SENSING DATA USING FULLY CONVOLUTIONAL NETWORKS

TRAINING COURSE CATALOG

Instance-aware Semantic Segmentation via Multi-task Network Cascades

CAP 6412 Advanced Computer Vision

Large-Scale Point Cloud Classification Benchmark

Automated Extraction of Buildings from Aerial LiDAR Point Cloud and Digital Imaging Datasets for 3D Cadastre - Preliminary Results

BUILDING DETECTION IN VERY HIGH RESOLUTION SATELLITE IMAGE USING IHS MODEL

Unwrapping of Urban Surface Models

Symmetry Detection Competition

Computer Vision Lecture 16

Computer Vision Lecture 16

Deep Incremental Scene Understanding. Federico Tombari & Christian Rupprecht Technical University of Munich, Germany

Multi-channel Deep Transfer Learning for Nuclei Segmentation in Glioblastoma Cell Tissue Images

BUILDING DETECTION AND STRUCTURE LINE EXTRACTION FROM AIRBORNE LIDAR DATA

Cell Segmentation Proposal Network For Microscopy Image Analysis

Video Object Segmentation using Deep Learning

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

A DATA DRIVEN METHOD FOR FLAT ROOF BUILDING RECONSTRUCTION FROM LiDAR POINT CLOUDS

Lecture 5: Object Detection

Computer Vision: Making machines see

Photogrammetry and 3D city modelling

2-4 April 2019 Taets Art and Event Park, Amsterdam CLICK TO KNOW MORE

Mask R-CNN. Kaiming He, Georgia, Gkioxari, Piotr Dollar, Ross Girshick Presenters: Xiaokang Wang, Mengyao Shi Feb. 13, 2018

City-Modeling. Detecting and Reconstructing Buildings from Aerial Images and LIDAR Data

Cluster-based 3D Reconstruction of Aerial Video

Visual features detection based on deep neural network in autonomous driving tasks

ECE 5470 Classification, Machine Learning, and Neural Network Review

Cascade Region Regression for Robust Object Detection

What s New in Imagery in ArcGIS. Presented by: Christopher Patterson Date: September 12, 2017

CO-REGISTERING AND NORMALIZING STEREO-BASED ELEVATION DATA TO SUPPORT BUILDING DETECTION IN VHR IMAGES

AIRBORNE GEIGER MODE LIDAR - LATEST ADVANCEMENTS IN REMOTE SENSING APPLICATIONS RANDY RHOADS

AUTOMATIC BUILDING DETECTION FROM LIDAR POINT CLOUD DATA

LEARNING TO SEGMENT MOVING OBJECTS IN VIDEOS FRAGKIADAKI ET AL. 2015

Deep Learning for Object detection & localization

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution and Fully Connected CRFs

Small-footprint full-waveform airborne LiDAR for habitat assessment in the ChangeHabitats2 project

COMBINING HIGH SPATIAL RESOLUTION OPTICAL AND LIDAR DATA FOR OBJECT-BASED IMAGE CLASSIFICATION

Online Open World Face Recognition From Video Streams

arxiv: v1 [cs.cv] 20 Dec 2016

arxiv: v2 [cs.cv] 25 Sep 2018

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

ADVANCED TERRAIN PROCESSING: ANALYTICAL RESULTS OF FILLING VOIDS IN REMOTELY SENSED DATA TERRAIN INPAINTING

arxiv: v1 [cs.cv] 17 Apr 2019

3D Convolutional Neural Networks for Landing Zone Detection from LiDAR

EVALUATION OF WORLDVIEW-1 STEREO SCENES AND RELATED 3D PRODUCTS

Gradient of the lower bound

(Deep) Learning for Robot Perception and Navigation. Wolfram Burgard

Efficient Segmentation-Aided Text Detection For Intelligent Robots

What s New in ecognition 9.0. Christian Weise

arxiv: v1 [cs.cv] 4 Apr 2019

Robust Detection for Red Blood Cells in Thin Blood Smear Microscopy Using Deep Learning

Deep Learning for Remote Sensing

Development of Geospatial Smart Cities and Management

Classification of urban feature from unmanned aerial vehicle images using GASVM integration and multi-scale segmentation

LEARNING TO INFER GRAPHICS PROGRAMS FROM HAND DRAWN IMAGES

A Strategy and Vision for Enterprise 3D GIS. B r o o k s P a t r i c k A c c o u n t E x e c u t i v e b p a t r i c e s r i.

Face Detection and Tracking Control with Omni Car

AdaDepth: Unsupervised Content Congruent Adaptation for Depth Estimation

PARTIAL STYLE TRANSFER USING WEAKLY SUPERVISED SEMANTIC SEGMENTATION. Shin Matsuo Wataru Shimoda Keiji Yanai

Additional Remarks on Designing Category-Level Attributes for Discriminative Visual Recognition

ABSTRACT 1. INTRODUCTION

Digital photogrammetry project with very high-resolution stereo pairs acquired by DigitalGlobe, Inc. satellite Worldview-2

Streaming Hierarchical Video Segmentation. Chenliang Xu, Caiming Xiong and Jason J. Corso SUNY at Buffalo

3D Object Recognition and Scene Understanding from RGB-D Videos. Yu Xiang Postdoctoral Researcher University of Washington

INFORMATION EXTRACTION USING OPTICAL AND RADAR REMOTE SENSING DATA FUSION ABSTRACT INTRODUCTION

TrueOrtho with 3D Feature Extraction

Synscapes A photorealistic syntehtic dataset for street scene parsing Jonas Unger Department of Science and Technology Linköpings Universitet.

Final Report: Smart Trash Net: Waste Localization and Classification

Supplementary Material: Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos

ERDAS IMAGINE THE WORLD S MOST WIDELY-USED REMOTE SENSING SOFTWARE PACKAGE

DEFECT INSPECTION FROM SCRATCH TO PRODUCTION. Andrew Liu, Ryan Shen Deep Learning Solution Architect

Joseph Camilo 1, Rui Wang 1, Leslie M. Collins 1, Senior Member, IEEE, Kyle Bradbury 2, Member, IEEE, and Jordan M. Malof 1,Member, IEEE

Automated Procedures for Creating and Updating Digital City Models Using Very High Resolution (VHR) Satellite Imagery

Outline of Presentation. Introduction to Overwatch Geospatial Software Feature Analyst and LIDAR Analyst Software

Vexcel Imaging Ve V x e c x el e I l maging I G maging mbh www. wwwv. e v x e c x e c l e -limaging. - c imaging. om c

Transcription:

DISTRIBUTION STATEMENT A APPROVED FOR PUBLIC RELEASE; DISTRIBUTION IS UNLIMITED This work was supported by the United States Special Operations Command (USSOCOM). The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of USSOCOM or the U.S. Government. This work was supported by the Intelligence Advanced Research Projects Activity (IARPA) via contract no. 2017-17032700004. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of IARPA or the U.S. Government. Urban 3D Challenge & Future Directions 26 April 2018 Myron Brown Hirsh Goldberg Kevin Foster Andrea Leichtman Gordon Christie Sean Wang Shea Hagstrom Marc Bosch Scott Almes

USSOCOM Urban 3D Challenge (2017) Building footprint classification using satellite images and 3D data RGB Ortho Image Digital Surface Model Example Solution Ground Truth Source Data: Vricon 50cm Ortho, DSM, and DTM Products Acquired for IARPA CORE3D Solutions Assessed Using Building Outlines from Public HSIP 133 Cities Goldberg et al., Urban 3D Challenge: Building Footprint Detection Using Orthorectified Imagery and Digital Surface Models from Commercial Satellites, in Proc. SPIE Geospatial Informatics and Motion Imagery Analytics VIII, 2018 JHU Applied Physics Laboratory 23 April 2018 2

Public Challenge Data Set Jacksonville, FL Tampa, FL Richmond, VA Training and Provisional Testing Sequestered Testing City Area (km²) / Number of Tiles Number of Ground Truthed Buildings Jacksonville, Florida 117 52,675 Tampa, Florida 119 55,158 Richmond, Virginia 125 49,220 Total 361 157,053 JHU Applied Physics Laboratory 23 April 2018 3

Example 1km Tiles in Challenge Data Set Urban / Downtown Residential Commercial / Residential Mix Jacksonville, Florida, USA Tampa, Florida, USA Richmond, Virginia, USA JHU Applied Physics Laboratory 23 April 2018 4

Semi-Automated Ground Truth Production HSIP 133 Cities Building Outlines Instance Level Ground Truth Labels AnnoteGeo Software for Manual Annotation HSIP Data is available on AWS: s3://grid-uscities-data AnnoteGeo is available at: https://github.com/pubgeo/annotegeo JHU Applied Physics Laboratory 23 April 2018 5

Evaluation Metrics Each unique performer building region is assigned to at most one ground truth building region Match is declared if intersection-over-union (IoU) is > pre-defined threshold (0.45) True Positive (TP) Performer regions not matched to a GT region False Positive (FP) Unmatched ground truth building region False Negative (FN) Instance-Level Truth Sample Output Precision P = TP TP + FP Recall(R) = TP TP + FN F 1 = 2 P R P + R True Positive (TP) False Positive (FP) False Negative (FN) Final score is an average of F 1 scores across all tiles being evaluated JHU Applied Physics Laboratory 23 April 2018 6

Urban 3D Challenge on TopCoder Pre-registration: 25 Sep, 2017 Challenge Duration: 9 Oct 4 Dec, 2017 Final Results: 5 Jan, 2018 Participation - 217 registered - 54 actively participated Top 6 offered awards for open source JHU Applied Physics Laboratory 23 April 2018 7

Provisional and Sequestered Leaderboards JHU Applied Physics Laboratory 23 April 2018 8

Urban 3D Challenge Winning Approach Ensemble of Convolutional Neural Networks (CNNs) RGB Original U-Net is Shown for Illustration DSM ndsm DTM ndsm = DSM - DTM Semantic Segmentation Ensemble of CNNs with ResNet-34 Encoder 1 & U-Net Decoder 2 Watershed Algorithm 3 Instance Segmentation 1 He et al., Deep Residual Learning for Image Recognition, CVPR 2016 2 Ronneberger et al., U-Net: Convolutional Networks for Biomedical Image Segmentation, MICCAI, 2015 3 Credit: http://iacl.ece.jhu.edu/~prince/ws/ JHU Applied Physics Laboratory 23 April 2018 9

Is Urban 3D Challenge a Strong Baseline? Sequestered Test F1 Scores* Richmond Townhouse Retraining Before Retraining After Retraining San Fernando, Argentina Very Different & Not in Training Data * Top provisional score was 0.89 Sequestered testing suggests this solution generalizes well, so we re using it as a baseline for new research and industry solutions JHU Applied Physics Laboratory 23 April 2018 10

Can We Apply to Other Source Data? VRICON RGB True Ortho WorldView-2 Single Image Ortho Jacksonville, FL Tampa, FL Richmond, VA JHU Applied Physics Laboratory 23 April 2018 11

Sequestered Test Results Alternate Source Data for Training and Testing Retrained with Alternate Source Data Challenge Results Tested with Alternate Source Data (A) Vricon RGB and 3D (B) Vricon RGB and Lidar 3D (C) Single Image Ortho RGB and Lidar 3D (D) Retrained and Tested with (C) (E) Trained with (D) and Tested with Unedited Labels JHU Applied Physics Laboratory 23 April 2018 12

More Public Data to Enable Research Learn more at http://www.jhuapl.edu/pubgeo.html WorldView-2 Images to Complement the Urban 3D Challenge WV-3 Images for MVS M. Brown, H. Goldberg, K. Foster, A. Leichtman, S. Wang, S. Hagstrom, M. Bosch, and S. Almes, Large-Scale Public Lidar and Satellite Image Data Set for Urban Semantic Labeling, in Proc. SPIE Laser Radar Technology and Applications XXII, 2018 JHU Applied Physics Laboratory 23 April 2018 13

Public Data Sets and Baseline Algorithms Enabling Semantic 3D Reconstruction Multi-View Stereo 3D Reconstruction Semantic & Instance Segmentation Semantic 3D Reconstruction Notional Illustration Credit: Biljecki et al. IARPA MVS3DM Challenge (2016) USSOCOM Urban 3D Challenge (2017) IARPA CORE3D Public Data (2018) JHU Applied Physics Laboratory 23 April 2018 14