Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models

Similar documents
Painting-to-3D Model Alignment Via Discriminative Visual Elements. Mathieu Aubry (INRIA) Bryan Russell (Intel) Josef Sivic (INRIA)

Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models

Object Detection by 3D Aspectlets and Occlusion Reasoning

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

What are we trying to achieve? Why are we doing this? What do we learn from past history? What will we talk about today?

FPM: Fine Pose Parts-Based Model with 3D CAD Models

arxiv: v2 [cs.cv] 18 Apr 2016

Indexing Ensembles of Exemplar-SVMs with Rejecting Taxonomies for Fast Evaluation

Part-Based Models for Object Class Recognition Part 3

arxiv: v1 [cs.cv] 13 Sep 2016

Representing 3D models with discriminative visual elements

Deformable Part Models

Course Administration

Supervised learning. y = f(x) function

arxiv: v1 [cs.cv] 10 Jul 2014

Detection III: Analyzing and Debugging Detection Methods

Painting-to-3D Model Alignment Via Discriminative Visual Elements 1

arxiv: v2 [cs.cv] 25 Aug 2014

Part Discovery from Partial Correspondence

Where was this picture painted? - Localizing paintings by alignment to 3D models

Actions and Attributes from Wholes and Parts

Painting-to-3D Model Alignment Via Discriminative Visual Elements

Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014

Window based detectors

Blocks that Shout: Distinctive Parts for Scene Classification

3dRR: Representation and Recognition

Building Part-based Object Detectors via 3D Geometry

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba

Lecture 15 Visual recognition

EECS 442 Computer vision. 3D Object Recognition and. Scene Understanding

Modeling 3D viewpoint for part-based object recognition of rigid objects

3D Object Representations for Recognition. Yu Xiang Computational Vision and Geometry Lab Stanford University

Three-Dimensional Object Detection and Layout Prediction using Clouds of Oriented Gradients

LEARNING TO GENERATE CHAIRS WITH CONVOLUTIONAL NEURAL NETWORKS

Spatial Localization and Detection. Lecture 8-1

Object Detection by 3D Aspectlets and Occlusion Reasoning

6.819 / 6.869: Advances in Computer Vision

Supervised learning. y = f(x) function

Category-level localization

Immediate, scalable object category detection

Learning Collections of Part Models for Object Recognition

High Level Computer Vision

Deep learning for object detection. Slides from Svetlana Lazebnik and many others

A model for full local image interpretation

Part based models for recognition. Kristen Grauman

Detecting Objects using Deformation Dictionaries

arxiv: v1 [cs.cv] 24 Feb 2014

REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION

CS 1674: Intro to Computer Vision. Object Recognition. Prof. Adriana Kovashka University of Pittsburgh April 3, 5, 2018

Comparison of Object Detection Algorithms on Maritime Vessels Mark Chua 1, David W. Aha 2, Bryan Auslander 3, Kalyan Gupta 3, and Brendan Morris 1

Multi-Component Models for Object Detection

Object Category Detection: Sliding Windows

Predicting Functional Regions of Objects

Training Deformable Object Models for Human Detection based on Alignment and Clustering

Patch Descriptors. CSE 455 Linda Shapiro

Development in Object Detection. Junyuan Lin May 4th

Object Detection with YOLO on Artwork Dataset

3D Object Class Detection in the Wild

Exemplar-specific Patch Features for Fine-grained Recognition

Class-Specific Object Pose Estimation and Reconstruction using 3D Part Geometry

Mid-level Deep Pattern Mining

Yiqi Yan. May 10, 2017

Patch Descriptors. EE/CSE 576 Linda Shapiro

Structured Prediction using Convolutional Neural Networks

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Object detection. Announcements. Last time: Mid-level cues 2/23/2016. Wed Feb 24 Kristen Grauman UT Austin

Object Detection with Partial Occlusion Based on a Deformable Parts-Based Model

ECCV Presented by: Boris Ivanovic and Yolanda Wang CS 331B - November 16, 2016

Find that! Visual Object Detection Primer

Object Detection Based on Deep Learning

Category-level Localization

Object Category Detection. Slides mostly from Derek Hoiem

Recognizing human actions in still images: a study of bag-of-features and part-based representations

3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding

Shape Anchors for Data-driven Multi-view Reconstruction

Of Gods and Goats: Weakly Supervised Learning of Figurative Art

Using k-poselets for detecting people and localizing their keypoints

CS5670: Intro to Computer Vision

Segmentation. Bottom up Segmentation Semantic Segmentation

Comparison of Object Detection Algorithms on Maritime Vessels Mark Chua 1, David W. Aha 2, Bryan Auslander 3, Kalyan Gupta 3, and Brendan Morris 1

Multiple-Person Tracking by Detection

Learning Realistic Human Actions from Movies

MINE THE FINE: FINE-GRAINED FRAGMENT DISCOVERY

Object detection with CNNs

Lecture 15: Detecting Objects by Parts

3D Pose-by-Detection of Vehicles via Discriminatively Reduced Ensembles of Correlation Filters

Hierarchical Learning for Object Detection. Long Zhu, Yuanhao Chen, William Freeman, Alan Yuille, Antonio Torralba MIT and UCLA, 2010

Representing 3D models for alignment and recognition

Unified, real-time object detection

Weakly supervised object recognition with convolutional neural networks

CNN BASED REGION PROPOSALS FOR EFFICIENT OBJECT DETECTION. Jawadul H. Bappy and Amit K. Roy-Chowdhury

Efficient Category Mining by Leveraging Instance Retrieval

Data-Driven 3D Primitives for Single Image Understanding

Undoing the Damage of Dataset Bias

Action recognition in videos

Modern Object Detection. Most slides from Ali Farhadi

Weakly Supervised Object Recognition with Convolutional Neural Networks

arxiv: v1 [cs.cv] 26 Jul 2018

Mining Discriminative Adjectives and Prepositions for Natural Scene Recognition

From 3D Scene Geometry to Human Workspace

Transcription:

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Mathieu Aubry (INRIA) Daniel Maturana (CMU) Alexei Efros (UC Berkeley) Bryan Russell (Intel) Josef Sivic (INRIA)

Sit on the chair!

Classification CHAIR Ex: ImageNet Challenge, Pascal VOC classification.

Detection chair Ex: Pascal VOC detection.

Segmentation Ex: Pascal VOC segmentation.

Our goal

From the beginning of computer vision First PhD in computer vision, MIT 1963 Lawrence G. Roberts Machine perception of three-dimensional solids Photograph 3D model

1980s: 2D-3D Alignment [Huttenlocher and Ullman IJCV 1990] Alignment: Huttenlocher & Ullman (198 [Lowe AI 1987] (a) (b) (c) [Faugeras&Hebert 86], [Grimson&Lozano-Perez 86], igure 2.2: Object recognit ion example from t he work of Lowe (1987) [88]. (a) 3D wire-fram

See also [Rothganger et al. CVPR 2003],[Arandjelovic ánd Zisserman ICCV 2011], [Li et al. ECCV 2012], [Snavely et al. SIGGRAPH 2006] Exact Instance alignment [Philbin et al. CVPR 2007] [Sattler et al. ICCV 2011] [Baatz et al. ECCV 2012] [Lim et al. ICCV 2013]

2D category recognition [Lazebnik et al. CVPR 2006] [Krizhevsky et al. NIPS 2012] [Felzenszwalb et al.pami 2010] [Singh et al. ECCV 2012] See also : [Dalal and Triggs CVPR 2005], [Bourdev and Malik ICCV 2009], [Malisiewicz et al. ICCV 2011]...

3D category recognition 3D DPMs: [Herjati&Ramanan 12], [Pepik et al.12], [Zia et al. 13], Simplified part models: [Xiang&Savarese 12], [Del Pero et al. 13] Cuboids: [Xiao et al. 12] [Fidler et al. 12] Blocks world revisited: [Gupta et al. 12] See also: [Glasner et al. 11], [Fouhey et al. 13], [Satkin&Hebert 13], [Choi et al. 13], [Hejrati and Ramanan 14], [Savarese and Fei-Fei 07]

Why chairs? CHAPTER 1. 1. INTRODUCTION

Difficulty: style CHAPTER 1. 1. INTRODUCTION

Approach: data-driven 1394 3D models from internet

Difficulty: viewpoint CHAPTER 1. 1. INTRODUCTION

Approach: use 3D models 62 views

Style Viewpoint

Difficulty: approximate style

Difficulty: approximate style

Difficulty: approximate style

Approach: part-based model

Approach overview 3D collection Match CG->real image Render views Select parts Select the best matches

How to select discriminative parts?

How to select discriminative parts? Best exemplar-lda classifiers [Hariharan et al. 2012] [Gharbi et al 2012] [Malisiewicz et al 2011]

How to match CG to photograph? HOG Classifier Masked classifier HOG mask Implementation: exemplar-lda See also: [Shrivastava et al. 2011]

Approach: CG-to-photograph Implementation: exemplar-lda

How to compare matches? Patches Detectors Matches

How to compare matches? Patches Detectors Affine Calibration Matches with negative data See paper for details

Example I.

Example II.

Example III.

Input image DPM output Our output 3D models

Input image DPM output Our output 3D models

Test set: Results 179 images from Pascal VOC 2012 (subset of validation data) 247 annotated chair bounding boxes Baselines: 1. Exemplar LDA: [Hariharan et al. 2012], [Gharbi et al 2012], [Malisiewicz et al 2011] 2. DPM: [Felzenszwalb et al. 2010]

Subjective human evaluation Orientation quality at 25% recall Good Bad Exemplar-LDA 52% 48% Ours 90% 10%

Subjective human evaluation Style consistency at 25% recall Exact Ok Bad Exemplar-LDA 3% 31% 66% Ours 21% 64% 15%

Our related ToG 2014 paper (to be presented at SIGGRAPH 14) Painting-to-3D Model Alignment Via Discriminative Visual Elements M. Aubry, B. Russell and J. Sivic

Summary Large collection of 3D models Part score calibration Render views from 3D CG-real matching Part based matching Finding parts ata and (soon) code: http://www.di.ens.fr/willow/research/seeing3dchairs

Questions?