Painting-to-3D Model Alignment Via Discriminative Visual Elements. Mathieu Aubry (INRIA) Bryan Russell (Intel) Josef Sivic (INRIA)

Size: px
Start display at page:

Download "Painting-to-3D Model Alignment Via Discriminative Visual Elements. Mathieu Aubry (INRIA) Bryan Russell (Intel) Josef Sivic (INRIA)"

Transcription

1 Painting-to-3D Model Alignment Via Discriminative Visual Elements Mathieu Aubry (INRIA) Bryan Russell (Intel) Josef Sivic (INRIA)

2

3

4

5 This work

6 Browsing across time

7 Inputs Goal Output Painting 3D model Approximate viewpoint

8 1980s: 2D-3D Alignment Photograph 3D model [Roberts 1963] Alignment: Huttenlocher & Ullman (198 [Huttenlocher and Ullman 1990] [Faugeras&Hebert 86], [Grimson&Lozano-Perez 86], [Lowe, 87]

9 Modern 2D-3D alignment [Snavely et al. SIGGRAPH 2006] [Sattler et al. ICCV 2011] [Philbin et al. CVPR 2007] See also [Rothganger et al. CVPR 2003], [Arandjelovic ánd Zisserman ICCV 2011], [Wu et al. 2008], [Li et al. ECCV 2012], [Lim et al. ICCV 2013], [Baatz et al. ECCV 2012]

10 Difficulty: limits of SIFT matching See also: [Chum & Matas CVPR 2006], [Schechtman & Irani, CVPR 2007], [Russell, Sivic, Ponce, Dessalles 2011], [Hauagge & Snavely CVPR 2012]

11 Modern 2D-3D alignment [Snavely et al. SIGGRAPH 2006] [Sattler et al. ICCV 2011] [Philbin et al. CVPR 2007] See also [Rothganger et al. CVPR 2003], [Arandjelovic ánd Zisserman ICCV 2011], [Wu et al. 2008], [Li et al. ECCV 2012], [Lim et al. ICCV 2013], [Baatz et al. ECCV 2012]

12 Retrieval using a global descriptor [Russell, Sivic, Ponce, Dessales, 2011] used GIST [Oliva and Torralba 2001] [Shrivastava, Malisiewicz, Gupta, Efros, 2011]

13 Discriminative part-based representation What makes Paris look like Paris? [Doersch et al. SIGGRAPH 2012] Unsupervised Discovery of Mid- Level Discriminative Patches [Singh et al. ECCV 2012] See also [Bourdev & Malik ICCV 2009], [Juneja et al. CVPR 2013], [Jain et al. CVPR 2013],

14 This work: Discriminative parts in 3D See also [Bourdev & Malik ICCV 2009], [Juneja et al. CVPR 2013], [Jain et al. CVPR 2013],

15 Alignment with a large scale vocabulary of discriminative 3D parts 1. Select parts 2. Match parts 3. Recover view

16 Step 1: Selecting parts

17 Step 1: Discarding parts from unlikely views Synthesize ~10,000 viewpoints for an architectural site See also: [Irschara et al. CVPR 2009], [Baatz et al. ECCV 2012]

18 Step 1: Selecting parts Representative

19 Step 1: Selecting discriminative parts [Dalal and Triggs 2005], [Felzenszwald et al 2010]

20 Step 1: Selecting parts Discriminative

21 Step 1: Selecting stable parts

22 Step 1: Selecting stable parts Require elements to be reliably detectable in near-by views Original view Nearby view Source Incorrect matches Target See also [Doersch et al. SIGGRAPH 2012] [Singh et al. ECCV 2012], [Juneja et al. CVPR 2013]

23 Step 1: Selecting parts Stable

24 Summary: representation of the 3D model a) Representative b) Discriminative c) Stable

25 Step 2: match across style?

26 Step 2: match across style

27 Step 2: match across style

28 Step 3: recover viewpoint

29 Step 3: recover viewpoint This can be seen as a calibration problem. We use an affine calibration (see paper for details) Calibration parameters s calib x = α. s x + β Calibrated score Uncalibrated score

30 Step 3: recover viewpoint

31 Step 3: recover viewpoint

32 Step 3: recover viewpoint Run RANSAC on the top 25 matches

33 Evaluation data Four 3D models from three sites San Marco square (PMVS) San Marco (CAD) 337 depictions Trevi fountain (CAD) Notre Dame (CAD) 85 Historical photographs 147 Paintings 45 Engravings 60 Drawings

34 Results: painting Input Aligned 3D model

35 Results: historical photograph

36 Results: painting

37 Results: drawing

38 Results: drawing

39 Results: engraving

40 Results: Watercolor

41 Recovered camera frusta

42 Failure modes: 3D symmetries

43 Failure modes: unusual viewpoints

44 Benchmark matching results Results on [Huagge and Snavely 2012] dataset MAP ( desceval ) [Hauagge and Snavely 2012] 0.58 Our matching without calibration 0.60 Our matching with calibration 0.78

45 User Study Rate the alignment as: GOOD BAD COARSE

46 Quantitative Results Good matches Exemplar-SVM [Shirivastava et al. 2011] 34 % SIFT on rendered views 40 % Ours 51 %

47 Quantitative Results Depiction style influence: Good match Coarse match No match Photographs 59% 20% 21% Paintings 53% 30% 18% Drawings 52% 29% 19% Engravings 57% 26% 17%

48 CVPR 2014 work Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models M. Aubry, D. Maturana, A. Efros, B. Russell and J. Sivic

49 Conclusion Leverage large vocabulary of discriminative 3D parts Reliably align non-photographic depictions to 3D model More results, data and soon code

50

51 Conclusion Leverage large vocabulary of discriminative 3D parts Reliably align non-photographic depictions to 3D model More results, data and soon code

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Mathieu Aubry (INRIA) Daniel Maturana (CMU) Alexei Efros (UC Berkeley) Bryan Russell (Intel) Josef Sivic (INRIA)

More information

Painting-to-3D Model Alignment Via Discriminative Visual Elements 1

Painting-to-3D Model Alignment Via Discriminative Visual Elements 1 Painting-to-3D Model Alignment Via Discriminative Visual Elements 1 Mathieu Aubry INRIA 2 / TU München and Bryan C. Russell Intel Labs and Josef Sivic INRIA 2 This paper describes a technique that can

More information

Painting-to-3D Model Alignment Via Discriminative Visual Elements

Painting-to-3D Model Alignment Via Discriminative Visual Elements Painting-to-3D Model Alignment Via Discriminative Visual Elements Mathieu Aubry, Bryan C. Russell, Josef Sivic To cite this version: Mathieu Aubry, Bryan C. Russell, Josef Sivic. Painting-to-3D Model Alignment

More information

Representing 3D models with discriminative visual elements

Representing 3D models with discriminative visual elements Representing 3D models with discriminative visual elements Mathieu Aubry June 23th 2014 Josef Sivic (INRIA-ENS) Bryan Russell (Intel) Painting-to-3D Model Alignment Via Discriminative Visual Elements M.

More information

Where was this picture painted? - Localizing paintings by alignment to 3D models

Where was this picture painted? - Localizing paintings by alignment to 3D models Where was this picture painted? - Localizing paintings by alignment to 3D models Mathieu Aubry, Bryan C. Russell, Josef Sivic To cite this version: Mathieu Aubry, Bryan C. Russell, Josef Sivic. Where was

More information

Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models

Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models Mathieu Aubry, Daniel Maturana, Alexei Efros, Bryan C. Russell, Josef Sivic To cite this version: Mathieu Aubry,

More information

Photo Tourism: Exploring Photo Collections in 3D

Photo Tourism: Exploring Photo Collections in 3D Photo Tourism: Exploring Photo Collections in 3D SIGGRAPH 2006 Noah Snavely Steven M. Seitz University of Washington Richard Szeliski Microsoft Research 2006 2006 Noah Snavely Noah Snavely Reproduced with

More information

Instance-level recognition part 2

Instance-level recognition part 2 Visual Recognition and Machine Learning Summer School Paris 2011 Instance-level recognition part 2 Josef Sivic http://www.di.ens.fr/~josef INRIA, WILLOW, ENS/INRIA/CNRS UMR 8548 Laboratoire d Informatique,

More information

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce Object Recognition Computer Vision Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce How many visual object categories are there? Biederman 1987 ANIMALS PLANTS OBJECTS

More information

Instance-level recognition II.

Instance-level recognition II. Reconnaissance d objets et vision artificielle 2010 Instance-level recognition II. Josef Sivic http://www.di.ens.fr/~josef INRIA, WILLOW, ENS/INRIA/CNRS UMR 8548 Laboratoire d Informatique, Ecole Normale

More information

Patch Descriptors. CSE 455 Linda Shapiro

Patch Descriptors. CSE 455 Linda Shapiro Patch Descriptors CSE 455 Linda Shapiro How can we find corresponding points? How can we find correspondences? How do we describe an image patch? How do we describe an image patch? Patches with similar

More information

3D Wikipedia: Using online text to automatically label and navigate reconstructed geometry

3D Wikipedia: Using online text to automatically label and navigate reconstructed geometry 3D Wikipedia: Using online text to automatically label and navigate reconstructed geometry Bryan C. Russell et al. SIGGRAPH Asia 2013 Presented by YoungBin Kim 2014. 01. 10 Abstract Produce annotated 3D

More information

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Adding spatial information Forming vocabularies from pairs of nearby features doublets

More information

Efficient Category Mining by Leveraging Instance Retrieval

Efficient Category Mining by Leveraging Instance Retrieval 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops Efficient Category Mining by Leveraging Instance Retrieval Abhinav Goel Mayank Juneja C. V. Jawahar Center for Visual Information

More information

Predicting Good Features for Image Geo-Localization Using Per-Bundle VLAD

Predicting Good Features for Image Geo-Localization Using Per-Bundle VLAD Predicting Good Features for Image Geo-Localization Using Per-Bundle VLAD Hyo Jin Kim, Enrique Dunn, Jan-Michael Frahm Department of Computer Science, University of North Carolina at Chapel Hill {hyojin,dunn,jmf}@cs.unc.edu

More information

Patch Descriptors. EE/CSE 576 Linda Shapiro

Patch Descriptors. EE/CSE 576 Linda Shapiro Patch Descriptors EE/CSE 576 Linda Shapiro 1 How can we find corresponding points? How can we find correspondences? How do we describe an image patch? How do we describe an image patch? Patches with similar

More information

Automatic alignment of paintings and photographs depicting a 3D scene

Automatic alignment of paintings and photographs depicting a 3D scene Automatic alignment of paintings and photographs depicting a 3D scene Bryan Russell, Josef Sivic, Jean Ponce, Helene Dessales To cite this version: Bryan Russell, Josef Sivic, Jean Ponce, Helene Dessales.

More information

Representing 3D models for alignment and recognition

Representing 3D models for alignment and recognition Representing 3D models for alignment and recognition Mathieu Aubry To cite this version: Mathieu Aubry. Representing 3D models for alignment and recognition. Computer Vision and Pattern Recognition [cs.cv].

More information

Are Buildings Only Instances? Exploration in Architectural Style Categories

Are Buildings Only Instances? Exploration in Architectural Style Categories Are Buildings Only Instances? Exploration in Architectural Style Categories Abhinav Goel abhinavgoel@students.iiit.ac.in Mayank Juneja juneja@students.iiit.ac.in Center for Visual Information Technology,

More information

Photo Tourism: Exploring Photo Collections in 3D

Photo Tourism: Exploring Photo Collections in 3D Photo Tourism: Exploring Photo Collections in 3D Noah Snavely Steven M. Seitz University of Washington Richard Szeliski Microsoft Research 15,464 37,383 76,389 2006 Noah Snavely 15,464 37,383 76,389 Reproduced

More information

Compressed local descriptors for fast image and video search in large databases

Compressed local descriptors for fast image and video search in large databases Compressed local descriptors for fast image and video search in large databases Matthijs Douze2 joint work with Hervé Jégou1, Cordelia Schmid2 and Patrick Pérez3 1: INRIA Rennes, TEXMEX team, France 2:

More information

Noah Snavely Steven M. Seitz. Richard Szeliski. University of Washington. Microsoft Research. Modified from authors slides

Noah Snavely Steven M. Seitz. Richard Szeliski. University of Washington. Microsoft Research. Modified from authors slides Photo Tourism: Exploring Photo Collections in 3D Noah Snavely Steven M. Seitz University of Washington Richard Szeliski Microsoft Research 2006 2006 Noah Snavely Noah Snavely Modified from authors slides

More information

Supervised learning. y = f(x) function

Supervised learning. y = f(x) function Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the

More information

Learning to Match Images in Large-Scale Collections

Learning to Match Images in Large-Scale Collections Learning to Match Images in Large-Scale Collections Song Cao and Noah Snavely Cornell University Ithaca, NY, 14853 Abstract. Many computer vision applications require computing structure and feature correspondence

More information

Part Discovery from Partial Correspondence

Part Discovery from Partial Correspondence Part Discovery from Partial Correspondence Subhransu Maji Gregory Shakhnarovich Toyota Technological Institute at Chicago, IL, USA Abstract We study the problem of part discovery when partial correspondence

More information

Query Adaptive Similarity for Large Scale Object Retrieval

Query Adaptive Similarity for Large Scale Object Retrieval Query Adaptive Similarity for Large Scale Object Retrieval Danfeng Qin Christian Wengert Luc van Gool ETH Zürich, Switzerland {qind,wengert,vangool}@vision.ee.ethz.ch Abstract Many recent object retrieval

More information

CS6670: Computer Vision

CS6670: Computer Vision CS6670: Computer Vision Noah Snavely Lecture 16: Bag-of-words models Object Bag of words Announcements Project 3: Eigenfaces due Wednesday, November 11 at 11:59pm solo project Final project presentations:

More information

Three-Dimensional Object Detection and Layout Prediction using Clouds of Oriented Gradients

Three-Dimensional Object Detection and Layout Prediction using Clouds of Oriented Gradients ThreeDimensional Object Detection and Layout Prediction using Clouds of Oriented Gradients Authors: Zhile Ren, Erik B. Sudderth Presented by: Shannon Kao, Max Wang October 19, 2016 Introduction Given an

More information

Segmenting Scenes by Matching Image Composites

Segmenting Scenes by Matching Image Composites Segmenting Scenes by Matching Image Composites Bryan C. Russell 1 Alexei A. Efros 2,1 Josef Sivic 1 William T. Freeman 3 Andrew Zisserman 4,1 1 INRIA 2 Carnegie Mellon University 3 CSAIL MIT 4 University

More information

Course Administration

Course Administration Course Administration Project 2 results are online Project 3 is out today The first quiz is a week from today (don t panic!) Covers all material up to the quiz Emphasizes lecture material NOT project topics

More information

Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs

Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs Xiaowei Li, Changchang Wu, Christopher Zach, Svetlana Lazebnik, Jan-Michael Frahm 1 Motivation Target problem: organizing

More information

CS5670: Computer Vision

CS5670: Computer Vision CS5670: Computer Vision Noah Snavely Multi-view stereo Announcements Project 3 ( Autostitch ) due Monday 4/17 by 11:59pm Recommended Reading Szeliski Chapter 11.6 Multi-View Stereo: A Tutorial Furukawa

More information

Local Image Features

Local Image Features Local Image Features Ali Borji UWM Many slides from James Hayes, Derek Hoiem and Grauman&Leibe 2008 AAAI Tutorial Overview of Keypoint Matching 1. Find a set of distinctive key- points A 1 A 2 A 3 B 3

More information

What are we trying to achieve? Why are we doing this? What do we learn from past history? What will we talk about today?

What are we trying to achieve? Why are we doing this? What do we learn from past history? What will we talk about today? Introduction What are we trying to achieve? Why are we doing this? What do we learn from past history? What will we talk about today? What are we trying to achieve? Example from Scott Satkin 3D interpretation

More information

arxiv: v2 [cs.cv] 18 Apr 2016

arxiv: v2 [cs.cv] 18 Apr 2016 Deep Exemplar 2D-3D Detection by Adapting from Real to Rendered Views Francisco Massa 1 Bryan C. Russell 2 Mathieu Aubry 1,3 1 École des Ponts ParisTech 2 Adobe Research 3 UC Berkeley arxiv:1512.02497v2

More information

Learning a Fine Vocabulary

Learning a Fine Vocabulary Learning a Fine Vocabulary Andrej Mikulík, Michal Perdoch, Ondřej Chum, and Jiří Matas CMP, Dept. of Cybernetics, Faculty of EE, Czech Technical University in Prague Abstract. We present a novel similarity

More information

Improving Image-Based Localization Through Increasing Correct Feature Correspondences

Improving Image-Based Localization Through Increasing Correct Feature Correspondences Improving Image-Based Localization Through Increasing Correct Feature Correspondences Guoyu Lu 1, Vincent Ly 1, Haoquan Shen 2, Abhishek Kolagunda 1, and Chandra Kambhamettu 1 1 Video/Image Modeling and

More information

Matching and Predicting Street Level Images

Matching and Predicting Street Level Images Matching and Predicting Street Level Images Biliana Kaneva 1, Josef Sivic 2, Antonio Torralba 1, Shai Avidan 3, and William T. Freeman 1 1 Massachusetts Institute of Technology {biliana,torralba,billf}@csail.mit.edu

More information

Detecting Object Instances Without Discriminative Features

Detecting Object Instances Without Discriminative Features Detecting Object Instances Without Discriminative Features Edward Hsiao June 19, 2013 Thesis Committee: Martial Hebert, Chair Alexei Efros Takeo Kanade Andrew Zisserman, University of Oxford 1 Object Instance

More information

LEARNING TO GENERATE CHAIRS WITH CONVOLUTIONAL NEURAL NETWORKS

LEARNING TO GENERATE CHAIRS WITH CONVOLUTIONAL NEURAL NETWORKS LEARNING TO GENERATE CHAIRS WITH CONVOLUTIONAL NEURAL NETWORKS Alexey Dosovitskiy, Jost Tobias Springenberg and Thomas Brox University of Freiburg Presented by: Shreyansh Daftry Visual Learning and Recognition

More information

Bag of Words Models. CS4670 / 5670: Computer Vision Noah Snavely. Bag-of-words models 11/26/2013

Bag of Words Models. CS4670 / 5670: Computer Vision Noah Snavely. Bag-of-words models 11/26/2013 CS4670 / 5670: Computer Vision Noah Snavely Bag-of-words models Object Bag of words Bag of Words Models Adapted from slides by Rob Fergus and Svetlana Lazebnik 1 Object Bag of words Origin 1: Texture Recognition

More information

Immediate, scalable object category detection

Immediate, scalable object category detection Immediate, scalable object category detection Yusuf Aytar Andrew Zisserman Visual Geometry Group, Department of Engineering Science, University of Oxford Abstract The objective of this work is object category

More information

Modeling 3D viewpoint for part-based object recognition of rigid objects

Modeling 3D viewpoint for part-based object recognition of rigid objects Modeling 3D viewpoint for part-based object recognition of rigid objects Joshua Schwartz Department of Computer Science Cornell University jdvs@cs.cornell.edu Abstract Part-based object models based on

More information

Mining Discriminative Adjectives and Prepositions for Natural Scene Recognition

Mining Discriminative Adjectives and Prepositions for Natural Scene Recognition Mining Discriminative Adjectives and Prepositions for Natural Scene Recognition Bangpeng Yao 1, Juan Carlos Niebles 2,3, Li Fei-Fei 1 1 Department of Computer Science, Princeton University, NJ 08540, USA

More information

Determining Patch Saliency Using Low-Level Context

Determining Patch Saliency Using Low-Level Context Determining Patch Saliency Using Low-Level Context Devi Parikh 1, C. Lawrence Zitnick 2, and Tsuhan Chen 1 1 Carnegie Mellon University, Pittsburgh, PA, USA 2 Microsoft Research, Redmond, WA, USA Abstract.

More information

Three things everyone should know to improve object retrieval. Relja Arandjelović and Andrew Zisserman (CVPR 2012)

Three things everyone should know to improve object retrieval. Relja Arandjelović and Andrew Zisserman (CVPR 2012) Three things everyone should know to improve object retrieval Relja Arandjelović and Andrew Zisserman (CVPR 2012) University of Oxford 2 nd April 2012 Large scale object retrieval Find all instances of

More information

Large-scale visual recognition The bag-of-words representation

Large-scale visual recognition The bag-of-words representation Large-scale visual recognition The bag-of-words representation Florent Perronnin, XRCE Hervé Jégou, INRIA CVPR tutorial June 16, 2012 Outline Bag-of-words Large or small vocabularies? Extensions for instance-level

More information

6.819 / 6.869: Advances in Computer Vision

6.819 / 6.869: Advances in Computer Vision 6.819 / 6.869: Advances in Computer Vision Image Retrieval: Retrieval: Information, images, objects, large-scale Website: http://6.869.csail.mit.edu/fa15/ Instructor: Yusuf Aytar Lecture TR 9:30AM 11:00AM

More information

Part based models for recognition. Kristen Grauman

Part based models for recognition. Kristen Grauman Part based models for recognition Kristen Grauman UT Austin Limitations of window-based models Not all objects are box-shaped Assuming specific 2d view of object Local components themselves do not necessarily

More information

Recognizing human actions in still images: a study of bag-of-features and part-based representations

Recognizing human actions in still images: a study of bag-of-features and part-based representations DELAITRE, LAPTEV, SIVIC: RECOGNIZING HUMAN ACTIONS IN STILL IMAGES. 1 Recognizing human actions in still images: a study of bag-of-features and part-based representations Vincent Delaitre 1 vincent.delaitre@ens-lyon.org

More information

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011 Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition

More information

Indexing Ensembles of Exemplar-SVMs with Rejecting Taxonomies for Fast Evaluation

Indexing Ensembles of Exemplar-SVMs with Rejecting Taxonomies for Fast Evaluation Indexing Ensembles of Exemplar-SVMs with Rejecting Taxonomies for Fast Evaluation Federico Becattini Lorenzo Seidenari Alberto Del Bimbo University of Florence federico.becattini@unifi.it University of

More information

Geometric LDA: A Generative Model for Particular Object Discovery

Geometric LDA: A Generative Model for Particular Object Discovery Geometric LDA: A Generative Model for Particular Object Discovery James Philbin 1 Josef Sivic 2 Andrew Zisserman 1 1 Visual Geometry Group, Department of Engineering Science, University of Oxford 2 INRIA,

More information

Blocks that Shout: Distinctive Parts for Scene Classification

Blocks that Shout: Distinctive Parts for Scene Classification Blocks that Shout: Distinctive Parts for Scene Classification Mayank Juneja 1 Andrea Vedaldi 2 C. V. Jawahar 1 Andrew Zisserman 2 1 Center for Visual Information Technology, International Institute of

More information

CLSH: Cluster-based Locality-Sensitive Hashing

CLSH: Cluster-based Locality-Sensitive Hashing CLSH: Cluster-based Locality-Sensitive Hashing Xiangyang Xu Tongwei Ren Gangshan Wu Multimedia Computing Group, State Key Laboratory for Novel Software Technology, Nanjing University xiangyang.xu@smail.nju.edu.cn

More information

Methods for Representing and Recognizing 3D objects

Methods for Representing and Recognizing 3D objects Methods for Representing and Recognizing 3D objects part 1 Silvio Savarese University of Michigan at Ann Arbor Object: Building, 45º pose, 8-10 meters away Object: Person, back; 1-2 meters away Object:

More information

arxiv: v1 [cs.cv] 24 Feb 2014

arxiv: v1 [cs.cv] 24 Feb 2014 EXEMPLAR-BASED LINEAR DISCRIMINANT ANALYSIS FOR ROBUST OBJECT TRACKING Changxin Gao, Feifei Chen, Jin-Gang Yu, Rui Huang, Nong Sang arxiv:1402.5697v1 [cs.cv] 24 Feb 2014 Science and Technology on Multi-spectral

More information

Shape Anchors for Data-driven Multi-view Reconstruction

Shape Anchors for Data-driven Multi-view Reconstruction Shape Anchors for Data-driven Multi-view Reconstruction Andrew Owens MIT CSAIL Jianxiong Xiao Princeton University Antonio Torralba MIT CSAIL William Freeman MIT CSAIL andrewo@mit.edu xj@princeton.edu

More information

Large scale object/scene recognition

Large scale object/scene recognition Large scale object/scene recognition Image dataset: > 1 million images query Image search system ranked image list Each image described by approximately 2000 descriptors 2 10 9 descriptors to index! Database

More information

Beyond Bags of features Spatial information & Shape models

Beyond Bags of features Spatial information & Shape models Beyond Bags of features Spatial information & Shape models Jana Kosecka Many slides adapted from S. Lazebnik, FeiFei Li, Rob Fergus, and Antonio Torralba Detection, recognition (so far )! Bags of features

More information

Single Image Super-resolution. Slides from Libin Geoffrey Sun and James Hays

Single Image Super-resolution. Slides from Libin Geoffrey Sun and James Hays Single Image Super-resolution Slides from Libin Geoffrey Sun and James Hays Cs129 Computational Photography James Hays, Brown, fall 2012 Types of Super-resolution Multi-image (sub-pixel registration) Single-image

More information

Self-Supervised Learning & Visual Discovery

Self-Supervised Learning & Visual Discovery CS 2770: Computer Vision Self-Supervised Learning & Visual Discovery Prof. Adriana Kovashka University of Pittsburgh April 10, 2017 Motivation So far we ve assumed access to plentiful labeled data How

More information

A data-driven approach for event prediction

A data-driven approach for event prediction A data-driven approach for event prediction Jenny Yuen and Antonio Torralba {jenny, torralba}@csail.mit.edu CSAIL MIT Abstract. When given a single static picture, humans can not only interpret the instantaneous

More information

Local Features and Bag of Words Models

Local Features and Bag of Words Models 10/14/11 Local Features and Bag of Words Models Computer Vision CS 143, Brown James Hays Slides from Svetlana Lazebnik, Derek Hoiem, Antonio Torralba, David Lowe, Fei Fei Li and others Computer Engineering

More information

Video Google: A Text Retrieval Approach to Object Matching in Videos

Video Google: A Text Retrieval Approach to Object Matching in Videos Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic, Frederik Schaffalitzky, Andrew Zisserman Visual Geometry Group University of Oxford The vision Enable video, e.g. a feature

More information

Part-based and local feature models for generic object recognition

Part-based and local feature models for generic object recognition Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza

More information

arxiv: v1 [cs.cv] 28 Sep 2018

arxiv: v1 [cs.cv] 28 Sep 2018 Extrinsic camera calibration method and its performance evaluation Jacek Komorowski 1 and Przemyslaw Rokita 2 arxiv:1809.11073v1 [cs.cv] 28 Sep 2018 1 Maria Curie Sklodowska University Lublin, Poland jacek.komorowski@gmail.com

More information

Large Scale Image Retrieval

Large Scale Image Retrieval Large Scale Image Retrieval Ondřej Chum and Jiří Matas Center for Machine Perception Czech Technical University in Prague Features Affine invariant features Efficient descriptors Corresponding regions

More information

Detection III: Analyzing and Debugging Detection Methods

Detection III: Analyzing and Debugging Detection Methods CS 1699: Intro to Computer Vision Detection III: Analyzing and Debugging Detection Methods Prof. Adriana Kovashka University of Pittsburgh November 17, 2015 Today Review: Deformable part models How can

More information

Supplementary Material for Ensemble Diffusion for Retrieval

Supplementary Material for Ensemble Diffusion for Retrieval Supplementary Material for Ensemble Diffusion for Retrieval Song Bai 1, Zhichao Zhou 1, Jingdong Wang, Xiang Bai 1, Longin Jan Latecki 3, Qi Tian 4 1 Huazhong University of Science and Technology, Microsoft

More information

Image correspondences and structure from motion

Image correspondences and structure from motion Image correspondences and structure from motion http://graphics.cs.cmu.edu/courses/15-463 15-463, 15-663, 15-862 Computational Photography Fall 2017, Lecture 20 Course announcements Homework 5 posted.

More information

Visual Object Recognition

Visual Object Recognition Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe Computer Vision Laboratory ETH Zurich Chicago, 14.07.2008 & Kristen Grauman Department

More information

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10 Structure from Motion CSE 152 Lecture 10 Announcements Homework 3 is due May 9, 11:59 PM Reading: Chapter 8: Structure from Motion Optional: Multiple View Geometry in Computer Vision, 2nd edition, Hartley

More information

What is usual in unusual videos? Trajectory snippet histograms for discovering unusualness

What is usual in unusual videos? Trajectory snippet histograms for discovering unusualness 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops What is usual in unusual videos? Trajectory snippet histograms for discovering unusualness Ahmet Iscen Bilkent University Ankara,

More information

3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding

3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding 3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding Scott Satkin Google Inc satkin@googlecom Martial Hebert Carnegie Mellon University hebert@ricmuedu Abstract We present a new algorithm

More information

Registering Images to Untextured Geometry using Average Shading Gradients

Registering Images to Untextured Geometry using Average Shading Gradients Registering Images to Untextured Geometry using Average hading Gradients Tobias Plötz tefan Roth Department of Computer cience, TU Darmstadt Abstract Untextured Model Average shading gradients Many existing

More information

Shape recognition with edge-based features

Shape recognition with edge-based features Shape recognition with edge-based features K. Mikolajczyk A. Zisserman C. Schmid Dept. of Engineering Science Dept. of Engineering Science INRIA Rhône-Alpes Oxford, OX1 3PJ Oxford, OX1 3PJ 38330 Montbonnot

More information

Instance-level recognition

Instance-level recognition Instance-level recognition 1) Local invariant features 2) Matching and recognition with local features 3) Efficient visual search 4) Very large scale indexing Matching of descriptors Matching and 3D reconstruction

More information

Bundling Features for Large Scale Partial-Duplicate Web Image Search

Bundling Features for Large Scale Partial-Duplicate Web Image Search Bundling Features for Large Scale Partial-Duplicate Web Image Search Zhong Wu, Qifa Ke, Michael Isard, and Jian Sun Microsoft Research Abstract In state-of-the-art image retrieval systems, an image is

More information

Detecting Dynamic Objects with Multi-View Background Subtraction

Detecting Dynamic Objects with Multi-View Background Subtraction Detecting Dynamic Objects with Multi-View Background Subtraction Raúl Díaz, Sam Hallman, Charless C. Fowlkes Computer Science Department, University of California, Irvine {rdiazgar,shallman,fowlkes}@ics.uci.edu

More information

Data-Driven 3D Primitives for Single Image Understanding

Data-Driven 3D Primitives for Single Image Understanding 2013 IEEE International Conference on Computer Vision Data-Driven 3D Primitives for Single Image Understanding David F. Fouhey, Abhinav Gupta, Martial Hebert The Robotics Institute, Carnegie Mellon University

More information

Instance-level recognition

Instance-level recognition Instance-level recognition 1) Local invariant features 2) Matching and recognition with local features 3) Efficient visual search 4) Very large scale indexing Matching of descriptors Matching and 3D reconstruction

More information

Comparing Local Feature Descriptors in plsa-based Image Models

Comparing Local Feature Descriptors in plsa-based Image Models Comparing Local Feature Descriptors in plsa-based Image Models Eva Hörster 1,ThomasGreif 1, Rainer Lienhart 1, and Malcolm Slaney 2 1 Multimedia Computing Lab, University of Augsburg, Germany {hoerster,lienhart}@informatik.uni-augsburg.de

More information

Stable hyper-pooling and query expansion for event detection

Stable hyper-pooling and query expansion for event detection 3 IEEE International Conference on Computer Vision Stable hyper-pooling and query expansion for event detection Matthijs Douze INRIA Grenoble Jérôme Revaud INRIA Grenoble Cordelia Schmid INRIA Grenoble

More information

Joint Spectral Correspondence for Disparate Image Matching

Joint Spectral Correspondence for Disparate Image Matching 2013 IEEE Conference on Computer Vision and Pattern Recognition Joint Spectral Correspondence for Disparate Image Matching Mayank Bansal and Kostas Daniilidis GRASP Lab, University of Pennsylvania Philadelphia,

More information

SEARCH BY MOBILE IMAGE BASED ON VISUAL AND SPATIAL CONSISTENCY. Xianglong Liu, Yihua Lou, Adams Wei Yu, Bo Lang

SEARCH BY MOBILE IMAGE BASED ON VISUAL AND SPATIAL CONSISTENCY. Xianglong Liu, Yihua Lou, Adams Wei Yu, Bo Lang SEARCH BY MOBILE IMAGE BASED ON VISUAL AND SPATIAL CONSISTENCY Xianglong Liu, Yihua Lou, Adams Wei Yu, Bo Lang State Key Laboratory of Software Development Environment Beihang University, Beijing 100191,

More information

FPM: Fine Pose Parts-Based Model with 3D CAD Models

FPM: Fine Pose Parts-Based Model with 3D CAD Models FPM: Fine Pose Parts-Based Model with 3D CAD Models The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation As Published Publisher

More information

Semantically Guided Location Recognition for Outdoors Scenes

Semantically Guided Location Recognition for Outdoors Scenes Semantically Guided Location Recognition for Outdoors Scenes Arsalan Mousavian and Jana Košecká and Jyh-Ming Lien Abstract The problem of image based localization has a long history both in robotics and

More information

Evaluation of GIST descriptors for web-scale image search

Evaluation of GIST descriptors for web-scale image search Evaluation of GIST descriptors for web-scale image search Matthijs Douze, Hervé Jégou, Sandhawalia Harsimrat, Laurent Amsaleg, Cordelia Schmid To cite this version: Matthijs Douze, Hervé Jégou, Sandhawalia

More information

Efficient Representation of Local Geometry for Large Scale Object Retrieval

Efficient Representation of Local Geometry for Large Scale Object Retrieval Efficient Representation of Local Geometry for Large Scale Object Retrieval Michal Perd och, Ondřej Chum and Jiří Matas Center for Machine Perception, Department of Cybernetics Faculty of Electrical Engineering,

More information

Learning Vocabularies over a Fine Quantization

Learning Vocabularies over a Fine Quantization International Journal of Computer Vision manuscript No. (will be inserted by the editor) Learning Vocabularies over a Fine Quantization Andrej Mikulik Michal Perdoch Ondřej Chum Jiří Matas Received: date

More information

CV as making bank. Intel buys Mobileye! $15 billion. Mobileye:

CV as making bank. Intel buys Mobileye! $15 billion. Mobileye: CV as making bank Intel buys Mobileye! $15 billion Mobileye: Spin-off from Hebrew University, Israel 450 engineers 15 million cars installed 313 car models June 2016 - Tesla left Mobileye Fatal crash car

More information

Recognition with Bag-ofWords. (Borrowing heavily from Tutorial Slides by Li Fei-fei)

Recognition with Bag-ofWords. (Borrowing heavily from Tutorial Slides by Li Fei-fei) Recognition with Bag-ofWords (Borrowing heavily from Tutorial Slides by Li Fei-fei) Recognition So far, we ve worked on recognizing edges Now, we ll work on recognizing objects We will use a bag-of-words

More information

Place Recognition and Online Learning in Dynamic Scenes with Spatio-Temporal Landmarks

Place Recognition and Online Learning in Dynamic Scenes with Spatio-Temporal Landmarks LEARNING IN DYNAMIC SCENES WITH SPATIO-TEMPORAL LANDMARKS 1 Place Recognition and Online Learning in Dynamic Scenes with Spatio-Temporal Landmarks Edward Johns and Guang-Zhong Yang ej09@imperial.ac.uk

More information

Bag-of-features. Cordelia Schmid

Bag-of-features. Cordelia Schmid Bag-of-features for category classification Cordelia Schmid Visual search Particular objects and scenes, large databases Category recognition Image classification: assigning a class label to the image

More information

Indexing local features and instance recognition May 16 th, 2017

Indexing local features and instance recognition May 16 th, 2017 Indexing local features and instance recognition May 16 th, 2017 Yong Jae Lee UC Davis Announcements PS2 due next Monday 11:59 am 2 Recap: Features and filters Transforming and describing images; textures,

More information

Fuzzy based Multiple Dictionary Bag of Words for Image Classification

Fuzzy based Multiple Dictionary Bag of Words for Image Classification Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2196 2206 International Conference on Modeling Optimisation and Computing Fuzzy based Multiple Dictionary Bag of Words for Image

More information

Visual Place Recognition with Repetitive Structures

Visual Place Recognition with Repetitive Structures 13 IEEE Conference on Computer Vision and Pattern Recognition Visual Place Recognition with Repetitive Structures Akihiko Torii Tokyo Tech Josef Sivic INRIA Tomas Pajdla CTU in Prague Masatoshi Okutomi

More information

Region Graphs for Organizing Image Collections

Region Graphs for Organizing Image Collections Region Graphs for Organizing Image Collections Alexander Ladikos 1, Edmond Boyer 2, Nassir Navab 1, and Slobodan Ilic 1 1 Chair for Computer Aided Medical Procedures, Technische Universität München 2 Perception

More information

By Suren Manvelyan,

By Suren Manvelyan, By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan,

More information