Painting-to-3D Model Alignment Via Discriminative Visual Elements. Mathieu Aubry (INRIA) Bryan Russell (Intel) Josef Sivic (INRIA)

Size: px

Start display at page:

Download "Painting-to-3D Model Alignment Via Discriminative Visual Elements. Mathieu Aubry (INRIA) Bryan Russell (Intel) Josef Sivic (INRIA)"

Henry Cole
5 years ago
Views:

1 Painting-to-3D Model Alignment Via Discriminative Visual Elements Mathieu Aubry (INRIA) Bryan Russell (Intel) Josef Sivic (INRIA)

5 This work

6 Browsing across time

7 Inputs Goal Output Painting 3D model Approximate viewpoint

8 1980s: 2D-3D Alignment Photograph 3D model [Roberts 1963] Alignment: Huttenlocher & Ullman (198 [Huttenlocher and Ullman 1990] [Faugeras&Hebert 86], [Grimson&Lozano-Perez 86], [Lowe, 87]

9 Modern 2D-3D alignment [Snavely et al. SIGGRAPH 2006] [Sattler et al. ICCV 2011] [Philbin et al. CVPR 2007] See also [Rothganger et al. CVPR 2003], [Arandjelovic ánd Zisserman ICCV 2011], [Wu et al. 2008], [Li et al. ECCV 2012], [Lim et al. ICCV 2013], [Baatz et al. ECCV 2012]

10 Difficulty: limits of SIFT matching See also: [Chum & Matas CVPR 2006], [Schechtman & Irani, CVPR 2007], [Russell, Sivic, Ponce, Dessalles 2011], [Hauagge & Snavely CVPR 2012]

CVPR 2003], [Arandjelovic ánd Zisserman ICCV

11 Modern 2D-3D alignment [Snavely et al. SIGGRAPH 2006] [Sattler et al. ICCV 2011] [Philbin et al. CVPR 2007] See also [Rothganger et al. CVPR 2003], [Arandjelovic ánd Zisserman ICCV 2011], [Wu et al. 2008], [Li et al. ECCV 2012], [Lim et al. ICCV 2013], [Baatz et al. ECCV 2012]

12 Retrieval using a global descriptor [Russell, Sivic, Ponce, Dessales, 2011] used GIST [Oliva and Torralba 2001] [Shrivastava, Malisiewicz, Gupta, Efros, 2011]

SIGGRAPH 2012] Unsupervised Discovery of Mid- Level Discriminative

13 Discriminative part-based representation What makes Paris look like Paris? [Doersch et al. SIGGRAPH 2012] Unsupervised Discovery of Mid- Level Discriminative Patches [Singh et al. ECCV 2012] See also [Bourdev & Malik ICCV 2009], [Juneja et al. CVPR 2013], [Jain et al. CVPR 2013],

14 This work: Discriminative parts in 3D See also [Bourdev & Malik ICCV 2009], [Juneja et al. CVPR 2013], [Jain et al. CVPR 2013],

15 Alignment with a large scale vocabulary of discriminative 3D parts 1. Select parts 2. Match parts 3. Recover view

16 Step 1: Selecting parts

17 Step 1: Discarding parts from unlikely views Synthesize ~10,000 viewpoints for an architectural site See also: [Irschara et al. CVPR 2009], [Baatz et al. ECCV 2012]

18 Step 1: Selecting parts Representative

19 Step 1: Selecting discriminative parts [Dalal and Triggs 2005], [Felzenszwald et al 2010]

20 Step 1: Selecting parts Discriminative

21 Step 1: Selecting stable parts

22 Step 1: Selecting stable parts Require elements to be reliably detectable in near-by views Original view Nearby view Source Incorrect matches Target See also [Doersch et al. SIGGRAPH 2012] [Singh et al. ECCV 2012], [Juneja et al. CVPR 2013]

23 Step 1: Selecting parts Stable

24 Summary: representation of the 3D model a) Representative b) Discriminative c) Stable

25 Step 2: match across style?

26 Step 2: match across style

27 Step 2: match across style

28 Step 3: recover viewpoint

29 Step 3: recover viewpoint This can be seen as a calibration problem. We use an affine calibration (see paper for details) Calibration parameters s calib x = α. s x + β Calibrated score Uncalibrated score

30 Step 3: recover viewpoint

31 Step 3: recover viewpoint

32 Step 3: recover viewpoint Run RANSAC on the top 25 matches

33 Evaluation data Four 3D models from three sites San Marco square (PMVS) San Marco (CAD) 337 depictions Trevi fountain (CAD) Notre Dame (CAD) 85 Historical photographs 147 Paintings 45 Engravings 60 Drawings

34 Results: painting Input Aligned 3D model

35 Results: historical photograph

36 Results: painting

37 Results: drawing

38 Results: drawing

39 Results: engraving

40 Results: Watercolor

41 Recovered camera frusta

42 Failure modes: 3D symmetries

43 Failure modes: unusual viewpoints

44 Benchmark matching results Results on [Huagge and Snavely 2012] dataset MAP ( desceval ) [Hauagge and Snavely 2012] 0.58 Our matching without calibration 0.60 Our matching with calibration 0.78

45 User Study Rate the alignment as: GOOD BAD COARSE

46 Quantitative Results Good matches Exemplar-SVM [Shirivastava et al. 2011] 34 % SIFT on rendered views 40 % Ours 51 %

47 Quantitative Results Depiction style influence: Good match Coarse match No match Photographs 59% 20% 21% Paintings 53% 30% 18% Drawings 52% 29% 19% Engravings 57% 26% 17%

48 CVPR 2014 work Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models M. Aubry, D. Maturana, A. Efros, B. Russell and J. Sivic

49 Conclusion Leverage large vocabulary of discriminative 3D parts Reliably align non-photographic depictions to 3D model More results, data and soon code

51 Conclusion Leverage large vocabulary of discriminative 3D parts Reliably align non-photographic depictions to 3D model More results, data and soon code

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Mathieu Aubry (INRIA) Daniel Maturana (CMU) Alexei Efros (UC Berkeley) Bryan Russell (Intel) Josef Sivic (INRIA)