Visual Material Traits Recognizing Per-Pixel Material Context

Size: px
Start display at page:

Download "Visual Material Traits Recognizing Per-Pixel Material Context"

Transcription

1 Visual Material Traits Recognizing Per-Pixel Material Context Gabriel Schwartz Ko Nishino 1 / 22

2 2 / 22

3 Materials as Visual Context What tells us this road is unsafe? 3 / 22

4 Material Category Recognition Methods Give single predictions for the entire image Adelson [1] Liu et al. [4] Hu et al. [3] Sharan et al. [6] Images from [2]. 4 / 22

5 Material Category Recognition Methods Give single predictions for the entire image Require object information object mask bounding box Adelson [1] Liu et al. [4] Hu et al. [3] Sharan et al. [6] Images from [2]. 4 / 22

6 Material Category Recognition Methods Give single predictions for the entire image Require object information object mask bounding box Predict categories that are really object properties Adelson [1] Liu et al. [4] Hu et al. [3] Sharan et al. [6] Images from [2]. 4 / 22

7 Material Category Recognition Methods Give single predictions for the entire image Require object information object mask bounding box Predict categories that are really object properties Object information not always available Adelson [1] Liu et al. [4] Hu et al. [3] Sharan et al. [6] Images from [2]. 4 / 22

8 Intra-Class Appearance Variability Images from [7]. 5 / 22

9 Visual Material Traits: Characteristic Material Properties Image from [5]. 6 / 22

10 Visual Material Traits: Characteristic Material Properties Fuzzy Organic Smooth Material traits are locally-recognizable material properties. Image from [5]. 6 / 22

11 Visual Material Trait Appearances What material properties can we see locally? Images from [7]. 7 / 22

12 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Images from [7]. 7 / 22

13 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Shiny Images from [7]. 7 / 22

14 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Shiny Smooth 7 / 22

15 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Shiny Smooth Some are more challenging 7 / 22

16 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Shiny Smooth Some are more challenging Fuzzy? Soft? 7 / 22

17 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Shiny Smooth Some are more challenging Fuzzy? Soft? How do we represent these traits? 7 / 22

18 Learning to Represent Material Traits Learn features that model the appearance of material traits 8 / 22

19 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: 8 / 22

20 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: Fast to compute 8 / 22

21 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: Fast to compute Able to be extracted anywhere 8 / 22

22 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: Fast to compute Able to be extracted anywhere Discriminative 8 / 22

23 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: Fast to compute Able to be extracted anywhere Discriminative Convolution filters may satisfy all of these properties 8 / 22

24 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: Fast to compute Able to be extracted anywhere Discriminative Convolution filters may satisfy all of these properties How do we learn them? 8 / 22

25 Learning Filters for Trait Representation Convolutional Autoencoder (CAE) model for feature learning Find optimal filters (W) s.t. they: min W,W 9 / 22

26 Learning Filters for Trait Representation Convolutional Autoencoder (CAE) model for feature learning Find optimal filters (W) s.t. they: Model trait patches E i = h (W I i + b e ) R i = W E i + b r 1 h (x) = 1 min W,W N 0 1 N I i R i 2 F i=1 9 / 22

27 Learning Filters for Trait Representation Convolutional Autoencoder (CAE) model for feature learning Find optimal filters (W) s.t. they: Model trait patches Form a sparse encoding E i = h (W I i + b e ) R i = W E i + b r 1 h (x) = 1 min W,W N 0 1 N I i R i 2 F +α p 1 N i=1 N 2 E i i=1 F 9 / 22

28 Learning Filters for Trait Representation Convolutional Autoencoder (CAE) model for feature learning Find optimal filters (W) s.t. they: Model trait patches Form a sparse encoding Have constrained magnitude E i = h (W I i + b e ) R i = W E i + b r 1 h (x) = 1 min W,W N 0 1 N I i R i 2 F +α p 1 N i=1 N 2 E i +β i=1 F ( ) W 2 F + W 2 F 9 / 22

29 Learning Filters for Trait Representation Convolutional Autoencoder (CAE) model for feature learning Find optimal filters (W) s.t. they: Model trait patches Form a sparse encoding Have constrained magnitude E i = h (W I i + b e ) R i = W E i + b r 1 h (x) = 1 min W,W N 0 1 N I i R i 2 F +α p 1 N i=1 N 2 E i +β i=1 F ( ) W 2 F + W 2 F 9 / 22

30 Learned Filters Filter Responses Convolution Filters Input Images 10 / 22

31 Learned Filters + Supplemental Nonlinear Features Filter Responses Convolution Filters Describe appearances CAE cannot: Input Images 10 / 22

32 Learned Filters + Supplemental Nonlinear Features Filter Responses Convolution Filters Describe appearances CAE cannot: Input Images Color Histograms 10 / 22

33 Learned Filters + Supplemental Nonlinear Features Filter Responses Convolution Filters Input Images Describe appearances CAE cannot: Color Histograms HOG 10 / 22

34 LBP 10 / 22 Learned Filters + Supplemental Nonlinear Features Filter Responses Convolution Filters Input Images Describe appearances CAE cannot: Color Histograms HOG

35 Material Trait Recognition Process Training data: Flickr Materials Database (FMD) [7] images with trait annotations Learn Filters 11 / 22

36 Material Trait Recognition Process Training data: Flickr Materials Database (FMD) [7] images with trait annotations Learn Filters Select Features Trait CAE Oriented HOG LBP Color Histograms Shiny Fuzzy Transparent (13 Material Traits ) Total Uses / 22

37 Material Trait Recognition Process Training data: Flickr Materials Database (FMD) [7] images with trait annotations Learn Filters Select Features Train Per-Trait Classifiers Trait CAE Oriented HOG LBP Color Histograms Shiny Fuzzy Transparent (13 Material Traits ) Total Uses / 22

38 Material Trait Recognition Process Training data: Flickr Materials Database (FMD) [7] images with trait annotations Learn Filters Select Features Train Per-Trait Classifiers Extract Features Trait CAE Oriented HOG LBP Color Histograms Shiny Fuzzy Transparent (13 Material Traits ) Total Uses / 22

39 Material Trait Recognition Process Training data: Flickr Materials Database (FMD) [7] images with trait annotations Learn Filters Select Features Train Per-Trait Classifiers Extract Features Recognize Traits Trait CAE Oriented HOG LBP Color Histograms Shiny Fuzzy Transparent (13 Material Traits ) Total Uses / 22

40 Material Trait Recognition Accuracy Accuracy Fuzzy 4 Shiny Smooth Soft Striped Metallic Organic Translucent Transparent Rough Liquid Woven Manmade 12 / 22

41 Patch Recognition Results Shiny Fuzzy Metallic Soft Smooth Liquid Rough Woven False Positives True Positives Shiny Fuzzy Metallic Soft Smooth Liquid Rough Woven 13 / 22

42 Per-Pixel Material Trait Maps Image from [5]. 14 / 22

43 Per-Pixel Material Trait Maps Fuzzy Organic Smooth Image from [5]. 14 / 22

44 Per-Pixel Material Trait Maps Image from [7]. 15 / 22

45 Per-Pixel Material Trait Maps Shiny Metallic Smooth Image from [7]. 15 / 22

46 What can we do with these material traits? 16 / 22

47 Material Recognition via Trait Distributions Smooth Rough Organic Soft Foliage Stone Plastic p ( trait i category j ) 17 / 22

48 Material Recognition Accuracy: Flickr Fabric Foliage Glass Leather Metal Paper Plastic Stone Water Wood Fabric Foliage Glass Leather Metal Paper Plastic Stone Water Wood Average: 49.2% [Liu et al.] (w/o obj): 42.6% (w/obj): 57.1% Images from [7]. 18 / 22

49 Material Recognition Accuracy: ImageNet Foliage Glass 0.8 Paper Plastic Stone Water Wood Foliage Glass Paper Plastic Stone Water Average: 60.5% Wood Images from [2]. 19 / 22

50 Segmentation with Material Traits Baseline NCuts Images from [5]. With Traits 20 / 22

51 Summary Material traits: may be recognized locally and accurately 21 / 22

52 Summary Material traits: may be recognized locally and accurately have distributions that encode material categories 21 / 22

53 Summary Material traits: may be recognized locally and accurately have distributions that encode material categories segment images into intuitively separate regions 21 / 22

54 Summary Material traits: may be recognized locally and accurately have distributions that encode material categories segment images into intuitively separate regions Future work: Discover new traits Improve applications 21 / 22

55 References E. H. Adelson. On Seeing Stuff: The Perception of Materials by Humans and Machines. In SPIE, pages 1 12, J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR, D. Hu, L. Bo, and X. Ren. Toward Robust Material Recognition for Everyday Objects. In BMVC, pages , C. Liu, L. Sharan, E. H. Adelson, and R. Rosenholtz. Exploring Features in a Bayesian Framework for Material Recognition. In CVPR, pages , D. Martin, C. Fowlkes, D. Tal, and J. Malik. A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics. In ICCV, pages L. Sharan, C. Liu, R. Rosenholtz, and E. H. Adelson. Recognizing Materials Using Perceptually Inspired Features. International Journal of Computer Vision, L. Sharan, R. Rosenholtz, and E. Adelson. Material Perception: What Can You See in a Brief Glance? Journal of Vision, 9(8):784, / 22

Visual Material Traits: Recognizing Per-Pixel Material Context

Visual Material Traits: Recognizing Per-Pixel Material Context Visual Material Traits: Recognizing Per-Pixel Material Context Gabriel Schwartz Drexel University Ko Nishino Drexel University gbs25@drexel.edu kon@drexel.edu Abstract Information describing the materials

More information

Real-World Material Recognition for Scene Understanding

Real-World Material Recognition for Scene Understanding Real-World Material Recognition for Scene Understanding Abstract Sam Corbett-Davies Department of Computer Science Stanford University scorbett@stanford.edu In this paper we address the problem of recognizing

More information

Toward Robust Material Recognition for Everyday Objects

Toward Robust Material Recognition for Everyday Objects HU, BO, REN: EVERYDAY MATERIAL RECOGNITION 1 Toward Robust Material Recognition for Everyday Objects Diane Hu http://cseweb.ucsd.edu/~dhu/ Liefeng Bo http://www.cs.washington.edu/homes/lfb/ Xiaofeng Ren

More information

PARTIAL STYLE TRANSFER USING WEAKLY SUPERVISED SEMANTIC SEGMENTATION. Shin Matsuo Wataru Shimoda Keiji Yanai

PARTIAL STYLE TRANSFER USING WEAKLY SUPERVISED SEMANTIC SEGMENTATION. Shin Matsuo Wataru Shimoda Keiji Yanai PARTIAL STYLE TRANSFER USING WEAKLY SUPERVISED SEMANTIC SEGMENTATION Shin Matsuo Wataru Shimoda Keiji Yanai Department of Informatics, The University of Electro-Communications, Tokyo 1-5-1 Chofugaoka,

More information

Solution: filter the image, then subsample F 1 F 2. subsample blur subsample. blur

Solution: filter the image, then subsample F 1 F 2. subsample blur subsample. blur Pyramids Gaussian pre-filtering Solution: filter the image, then subsample blur F 0 subsample blur subsample * F 0 H F 1 F 1 * H F 2 { Gaussian pyramid blur F 0 subsample blur subsample * F 0 H F 1 F 1

More information

Material Recognition in the Wild with the Materials in Context Database

Material Recognition in the Wild with the Materials in Context Database Material Recognition in the Wild with the Materials in Context Database Sean Bell Paul Upchurch Noah Snavely Kavita Bala Department of Computer Science, Cornell University {sbell,paulu,snavely,kb}@cs.cornell.edu

More information

Material Recognition: Bayesian Inference or SVMs?

Material Recognition: Bayesian Inference or SVMs? Material Recognition: Bayesian Inference or SVMs? Ishrat Badami Supervised by: Michael Weinmann, Reinhard Klein Institute of Computer Graphics University of Bonn Abstract Material recognition is an important

More information

Attributes and More Crowdsourcing

Attributes and More Crowdsourcing Attributes and More Crowdsourcing Computer Vision CS 143, Brown James Hays Many slides from Derek Hoiem Recap: Human Computation Active Learning: Let the classifier tell you where more annotation is needed.

More information

Supplementary Materials for Salient Object Detection: A

Supplementary Materials for Salient Object Detection: A Supplementary Materials for Salient Object Detection: A Discriminative Regional Feature Integration Approach Huaizu Jiang, Zejian Yuan, Ming-Ming Cheng, Yihong Gong Nanning Zheng, and Jingdong Wang Abstract

More information

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem Many slides from Derek Hoiem Attributes Computer Vision James Hays Recap: Human Computation Active Learning: Let the classifier tell you where more annotation is needed. Human-in-the-loop recognition:

More information

Attribute learning in large-scale datasets. Olga Russakovsky and Li Fei-Fei

Attribute learning in large-scale datasets. Olga Russakovsky and Li Fei-Fei Attribute learning in large-scale datasets Olga Russakovsky and Li Fei-Fei Categorization of the visual world Berry Fruit Entity Tree Instrument Furniture Categorization of the visual world Berry Fruit

More information

Two Routes for Image to Image Translation: Rule based vs. Learning based. Minglun Gong, Memorial Univ. Collaboration with Mr.

Two Routes for Image to Image Translation: Rule based vs. Learning based. Minglun Gong, Memorial Univ. Collaboration with Mr. Two Routes for Image to Image Translation: Rule based vs. Learning based Minglun Gong, Memorial Univ. Collaboration with Mr. Zili Yi Introduction A brief history of image processing Image to Image translation

More information

Reconstructive Sparse Code Transfer for Contour Detection and Semantic Labeling

Reconstructive Sparse Code Transfer for Contour Detection and Semantic Labeling Reconstructive Sparse Code Transfer for Contour Detection and Semantic Labeling Michael Maire 1,2 Stella X. Yu 3 Pietro Perona 2 1 TTI Chicago 2 California Institute of Technology 3 University of California

More information

JOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS. Zhao Chen Machine Learning Intern, NVIDIA

JOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS. Zhao Chen Machine Learning Intern, NVIDIA JOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS Zhao Chen Machine Learning Intern, NVIDIA ABOUT ME 5th year PhD student in physics @ Stanford by day, deep learning computer vision scientist

More information

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Mathieu Aubry (INRIA) Daniel Maturana (CMU) Alexei Efros (UC Berkeley) Bryan Russell (Intel) Josef Sivic (INRIA)

More information

Sketchable Histograms of Oriented Gradients for Object Detection

Sketchable Histograms of Oriented Gradients for Object Detection Sketchable Histograms of Oriented Gradients for Object Detection No Author Given No Institute Given Abstract. In this paper we investigate a new representation approach for visual object recognition. The

More information

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun Presented by Tushar Bansal Objective 1. Get bounding box for all objects

More information

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,

More information

Discovering Visual Hierarchy through Unsupervised Learning Haider Razvi

Discovering Visual Hierarchy through Unsupervised Learning Haider Razvi Discovering Visual Hierarchy through Unsupervised Learning Haider Razvi hrazvi@stanford.edu 1 Introduction: We present a method for discovering visual hierarchy in a set of images. Automatically grouping

More information

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213) Recognition of Animal Skin Texture Attributes in the Wild Amey Dharwadker (aap2174) Kai Zhang (kz2213) Motivation Patterns and textures are have an important role in object description and understanding

More information

arxiv: v1 [cs.cv] 22 Apr 2015

arxiv: v1 [cs.cv] 22 Apr 2015 LOAD: Local Orientation Adaptive Descriptor for Texture and Material Classification Xianbiao Qi a,b, Guoying Zhao a, Linlin Shen b, Qingquan Li b, Matti Pietikäinen a arxiv:1504.05809v1 [cs.cv] 22 Apr

More information

(Deep) Learning for Robot Perception and Navigation. Wolfram Burgard

(Deep) Learning for Robot Perception and Navigation. Wolfram Burgard (Deep) Learning for Robot Perception and Navigation Wolfram Burgard Deep Learning for Robot Perception (and Navigation) Lifeng Bo, Claas Bollen, Thomas Brox, Andreas Eitel, Dieter Fox, Gabriel L. Oliveira,

More information

Efficient Similarity Search in Scientific Databases with Feature Signatures

Efficient Similarity Search in Scientific Databases with Feature Signatures DATA MANAGEMENT AND DATA EXPLORATION GROUP Prof. Dr. rer. nat. Thomas Seidl DATA MANAGEMENT AND DATA EXPLORATION GROUP Prof. Dr. rer. nat. Thomas Seidl Efficient Similarity Search in Scientific Databases

More information

The Caltech-UCSD Birds Dataset

The Caltech-UCSD Birds Dataset The Caltech-UCSD Birds-200-2011 Dataset Catherine Wah 1, Steve Branson 1, Peter Welinder 2, Pietro Perona 2, Serge Belongie 1 1 University of California, San Diego 2 California Institute of Technology

More information

Part Localization by Exploiting Deep Convolutional Networks

Part Localization by Exploiting Deep Convolutional Networks Part Localization by Exploiting Deep Convolutional Networks Marcel Simon, Erik Rodner, and Joachim Denzler Computer Vision Group, Friedrich Schiller University of Jena, Germany www.inf-cv.uni-jena.de Abstract.

More information

24 hours of Photo Sharing. installation by Erik Kessels

24 hours of Photo Sharing. installation by Erik Kessels 24 hours of Photo Sharing installation by Erik Kessels And sometimes Internet photos have useful labels Im2gps. Hays and Efros. CVPR 2008 But what if we want more? Image Categorization Training Images

More information

arxiv: v1 [cs.cv] 24 Aug 2016

arxiv: v1 [cs.cv] 24 Aug 2016 arxiv:1608.06985v1 [cs.cv] 24 Aug 2016 A 4D Light-Field Dataset and CNN Architectures for Material Recognition Ting-Chun Wang 1, Jun-Yan Zhu 1, Ebi Hiroaki 2, Manmohan Chandraker 2, Alexei A. Efros 1,

More information

arxiv: v1 [cs.cv] 10 Jul 2014

arxiv: v1 [cs.cv] 10 Jul 2014 ARTOS Adaptive Real-Time Object Detection System Björn Barz Erik Rodner bjoern.barz@uni-jena.de erik.rodner@uni-jena.de arxiv:1407.2721v1 [cs.cv] 10 Jul 2014 Joachim Denzler Computer Vision Group Friedrich

More information

Latest development in image feature representation and extraction

Latest development in image feature representation and extraction International Journal of Advanced Research and Development ISSN: 2455-4030, Impact Factor: RJIF 5.24 www.advancedjournal.com Volume 2; Issue 1; January 2017; Page No. 05-09 Latest development in image

More information

Unified, real-time object detection

Unified, real-time object detection Unified, real-time object detection Final Project Report, Group 02, 8 Nov 2016 Akshat Agarwal (13068), Siddharth Tanwar (13699) CS698N: Recent Advances in Computer Vision, Jul Nov 2016 Instructor: Gaurav

More information

Linear combinations of simple classifiers for the PASCAL challenge

Linear combinations of simple classifiers for the PASCAL challenge Linear combinations of simple classifiers for the PASCAL challenge Nik A. Melchior and David Lee 16 721 Advanced Perception The Robotics Institute Carnegie Mellon University Email: melchior@cmu.edu, dlee1@andrew.cmu.edu

More information

Deep Learning for Face Recognition. Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong

Deep Learning for Face Recognition. Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong Deep Learning for Face Recognition Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong Deep Learning Results on LFW Method Accuracy (%) # points # training images Huang

More information

Deformable Part Models

Deformable Part Models CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones

More information

arxiv: v2 [cs.cv] 25 Aug 2014

arxiv: v2 [cs.cv] 25 Aug 2014 ARTOS Adaptive Real-Time Object Detection System Björn Barz Erik Rodner bjoern.barz@uni-jena.de erik.rodner@uni-jena.de arxiv:1407.2721v2 [cs.cv] 25 Aug 2014 Joachim Denzler Computer Vision Group Friedrich

More information

Multi-Glance Attention Models For Image Classification

Multi-Glance Attention Models For Image Classification Multi-Glance Attention Models For Image Classification Chinmay Duvedi Stanford University Stanford, CA cduvedi@stanford.edu Pararth Shah Stanford University Stanford, CA pararth@stanford.edu Abstract We

More information

Object Recognition with Deformable Models

Object Recognition with Deformable Models Object Recognition with Deformable Models Pedro F. Felzenszwalb Department of Computer Science University of Chicago Joint work with: Dan Huttenlocher, Joshua Schwartz, David McAllester, Deva Ramanan.

More information

FIGARO, HAIR DETECTION AND SEGMENTATION IN THE WILD

FIGARO, HAIR DETECTION AND SEGMENTATION IN THE WILD FIGARO, HAIR DETECTION AND SEGMENTATION IN THE WILD Michele Svanera, Umar Riaz Muhammad, Riccardo Leonardi, and Sergio Benini Department of Information Engineering, University of Brescia, Italy Email:

More information

Fast Border Ownership Assignment with Bio-Inspired Features

Fast Border Ownership Assignment with Bio-Inspired Features Fast Border Ownership Assignment with Bio-Inspired Features Ching L. Teo, Cornelia Fermüller, Yiannis Aloimonos Computer Vision Lab and UMIACS University of Maryland College Park What is Border Ownership?

More information

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011 Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition

More information

A Novel Representation and Pipeline for Object Detection

A Novel Representation and Pipeline for Object Detection A Novel Representation and Pipeline for Object Detection Vishakh Hegde Stanford University vishakh@stanford.edu Manik Dhar Stanford University dmanik@stanford.edu Abstract Object detection is an important

More information

One Network to Solve Them All Solving Linear Inverse Problems using Deep Projection Models

One Network to Solve Them All Solving Linear Inverse Problems using Deep Projection Models One Network to Solve Them All Solving Linear Inverse Problems using Deep Projection Models [Supplemental Materials] 1. Network Architecture b ref b ref +1 We now describe the architecture of the networks

More information

CS 2770: Computer Vision. Edges and Segments. Prof. Adriana Kovashka University of Pittsburgh February 21, 2017

CS 2770: Computer Vision. Edges and Segments. Prof. Adriana Kovashka University of Pittsburgh February 21, 2017 CS 2770: Computer Vision Edges and Segments Prof. Adriana Kovashka University of Pittsburgh February 21, 2017 Edges vs Segments Figure adapted from J. Hays Edges vs Segments Edges More low-level Don t

More information

ROTATION INVARIANT SPARSE CODING AND PCA

ROTATION INVARIANT SPARSE CODING AND PCA ROTATION INVARIANT SPARSE CODING AND PCA NATHAN PFLUEGER, RYAN TIMMONS Abstract. We attempt to encode an image in a fashion that is only weakly dependent on rotation of objects within the image, as an

More information

Detection III: Analyzing and Debugging Detection Methods

Detection III: Analyzing and Debugging Detection Methods CS 1699: Intro to Computer Vision Detection III: Analyzing and Debugging Detection Methods Prof. Adriana Kovashka University of Pittsburgh November 17, 2015 Today Review: Deformable part models How can

More information

Artistic ideation based on computer vision methods

Artistic ideation based on computer vision methods Journal of Theoretical and Applied Computer Science Vol. 6, No. 2, 2012, pp. 72 78 ISSN 2299-2634 http://www.jtacs.org Artistic ideation based on computer vision methods Ferran Reverter, Pilar Rosado,

More information

Proceedings of the International MultiConference of Engineers and Computer Scientists 2018 Vol I IMECS 2018, March 14-16, 2018, Hong Kong

Proceedings of the International MultiConference of Engineers and Computer Scientists 2018 Vol I IMECS 2018, March 14-16, 2018, Hong Kong , March 14-16, 2018, Hong Kong , March 14-16, 2018, Hong Kong , March 14-16, 2018, Hong Kong , March 14-16, 2018, Hong Kong TABLE I CLASSIFICATION ACCURACY OF DIFFERENT PRE-TRAINED MODELS ON THE TEST DATA

More information

Recap from Monday. Visualizing Networks Caffe overview Slides are now online

Recap from Monday. Visualizing Networks Caffe overview Slides are now online Recap from Monday Visualizing Networks Caffe overview Slides are now online Today Edges and Regions, GPB Fast Edge Detection Using Structured Forests Zhihao Li Holistically-Nested Edge Detection Yuxin

More information

Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling

Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling [DOI: 10.2197/ipsjtcva.7.99] Express Paper Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling Takayoshi Yamashita 1,a) Takaya Nakamura 1 Hiroshi Fukui 1,b) Yuji

More information

Large-scale Video Classification with Convolutional Neural Networks

Large-scale Video Classification with Convolutional Neural Networks Large-scale Video Classification with Convolutional Neural Networks Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei Note: Slide content mostly from : Bay Area

More information

Beyond Mere Pixels: How Can Computers Interpret and Compare Digital Images? Nicholas R. Howe Cornell University

Beyond Mere Pixels: How Can Computers Interpret and Compare Digital Images? Nicholas R. Howe Cornell University Beyond Mere Pixels: How Can Computers Interpret and Compare Digital Images? Nicholas R. Howe Cornell University Why Image Retrieval? World Wide Web: Millions of hosts Billions of images Growth of video

More information

TRANSPARENT OBJECT DETECTION USING REGIONS WITH CONVOLUTIONAL NEURAL NETWORK

TRANSPARENT OBJECT DETECTION USING REGIONS WITH CONVOLUTIONAL NEURAL NETWORK TRANSPARENT OBJECT DETECTION USING REGIONS WITH CONVOLUTIONAL NEURAL NETWORK 1 Po-Jen Lai ( 賴柏任 ), 2 Chiou-Shann Fuh ( 傅楸善 ) 1 Dept. of Electrical Engineering, National Taiwan University, Taiwan 2 Dept.

More information

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking Feature descriptors Alain Pagani Prof. Didier Stricker Computer Vision: Object and People Tracking 1 Overview Previous lectures: Feature extraction Today: Gradiant/edge Points (Kanade-Tomasi + Harris)

More information

Pairwise Rotation Invariant Co-occurrence Local Binary Pattern

Pairwise Rotation Invariant Co-occurrence Local Binary Pattern Pairwise Rotation Invariant Co-occurrence Local Binary Pattern Xianbiao Qi 1, Rong Xiao 2, Jun Guo 1, and Lei Zhang 2 1 Beijing University of Posts and Telecommunications, Beijing 100876, P.R. China 2

More information

OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES

OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES Image Processing & Communication, vol. 18,no. 1, pp.31-36 DOI: 10.2478/v10248-012-0088-x 31 OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES RAFAŁ KOZIK ADAM MARCHEWKA Institute of Telecommunications, University

More information

Light Introduction. Read this article for more background information:

Light Introduction. Read this article for more background information: Light Introduction Read this article for more background information: Color Absorption Article Watch the following video on terms like Absorption, Reflection, and Transmission Video 1 Waves 1 Stephanie

More information

Category-level localization

Category-level localization Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object

More information

Photo OCR ( )

Photo OCR ( ) Photo OCR (2017-2018) Xiang Bai Huazhong University of Science and Technology Outline VALSE2018, DaLian Xiang Bai 2 Deep Direct Regression for Multi-Oriented Scene Text Detection [He et al., ICCV, 2017.]

More information

Supervised texture detection in images

Supervised texture detection in images Supervised texture detection in images Branislav Mičušík and Allan Hanbury Pattern Recognition and Image Processing Group, Institute of Computer Aided Automation, Vienna University of Technology Favoritenstraße

More information

THE SPEED-LIMIT SIGN DETECTION AND RECOGNITION SYSTEM

THE SPEED-LIMIT SIGN DETECTION AND RECOGNITION SYSTEM THE SPEED-LIMIT SIGN DETECTION AND RECOGNITION SYSTEM Kuo-Hsin Tu ( 塗國星 ), Chiou-Shann Fuh ( 傅楸善 ) Dept. of Computer Science and Information Engineering, National Taiwan University, Taiwan E-mail: p04922004@csie.ntu.edu.tw,

More information

Research of Traffic Flow Based on SVM Method. Deng-hong YIN, Jian WANG and Bo LI *

Research of Traffic Flow Based on SVM Method. Deng-hong YIN, Jian WANG and Bo LI * 2017 2nd International onference on Artificial Intelligence: Techniques and Applications (AITA 2017) ISBN: 978-1-60595-491-2 Research of Traffic Flow Based on SVM Method Deng-hong YIN, Jian WANG and Bo

More information

Conveying 3D Shape and Depth with Textured and Transparent Surfaces Victoria Interrante

Conveying 3D Shape and Depth with Textured and Transparent Surfaces Victoria Interrante Conveying 3D Shape and Depth with Textured and Transparent Surfaces Victoria Interrante In scientific visualization, there are many applications in which researchers need to achieve an integrated understanding

More information

COMP 551 Applied Machine Learning Lecture 16: Deep Learning

COMP 551 Applied Machine Learning Lecture 16: Deep Learning COMP 551 Applied Machine Learning Lecture 16: Deep Learning Instructor: Ryan Lowe (ryan.lowe@cs.mcgill.ca) Slides mostly by: Class web page: www.cs.mcgill.ca/~hvanho2/comp551 Unless otherwise noted, all

More information

Aggregating Descriptors with Local Gaussian Metrics

Aggregating Descriptors with Local Gaussian Metrics Aggregating Descriptors with Local Gaussian Metrics Hideki Nakayama Grad. School of Information Science and Technology The University of Tokyo Tokyo, JAPAN nakayama@ci.i.u-tokyo.ac.jp Abstract Recently,

More information

Texture April 17 th, 2018

Texture April 17 th, 2018 Texture April 17 th, 2018 Yong Jae Lee UC Davis Announcements PS1 out today Due 5/2 nd, 11:59 pm start early! 2 Review: last time Edge detection: Filter for gradient Threshold gradient magnitude, thin

More information

Texture C H A P T E R 6

Texture C H A P T E R 6 C H A P T E R 6 Texture Texture is a phenomenon that is widespread, easy to recognise, and hard to define. Typically, whether an effect is referred to as texture or not depends on the scale at which it

More information

Spatial Latent Dirichlet Allocation

Spatial Latent Dirichlet Allocation Spatial Latent Dirichlet Allocation Xiaogang Wang and Eric Grimson Computer Science and Computer Science and Artificial Intelligence Lab Massachusetts Tnstitute of Technology, Cambridge, MA, 02139, USA

More information

Beyond Bags of Features

Beyond Bags of Features : for Recognizing Natural Scene Categories Matching and Modeling Seminar Instructed by Prof. Haim J. Wolfson School of Computer Science Tel Aviv University December 9 th, 2015

More information

Learning the Ecological Statistics of Perceptual Organization

Learning the Ecological Statistics of Perceptual Organization Learning the Ecological Statistics of Perceptual Organization Charless Fowlkes work with David Martin, Xiaofeng Ren and Jitendra Malik at University of California at Berkeley 1 How do ideas from perceptual

More information

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Si Chen The George Washington University sichen@gwmail.gwu.edu Meera Hahn Emory University mhahn7@emory.edu Mentor: Afshin

More information

The attributes of objects. D.A. Forsyth, UIUC channelling Derek Hoiem, UIUC, with Ali Farhadi, Ian Endres, Gang Wang all of UIUC

The attributes of objects. D.A. Forsyth, UIUC channelling Derek Hoiem, UIUC, with Ali Farhadi, Ian Endres, Gang Wang all of UIUC The attributes of objects D.A. Forsyth, UIUC channelling Derek Hoiem, UIUC, with Ali Farhadi, Ian Endres, Gang Wang all of UIUC Obtain dataset Build features Mess around with classifiers, probability,

More information

Deep learning for object detection. Slides from Svetlana Lazebnik and many others

Deep learning for object detection. Slides from Svetlana Lazebnik and many others Deep learning for object detection Slides from Svetlana Lazebnik and many others Recent developments in object detection 80% PASCAL VOC mean0average0precision0(map) 70% 60% 50% 40% 30% 20% 10% Before deep

More information

Automatic Ranking of Images on the Web

Automatic Ranking of Images on the Web Automatic Ranking of Images on the Web HangHang Zhang Electrical Engineering Department Stanford University hhzhang@stanford.edu Zixuan Wang Electrical Engineering Department Stanford University zxwang@stanford.edu

More information

Discriminative Illumination:

Discriminative Illumination: Discriminative Illumination: Per-pixel Classification of Raw Materials based on Optimal Projections of Spectral BRDFs Jinwei Gu and Chao Liu Munsell Color Science Laboratory Chester F. Carlson Center for

More information

Lecture 6: Texture. Tuesday, Sept 18

Lecture 6: Texture. Tuesday, Sept 18 Lecture 6: Texture Tuesday, Sept 18 Graduate students Problem set 1 extension ideas Chamfer matching Hierarchy of shape prototypes, search over translations Comparisons with Hausdorff distance, L1 on

More information

Visual Object Recognition

Visual Object Recognition Visual Object Recognition Lecture 3: Descriptors Per-Erik Forssén, docent Computer Vision Laboratory Department of Electrical Engineering Linköping University 2015 2014 Per-Erik Forssén Lecture 3: Descriptors

More information

CS231A Course Project Final Report Sign Language Recognition with Unsupervised Feature Learning

CS231A Course Project Final Report Sign Language Recognition with Unsupervised Feature Learning CS231A Course Project Final Report Sign Language Recognition with Unsupervised Feature Learning Justin Chen Stanford University justinkchen@stanford.edu Abstract This paper focuses on experimenting with

More information

Detecting Bone Lesions in Multiple Myeloma Patients using Transfer Learning

Detecting Bone Lesions in Multiple Myeloma Patients using Transfer Learning Detecting Bone Lesions in Multiple Myeloma Patients using Transfer Learning Matthias Perkonigg 1, Johannes Hofmanninger 1, Björn Menze 2, Marc-André Weber 3, and Georg Langs 1 1 Computational Imaging Research

More information

Multiview Feature Learning

Multiview Feature Learning Multiview Feature Learning Roland Memisevic Frankfurt, Montreal Tutorial at IPAM 2012 Roland Memisevic (Frankfurt, Montreal) Multiview Feature Learning Tutorial at IPAM 2012 1 / 163 Outline 1 Introduction

More information

Filters + linear algebra

Filters + linear algebra Filters + linear algebra Outline Efficiency (pyramids, separability, steerability) Linear algebra Bag-of-words Recall: Canny Derivative-of-Gaussian = Gaussian * [1-1] Ideal image +noise filtered Fundamental

More information

arxiv: v1 [cs.mm] 12 Jan 2016

arxiv: v1 [cs.mm] 12 Jan 2016 Learning Subclass Representations for Visually-varied Image Classification Xinchao Li, Peng Xu, Yue Shi, Martha Larson, Alan Hanjalic Multimedia Information Retrieval Lab, Delft University of Technology

More information

The SIFT (Scale Invariant Feature

The SIFT (Scale Invariant Feature The SIFT (Scale Invariant Feature Transform) Detector and Descriptor developed by David Lowe University of British Columbia Initial paper ICCV 1999 Newer journal paper IJCV 2004 Review: Matt Brown s Canonical

More information

Learning Convolutional Feature Hierarchies for Visual Recognition

Learning Convolutional Feature Hierarchies for Visual Recognition Learning Convolutional Feature Hierarchies for Visual Recognition Koray Kavukcuoglu, Pierre Sermanet, Y-Lan Boureau, Karol Gregor, Michael Mathieu, Yann LeCun Computer Science Department Courant Institute

More information

Flow Estimation. Min Bai. February 8, University of Toronto. Min Bai (UofT) Flow Estimation February 8, / 47

Flow Estimation. Min Bai. February 8, University of Toronto. Min Bai (UofT) Flow Estimation February 8, / 47 Flow Estimation Min Bai University of Toronto February 8, 2016 Min Bai (UofT) Flow Estimation February 8, 2016 1 / 47 Outline Optical Flow - Continued Min Bai (UofT) Flow Estimation February 8, 2016 2

More information

Learning Visual Semantics: Models, Massive Computation, and Innovative Applications

Learning Visual Semantics: Models, Massive Computation, and Innovative Applications Learning Visual Semantics: Models, Massive Computation, and Innovative Applications Part II: Visual Features and Representations Liangliang Cao, IBM Watson Research Center Evolvement of Visual Features

More information

Boundaries and Sketches

Boundaries and Sketches Boundaries and Sketches Szeliski 4.2 Computer Vision James Hays Many slides from Michael Maire, Jitendra Malek Today s lecture Segmentation vs Boundary Detection Why boundaries / Grouping? Recap: Canny

More information

Places Challenge 2017

Places Challenge 2017 Places Challenge 2017 Scene Parsing Task CASIA_IVA_JD Jun Fu, Jing Liu, Longteng Guo, Haijie Tian, Fei Liu, Hanqing Lu Yong Li, Yongjun Bao, Weipeng Yan National Laboratory of Pattern Recognition, Institute

More information

Object Detection by 3D Aspectlets and Occlusion Reasoning

Object Detection by 3D Aspectlets and Occlusion Reasoning Object Detection by 3D Aspectlets and Occlusion Reasoning Yu Xiang University of Michigan Silvio Savarese Stanford University In the 4th International IEEE Workshop on 3D Representation and Recognition

More information

Traffic Sign Localization and Classification Methods: An Overview

Traffic Sign Localization and Classification Methods: An Overview Traffic Sign Localization and Classification Methods: An Overview Ivan Filković University of Zagreb Faculty of Electrical Engineering and Computing Department of Electronics, Microelectronics, Computer

More information

Part-based and local feature models for generic object recognition

Part-based and local feature models for generic object recognition Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza

More information

DEEP BLIND IMAGE QUALITY ASSESSMENT

DEEP BLIND IMAGE QUALITY ASSESSMENT DEEP BLIND IMAGE QUALITY ASSESSMENT BY LEARNING SENSITIVITY MAP Jongyoo Kim, Woojae Kim and Sanghoon Lee ICASSP 2018 Deep Learning and Convolutional Neural Networks (CNNs) SOTA in computer vision & image

More information

MRFs and Segmentation with Graph Cuts

MRFs and Segmentation with Graph Cuts 02/24/10 MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class Finish up EM MRFs w ij i Segmentation with Graph Cuts j EM Algorithm: Recap

More information

Region-based Segmentation and Object Detection

Region-based Segmentation and Object Detection Region-based Segmentation and Object Detection Stephen Gould 1 Tianshi Gao 1 Daphne Koller 2 1 Department of Electrical Engineering, Stanford University 2 Department of Computer Science, Stanford University

More information

Crowdsourcing Annotations for Visual Object Detection

Crowdsourcing Annotations for Visual Object Detection Crowdsourcing Annotations for Visual Object Detection Hao Su, Jia Deng, Li Fei-Fei Computer Science Department, Stanford University Abstract A large number of images with ground truth object bounding boxes

More information

Exploiting noisy web data for largescale visual recognition

Exploiting noisy web data for largescale visual recognition Exploiting noisy web data for largescale visual recognition Lamberto Ballan University of Padova, Italy CVPRW WebVision - Jul 26, 2017 Datasets drive computer vision progress ImageNet Slide credit: O.

More information

LEARNING BOUNDARIES WITH COLOR AND DEPTH. Zhaoyin Jia, Andrew Gallagher, Tsuhan Chen

LEARNING BOUNDARIES WITH COLOR AND DEPTH. Zhaoyin Jia, Andrew Gallagher, Tsuhan Chen LEARNING BOUNDARIES WITH COLOR AND DEPTH Zhaoyin Jia, Andrew Gallagher, Tsuhan Chen School of Electrical and Computer Engineering, Cornell University ABSTRACT To enable high-level understanding of a scene,

More information

Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition

Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition Emotion Recognition In The Wild Challenge and Workshop (EmotiW 2013) Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition Mengyi Liu, Ruiping Wang, Zhiwu Huang, Shiguang Shan,

More information

Person Retrieval in Surveillance Video using Height, Color and Gender

Person Retrieval in Surveillance Video using Height, Color and Gender Person Retrieval in Surveillance Video using Height, Color and Gender Hiren Galiyawala 1, Kenil Shah 2, Vandit Gajjar 1, Mehul S. Raval 1 1 School of Engineering and Applied Science, Ahmedabad University,

More information

Tri-modal Human Body Segmentation

Tri-modal Human Body Segmentation Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4

More information

ECS 289H: Visual Recognition Fall Yong Jae Lee Department of Computer Science

ECS 289H: Visual Recognition Fall Yong Jae Lee Department of Computer Science ECS 289H: Visual Recognition Fall 2014 Yong Jae Lee Department of Computer Science Plan for today Questions? Research overview Standard supervised visual learning building Category models Annotators tree

More information

Color Local Texture Features Based Face Recognition

Color Local Texture Features Based Face Recognition Color Local Texture Features Based Face Recognition Priyanka V. Bankar Department of Electronics and Communication Engineering SKN Sinhgad College of Engineering, Korti, Pandharpur, Maharashtra, India

More information