Visual Material Traits Recognizing Per-Pixel Material Context
|
|
- Eric Warner
- 5 years ago
- Views:
Transcription
1 Visual Material Traits Recognizing Per-Pixel Material Context Gabriel Schwartz Ko Nishino 1 / 22
2 2 / 22
3 Materials as Visual Context What tells us this road is unsafe? 3 / 22
4 Material Category Recognition Methods Give single predictions for the entire image Adelson [1] Liu et al. [4] Hu et al. [3] Sharan et al. [6] Images from [2]. 4 / 22
5 Material Category Recognition Methods Give single predictions for the entire image Require object information object mask bounding box Adelson [1] Liu et al. [4] Hu et al. [3] Sharan et al. [6] Images from [2]. 4 / 22
6 Material Category Recognition Methods Give single predictions for the entire image Require object information object mask bounding box Predict categories that are really object properties Adelson [1] Liu et al. [4] Hu et al. [3] Sharan et al. [6] Images from [2]. 4 / 22
7 Material Category Recognition Methods Give single predictions for the entire image Require object information object mask bounding box Predict categories that are really object properties Object information not always available Adelson [1] Liu et al. [4] Hu et al. [3] Sharan et al. [6] Images from [2]. 4 / 22
8 Intra-Class Appearance Variability Images from [7]. 5 / 22
9 Visual Material Traits: Characteristic Material Properties Image from [5]. 6 / 22
10 Visual Material Traits: Characteristic Material Properties Fuzzy Organic Smooth Material traits are locally-recognizable material properties. Image from [5]. 6 / 22
11 Visual Material Trait Appearances What material properties can we see locally? Images from [7]. 7 / 22
12 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Images from [7]. 7 / 22
13 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Shiny Images from [7]. 7 / 22
14 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Shiny Smooth 7 / 22
15 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Shiny Smooth Some are more challenging 7 / 22
16 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Shiny Smooth Some are more challenging Fuzzy? Soft? 7 / 22
17 Visual Material Trait Appearances What material properties can we see locally? Certain properties are easy to describe Shiny Smooth Some are more challenging Fuzzy? Soft? How do we represent these traits? 7 / 22
18 Learning to Represent Material Traits Learn features that model the appearance of material traits 8 / 22
19 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: 8 / 22
20 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: Fast to compute 8 / 22
21 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: Fast to compute Able to be extracted anywhere 8 / 22
22 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: Fast to compute Able to be extracted anywhere Discriminative 8 / 22
23 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: Fast to compute Able to be extracted anywhere Discriminative Convolution filters may satisfy all of these properties 8 / 22
24 Learning to Represent Material Traits Learn features that model the appearance of material traits Features should be: Fast to compute Able to be extracted anywhere Discriminative Convolution filters may satisfy all of these properties How do we learn them? 8 / 22
25 Learning Filters for Trait Representation Convolutional Autoencoder (CAE) model for feature learning Find optimal filters (W) s.t. they: min W,W 9 / 22
26 Learning Filters for Trait Representation Convolutional Autoencoder (CAE) model for feature learning Find optimal filters (W) s.t. they: Model trait patches E i = h (W I i + b e ) R i = W E i + b r 1 h (x) = 1 min W,W N 0 1 N I i R i 2 F i=1 9 / 22
27 Learning Filters for Trait Representation Convolutional Autoencoder (CAE) model for feature learning Find optimal filters (W) s.t. they: Model trait patches Form a sparse encoding E i = h (W I i + b e ) R i = W E i + b r 1 h (x) = 1 min W,W N 0 1 N I i R i 2 F +α p 1 N i=1 N 2 E i i=1 F 9 / 22
28 Learning Filters for Trait Representation Convolutional Autoencoder (CAE) model for feature learning Find optimal filters (W) s.t. they: Model trait patches Form a sparse encoding Have constrained magnitude E i = h (W I i + b e ) R i = W E i + b r 1 h (x) = 1 min W,W N 0 1 N I i R i 2 F +α p 1 N i=1 N 2 E i +β i=1 F ( ) W 2 F + W 2 F 9 / 22
29 Learning Filters for Trait Representation Convolutional Autoencoder (CAE) model for feature learning Find optimal filters (W) s.t. they: Model trait patches Form a sparse encoding Have constrained magnitude E i = h (W I i + b e ) R i = W E i + b r 1 h (x) = 1 min W,W N 0 1 N I i R i 2 F +α p 1 N i=1 N 2 E i +β i=1 F ( ) W 2 F + W 2 F 9 / 22
30 Learned Filters Filter Responses Convolution Filters Input Images 10 / 22
31 Learned Filters + Supplemental Nonlinear Features Filter Responses Convolution Filters Describe appearances CAE cannot: Input Images 10 / 22
32 Learned Filters + Supplemental Nonlinear Features Filter Responses Convolution Filters Describe appearances CAE cannot: Input Images Color Histograms 10 / 22
33 Learned Filters + Supplemental Nonlinear Features Filter Responses Convolution Filters Input Images Describe appearances CAE cannot: Color Histograms HOG 10 / 22
34 LBP 10 / 22 Learned Filters + Supplemental Nonlinear Features Filter Responses Convolution Filters Input Images Describe appearances CAE cannot: Color Histograms HOG
35 Material Trait Recognition Process Training data: Flickr Materials Database (FMD) [7] images with trait annotations Learn Filters 11 / 22
36 Material Trait Recognition Process Training data: Flickr Materials Database (FMD) [7] images with trait annotations Learn Filters Select Features Trait CAE Oriented HOG LBP Color Histograms Shiny Fuzzy Transparent (13 Material Traits ) Total Uses / 22
37 Material Trait Recognition Process Training data: Flickr Materials Database (FMD) [7] images with trait annotations Learn Filters Select Features Train Per-Trait Classifiers Trait CAE Oriented HOG LBP Color Histograms Shiny Fuzzy Transparent (13 Material Traits ) Total Uses / 22
38 Material Trait Recognition Process Training data: Flickr Materials Database (FMD) [7] images with trait annotations Learn Filters Select Features Train Per-Trait Classifiers Extract Features Trait CAE Oriented HOG LBP Color Histograms Shiny Fuzzy Transparent (13 Material Traits ) Total Uses / 22
39 Material Trait Recognition Process Training data: Flickr Materials Database (FMD) [7] images with trait annotations Learn Filters Select Features Train Per-Trait Classifiers Extract Features Recognize Traits Trait CAE Oriented HOG LBP Color Histograms Shiny Fuzzy Transparent (13 Material Traits ) Total Uses / 22
40 Material Trait Recognition Accuracy Accuracy Fuzzy 4 Shiny Smooth Soft Striped Metallic Organic Translucent Transparent Rough Liquid Woven Manmade 12 / 22
41 Patch Recognition Results Shiny Fuzzy Metallic Soft Smooth Liquid Rough Woven False Positives True Positives Shiny Fuzzy Metallic Soft Smooth Liquid Rough Woven 13 / 22
42 Per-Pixel Material Trait Maps Image from [5]. 14 / 22
43 Per-Pixel Material Trait Maps Fuzzy Organic Smooth Image from [5]. 14 / 22
44 Per-Pixel Material Trait Maps Image from [7]. 15 / 22
45 Per-Pixel Material Trait Maps Shiny Metallic Smooth Image from [7]. 15 / 22
46 What can we do with these material traits? 16 / 22
47 Material Recognition via Trait Distributions Smooth Rough Organic Soft Foliage Stone Plastic p ( trait i category j ) 17 / 22
48 Material Recognition Accuracy: Flickr Fabric Foliage Glass Leather Metal Paper Plastic Stone Water Wood Fabric Foliage Glass Leather Metal Paper Plastic Stone Water Wood Average: 49.2% [Liu et al.] (w/o obj): 42.6% (w/obj): 57.1% Images from [7]. 18 / 22
49 Material Recognition Accuracy: ImageNet Foliage Glass 0.8 Paper Plastic Stone Water Wood Foliage Glass Paper Plastic Stone Water Average: 60.5% Wood Images from [2]. 19 / 22
50 Segmentation with Material Traits Baseline NCuts Images from [5]. With Traits 20 / 22
51 Summary Material traits: may be recognized locally and accurately 21 / 22
52 Summary Material traits: may be recognized locally and accurately have distributions that encode material categories 21 / 22
53 Summary Material traits: may be recognized locally and accurately have distributions that encode material categories segment images into intuitively separate regions 21 / 22
54 Summary Material traits: may be recognized locally and accurately have distributions that encode material categories segment images into intuitively separate regions Future work: Discover new traits Improve applications 21 / 22
55 References E. H. Adelson. On Seeing Stuff: The Perception of Materials by Humans and Machines. In SPIE, pages 1 12, J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR, D. Hu, L. Bo, and X. Ren. Toward Robust Material Recognition for Everyday Objects. In BMVC, pages , C. Liu, L. Sharan, E. H. Adelson, and R. Rosenholtz. Exploring Features in a Bayesian Framework for Material Recognition. In CVPR, pages , D. Martin, C. Fowlkes, D. Tal, and J. Malik. A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics. In ICCV, pages L. Sharan, C. Liu, R. Rosenholtz, and E. H. Adelson. Recognizing Materials Using Perceptually Inspired Features. International Journal of Computer Vision, L. Sharan, R. Rosenholtz, and E. Adelson. Material Perception: What Can You See in a Brief Glance? Journal of Vision, 9(8):784, / 22
Visual Material Traits: Recognizing Per-Pixel Material Context
Visual Material Traits: Recognizing Per-Pixel Material Context Gabriel Schwartz Drexel University Ko Nishino Drexel University gbs25@drexel.edu kon@drexel.edu Abstract Information describing the materials
More informationReal-World Material Recognition for Scene Understanding
Real-World Material Recognition for Scene Understanding Abstract Sam Corbett-Davies Department of Computer Science Stanford University scorbett@stanford.edu In this paper we address the problem of recognizing
More informationToward Robust Material Recognition for Everyday Objects
HU, BO, REN: EVERYDAY MATERIAL RECOGNITION 1 Toward Robust Material Recognition for Everyday Objects Diane Hu http://cseweb.ucsd.edu/~dhu/ Liefeng Bo http://www.cs.washington.edu/homes/lfb/ Xiaofeng Ren
More informationPARTIAL STYLE TRANSFER USING WEAKLY SUPERVISED SEMANTIC SEGMENTATION. Shin Matsuo Wataru Shimoda Keiji Yanai
PARTIAL STYLE TRANSFER USING WEAKLY SUPERVISED SEMANTIC SEGMENTATION Shin Matsuo Wataru Shimoda Keiji Yanai Department of Informatics, The University of Electro-Communications, Tokyo 1-5-1 Chofugaoka,
More informationSolution: filter the image, then subsample F 1 F 2. subsample blur subsample. blur
Pyramids Gaussian pre-filtering Solution: filter the image, then subsample blur F 0 subsample blur subsample * F 0 H F 1 F 1 * H F 2 { Gaussian pyramid blur F 0 subsample blur subsample * F 0 H F 1 F 1
More informationMaterial Recognition in the Wild with the Materials in Context Database
Material Recognition in the Wild with the Materials in Context Database Sean Bell Paul Upchurch Noah Snavely Kavita Bala Department of Computer Science, Cornell University {sbell,paulu,snavely,kb}@cs.cornell.edu
More informationMaterial Recognition: Bayesian Inference or SVMs?
Material Recognition: Bayesian Inference or SVMs? Ishrat Badami Supervised by: Michael Weinmann, Reinhard Klein Institute of Computer Graphics University of Bonn Abstract Material recognition is an important
More informationAttributes and More Crowdsourcing
Attributes and More Crowdsourcing Computer Vision CS 143, Brown James Hays Many slides from Derek Hoiem Recap: Human Computation Active Learning: Let the classifier tell you where more annotation is needed.
More informationSupplementary Materials for Salient Object Detection: A
Supplementary Materials for Salient Object Detection: A Discriminative Regional Feature Integration Approach Huaizu Jiang, Zejian Yuan, Ming-Ming Cheng, Yihong Gong Nanning Zheng, and Jingdong Wang Abstract
More informationAttributes. Computer Vision. James Hays. Many slides from Derek Hoiem
Many slides from Derek Hoiem Attributes Computer Vision James Hays Recap: Human Computation Active Learning: Let the classifier tell you where more annotation is needed. Human-in-the-loop recognition:
More informationAttribute learning in large-scale datasets. Olga Russakovsky and Li Fei-Fei
Attribute learning in large-scale datasets Olga Russakovsky and Li Fei-Fei Categorization of the visual world Berry Fruit Entity Tree Instrument Furniture Categorization of the visual world Berry Fruit
More informationTwo Routes for Image to Image Translation: Rule based vs. Learning based. Minglun Gong, Memorial Univ. Collaboration with Mr.
Two Routes for Image to Image Translation: Rule based vs. Learning based Minglun Gong, Memorial Univ. Collaboration with Mr. Zili Yi Introduction A brief history of image processing Image to Image translation
More informationReconstructive Sparse Code Transfer for Contour Detection and Semantic Labeling
Reconstructive Sparse Code Transfer for Contour Detection and Semantic Labeling Michael Maire 1,2 Stella X. Yu 3 Pietro Perona 2 1 TTI Chicago 2 California Institute of Technology 3 University of California
More informationJOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS. Zhao Chen Machine Learning Intern, NVIDIA
JOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS Zhao Chen Machine Learning Intern, NVIDIA ABOUT ME 5th year PhD student in physics @ Stanford by day, deep learning computer vision scientist
More informationSeeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models
Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Mathieu Aubry (INRIA) Daniel Maturana (CMU) Alexei Efros (UC Berkeley) Bryan Russell (Intel) Josef Sivic (INRIA)
More informationSketchable Histograms of Oriented Gradients for Object Detection
Sketchable Histograms of Oriented Gradients for Object Detection No Author Given No Institute Given Abstract. In this paper we investigate a new representation approach for visual object recognition. The
More informationFaster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun Presented by Tushar Bansal Objective 1. Get bounding box for all objects
More informationLarge-Scale Traffic Sign Recognition based on Local Features and Color Segmentation
Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,
More informationDiscovering Visual Hierarchy through Unsupervised Learning Haider Razvi
Discovering Visual Hierarchy through Unsupervised Learning Haider Razvi hrazvi@stanford.edu 1 Introduction: We present a method for discovering visual hierarchy in a set of images. Automatically grouping
More informationRecognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)
Recognition of Animal Skin Texture Attributes in the Wild Amey Dharwadker (aap2174) Kai Zhang (kz2213) Motivation Patterns and textures are have an important role in object description and understanding
More informationarxiv: v1 [cs.cv] 22 Apr 2015
LOAD: Local Orientation Adaptive Descriptor for Texture and Material Classification Xianbiao Qi a,b, Guoying Zhao a, Linlin Shen b, Qingquan Li b, Matti Pietikäinen a arxiv:1504.05809v1 [cs.cv] 22 Apr
More information(Deep) Learning for Robot Perception and Navigation. Wolfram Burgard
(Deep) Learning for Robot Perception and Navigation Wolfram Burgard Deep Learning for Robot Perception (and Navigation) Lifeng Bo, Claas Bollen, Thomas Brox, Andreas Eitel, Dieter Fox, Gabriel L. Oliveira,
More informationEfficient Similarity Search in Scientific Databases with Feature Signatures
DATA MANAGEMENT AND DATA EXPLORATION GROUP Prof. Dr. rer. nat. Thomas Seidl DATA MANAGEMENT AND DATA EXPLORATION GROUP Prof. Dr. rer. nat. Thomas Seidl Efficient Similarity Search in Scientific Databases
More informationThe Caltech-UCSD Birds Dataset
The Caltech-UCSD Birds-200-2011 Dataset Catherine Wah 1, Steve Branson 1, Peter Welinder 2, Pietro Perona 2, Serge Belongie 1 1 University of California, San Diego 2 California Institute of Technology
More informationPart Localization by Exploiting Deep Convolutional Networks
Part Localization by Exploiting Deep Convolutional Networks Marcel Simon, Erik Rodner, and Joachim Denzler Computer Vision Group, Friedrich Schiller University of Jena, Germany www.inf-cv.uni-jena.de Abstract.
More information24 hours of Photo Sharing. installation by Erik Kessels
24 hours of Photo Sharing installation by Erik Kessels And sometimes Internet photos have useful labels Im2gps. Hays and Efros. CVPR 2008 But what if we want more? Image Categorization Training Images
More informationarxiv: v1 [cs.cv] 24 Aug 2016
arxiv:1608.06985v1 [cs.cv] 24 Aug 2016 A 4D Light-Field Dataset and CNN Architectures for Material Recognition Ting-Chun Wang 1, Jun-Yan Zhu 1, Ebi Hiroaki 2, Manmohan Chandraker 2, Alexei A. Efros 1,
More informationarxiv: v1 [cs.cv] 10 Jul 2014
ARTOS Adaptive Real-Time Object Detection System Björn Barz Erik Rodner bjoern.barz@uni-jena.de erik.rodner@uni-jena.de arxiv:1407.2721v1 [cs.cv] 10 Jul 2014 Joachim Denzler Computer Vision Group Friedrich
More informationLatest development in image feature representation and extraction
International Journal of Advanced Research and Development ISSN: 2455-4030, Impact Factor: RJIF 5.24 www.advancedjournal.com Volume 2; Issue 1; January 2017; Page No. 05-09 Latest development in image
More informationUnified, real-time object detection
Unified, real-time object detection Final Project Report, Group 02, 8 Nov 2016 Akshat Agarwal (13068), Siddharth Tanwar (13699) CS698N: Recent Advances in Computer Vision, Jul Nov 2016 Instructor: Gaurav
More informationLinear combinations of simple classifiers for the PASCAL challenge
Linear combinations of simple classifiers for the PASCAL challenge Nik A. Melchior and David Lee 16 721 Advanced Perception The Robotics Institute Carnegie Mellon University Email: melchior@cmu.edu, dlee1@andrew.cmu.edu
More informationDeep Learning for Face Recognition. Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong
Deep Learning for Face Recognition Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong Deep Learning Results on LFW Method Accuracy (%) # points # training images Huang
More informationDeformable Part Models
CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones
More informationarxiv: v2 [cs.cv] 25 Aug 2014
ARTOS Adaptive Real-Time Object Detection System Björn Barz Erik Rodner bjoern.barz@uni-jena.de erik.rodner@uni-jena.de arxiv:1407.2721v2 [cs.cv] 25 Aug 2014 Joachim Denzler Computer Vision Group Friedrich
More informationMulti-Glance Attention Models For Image Classification
Multi-Glance Attention Models For Image Classification Chinmay Duvedi Stanford University Stanford, CA cduvedi@stanford.edu Pararth Shah Stanford University Stanford, CA pararth@stanford.edu Abstract We
More informationObject Recognition with Deformable Models
Object Recognition with Deformable Models Pedro F. Felzenszwalb Department of Computer Science University of Chicago Joint work with: Dan Huttenlocher, Joshua Schwartz, David McAllester, Deva Ramanan.
More informationFIGARO, HAIR DETECTION AND SEGMENTATION IN THE WILD
FIGARO, HAIR DETECTION AND SEGMENTATION IN THE WILD Michele Svanera, Umar Riaz Muhammad, Riccardo Leonardi, and Sergio Benini Department of Information Engineering, University of Brescia, Italy Email:
More informationFast Border Ownership Assignment with Bio-Inspired Features
Fast Border Ownership Assignment with Bio-Inspired Features Ching L. Teo, Cornelia Fermüller, Yiannis Aloimonos Computer Vision Lab and UMIACS University of Maryland College Park What is Border Ownership?
More informationPreviously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011
Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition
More informationA Novel Representation and Pipeline for Object Detection
A Novel Representation and Pipeline for Object Detection Vishakh Hegde Stanford University vishakh@stanford.edu Manik Dhar Stanford University dmanik@stanford.edu Abstract Object detection is an important
More informationOne Network to Solve Them All Solving Linear Inverse Problems using Deep Projection Models
One Network to Solve Them All Solving Linear Inverse Problems using Deep Projection Models [Supplemental Materials] 1. Network Architecture b ref b ref +1 We now describe the architecture of the networks
More informationCS 2770: Computer Vision. Edges and Segments. Prof. Adriana Kovashka University of Pittsburgh February 21, 2017
CS 2770: Computer Vision Edges and Segments Prof. Adriana Kovashka University of Pittsburgh February 21, 2017 Edges vs Segments Figure adapted from J. Hays Edges vs Segments Edges More low-level Don t
More informationROTATION INVARIANT SPARSE CODING AND PCA
ROTATION INVARIANT SPARSE CODING AND PCA NATHAN PFLUEGER, RYAN TIMMONS Abstract. We attempt to encode an image in a fashion that is only weakly dependent on rotation of objects within the image, as an
More informationDetection III: Analyzing and Debugging Detection Methods
CS 1699: Intro to Computer Vision Detection III: Analyzing and Debugging Detection Methods Prof. Adriana Kovashka University of Pittsburgh November 17, 2015 Today Review: Deformable part models How can
More informationArtistic ideation based on computer vision methods
Journal of Theoretical and Applied Computer Science Vol. 6, No. 2, 2012, pp. 72 78 ISSN 2299-2634 http://www.jtacs.org Artistic ideation based on computer vision methods Ferran Reverter, Pilar Rosado,
More informationProceedings of the International MultiConference of Engineers and Computer Scientists 2018 Vol I IMECS 2018, March 14-16, 2018, Hong Kong
, March 14-16, 2018, Hong Kong , March 14-16, 2018, Hong Kong , March 14-16, 2018, Hong Kong , March 14-16, 2018, Hong Kong TABLE I CLASSIFICATION ACCURACY OF DIFFERENT PRE-TRAINED MODELS ON THE TEST DATA
More informationRecap from Monday. Visualizing Networks Caffe overview Slides are now online
Recap from Monday Visualizing Networks Caffe overview Slides are now online Today Edges and Regions, GPB Fast Edge Detection Using Structured Forests Zhihao Li Holistically-Nested Edge Detection Yuxin
More informationCost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling
[DOI: 10.2197/ipsjtcva.7.99] Express Paper Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling Takayoshi Yamashita 1,a) Takaya Nakamura 1 Hiroshi Fukui 1,b) Yuji
More informationLarge-scale Video Classification with Convolutional Neural Networks
Large-scale Video Classification with Convolutional Neural Networks Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei Note: Slide content mostly from : Bay Area
More informationBeyond Mere Pixels: How Can Computers Interpret and Compare Digital Images? Nicholas R. Howe Cornell University
Beyond Mere Pixels: How Can Computers Interpret and Compare Digital Images? Nicholas R. Howe Cornell University Why Image Retrieval? World Wide Web: Millions of hosts Billions of images Growth of video
More informationTRANSPARENT OBJECT DETECTION USING REGIONS WITH CONVOLUTIONAL NEURAL NETWORK
TRANSPARENT OBJECT DETECTION USING REGIONS WITH CONVOLUTIONAL NEURAL NETWORK 1 Po-Jen Lai ( 賴柏任 ), 2 Chiou-Shann Fuh ( 傅楸善 ) 1 Dept. of Electrical Engineering, National Taiwan University, Taiwan 2 Dept.
More informationFeature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking
Feature descriptors Alain Pagani Prof. Didier Stricker Computer Vision: Object and People Tracking 1 Overview Previous lectures: Feature extraction Today: Gradiant/edge Points (Kanade-Tomasi + Harris)
More informationPairwise Rotation Invariant Co-occurrence Local Binary Pattern
Pairwise Rotation Invariant Co-occurrence Local Binary Pattern Xianbiao Qi 1, Rong Xiao 2, Jun Guo 1, and Lei Zhang 2 1 Beijing University of Posts and Telecommunications, Beijing 100876, P.R. China 2
More informationOBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES
Image Processing & Communication, vol. 18,no. 1, pp.31-36 DOI: 10.2478/v10248-012-0088-x 31 OBJECT RECOGNITION ALGORITHM FOR MOBILE DEVICES RAFAŁ KOZIK ADAM MARCHEWKA Institute of Telecommunications, University
More informationLight Introduction. Read this article for more background information:
Light Introduction Read this article for more background information: Color Absorption Article Watch the following video on terms like Absorption, Reflection, and Transmission Video 1 Waves 1 Stephanie
More informationCategory-level localization
Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object
More informationPhoto OCR ( )
Photo OCR (2017-2018) Xiang Bai Huazhong University of Science and Technology Outline VALSE2018, DaLian Xiang Bai 2 Deep Direct Regression for Multi-Oriented Scene Text Detection [He et al., ICCV, 2017.]
More informationSupervised texture detection in images
Supervised texture detection in images Branislav Mičušík and Allan Hanbury Pattern Recognition and Image Processing Group, Institute of Computer Aided Automation, Vienna University of Technology Favoritenstraße
More informationTHE SPEED-LIMIT SIGN DETECTION AND RECOGNITION SYSTEM
THE SPEED-LIMIT SIGN DETECTION AND RECOGNITION SYSTEM Kuo-Hsin Tu ( 塗國星 ), Chiou-Shann Fuh ( 傅楸善 ) Dept. of Computer Science and Information Engineering, National Taiwan University, Taiwan E-mail: p04922004@csie.ntu.edu.tw,
More informationResearch of Traffic Flow Based on SVM Method. Deng-hong YIN, Jian WANG and Bo LI *
2017 2nd International onference on Artificial Intelligence: Techniques and Applications (AITA 2017) ISBN: 978-1-60595-491-2 Research of Traffic Flow Based on SVM Method Deng-hong YIN, Jian WANG and Bo
More informationConveying 3D Shape and Depth with Textured and Transparent Surfaces Victoria Interrante
Conveying 3D Shape and Depth with Textured and Transparent Surfaces Victoria Interrante In scientific visualization, there are many applications in which researchers need to achieve an integrated understanding
More informationCOMP 551 Applied Machine Learning Lecture 16: Deep Learning
COMP 551 Applied Machine Learning Lecture 16: Deep Learning Instructor: Ryan Lowe (ryan.lowe@cs.mcgill.ca) Slides mostly by: Class web page: www.cs.mcgill.ca/~hvanho2/comp551 Unless otherwise noted, all
More informationAggregating Descriptors with Local Gaussian Metrics
Aggregating Descriptors with Local Gaussian Metrics Hideki Nakayama Grad. School of Information Science and Technology The University of Tokyo Tokyo, JAPAN nakayama@ci.i.u-tokyo.ac.jp Abstract Recently,
More informationTexture April 17 th, 2018
Texture April 17 th, 2018 Yong Jae Lee UC Davis Announcements PS1 out today Due 5/2 nd, 11:59 pm start early! 2 Review: last time Edge detection: Filter for gradient Threshold gradient magnitude, thin
More informationTexture C H A P T E R 6
C H A P T E R 6 Texture Texture is a phenomenon that is widespread, easy to recognise, and hard to define. Typically, whether an effect is referred to as texture or not depends on the scale at which it
More informationSpatial Latent Dirichlet Allocation
Spatial Latent Dirichlet Allocation Xiaogang Wang and Eric Grimson Computer Science and Computer Science and Artificial Intelligence Lab Massachusetts Tnstitute of Technology, Cambridge, MA, 02139, USA
More informationBeyond Bags of Features
: for Recognizing Natural Scene Categories Matching and Modeling Seminar Instructed by Prof. Haim J. Wolfson School of Computer Science Tel Aviv University December 9 th, 2015
More informationLearning the Ecological Statistics of Perceptual Organization
Learning the Ecological Statistics of Perceptual Organization Charless Fowlkes work with David Martin, Xiaofeng Ren and Jitendra Malik at University of California at Berkeley 1 How do ideas from perceptual
More informationDeep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks
Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Si Chen The George Washington University sichen@gwmail.gwu.edu Meera Hahn Emory University mhahn7@emory.edu Mentor: Afshin
More informationThe attributes of objects. D.A. Forsyth, UIUC channelling Derek Hoiem, UIUC, with Ali Farhadi, Ian Endres, Gang Wang all of UIUC
The attributes of objects D.A. Forsyth, UIUC channelling Derek Hoiem, UIUC, with Ali Farhadi, Ian Endres, Gang Wang all of UIUC Obtain dataset Build features Mess around with classifiers, probability,
More informationDeep learning for object detection. Slides from Svetlana Lazebnik and many others
Deep learning for object detection Slides from Svetlana Lazebnik and many others Recent developments in object detection 80% PASCAL VOC mean0average0precision0(map) 70% 60% 50% 40% 30% 20% 10% Before deep
More informationAutomatic Ranking of Images on the Web
Automatic Ranking of Images on the Web HangHang Zhang Electrical Engineering Department Stanford University hhzhang@stanford.edu Zixuan Wang Electrical Engineering Department Stanford University zxwang@stanford.edu
More informationDiscriminative Illumination:
Discriminative Illumination: Per-pixel Classification of Raw Materials based on Optimal Projections of Spectral BRDFs Jinwei Gu and Chao Liu Munsell Color Science Laboratory Chester F. Carlson Center for
More informationLecture 6: Texture. Tuesday, Sept 18
Lecture 6: Texture Tuesday, Sept 18 Graduate students Problem set 1 extension ideas Chamfer matching Hierarchy of shape prototypes, search over translations Comparisons with Hausdorff distance, L1 on
More informationVisual Object Recognition
Visual Object Recognition Lecture 3: Descriptors Per-Erik Forssén, docent Computer Vision Laboratory Department of Electrical Engineering Linköping University 2015 2014 Per-Erik Forssén Lecture 3: Descriptors
More informationCS231A Course Project Final Report Sign Language Recognition with Unsupervised Feature Learning
CS231A Course Project Final Report Sign Language Recognition with Unsupervised Feature Learning Justin Chen Stanford University justinkchen@stanford.edu Abstract This paper focuses on experimenting with
More informationDetecting Bone Lesions in Multiple Myeloma Patients using Transfer Learning
Detecting Bone Lesions in Multiple Myeloma Patients using Transfer Learning Matthias Perkonigg 1, Johannes Hofmanninger 1, Björn Menze 2, Marc-André Weber 3, and Georg Langs 1 1 Computational Imaging Research
More informationMultiview Feature Learning
Multiview Feature Learning Roland Memisevic Frankfurt, Montreal Tutorial at IPAM 2012 Roland Memisevic (Frankfurt, Montreal) Multiview Feature Learning Tutorial at IPAM 2012 1 / 163 Outline 1 Introduction
More informationFilters + linear algebra
Filters + linear algebra Outline Efficiency (pyramids, separability, steerability) Linear algebra Bag-of-words Recall: Canny Derivative-of-Gaussian = Gaussian * [1-1] Ideal image +noise filtered Fundamental
More informationarxiv: v1 [cs.mm] 12 Jan 2016
Learning Subclass Representations for Visually-varied Image Classification Xinchao Li, Peng Xu, Yue Shi, Martha Larson, Alan Hanjalic Multimedia Information Retrieval Lab, Delft University of Technology
More informationThe SIFT (Scale Invariant Feature
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor developed by David Lowe University of British Columbia Initial paper ICCV 1999 Newer journal paper IJCV 2004 Review: Matt Brown s Canonical
More informationLearning Convolutional Feature Hierarchies for Visual Recognition
Learning Convolutional Feature Hierarchies for Visual Recognition Koray Kavukcuoglu, Pierre Sermanet, Y-Lan Boureau, Karol Gregor, Michael Mathieu, Yann LeCun Computer Science Department Courant Institute
More informationFlow Estimation. Min Bai. February 8, University of Toronto. Min Bai (UofT) Flow Estimation February 8, / 47
Flow Estimation Min Bai University of Toronto February 8, 2016 Min Bai (UofT) Flow Estimation February 8, 2016 1 / 47 Outline Optical Flow - Continued Min Bai (UofT) Flow Estimation February 8, 2016 2
More informationLearning Visual Semantics: Models, Massive Computation, and Innovative Applications
Learning Visual Semantics: Models, Massive Computation, and Innovative Applications Part II: Visual Features and Representations Liangliang Cao, IBM Watson Research Center Evolvement of Visual Features
More informationBoundaries and Sketches
Boundaries and Sketches Szeliski 4.2 Computer Vision James Hays Many slides from Michael Maire, Jitendra Malek Today s lecture Segmentation vs Boundary Detection Why boundaries / Grouping? Recap: Canny
More informationPlaces Challenge 2017
Places Challenge 2017 Scene Parsing Task CASIA_IVA_JD Jun Fu, Jing Liu, Longteng Guo, Haijie Tian, Fei Liu, Hanqing Lu Yong Li, Yongjun Bao, Weipeng Yan National Laboratory of Pattern Recognition, Institute
More informationObject Detection by 3D Aspectlets and Occlusion Reasoning
Object Detection by 3D Aspectlets and Occlusion Reasoning Yu Xiang University of Michigan Silvio Savarese Stanford University In the 4th International IEEE Workshop on 3D Representation and Recognition
More informationTraffic Sign Localization and Classification Methods: An Overview
Traffic Sign Localization and Classification Methods: An Overview Ivan Filković University of Zagreb Faculty of Electrical Engineering and Computing Department of Electronics, Microelectronics, Computer
More informationPart-based and local feature models for generic object recognition
Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza
More informationDEEP BLIND IMAGE QUALITY ASSESSMENT
DEEP BLIND IMAGE QUALITY ASSESSMENT BY LEARNING SENSITIVITY MAP Jongyoo Kim, Woojae Kim and Sanghoon Lee ICASSP 2018 Deep Learning and Convolutional Neural Networks (CNNs) SOTA in computer vision & image
More informationMRFs and Segmentation with Graph Cuts
02/24/10 MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class Finish up EM MRFs w ij i Segmentation with Graph Cuts j EM Algorithm: Recap
More informationRegion-based Segmentation and Object Detection
Region-based Segmentation and Object Detection Stephen Gould 1 Tianshi Gao 1 Daphne Koller 2 1 Department of Electrical Engineering, Stanford University 2 Department of Computer Science, Stanford University
More informationCrowdsourcing Annotations for Visual Object Detection
Crowdsourcing Annotations for Visual Object Detection Hao Su, Jia Deng, Li Fei-Fei Computer Science Department, Stanford University Abstract A large number of images with ground truth object bounding boxes
More informationExploiting noisy web data for largescale visual recognition
Exploiting noisy web data for largescale visual recognition Lamberto Ballan University of Padova, Italy CVPRW WebVision - Jul 26, 2017 Datasets drive computer vision progress ImageNet Slide credit: O.
More informationLEARNING BOUNDARIES WITH COLOR AND DEPTH. Zhaoyin Jia, Andrew Gallagher, Tsuhan Chen
LEARNING BOUNDARIES WITH COLOR AND DEPTH Zhaoyin Jia, Andrew Gallagher, Tsuhan Chen School of Electrical and Computer Engineering, Cornell University ABSTRACT To enable high-level understanding of a scene,
More informationPartial Least Squares Regression on Grassmannian Manifold for Emotion Recognition
Emotion Recognition In The Wild Challenge and Workshop (EmotiW 2013) Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition Mengyi Liu, Ruiping Wang, Zhiwu Huang, Shiguang Shan,
More informationPerson Retrieval in Surveillance Video using Height, Color and Gender
Person Retrieval in Surveillance Video using Height, Color and Gender Hiren Galiyawala 1, Kenil Shah 2, Vandit Gajjar 1, Mehul S. Raval 1 1 School of Engineering and Applied Science, Ahmedabad University,
More informationTri-modal Human Body Segmentation
Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4
More informationECS 289H: Visual Recognition Fall Yong Jae Lee Department of Computer Science
ECS 289H: Visual Recognition Fall 2014 Yong Jae Lee Department of Computer Science Plan for today Questions? Research overview Standard supervised visual learning building Category models Annotators tree
More informationColor Local Texture Features Based Face Recognition
Color Local Texture Features Based Face Recognition Priyanka V. Bankar Department of Electronics and Communication Engineering SKN Sinhgad College of Engineering, Korti, Pandharpur, Maharashtra, India
More information