Adapting Deep Networks Across Domains, Modalities, and Tasks Judy Hoffman!

Size: px
Start display at page:

Download "Adapting Deep Networks Across Domains, Modalities, and Tasks Judy Hoffman!"

Transcription

1 Adapting Deep Networks Across Domains, Modalities, and Tasks Judy Hoffman! Eric Tzeng Saurabh Gupta Kate Saenko Trevor Darrell

2 Recent Visual Recognition Progress ImageNet Performance 100 Accuracy

3 Recent Visual Recognition Progress ImageNet Performance Accuracy Deep models

4 Deep Visual Models [LeCuN 89, 98] [Hubel and Wisel 59] [Krizhevsky 2012] [Szegedy 2014] [Simonyan 2014]

5 Deep Visual Models [LeCuN 89, 98] [Hubel and Wisel 59] [Krizhevsky 2012] [Szegedy 2014] [Simonyan 2014]

6 Domain Adaptation: Train on source adapt to target backpack chair bike Source Domain PS (X, Y ) lots of labeled data DS = {(xi, yi ), 8i 2 {1,..., N }}

7 Domain Adaptation: Train on source adapt to target backpack chair bike Source Domain PS (X, Y ) lots of labeled data DS = {(xi, yi ), 8i 2 {1,..., N }}?? bike Target Domain PT (Z, H) unlabeled or limited labels DT = {(zj,?), 8j 2 {1,..., M }}

8 Domain Adaptation: Train on source adapt to target Adapt backpack chair bike Source Domain PS (X, Y ) lots of labeled data DS = {(xi, yi ), 8i 2 {1,..., N }}?? bike Target Domain PT (Z, H) unlabeled or limited labels DT = {(zj,?), 8j 2 {1,..., M }}

9 Prior work: domain adaptation Minimizing distribution distance! Borgwardt`06, Mansour`09, Pan`09, Fernando`13 Deep model adaptation Chopra`13, Tzeng`14, Long`15, Ganin`15

10 Adapting across domains minimize discrepancy min repr L conf (x, z, D ; repr )= X x i 2S H(U(D),q s i )+ X z j 2T H(U(D),q t j) [ICCV 2015]

11 Adapting across domains minimize discrepancy min repr L conf (x, z, D ; repr )= X x i 2S H(U(D),q s i )+ X z j 2T H(U(D),q t j) x i [ICCV 2015]

12 Adapting across domains minimize discrepancy min repr L conf (x, z, D ; repr )= X x i 2S H(U(D),q s i )+ X z j 2T H(U(D),q t j) c object classifier x i [ICCV 2015]

13 Adapting across domains minimize discrepancy min repr L conf (x, z, D ; repr )= X x i 2S H(U(D),q s i )+ X z j 2T H(U(D),q t j) z j c object classifier x i [ICCV 2015]

14 Adapting across domains minimize discrepancy min repr L conf (x, z, D ; repr )= X x i 2S H(U(D),q s i )+ X z j 2T H(U(D),q t j) z j Discrepency c object classifier x i [ICCV 2015]

15 Adapting across domains minimize discrepancy min repr L conf (x, z, D ; repr )= X x i 2S H(U(D),q s i )+ X z j 2T H(U(D),q t j) z j c object classifier x i D domain classifier [ICCV 2015]

16 Adapting across domains minimize discrepancy min repr L conf (x, z, D ; repr )= X x i 2S H(U(D),q s i )+ X z j 2T H(U(D),q t j) z j q s i =[1, 0] q t j =[0, 1] x i D domain classifier [ICCV 2015]

17 Adapting across domains minimize discrepancy min repr L conf (x, z, D ; repr )= X x i 2S H(U(D),q s i )+ X z j 2T H(U(D),q t j) Cross-entropy with uniform distribution z j x i D domain classifier [ICCV 2015]

18 Adapting across domains minimize discrepancy min repr L conf (x, z, D ; repr )= X x i 2S z j H(U(D),q s i )+ X z j 2T H(U(D),q t j) x i D domain classifier [ICCV 2015]

19 Adapting across domains minimize discrepancy min repr L conf (x, z, D ; repr )= X x i 2S z j H(U(D),q s i )+ X z j 2T H(U(D),q t j) c object classifier x i [ICCV 2015]

20 Adapting across domains minimize discrepancy backpack chair bike conv1 conv5 fc6 fc7 source data fc8 classification loss Source Data

21 Adapting across domains minimize discrepancy backpack chair bike conv1 conv5 fc6 fc7 source data fc8 Source Data shared shared shared shared shared classification loss backpack Target Data conv1 conv5 fc6 fc7 fc8 labeled target data?

22 Adapting across domains minimize discrepancy backpack chair bike conv1 conv5 fc6 fc7 source data fc8 backpack Source Data Target Data shared shared shared shared conv1 conv5 fc6 fc7 source data all target data fcd domain confusion loss domain classifier loss shared fc8 classification loss labeled target data?

23 Verify confusion

24 Domain Adaptation: Train on source adapt to target Adapt backpack chair bike Source Domain PS (X, Y ) lots of labeled data DS = {(xi, yi ), 8i 2 {1,..., N }}?? bike Target Domain PT (Z, H) unlabeled or limited labels DT = {(zj,?), 8j 2 {1,..., M }}

25 Domain Adaptation: Train on source adapt to target Adapt backpack chair bike Source Domain PS (X, Y ) lots of labeled data DS = {(xi, yi ), 8i 2 {1,..., N }}?? bike Target Domain PT (Z, H) unlabeled or limited labels DT = {(zj,?), 8j 2 {1,..., M }}

26 Standard supervised deep loss H(h, p) =E h [ log p] [ICCV 2015]

27 Standard supervised deep loss H(h, p) =E h [ log p] z j p j conv1 conv5 fc6 fc7 fc8 Bottle MugChair Laptop Keyboard [ICCV 2015]

28 Standard supervised deep loss H(h, p) =E h [ log p] z j p j conv1 conv5 fc6 fc7 fc8 Bottle MugChair Laptop Keyboard bottle h j Bottle MugChair Laptop Keyboard [ICCV 2015]

29 Source Softlabels Source CNN softmax high temp Bottle MugChair Laptop Keyboard Source CNN softmax high temp Bottle MugChair Laptop Keyboard + Bottle MugChair Laptop Keyboard source bottle examples Source CNN softmax high temp Bottle MugChair Laptop Keyboard

30 Standard supervised deep loss H(h, p) =E h [ log p] z j p j conv1 conv5 fc6 fc7 fc8 Bottle MugChair Laptop Keyboard bottle Bottle MugChair h j Laptop Keyboard [ICCV 2015]

31 Class correlation transfer loss H(h, p) =E h [ log p] z j p j conv1 conv5 fc6 fc7 fc8 Bottle MugChair Laptop Keyboard bottle h j Bottle MugChair Laptop Keyboard source softlabel [ICCV 2015]

32 Class correlation transfer loss backpack chair bike conv1 conv5 fc6 fc7 source data fc8 source data Source Data shared shared shared shared fcd domain confusion loss shared classification loss backpack Target Data conv1 conv5 fc6 fc7 all target data domain classifier loss fc8 labeled target data?

33 Class correlation transfer loss backpack chair bike conv1 conv5 fc6 fc7 source data fc8 source data Source Data shared shared shared shared fcd domain confusion loss shared classification loss backpack Target Data conv1 conv5 fc6 fc7 all target data labeled target data domain classifier loss fc8 softmax high temp softlabel loss Source softlabels?

34 Office dataset Experiment Adapting Visual Category Models to New Domains 9 31 categories laptop instance 2 instance 1 letter tray... webcam instance 5 keyboard dslr instance 5 all classes have source labeled examples 15 classes have target labeled examples evaluate on remaining 16 classes headphones amazon... file cabinet instance 2 instance domains Fig. 4. New dataset for investigating domain shifts in visual category recognition tasks. Images of objects from 31 categories are downloaded from the web as well as captured by a high definition and a low definition camera. [saenko`10] popular way to acquire data, as it allows for easy access to large amounts of

35 Office dataset Experiment A! W A! D D! A D! W W! A W! D Average MMDT [18] 44.6 ± ± 0.5 Source CNN 54.2 ± ± ± ± ± ± Ours: dom confusion only 55.2 ± ± ± ± ± ± Ours: soft labels only 56.8 ± ± ± ± ± ± Ours: dom confusion+soft labels 59.3 ± ± ± ± ± ± Table 2. Multi-class accuracy evaluation on the standard semi-supervised adaptation setting with the Office dataset. We evaluate on 16 Multiclass accuracy over 16 classes which lack target labels

36 Baseline soft label Baseline soft activation ring binder monitor Target test image back pack bike bike helmet bookcase bottle calculator desk chair desk lamp desktop computer file cabinet headphones keyboard laptop computer letter tray mobile phone monitor mouse mug paper notebook pen Ours soft label Our soft activation phone printer projector punchers ring binder ruler scissors speaker stapler tape dispenser trash can back pack bike bike helmet bookcase bottle calculator desk chair desk lamp desktop computer file cabinet headphones keyboard laptop computer letter tray mobile phone monitor mouse mug paper notebook pen phone printer projector punchers ring binder ruler scissors speaker stapler tape dispenser trash can

37 Source soft labels back pack bike bike helmet bookcase bottle calculator desk chair desk lamp desktop computer file cabinet headphones keyboard laptop computer letter tray mobile phone

38 Cross-dataset Experiment Setup Source: ImageNet! Target: Caltech256! 40 categories! Evaluate adaptation performance with 0,1,3,5 target labeled examples per class [tommasi`14]

39 ImageNet adapted to Caltech Multi-class Multiclass Accuracy Source+Target CNN Ours: softlabels only Ours: dom confusion+softlabels Number Number Labeled of Target labeled Examples target examples per Category [ICCV 2015]

40 Summary: simultaneous transfer across domains and tasks Domain confusion aligns the distributions Softlabels transfer class correlations Paper presented in poster session Wednesday 12/16 4B

41 Discrepancy due to modality shift Lots of data to train models RGB System uses model

42 Label space discrepancy lamp night-stand lamp bed bed pillow night-stand Current output System uses model [NIPS 2014, CVPR 2015]

43 Label space discrepancy lamp night-stand lamp bed bed pillow night-stand Desired output System uses model [NIPS 2014, CVPR 2015]

44 Adapting Deep Visual Models Adapting across domains Adapting across tasks lamp bed pillow night-stand Adapting across modalities Error bounds on adapted deep models Generally applicable to adaptation with deep learning in AI

45 Thank you.

Learning Transferable Features with Deep Adaptation Networks

Learning Transferable Features with Deep Adaptation Networks Learning Transferable Features with Deep Adaptation Networks Mingsheng Long, Yue Cao, Jianmin Wang, Michael I. Jordan Presented by Changyou Chen October 30, 2015 1 Changyou Chen Learning Transferable Features

More information

arxiv: v2 [cs.cv] 18 Feb 2014

arxiv: v2 [cs.cv] 18 Feb 2014 One-Shot Adaptation of Supervised Deep Convolutional Models arxiv:1312.6204v2 [cs.cv] 18 Feb 2014 Judy Hoffman, Eric Tzeng, Jeff Donahue UC Berkeley, EECS & ICSI {jhoffman,etzeng,jdonahue}@eecs.berkeley.edu

More information

arxiv: v1 [cs.cv] 6 Jul 2016

arxiv: v1 [cs.cv] 6 Jul 2016 arxiv:607.079v [cs.cv] 6 Jul 206 Deep CORAL: Correlation Alignment for Deep Domain Adaptation Baochen Sun and Kate Saenko University of Massachusetts Lowell, Boston University Abstract. Deep neural networks

More information

Deep Model Adaptation using Domain Adversarial Training

Deep Model Adaptation using Domain Adversarial Training Deep Model Adaptation using Domain Adversarial Training Victor Lempitsky, joint work with Yaroslav Ganin Skolkovo Institute of Science and Technology ( Skoltech ) Moscow region, Russia Deep supervised

More information

Unsupervised Domain Adaptation by Backpropagation. Chih-Hui Ho, Xingyu Gu, Yuan Qi

Unsupervised Domain Adaptation by Backpropagation. Chih-Hui Ho, Xingyu Gu, Yuan Qi Unsupervised Domain Adaptation by Backpropagation Chih-Hui Ho, Xingyu Gu, Yuan Qi Problems Deep network: requires massive labeled training data. Labeled data: Available sometimes: Image recognition Speech

More information

DeCAF: a Deep Convolutional Activation Feature for Generic Visual Recognition

DeCAF: a Deep Convolutional Activation Feature for Generic Visual Recognition DeCAF: a Deep Convolutional Activation Feature for Generic Visual Recognition ECS 289G 10/06/2016 Authors: Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng and Trevor Darrell

More information

Fine-tuning Pre-trained Large Scaled ImageNet model on smaller dataset for Detection task

Fine-tuning Pre-trained Large Scaled ImageNet model on smaller dataset for Detection task Fine-tuning Pre-trained Large Scaled ImageNet model on smaller dataset for Detection task Kyunghee Kim Stanford University 353 Serra Mall Stanford, CA 94305 kyunghee.kim@stanford.edu Abstract We use a

More information

Learning with Side Information through Modality Hallucination

Learning with Side Information through Modality Hallucination Learning with Side Information through Modality Hallucination Judy Hoffman Saurabh Gupta Trevor Darrell EECS Department, UC Berkeley {jhoffman, sgupta, trevor}@eecs.berkeley.edu Abstract We present a modality

More information

arxiv: v1 [cs.ai] 8 Nov 2016

arxiv: v1 [cs.ai] 8 Nov 2016 Cognitive Discriminative Mappings for Rapid Learning Wen-Chieh Fang Yi-ting Chiang arxiv:1611.02512v1 [cs.ai] 8 Nov 2016 Abstract Humans can learn concepts or recognize items from just a handful of examples,

More information

Lightweight Unsupervised Domain Adaptation by Convolutional Filter Reconstruction

Lightweight Unsupervised Domain Adaptation by Convolutional Filter Reconstruction Lightweight Unsupervised Domain Adaptation by Convolutional Filter Reconstruction Rahaf Aljundi, Tinne Tuytelaars KU Leuven, ESAT-PSI - iminds, Belgium Abstract. Recently proposed domain adaptation methods

More information

Unsupervised domain adaptation of deep object detectors

Unsupervised domain adaptation of deep object detectors Unsupervised domain adaptation of deep object detectors Debjeet Majumdar 1 and Vinay P. Namboodiri2 Indian Institute of Technology, Kanpur - Computer Science and Engineering Kalyanpur, Kanpur, Uttar Pradesh

More information

Deep learning for object detection. Slides from Svetlana Lazebnik and many others

Deep learning for object detection. Slides from Svetlana Lazebnik and many others Deep learning for object detection Slides from Svetlana Lazebnik and many others Recent developments in object detection 80% PASCAL VOC mean0average0precision0(map) 70% 60% 50% 40% 30% 20% 10% Before deep

More information

Rich feature hierarchies for accurate object detection and semantic segmentation

Rich feature hierarchies for accurate object detection and semantic segmentation Rich feature hierarchies for accurate object detection and semantic segmentation Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik Presented by Pandian Raju and Jialin Wu Last class SGD for Document

More information

Deep Learning for Computer Vision II

Deep Learning for Computer Vision II IIIT Hyderabad Deep Learning for Computer Vision II C. V. Jawahar Paradigm Shift Feature Extraction (SIFT, HoG, ) Part Models / Encoding Classifier Sparrow Feature Learning Classifier Sparrow L 1 L 2 L

More information

Structured Prediction using Convolutional Neural Networks

Structured Prediction using Convolutional Neural Networks Overview Structured Prediction using Convolutional Neural Networks Bohyung Han bhhan@postech.ac.kr Computer Vision Lab. Convolutional Neural Networks (CNNs) Structured predictions for low level computer

More information

Geodesic Flow Kernel for Unsupervised Domain Adaptation

Geodesic Flow Kernel for Unsupervised Domain Adaptation Geodesic Flow Kernel for Unsupervised Domain Adaptation Boqing Gong University of Southern California Joint work with Yuan Shi, Fei Sha, and Kristen Grauman 1 Motivation TRAIN TEST Mismatch between different

More information

Digital Literacy. Identify types of computers, how they process information, and the purpose and function of different hardware components

Digital Literacy. Identify types of computers, how they process information, and the purpose and function of different hardware components Digital Literacy Identify types of computers, how they process information, and the purpose and function of different hardware components Computer Basics 1.01 Types of Computers Input and Output Devices

More information

REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION

REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION Kingsley Kuan 1, Gaurav Manek 1, Jie Lin 1, Yuan Fang 1, Vijay Chandrasekhar 1,2 Institute for Infocomm Research, A*STAR, Singapore 1 Nanyang Technological

More information

Learning with Side Information through Modality Hallucination

Learning with Side Information through Modality Hallucination Master Seminar Report for Recent Trends in 3D Computer Vision Learning with Side Information through Modality Hallucination Judy Hoffman, Saurabh Gupta, Trevor Darrell. CVPR 2016 Nan Yang Supervisor: Benjamin

More information

RECOGNIZING HETEROGENEOUS CROSS-DOMAIN DATA VIA GENERALIZED JOINT DISTRIBUTION ADAPTATION

RECOGNIZING HETEROGENEOUS CROSS-DOMAIN DATA VIA GENERALIZED JOINT DISTRIBUTION ADAPTATION RECOGNIZING HETEROGENEOUS CROSS-DOMAIN DATA VIA GENERALIZED JOINT DISTRIBUTION ADAPTATION Yuan-Ting Hsieh, Shih-Yen Tao, Yao-Hung Hubert Tsai 2, Yi-Ren Yeh 3, Yu-Chiang Frank Wang 2 Department of Electrical

More information

By: Computer Education. Introduction Level One

By:   Computer Education. Introduction Level One Computer Education Introduction Level One What is a Computer? A computer is an electronic machine which can take in data, stores and processed the data, and then produces information Data is information

More information

arxiv: v2 [cs.cv] 17 Nov 2016 Abstract

arxiv: v2 [cs.cv] 17 Nov 2016 Abstract Beyond Sharing Weights for Deep Domain Adaptation Artem Rozantsev Mathieu Salzmann Pascal Fua Computer Vision Laboratory, École Polytechnique Fédérale de Lausanne Lausanne, Switzerland {firstname.lastname}@epfl.ch

More information

Real-time Object Detection CS 229 Course Project

Real-time Object Detection CS 229 Course Project Real-time Object Detection CS 229 Course Project Zibo Gong 1, Tianchang He 1, and Ziyi Yang 1 1 Department of Electrical Engineering, Stanford University December 17, 2016 Abstract Objection detection

More information

Volumetric and Multi-View CNNs for Object Classification on 3D Data Supplementary Material

Volumetric and Multi-View CNNs for Object Classification on 3D Data Supplementary Material Volumetric and Multi-View CNNs for Object Classification on 3D Data Supplementary Material Charles R. Qi Hao Su Matthias Nießner Angela Dai Mengyuan Yan Leonidas J. Guibas Stanford University 1. Details

More information

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Si Chen The George Washington University sichen@gwmail.gwu.edu Meera Hahn Emory University mhahn7@emory.edu Mentor: Afshin

More information

A Deep Learning Framework for Authorship Classification of Paintings

A Deep Learning Framework for Authorship Classification of Paintings A Deep Learning Framework for Authorship Classification of Paintings Kai-Lung Hua ( 花凱龍 ) Dept. of Computer Science and Information Engineering National Taiwan University of Science and Technology Taipei,

More information

arxiv: v1 [cs.cv] 20 Dec 2016

arxiv: v1 [cs.cv] 20 Dec 2016 End-to-End Pedestrian Collision Warning System based on a Convolutional Neural Network with Semantic Segmentation arxiv:1612.06558v1 [cs.cv] 20 Dec 2016 Heechul Jung heechul@dgist.ac.kr Min-Kook Choi mkchoi@dgist.ac.kr

More information

Channel Locality Block: A Variant of Squeeze-and-Excitation

Channel Locality Block: A Variant of Squeeze-and-Excitation Channel Locality Block: A Variant of Squeeze-and-Excitation 1 st Huayu Li Northern Arizona University Flagstaff, United State Northern Arizona University hl459@nau.edu arxiv:1901.01493v1 [cs.lg] 6 Jan

More information

Object detection with CNNs

Object detection with CNNs Object detection with CNNs 80% PASCAL VOC mean0average0precision0(map) 70% 60% 50% 40% 30% 20% 10% Before CNNs After CNNs 0% 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 year Region proposals

More information

Unsupervised Cross-Domain Deep Image Generation

Unsupervised Cross-Domain Deep Image Generation Unsupervised Cross-Domain Deep Image Generation Yaniv Taigman, Adam Polyak, Lior Wolf Facebook AI Research (FAIR) Tel Aviv Supervised Learning; {Xi, yi} àf Face Recognition (DeepFace / FAIR) Kaiming et

More information

Object Detection Based on Deep Learning

Object Detection Based on Deep Learning Object Detection Based on Deep Learning Yurii Pashchenko AI Ukraine 2016, Kharkiv, 2016 Image classification (mostly what you ve seen) http://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-detection.pdf

More information

CS 2750: Machine Learning. Neural Networks. Prof. Adriana Kovashka University of Pittsburgh April 13, 2016

CS 2750: Machine Learning. Neural Networks. Prof. Adriana Kovashka University of Pittsburgh April 13, 2016 CS 2750: Machine Learning Neural Networks Prof. Adriana Kovashka University of Pittsburgh April 13, 2016 Plan for today Neural network definition and examples Training neural networks (backprop) Convolutional

More information

Deep Neural Networks:

Deep Neural Networks: Deep Neural Networks: Part II Convolutional Neural Network (CNN) Yuan-Kai Wang, 2016 Web site of this course: http://pattern-recognition.weebly.com source: CNN for ImageClassification, by S. Lazebnik,

More information

An Exploration of Computer Vision Techniques for Bird Species Classification

An Exploration of Computer Vision Techniques for Bird Species Classification An Exploration of Computer Vision Techniques for Bird Species Classification Anne L. Alter, Karen M. Wang December 15, 2017 Abstract Bird classification, a fine-grained categorization task, is a complex

More information

Subspace Alignment Based Domain Adaptation for RCNN Detector

Subspace Alignment Based Domain Adaptation for RCNN Detector RAJ, NAMBOODIRI, TUYTELAARS: ADAPTING RCNN DETECTOR 1 Subspace Alignment Based Domain Adaptation for RCNN Detector Anant Raj anantraj@iitk.ac.in Vinay P. Namboodiri vinaypn@iitk.ac.in Tinne Tuytelaars

More information

arxiv: v2 [cs.cv] 28 Mar 2018

arxiv: v2 [cs.cv] 28 Mar 2018 Importance Weighted Adversarial Nets for Partial Domain Adaptation Jing Zhang, Zewei Ding, Wanqing Li, Philip Ogunbona Advanced Multimedia Research Lab, University of Wollongong, Australia jz60@uowmail.edu.au,

More information

AttentionNet for Accurate Localization and Detection of Objects. (To appear in ICCV 2015)

AttentionNet for Accurate Localization and Detection of Objects. (To appear in ICCV 2015) AttentionNet for Accurate Localization and Detection of Objects. (To appear in ICCV 2015) Donggeun Yoo, Sunggyun Park, Joon-Young Lee, Anthony Paek, In So Kweon. State-of-the-art frameworks for object

More information

Landmarks-based Kernelized Subspace Alignment for Unsupervised Domain Adaptation

Landmarks-based Kernelized Subspace Alignment for Unsupervised Domain Adaptation Landmarks-based Kernelized Subspace Alignment for Unsupervised Domain Adaptation Rahaf Aljundi, Re mi Emonet, Damien Muselet and Marc Sebban rahaf.aljundi@gmail.com remi.emonet, damien.muselet, marc.sebban

More information

Learning Partially Shared Dictionaries for Domain Adaptation

Learning Partially Shared Dictionaries for Domain Adaptation Learning Partially Shared Dictionaries for Domain Adaptation Viresh Ranjan 1 Gaurav Harit 2 C. V. Jawahar 1 1 CVIT, IIIT Hyderabad, India 2 IIT Jodhpur, India Abstract. Real world applicability of many

More information

Unified Deep Supervised Domain Adaptation and Generalization

Unified Deep Supervised Domain Adaptation and Generalization Unified Deep Supervised Domain Adaptation and Generalization Saeid Motiian Marco Piccirilli Donald A. Adjeroh Gianfranco Doretto West Virginia University Morgantown, WV 26508 {samotiian, mpiccir1, daadjeroh,

More information

Multilayer and Multimodal Fusion of Deep Neural Networks for Video Classification

Multilayer and Multimodal Fusion of Deep Neural Networks for Video Classification Multilayer and Multimodal Fusion of Deep Neural Networks for Video Classification Xiaodong Yang, Pavlo Molchanov, Jan Kautz INTELLIGENT VIDEO ANALYTICS Surveillance event detection Human-computer interaction

More information

Return of the Devil in the Details: Delving Deep into Convolutional Nets

Return of the Devil in the Details: Delving Deep into Convolutional Nets Return of the Devil in the Details: Delving Deep into Convolutional Nets Ken Chatfield - Karen Simonyan - Andrea Vedaldi - Andrew Zisserman University of Oxford The Devil is still in the Details 2011 2014

More information

Domain Adapta,on in a deep learning context. Tinne Tuytelaars

Domain Adapta,on in a deep learning context. Tinne Tuytelaars Domain Adapta,on in a deep learning context Tinne Tuytelaars Work in collabora,on with T. Tommasi, N. Patricia, B. Caputo, T. Tuytelaars "A Deeper Look at Dataset Bias" GCPR 2015 A. Raj, V. Namboodiri

More information

Deep Learning for Computer Vision with MATLAB By Jon Cherrie

Deep Learning for Computer Vision with MATLAB By Jon Cherrie Deep Learning for Computer Vision with MATLAB By Jon Cherrie 2015 The MathWorks, Inc. 1 Deep learning is getting a lot of attention "Dahl and his colleagues won $22,000 with a deeplearning system. 'We

More information

Open Set Domain Adaptation

Open Set Domain Adaptation Open Set Domain Adaptation Pau Panareda Busto 1,2 1 Airbus Group Innovations Munich, Germany pau.panareda-busto@airbus.com Juergen Gall 2 2 Computer Vision Group University of Bonn, Germany gall@iai.uni-bonn.de

More information

MULTI-LEVEL 3D CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION SAMBIT GHADAI XIAN LEE ADITYA BALU SOUMIK SARKAR ADARSH KRISHNAMURTHY

MULTI-LEVEL 3D CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION SAMBIT GHADAI XIAN LEE ADITYA BALU SOUMIK SARKAR ADARSH KRISHNAMURTHY MULTI-LEVEL 3D CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION SAMBIT GHADAI XIAN LEE ADITYA BALU SOUMIK SARKAR ADARSH KRISHNAMURTHY Outline Object Recognition Multi-Level Volumetric Representations

More information

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech Convolutional Neural Networks Computer Vision Jia-Bin Huang, Virginia Tech Today s class Overview Convolutional Neural Network (CNN) Training CNN Understanding and Visualizing CNN Image Categorization:

More information

3D model classification using convolutional neural network

3D model classification using convolutional neural network 3D model classification using convolutional neural network JunYoung Gwak Stanford jgwak@cs.stanford.edu Abstract Our goal is to classify 3D models directly using convolutional neural network. Most of existing

More information

Lot Item Name Description Quantity " Display Case Dell Optiplex 790 contains tower, keyboard, mouse, & monitor Computer Table

Lot Item Name Description Quantity  Display Case Dell Optiplex 790 contains tower, keyboard, mouse, & monitor Computer Table Lot Item Name Description Quantity 101 48" Display Case 6 102 Dell Optiplex 790 contains tower, keyboard, mouse, & monitor 123 103 Computer Table Black Legs 68 104 Classroom Table 18" x 60" Black Legs

More information

arxiv: v2 [cs.cv] 23 May 2016

arxiv: v2 [cs.cv] 23 May 2016 Localizing by Describing: Attribute-Guided Attention Localization for Fine-Grained Recognition arxiv:1605.06217v2 [cs.cv] 23 May 2016 Xiao Liu Jiang Wang Shilei Wen Errui Ding Yuanqing Lin Baidu Research

More information

REVISITING BATCH NORMALIZATION FOR PRACTICAL DOMAIN ADAPTATION

REVISITING BATCH NORMALIZATION FOR PRACTICAL DOMAIN ADAPTATION REVISITING BATCH NORMALIZATION FOR PRACTICAL DOMAIN ADAPTATION Yanghao Li, Naiyan Wang, Jianping Shi, Jiaying Liu, Xiaodi Hou Institute of Computer Science and Technology, Peking University TuSimple SenseTime

More information

Weakly Supervised Object Recognition with Convolutional Neural Networks

Weakly Supervised Object Recognition with Convolutional Neural Networks GDR-ISIS, Paris March 20, 2015 Weakly Supervised Object Recognition with Convolutional Neural Networks Ivan Laptev ivan.laptev@inria.fr WILLOW, INRIA/ENS/CNRS, Paris Joint work with: Maxime Oquab Leon

More information

Fuzzy Set Theory in Computer Vision: Example 3

Fuzzy Set Theory in Computer Vision: Example 3 Fuzzy Set Theory in Computer Vision: Example 3 Derek T. Anderson and James M. Keller FUZZ-IEEE, July 2017 Overview Purpose of these slides are to make you aware of a few of the different CNN architectures

More information

Partial Adversarial Domain Adaptation

Partial Adversarial Domain Adaptation Partial Adversarial Domain Adaptation Zhangjie Cao, Lijia Ma, Mingsheng Long( ), and Jianmin Wang School of Software, Tsinghua University, China National Engineering Laboratory for Big Data Software Beijing

More information

A Novel Representation and Pipeline for Object Detection

A Novel Representation and Pipeline for Object Detection A Novel Representation and Pipeline for Object Detection Vishakh Hegde Stanford University vishakh@stanford.edu Manik Dhar Stanford University dmanik@stanford.edu Abstract Object detection is an important

More information

Efficient Convolutional Network Learning using Parametric Log based Dual-Tree Wavelet ScatterNet

Efficient Convolutional Network Learning using Parametric Log based Dual-Tree Wavelet ScatterNet Efficient Convolutional Network Learning using Parametric Log based Dual-Tree Wavelet ScatterNet Amarjot Singh, Nick Kingsbury Signal Processing Group, Department of Engineering, University of Cambridge,

More information

Spatial Localization and Detection. Lecture 8-1

Spatial Localization and Detection. Lecture 8-1 Lecture 8: Spatial Localization and Detection Lecture 8-1 Administrative - Project Proposals were due on Saturday Homework 2 due Friday 2/5 Homework 1 grades out this week Midterm will be in-class on Wednesday

More information

Part Localization by Exploiting Deep Convolutional Networks

Part Localization by Exploiting Deep Convolutional Networks Part Localization by Exploiting Deep Convolutional Networks Marcel Simon, Erik Rodner, and Joachim Denzler Computer Vision Group, Friedrich Schiller University of Jena, Germany www.inf-cv.uni-jena.de Abstract.

More information

Image and Video Understanding

Image and Video Understanding Image and Video Understanding 2VO 70.095 WS Christoph Feichtenhofer, Axel Pinz Slide credits: Many thanks to all the great computer vision researchers on which this presentation relies on. Most material

More information

OMG: ORTHOGONAL METHOD OF GROUPING WITH APPLICATION OF K-SHOT LEARNING

OMG: ORTHOGONAL METHOD OF GROUPING WITH APPLICATION OF K-SHOT LEARNING OMG: ORTHOGONAL METHOD OF GROUPING WITH APPLICATION OF K-SHOT LEARNING Haoqi Fan Yu Zhang Kris M. Kitani The Robotics Institute, School of Computer Science Carnegie Mellon University Pittsburgh, P.A.,

More information

In Defense of Fully Connected Layers in Visual Representation Transfer

In Defense of Fully Connected Layers in Visual Representation Transfer In Defense of Fully Connected Layers in Visual Representation Transfer Chen-Lin Zhang, Jian-Hao Luo, Xiu-Shen Wei, Jianxin Wu National Key Laboratory for Novel Software Technology, Nanjing University,

More information

Domain Adaptation from Synthesis to Reality in Single-model Detector for Video Smoke Detection

Domain Adaptation from Synthesis to Reality in Single-model Detector for Video Smoke Detection Domain Adaptation from Synthesis to Reality in Single-model Detector for Video Smoke Detection Gao Xu, Yongming Zhang, Qixing Zhang*, Gaohua Lin, Jinjun Wang State Key Laboratory of Fire Science, University

More information

Yiqi Yan. May 10, 2017

Yiqi Yan. May 10, 2017 Yiqi Yan May 10, 2017 P a r t I F u n d a m e n t a l B a c k g r o u n d s Convolution Single Filter Multiple Filters 3 Convolution: case study, 2 filters 4 Convolution: receptive field receptive field

More information

Cross Modal Distillation for Supervision Transfer

Cross Modal Distillation for Supervision Transfer Cross Modal Distillation for Supervision Transfer Saurabh Gupta Judy Hoffman Jitendra Malik University of California, Berkeley {sgupta, jhoffman, malik}@eecs.berkeley.edu Abstract In this work we propose

More information

Cascade Region Regression for Robust Object Detection

Cascade Region Regression for Robust Object Detection Large Scale Visual Recognition Challenge 2015 (ILSVRC2015) Cascade Region Regression for Robust Object Detection Jiankang Deng, Shaoli Huang, Jing Yang, Hui Shuai, Zhengbo Yu, Zongguang Lu, Qiang Ma, Yali

More information

3D Object Classification using Shape Distributions and Deep Learning

3D Object Classification using Shape Distributions and Deep Learning 3D Object Classification using Shape Distributions and Deep Learning Melvin Low Stanford University mwlow@cs.stanford.edu Abstract This paper shows that the Absolute Angle shape distribution (AAD) feature

More information

RECURRENT NEURAL NETWORKS

RECURRENT NEURAL NETWORKS RECURRENT NEURAL NETWORKS Methods Traditional Deep-Learning based Non-machine Learning Machine-Learning based method Supervised SVM MLP CNN RNN (LSTM) Localizati on GPS, SLAM Self Driving Perception Pedestrian

More information

Deep Adversarial Attention Alignment for Unsupervised Domain Adaptation: the Benefit of Target Expectation Maximization

Deep Adversarial Attention Alignment for Unsupervised Domain Adaptation: the Benefit of Target Expectation Maximization Deep Adversarial Attention Alignment for Unsupervised Domain Adaptation: the Benefit of Target Expectation Maximization Guoliang Kang 1, Liang Zheng 1,2, Yan Yan 1, and Yi Yang 1 1 CAI, University of Technology

More information

Computer Vision Lecture 16

Computer Vision Lecture 16 Announcements Computer Vision Lecture 16 Deep Learning Applications 11.01.2017 Seminar registration period starts on Friday We will offer a lab course in the summer semester Deep Robot Learning Topic:

More information

Computer Vision Lecture 16

Computer Vision Lecture 16 Computer Vision Lecture 16 Deep Learning Applications 11.01.2017 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de leibe@vision.rwth-aachen.de Announcements Seminar registration period starts

More information

LSDA: Large Scale Detection through Adaptation

LSDA: Large Scale Detection through Adaptation LSDA: Large Scale Detection through Adaptation Judy Hoffman, Sergio Guadarrama, Eric Tzeng, Ronghang Hu, Jeff Donahue, EECS, UC Berkeley, EE, Tsinghua University {jhoffman, sguada, tzeng, jdonahue}@eecs.berkeley.edu

More information

Domain Separation Networks

Domain Separation Networks Domain Separation Networks Konstantinos Bousmalis Google Brain Mountain View, CA konstantinos@google.com George Trigeorgis Imperial College London London, UK g.trigeorgis@imperial.ac.uk arxiv:1608.06019v1

More information

Unsupervised Domain Adaptation by Domain Invariant Projection

Unsupervised Domain Adaptation by Domain Invariant Projection 2013 IEEE International Conference on Computer Vision Unsupervised Domain Adaptation by Domain Invariant Projection Mahsa Baktashmotlagh 1,3, Mehrtash T. Harandi 2,3, Brian C. Lovell 1, and Mathieu Salzmann

More information

The Hilbert Problems of Computer Vision. Jitendra Malik UC Berkeley & Google, Inc.

The Hilbert Problems of Computer Vision. Jitendra Malik UC Berkeley & Google, Inc. The Hilbert Problems of Computer Vision Jitendra Malik UC Berkeley & Google, Inc. This talk The computational power of the human brain Research is the art of the soluble Hilbert problems, circa 2004 Hilbert

More information

Transfer Learning. Style Transfer in Deep Learning

Transfer Learning. Style Transfer in Deep Learning Transfer Learning & Style Transfer in Deep Learning 4-DEC-2016 Gal Barzilai, Ram Machlev Deep Learning Seminar School of Electrical Engineering Tel Aviv University Part 1: Transfer Learning in Deep Learning

More information

Computer Vision Lecture 16

Computer Vision Lecture 16 Computer Vision Lecture 16 Deep Learning for Object Categorization 14.01.2016 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de leibe@vision.rwth-aachen.de Announcements Seminar registration period

More information

arxiv: v2 [cs.ro] 26 Oct 2018

arxiv: v2 [cs.ro] 26 Oct 2018 Adapting control policies from simulation to reality using a pairwise loss Ulrich Viereck, Xingchao Peng, Kate Saenko, and Robert Platt arxiv:1807.10413v2 [cs.ro] 26 Oct 2018 Abstract This paper proposes

More information

Software is a computer that tells the computer how to perform particular.

Software is a computer that tells the computer how to perform particular. Name: A is a device that accepts, processes, stores, and produces, all according to a series of stored instructions. _ includes the physical electronic and mechanical devices that process the data. Peripheral

More information

Information Technology, the Internet, and You. Information Technology, the Internet, and You 1-2

Information Technology, the Internet, and You. Information Technology, the Internet, and You 1-2 Explain the five parts of an information system: people, procedures, software, hardware, and data. Distinguish between system software and application software. Discuss the three kinds of system software

More information

arxiv: v2 [cs.cv] 1 Oct 2014

arxiv: v2 [cs.cv] 1 Oct 2014 Deformable Part Models are Convolutional Neural Networks Tech report Ross Girshick Forrest Iandola Trevor Darrell Jitendra Malik UC Berkeley {rbg,forresti,trevor,malik}@eecsberkeleyedu arxiv:14095403v2

More information

Bilinear Models for Fine-Grained Visual Recognition

Bilinear Models for Fine-Grained Visual Recognition Bilinear Models for Fine-Grained Visual Recognition Subhransu Maji College of Information and Computer Sciences University of Massachusetts, Amherst Fine-grained visual recognition Example: distinguish

More information

VisDA: A Synthetic-to-Real Benchmark for Visual Domain Adaptation

VisDA: A Synthetic-to-Real Benchmark for Visual Domain Adaptation VisDA: A Synthetic-to-Real Benchmark for Visual Domain Adaptation Xingchao Peng 1, Ben Usman 1, Neela Kaushik 1, Dequan Wang 2, Judy Hoffman 2, and Kate Saenko 1 xpeng,usmn,nkaushik,saenko@bu.edu, jhoffman,dqwang@eecs.berkeley.edu

More information

Direct Multi-Scale Dual-Stream Network for Pedestrian Detection Sang-Il Jung and Ki-Sang Hong Image Information Processing Lab.

Direct Multi-Scale Dual-Stream Network for Pedestrian Detection Sang-Il Jung and Ki-Sang Hong Image Information Processing Lab. [ICIP 2017] Direct Multi-Scale Dual-Stream Network for Pedestrian Detection Sang-Il Jung and Ki-Sang Hong Image Information Processing Lab., POSTECH Pedestrian Detection Goal To draw bounding boxes that

More information

DeepIM: Deep Iterative Matching for 6D Pose Estimation - Supplementary Material

DeepIM: Deep Iterative Matching for 6D Pose Estimation - Supplementary Material DeepIM: Deep Iterative Matching for 6D Pose Estimation - Supplementary Material Yi Li 1, Gu Wang 1, Xiangyang Ji 1, Yu Xiang 2, and Dieter Fox 2 1 Tsinghua University, BNRist 2 University of Washington

More information

Visual Place Recognition in Changing Environments with Time-Invariant Image Patch Descriptors

Visual Place Recognition in Changing Environments with Time-Invariant Image Patch Descriptors Visual Place Recognition in Changing Environments with Time-Invariant Image Patch Descriptors Boris Ivanovic Stanford University borisi@cs.stanford.edu Abstract Feature descriptors for images are a mature

More information

3D Object Recognition and Scene Understanding. Yu Xiang University of Washington

3D Object Recognition and Scene Understanding. Yu Xiang University of Washington 3D Object Recognition and Scene Understanding Yu Xiang University of Washington 1 2 Image classification tagging/annotation Room Chair Fergus et al. CVPR 03 Fei-Fei et al. CVPRW 04 Chua et al. CIVR 09

More information

Comparison of Fine-tuning and Extension Strategies for Deep Convolutional Neural Networks

Comparison of Fine-tuning and Extension Strategies for Deep Convolutional Neural Networks Comparison of Fine-tuning and Extension Strategies for Deep Convolutional Neural Networks Nikiforos Pittaras 1, Foteini Markatopoulou 1,2, Vasileios Mezaris 1, and Ioannis Patras 2 1 Information Technologies

More information

Adversarial Discriminative Domain Adaptation

Adversarial Discriminative Domain Adaptation Adversarial Discriminative Domain Adaptation Eric Tzeng University of California, Berkeley etzeng@eecs.berkeley.edu Kate Saenko Boston University saenko@bu.edu Judy Hoffman Stanford University jhoffman@cs.stanford.edu

More information

INTERACTIVE WHITEBOARDS: SMART Board Overview

INTERACTIVE WHITEBOARDS: SMART Board Overview INTERACTIVE WHITEBOARDS: SMART Board Overview SMART Notebook Basic 2 SMART Board Equipment The SMART interactive whiteboard uses several main components. They include: THE BOARD The SMART Board has a touch-

More information

Multi-Glance Attention Models For Image Classification

Multi-Glance Attention Models For Image Classification Multi-Glance Attention Models For Image Classification Chinmay Duvedi Stanford University Stanford, CA cduvedi@stanford.edu Pararth Shah Stanford University Stanford, CA pararth@stanford.edu Abstract We

More information

ECCV Presented by: Boris Ivanovic and Yolanda Wang CS 331B - November 16, 2016

ECCV Presented by: Boris Ivanovic and Yolanda Wang CS 331B - November 16, 2016 ECCV 2016 Presented by: Boris Ivanovic and Yolanda Wang CS 331B - November 16, 2016 Fundamental Question What is a good vector representation of an object? Something that can be easily predicted from 2D

More information

Image Analogies for Visual Domain Adaptation

Image Analogies for Visual Domain Adaptation Image Analogies for Visual Domain Adaptation Jeff Donahue UC Berkeley EECS jdonahue@eecs.berkeley.edu Abstract In usual approaches to visual domain adaptation, algorithms are used to infer a common domain

More information

Fully Convolutional Networks for Semantic Segmentation

Fully Convolutional Networks for Semantic Segmentation Fully Convolutional Networks for Semantic Segmentation Jonathan Long* Evan Shelhamer* Trevor Darrell UC Berkeley Chaim Ginzburg for Deep Learning seminar 1 Semantic Segmentation Define a pixel-wise labeling

More information

arxiv: v1 [cs.cv] 9 Dec 2015

arxiv: v1 [cs.cv] 9 Dec 2015 Window-Object Relationship Guided Representation Learning for Generic Object Detections Xingyu ZENG Wanli OUYANG Xiaogang WANG The Chinese University of Hong Kong {xyzeng,wlouyang,xgwang}@ee.cuhk.edu.hk

More information

Constrained Convolutional Neural Networks for Weakly Supervised Segmentation. Deepak Pathak, Philipp Krähenbühl and Trevor Darrell

Constrained Convolutional Neural Networks for Weakly Supervised Segmentation. Deepak Pathak, Philipp Krähenbühl and Trevor Darrell Constrained Convolutional Neural Networks for Weakly Supervised Segmentation Deepak Pathak, Philipp Krähenbühl and Trevor Darrell 1 Multi-class Image Segmentation Assign a class label to each pixel in

More information

Perceiving the 3D World from Images and Videos. Yu Xiang Postdoctoral Researcher University of Washington & NVIDIA Research

Perceiving the 3D World from Images and Videos. Yu Xiang Postdoctoral Researcher University of Washington & NVIDIA Research Perceiving the 3D World from Images and Videos Yu Xiang Postdoctoral Researcher University of Washington & NVIDIA Research 1 2 Acting in the 3D World Sensing Sound sensor Camera Depth sensor Touch sensor

More information

POINT CLOUD DEEP LEARNING

POINT CLOUD DEEP LEARNING POINT CLOUD DEEP LEARNING Innfarn Yoo, 3/29/28 / 57 Introduction AGENDA Previous Work Method Result Conclusion 2 / 57 INTRODUCTION 3 / 57 2D OBJECT CLASSIFICATION Deep Learning for 2D Object Classification

More information

Deep Learning with Tensorflow AlexNet

Deep Learning with Tensorflow   AlexNet Machine Learning and Computer Vision Group Deep Learning with Tensorflow http://cvml.ist.ac.at/courses/dlwt_w17/ AlexNet Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton, "Imagenet classification

More information

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Mathieu Aubry (INRIA) Daniel Maturana (CMU) Alexei Efros (UC Berkeley) Bryan Russell (Intel) Josef Sivic (INRIA)

More information

arxiv: v1 [cs.cv] 30 Dec 2018

arxiv: v1 [cs.cv] 30 Dec 2018 DART: DOMAIN-ADVERSARIAL RESIDUAL-TRANSFER NETWORKS FOR UNSUPERVISED CROSS-DOMAIN IMAGE CLASSIFICATION A PREPRINT arxiv:1812.11478v1 [cs.cv] 30 Dec 2018 Xianghong Fang SMILE Lab Univ. of Electronic Sci

More information