Hybrid Learning Framework for Large-Scale Web Image Annotation and Localization

Size: px
Start display at page:

Download "Hybrid Learning Framework for Large-Scale Web Image Annotation and Localization"

Transcription

1 .. Hybrid Learning Framework for Large-Scale Web Image Annotation and Localization Yong Li, Jing Liu, Yuhang Wang, Bingyuan Liu, Jun Fu, Yunze Gao, Hui Wu, Hang Song, Peng Ying, Hanqing lu Image & Video Analysis Group, Institute of Automation, Chinese Academy of Sciences September 9, 2015 Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

2 Outline.1 Introduction to The Image Annotation Task.2 The Proposed Hybrid Learning Framework.3 Experiments and Results.4 Conclusions Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

3 Introduction to The Image Annotation Task Two subtasks in Scalable Concept Image Annotation challenge Image concept detection and localisation: it is to annotate and localize 500,000 web images with 251 concepts. Generation of textual descriptions of images: it is to describe an image with a textual description of the visual content depicted in the image. We focus on the subtask 1. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

4 Introduction to The Image Annotation Task Two subtasks in Scalable Concept Image Annotation challenge Image concept detection and localisation: it is to annotate and localize 500,000 web images with 251 concepts. Generation of textual descriptions of images: it is to describe an image with a textual description of the visual content depicted in the image. We focus on the subtask 1. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

5 The Proposed Hybrid Learning Framework We propose a two-stage hybrid learning framework as follows. Annotation Localization Query Image Webpage Description Annotation By Classification Annotation By Search Localization By Fast RCNN Localization By Search Face Detection Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

6 Flowchart of The Proposed Approach. Yong Li (IVA, NLPR, CASIA) HLF.... September 9, / 25

7 Data Preparation We prefer multiple online resources to perform such task, including the ImageNet database, the Sun database, the WordNet, and the online image sharing website Flicker. There are 175 concepts concurrent in the ImageNet dataset and the ImageCLEF task simultaneously. Meanwhile, there are 217 concepts concurrent in the Sun dataset and the ImageCLEF task. For the concepts not in the ImageNet and Sun database or with very few annotated images, we crawled images from the website Flicker and filtered it with 50 images left for each concept. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

8 The Proposed Hybrid Learning Framework Image Annotation Annotation by classification. Annotation by search. Concept Localization Localization By Fast RCNN. Localization By Search. Localization of Face Related Concepts. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

9 Annotation by classification We train a linear SVM for each concept with one-vs-rest strategy. The visual feature is extracted with the released VGG19 model with dimension Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

10 Annotation by classification The ratio between the positive samples and negative samples is between 1:10 to 1:5. Due to images usually being labelled with multiple concepts in training data, the negative samples for a given concept classifier are selected as the ones whose all labels do not include the concept. The final output of SVM is normalized with the Logistic function 1 f(x) = to make the output value between [0, 1]. 1+exp( w T x) Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

11 The Proposed Hybrid Learning Framework Image Annotation Annotation by classification. Annotation by search. Concept Localization Localization By Fast RCNN. Localization By Search. Localization of Face Related Concepts. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

12 Annotation by Search The search-based approach for image annotation works on the assumption that visual similar images should reflect similar semantical concepts, and most textual information of web images is relevant to their visual content. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

13 Annotation by Search Annotation by search mainly consists of the following three steps, Search for similar images: The extracted 4096-dimensional deep features are mapped to dimensional binary hash codes leveraging the random projection algorithm to save memory space and reduce retrieval time. Semantic Reranking: To further improve the results of visual similarity search, we explore the textual information of the given image, and perform the semantic similarity search on the top-n A visually similar images to rerank the similar image set. Relevant concept selection: We employ a WordNet-based approach, which is similar to the solution in [1]. [1]: Budikova, P., Botorek, J., Batko, M., Zezula, P.: DISA at imageclef 2014: The search-based solution for scalable image annotation. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

14 Annotation by Search Annotation by search mainly consists of the following three steps, Search for similar images: The extracted 4096-dimensional deep features are mapped to dimensional binary hash codes leveraging the random projection algorithm to save memory space and reduce retrieval time. Semantic Reranking: To further improve the results of visual similarity search, we explore the textual information of the given image, and perform the semantic similarity search on the top-n A visually similar images to rerank the similar image set. Relevant concept selection: We employ a WordNet-based approach, which is similar to the solution in [1]. [1]: Budikova, P., Botorek, J., Batko, M., Zezula, P.: DISA at imageclef 2014: The search-based solution for scalable image annotation. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

15 Annotation by Search Annotation by search mainly consists of the following three steps, Search for similar images: The extracted 4096-dimensional deep features are mapped to dimensional binary hash codes leveraging the random projection algorithm to save memory space and reduce retrieval time. Semantic Reranking: To further improve the results of visual similarity search, we explore the textual information of the given image, and perform the semantic similarity search on the top-n A visually similar images to rerank the similar image set. Relevant concept selection: We employ a WordNet-based approach, which is similar to the solution in [1]. [1]: Budikova, P., Botorek, J., Batko, M., Zezula, P.: DISA at imageclef 2014: The search-based solution for scalable image annotation. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

16 The Proposed Hybrid Learning Framework Image Annotation Annotation by classification. Annotation by search. Concept Localization Localization By Fast RCNN. Localization By Search. Localization of Face Related Concepts. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

17 Localization By Fast RCNN Fast RCNN originates from the method RCNN, which provides classification result and regressed location simultaneously. Deep ConvNet RoI projection Conv feature map RoI pooling layer Outputs: softmax FCs FC RoI feature vector bbox regressor FC For each RoI Comparisons of such two methods will be given. We have used both methods during the competition. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

18 Localization By Fast RCNN Fast RCNN originates from the method RCNN, which provides classification result and regressed location simultaneously. Deep ConvNet RoI projection Conv feature map RoI pooling layer Outputs: softmax FCs FC RoI feature vector bbox regressor FC For each RoI Comparisons of such two methods will be given. We have used both methods during the competition. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

19 Localization By Fast RCNN RCNN has demonstrated its advantage in object detection. However, it has notable drawbacks. Training is a multi-stage pipeline, which is expensive in space and time. One first finetunes a ConvNet for detection using cross-entropy loss. Then linear SVMs are fit to ConvNet features computed on warped object proposals. Finally, bounding-box regressors are learned. Test-time detection is slow. At test time, features are extracted from each warped object proposal in each test image. Detection with VGG16 takes 47s per image. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

20 Localization By Fast RCNN Compared with RCNN, Fast RCNN has the following advantages. It achieves higher detection quality(map). Training is single-stage, using a multi-task loss. It provides classification result and regressed location simultaneously for each candidate object proposal. No disk storage is required for feature caching. For us, we preferred the RCNN method when decided to take part in the task. However, it is really tough to deal with 500,000 images with RCNN. We met fast RCNN two weeks before the deadline. It helped us a lot. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

21 Localization By Fast RCNN Compared with RCNN, Fast RCNN has the following advantages. It achieves higher detection quality(map). Training is single-stage, using a multi-task loss. It provides classification result and regressed location simultaneously for each candidate object proposal. No disk storage is required for feature caching. For us, we preferred the RCNN method when decided to take part in the task. However, it is really tough to deal with 500,000 images with RCNN. We met fast RCNN two weeks before the deadline. It helped us a lot. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

22 The Proposed Hybrid Learning Framework Image Annotation Annotation by classification. Annotation by search. Concept Localization Localization By Fast RCNN. Localization By Search. Localization of Face Related Concepts. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

23 Localization By Search We give a special consideration to the 25 scene related concepts for concept localization ( e.g., beach, sea, river and valley ). We first find its top-n L visually similar neighbors with the same concept in the localization training data. We use their merged bounding box as the location of the scenery concept. Annotation Query Image Similar Image 1 Similar Image 2 Similar Image 3 Valley Shore Bathroom Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

24 The Proposed Hybrid Learning Framework Image Annotation Annotation by classification. Annotation by search. Concept Localization Localization By Fast RCNN. Localization By Search. Localization of Face Related Concepts. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

25 Localization of Face Related Concepts We give a special consideration to the person and face related concepts. Face detection and facial point localization have been actively studied over the past years and achieve satisfactory performance. Face Beard Hand Head Hair Leg Mouth Tongue Foot Eye Ear Arm Nose Neck Linear classifiers are trained with the SIFT features extracted on the facial points to determine the following concepts. Man Woman Male child Female child Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

26 Localization of Face Related Concepts We give a special consideration to the person and face related concepts. Face detection and facial point localization have been actively studied over the past years and achieve satisfactory performance. Face Beard Hand Head Hair Leg Mouth Tongue Foot Eye Ear Arm Nose Neck Linear classifiers are trained with the SIFT features extracted on the facial points to determine the following concepts. Man Woman Male child Female child Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

27 Experiments and Results We have submitted 8 runs with 5 different settings of combinations with the above model modules, including Annotation By Classification (ABC), Annotation By Search (ABS), localization by Fast R-CNN (FRCN), Localization By Search (LBS) and Concept Extension (CE). Method ABC CE ABS LBS FRCN SVM Threshold Overlap 0.5 Overlap 0 Setting 1 yes no yes yes yes Setting 2 yes yes yes yes yes Setting 3 yes yes no yes yes Setting 4 yes no no yes yes Setting 5 no no no no yes Effectiveness ABS is verified by comparing the result of setting 2 and setting 3 with 2 percent improvement. Effectiveness of FRCN is verified with setting 5, which is unsatisfactory. The proposed two stage process is more suitable to deal with such task. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

28 MAP with Different Recall Rates Mean average precision(%) Setting 1 Setting 2 Setting 3 Setting 4 Setting Overlap with GT labels(%) Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

29 Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25 CEA_LIST CNRS_TPT-1 CNRS_TPT-2 CNRS_TPT-3 CNRS_TPT-4 CNRS_TPT-5 CNRS_TPT-6 CNRS_TPT-7 CNRS_TPT-8 CNRS_TPT-9 CNRS_TPT-10 IRIP-iCC-1 IRIP-iCC-2 IRIP-iCC-3 IRIP-iCC-4 IRIP-iCC-5 IRIP-iCC-6 IRIP-iCC-7 IRIP-iCC-8 IRIP-iCC-9 IRIP-iCC-10 ISIA IVANLPR-1 IVANLPR-2 IVANLPR-3 IVANLPR-4 IVANLPR-5 IVANLPR-6 IVANLPR-7 IVANLPR-8 kdevir-1 kdevir-2 kdevir-3 kdevir-4 kdevir-5 Lip6.fr-1 Lip6.fr-2 Lip6.fr-3 Lip6.fr-4 Lip6.fr-5 Lip6.fr-6 Lip6.fr-7 MLVISP6-1 MLVISP6-2 MLVISP6-3 MLVISP6-4 MLVISP6-5 MLVISP6-6 MLVISP6-7 MLVISP6-8 MLVISP6-9 MLVISP6-10 MLVISP6-11 MLVISP6-12 MLVISP6-13 MLVISP6-14 RUC-1 RUC-2 RUC-3 RUC-4 RUC-5 RUC-6 RUC-7 RUC-8 SMIVA-1 SMIVA-2 SMIVA-3 SMIVA-4 SMIVA-5 SMIVA-6 SMIVA-7 SMIVA-8 SMIVA-9 SMIVA-10 SMIVA-11 SMIVA-12 SMIVA-13 SMIVA-14 SMIVA-15 SMIVA Map with 50% overlap with the GT labels Submission Results of Different Teams

30 Conclusions We propose a two-stage hybrid learning framework for Large-Scale Web Image Annotation and Localization. It is important to analyse the concepts carefully and dealing with different kinds of concepts with different modules is necessary. For the proposed method, the performance is limited by the number of training samples with full annotations. Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

31 Thanks for your attention! More info: Yong Li (IVA, NLPR, CASIA) HLF September 9, / 25

Deep learning for object detection. Slides from Svetlana Lazebnik and many others

Deep learning for object detection. Slides from Svetlana Lazebnik and many others Deep learning for object detection Slides from Svetlana Lazebnik and many others Recent developments in object detection 80% PASCAL VOC mean0average0precision0(map) 70% 60% 50% 40% 30% 20% 10% Before deep

More information

Object detection with CNNs

Object detection with CNNs Object detection with CNNs 80% PASCAL VOC mean0average0precision0(map) 70% 60% 50% 40% 30% 20% 10% Before CNNs After CNNs 0% 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 year Region proposals

More information

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Shaoqing Ren Kaiming He Ross Girshick Jian Sun Present by: Yixin Yang Mingdong Wang 1 Object Detection 2 1 Applications Basic

More information

CS6501: Deep Learning for Visual Recognition. Object Detection I: RCNN, Fast-RCNN, Faster-RCNN

CS6501: Deep Learning for Visual Recognition. Object Detection I: RCNN, Fast-RCNN, Faster-RCNN CS6501: Deep Learning for Visual Recognition Object Detection I: RCNN, Fast-RCNN, Faster-RCNN Today s Class Object Detection The RCNN Object Detector (2014) The Fast RCNN Object Detector (2015) The Faster

More information

Spatial Localization and Detection. Lecture 8-1

Spatial Localization and Detection. Lecture 8-1 Lecture 8: Spatial Localization and Detection Lecture 8-1 Administrative - Project Proposals were due on Saturday Homework 2 due Friday 2/5 Homework 1 grades out this week Midterm will be in-class on Wednesday

More information

Object Detection Based on Deep Learning

Object Detection Based on Deep Learning Object Detection Based on Deep Learning Yurii Pashchenko AI Ukraine 2016, Kharkiv, 2016 Image classification (mostly what you ve seen) http://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-detection.pdf

More information

Deep Learning for Object detection & localization

Deep Learning for Object detection & localization Deep Learning for Object detection & localization RCNN, Fast RCNN, Faster RCNN, YOLO, GAP, CAM, MSROI Aaditya Prakash Sep 25, 2018 Image classification Image classification Whole of image is classified

More information

BUAA-iCC at ImageCLEF 2015 Scalable Concept Image Annotation Challenge

BUAA-iCC at ImageCLEF 2015 Scalable Concept Image Annotation Challenge BUAA-iCC at ImageCLEF 2015 Scalable Concept Image Annotation Challenge Yunhong Wang and Jiaxin Chen Intelligent Recognition and Image Processing Lab, Beihang University, Beijing 100191, P.R.China yhwang@buaa.edu.cn;

More information

Object detection using Region Proposals (RCNN) Ernest Cheung COMP Presentation

Object detection using Region Proposals (RCNN) Ernest Cheung COMP Presentation Object detection using Region Proposals (RCNN) Ernest Cheung COMP790-125 Presentation 1 2 Problem to solve Object detection Input: Image Output: Bounding box of the object 3 Object detection using CNN

More information

Project 3 Q&A. Jonathan Krause

Project 3 Q&A. Jonathan Krause Project 3 Q&A Jonathan Krause 1 Outline R-CNN Review Error metrics Code Overview Project 3 Report Project 3 Presentations 2 Outline R-CNN Review Error metrics Code Overview Project 3 Report Project 3 Presentations

More information

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun Presented by Tushar Bansal Objective 1. Get bounding box for all objects

More information

OBJECT DETECTION HYUNG IL KOO

OBJECT DETECTION HYUNG IL KOO OBJECT DETECTION HYUNG IL KOO INTRODUCTION Computer Vision Tasks Classification + Localization Classification: C-classes Input: image Output: class label Evaluation metric: accuracy Localization Input:

More information

Yiqi Yan. May 10, 2017

Yiqi Yan. May 10, 2017 Yiqi Yan May 10, 2017 P a r t I F u n d a m e n t a l B a c k g r o u n d s Convolution Single Filter Multiple Filters 3 Convolution: case study, 2 filters 4 Convolution: receptive field receptive field

More information

Introduction to Deep Learning for Facial Understanding Part III: Regional CNNs

Introduction to Deep Learning for Facial Understanding Part III: Regional CNNs Introduction to Deep Learning for Facial Understanding Part III: Regional CNNs Raymond Ptucha, Rochester Institute of Technology, USA Tutorial-9 May 19, 218 www.nvidia.com/dli R. Ptucha 18 1 Fair Use Agreement

More information

Kaggle Data Science Bowl 2017 Technical Report

Kaggle Data Science Bowl 2017 Technical Report Kaggle Data Science Bowl 2017 Technical Report qfpxfd Team May 11, 2017 1 Team Members Table 1: Team members Name E-Mail University Jia Ding dingjia@pku.edu.cn Peking University, Beijing, China Aoxue Li

More information

3 Object Detection. BVM 2018 Tutorial: Advanced Deep Learning Methods. Paul F. Jaeger, Division of Medical Image Computing

3 Object Detection. BVM 2018 Tutorial: Advanced Deep Learning Methods. Paul F. Jaeger, Division of Medical Image Computing 3 Object Detection BVM 2018 Tutorial: Advanced Deep Learning Methods Paul F. Jaeger, of Medical Image Computing What is object detection? classification segmentation obj. detection (1 label per pixel)

More information

ECCV Presented by: Boris Ivanovic and Yolanda Wang CS 331B - November 16, 2016

ECCV Presented by: Boris Ivanovic and Yolanda Wang CS 331B - November 16, 2016 ECCV 2016 Presented by: Boris Ivanovic and Yolanda Wang CS 331B - November 16, 2016 Fundamental Question What is a good vector representation of an object? Something that can be easily predicted from 2D

More information

Automatic detection of books based on Faster R-CNN

Automatic detection of books based on Faster R-CNN Automatic detection of books based on Faster R-CNN Beibei Zhu, Xiaoyu Wu, Lei Yang, Yinghua Shen School of Information Engineering, Communication University of China Beijing, China e-mail: zhubeibei@cuc.edu.cn,

More information

Places Challenge 2017

Places Challenge 2017 Places Challenge 2017 Scene Parsing Task CASIA_IVA_JD Jun Fu, Jing Liu, Longteng Guo, Haijie Tian, Fei Liu, Hanqing Lu Yong Li, Yongjun Bao, Weipeng Yan National Laboratory of Pattern Recognition, Institute

More information

CEA LIST s participation to the Scalable Concept Image Annotation task of ImageCLEF 2015

CEA LIST s participation to the Scalable Concept Image Annotation task of ImageCLEF 2015 CEA LIST s participation to the Scalable Concept Image Annotation task of ImageCLEF 2015 Etienne Gadeski, Hervé Le Borgne, and Adrian Popescu CEA, LIST, Laboratory of Vision and Content Engineering, France

More information

Deep Face Recognition. Nathan Sun

Deep Face Recognition. Nathan Sun Deep Face Recognition Nathan Sun Why Facial Recognition? Picture ID or video tracking Higher Security for Facial Recognition Software Immensely useful to police in tracking suspects Your face will be an

More information

Joint Object Detection and Viewpoint Estimation using CNN features

Joint Object Detection and Viewpoint Estimation using CNN features Joint Object Detection and Viewpoint Estimation using CNN features Carlos Guindel, David Martín and José M. Armingol cguindel@ing.uc3m.es Intelligent Systems Laboratory Universidad Carlos III de Madrid

More information

FaceNet. Florian Schroff, Dmitry Kalenichenko, James Philbin Google Inc. Presentation by Ignacio Aranguren and Rahul Rana

FaceNet. Florian Schroff, Dmitry Kalenichenko, James Philbin Google Inc. Presentation by Ignacio Aranguren and Rahul Rana FaceNet Florian Schroff, Dmitry Kalenichenko, James Philbin Google Inc. Presentation by Ignacio Aranguren and Rahul Rana Introduction FaceNet learns a mapping from face images to a compact Euclidean Space

More information

Flow-Based Video Recognition

Flow-Based Video Recognition Flow-Based Video Recognition Jifeng Dai Visual Computing Group, Microsoft Research Asia Joint work with Xizhou Zhu*, Yuwen Xiong*, Yujie Wang*, Lu Yuan and Yichen Wei (* interns) Talk pipeline Introduction

More information

CIS680: Vision & Learning Assignment 2.b: RPN, Faster R-CNN and Mask R-CNN Due: Nov. 21, 2018 at 11:59 pm

CIS680: Vision & Learning Assignment 2.b: RPN, Faster R-CNN and Mask R-CNN Due: Nov. 21, 2018 at 11:59 pm CIS680: Vision & Learning Assignment 2.b: RPN, Faster R-CNN and Mask R-CNN Due: Nov. 21, 2018 at 11:59 pm Instructions This is an individual assignment. Individual means each student must hand in their

More information

Overview of ImageCLEF Mauricio Villegas (on behalf of all organisers)

Overview of ImageCLEF Mauricio Villegas (on behalf of all organisers) Overview of ImageCLEF 2016 Mauricio Villegas (on behalf of all organisers) ImageCLEF history Started in 2003 with a photo retrieval task 4 participants submitting results In 2009 we had 6 tasks and 65

More information

Lecture 5: Object Detection

Lecture 5: Object Detection Object Detection CSED703R: Deep Learning for Visual Recognition (2017F) Lecture 5: Object Detection Bohyung Han Computer Vision Lab. bhhan@postech.ac.kr 2 Traditional Object Detection Algorithms Region-based

More information

Classification of objects from Video Data (Group 30)

Classification of objects from Video Data (Group 30) Classification of objects from Video Data (Group 30) Sheallika Singh 12665 Vibhuti Mahajan 12792 Aahitagni Mukherjee 12001 M Arvind 12385 1 Motivation Video surveillance has been employed for a long time

More information

Object Detection on Self-Driving Cars in China. Lingyun Li

Object Detection on Self-Driving Cars in China. Lingyun Li Object Detection on Self-Driving Cars in China Lingyun Li Introduction Motivation: Perception is the key of self-driving cars Data set: 10000 images with annotation 2000 images without annotation (not

More information

PIXELS TO VOXELS: MODELING VISUAL REPRESENTATION IN THE HUMAN BRAIN

PIXELS TO VOXELS: MODELING VISUAL REPRESENTATION IN THE HUMAN BRAIN PIXELS TO VOXELS: MODELING VISUAL REPRESENTATION IN THE HUMAN BRAIN By Pulkit Agrawal, Dustin Stansbury, Jitendra Malik, Jack L. Gallant University of California Berkeley Presented by Tim Patzelt AGENDA

More information

Rich feature hierarchies for accurate object detection and semant

Rich feature hierarchies for accurate object detection and semant Rich feature hierarchies for accurate object detection and semantic segmentation Speaker: Yucong Shen 4/5/2018 Develop of Object Detection 1 DPM (Deformable parts models) 2 R-CNN 3 Fast R-CNN 4 Faster

More information

Yield Estimation using faster R-CNN

Yield Estimation using faster R-CNN Yield Estimation using faster R-CNN 1 Vidhya Sagar, 2 Sailesh J.Jain and 2 Arjun P. 1 Assistant Professor, 2 UG Scholar, Department of Computer Engineering and Science SRM Institute of Science and Technology,Chennai,

More information

Learning to Segment Object Candidates

Learning to Segment Object Candidates Learning to Segment Object Candidates Pedro Pinheiro, Ronan Collobert and Piotr Dollar Presented by - Sivaraman, Kalpathy Sitaraman, M.S. in Computer Science, University of Virginia Facebook Artificial

More information

Mask R-CNN. presented by Jiageng Zhang, Jingyao Zhan, Yunhan Ma

Mask R-CNN. presented by Jiageng Zhang, Jingyao Zhan, Yunhan Ma Mask R-CNN presented by Jiageng Zhang, Jingyao Zhan, Yunhan Ma Mask R-CNN Background Related Work Architecture Experiment Mask R-CNN Background Related Work Architecture Experiment Background From left

More information

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li Learning to Match Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li 1. Introduction The main tasks in many applications can be formalized as matching between heterogeneous objects, including search, recommendation,

More information

R-FCN: Object Detection with Really - Friggin Convolutional Networks

R-FCN: Object Detection with Really - Friggin Convolutional Networks R-FCN: Object Detection with Really - Friggin Convolutional Networks Jifeng Dai Microsoft Research Li Yi Tsinghua Univ. Kaiming He FAIR Jian Sun Microsoft Research NIPS, 2016 Or Region-based Fully Convolutional

More information

LSTM and its variants for visual recognition. Xiaodan Liang Sun Yat-sen University

LSTM and its variants for visual recognition. Xiaodan Liang Sun Yat-sen University LSTM and its variants for visual recognition Xiaodan Liang xdliang328@gmail.com Sun Yat-sen University Outline Context Modelling with CNN LSTM and its Variants LSTM Architecture Variants Application in

More information

AttentionNet for Accurate Localization and Detection of Objects. (To appear in ICCV 2015)

AttentionNet for Accurate Localization and Detection of Objects. (To appear in ICCV 2015) AttentionNet for Accurate Localization and Detection of Objects. (To appear in ICCV 2015) Donggeun Yoo, Sunggyun Park, Joon-Young Lee, Anthony Paek, In So Kweon. State-of-the-art frameworks for object

More information

SSD: Single Shot MultiBox Detector. Author: Wei Liu et al. Presenter: Siyu Jiang

SSD: Single Shot MultiBox Detector. Author: Wei Liu et al. Presenter: Siyu Jiang SSD: Single Shot MultiBox Detector Author: Wei Liu et al. Presenter: Siyu Jiang Outline 1. Motivations 2. Contributions 3. Methodology 4. Experiments 5. Conclusions 6. Extensions Motivation Motivation

More information

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Si Chen The George Washington University sichen@gwmail.gwu.edu Meera Hahn Emory University mhahn7@emory.edu Mentor: Afshin

More information

Rich feature hierarchies for accurate object detection and semantic segmentation

Rich feature hierarchies for accurate object detection and semantic segmentation Rich feature hierarchies for accurate object detection and semantic segmentation BY; ROSS GIRSHICK, JEFF DONAHUE, TREVOR DARRELL AND JITENDRA MALIK PRESENTER; MUHAMMAD OSAMA Object detection vs. classification

More information

Recap Image Classification with Bags of Local Features

Recap Image Classification with Bags of Local Features Recap Image Classification with Bags of Local Features Bag of Feature models were the state of the art for image classification for a decade BoF may still be the state of the art for instance retrieval

More information

Fully-Convolutional Siamese Networks for Object Tracking

Fully-Convolutional Siamese Networks for Object Tracking Fully-Convolutional Siamese Networks for Object Tracking Luca Bertinetto*, Jack Valmadre*, João Henriques, Andrea Vedaldi and Philip Torr www.robots.ox.ac.uk/~luca luca.bertinetto@eng.ox.ac.uk Tracking

More information

DEEP NEURAL NETWORKS FOR OBJECT DETECTION

DEEP NEURAL NETWORKS FOR OBJECT DETECTION DEEP NEURAL NETWORKS FOR OBJECT DETECTION Sergey Nikolenko Steklov Institute of Mathematics at St. Petersburg October 21, 2017, St. Petersburg, Russia Outline Bird s eye overview of deep learning Convolutional

More information

Large Scale Chinese News Categorization. Peng Wang. Joint work with H. Zhang, B. Xu, H.W. Hao

Large Scale Chinese News Categorization. Peng Wang. Joint work with H. Zhang, B. Xu, H.W. Hao Large Scale Chinese News Categorization --based on Improved Feature Selection Method Peng Wang Joint work with H. Zhang, B. Xu, H.W. Hao Computational-Brain Research Center Institute of Automation, Chinese

More information

Machine Learning 13. week

Machine Learning 13. week Machine Learning 13. week Deep Learning Convolutional Neural Network Recurrent Neural Network 1 Why Deep Learning is so Popular? 1. Increase in the amount of data Thanks to the Internet, huge amount of

More information

Single-Shot Refinement Neural Network for Object Detection -Supplementary Material-

Single-Shot Refinement Neural Network for Object Detection -Supplementary Material- Single-Shot Refinement Neural Network for Object Detection -Supplementary Material- Shifeng Zhang 1,2, Longyin Wen 3, Xiao Bian 3, Zhen Lei 1,2, Stan Z. Li 4,1,2 1 CBSR & NLPR, Institute of Automation,

More information

Real-time Object Detection CS 229 Course Project

Real-time Object Detection CS 229 Course Project Real-time Object Detection CS 229 Course Project Zibo Gong 1, Tianchang He 1, and Ziyi Yang 1 1 Department of Electrical Engineering, Stanford University December 17, 2016 Abstract Objection detection

More information

Computer Vision Lecture 16

Computer Vision Lecture 16 Computer Vision Lecture 16 Deep Learning Applications 11.01.2017 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de leibe@vision.rwth-aachen.de Announcements Seminar registration period starts

More information

arxiv: v3 [cs.cv] 2 Jun 2017

arxiv: v3 [cs.cv] 2 Jun 2017 Incorporating the Knowledge of Dermatologists to Convolutional Neural Networks for the Diagnosis of Skin Lesions arxiv:1703.01976v3 [cs.cv] 2 Jun 2017 Iván González-Díaz Department of Signal Theory and

More information

Deep Learning for Computer Vision II

Deep Learning for Computer Vision II IIIT Hyderabad Deep Learning for Computer Vision II C. V. Jawahar Paradigm Shift Feature Extraction (SIFT, HoG, ) Part Models / Encoding Classifier Sparrow Feature Learning Classifier Sparrow L 1 L 2 L

More information

Unsupervised Deep Learning. James Hays slides from Carl Doersch and Richard Zhang

Unsupervised Deep Learning. James Hays slides from Carl Doersch and Richard Zhang Unsupervised Deep Learning James Hays slides from Carl Doersch and Richard Zhang Recap from Previous Lecture We saw two strategies to get structured output while using deep learning With object detection,

More information

Why Is Recognition Hard? Object Recognizer

Why Is Recognition Hard? Object Recognizer Why Is Recognition Hard? Object Recognizer panda Why is Recognition Hard? Object Recognizer panda Pose Why is Recognition Hard? Object Recognizer panda Occlusion Why is Recognition Hard? Object Recognizer

More information

Bidirectional Recurrent Convolutional Networks for Video Super-Resolution

Bidirectional Recurrent Convolutional Networks for Video Super-Resolution Bidirectional Recurrent Convolutional Networks for Video Super-Resolution Qi Zhang & Yan Huang Center for Research on Intelligent Perception and Computing (CRIPAC) National Laboratory of Pattern Recognition

More information

Martian lava field, NASA, Wikipedia

Martian lava field, NASA, Wikipedia Martian lava field, NASA, Wikipedia Old Man of the Mountain, Franconia, New Hampshire Pareidolia http://smrt.ccel.ca/203/2/6/pareidolia/ Reddit for more : ) https://www.reddit.com/r/pareidolia/top/ Pareidolia

More information

Instance-aware Semantic Segmentation via Multi-task Network Cascades

Instance-aware Semantic Segmentation via Multi-task Network Cascades Instance-aware Semantic Segmentation via Multi-task Network Cascades Jifeng Dai, Kaiming He, Jian Sun Microsoft research 2016 Yotam Gil Amit Nativ Agenda Introduction Highlights Implementation Further

More information

Deep Residual Learning

Deep Residual Learning Deep Residual Learning MSRA @ ILSVRC & COCO 2015 competitions Kaiming He with Xiangyu Zhang, Shaoqing Ren, Jifeng Dai, & Jian Sun Microsoft Research Asia (MSRA) MSRA @ ILSVRC & COCO 2015 Competitions 1st

More information

A FRAMEWORK OF EXTRACTING MULTI-SCALE FEATURES USING MULTIPLE CONVOLUTIONAL NEURAL NETWORKS. Kuan-Chuan Peng and Tsuhan Chen

A FRAMEWORK OF EXTRACTING MULTI-SCALE FEATURES USING MULTIPLE CONVOLUTIONAL NEURAL NETWORKS. Kuan-Chuan Peng and Tsuhan Chen A FRAMEWORK OF EXTRACTING MULTI-SCALE FEATURES USING MULTIPLE CONVOLUTIONAL NEURAL NETWORKS Kuan-Chuan Peng and Tsuhan Chen School of Electrical and Computer Engineering, Cornell University, Ithaca, NY

More information

Object Detection. Part1. Presenter: Dae-Yong

Object Detection. Part1. Presenter: Dae-Yong Object Part1 Presenter: Dae-Yong Contents 1. What is an Object? 2. Traditional Object Detector 3. Deep Learning-based Object Detector What is an Object? Subset of Object Recognition What is an Object?

More information

Abandoned Luggage Detection

Abandoned Luggage Detection Abandoned Luggage Detection Using deep neural networks to detect and track abandoned luggage in video By Evan Miller and Eli Selkin For CS519 instructed by Dr. Hao Ji At California State Polytechnic University,

More information

Regionlet Object Detector with Hand-crafted and CNN Feature

Regionlet Object Detector with Hand-crafted and CNN Feature Regionlet Object Detector with Hand-crafted and CNN Feature Xiaoyu Wang Research Xiaoyu Wang Research Ming Yang Horizon Robotics Shenghuo Zhu Alibaba Group Yuanqing Lin Baidu Overview of this section Regionlet

More information

Mask R-CNN. Kaiming He, Georgia, Gkioxari, Piotr Dollar, Ross Girshick Presenters: Xiaokang Wang, Mengyao Shi Feb. 13, 2018

Mask R-CNN. Kaiming He, Georgia, Gkioxari, Piotr Dollar, Ross Girshick Presenters: Xiaokang Wang, Mengyao Shi Feb. 13, 2018 Mask R-CNN Kaiming He, Georgia, Gkioxari, Piotr Dollar, Ross Girshick Presenters: Xiaokang Wang, Mengyao Shi Feb. 13, 2018 1 Common computer vision tasks Image Classification: one label is generated for

More information

Fully-Convolutional Siamese Networks for Object Tracking

Fully-Convolutional Siamese Networks for Object Tracking Fully-Convolutional Siamese Networks for Object Tracking Luca Bertinetto*, Jack Valmadre*, João Henriques, Andrea Vedaldi and Philip Torr www.robots.ox.ac.uk/~luca luca.bertinetto@eng.ox.ac.uk Tracking

More information

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet Classification with Deep Convolutional Neural Networks ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky Ilya Sutskever Geoffrey Hinton University of Toronto Canada Paper with same name to appear in NIPS 2012 Main idea Architecture

More information

Fine-tuning Pre-trained Large Scaled ImageNet model on smaller dataset for Detection task

Fine-tuning Pre-trained Large Scaled ImageNet model on smaller dataset for Detection task Fine-tuning Pre-trained Large Scaled ImageNet model on smaller dataset for Detection task Kyunghee Kim Stanford University 353 Serra Mall Stanford, CA 94305 kyunghee.kim@stanford.edu Abstract We use a

More information

3D model classification using convolutional neural network

3D model classification using convolutional neural network 3D model classification using convolutional neural network JunYoung Gwak Stanford jgwak@cs.stanford.edu Abstract Our goal is to classify 3D models directly using convolutional neural network. Most of existing

More information

Deep Learning for Computer Vision with MATLAB By Jon Cherrie

Deep Learning for Computer Vision with MATLAB By Jon Cherrie Deep Learning for Computer Vision with MATLAB By Jon Cherrie 2015 The MathWorks, Inc. 1 Deep learning is getting a lot of attention "Dahl and his colleagues won $22,000 with a deeplearning system. 'We

More information

Optimizing Object Detection:

Optimizing Object Detection: Lecture 10: Optimizing Object Detection: A Case Study of R-CNN, Fast R-CNN, and Faster R-CNN Visual Computing Systems Today s task: object detection Image classification: what is the object in this image?

More information

Todo before next class

Todo before next class Todo before next class Each project group should submit a short project report (4 pages presentation slides) including 1. Problem definition 2. Related work 3. Preliminary results 4. Future plan Submission:

More information

R-FCN: OBJECT DETECTION VIA REGION-BASED FULLY CONVOLUTIONAL NETWORKS

R-FCN: OBJECT DETECTION VIA REGION-BASED FULLY CONVOLUTIONAL NETWORKS R-FCN: OBJECT DETECTION VIA REGION-BASED FULLY CONVOLUTIONAL NETWORKS JIFENG DAI YI LI KAIMING HE JIAN SUN MICROSOFT RESEARCH TSINGHUA UNIVERSITY MICROSOFT RESEARCH MICROSOFT RESEARCH SPEED/ACCURACY TRADE-OFFS

More information

Machine Learning for Signal Processing Detecting faces (& other objects) in images

Machine Learning for Signal Processing Detecting faces (& other objects) in images Machine Learning for Signal Processing Detecting faces (& other objects) in images Class 8. 27 Sep 2016 11755/18979 1 Last Lecture: How to describe a face The typical face A typical face that captures

More information

Lecture: Deep Convolutional Neural Networks

Lecture: Deep Convolutional Neural Networks Lecture: Deep Convolutional Neural Networks Shubhang Desai Stanford Vision and Learning Lab 1 Today s agenda Deep convolutional networks History of CNNs CNN dev Architecture search 2 Previously argmax

More information

Return of the Devil in the Details: Delving Deep into Convolutional Nets

Return of the Devil in the Details: Delving Deep into Convolutional Nets Return of the Devil in the Details: Delving Deep into Convolutional Nets Ken Chatfield - Karen Simonyan - Andrea Vedaldi - Andrew Zisserman University of Oxford The Devil is still in the Details 2011 2014

More information

Computer Vision Lecture 16

Computer Vision Lecture 16 Announcements Computer Vision Lecture 16 Deep Learning Applications 11.01.2017 Seminar registration period starts on Friday We will offer a lab course in the summer semester Deep Robot Learning Topic:

More information

REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION

REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION Kingsley Kuan 1, Gaurav Manek 1, Jie Lin 1, Yuan Fang 1, Vijay Chandrasekhar 1,2 Institute for Infocomm Research, A*STAR, Singapore 1 Nanyang Technological

More information

Deep Learning For Video Classification. Presented by Natalie Carlebach & Gil Sharon

Deep Learning For Video Classification. Presented by Natalie Carlebach & Gil Sharon Deep Learning For Video Classification Presented by Natalie Carlebach & Gil Sharon Overview Of Presentation Motivation Challenges of video classification Common datasets 4 different methods presented in

More information

Content-Based Image Recovery

Content-Based Image Recovery Content-Based Image Recovery Hong-Yu Zhou and Jianxin Wu National Key Laboratory for Novel Software Technology Nanjing University, China zhouhy@lamda.nju.edu.cn wujx2001@nju.edu.cn Abstract. We propose

More information

Gender Classification Technique Based on Facial Features using Neural Network

Gender Classification Technique Based on Facial Features using Neural Network Gender Classification Technique Based on Facial Features using Neural Network Anushri Jaswante Dr. Asif Ullah Khan Dr. Bhupesh Gour Computer Science & Engineering, Rajiv Gandhi Proudyogiki Vishwavidyalaya,

More information

MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK. Wenjie Guan, YueXian Zou*, Xiaoqun Zhou

MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK. Wenjie Guan, YueXian Zou*, Xiaoqun Zhou MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK Wenjie Guan, YueXian Zou*, Xiaoqun Zhou ADSPLAB/Intelligent Lab, School of ECE, Peking University, Shenzhen,518055, China

More information

Improved Face Detection and Alignment using Cascade Deep Convolutional Network

Improved Face Detection and Alignment using Cascade Deep Convolutional Network Improved Face Detection and Alignment using Cascade Deep Convolutional Network Weilin Cong, Sanyuan Zhao, Hui Tian, and Jianbing Shen Beijing Key Laboratory of Intelligent Information Technology, School

More information

FAce detection is an important and long-standing problem in

FAce detection is an important and long-standing problem in 1 Faceness-Net: Face Detection through Deep Facial Part Responses Shuo Yang, Ping Luo, Chen Change Loy, Senior Member, IEEE and Xiaoou Tang, Fellow, IEEE arxiv:11.08393v3 [cs.cv] 25 Aug 2017 Abstract We

More information

Representation Learning using Multi-Task Deep Neural Networks for Semantic Classification and Information Retrieval

Representation Learning using Multi-Task Deep Neural Networks for Semantic Classification and Information Retrieval Representation Learning using Multi-Task Deep Neural Networks for Semantic Classification and Information Retrieval Xiaodong Liu 12, Jianfeng Gao 1, Xiaodong He 1 Li Deng 1, Kevin Duh 2, Ye-Yi Wang 1 1

More information

Using Machine Learning for Classification of Cancer Cells

Using Machine Learning for Classification of Cancer Cells Using Machine Learning for Classification of Cancer Cells Camille Biscarrat University of California, Berkeley I Introduction Cell screening is a commonly used technique in the development of new drugs.

More information

Hide-and-Seek: Forcing a network to be Meticulous for Weakly-supervised Object and Action Localization

Hide-and-Seek: Forcing a network to be Meticulous for Weakly-supervised Object and Action Localization Hide-and-Seek: Forcing a network to be Meticulous for Weakly-supervised Object and Action Localization Krishna Kumar Singh and Yong Jae Lee University of California, Davis ---- Paper Presentation Yixian

More information

Feature-Fused SSD: Fast Detection for Small Objects

Feature-Fused SSD: Fast Detection for Small Objects Feature-Fused SSD: Fast Detection for Small Objects Guimei Cao, Xuemei Xie, Wenzhe Yang, Quan Liao, Guangming Shi, Jinjian Wu School of Electronic Engineering, Xidian University, China xmxie@mail.xidian.edu.cn

More information

1 MS Student OR 1 Undergrad Student

1 MS Student OR 1 Undergrad Student Fast face/hand detection and tracking for gesture video SVCL is interested in building a purely vision-based system to understand continuous gesture/sign language videos. One of the key component of the

More information

Cascade Region Regression for Robust Object Detection

Cascade Region Regression for Robust Object Detection Large Scale Visual Recognition Challenge 2015 (ILSVRC2015) Cascade Region Regression for Robust Object Detection Jiankang Deng, Shaoli Huang, Jing Yang, Hui Shuai, Zhengbo Yu, Zongguang Lu, Qiang Ma, Yali

More information

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601 Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network Nathan Sun CIS601 Introduction Face ID is complicated by alterations to an individual s appearance Beard,

More information

A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task

A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task Wei Li, Jinming Min, Gareth J. F. Jones Center for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland

More information

Overview of the ImageCLEF 2016 Scalable Concept Image Annotation Task

Overview of the ImageCLEF 2016 Scalable Concept Image Annotation Task Overview of the ImageCLEF 2016 Scalable Concept Image Annotation Task Andrew Gilbert, Luca Piras, Josiah Wang*, Fei Yan*, Arnau Ramisa*, Emmanuel Dellandrea*, Robert Gaizauskas*, Mauricio Villegas & Krystian

More information

Multi-View 3D Object Detection Network for Autonomous Driving

Multi-View 3D Object Detection Network for Autonomous Driving Multi-View 3D Object Detection Network for Autonomous Driving Xiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, Tian Xia CVPR 2017 (Spotlight) Presented By: Jason Ku Overview Motivation Dataset Network Architecture

More information

Available online at ScienceDirect. Procedia Computer Science 96 (2016 )

Available online at   ScienceDirect. Procedia Computer Science 96 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 96 (2016 ) 1409 1417 20th International Conference on Knowledge Based and Intelligent Information and Engineering Systems,

More information

CS 2750: Machine Learning. Neural Networks. Prof. Adriana Kovashka University of Pittsburgh April 13, 2016

CS 2750: Machine Learning. Neural Networks. Prof. Adriana Kovashka University of Pittsburgh April 13, 2016 CS 2750: Machine Learning Neural Networks Prof. Adriana Kovashka University of Pittsburgh April 13, 2016 Plan for today Neural network definition and examples Training neural networks (backprop) Convolutional

More information

Semantic Image Search. Alex Egg

Semantic Image Search. Alex Egg Semantic Image Search Alex Egg Inspiration Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenet classification with deep convolutional neural networks." Advances in neural information processing

More information

Face Recognition A Deep Learning Approach

Face Recognition A Deep Learning Approach Face Recognition A Deep Learning Approach Lihi Shiloh Tal Perl Deep Learning Seminar 2 Outline What about Cat recognition? Classical face recognition Modern face recognition DeepFace FaceNet Comparison

More information

SEMANTIC SEGMENTATION AVIRAM BAR HAIM & IRIS TAL

SEMANTIC SEGMENTATION AVIRAM BAR HAIM & IRIS TAL SEMANTIC SEGMENTATION AVIRAM BAR HAIM & IRIS TAL IMAGE DESCRIPTIONS IN THE WILD (IDW-CNN) LARGE KERNEL MATTERS (GCN) DEEP LEARNING SEMINAR, TAU NOVEMBER 2017 TOPICS IDW-CNN: Improving Semantic Segmentation

More information

From Keyboards to Neural Networks 从键盘到神经网络

From Keyboards to Neural Networks 从键盘到神经网络 From Keyboards to Neural Networks 从键盘到神经网络 Qcon Beijing April 21, 2018 Biye Li Team Manager, Data Technologies Automation Xiangqian Yu Team Lead, Derivatives Data What is Bloomberg? The Bloomberg Terminal

More information

Deep Learning and Its Applications

Deep Learning and Its Applications Convolutional Neural Network and Its Application in Image Recognition Oct 28, 2016 Outline 1 A Motivating Example 2 The Convolutional Neural Network (CNN) Model 3 Training the CNN Model 4 Issues and Recent

More information

BUAA AUDR at ImageCLEF 2012 Photo Annotation Task

BUAA AUDR at ImageCLEF 2012 Photo Annotation Task BUAA AUDR at ImageCLEF 2012 Photo Annotation Task Lei Huang, Yang Liu State Key Laboratory of Software Development Enviroment, Beihang University, 100191 Beijing, China huanglei@nlsde.buaa.edu.cn liuyang@nlsde.buaa.edu.cn

More information

Deep Learning Workshop. Nov. 20, 2015 Andrew Fishberg, Rowan Zellers

Deep Learning Workshop. Nov. 20, 2015 Andrew Fishberg, Rowan Zellers Deep Learning Workshop Nov. 20, 2015 Andrew Fishberg, Rowan Zellers Why deep learning? The ImageNet Challenge Goal: image classification with 1000 categories Top 5 error rate of 15%. Krizhevsky, Alex,

More information