Web-Scale Image Search and Their Applications
|
|
- Esther Bradley
- 5 years ago
- Views:
Transcription
1 Web-Scale Image Search and Their Applications Sung-Eui Yoon KAIST
2 Project Guidelines: Project Topics Any topics related to the course theme are okay You can find topics by browsing recent papers
3 Expectations Mid-term project presentation Introduce problems and explain why it is important Give an overall idea on the related work Explain what problems those existing techniques have (Optional) explain how you can address those problems Explain roles of each member
4 Expectations Final-term project presentation Cover all the materials that you talked for your mid-term project Present your ideas that can address problems of those state-of-the-art techniques Give your qualitatively (or intuitive) reasons how your ideas address them Also, explain expected benefits and drawbacks of your approach (Optional) backup your claims with quantitative results collected by some implementations Explain roles of each members
5 A few more comments Start to implement a paper, if you don t have any clear ideas While you implement it, you may get ideas about improving it Speaker Novelty of the project and idea (1 ~ 5) Practical benefits of the method (1 ~ 5) Completeness level of the project (1 ~ 5) Total score (3 ~ 15) Role of each student is clear and well balanced? (Yes or No) XXX YYY
6 Project evaluation sheet You name: ID: Score table: higher score is better. Speaker Novelty of the project and idea (1 ~ 5) Practical benefits of the method (1 ~ 5) Completeness level of the project (1 ~ 5) Total score (3 ~ 15) Role of each student is clear and well balanced? (Yes or No) XXX YYY
7 Web-Scale Visual Data and Novel Applications Visual data are widely used for various communication and, and are more widely consumed at Web and mobile devices YouTube, Facebook, Flickr, etc. Processing them requires scalable algorithms Web-scale visual data can enable new applications
8 Ack.: Zhe Lin Review: Efficient Image Search Deep Convolutional Neural Network Distance Encoded Optimized PQ
9 Object Retrieval and Localization [X. Shen et al., CVPR 2012] 9
10 Object Retrieval and Localization Local correspondence voting for non-rigid object matching g 2 f 2 g 5 f 5 g 1 f 1 f 4 f 3 Q g 4 g 3 D
11 Object Retrieval and Localization Examples of Voting Maps 11
12 Object Retrieval and Localization 12 Non-rigid cases
13 Product Image Recognition [X. Shen et al., ECCV 2012] Examples of product images in the database Examples of query images taken by mobile phones 13
14 Product Image Recognition a) A query b) DB image c) A vote map 14 d) Aggregated voting maps e) Tri map f) Segmented result
15 Product Image Recognition Images Support map Extraction GrabCut w/ manual init. 15
16 Face Detection by Image Retrieval [X. Shen et al., CVPR 2013] [H. Li et al., CVPR 2014] 16
17 Face Detection by Image Retrieval Database Images Voting Maps Aggregation By Boosting 17
18 Face Detection by Image Retrieval Example detection results 18
19 Facial Attribute Recognition transfer landmark, pose, age, gender, expression 19
20 20 Facial Attribute Recognition
21 Data-Driven Object Segmentation Find seg. examples and transfer [J. Yang et al. CVPR 2014] 21
22 Data-Driven Automatic Cropping [A. Samii et al. CGF 2015] 22
23 23 Automatic Image Tagging
24 Deep-kNN Tagging System white tiger, snow, cute keyword freq. white tiger, tiger, beast white tiger 3 snow 3 Neural Net Feature Extractor Search Engine (DOPQ) white tiger, cage, zoo snow, winter, forest knn Voting winter 2 cute 1 tiger 1 beast 1 cage 1 zoo 1 Labeled Image DB (17M) Search Index zebra, snow, winter forest 1 zebra 1 24
25 Free-Text Image Search sydney opera house 25
26 Image Recommendation: Collaborative Feature Learning from Social Media 26 [C. Fang et al. CVPR 2015]
27 Image Recommendation: Collaborative Feature Learning from Social Media 27 [C. Fang et al. CVPR 2015]
28 Image Retrieval based Image Watermarking [IWDW11] Exhaustive watermark matching Sequential one-to-one comparison Time-consuming job Image Retrieval based Image watermarking (IRIW) Reduce search domain by image search Achieve performance enhancement Large-scale Large-scaleImage Watermark Watermark Database Databaseretrievalcomparison comparison 28
29 Result Accuracy (100 tests) Precision Recall #of ( I R) #of ( R) #of ( I R) #of ( I) I : ground R :result set truth set 29
30 Scene Completion using Millions of Photographs [SIG. 07] Input image Scene Descriptor Image Collection 20 completions Context matching + blending 200 matches Hays and Efros, SIGGRAPH 2007
31 Results
32 Hays and Efros, SIGGRAPH 2007
33 Hays and Efros, SIGGRAPH 2007
34 Photo Tourism [SIG. 11] 2006 Noah Snavely
35 15,464 37,383 76, Noah Snavely
36 Photo Tourism overview Input photographs Scene reconstruction Relative camera positions and orientations Point cloud Sparse correspondence Photo Explorer 2006 Noah Snavely
37 Visual Prediction Predict possible actions by: Identify similar patches in the training videos based on NNS Propagating them in the query image 37
38 38 Summary
39 Conclusions Visual data are more widely used for various communication and are thus associated at Web Processing them requires scalable algorithms Web-scale visual data can enable new applications Examples Photo tourism Scene completion Image-retrieval based image watermarking 39
Augmented Reality. Sung-eui Yoon
Augmented Reality Sung-eui Yoon 1 Project Guidelines: Project Topics Any topics related to the course theme are okay You can find topics by browsing recent papers 2 Expectations Mid-term project presentation
More informationSingle Image Super-resolution. Slides from Libin Geoffrey Sun and James Hays
Single Image Super-resolution Slides from Libin Geoffrey Sun and James Hays Cs129 Computational Photography James Hays, Brown, fall 2012 Types of Super-resolution Multi-image (sub-pixel registration) Single-image
More informationA Systems View of Large- Scale 3D Reconstruction
Lecture 23: A Systems View of Large- Scale 3D Reconstruction Visual Computing Systems Goals and motivation Construct a detailed 3D model of the world from unstructured photographs (e.g., Flickr, Facebook)
More informationOpportunities of Scale
Opportunities of Scale 11/07/17 Computational Photography Derek Hoiem, University of Illinois Most slides from Alyosha Efros Graphic from Antonio Torralba Today s class Opportunities of Scale: Data-driven
More informationSkyFinder: Attribute-based Sky Image Search
SkyFinder: Attribute-based Sky Image Search SIGGRAPH 2009 Litian Tao, Lu Yuan, Jian Sun Kim, Wook 2016. 1. 12 Abstract Interactive search system of over a half million sky images Automatically extracted
More informationDiscriminative classifiers for image recognition
Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study
More informationDetecting and Parsing of Visual Objects: Humans and Animals. Alan Yuille (UCLA)
Detecting and Parsing of Visual Objects: Humans and Animals Alan Yuille (UCLA) Summary This talk describes recent work on detection and parsing visual objects. The methods represent objects in terms of
More informationModeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs
Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs Xiaowei Li, Changchang Wu, Christopher Zach, Svetlana Lazebnik, Jan-Michael Frahm 1 Motivation Target problem: organizing
More informationLearning to generate 3D shapes
Learning to generate 3D shapes Subhransu Maji College of Information and Computer Sciences University of Massachusetts, Amherst http://people.cs.umass.edu/smaji August 10, 2018 @ Caltech Creating 3D shapes
More informationCS5670: Computer Vision
CS5670: Computer Vision Noah Snavely Lecture 33: Recognition Basics Slides from Andrej Karpathy and Fei-Fei Li http://vision.stanford.edu/teaching/cs231n/ Announcements Quiz moved to Tuesday Project 4
More informationData-driven Depth Inference from a Single Still Image
Data-driven Depth Inference from a Single Still Image Kyunghee Kim Computer Science Department Stanford University kyunghee.kim@stanford.edu Abstract Given an indoor image, how to recover its depth information
More informationACM MM Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang
ACM MM 2010 Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang Harbin Institute of Technology National University of Singapore Microsoft Corporation Proliferation of images and videos on the Internet
More informationApparel Classifier and Recommender using Deep Learning
Apparel Classifier and Recommender using Deep Learning Live Demo at: http://saurabhg.me/projects/tag-that-apparel Saurabh Gupta sag043@ucsd.edu Siddhartha Agarwal siagarwa@ucsd.edu Apoorve Dave a1dave@ucsd.edu
More informationSky is Not the Limit: Semantic-Aware Sky Replacement
Sky is Not the Limit: Semantic-Aware Sky Replacement ACM Transactions on Graphics (SIGGRAPH), 2016 Yi-Hsuan Tsai UC Merced Xiaohui Shen Adobe Research Zhe Lin Adobe Research Kalyan Sunkavalli Adobe Research
More informationAdaDepth: Unsupervised Content Congruent Adaptation for Depth Estimation
AdaDepth: Unsupervised Content Congruent Adaptation for Depth Estimation Introduction Supplementary material In the supplementary material, we present additional qualitative results of the proposed AdaDepth
More informationGAN Related Works. CVPR 2018 & Selective Works in ICML and NIPS. Zhifei Zhang
GAN Related Works CVPR 2018 & Selective Works in ICML and NIPS Zhifei Zhang Generative Adversarial Networks (GANs) 9/12/2018 2 Generative Adversarial Networks (GANs) Feedforward Backpropagation Real? z
More informationIntroduction to Computer Vision. Srikumar Ramalingam School of Computing University of Utah
Introduction to Computer Vision Srikumar Ramalingam School of Computing University of Utah srikumar@cs.utah.edu Course Website http://www.eng.utah.edu/~cs6320/ What is computer vision? Light source 3D
More informationTracking. Hao Guan( 管皓 ) School of Computer Science Fudan University
Tracking Hao Guan( 管皓 ) School of Computer Science Fudan University 2014-09-29 Multimedia Video Audio Use your eyes Video Tracking Use your ears Audio Tracking Tracking Video Tracking Definition Given
More informationLearning from 3D Data
Learning from 3D Data Thomas Funkhouser Princeton University* * On sabbatical at Stanford and Google Disclaimer: I am talking about the work of these people Shuran Song Andy Zeng Fisher Yu Yinda Zhang
More informationLecture 7: Semantic Segmentation
Semantic Segmentation CSED703R: Deep Learning for Visual Recognition (207F) Segmenting images based on its semantic notion Lecture 7: Semantic Segmentation Bohyung Han Computer Vision Lab. bhhanpostech.ac.kr
More informationELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 4. Prof. James She
ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 4 Prof. James She james.she@ust.hk 1 Selected Works of Activity 4 2 Selected Works of Activity 4 3 Last lecture 4 Mid-term
More informationTri-modal Human Body Segmentation
Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4
More information3D Wikipedia: Using online text to automatically label and navigate reconstructed geometry
3D Wikipedia: Using online text to automatically label and navigate reconstructed geometry Bryan C. Russell et al. SIGGRAPH Asia 2013 Presented by YoungBin Kim 2014. 01. 10 Abstract Produce annotated 3D
More informationTwo Routes for Image to Image Translation: Rule based vs. Learning based. Minglun Gong, Memorial Univ. Collaboration with Mr.
Two Routes for Image to Image Translation: Rule based vs. Learning based Minglun Gong, Memorial Univ. Collaboration with Mr. Zili Yi Introduction A brief history of image processing Image to Image translation
More informationLearning bottom-up visual processes using automatically generated ground truth data
Learning bottom-up visual processes using automatically generated ground truth data Kalle Åström, Yubin Kuang, Magnus Oskarsson, Lars Kopp an Martin Byröd Mathematical Imaging Group Center for Mathematical
More informationDescribable Visual Attributes for Face Verification and Image Search
Advanced Topics in Multimedia Analysis and Indexing, Spring 2011, NTU. 1 Describable Visual Attributes for Face Verification and Image Search Kumar, Berg, Belhumeur, Nayar. PAMI, 2011. Ryan Lei 2011/05/05
More informationPhoto Tourism: Exploring Photo Collections in 3D
Photo Tourism: Exploring Photo Collections in 3D SIGGRAPH 2006 Noah Snavely Steven M. Seitz University of Washington Richard Szeliski Microsoft Research 2006 2006 Noah Snavely Noah Snavely Reproduced with
More informationBilinear Models for Fine-Grained Visual Recognition
Bilinear Models for Fine-Grained Visual Recognition Subhransu Maji College of Information and Computer Sciences University of Massachusetts, Amherst Fine-grained visual recognition Example: distinguish
More informationarxiv: v1 [cs.mm] 12 Jan 2016
Learning Subclass Representations for Visually-varied Image Classification Xinchao Li, Peng Xu, Yue Shi, Martha Larson, Alan Hanjalic Multimedia Information Retrieval Lab, Delft University of Technology
More informationarxiv: v1 [cs.cv] 19 May 2018
Long-term face tracking in the wild using deep learning Kunlei Zhang a, Elaheh Barati a, Elaheh Rashedi a,, Xue-wen Chen a a Dept. of Computer Science, Wayne State University, Detroit, MI arxiv:1805.07646v1
More informationInternet of things that video
Video recognition from a sentence Cees Snoek Intelligent Sensory Information Systems Lab University of Amsterdam The Netherlands Internet of things that video 45 billion cameras by 2022 [LDV Capital] 2
More informationAn Efficient Methodology for Image Rich Information Retrieval
An Efficient Methodology for Image Rich Information Retrieval 56 Ashwini Jaid, 2 Komal Savant, 3 Sonali Varma, 4 Pushpa Jat, 5 Prof. Sushama Shinde,2,3,4 Computer Department, Siddhant College of Engineering,
More informationRequesting changes to the grades, please write to me in an with descriptions. Move my office hours to the morning. 11am - 12:30pm Thursdays
Lab1 graded. Requesting changes to the grades, please write to me in an email with descriptions. Move my office hours to the morning. 11am - 12:30pm Thursdays Internet-scale texture synthesis With slides
More informationRecognition Tools: Support Vector Machines
CS 2770: Computer Vision Recognition Tools: Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh January 12, 2017 Announcement TA office hours: Tuesday 4pm-6pm Wednesday 10am-12pm Matlab
More informationExploring the Structure of Data at Scale. Rudy Agovic, PhD CEO & Chief Data Scientist at Reliancy January 16, 2019
Exploring the Structure of Data at Scale Rudy Agovic, PhD CEO & Chief Data Scientist at Reliancy January 16, 2019 Outline Why exploration of large datasets matters Challenges in working with large data
More informationDeep Incremental Scene Understanding. Federico Tombari & Christian Rupprecht Technical University of Munich, Germany
Deep Incremental Scene Understanding Federico Tombari & Christian Rupprecht Technical University of Munich, Germany C. Couprie et al. "Toward Real-time Indoor Semantic Segmentation Using Depth Information"
More informationWhat was Monet seeing while painting? Translating artworks to photo-realistic images M. Tomei, L. Baraldi, M. Cornia, R. Cucchiara
What was Monet seeing while painting? Translating artworks to photo-realistic images M. Tomei, L. Baraldi, M. Cornia, R. Cucchiara COMPUTER VISION IN THE ARTISTIC DOMAIN The effectiveness of Computer Vision
More informationarxiv: v1 [cs.cv] 16 Nov 2015
Coarse-to-fine Face Alignment with Multi-Scale Local Patch Regression Zhiao Huang hza@megvii.com Erjin Zhou zej@megvii.com Zhimin Cao czm@megvii.com arxiv:1511.04901v1 [cs.cv] 16 Nov 2015 Abstract Facial
More informationYelp Restaurant Photo Classification
Yelp Restaurant Photo Classification Rajarshi Roy Stanford University rroy@stanford.edu Abstract The Yelp Restaurant Photo Classification challenge is a Kaggle challenge that focuses on the problem predicting
More informationCourse Administration
Course Administration Project 2 results are online Project 3 is out today The first quiz is a week from today (don t panic!) Covers all material up to the quiz Emphasizes lecture material NOT project topics
More informationNoah Snavely Steven M. Seitz. Richard Szeliski. University of Washington. Microsoft Research. Modified from authors slides
Photo Tourism: Exploring Photo Collections in 3D Noah Snavely Steven M. Seitz University of Washington Richard Szeliski Microsoft Research 2006 2006 Noah Snavely Noah Snavely Modified from authors slides
More informationPhoto Tourism: Exploring Photo Collections in 3D
Click! Click! Oooo!! Click! Zoom click! Click! Some other camera noise!! Photo Tourism: Exploring Photo Collections in 3D Click! Click! Ahhh! Click! Click! Overview of Research at Microsoft, 2007 Jeremy
More informationLearning Visual Semantics: Models, Massive Computation, and Innovative Applications. Tutorial at CVPR 2014 June 23rd, 1:00pm-5:00pm, Columbus, OH
Learning Visual Semantics: Models, Massive Computation, and Innovative Applications Tutorial at CVPR 2014 June 23rd, 1:00pm-5:00pm, Columbus, OH Introduction Instructors: Shih-Fu Chang John Smith Rogerio
More informationSparse 3D Model Reconstruction from photographs
Sparse 3D Model Reconstruction from photographs Kanchalai Suveepattananont ksuvee@seas.upenn.edu Professor Norm Badler badler@seas.upenn.edu Advisor Aline Normoyle alinen@seas.upenn.edu Abstract Nowaday,
More informationExploiting noisy web data for largescale visual recognition
Exploiting noisy web data for largescale visual recognition Lamberto Ballan University of Padova, Italy CVPRW WebVision - Jul 26, 2017 Datasets drive computer vision progress ImageNet Slide credit: O.
More informationJOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS. Zhao Chen Machine Learning Intern, NVIDIA
JOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS Zhao Chen Machine Learning Intern, NVIDIA ABOUT ME 5th year PhD student in physics @ Stanford by day, deep learning computer vision scientist
More informationSocial Tourism Using Photos
Social Tourism Using Photos Mark Lenz Abstract Mobile phones are frequently used to determine one's location and navigation directions, and they can also provide a platform for connecting tourists with
More information3 Object Detection. BVM 2018 Tutorial: Advanced Deep Learning Methods. Paul F. Jaeger, Division of Medical Image Computing
3 Object Detection BVM 2018 Tutorial: Advanced Deep Learning Methods Paul F. Jaeger, of Medical Image Computing What is object detection? classification segmentation obj. detection (1 label per pixel)
More informationCost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling
[DOI: 10.2197/ipsjtcva.7.99] Express Paper Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling Takayoshi Yamashita 1,a) Takaya Nakamura 1 Hiroshi Fukui 1,b) Yuji
More informationPart-based and local feature models for generic object recognition
Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza
More informationBeyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Adding spatial information Forming vocabularies from pairs of nearby features doublets
More informationCan Similar Scenes help Surface Layout Estimation?
Can Similar Scenes help Surface Layout Estimation? Santosh K. Divvala, Alexei A. Efros, Martial Hebert Robotics Institute, Carnegie Mellon University. {santosh,efros,hebert}@cs.cmu.edu Abstract We describe
More informationPicture-in-Picture. Video-in-Video
Video-in-Video MediaTek Technology White Paper May 2015 2015 MediaTek Inc. Introducing Picture-in-Picture MediaTek s Picture-In-Picture and Video-In-Video technology is a customer-appealing camera feature.
More informationPhoto Tourism: Exploring Photo Collections in 3D
Photo Tourism: Exploring Photo Collections in 3D Noah Snavely Steven M. Seitz University of Washington Richard Szeliski Microsoft Research 15,464 37,383 76,389 2006 Noah Snavely 15,464 37,383 76,389 Reproduced
More informationUnderstanding Faces. Detection, Recognition, and. Transformation of Faces 12/5/17
Understanding Faces Detection, Recognition, and 12/5/17 Transformation of Faces Lucas by Chuck Close Chuck Close, self portrait Some slides from Amin Sadeghi, Lana Lazebnik, Silvio Savarese, Fei-Fei Li
More informationPointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space
PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space Sikai Zhong February 14, 2018 COMPUTER SCIENCE Table of contents 1. PointNet 2. PointNet++ 3. Experiments 1 PointNet Property
More informationInteractive Image Search with Attributes
Interactive Image Search with Attributes Adriana Kovashka Department of Computer Science January 13, 2015 Joint work with Kristen Grauman and Devi Parikh We Need Search to Access Visual Data 144,000 hours
More informationDepth from Stereo. Dominic Cheng February 7, 2018
Depth from Stereo Dominic Cheng February 7, 2018 Agenda 1. Introduction to stereo 2. Efficient Deep Learning for Stereo Matching (W. Luo, A. Schwing, and R. Urtasun. In CVPR 2016.) 3. Cascade Residual
More informationStereo Epipolar Geometry for General Cameras. Sanja Fidler CSC420: Intro to Image Understanding 1 / 33
Stereo Epipolar Geometry for General Cameras Sanja Fidler CSC420: Intro to Image Understanding 1 / 33 Stereo Epipolar geometry Case with two cameras with parallel optical axes General case Now this Sanja
More informationFace Recognition A Deep Learning Approach
Face Recognition A Deep Learning Approach Lihi Shiloh Tal Perl Deep Learning Seminar 2 Outline What about Cat recognition? Classical face recognition Modern face recognition DeepFace FaceNet Comparison
More informationWISE: Large Scale Content Based Web Image Search. Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley
WISE: Large Scale Content Based Web Image Search Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley 1 A picture is worth a thousand words. Query by Images What leaf?
More informationSwitchable Temporal Propagation Network
Switchable Temporal Propagation Network Sifei Liu 1, Guangyu Zhong 1,3, Shalini De Mello 1, Jinwei Gu 1 Varun Jampani 1, Ming-Hsuan Yang 2, Jan Kautz 1 1 NVIDIA, 2 UC Merced, 3 Dalian University of Technology
More informationLocation Recognition using Prioritized Feature Matching
Location Recognition using Prioritized Feature Matching Yunpeng Li Noah Snavely Daniel P. Huttenlocher Department of Computer Science, Cornell University, Ithaca, NY 14853 {yuli,snavely,dph}@cs.cornell.edu
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:
Semi Automatic Annotation Exploitation Similarity of Pics in i Personal Photo Albums P. Subashree Kasi Thangam 1 and R. Rosy Angel 2 1 Assistant Professor, Department of Computer Science Engineering College,
More informationTag Based Image Search by Social Re-ranking
Tag Based Image Search by Social Re-ranking Vilas Dilip Mane, Prof.Nilesh P. Sable Student, Department of Computer Engineering, Imperial College of Engineering & Research, Wagholi, Pune, Savitribai Phule
More informationFast Indexing and Search. Lida Huang, Ph.D. Senior Member of Consulting Staff Magma Design Automation
Fast Indexing and Search Lida Huang, Ph.D. Senior Member of Consulting Staff Magma Design Automation Motivation Object categorization? http://www.cs.utexas.edu/~grauman/slides/jain_et_al_cvpr2008.ppt Motivation
More informationDeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution and Fully Connected CRFs
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution and Fully Connected CRFs Zhipeng Yan, Moyuan Huang, Hao Jiang 5/1/2017 1 Outline Background semantic segmentation Objective,
More informationStoryline Reconstruction for Unordered Images
Introduction: Storyline Reconstruction for Unordered Images Final Paper Sameedha Bairagi, Arpit Khandelwal, Venkatesh Raizaday Storyline reconstruction is a relatively new topic and has not been researched
More informationShifting from Naming to Describing: Semantic Attribute Models. Rogerio Feris, June 2014
Shifting from Naming to Describing: Semantic Attribute Models Rogerio Feris, June 2014 Recap Large-Scale Semantic Modeling Feature Coding and Pooling Low-Level Feature Extraction Training Data Slide credit:
More informationAN EFFICIENT PROCESSING OF WEBPAGE METADATA AND DOCUMENTS USING ANNOTATION Sabna N.S 1, Jayaleshmi S 2
AN EFFICIENT PROCESSING OF WEBPAGE METADATA AND DOCUMENTS USING ANNOTATION Sabna N.S 1, Jayaleshmi S 2 1 M.Tech Scholar, Dept of CSE, LBSITW, Poojappura, Thiruvananthapuram sabnans1988@gmail.com 2 Associate
More informationFace Detection and Tracking Control with Omni Car
Face Detection and Tracking Control with Omni Car Jheng-Hao Chen, Tung-Yu Wu CS 231A Final Report June 31, 2016 Abstract We present a combination of frontal and side face detection approach, using deep
More informationHuman Detection and Tracking for Video Surveillance: A Cognitive Science Approach
Human Detection and Tracking for Video Surveillance: A Cognitive Science Approach Vandit Gajjar gajjar.vandit.381@ldce.ac.in Ayesha Gurnani gurnani.ayesha.52@ldce.ac.in Yash Khandhediya khandhediya.yash.364@ldce.ac.in
More informationAn Exploration of Computer Vision Techniques for Bird Species Classification
An Exploration of Computer Vision Techniques for Bird Species Classification Anne L. Alter, Karen M. Wang December 15, 2017 Abstract Bird classification, a fine-grained categorization task, is a complex
More informationMatching and Predicting Street Level Images
Matching and Predicting Street Level Images Biliana Kaneva 1, Josef Sivic 2, Antonio Torralba 1, Shai Avidan 3, and William T. Freeman 1 1 Massachusetts Institute of Technology {biliana,torralba,billf}@csail.mit.edu
More informationA THREE LAYERED MODEL TO PERFORM CHARACTER RECOGNITION FOR NOISY IMAGES
INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONSAND ROBOTICS ISSN 2320-7345 A THREE LAYERED MODEL TO PERFORM CHARACTER RECOGNITION FOR NOISY IMAGES 1 Neha, 2 Anil Saroliya, 3 Varun Sharma 1,
More informationECE 6554:Advanced Computer Vision Pose Estimation
ECE 6554:Advanced Computer Vision Pose Estimation Sujay Yadawadkar, Virginia Tech, Agenda: Pose Estimation: Part Based Models for Pose Estimation Pose Estimation with Convolutional Neural Networks (Deep
More informationDOMAIN-ADAPTIVE GENERATIVE ADVERSARIAL NETWORKS FOR SKETCH-TO-PHOTO INVERSION
DOMAIN-ADAPTIVE GENERATIVE ADVERSARIAL NETWORKS FOR SKETCH-TO-PHOTO INVERSION Yen-Cheng Liu 1, Wei-Chen Chiu 2, Sheng-De Wang 1, and Yu-Chiang Frank Wang 1 1 Graduate Institute of Electrical Engineering,
More informationINFORMATION MANAGEMENT FOR SEMANTIC REPRESENTATION IN RANDOM FOREST
International Journal of Computer Engineering and Applications, Volume IX, Issue VIII, August 2015 www.ijcea.com ISSN 2321-3469 INFORMATION MANAGEMENT FOR SEMANTIC REPRESENTATION IN RANDOM FOREST Miss.Priyadarshani
More informationPerceiving the 3D World from Images and Videos. Yu Xiang Postdoctoral Researcher University of Washington
Perceiving the 3D World from Images and Videos Yu Xiang Postdoctoral Researcher University of Washington 1 2 Act in the 3D World Sensing & Understanding Acting Intelligent System 3D World 3 Understand
More informationThe Analysis of Faces in Brains and Machines
CS 332 Visual Processing in Computer and Biological Vision Systems The Analysis of Faces in Brains and Machines Paula Johnson Elizabeth Warren HMAX model Why is face analysis important? Remember/recognize
More informationPerson Re-identification for Improved Multi-person Multi-camera Tracking by Continuous Entity Association
Person Re-identification for Improved Multi-person Multi-camera Tracking by Continuous Entity Association Neeti Narayan, Nishant Sankaran, Devansh Arpit, Karthik Dantu, Srirangaraj Setlur, Venu Govindaraju
More informationGeneric Face Alignment Using an Improved Active Shape Model
Generic Face Alignment Using an Improved Active Shape Model Liting Wang, Xiaoqing Ding, Chi Fang Electronic Engineering Department, Tsinghua University, Beijing, China {wanglt, dxq, fangchi} @ocrserv.ee.tsinghua.edu.cn
More informationMulti-View 3D Object Detection Network for Autonomous Driving
Multi-View 3D Object Detection Network for Autonomous Driving Xiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, Tian Xia CVPR 2017 (Spotlight) Presented By: Jason Ku Overview Motivation Dataset Network Architecture
More informationLearning 6D Object Pose Estimation and Tracking
Learning 6D Object Pose Estimation and Tracking Carsten Rother presented by: Alexander Krull 21/12/2015 6D Pose Estimation Input: RGBD-image Known 3D model Output: 6D rigid body transform of object 21/12/2015
More informationImage correspondences and structure from motion
Image correspondences and structure from motion http://graphics.cs.cmu.edu/courses/15-463 15-463, 15-663, 15-862 Computational Photography Fall 2017, Lecture 20 Course announcements Homework 5 posted.
More informationClass 5: Attributes and Semantic Features
Class 5: Attributes and Semantic Features Rogerio Feris, Feb 21, 2013 EECS 6890 Topics in Information Processing Spring 2013, Columbia University http://rogerioferis.com/visualrecognitionandsearch Project
More informationSrikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah
School of Computing University of Utah Presentation Outline 1 2 3 Forward Projection (Reminder) u v 1 KR ( I t ) X m Y m Z m 1 Backward Projection (Reminder) Q K 1 q Presentation Outline 1 2 3 Sample Problem
More informationDOMAIN-ADAPTIVE GENERATIVE ADVERSARIAL NETWORKS FOR SKETCH-TO-PHOTO INVERSION
2017 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, SEPT. 25 28, 2017, TOKYO, JAPAN DOMAIN-ADAPTIVE GENERATIVE ADVERSARIAL NETWORKS FOR SKETCH-TO-PHOTO INVERSION Yen-Cheng Liu 1,
More informationMachine Learning with Python
DEVNET-2163 Machine Learning with Python Dmitry Figol, SE WW Enterprise Sales @dmfigol Cisco Spark How Questions? Use Cisco Spark to communicate with the speaker after the session 1. Find this session
More informationLarge scale object/scene recognition
Large scale object/scene recognition Image dataset: > 1 million images query Image search system ranked image list Each image described by approximately 2000 descriptors 2 10 9 descriptors to index! Database
More information3D Shape Analysis with Multi-view Convolutional Networks. Evangelos Kalogerakis
3D Shape Analysis with Multi-view Convolutional Networks Evangelos Kalogerakis 3D model repositories [3D Warehouse - video] 3D geometry acquisition [KinectFusion - video] 3D shapes come in various flavors
More informationHomographies and RANSAC
Homographies and RANSAC Computer vision 6.869 Bill Freeman and Antonio Torralba March 30, 2011 Homographies and RANSAC Homographies RANSAC Building panoramas Phototourism 2 Depth-based ambiguity of position
More informationVisual Recognition and Search April 18, 2008 Joo Hyun Kim
Visual Recognition and Search April 18, 2008 Joo Hyun Kim Introduction Suppose a stranger in downtown with a tour guide book?? Austin, TX 2 Introduction Look at guide What s this? Found Name of place Where
More informationPainting-to-3D Model Alignment Via Discriminative Visual Elements. Mathieu Aubry (INRIA) Bryan Russell (Intel) Josef Sivic (INRIA)
Painting-to-3D Model Alignment Via Discriminative Visual Elements Mathieu Aubry (INRIA) Bryan Russell (Intel) Josef Sivic (INRIA) This work Browsing across time 1699 1740 1750 1800 1800 1699 1740 1750
More informationFisher House Foundation. Communications Presentation Friends of Fisher House Conference 2017
Fisher House Foundation Communications Presentation Friends of Fisher House Conference 2017 Sequence of Topics Communications Overview/Update Branding & Fisher House Website Using Fisher House Resources
More informationConnecting relevant video content to audiences CREDENTIALS DECK
Connecting relevant video content to audiences CREDENTIALS DECK 1 Hello, We re Vilynx We've developed machine learning technology to drive video discovery and engagement. US company, headquarters in Palo
More informationKnow your neighbours: Machine Learning on Graphs
Know your neighbours: Machine Learning on Graphs Andrew Docherty Senior Research Engineer andrew.docherty@data61.csiro.au www.data61.csiro.au 2 Graphs are Everywhere Online Social Networks Transportation
More informationTriangle Method for Fast Face Detection on the Wild
Journal of Multimedia Information System VOL. 5, NO. 1, March 2018 (pp. 15-20): ISSN 2383-7632(Online) http://dx.doi.org/10.9717/jmis.2018.5.1.15 Triangle Method for Fast Face Detection on the Wild Karimov
More informationDeep Face Recognition. Nathan Sun
Deep Face Recognition Nathan Sun Why Facial Recognition? Picture ID or video tracking Higher Security for Facial Recognition Software Immensely useful to police in tracking suspects Your face will be an
More informationLarge-scale Video Classification with Convolutional Neural Networks
Large-scale Video Classification with Convolutional Neural Networks Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei Note: Slide content mostly from : Bay Area
More information