Visual Search: 3 Levels of Real-Time Feedback
|
|
- Juniper Casey
- 5 years ago
- Views:
Transcription
1 Visual Search: 3 Levels of Real-Time Feedback Prof. Shih-Fu Chang Department of Electrical Engineering Digital Video and Multimedia Lab
2 A lot of work on Image Classification Analysis models Anchor Snow Soccer Building Outdoor Audio-visual features Geo, social, camera metadata User/mission context Rich semantic labels 2
3 Object Detection (PASCAL VOC) aeroplane bird bicycle boat train Person chair car cat bottle dining table tv/monitor cow dog horse potted plant motorbike bus sheep sofa
4 Research community growing fast! TRECVID Data domain amount types Lexicon size Broadcast news, documentary, surveillance, Internet, 400 hours Sound & Vision 170 hours Television News 100 hours BBC rushes (130,000+ subshots) video shots, keyframes 10 (2004, 2005) 39 (2006, 2007) 20 (2008) 130 (2010) LSCOM Broadcast news 170 hours Broadcast News video concepts CalTech256 Internet Images 30,607 images images 256 classes PASCAL Internet Images 9,963 images 24,640 annotated objects images, objects (as of Nov. 2009) 20 classes Tiny Image Internet Images 80,000,000 tiny images (32x32) images 75,378 WordNet nouns LabelMe Internet and UGC conten 30,369 images from 183 folders images, keyframes 111,490 object labels ImageNet Internet images 9,386,073 images images 14,847 WordNet synsets Lotus Hill Dataset Internet Images 500,000+ images and keyframes images, keyframes 280 object classes
5 CuZero: 400+ visual classifier models concept detection models: objects, people, location, scenes, events, etc airplane airplane_takeoff airport_or_airfieldarmed_personbuilding car cityscape crowd desert dirt_gravel_road entertainment explosion_fire foresthighway hospital insurgents landscape maps military military_base military_personnelmountainnighttime peoplemarching person powerplants riot river road rpg shooting smoke tanksurban vegetation vehicle waterscape_waterfront weaponsweather Columbia Runs Evaluation of 20 concepts at TRECVID 2008
6 TRECVID: Concept Detection Examples Top five classification results Classroom Demonstration Or Protest Cityscape Airplane flying Singing
7 Problem: User Gap Given a new search target, users have difficulty in choosing appropriate concept classifiers Find shots of something burning with flames visible Which classifiers to use? Which classifiers work? hundreds of classifiers car outdoor urban fire building airplane car crash road person Explosion
8 1 Specific exemplar Problem: Specific example vs. Generic concept Query: find person running around a building car Running vehicle building person road walking loading urban 2 Generic concept
9 Pains of Frustrated Users User forced to take one shot searches, iterating queries with a trial and error approach...
10 Semantic Search Target (query topic) C Concept Mapping olumbia 2 u Zero 1 Instant Query Concept Mapping Real-Time Navigation & Feedback Multi-Concept Navigation Concept Visualization niversity Real-time Query Manipulation Video Content (with metadata) content analytics thousands of semantic concepts car urban outdoo fire r building airplane car crash road sky person user CuZero Search Engine Concept Pool (Zavesky and Chang, Multimedia Info Retrieval MIR 08)
11 Instant Feedback: Instant visual concept suggestion Instant Classifier Suggestion query time concept mining S.-F. Chang, Columbia U. 11
12 Instant Feedback: Mapping formulate query refine Precise specification of search anchors Refine query by sliding inspector through the map Intuitive control: Closer to anchor is more like it Increases exploration breadth, instead of single list browsing Instant result display for each location, without new query computation anchor inspector anchor weights Road Car Outdoor Explosio n Fire 12
13 Demos Find lake front buildings in the Central Park Find person walking around building Find a car on a road in a snowy condition Find urban explosion scenes in UAV videos 13
14 Instant Feedback of Query Manipulation Multi-modal query input Visual examples (motion track, object/image samples) >350 object/scene models Textual, geo-spatio-temporal cues running 1 running 2 road fire smoke Real-Time Sliding Query Panel 2D sliding panel for arbitrary query manipulation Instant query result update after refinement Intuitive feedback for query weight adjustment Distributed, lightweight environment Distributed client-server approach for fast deployment with large data archive. Prototypesof 200+ hours of TRECVID videos and VIRAT videos Smoke Road Explosion Running 1 Running 2 Query refinement
15 User feedback can be used in many other ways 1 2 Query Formulation Query Processing 3 Results Visual Examples Classifiers Cues/constraints Feature Selection Distance Metric Ranking Model Updated results Handle novel data Online Update/Rerank New Classifiers Relevance Feedback ( track, event, activity) Query Log
16 Graph-based feedback propagation Propagate user feedback to larger collection Input samples with sparse labels Label propagation on graph Label inference results Positive Negative Unlabeled Positive Negative - - graph - - graph node - - weight matrix - - risk function - - classification - - label matrix
17 A hot topic in Machine Learning Given initial labels, Y, find classification function F over graph nodes Label smoothness Fit known labels (Zhou, et al NIPS04) Gaussian fields & Harmonic functions (Zhu et al ICML03) 17
18 Many Challenging Issues LGC Method GFHF Method Unbalanced Labels Bad Label Locations Noisy Labels S.-F. Chang, Columbia U
19 Graph Transduction via Alternating Minimization (GTAM) (Wang, Jebara, Chang, ICML08) (Wang and Chang, CVPR09) -- Bivariate Optimization over Labels (Y) and Prediction (F) Propagation Step Given label (Y), propagate over graph, predict F Label Selection Step Iteratively add good labels or remove bad labels 19 (TAG demo)
20 Iteration # 2 Initial Labels Label Diagnosis and Self Tuning (Wang and Chang CVPR 09) Iteration # 4 Iteration # 6 The values of the cost function Q during optimization procedure of LDST and GTAM methods.
21 Use Pseudo-Feedback when user not available Google Search Statue of Liberty Keyword Search Flickr Photo Site Top images as + Bottom imgs as - Label Diagnosis Diffusion
22 Use Pseudo-Feedback when user not available Rerank Keyword Search Flickr Photo Site Top images as + Bottom imgs as - Label Diagnosis Diffusion
23 Use Pseudo-Feedback when user not available Google Search Tiger Keyword Search Flickr Photo Site Top images as + Bottom imgs as - Label Diagnosis Diffusion
24 Use Pseudo-Feedback when user not available Rerank Keyword Search Flickr Photo Site Top images as + Bottom imgs as - Label Diagnosis Diffusion
25 Feedback at a faster time scale via Brain State Signaling Human Vision: Superb by quick gist in the Blink of an Eye (Hubel, 1995) Joint work with Paul Sajda s group 25
26 Hybrid Computer-Human Vision System Goal: optimally integrate neuro-vision and computer vision/machine learning to maximize information throughput and retrieval accuracy of image content 26
27 The Intention Readout Experiment User thinks about what he/she wants to search Database (any target that may interest users) 27
28 The Paradigm Neural (EEG) decoder Interest-scores Database 28
29 The Paradigm Neural (EEG) decoder Exemplar labels (noisy) Features from the entire DB Database Semi-supervised computer vision retrieval prediction score 29
30 The Paradigm Pre-triage Post-triage 30
31 The Paradigm Human inspects only a small sample set via BCI Machine filters out noise and retrieves targets from very large DB General: no predefined target models, no keyword High Throughput: neuro-vision as bootstrap of fast computer vision Pre-triage Post-triage 31
32 Identifying Discriminative Components in the EEG Using Single-Trial Analysis LDA or Logistic Regression is used to learn the contributions of EEG signal components at different spatial-temporal locations (Parra, Sajda et al. 2002, 2003) Optimal spatial filtering across electrodes within each short window (e.g., 100ms) Optimal temporal filtering over time windows after onset 32
33 EEG Feedback and Manifold Learning Opportunities and Issues: EEG results used as exemplar feedback indicating user interest Propagate interest scores over manifolds in the image space Challenge: EEG labels are noisy and limited No prior knowledge about target models 33
34 Experiments CalTech101: 3798 images from 62 categories Satellite images Generic neural decoder trained per user using images (Soccer Ball or Baseball Gloves) from Caltech256 A subset images randomly sampled to construct 6-Hz RSVP sequence Initial Trials: 4 subjects, 3 targets (Dalmatian, Chandelier/Menorah, & Starfish) 34
35 Example results Top 20 results of Neural EEG detection Top 20 results of Hybrid System (BCI-VPM) 35
36 Retrieval on Satellite Imagery The experimental results of helipad target RSVP, showing the top 20 ranked images. a) ranking by original EEG scores; b) ranking by the BCI-VPM refined interest score. 36
37 Conclusions Instant feedback and refinement tools improve the utility of small imperfect classifier pools As shown in CuZero search system Relevance feedback techniques maximizes the utility of limited imperfect user input As shown in graph-based propagation Other forms of relevance feedback Pseudo feedback from Web search Feedback from neuro state decoding digital video multimedia lab
38 References (many papers can be found at ) E. Zavesky, S.-F. Chang, CuZero: Embracing the Frontier of Interactive Visual Search for Informed Users, ACM Multimedia Information Retrieval, Y.-G. Jiang, A.Yanagawa, S.-F.Chang, C.-W.Ngo, CU-VIREO374: Fusing Columbia374 and VIREO374 for Large Scale Semantic Concept Detection, ADVENT Technical Report # Columbia University, August ( J. Wang, E. Pohlmeyer, B. Hanna, Y.-G. Jiang, P. Sajda, and S.-F. Chang, Brain State Decoding for Rapid Image Retrieval, ACM Multimedia Conference, Jun Wang, Yu-Gang Jiang, Shih-Fu Chang, Label Diagnosis through Self Tuning for Web Image Search, CVPR Jun Wang, Sanjiv Kumar, Shih-Fu Chang, Semi-Supervised Hashing for Scalable Image Retrieval, CVPR Junfeng He, Wei Liu, Shih-Fu Chang, Scalable Similarity Search with Optimized Kernel Hashing, ACM SIGKDD, Wei Liu, Junfeng He, Shih-Fu Chang, Large Graph Construction for Scalable Semi-Supervised Learning, ICML 2010.
Graph-based Techniques for Searching Large-Scale Noisy Multimedia Data
Graph-based Techniques for Searching Large-Scale Noisy Multimedia Data Shih-Fu Chang Department of Electrical Engineering Department of Computer Science Columbia University Joint work with Jun Wang (IBM),
More informationVisual Search. Shih-Fu Chang. Department of Electrical Engineering Columbia University. Boston University ECE Lecture, September 2010
Visual Search Shih-Fu Chang Department of Electrical Engineering Columbia University Boston University ECE Lecture, September 2010 (by Mac Funamizu) Many pictures are taken everyday/everywhere... Barack
More informationEE 6882 Visual Search Engine
EE 6882 Visual Search Engine Relevance Feedback March 5 th, 2012 Lecture #7 Graph Based Semi Supervised Learning Application of Image Matching: Manipulation Detection What We have Learned Image Representation
More informationColumbia University High-Level Feature Detection: Parts-based Concept Detectors
TRECVID 2005 Workshop Columbia University High-Level Feature Detection: Parts-based Concept Detectors Dong-Qing Zhang, Shih-Fu Chang, Winston Hsu, Lexin Xie, Eric Zavesky Digital Video and Multimedia Lab
More informationObject Detection Based on Deep Learning
Object Detection Based on Deep Learning Yurii Pashchenko AI Ukraine 2016, Kharkiv, 2016 Image classification (mostly what you ve seen) http://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-detection.pdf
More informationFine-tuning Pre-trained Large Scaled ImageNet model on smaller dataset for Detection task
Fine-tuning Pre-trained Large Scaled ImageNet model on smaller dataset for Detection task Kyunghee Kim Stanford University 353 Serra Mall Stanford, CA 94305 kyunghee.kim@stanford.edu Abstract We use a
More informationAlternating Minimization. Jun Wang, Tony Jebara, and Shih-fu Chang
Graph Transduction via Alternating Minimization Jun Wang, Tony Jebara, and Shih-fu Chang 1 Outline of the presentation Brief introduction and related work Problems with Graph Labeling Imbalanced labels
More informationMULTIMODAL SEMI-SUPERVISED IMAGE CLASSIFICATION BY COMBINING TAG REFINEMENT, GRAPH-BASED LEARNING AND SUPPORT VECTOR REGRESSION
MULTIMODAL SEMI-SUPERVISED IMAGE CLASSIFICATION BY COMBINING TAG REFINEMENT, GRAPH-BASED LEARNING AND SUPPORT VECTOR REGRESSION Wenxuan Xie, Zhiwu Lu, Yuxin Peng and Jianguo Xiao Institute of Computer
More informationCPSC340. State-of-the-art Neural Networks. Nando de Freitas November, 2012 University of British Columbia
CPSC340 State-of-the-art Neural Networks Nando de Freitas November, 2012 University of British Columbia Outline of the lecture This lecture provides an overview of two state-of-the-art neural networks:
More informationLeveraging flickr images for object detection
Leveraging flickr images for object detection Elisavet Chatzilari Spiros Nikolopoulos Yiannis Kompatsiaris Outline Introduction to object detection Our proposal Experiments Current research 2 Introduction
More informationAttributes. Computer Vision. James Hays. Many slides from Derek Hoiem
Many slides from Derek Hoiem Attributes Computer Vision James Hays Recap: Human Computation Active Learning: Let the classifier tell you where more annotation is needed. Human-in-the-loop recognition:
More informationBeyond Sliding Windows: Object Localization by Efficient Subwindow Search
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search Christoph H. Lampert, Matthew B. Blaschko, & Thomas Hofmann Max Planck Institute for Biological Cybernetics Tübingen, Germany Google,
More informationColumbia University TRECVID-2005 Video Search and High-Level Feature Extraction
Columbia University TRECVID-2005 Video Search and High-Level Feature Extraction Shih-Fu Chang, Winston Hsu, Lyndon Kennedy, Lexing Xie, Akira Yanagawa, Eric Zavesky, Dong-Qing Zhang Digital Video and Multimedia
More information24 hours of Photo Sharing. installation by Erik Kessels
24 hours of Photo Sharing installation by Erik Kessels And sometimes Internet photos have useful labels Im2gps. Hays and Efros. CVPR 2008 But what if we want more? Image Categorization Training Images
More informationMultimedia Information Retrieval The case of video
Multimedia Information Retrieval The case of video Outline Overview Problems Solutions Trends and Directions Multimedia Information Retrieval Motivation With the explosive growth of digital media data,
More informationover Multi Label Images
IBM Research Compact Hashing for Mixed Image Keyword Query over Multi Label Images Xianglong Liu 1, Yadong Mu 2, Bo Lang 1 and Shih Fu Chang 2 1 Beihang University, Beijing, China 2 Columbia University,
More informationDeep condolence to Professor Mark Everingham
Deep condolence to Professor Mark Everingham Towards VOC2012 Object Classification Challenge Generalized Hierarchical Matching for Sub-category Aware Object Classification National University of Singapore
More informationUnified, real-time object detection
Unified, real-time object detection Final Project Report, Group 02, 8 Nov 2016 Akshat Agarwal (13068), Siddharth Tanwar (13699) CS698N: Recent Advances in Computer Vision, Jul Nov 2016 Instructor: Gaurav
More informationAnalysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009
Analysis: TextonBoost and Semantic Texton Forests Daniel Munoz 16-721 Februrary 9, 2009 Papers [shotton-eccv-06] J. Shotton, J. Winn, C. Rother, A. Criminisi, TextonBoost: Joint Appearance, Shape and Context
More informationCategory-level localization
Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object
More informationColumbia University/VIREO-CityU/IRIT TRECVID2008 High-Level Feature Extraction and Interactive Video Search
Columbia University/VIREO-CityU/IRIT TRECVID2008 High-Level Feature Extraction and Interactive Video Search Shih-Fu Chang 1, Junfeng He 1, Yu-Gang Jiang 1,2, Elie El Khoury 3, Chong-Wah Ngo 2, Akira Yanagawa
More informationEVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION. Ing. Lorenzo Seidenari
EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION Ing. Lorenzo Seidenari e-mail: seidenari@dsi.unifi.it What is an Event? Dictionary.com definition: something that occurs in a certain place during a particular
More informationThe Stanford/Technicolor/Fraunhofer HHI Video Semantic Indexing System
The Stanford/Technicolor/Fraunhofer HHI Video Semantic Indexing System Our first participation on the TRECVID workshop A. F. de Araujo 1, F. Silveira 2, H. Lakshman 3, J. Zepeda 2, A. Sheth 2, P. Perez
More informationJoint Inference in Image Databases via Dense Correspondence. Michael Rubinstein MIT CSAIL (while interning at Microsoft Research)
Joint Inference in Image Databases via Dense Correspondence Michael Rubinstein MIT CSAIL (while interning at Microsoft Research) My work Throughout the year (and my PhD thesis): Temporal Video Analysis
More informationACM MM Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang
ACM MM 2010 Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang Harbin Institute of Technology National University of Singapore Microsoft Corporation Proliferation of images and videos on the Internet
More informationExtend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network
Extend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network Liwen Zheng, Canmiao Fu, Yong Zhao * School of Electronic and Computer Engineering, Shenzhen Graduate School of
More informationPASCAL VOC Classification: Local Features vs. Deep Features. Shuicheng YAN, NUS
PASCAL VOC Classification: Local Features vs. Deep Features Shuicheng YAN, NUS PASCAL VOC Why valuable? Multi-label, Real Scenarios! Visual Object Recognition Object Classification Object Detection Object
More informationLarge-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds
Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds Sudheendra Vijayanarasimhan Kristen Grauman Department of Computer Science University of Texas at Austin Austin,
More informationCS 1674: Intro to Computer Vision. Attributes. Prof. Adriana Kovashka University of Pittsburgh November 2, 2016
CS 1674: Intro to Computer Vision Attributes Prof. Adriana Kovashka University of Pittsburgh November 2, 2016 Plan for today What are attributes and why are they useful? (paper 1) Attributes for zero-shot
More informationHierarchical Image-Region Labeling via Structured Learning
Hierarchical Image-Region Labeling via Structured Learning Julian McAuley, Teo de Campos, Gabriela Csurka, Florent Perronin XRCE September 14, 2009 McAuley et al (XRCE) Hierarchical Image-Region Labeling
More informationVisual Query Suggestion
Visual Query Suggestion Zheng-Jun Zha, Linjun Yang, Tao Mei, Meng Wang, Zengfu Wang University of Science and Technology of China Textual Visual Query Suggestion Microsoft Research Asia Motivation Framework
More informationVideo Semantic Indexing using Object Detection-Derived Features
Video Semantic Indexing using Object Detection-Derived Features Kotaro Kikuchi, Kazuya Ueki, Tetsuji Ogawa, and Tetsunori Kobayashi Dept. of Computer Science, Waseda University, Japan Abstract A new feature
More informationObject Recognition II
Object Recognition II Linda Shapiro EE/CSE 576 with CNN slides from Ross Girshick 1 Outline Object detection the task, evaluation, datasets Convolutional Neural Networks (CNNs) overview and history Region-based
More informationRongrong Ji (Columbia), Yu Gang Jiang (Fudan), June, 2012
Supervised Hashing with Kernels Wei Liu (Columbia Columbia), Jun Wang (IBM IBM), Rongrong Ji (Columbia), Yu Gang Jiang (Fudan), and Shih Fu Chang (Columbia Columbia) June, 2012 Outline Motivations Problem
More informationEasy Samples First: Self-paced Reranking for Zero-Example Multimedia Search
Easy Samples First: Self-paced Reranking for Zero-Example Multimedia Search Lu Jiang 1, Deyu Meng 2, Teruko Mitamura 1, Alexander G. Hauptmann 1 1 School of Computer Science, Carnegie Mellon University
More informationOptimal Video Adaptation and Skimming Using a Utility-Based Framework
Optimal Video Adaptation and Skimming Using a Utility-Based Framework Shih-Fu Chang Digital Video and Multimedia Lab ADVENT University-Industry Consortium Columbia University Sept. 9th 2002 http://www.ee.columbia.edu/dvmm
More informationA Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task
A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task Wei Li, Jinming Min, Gareth J. F. Jones Center for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland
More informationSemantic Pooling for Image Categorization using Multiple Kernel Learning
Semantic Pooling for Image Categorization using Multiple Kernel Learning Thibaut Durand (1,2), Nicolas Thome (1), Matthieu Cord (1), David Picard (2) (1) Sorbonne Universités, UPMC Univ Paris 06, UMR 7606,
More informationRushes Video Segmentation Using Semantic Features
Rushes Video Segmentation Using Semantic Features Athina Pappa, Vasileios Chasanis, and Antonis Ioannidis Department of Computer Science and Engineering, University of Ioannina, GR 45110, Ioannina, Greece
More informationImage Similarity Based on Direct Human Judgment
Image Similarity Based on Direct Human Judgment Raul Guerra Dept. of Computer Science University of Maryland College Park, MD 20742 rguerra@cs.umd.edu Abstract Recently the field of human-based computation
More informationExperimenting VIREO-374: Bag-of-Visual-Words and Visual-Based Ontology for Semantic Video Indexing and Search
Experimenting VIREO-374: Bag-of-Visual-Words and Visual-Based Ontology for Semantic Video Indexing and Search Chong-Wah Ngo, Yu-Gang Jiang, Xiaoyong Wei Feng Wang, Wanlei Zhao, Hung-Khoon Tan and Xiao
More informationImage Annotation Using Multi-label Correlated Green s Function
Image Annotation Using Multi-label Correlated Green s Function Hua Wang, Heng Huang, Chris Ding Computer Science and Engineering Department University of Texas at Arlington huawang27@mavs.uta.edu {heng,
More informationLEARNING OBJECT SEGMENTATION USING A MULTI NETWORK SEGMENT CLASSIFICATION APPROACH
Università degli Studi dell'insubria Varese, Italy LEARNING OBJECT SEGMENTATION USING A MULTI NETWORK SEGMENT CLASSIFICATION APPROACH Simone Albertini Ignazio Gallo, Angelo Nodari, Marco Vanetti albertini.simone@gmail.com
More informationSpatial Localization and Detection. Lecture 8-1
Lecture 8: Spatial Localization and Detection Lecture 8-1 Administrative - Project Proposals were due on Saturday Homework 2 due Friday 2/5 Homework 1 grades out this week Midterm will be in-class on Wednesday
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:
Semi Automatic Annotation Exploitation Similarity of Pics in i Personal Photo Albums P. Subashree Kasi Thangam 1 and R. Rosy Angel 2 1 Assistant Professor, Department of Computer Science Engineering College,
More informationLearning Representations for Visual Object Class Recognition
Learning Representations for Visual Object Class Recognition Marcin Marszałek Cordelia Schmid Hedi Harzallah Joost van de Weijer LEAR, INRIA Grenoble, Rhône-Alpes, France October 15th, 2007 Bag-of-Features
More informationConstrained Convolutional Neural Networks for Weakly Supervised Segmentation. Deepak Pathak, Philipp Krähenbühl and Trevor Darrell
Constrained Convolutional Neural Networks for Weakly Supervised Segmentation Deepak Pathak, Philipp Krähenbühl and Trevor Darrell 1 Multi-class Image Segmentation Assign a class label to each pixel in
More informationWeakly Supervised Object Recognition with Convolutional Neural Networks
GDR-ISIS, Paris March 20, 2015 Weakly Supervised Object Recognition with Convolutional Neural Networks Ivan Laptev ivan.laptev@inria.fr WILLOW, INRIA/ENS/CNRS, Paris Joint work with: Maxime Oquab Leon
More informationFeature-Fused SSD: Fast Detection for Small Objects
Feature-Fused SSD: Fast Detection for Small Objects Guimei Cao, Xuemei Xie, Wenzhe Yang, Quan Liao, Guangming Shi, Jinjian Wu School of Electronic Engineering, Xidian University, China xmxie@mail.xidian.edu.cn
More informationImage Processing (IP)
Image Processing Pattern Recognition Computer Vision Xiaojun Qi Utah State University Image Processing (IP) Manipulate and analyze digital images (pictorial information) by computer. Applications: The
More informationDeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution and Fully Connected CRFs
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution and Fully Connected CRFs Zhipeng Yan, Moyuan Huang, Hao Jiang 5/1/2017 1 Outline Background semantic segmentation Objective,
More informationXian-Sheng Hua ( 华先胜 )
Machine Learning and Applications Workshop 2009 Xian-Sheng Hua ( 华先胜 ) Lead Researcher, Media Computing Group, Microsoft Research Asia Nanjing China Nov 7-8, 2009 Introduce research problems in multimedia
More informationOptimizing Intersection-Over-Union in Deep Neural Networks for Image Segmentation
Optimizing Intersection-Over-Union in Deep Neural Networks for Image Segmentation Md Atiqur Rahman and Yang Wang Department of Computer Science, University of Manitoba, Canada {atique, ywang}@cs.umanitoba.ca
More informationClass 5: Attributes and Semantic Features
Class 5: Attributes and Semantic Features Rogerio Feris, Feb 21, 2013 EECS 6890 Topics in Information Processing Spring 2013, Columbia University http://rogerioferis.com/visualrecognitionandsearch Project
More informationSemantic Context Transfer across Heterogeneous Sources for Domain Adaptive Video Search
Semantic Context Transfer across Heterogeneous Sources for Domain Adaptive Video Search Yu-Gang Jiang, Chong-Wah Ngo, Shih-Fu Chang Department of Computer Science, City University of Hong Kong Department
More informationAN ENHANCED ATTRIBUTE RERANKING DESIGN FOR WEB IMAGE SEARCH
AN ENHANCED ATTRIBUTE RERANKING DESIGN FOR WEB IMAGE SEARCH Sai Tejaswi Dasari #1 and G K Kishore Babu *2 # Student,Cse, CIET, Lam,Guntur, India * Assistant Professort,Cse, CIET, Lam,Guntur, India Abstract-
More informationEECS 442 Computer vision Databases for object recognition and beyond Segments of this lectures are courtesy of Prof F. Li, R. Fergus and A.
EECS 442 Computer vision Databases for object recognition and beyond Segments of this lectures are courtesy of Prof F. Li, R. Fergus and A. Zisserman Why are we talking about datasets? This is computer
More informationTowards Building Large-Scale Multimodal Knowledge Bases
Towards Building Large-Scale Multimodal Knowledge Bases Dihong Gong Advised by Dr. Daisy Zhe Wang Knowledge Itself is Power --Francis Bacon Analytics Social Robotics Knowledge Graph Nodes Represent entities
More informationEEG analysis for implicit tagging of video data
EEG analysis for implicit tagging of video data Sander Koelstra, Christian Mühl, Ioannis Patras MultiMedia and Vision Group, Queen Mary, University of London, UK Human Media Interaction, University of
More informationMultimedia Data Management M
ALMA MATER STUDIORUM - UNIVERSITÀ DI BOLOGNA Multimedia Data Management M Second cycle degree programme (LM) in Computer Engineering University of Bologna Semantic Multimedia Data Annotation Home page:
More informationDeep learning for object detection. Slides from Svetlana Lazebnik and many others
Deep learning for object detection Slides from Svetlana Lazebnik and many others Recent developments in object detection 80% PASCAL VOC mean0average0precision0(map) 70% 60% 50% 40% 30% 20% 10% Before deep
More informationSingle-Shot Refinement Neural Network for Object Detection -Supplementary Material-
Single-Shot Refinement Neural Network for Object Detection -Supplementary Material- Shifeng Zhang 1,2, Longyin Wen 3, Xiao Bian 3, Zhen Lei 1,2, Stan Z. Li 4,1,2 1 CBSR & NLPR, Institute of Automation,
More informationDeformable Part Models
CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones
More informationSegmenting Objects in Weakly Labeled Videos
Segmenting Objects in Weakly Labeled Videos Mrigank Rochan, Shafin Rahman, Neil D.B. Bruce, Yang Wang Department of Computer Science University of Manitoba Winnipeg, Canada {mrochan, shafin12, bruce, ywang}@cs.umanitoba.ca
More informationSupplementary Material: Pixelwise Instance Segmentation with a Dynamically Instantiated Network
Supplementary Material: Pixelwise Instance Segmentation with a Dynamically Instantiated Network Anurag Arnab and Philip H.S. Torr University of Oxford {anurag.arnab, philip.torr}@eng.ox.ac.uk 1. Introduction
More informationStructured Prediction using Convolutional Neural Networks
Overview Structured Prediction using Convolutional Neural Networks Bohyung Han bhhan@postech.ac.kr Computer Vision Lab. Convolutional Neural Networks (CNNs) Structured predictions for low level computer
More informationAn Object Detection Algorithm based on Deformable Part Models with Bing Features Chunwei Li1, a and Youjun Bu1, b
5th International Conference on Advanced Materials and Computer Science (ICAMCS 2016) An Object Detection Algorithm based on Deformable Part Models with Bing Features Chunwei Li1, a and Youjun Bu1, b 1
More informationMultimedia Information Retrieval
Multimedia Information Retrieval Prof Stefan Rüger Multimedia and Information Systems Knowledge Media Institute The Open University http://kmi.open.ac.uk/mmis Multimedia Information Retrieval 1. What are
More informationLearning Spatial Context: Using Stuff to Find Things
Learning Spatial Context: Using Stuff to Find Things Wei-Cheng Su Motivation 2 Leverage contextual information to enhance detection Some context objects are non-rigid and are more naturally classified
More informationThe LICHEN Framework: A new toolbox for the exploitation of corpora
The LICHEN Framework: A new toolbox for the exploitation of corpora Lisa Lena Opas-Hänninen, Tapio Seppänen, Ilkka Juuso and Matti Hosio (University of Oulu, Finland) Background cultural inheritance is
More informationCOSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor
COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality
More informationInstances on a Budget
Retrieving Similar or Informative Instances on a Budget Kristen Grauman Dept. of Computer Science University of Texas at Austin Work with Sudheendra Vijayanarasimham Work with Sudheendra Vijayanarasimham,
More informationKeywords TBIR, Tag completion, Matrix completion, Image annotation, Image retrieval
Volume 4, Issue 9, September 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Tag Completion
More informationList of Accepted Papers for ICVGIP 2018
List of Accepted Papers for ICVGIP 2018 Paper ID ACM Article Title 3 1 PredGAN - A deep multi-scale video prediction framework for anomaly detection in videos 7 2 Handwritten Essay Grading on Mobiles using
More informationMULTIVIEW RANK LEARNING FOR MULTIMEDIA KNOWN ITEM SEARCH
MULTIVIEW RANK LEARNING FOR MULTIMEDIA KNOWN ITEM SEARCH by David L Etter A Dissertation Submitted to the Graduate Faculty of George Mason University In Partial fulfillment of The Requirements for the
More informationEE 6882 Statistical Methods for Video Indexing and Analysis
EE 6882 Statistical Methods for Video Indexing and Analysis Fall 2004 Prof. ShihFu Chang http://www.ee.columbia.edu/~sfchang Lecture 1 part A (9/8/04) 1 EE E6882 SVIA Lecture #1 Part I Introduction Course
More informationObject recognition (part 2)
Object recognition (part 2) CSE P 576 Larry Zitnick (larryz@microsoft.com) 1 2 3 Support Vector Machines Modified from the slides by Dr. Andrew W. Moore http://www.cs.cmu.edu/~awm/tutorials Linear Classifiers
More informationA Comparison of l 1 Norm and l 2 Norm Multiple Kernel SVMs in Image and Video Classification
A Comparison of l 1 Norm and l 2 Norm Multiple Kernel SVMs in Image and Video Classification Fei Yan Krystian Mikolajczyk Josef Kittler Muhammad Tahir Centre for Vision, Speech and Signal Processing University
More information2282 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 21, NO. 4, APRIL 2012
2282 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 21, NO. 4, APRIL 2012 Task-Dependent Visual-Codebook Compression Rongrong Ji, Member, IEEE, Hongxun Yao, Member, IEEE, Wei Liu, Student Member, IEEE, Xiaoshuai
More informationTowards Weakly- and Semi- Supervised Object Localization and Semantic Segmentation
Towards Weakly- and Semi- Supervised Object Localization and Semantic Segmentation Lecturer: Yunchao Wei Image Formation and Processing (IFP) Group University of Illinois at Urbanahttps://weiyc.githu Champaign
More informationSemi-supervised Data Representation via Affinity Graph Learning
1 Semi-supervised Data Representation via Affinity Graph Learning Weiya Ren 1 1 College of Information System and Management, National University of Defense Technology, Changsha, Hunan, P.R China, 410073
More informationTowards Large-Scale Semantic Representations for Actionable Exploitation. Prof. Trevor Darrell UC Berkeley
Towards Large-Scale Semantic Representations for Actionable Exploitation Prof. Trevor Darrell UC Berkeley traditional surveillance sensor emerging crowd sensor Desired capabilities: spatio-temporal reconstruction
More informationHigh-level Event Recognition in Internet Videos
High-level Event Recognition in Internet Videos Yu-Gang Jiang School of Computer Science Fudan University, Shanghai, China ygj@fudan.edu.cn Joint work with Guangnan Ye 1, Subh Bhattacharya 2, Dan Ellis
More informationSemantic Video Indexing
Semantic Video Indexing T-61.6030 Multimedia Retrieval Stevan Keraudy stevan.keraudy@tkk.fi Helsinki University of Technology March 14, 2008 What is it? Query by keyword or tag is common Semantic Video
More informationSemantic Image Clustering Using Object Relation Network
Semantic Image Clustering Using Object Relation Network Na Chen and Viktor K. Prasanna University of Southern California Abstract. This paper presents a novel method to organize a collection of images
More informationOptimizing Object Detection:
Lecture 10: Optimizing Object Detection: A Case Study of R-CNN, Fast R-CNN, and Faster R-CNN Visual Computing Systems Today s task: object detection Image classification: what is the object in this image?
More informationExperiments of Image Retrieval Using Weak Attributes
Columbia University Computer Science Department Technical Report # CUCS 005-12 (2012) Experiments of Image Retrieval Using Weak Attributes Felix X. Yu, Rongrong Ji, Ming-Hen Tsai, Guangnan Ye, Shih-Fu
More informationConsumer Video Understanding
Consumer Video Understanding A Benchmark Database + An Evaluation of Human & Machine Performance Yu-Gang Jiang, Guangnan Ye, Shih-Fu Chang, Daniel Ellis, Alexander C. Loui Columbia University Kodak Research
More informationYUSUF AYTAR B.S. Ege University
SEMANTIC VIDEO RETRIEVAL USING HIGH LEVEL CONTEXT by YUSUF AYTAR B.S. Ege University A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in the School of Electrical
More informationR-CNN Based Object Detection and Classification Methods for Complex Sceneries
R-CNN Based Object Detection and Classification Methods for Complex Sceneries Jaswinder Singh #1, Dr. B.K. Sharma *2 # Research scholar, Dr. A.P.J Abdul Kalam Technical University, Lucknow, India. * Principal
More informationA Reranking Approach for Context-based Concept Fusion in Video Indexing and Retrieval
A Reranking Approach for Context-based Concept Fusion in Video Indexing and Retrieval Lyndon S. Kennedy Dept. of Electrical Engineering Columbia University New York, NY 10027 lyndon@ee.columbia.edu Shih-Fu
More informationStealing Objects With Computer Vision
Stealing Objects With Computer Vision Learning Based Methods in Vision Analysis Project #4: Mar 4, 2009 Presented by: Brian C. Becker Carnegie Mellon University Motivation Goal: Detect objects in the photo
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:
IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 20131 Improve Search Engine Relevance with Filter session Addlin Shinney R 1, Saravana Kumar T
More informationDetection III: Analyzing and Debugging Detection Methods
CS 1699: Intro to Computer Vision Detection III: Analyzing and Debugging Detection Methods Prof. Adriana Kovashka University of Pittsburgh November 17, 2015 Today Review: Deformable part models How can
More informationDeep Learning for Visual Manipulation and Synthesis
Deep Learning for Visual Manipulation and Synthesis Jun-Yan Zhu 朱俊彦 UC Berkeley 2017/01/11 @ VALSE What is visual manipulation? Image Editing Program input photo User Input result Desired output: stay
More informationLearning Visual Semantics: Models, Massive Computation, and Innovative Applications. Tutorial at CVPR 2014 June 23rd, 1:00pm-5:00pm, Columbus, OH
Learning Visual Semantics: Models, Massive Computation, and Innovative Applications Tutorial at CVPR 2014 June 23rd, 1:00pm-5:00pm, Columbus, OH Introduction Instructors: Shih-Fu Chang John Smith Rogerio
More informationarxiv: v1 [cs.mm] 12 Jan 2016
Learning Subclass Representations for Visually-varied Image Classification Xinchao Li, Peng Xu, Yue Shi, Martha Larson, Alan Hanjalic Multimedia Information Retrieval Lab, Delft University of Technology
More informationInformation Technology for Documentary Data Representation
ALMA MATER STUDIORUM - UNIVERSITÀ DI BOLOGNA Information Technology for Documentary Data Representation Laurea Magistrale in Scienze del Libro e del Documento University of Bologna Multimedia Information
More informationShifting from Naming to Describing: Semantic Attribute Models. Rogerio Feris, June 2014
Shifting from Naming to Describing: Semantic Attribute Models Rogerio Feris, June 2014 Recap Large-Scale Semantic Modeling Feature Coding and Pooling Low-Level Feature Extraction Training Data Slide credit:
More informationREGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION
REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION Kingsley Kuan 1, Gaurav Manek 1, Jie Lin 1, Yuan Fang 1, Vijay Chandrasekhar 1,2 Institute for Infocomm Research, A*STAR, Singapore 1 Nanyang Technological
More information