EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION. Ing. Lorenzo Seidenari
|
|
- Job Dennis
- 5 years ago
- Views:
Transcription
1 EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION Ing. Lorenzo Seidenari
2 What is an Event? Dictionary.com definition: something that occurs in a certain place during a particular interval of time. Examples from various domains: Sports: shot on goal Surveillance: enter in car Movies: drink
3 Importance of Human Actions Most videos recorded and downloadable from the web contain people; the semantic is therefore defined by people behavior. Third generation video-surveillance systems benefit from automatic interpretation of human actions and behaviors. Definition 1: physical body motion. Definition 2: interaction with environment (objects or people) on a specific purpose.
4 Human action recognition challenges Actor appearance variation. Gender, clothing body posture and size. Scale, illumination and background change as in object categorization. Different ways of executing the same action. This results in limbs trajectory and speed change. Semantically different but perceptually similar actions (e.g. running and jogging).
5 Are actions space-time objects? We already know how to detect instances of object categories in static images. How do we take advantage of time to describe dynamic concepts (i.e. human actions)? time time time time
6 Framework Overview: Same three steps of object categorization (feature extraction, dictionary formation, classification) Features detector and descriptor here differ! Interest points extraction Bag-of-features Visual Dictionary running walking jogging handwaving handclapping boxing SVM classifier Bag-of-words
7 Descriptor combination strategy Descriptor Visual Dictionary Action Representation ST Patch 3DGrad_HoF BoW Visual Dictionaries Descriptors Action Representation ST Patch 3DGrad 3DGrad + HoF BoW HoF
8 Effective codebooks: Spatio-temporal descriptors span an extremely high-dimensional feature space Our dense multi-scale sampling produce a non-uniform feature space. K-means clusters are attracted towards densely populated regions. Less dense zone are not represented correctly. Radius-based clustering [Jurie ICCV05] exploits mode finding to place cluster centers. More accurate coding of the feature space. Note: to reduce the uncertainty we perform soft assignment.
9 Results: codebook performance Words are sorted by frequency and added incrementally to dictionary. KTH codebook size Non-informative high-frequency terms. Informative mid-frequency terms.
10 Results: codebook performance Words are sorted by frequency and added incrementally to dictionary. Weizmann codebook size Non-informative high-frequency terms. Informative mid-frequency terms.
11 Results: dataset The approach is tested on two standard datasets Weizmann dataset is considered less challenging for the reduce variability of shooting conditions and amount of actors. KTH 25 actors 6 actions 4 viewing conditions 2931 clips Weizmann 9 actors 10 actions 1 viewing conditions 93 clips
12 Results: comparison with the state of the art We compare our results by using the same methodology to measure the Improvement w.r.t. to the current state-of-the-art Method KTH Weizmann Our method Laptev et al. - HoG ['08] Laptev et al. - HoF [ 08] Dollár et al. [ 05] Wong e Cipolla [ 07] Scovanner et al. [ 07] Niebles et al. [ 08] Liu et al. [ 08] Kläser et al. [ 08] Willems et al. [ 08]
13 Real video footage We test our detector on a sequence taken in a garage. A sliding temporal window is used to perform the segmentation. walking running
14 Recognizing generic video events Online video search and video indexing Events characterized by an evolution of scenes, objects and actions over time. 56 events are defined in LSCOM. Event examples in the news domain: Airplane Flying Car Exiting
15 Event Recognition: Object Tracking A possible approach, which exploit object recognition is to detect interest object, track over time, and model spatio-temporal dynamics. Some events are well defined by the presence and motion of an object. Object Detection & Localization Tracking Inference Airplane Landing Hard to detect events without explicit object motion, such as Riot?
16 Event recognition: exploit dynamic concept evolution Global low level feature are extracted such as edge histograms, Gabor texture descriptors and grid color moments. 108 concent detectors are trained on this features. Each frame is represented by 108 concept scores. Shots similarity is evaluated by computing Earth Mover s Distance. feature extraction concept detectors EMD distance Plug the EMD into a rbf kernel and use it in a SVM to predict category
17 Content Representation: Mid-level Semantic Concept Scores Image Database Concept Detectors Train detectors on low-level features Mid-level semantic concept feature is more robust Columbia developed and released 374 semantic concept detectors. Detectors are available online
18 Earth Mover s Distance (EMD): Approach Supplier P is with a given amount of goods Receiver Q is with a given limited capacity 1 1/2 d ij 1/2 Weights: Solved by linear programming Temporal shift: a frame at the beginning of P can be mapped to a frame at the end of Q Scale variations: a frame from P can be mapped to multiple frames in Q
19 Experiments: Keyframe based feature performance Dataset: TRECVID2005 Evaluation Metric: Average Precision 1,0 0,8 0,6 0,4 0,2 concept scores edge direction histogram Gabor texture color moment 0,0 Car Crash Protest Greeting Car Exiting Combat Marching Riot Running Shooting Walking (average)
20 Experiments: EMD concept performance
21 References On space-time interest points, Laptev, I. IJCV 2005 Behavior recognition via sparse spatio-temporal features, Dollar, P., Rabaud, V., Cottrel, G. and Belongie, S. ICCV VS-PETS 2005 Effective Codebooks for Human Action Recognition, Ballan, L., Bertini, M., Del Bimbo, A.,Seidenari, L. and Serra, G. ICCV VOEC 2009 Video Event Recognition using kernel methods with multilevel temporal alignement, Dong Xu, Shih-Fu Chang, TPAMI 2008
Content-based image and video analysis. Event Recognition
Content-based image and video analysis Event Recognition 21.06.2010 What is an event? a thing that happens or takes place, Oxford Dictionary Examples: Human gestures Human actions (running, drinking, etc.)
More informationEvaluation of Local Space-time Descriptors based on Cuboid Detector in Human Action Recognition
International Journal of Innovation and Applied Studies ISSN 2028-9324 Vol. 9 No. 4 Dec. 2014, pp. 1708-1717 2014 Innovative Space of Scientific Research Journals http://www.ijias.issr-journals.org/ Evaluation
More informationLecture 18: Human Motion Recognition
Lecture 18: Human Motion Recognition Professor Fei Fei Li Stanford Vision Lab 1 What we will learn today? Introduction Motion classification using template matching Motion classification i using spatio
More informationIMPROVING SPATIO-TEMPORAL FEATURE EXTRACTION TECHNIQUES AND THEIR APPLICATIONS IN ACTION CLASSIFICATION. Maral Mesmakhosroshahi, Joohee Kim
IMPROVING SPATIO-TEMPORAL FEATURE EXTRACTION TECHNIQUES AND THEIR APPLICATIONS IN ACTION CLASSIFICATION Maral Mesmakhosroshahi, Joohee Kim Department of Electrical and Computer Engineering Illinois Institute
More informationExtracting Spatio-temporal Local Features Considering Consecutiveness of Motions
Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions Akitsugu Noguchi and Keiji Yanai Department of Computer Science, The University of Electro-Communications, 1-5-1 Chofugaoka,
More informationAction recognition in videos
Action recognition in videos Cordelia Schmid INRIA Grenoble Joint work with V. Ferrari, A. Gaidon, Z. Harchaoui, A. Klaeser, A. Prest, H. Wang Action recognition - goal Short actions, i.e. drinking, sit
More informationUnderstanding Sport Activities from Correspondences of Clustered Trajectories
Understanding Sport Activities from Correspondences of Clustered Trajectories Francesco Turchini, Lorenzo Seidenari, Alberto Del Bimbo http://www.micc.unifi.it/vim Introduction The availability of multimedia
More informationAn evaluation of local action descriptors for human action classification in the presence of occlusion
An evaluation of local action descriptors for human action classification in the presence of occlusion Iveel Jargalsaikhan, Cem Direkoglu, Suzanne Little, and Noel E. O Connor INSIGHT Centre for Data Analytics,
More informationVision and Image Processing Lab., CRV Tutorial day- May 30, 2010 Ottawa, Canada
Spatio-Temporal Salient Features Amir H. Shabani Vision and Image Processing Lab., University of Waterloo, ON CRV Tutorial day- May 30, 2010 Ottawa, Canada 1 Applications Automated surveillance for scene
More informationSpatio-temporal Feature Classifier
Spatio-temporal Feature Classifier Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2015, 7, 1-7 1 Open Access Yun Wang 1,* and Suxing Liu 2 1 School
More informationOBJECT CATEGORIZATION
OBJECT CATEGORIZATION Ing. Lorenzo Seidenari e-mail: seidenari@dsi.unifi.it Slides: Ing. Lamberto Ballan November 18th, 2009 What is an Object? Merriam-Webster Definition: Something material that may be
More informationMoSIFT: Recognizing Human Actions in Surveillance Videos
MoSIFT: Recognizing Human Actions in Surveillance Videos CMU-CS-09-161 Ming-yu Chen and Alex Hauptmann School of Computer Science Carnegie Mellon University Pittsburgh PA 15213 September 24, 2009 Copyright
More informationEvaluation of local descriptors for action recognition in videos
Evaluation of local descriptors for action recognition in videos Piotr Bilinski and Francois Bremond INRIA Sophia Antipolis - PULSAR group 2004 route des Lucioles - BP 93 06902 Sophia Antipolis Cedex,
More informationAction Recognition & Categories via Spatial-Temporal Features
Action Recognition & Categories via Spatial-Temporal Features 华俊豪, 11331007 huajh7@gmail.com 2014/4/9 Talk at Image & Video Analysis taught by Huimin Yu. Outline Introduction Frameworks Feature extraction
More informationPerson Action Recognition/Detection
Person Action Recognition/Detection Fabrício Ceschin Visão Computacional Prof. David Menotti Departamento de Informática - Universidade Federal do Paraná 1 In object recognition: is there a chair in the
More informationA Motion Descriptor Based on Statistics of Optical Flow Orientations for Action Classification in Video-Surveillance
A Motion Descriptor Based on Statistics of Optical Flow Orientations for Action Classification in Video-Surveillance Fabio Martínez, Antoine Manzanera, Eduardo Romero To cite this version: Fabio Martínez,
More informationColumbia University High-Level Feature Detection: Parts-based Concept Detectors
TRECVID 2005 Workshop Columbia University High-Level Feature Detection: Parts-based Concept Detectors Dong-Qing Zhang, Shih-Fu Chang, Winston Hsu, Lexin Xie, Eric Zavesky Digital Video and Multimedia Lab
More informationClass 9 Action Recognition
Class 9 Action Recognition Liangliang Cao, April 4, 2013 EECS 6890 Topics in Information Processing Spring 2013, Columbia University http://rogerioferis.com/visualrecognitionandsearch Visual Recognition
More informationAction Recognition in Video by Sparse Representation on Covariance Manifolds of Silhouette Tunnels
Action Recognition in Video by Sparse Representation on Covariance Manifolds of Silhouette Tunnels Kai Guo, Prakash Ishwar, and Janusz Konrad Department of Electrical & Computer Engineering Motivation
More informationObject and Action Detection from a Single Example
Object and Action Detection from a Single Example Peyman Milanfar* EE Department University of California, Santa Cruz *Joint work with Hae Jong Seo AFOSR Program Review, June 4-5, 29 Take a look at this:
More informationComputer Vision. Exercise Session 10 Image Categorization
Computer Vision Exercise Session 10 Image Categorization Object Categorization Task Description Given a small number of training images of a category, recognize a-priori unknown instances of that category
More informationFirst-Person Animal Activity Recognition from Egocentric Videos
First-Person Animal Activity Recognition from Egocentric Videos Yumi Iwashita Asamichi Takamine Ryo Kurazume School of Information Science and Electrical Engineering Kyushu University, Fukuoka, Japan yumi@ieee.org
More informationVisual Event Recognition in News Video using Kernel Methods with Multi-Level Temporal Alignment
Visual Event Recognition in News Video using Kernel Methods with Multi-Level Temporal Alignment Dong Xu and Shih-Fu Chang Department of Electrical Engineering Columbia University, New York, NY 007 {dongxu,
More informationA Spatio-Temporal Descriptor Based on 3D-Gradients
A Spatio-Temporal Descriptor Based on 3D-Gradients Alexander Kläser Marcin Marszałek Cordelia Schmid INRIA Grenoble, LEAR, LJK {alexander.klaser,marcin.marszalek,cordelia.schmid}@inrialpes.fr Abstract
More informationTemporal Poselets for Collective Activity Detection and Recognition
Temporal Poselets for Collective Activity Detection and Recognition Moin Nabi Alessio Del Bue Vittorio Murino Pattern Analysis and Computer Vision (PAVIS) Istituto Italiano di Tecnologia (IIT) Via Morego
More informationCS229: Action Recognition in Tennis
CS229: Action Recognition in Tennis Aman Sikka Stanford University Stanford, CA 94305 Rajbir Kataria Stanford University Stanford, CA 94305 asikka@stanford.edu rkataria@stanford.edu 1. Motivation As active
More informationEvent Detection and Recognition for Semantic Annotation of Video
Multimed Tools Appl manuscript No. (will be inserted by the editor) Event Detection and Recognition for Semantic Annotation of Video Lamberto Ballan Marco Bertini Alberto Del Bimbo Lorenzo Seidenari Giuseppe
More informationHuman Focused Action Localization in Video
Human Focused Action Localization in Video Alexander Kläser 1, Marcin Marsza lek 2, Cordelia Schmid 1, and Andrew Zisserman 2 1 INRIA Grenoble, LEAR, LJK {klaser,schmid}@inrialpes.fr 2 Engineering Science,
More informationStatistics of Pairwise Co-occurring Local Spatio-Temporal Features for Human Action Recognition
Statistics of Pairwise Co-occurring Local Spatio-Temporal Features for Human Action Recognition Piotr Bilinski, Francois Bremond To cite this version: Piotr Bilinski, Francois Bremond. Statistics of Pairwise
More informationBag-of-features. Cordelia Schmid
Bag-of-features for category classification Cordelia Schmid Visual search Particular objects and scenes, large databases Category recognition Image classification: assigning a class label to the image
More informationLearning realistic human actions from movies
Learning realistic human actions from movies Ivan Laptev, Marcin Marszalek, Cordelia Schmid, Benjamin Rozenfeld CVPR 2008 Presented by Piotr Mirowski Courant Institute, NYU Advanced Vision class, November
More informationLocal Features and Kernels for Classifcation of Texture and Object Categories: A Comprehensive Study
Local Features and Kernels for Classifcation of Texture and Object Categories: A Comprehensive Study J. Zhang 1 M. Marszałek 1 S. Lazebnik 2 C. Schmid 1 1 INRIA Rhône-Alpes, LEAR - GRAVIR Montbonnot, France
More informationThe Stanford/Technicolor/Fraunhofer HHI Video Semantic Indexing System
The Stanford/Technicolor/Fraunhofer HHI Video Semantic Indexing System Our first participation on the TRECVID workshop A. F. de Araujo 1, F. Silveira 2, H. Lakshman 3, J. Zepeda 2, A. Sheth 2, P. Perez
More informationSampling Strategies for Real-time Action Recognition
2013 IEEE Conference on Computer Vision and Pattern Recognition Sampling Strategies for Real-time Action Recognition Feng Shi, Emil Petriu and Robert Laganière School of Electrical Engineering and Computer
More informationDescriptors for CV. Introduc)on:
Descriptors for CV Content 2014 1.Introduction 2.Histograms 3.HOG 4.LBP 5.Haar Wavelets 6.Video based descriptor 7.How to compare descriptors 8.BoW paradigm 1 2 1 2 Color RGB histogram Introduc)on: Image
More informationMultiple Kernel Learning for Emotion Recognition in the Wild
Multiple Kernel Learning for Emotion Recognition in the Wild Karan Sikka, Karmen Dykstra, Suchitra Sathyanarayana, Gwen Littlewort and Marian S. Bartlett Machine Perception Laboratory UCSD EmotiW Challenge,
More informationLeveraging Textural Features for Recognizing Actions in Low Quality Videos
Leveraging Textural Features for Recognizing Actions in Low Quality Videos Saimunur Rahman, John See, Chiung Ching Ho Centre of Visual Computing, Faculty of Computing and Informatics Multimedia University,
More informationDictionary of gray-level 3D patches for action recognition
Dictionary of gray-level 3D patches for action recognition Stefen Chan Wai Tim, Michele Rombaut, Denis Pellerin To cite this version: Stefen Chan Wai Tim, Michele Rombaut, Denis Pellerin. Dictionary of
More informationPreliminary Local Feature Selection by Support Vector Machine for Bag of Features
Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Tetsu Matsukawa Koji Suzuki Takio Kurita :University of Tsukuba :National Institute of Advanced Industrial Science and
More informationAn Efficient Part-Based Approach to Action Recognition from RGB-D Video with BoW-Pyramid Representation
2013 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) November 3-7, 2013. Tokyo, Japan An Efficient Part-Based Approach to Action Recognition from RGB-D Video with BoW-Pyramid
More informationPeople Detection and Video Understanding
1 People Detection and Video Understanding Francois BREMOND INRIA Sophia Antipolis STARS team Institut National Recherche Informatique et Automatisme Francois.Bremond@inria.fr http://www-sop.inria.fr/members/francois.bremond/
More informationAction Recognition in Low Quality Videos by Jointly Using Shape, Motion and Texture Features
Action Recognition in Low Quality Videos by Jointly Using Shape, Motion and Texture Features Saimunur Rahman, John See, Chiung Ching Ho Centre of Visual Computing, Faculty of Computing and Informatics
More informationEvaluation of local spatio-temporal features for action recognition
Evaluation of local spatio-temporal features for action recognition Heng Wang, Muhammad Muneeb Ullah, Alexander Klaser, Ivan Laptev, Cordelia Schmid To cite this version: Heng Wang, Muhammad Muneeb Ullah,
More informationREJECTION-BASED CLASSIFICATION FOR ACTION RECOGNITION USING A SPATIO-TEMPORAL DICTIONARY. Stefen Chan Wai Tim, Michele Rombaut, Denis Pellerin
REJECTION-BASED CLASSIFICATION FOR ACTION RECOGNITION USING A SPATIO-TEMPORAL DICTIONARY Stefen Chan Wai Tim, Michele Rombaut, Denis Pellerin Univ. Grenoble Alpes, GIPSA-Lab, F-38000 Grenoble, France ABSTRACT
More informationHuman Action Recognition via Fused Kinematic Structure and Surface Representation
University of Denver Digital Commons @ DU Electronic Theses and Dissertations Graduate Studies 8-1-2013 Human Action Recognition via Fused Kinematic Structure and Surface Representation Salah R. Althloothi
More informationVideo annotation based on adaptive annular spatial partition scheme
Video annotation based on adaptive annular spatial partition scheme Guiguang Ding a), Lu Zhang, and Xiaoxu Li Key Laboratory for Information System Security, Ministry of Education, Tsinghua National Laboratory
More informationGesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation
Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation Lorenzo Baraldi 1, Francesco Paci 2, Giuseppe Serra 1, Luca Benini 2,3, Rita Cucchiara 1 1 Dipartimento di Ingegneria
More informationVideo Event Detection Using Motion Relativity and Feature Selection
IEEE TRANSACTIONS ON MULTIMEDIA, VOL. XX, NO. XX, XX 1 Video Event Detection Using Motion Relativity and Feature Selection Feng Wang, Zhanhu Sun, Yu-Gang Jiang, and Chong-Wah Ngo Abstract Event detection
More informationRecognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)
Recognition of Animal Skin Texture Attributes in the Wild Amey Dharwadker (aap2174) Kai Zhang (kz2213) Motivation Patterns and textures are have an important role in object description and understanding
More informationLearning Realistic Human Actions from Movies
Learning Realistic Human Actions from Movies Ivan Laptev*, Marcin Marszałek**, Cordelia Schmid**, Benjamin Rozenfeld*** INRIA Rennes, France ** INRIA Grenoble, France *** Bar-Ilan University, Israel Presented
More informationIncremental Action Recognition Using Feature-Tree
Incremental Action Recognition Using Feature-Tree Kishore K Reddy Computer Vision Lab University of Central Florida kreddy@cs.ucf.edu Jingen Liu Computer Vision Lab University of Central Florida liujg@cs.ucf.edu
More informationPart-based and local feature models for generic object recognition
Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza
More informationHuman activity recognition in the semantic simplex of elementary actions
STUDENT, PROF, COLLABORATOR: BMVC AUTHOR GUIDELINES 1 Human activity recognition in the semantic simplex of elementary actions Beaudry Cyrille cyrille.beaudry@univ-lr.fr Péteri Renaud renaud.peteri@univ-lr.fr
More informationLocal Features and Bag of Words Models
10/14/11 Local Features and Bag of Words Models Computer Vision CS 143, Brown James Hays Slides from Svetlana Lazebnik, Derek Hoiem, Antonio Torralba, David Lowe, Fei Fei Li and others Computer Engineering
More informationSPATIO-TEMPORAL PYRAMIDAL ACCORDION REPRESENTATION FOR HUMAN ACTION RECOGNITION. Manel Sekma, Mahmoud Mejdoub, Chokri Ben Amar
2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) SPATIO-TEMPORAL PYRAMIDAL ACCORDION REPRESENTATION FOR HUMAN ACTION RECOGNITION Manel Sekma, Mahmoud Mejdoub, Chokri
More informationSpatio-temporal Shape and Flow Correlation for Action Recognition
Spatio-temporal Shape and Flow Correlation for Action Recognition Yan Ke 1, Rahul Sukthankar 2,1, Martial Hebert 1 1 School of Computer Science, Carnegie Mellon; 2 Intel Research Pittsburgh {yke,rahuls,hebert}@cs.cmu.edu
More informationP-CNN: Pose-based CNN Features for Action Recognition. Iman Rezazadeh
P-CNN: Pose-based CNN Features for Action Recognition Iman Rezazadeh Introduction automatic understanding of dynamic scenes strong variations of people and scenes in motion and appearance Fine-grained
More informationPattern Recognition Letters
Pattern Recognition Letters 33 (2012) 1188 1195 Contents lists available at SciVerse ScienceDirect Pattern Recognition Letters journal homepage: www.elsevier.com/locate/patrec Motion recognition using
More informationLearning Human Actions with an Adaptive Codebook
Learning Human Actions with an Adaptive Codebook Yu Kong, Xiaoqin Zhang, Weiming Hu and Yunde Jia Beijing Laboratory of Intelligent Information Technology School of Computer Science, Beijing Institute
More informationEigenJoints-based Action Recognition Using Naïve-Bayes-Nearest-Neighbor
EigenJoints-based Action Recognition Using Naïve-Bayes-Nearest-Neighbor Xiaodong Yang and YingLi Tian Department of Electrical Engineering The City College of New York, CUNY {xyang02, ytian}@ccny.cuny.edu
More informationAction Recognition by Dense Trajectories
Action Recognition by Dense Trajectories Heng Wang, Alexander Kläser, Cordelia Schmid, Liu Cheng-Lin To cite this version: Heng Wang, Alexander Kläser, Cordelia Schmid, Liu Cheng-Lin. Action Recognition
More informationAction Recognition Using Hybrid Feature Descriptor and VLAD Video Encoding
Action Recognition Using Hybrid Feature Descriptor and VLAD Video Encoding Dong Xing, Xianzhong Wang, Hongtao Lu Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive
More informationDiscriminative classifiers for image recognition
Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study
More informationBehavior Recognition in Video with Extended Models of Feature Velocity Dynamics
Behavior Recognition in Video with Extended Models of Feature Velocity Dynamics Ross Messing 1 1 Department of Computer Science University of Rochester Rochester, NY, 14627, USA {rmessing, cpal}@cs.rochester.edu
More informationView-Invariant Dynamic Texture Recognition using a Bag of Dynamical Systems
View-Invariant Dynamic Texture Recognition using a Bag of Dynamical Systems Avinash Ravichandran, Rizwan Chaudhry and René Vidal Center for Imaging Science, Johns Hopkins University, Baltimore, MD 21218,
More informationSupervised learning. y = f(x) function
Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the
More informationTA Section: Problem Set 4
TA Section: Problem Set 4 Outline Discriminative vs. Generative Classifiers Image representation and recognition models Bag of Words Model Part-based Model Constellation Model Pictorial Structures Model
More informationVideo Event Classification using String Kernels
Multimed Tools Appl manuscript No. (will be inserted by the editor) Video Event Classification using String Kernels Lamberto Ballan Marco Bertini Alberto Del Bimbo Giuseppe Serra Received: Apr, 9 / Accepted:
More informationClass 5: Attributes and Semantic Features
Class 5: Attributes and Semantic Features Rogerio Feris, Feb 21, 2013 EECS 6890 Topics in Information Processing Spring 2013, Columbia University http://rogerioferis.com/visualrecognitionandsearch Project
More informationAutomatic visual recognition for metro surveillance
Automatic visual recognition for metro surveillance F. Cupillard, M. Thonnat, F. Brémond Orion Research Group, INRIA, Sophia Antipolis, France Abstract We propose in this paper an approach for recognizing
More informationClassifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped
More informationDense Spatio-temporal Features For Non-parametric Anomaly Detection And Localization
Dense Spatio-temporal Features For Non-parametric Anomaly Detection And Localization Lorenzo Seidenari, Marco Bertini, Alberto Del Bimbo Dipartimento di Sistemi e Informatica - University of Florence Viale
More informationAction Recognition with HOG-OF Features
Action Recognition with HOG-OF Features Florian Baumann Institut für Informationsverarbeitung, Leibniz Universität Hannover, {last name}@tnt.uni-hannover.de Abstract. In this paper a simple and efficient
More informationSpatio-Temporal Optical Flow Statistics (STOFS) for Activity Classification
Spatio-Temporal Optical Flow Statistics (STOFS) for Activity Classification Vignesh Jagadeesh University of California Santa Barbara, CA-93106 vignesh@ece.ucsb.edu S. Karthikeyan B.S. Manjunath University
More informationAction Recognition using Randomised Ferns
Action Recognition using Randomised Ferns Olusegun Oshin Andrew Gilbert John Illingworth Richard Bowden Centre for Vision, Speech and Signal Processing, University of Surrey Guildford, Surrey United Kingdom
More informationQMUL-ACTIVA: Person Runs detection for the TRECVID Surveillance Event Detection task
QMUL-ACTIVA: Person Runs detection for the TRECVID Surveillance Event Detection task Fahad Daniyal and Andrea Cavallaro Queen Mary University of London Mile End Road, London E1 4NS (United Kingdom) {fahad.daniyal,andrea.cavallaro}@eecs.qmul.ac.uk
More informationLarge-scale Video Classification with Convolutional Neural Networks
Large-scale Video Classification with Convolutional Neural Networks Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei Note: Slide content mostly from : Bay Area
More informationPatch Descriptors. CSE 455 Linda Shapiro
Patch Descriptors CSE 455 Linda Shapiro How can we find corresponding points? How can we find correspondences? How do we describe an image patch? How do we describe an image patch? Patches with similar
More informationHuman Action Recognition Based on Oriented Motion Salient Regions
Human Action Recognition Based on Oriented Motion Salient Regions Baoxin Wu 1, Shuang Yang 1, Chunfeng Yuan 1, Weiming Hu 1, and Fangshi Wang 2 1 NLPR, Institute of Automation, Chinese Academy of Sciences,
More informationSparse coding for image classification
Sparse coding for image classification Columbia University Electrical Engineering: Kun Rong(kr2496@columbia.edu) Yongzhou Xiang(yx2211@columbia.edu) Yin Cui(yc2776@columbia.edu) Outline Background Introduction
More informationMotion Interchange Patterns for Action Recognition in Unconstrained Videos
Motion Interchange Patterns for Action Recognition in Unconstrained Videos Orit Kliper-Gross, Yaron Gurovich, Tal Hassner, Lior Wolf Weizmann Institute of Science The Open University of Israel Tel Aviv
More informationA Probabilistic Framework for Recognizing Similar Actions using Spatio-Temporal Features
A Probabilistic Framework for Recognizing Similar Actions using Spatio-Temporal Features Alonso Patron-Perez, Ian Reid Department of Engineering Science, University of Oxford OX1 3PJ, Oxford, UK {alonso,ian}@robots.ox.ac.uk
More informationFast Realistic Multi-Action Recognition using Mined Dense Spatio-temporal Features
Fast Realistic Multi-Action Recognition using Mined Dense Spatio-temporal Features Andrew Gilbert, John Illingworth and Richard Bowden CVSSP, University of Surrey, Guildford, Surrey GU2 7XH United Kingdom
More informationCombined Shape Analysis of Human Poses and Motion Units for Action Segmentation and Recognition
Combined Shape Analysis of Human Poses and Motion Units for Action Segmentation and Recognition Maxime Devanne 1,2, Hazem Wannous 1, Stefano Berretti 2, Pietro Pala 2, Mohamed Daoudi 1, and Alberto Del
More information88 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 1, FEBRUARY 2012
88 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 1, FEBRUARY 2012 Semantic Model Vectors for Complex Video Event Recognition Michele Merler, Student Member, IEEE, Bert Huang, Lexing Xie, Senior Member,
More informationMinimizing hallucination in Histogram of Oriented Gradients
Minimizing hallucination in Histogram of Oriented Gradients Javier Ortiz Sławomir Bąk Michał Koperski François Brémond INRIA Sophia Antipolis, STARS group 2004, route des Lucioles, BP93 06902 Sophia Antipolis
More informationPreviously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011
Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition
More informationHuman Activity Recognition Using a Dynamic Texture Based Method
Human Activity Recognition Using a Dynamic Texture Based Method Vili Kellokumpu, Guoying Zhao and Matti Pietikäinen Machine Vision Group University of Oulu, P.O. Box 4500, Finland {kello,gyzhao,mkp}@ee.oulu.fi
More informationRushes Video Segmentation Using Semantic Features
Rushes Video Segmentation Using Semantic Features Athina Pappa, Vasileios Chasanis, and Antonis Ioannidis Department of Computer Science and Engineering, University of Ioannina, GR 45110, Ioannina, Greece
More informationLocal Features based Object Categories and Object Instances Recognition
Local Features based Object Categories and Object Instances Recognition Eric Nowak Ph.D. thesis defense 17th of March, 2008 1 Thesis in Computer Vision Computer vision is the science and technology of
More informationString distance for automatic image classification
String distance for automatic image classification Nguyen Hong Thinh*, Le Vu Ha*, Barat Cecile** and Ducottet Christophe** *University of Engineering and Technology, Vietnam National University of HaNoi,
More informationApplication of 3D-Wavelet Statistics to Video Analysis
Application of 3D-Wavelet Statistics to Video Analysis M. Omidyeganeh¹ ², S. Ghaemmaghami¹ ³, S. Shirmohammadi ¹Electrical Engineering Department, ²Advanced Information & Communication Technology Center
More informationSpace-Time Shapelets for Action Recognition
Space-Time Shapelets for Action Recognition Dhruv Batra 1 Tsuhan Chen 1 Rahul Sukthankar 2,1 batradhruv@cmu.edu tsuhan@cmu.edu rahuls@cs.cmu.edu 1 Carnegie Mellon University 2 Intel Research Pittsburgh
More informationChapter 2 Action Representation
Chapter 2 Action Representation Abstract In this chapter, various action recognition issues are covered in a concise manner. Various approaches are presented here. In Chap. 1, nomenclatures, various aspects
More informationLeveraging Textural Features for Recognizing Actions in Low Quality Videos
Leveraging Textural Features for Recognizing Actions in Low Quality Videos Saimunur Rahman 1, John See 2, and Chiung Ching Ho 3 Centre of Visual Computing, Faculty of Computing and Informatics Multimedia
More informationCS6670: Computer Vision
CS6670: Computer Vision Noah Snavely Lecture 16: Bag-of-words models Object Bag of words Announcements Project 3: Eigenfaces due Wednesday, November 11 at 11:59pm solo project Final project presentations:
More informationBag of Words Models. CS4670 / 5670: Computer Vision Noah Snavely. Bag-of-words models 11/26/2013
CS4670 / 5670: Computer Vision Noah Snavely Bag-of-words models Object Bag of words Bag of Words Models Adapted from slides by Rob Fergus and Svetlana Lazebnik 1 Object Bag of words Origin 1: Texture Recognition
More informationBag of Optical Flow Volumes for Image Sequence Recognition 1
RIEMENSCHNEIDER, DONOSER, BISCHOF: BAG OF OPTICAL FLOW VOLUMES 1 Bag of Optical Flow Volumes for Image Sequence Recognition 1 Hayko Riemenschneider http://www.icg.tugraz.at/members/hayko Michael Donoser
More informationMotion Tracking and Event Understanding in Video Sequences
Motion Tracking and Event Understanding in Video Sequences Isaac Cohen Elaine Kang, Jinman Kang Institute for Robotics and Intelligent Systems University of Southern California Los Angeles, CA Objectives!
More information