Understanding Sport Activities from Correspondences of Clustered Trajectories
|
|
- Ann Lindsey
- 5 years ago
- Views:
Transcription
1 Understanding Sport Activities from Correspondences of Clustered Trajectories Francesco Turchini, Lorenzo Seidenari, Alberto Del Bimbo
2 Introduction The availability of multimedia content is continuously growing Sport events are among the most watched tv content driving the pay-perview model for major broadcasters. Many commercial and professional application can be enabled by automatic action recognition in sports: Improved broadcast commentaries through fast similar event retrieval Gameplay analysis for head coaches Automatic game statistics collection
3 State-of-the-Art Few systems actually succeed in performing classification on sport videos without employing additional information Usually trackers need camera calibration and an ad-hoc setup Specific sport knowledge is often used to achieve player identification These requirements make these systems less general and hard to employ Method Sport-Specific Player bbox annotations* Player Team/Identity Player tracking Camera calibration Features Atmosukarto [CVPRW CVSports 2013] Yes No Yes Yes Yes Raw frames Ballan [CBMI 2009] No No No No No SIFT Waltner [ÖAGM 2014] Bialkowski [CVPRW CVSports 2013] No Yes Yes Yes Yes HoG, HoF, SC, RWPC Yes No Yes Yes Yes Raw frames Ours No No No No No Dense Trajectories * for training
4 Main Idea To avoid player detection, tracking and identification a system should be able to make partial correspondences of spatio-temporal patterns Starting from a set of trajectories we would like to decompose the action in order to perform partial correspondences.
5 The Method Video Representation PCA Trajectory Clusters LSC [Cai 15] Trajectory Description [Wang 13] Fisher Encoding [Perronin 12] 1. Video trajectories are clustered using Landmark Based Spectral Clustering (LSC) 2. Trajectories are represented with appearance and motion HoG,HoF and MBH 3. Each cluster is encoded using Fisher Vectors over a Gaussian Mixture Model 4. We end up with Fisher Vectors Ψ(X i ) computed from each cluster X i for each descriptor (HoG, HoF, MBH)
6 The Method In sport footage our method naturally groups trajectories stemming from motion of players or generated by the motion of relevant objects (e.g. ball) This approach allows to make partial correspondences of relevant features without detecting, tracking or recognizing player in the field.
7 Cluster Set Kernel Given the set of extracted motion features X, the clustering step yields a partition such that Given two feature sets X and Y, we define a kernel K exploiting trajectory grouping as follows The max operator allows to put into correspondence the most similar patterns from the compared videos We use a kernel SVM as a classifier
8 Feature Fusion To improve the representation we fuse multiple kernels First we consider multiple features (HoG, HoG, MBHx,MBHy) Second we can fuse kernels computed from different groupings (varying the number of clusters): Baselines: Global Representation Clustering only N =2
9 Results We tested our approach on three public datasets UCF Sports Actions dataset 150 Clips (6s), nine classes: Diving, Golf Swinging, Kicking, Lifting, Horseback Riding, Running, Skating, Swinging, Walking Performance measured with mean average precision MICC-SOCACT4 dataset 100 Clips (7s), four classes: Goal Kick, Throw In, Placed Kick, Shot on Goal Performance measured with mean per-class accuracy Volleyball Activity dataset 903 Clips (2s), seven classes with five volley specific classes: Serve, Reception, Setting, Attack, Block and two more general classes: Stand, Defense/Move Performance measured with mean per-class accuracy
10 Results We have state-of-the art results on the smaller MICC-SOCACT dataset Our Fusion Our Clustering FV Baseline String Kernel [Ballan09] NN + NWD [Ballan09] Our Fusion Our Clustering FV Baseline
11 Our Fusion Our Clustering FV Baseline Results Clustering and Global representation (FV) have complementary behavior, our fusion obtain a 20% improvement over Waltner et al. Our Fusion Clustering FV Baseline Waltner et al. 5 Classes Classes
12 Results We also tested our method on the more generic UCF Sports dataset Our method obtains state-of-the art performance hinting that our approach is also suitable for generic action recognition. Clustering (10 clusters) FV Baseline Karaman et al. [5] Kovashka et al. [10] Klaser [14]
13 Action Saliency Trajectory clustering yields a motion segmentation which identifies various salient patterns We perform relevant cluster mining exploiting the SVM scores variation Given a true positive video feature set Z, we search for the cluster Z i that, if removed, causes the higher classification score drop: We iterate this process in a greedy manner to score all clusters of a video.
14 Action Saliency Correctly classified examples of Service and Setting classes Service Setting Note that in the Setting action, spikers, the middle-blocker and the opposite player running up are localized instead of the setter.
15 Conclusions We have proposed a novel method for activity recognition based on local trajectory grouping and matching This feature grouping helps identifying some mid-level spatio-temporal patterns that are semantically sensible Preliminary results on cluster salience showing localization potential State-of-the-Art on various public benchmarks without: Player tracking and identification Exploiting sport-specific players position in the court Camera calibration
EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION. Ing. Lorenzo Seidenari
EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION Ing. Lorenzo Seidenari e-mail: seidenari@dsi.unifi.it What is an Event? Dictionary.com definition: something that occurs in a certain place during a particular
More informationAction recognition in videos
Action recognition in videos Cordelia Schmid INRIA Grenoble Joint work with V. Ferrari, A. Gaidon, Z. Harchaoui, A. Klaeser, A. Prest, H. Wang Action recognition - goal Short actions, i.e. drinking, sit
More informationPerson Action Recognition/Detection
Person Action Recognition/Detection Fabrício Ceschin Visão Computacional Prof. David Menotti Departamento de Informática - Universidade Federal do Paraná 1 In object recognition: is there a chair in the
More informationLecture 18: Human Motion Recognition
Lecture 18: Human Motion Recognition Professor Fei Fei Li Stanford Vision Lab 1 What we will learn today? Introduction Motion classification using template matching Motion classification i using spatio
More informationExtracting Spatio-temporal Local Features Considering Consecutiveness of Motions
Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions Akitsugu Noguchi and Keiji Yanai Department of Computer Science, The University of Electro-Communications, 1-5-1 Chofugaoka,
More informationPeople Detection and Video Understanding
1 People Detection and Video Understanding Francois BREMOND INRIA Sophia Antipolis STARS team Institut National Recherche Informatique et Automatisme Francois.Bremond@inria.fr http://www-sop.inria.fr/members/francois.bremond/
More informationAction Classification in Soccer Videos with Long Short-Term Memory Recurrent Neural Networks
Action Classification in Soccer Videos with Long Short-Term Memory Recurrent Neural Networks Moez Baccouche 1,2, Franck Mamalet 1, Christian Wolf 2, Christophe Garcia 1, and Atilla Baskurt 2 1 Orange Labs,
More informationP-CNN: Pose-based CNN Features for Action Recognition. Iman Rezazadeh
P-CNN: Pose-based CNN Features for Action Recognition Iman Rezazadeh Introduction automatic understanding of dynamic scenes strong variations of people and scenes in motion and appearance Fine-grained
More informationAction Recognition using Discriminative Structured Trajectory Groups
2015 IEEE Winter Conference on Applications of Computer Vision Action Recognition using Discriminative Structured Trajectory Groups Indriyati Atmosukarto 1,2, Narendra Ahuja 3, Bernard Ghanem 4 1 Singapore
More informationCS229: Action Recognition in Tennis
CS229: Action Recognition in Tennis Aman Sikka Stanford University Stanford, CA 94305 Rajbir Kataria Stanford University Stanford, CA 94305 asikka@stanford.edu rkataria@stanford.edu 1. Motivation As active
More informationRECOGNIZING HAND-OBJECT INTERACTIONS IN WEARABLE CAMERA VIDEOS. IBM Research - Tokyo The Robotics Institute, Carnegie Mellon University
RECOGNIZING HAND-OBJECT INTERACTIONS IN WEARABLE CAMERA VIDEOS Tatsuya Ishihara Kris M. Kitani Wei-Chiu Ma Hironobu Takagi Chieko Asakawa IBM Research - Tokyo The Robotics Institute, Carnegie Mellon University
More informationCS231N Section. Video Understanding 6/1/2018
CS231N Section Video Understanding 6/1/2018 Outline Background / Motivation / History Video Datasets Models Pre-deep learning CNN + RNN 3D convolution Two-stream What we ve seen in class so far... Image
More informationAutomatic Data Acquisition Based on Abrupt Motion Feature and Spatial Importance for 3D Volleyball Analysis
Automatic Data Acquisition Based on Abrupt Motion Feature and Spatial Importance for 3D Volleyball Analysis 1. Introduction Sports analysis technologies have attracted increasing attention with the hosting
More informationHistogram of Flow and Pyramid Histogram of Visual Words for Action Recognition
Histogram of Flow and Pyramid Histogram of Visual Words for Action Recognition Ethem F. Can and R. Manmatha Department of Computer Science, UMass Amherst Amherst, MA, 01002, USA [efcan, manmatha]@cs.umass.edu
More informationMultiple Kernel Learning for Emotion Recognition in the Wild
Multiple Kernel Learning for Emotion Recognition in the Wild Karan Sikka, Karmen Dykstra, Suchitra Sathyanarayana, Gwen Littlewort and Marian S. Bartlett Machine Perception Laboratory UCSD EmotiW Challenge,
More informationTrademark Matching and Retrieval in Sport Video Databases
Trademark Matching and Retrieval in Sport Video Databases Andrew D. Bagdanov, Lamberto Ballan, Marco Bertini and Alberto Del Bimbo {bagdanov, ballan, bertini, delbimbo}@dsi.unifi.it 9th ACM SIGMM International
More informationMotion analysis for broadcast tennis video considering mutual interaction of players
14-10 MVA2011 IAPR Conference on Machine Vision Applications, June 13-15, 2011, Nara, JAPAN analysis for broadcast tennis video considering mutual interaction of players Naoto Maruyama, Kazuhiro Fukui
More informationon learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015
on learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015 Vector visual representation Fixed-size image representation High-dim (100 100,000) Generic, unsupervised: BoW,
More informationAn evaluation of local action descriptors for human action classification in the presence of occlusion
An evaluation of local action descriptors for human action classification in the presence of occlusion Iveel Jargalsaikhan, Cem Direkoglu, Suzanne Little, and Noel E. O Connor INSIGHT Centre for Data Analytics,
More informationDeep Learning For Video Classification. Presented by Natalie Carlebach & Gil Sharon
Deep Learning For Video Classification Presented by Natalie Carlebach & Gil Sharon Overview Of Presentation Motivation Challenges of video classification Common datasets 4 different methods presented in
More informationIMPROVING SPATIO-TEMPORAL FEATURE EXTRACTION TECHNIQUES AND THEIR APPLICATIONS IN ACTION CLASSIFICATION. Maral Mesmakhosroshahi, Joohee Kim
IMPROVING SPATIO-TEMPORAL FEATURE EXTRACTION TECHNIQUES AND THEIR APPLICATIONS IN ACTION CLASSIFICATION Maral Mesmakhosroshahi, Joohee Kim Department of Electrical and Computer Engineering Illinois Institute
More informationHUMAN action recognition has received significant research
JOURNAL OF L A TEX CLASS FILES, VOL. 6, NO. 1, JANUARY 2007 1 Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling Yu-Gang Jiang, Qi Dai, Wei Liu, Xiangyang Xue, Chong-Wah Ngo Abstract
More informationREJECTION-BASED CLASSIFICATION FOR ACTION RECOGNITION USING A SPATIO-TEMPORAL DICTIONARY. Stefen Chan Wai Tim, Michele Rombaut, Denis Pellerin
REJECTION-BASED CLASSIFICATION FOR ACTION RECOGNITION USING A SPATIO-TEMPORAL DICTIONARY Stefen Chan Wai Tim, Michele Rombaut, Denis Pellerin Univ. Grenoble Alpes, GIPSA-Lab, F-38000 Grenoble, France ABSTRACT
More informationUnsupervised Spectral Dual Assignment Clustering of Human Actions in Context
Unsupervised Spectral Dual Assignment Clustering of Human Actions in Context Simon Jones, Ling Shao Department of Electronic and Electrical Engineering The University of Sheffield, Sheffield, S1 3JD, UK
More informationarxiv: v1 [cs.cv] 29 Apr 2016
Improved Dense Trajectory with Cross Streams arxiv:1604.08826v1 [cs.cv] 29 Apr 2016 ABSTRACT Katsunori Ohnishi Graduate School of Information Science and Technology University of Tokyo ohnishi@mi.t.utokyo.ac.jp
More informationMultilayer and Multimodal Fusion of Deep Neural Networks for Video Classification
Multilayer and Multimodal Fusion of Deep Neural Networks for Video Classification Xiaodong Yang, Pavlo Molchanov, Jan Kautz INTELLIGENT VIDEO ANALYTICS Surveillance event detection Human-computer interaction
More informationColumbia University High-Level Feature Detection: Parts-based Concept Detectors
TRECVID 2005 Workshop Columbia University High-Level Feature Detection: Parts-based Concept Detectors Dong-Qing Zhang, Shih-Fu Chang, Winston Hsu, Lexin Xie, Eric Zavesky Digital Video and Multimedia Lab
More informationAction Recognition by Dense Trajectories
Action Recognition by Dense Trajectories Heng Wang, Alexander Kläser, Cordelia Schmid, Liu Cheng-Lin To cite this version: Heng Wang, Alexander Kläser, Cordelia Schmid, Liu Cheng-Lin. Action Recognition
More informationIMA Preprint Series # 2378
SPARSE MODELING OF HUMAN ACTIONS FROM MOTION IMAGERY By Alexey Castrodad and Guillermo Sapiro IMA Preprint Series # 2378 ( September 2011 ) INSTITUTE FOR MATHEMATICS AND ITS APPLICATIONS UNIVERSITY OF
More informationHuman Action Recognition Based on Oriented Motion Salient Regions
Human Action Recognition Based on Oriented Motion Salient Regions Baoxin Wu 1, Shuang Yang 1, Chunfeng Yuan 1, Weiming Hu 1, and Fangshi Wang 2 1 NLPR, Institute of Automation, Chinese Academy of Sciences,
More informationA Unified Method for First and Third Person Action Recognition
A Unified Method for First and Third Person Action Recognition Ali Javidani Department of Computer Science and Engineering Shahid Beheshti University Tehran, Iran a.javidani@mail.sbu.ac.ir Ahmad Mahmoudi-Aznaveh
More informationLarge-scale Video Classification with Convolutional Neural Networks
Large-scale Video Classification with Convolutional Neural Networks Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei Note: Slide content mostly from : Bay Area
More informationLeveraging Textural Features for Recognizing Actions in Low Quality Videos
Leveraging Textural Features for Recognizing Actions in Low Quality Videos Saimunur Rahman, John See, Chiung Ching Ho Centre of Visual Computing, Faculty of Computing and Informatics Multimedia University,
More informationCombined Shape Analysis of Human Poses and Motion Units for Action Segmentation and Recognition
Combined Shape Analysis of Human Poses and Motion Units for Action Segmentation and Recognition Maxime Devanne 1,2, Hazem Wannous 1, Stefano Berretti 2, Pietro Pala 2, Mohamed Daoudi 1, and Alberto Del
More informationAssistive Sports Video Annotation: Modelling and Detecting Complex Events in Sports Video
: Modelling and Detecting Complex Events in Sports Video Aled Owen 1, David Marshall 1, Kirill Sidorov 1, Yulia Hicks 1, and Rhodri Brown 2 1 Cardiff University, Cardiff, UK 2 Welsh Rugby Union Abstract
More informationGesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation
Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation Lorenzo Baraldi 1, Francesco Paci 2, Giuseppe Serra 1, Luca Benini 2,3, Rita Cucchiara 1 1 Dipartimento di Ingegneria
More informationVideo Classification with Densely Extracted HOG/HOF/MBH Features: An Evaluation of the Accuracy/Computational Efficiency Trade-off
Noname manuscript No. (will be inserted by the editor) Video Classification with Densely Extracted HOG/HOF/MBH Features: An Evaluation of the Accuracy/Computational Efficiency Trade-off J. Uijlings I.C.
More informationAn evaluation of bags-of-words and spatio-temporal shapes for action recognition
An evaluation of bags-of-words and spatio-temporal shapes for action recognition Teófilo de Campos, Mark Barnard, Krystian Mikolajczyk, Josef Kittler, Fei Yan, William Christmas and David Windridge CVSSP,
More informationClass 9 Action Recognition
Class 9 Action Recognition Liangliang Cao, April 4, 2013 EECS 6890 Topics in Information Processing Spring 2013, Columbia University http://rogerioferis.com/visualrecognitionandsearch Visual Recognition
More informationModified Time Flexible Kernel for Video Activity Recognition using Support Vector Machines
Modified Time Flexible Kernel for Video Activity Recognition using Support Vector Machines Ankit Sharma 1, Apurv Kumar 1, Sony Allappa 1, Veena Thenkanidiyoor 1, Dileep Aroor Dinesh 2 and Shikha Gupta
More informationChapter 2 Action Representation
Chapter 2 Action Representation Abstract In this chapter, various action recognition issues are covered in a concise manner. Various approaches are presented here. In Chap. 1, nomenclatures, various aspects
More informationTwo-Stream Convolutional Networks for Action Recognition in Videos
Two-Stream Convolutional Networks for Action Recognition in Videos Karen Simonyan Andrew Zisserman Cemil Zalluhoğlu Introduction Aim Extend deep Convolution Networks to action recognition in video. Motivation
More informationDynamic Vision Sensors for Human Activity Recognition
Dynamic Vision Sensors for Human Activity Recognition Stefanie Anna Baby 1, Bimal Vinod 2, Chaitanya Chinni 3, Kaushik Mitra 4 Computational Imaging Lab IIT Madras, Chennai, India { 1 ee13b120, 2 ee15m005,
More informationMinimizing hallucination in Histogram of Oriented Gradients
Minimizing hallucination in Histogram of Oriented Gradients Javier Ortiz Sławomir Bąk Michał Koperski François Brémond INRIA Sophia Antipolis, STARS group 2004, route des Lucioles, BP93 06902 Sophia Antipolis
More informationAutomatic summarization of video data
Automatic summarization of video data Presented by Danila Potapov Joint work with: Matthijs Douze Zaid Harchaoui Cordelia Schmid LEAR team, nria Grenoble Khronos-Persyvact Spring School 1.04.2015 Definition
More informationAction Recognition From Videos using Sparse Trajectories
Action Recognition From Videos using Sparse Trajectories Alexandros Doumanoglou, Nicholas Vretos, Petros Daras Centre for Research and Technology - Hellas (ITI-CERTH) 6th Km Charilaou - Thermi, Thessaloniki,
More informationMotion Interchange Patterns for Action Recognition in Unconstrained Videos
Motion Interchange Patterns for Action Recognition in Unconstrained Videos Orit Kliper-Gross, Yaron Gurovich, Tal Hassner, Lior Wolf Weizmann Institute of Science The Open University of Israel Tel Aviv
More informationRobust Action Recognition Using Local Motion and Group Sparsity
Robust Action Recognition Using Local Motion and Group Sparsity Jungchan Cho a, Minsik Lee a, Hyung Jin Chang b, Songhwai Oh a, a Department of Electrical and Computer Engineering and ASRI, Seoul National
More informationVideo Summarization Using MPEG-7 Motion Activity and Audio Descriptors
Video Summarization Using MPEG-7 Motion Activity and Audio Descriptors Ajay Divakaran, Kadir A. Peker, Regunathan Radhakrishnan, Ziyou Xiong and Romain Cabasson Presented by Giulia Fanti 1 Overview Motivation
More informationEvaluation of Local Space-time Descriptors based on Cuboid Detector in Human Action Recognition
International Journal of Innovation and Applied Studies ISSN 2028-9324 Vol. 9 No. 4 Dec. 2014, pp. 1708-1717 2014 Innovative Space of Scientific Research Journals http://www.ijias.issr-journals.org/ Evaluation
More informationHuman Action Recognition from Gradient Boundary Histograms
Human Action Recognition from Gradient Boundary Histograms by Xuelu Wang Thesis submitted to the Faculty of Graduate and Postdoctoral Studies In partial fulfillment of the requirements For the M.A.SC.
More informationRecognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)
Recognition of Animal Skin Texture Attributes in the Wild Amey Dharwadker (aap2174) Kai Zhang (kz2213) Motivation Patterns and textures are have an important role in object description and understanding
More informationAction Recognition Using Hybrid Feature Descriptor and VLAD Video Encoding
Action Recognition Using Hybrid Feature Descriptor and VLAD Video Encoding Dong Xing, Xianzhong Wang, Hongtao Lu Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive
More informationEigen-Evolution Dense Trajectory Descriptors
Eigen-Evolution Dense Trajectory Descriptors Yang Wang, Vinh Tran, and Minh Hoai Stony Brook University, Stony Brook, NY 11794-2424, USA {wang33, tquangvinh, minhhoai}@cs.stonybrook.edu Abstract Trajectory-pooled
More informationActivity Recognition in Temporally Untrimmed Videos
Activity Recognition in Temporally Untrimmed Videos Bryan Anenberg Stanford University anenberg@stanford.edu Norman Yu Stanford University normanyu@stanford.edu Abstract We investigate strategies to apply
More informationHighlight Ranking for Broadcast Tennis Video Based on Multi-modality Analysis and Relevance Feedback
Highlight Ranking for Broadcast Tennis Video Based on Multi-modality Analysis and Relevance Feedback Guangyu Zhu 1, Qingming Huang 2, and Yihong Gong 3 1 Harbin Institute of Technology, Harbin, P.R. China
More informationEXMOVES: Mid-level Features for Efficient Action Recognition and Video Analysis
Int J Comput Vis (2016) 119:239 253 DOI 10.1007/s11263-016-0905-6 EXMOVES: Mid-level Features for Efficient Action Recognition and Video Analysis Du Tran 1 Lorenzo Torresani 1 Received: 15 May 2014 / Accepted:
More informationAUTOMATED BALL TRACKING IN TENNIS VIDEO
AUTOMATED BALL TRACKING IN TENNIS VIDEO Tayeba Qazi*, Prerana Mukherjee~, Siddharth Srivastava~, Brejesh Lall~, Nathi Ram Chauhan* *Indira Gandhi Delhi Technical University for Women, Delhi ~Indian Institute
More informationVideo Classification with Densely Extracted HOG/HOF/MBH Features: An Evaluation of the Accuracy/Computational Efficiency Trade-off
Noname manuscript No. (will be inserted by the editor) Video Classification with Densely Extracted HOG/HOF/MBH Features: An Evaluation of the Accuracy/Computational Efficiency Trade-off J. Uijlings I.C.
More informationAction Recognition Using Global Spatio-Temporal Features Derived from Sparse Representations
Action Recognition Using Global Spatio-Temporal Features Derived from Sparse Representations Guruprasad Somasundaram, Anoop Cherian, Vassilios Morellas, and Nikolaos Papanikolopoulos Department of Computer
More informationContent-based image and video analysis. Event Recognition
Content-based image and video analysis Event Recognition 21.06.2010 What is an event? a thing that happens or takes place, Oxford Dictionary Examples: Human gestures Human actions (running, drinking, etc.)
More informationLOCAL VISUAL PATTERN MODELLING FOR IMAGE AND VIDEO CLASSIFICATION
LOCAL VISUAL PATTERN MODELLING FOR IMAGE AND VIDEO CLASSIFICATION Peng Wang A thesis submitted for the degree of Doctor of Philosophy at The University of Queensland in 2017 School of Information Technology
More informationHighlights Extraction from Unscripted Video
Highlights Extraction from Unscripted Video T 61.6030, Multimedia Retrieval Seminar presentation 04.04.2008 Harrison Mfula Helsinki University of Technology Department of Computer Science, Espoo, Finland
More informationStatistics of Pairwise Co-occurring Local Spatio-Temporal Features for Human Action Recognition
Statistics of Pairwise Co-occurring Local Spatio-Temporal Features for Human Action Recognition Piotr Bilinski, Francois Bremond To cite this version: Piotr Bilinski, Francois Bremond. Statistics of Pairwise
More informationA Hybrid Approach to News Video Classification with Multi-modal Features
A Hybrid Approach to News Video Classification with Multi-modal Features Peng Wang, Rui Cai and Shi-Qiang Yang Department of Computer Science and Technology, Tsinghua University, Beijing 00084, China Email:
More informationHUMAN ACTION RECOGNITION
HUMAN ACTION RECOGNITION Human Action Recognition 1. Hand crafted feature + Shallow classifier 2. Human localization + (Hand crafted features) + 3D CNN Input is a small chunk of video 3. 3D CNN Input is
More informationSummarization of Egocentric Moving Videos for Generating Walking Route Guidance
Summarization of Egocentric Moving Videos for Generating Walking Route Guidance Masaya Okamoto and Keiji Yanai Department of Informatics, The University of Electro-Communications 1-5-1 Chofugaoka, Chofu-shi,
More informationACTIVE CLASSIFICATION FOR HUMAN ACTION RECOGNITION. Alexandros Iosifidis, Anastasios Tefas and Ioannis Pitas
ACTIVE CLASSIFICATION FOR HUMAN ACTION RECOGNITION Alexandros Iosifidis, Anastasios Tefas and Ioannis Pitas Depeartment of Informatics, Aristotle University of Thessaloniki, Greece {aiosif,tefas,pitas}@aiia.csd.auth.gr
More informationQMUL-ACTIVA: Person Runs detection for the TRECVID Surveillance Event Detection task
QMUL-ACTIVA: Person Runs detection for the TRECVID Surveillance Event Detection task Fahad Daniyal and Andrea Cavallaro Queen Mary University of London Mile End Road, London E1 4NS (United Kingdom) {fahad.daniyal,andrea.cavallaro}@eecs.qmul.ac.uk
More informationSpatio-temporal Feature Classifier
Spatio-temporal Feature Classifier Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2015, 7, 1-7 1 Open Access Yun Wang 1,* and Suxing Liu 2 1 School
More informationHuman Detection and Tracking for Video Surveillance: A Cognitive Science Approach
Human Detection and Tracking for Video Surveillance: A Cognitive Science Approach Vandit Gajjar gajjar.vandit.381@ldce.ac.in Ayesha Gurnani gurnani.ayesha.52@ldce.ac.in Yash Khandhediya khandhediya.yash.364@ldce.ac.in
More informationA Survey on Content-aware Video Analysis for Sports
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 99, NO. 9, JANUARY 2017 1 A Survey on Content-aware Video Analysis for Sports Huang-Chia Shih, Member, IEEE Abstract Sports data analysis
More informationAction Recognition with Improved Trajectories
Action Recognition with Improved Trajectories Heng Wang and Cordelia Schmid LEAR, INRIA, France firstname.lastname@inria.fr Abstract Recently dense trajectories were shown to be an efficient video representation
More informationVision and Image Processing Lab., CRV Tutorial day- May 30, 2010 Ottawa, Canada
Spatio-Temporal Salient Features Amir H. Shabani Vision and Image Processing Lab., University of Waterloo, ON CRV Tutorial day- May 30, 2010 Ottawa, Canada 1 Applications Automated surveillance for scene
More informationMultiple-Choice Questionnaire Group C
Family name: Vision and Machine-Learning Given name: 1/28/2011 Multiple-Choice naire Group C No documents authorized. There can be several right answers to a question. Marking-scheme: 2 points if all right
More informationGUANGHAN NING Pinard St, Milpitas, CA, 95035
GUANGHAN NING 573-825-8230 gnxr9@mail.missouri.edu 2294 Pinard St, Milpitas, CA, 95035 EDUCATION Ph.D. Candidate, Electrical and Computer Engineering, University of Missouri, Columbia, MO, Supervisor:
More informationConsumer Video Understanding
Consumer Video Understanding A Benchmark Database + An Evaluation of Human & Machine Performance Yu-Gang Jiang, Guangnan Ye, Shih-Fu Chang, Daniel Ellis, Alexander C. Loui Columbia University Kodak Research
More informationBEYOND BAG-OF-WORDS: FAST VIDEO CLASSIFICATION WITH FISHER KERNEL VECTOR OF LOCALLY AGGREGATED DESCRIPTORS
BEYOND BAG-OF-WORDS: FAST VIDEO CLASSIFICATION WITH FISHER KERNEL VECTOR OF LOCALLY AGGREGATED DESCRIPTORS Ionuţ Mironică 1, Ionuţ Duţă 2,1, Bogdan Ionescu 1, Nicu Sebe 2 1 LAPI, University Politehnica
More informationMatching Mixtures of Curves for Human Action Recognition
Matching Mixtures of Curves for Human Action Recognition Michalis Vrigkas 1, Vasileios Karavasilis 1, Christophoros Nikou 1, and Ioannis A. Kakadiaris 2 1 Department of Computer Science, University of
More informationSupervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information
Supervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information Duc-Tien Dang-Nguyen, Giulia Boato, Alessandro Moschitti, Francesco G.B. De Natale Department of Information
More informationSCENE TEXT RECOGNITION IN MULTIPLE FRAMES BASED ON TEXT TRACKING
SCENE TEXT RECOGNITION IN MULTIPLE FRAMES BASED ON TEXT TRACKING Xuejian Rong 1, Chucai Yi 2, Xiaodong Yang 1 and Yingli Tian 1,2 1 The City College, 2 The Graduate Center, City University of New York
More informationAction Localization in Video using a Graph-based Feature Representation
Action Localization in Video using a Graph-based Feature Representation Iveel Jargalsaikhan, Suzanne Little and Noel E O Connor Insight Centre for Data Analytics, Dublin City University, Ireland iveel.jargalsaikhan2@mail.dcu.ie
More informationEfficient and effective human action recognition in video through motion boundary description with a compact set of trajectories
biblio.ugent.be The UGent Institutional Repository is the electronic archiving and dissemination platform for all UGent research publications. Ghent University has implemented a mandate stipulating that
More informationVideo Action Detection with Relational Dynamic-Poselets
Video Action Detection with Relational Dynamic-Poselets Limin Wang 1,2, Yu Qiao 2, Xiaoou Tang 1,2 1 Department of Information Engineering, The Chinese University of Hong Kong 2 Shenzhen Key Lab of CVPR,
More informationReal-Time Content-Based Adaptive Streaming of Sports Videos
Real-Time Content-Based Adaptive Streaming of Sports Videos Shih-Fu Chang, Di Zhong, and Raj Kumar Digital Video and Multimedia Group ADVENT University/Industry Consortium Columbia University December
More informationThe Stanford/Technicolor/Fraunhofer HHI Video Semantic Indexing System
The Stanford/Technicolor/Fraunhofer HHI Video Semantic Indexing System Our first participation on the TRECVID workshop A. F. de Araujo 1, F. Silveira 2, H. Lakshman 3, J. Zepeda 2, A. Sheth 2, P. Perez
More informationTeam SRI-Sarnoff s AURORA TRECVID 2011
Team SRI-Sarnoff s AURORA System @ TRECVID 2011 Hui Cheng, Amir Tamrakar, Saad Ali, Qian Yu, Omar Javed, Jingen Liu, Ajay Divakaran, Harpreet S. Sawhney, Alex Hauptmann, Mubarak Shah, Subhabrata Bhattacharya,
More informationLocal Part Model for Action Recognition in Realistic Videos
Local Part Model for Action Recognition in Realistic Videos by Feng Shi Thesis submitted to Faculty of Graduate and Postdoctoral Studies in partial fulfillment of the requirements for the Doctorate of
More informationEigenJoints-based Action Recognition Using Naïve-Bayes-Nearest-Neighbor
EigenJoints-based Action Recognition Using Naïve-Bayes-Nearest-Neighbor Xiaodong Yang and YingLi Tian Department of Electrical Engineering The City College of New York, CUNY {xyang02, ytian}@ccny.cuny.edu
More informationCategorizing Turn-Taking Interactions
Categorizing Turn-Taking Interactions Karthir Prabhakar and James M. Rehg Center for Behavior Imaging and RIM@GT School of Interactive Computing, Georgia Institute of Technology {karthir.prabhakar,rehg}@cc.gatech.edu
More informationLatent Variable Models for Structured Prediction and Content-Based Retrieval
Latent Variable Models for Structured Prediction and Content-Based Retrieval Ariadna Quattoni Universitat Politècnica de Catalunya Joint work with Borja Balle, Xavier Carreras, Adrià Recasens, Antonio
More informationAction is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization
Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization Nataliya Shapovalova Michalis Raptis Leonid Sigal Greg Mori Simon Fraser University Comcast Disney Research
More informationApproach to Metadata Production and Application Technology Research
Approach to Metadata Production and Application Technology Research In the areas of broadcasting based on home servers and content retrieval, the importance of segment metadata, which is attached in segment
More informationLesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval
Lesson 11 Media Retrieval Information Retrieval Image Retrieval Video Retrieval Audio Retrieval Information Retrieval Retrieval = Query + Search Informational Retrieval: Get required information from database/web
More informationImageCLEF 2011
SZTAKI @ ImageCLEF 2011 Bálint Daróczy joint work with András Benczúr, Róbert Pethes Data Mining and Web Search Group Computer and Automation Research Institute Hungarian Academy of Sciences Training/test
More informationLeveraging Textural Features for Recognizing Actions in Low Quality Videos
Leveraging Textural Features for Recognizing Actions in Low Quality Videos Saimunur Rahman 1, John See 2, and Chiung Ching Ho 3 Centre of Visual Computing, Faculty of Computing and Informatics Multimedia
More informationReal-Time Action Detection in Video Surveillance using Sub-Action Descriptor with Multi-CNN
Real-Time Action Detection in Video Surveillance using Sub-Action Descriptor with Multi-CNN Cheng-Bin Jin *, Shengzhe Li, and Hakil Kim * * Inha University, Incheon, Korea Visionin Inc., Incheon, Korea
More informationEfficient Activity Detection in Untrimmed Video with Max-Subgraph Search
1 Efficient Activity Detection in Untrimmed Video with Max- Search Chao Yeh Chen and Kristen Grauman arxiv:1607.02815v1 [cs.cv] 11 Jul 2016 Abstract We propose an efficient approach for activity detection
More informationVisual Action Recognition
Visual Action Recognition Ying Wu Electrical Engineering and Computer Science Northwestern University, Evanston, IL 60208 yingwu@northwestern.edu http://www.eecs.northwestern.edu/~yingwu 1 / 57 Outline
More informationCLUSTER ENCODING FOR MODELLING TEMPORAL VARIATION IN VIDEO
CLUSTER ENCODING FOR MODELLING TEMPORAL VARIATION IN VIDEO Negar Rostamzadeh, Jasper Uijlings, Ionuţ Mironică, Mojtaba Khomami Abadi, Bogdan Ionescu, Nicu Sebe University of Trento, Italy CALVIN group,
More information