Crowd Scene Understanding with Coherent Recurrent Neural Networks
|
|
- Darcy Lucas
- 6 years ago
- Views:
Transcription
1 Crowd Scene Understanding with Coherent Recurrent Neural Networks Hang Su, Yinpeng Dong, Jun Zhu May 22, 2016 Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
2 Outline 1 Introduction 2 LSTM Recap 3 Coherent LSTM 4 Experimental Results Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
3 Outline 1 Introduction 2 LSTM Recap 3 Coherent LSTM 4 Experimental Results Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
4 Background Crowd scene is the scene of public places where a large group of people who have gathered together such as a university campus or the sidewalks of a busy street. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
5 Background Crowd scene is the scene of public places where a large group of people who have gathered together such as a university campus or the sidewalks of a busy street. Groups are the main entities that make up a crowd. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
6 Background Crowd scene is the scene of public places where a large group of people who have gathered together such as a university campus or the sidewalks of a busy street. Groups are the main entities that make up a crowd. When pedestrians walk in a crowded space, their trajectories are influenced by others and obstacles. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
7 Background Crowd scene is the scene of public places where a large group of people who have gathered together such as a university campus or the sidewalks of a busy street. Groups are the main entities that make up a crowd. When pedestrians walk in a crowded space, their trajectories are influenced by others and obstacles. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
8 Applications Understanding collective behaviors in crowd scenes has a wide range of applications in Video Surveillance Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
9 Applications Understanding collective behaviors in crowd scenes has a wide range of applications in Video Surveillance Crowd Management Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
10 Applications Understanding collective behaviors in crowd scenes has a wide range of applications in Video Surveillance Crowd Management Avoiding Tragic Accidents Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
11 Problem Formulation Obtain reliable tracklets from each scene using KLT trackers. At any time-instant t, the i th person is represented by his/her coordinate (x i (t), y i (t)). Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
12 Problem Formulation Obtain reliable tracklets from each scene using KLT trackers. At any time-instant t, the i th person is represented by his/her coordinate (x i (t), y i (t)). Predict future trajectories of pedestrians and use extracted hidden features to do other classification tasks. LSTM LSTM Coherent regularization Motion Prediction LSTM Coherent regularization Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
13 Challenge Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
14 Challenge Crowd spatio-temporal patterns behave nonlinear dynamics Limit cycles Quasi-period Chaos Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
15 Challenge Crowd spatio-temporal patterns behave nonlinear dynamics Limit cycles Quasi-period Chaos Collective effect (or coherent motion) Pedestrian tend to form groups Intra-group properties and inter-group properties. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
16 Previous Work Traditional approach such as Social Force model Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
17 Previous Work Traditional approach such as Social Force model Optimize energy function Hand-crafted functions Hard to generalize Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
18 Previous Work Traditional approach such as Social Force model Optimize energy function Hand-crafted functions Hard to generalize Probabilistic Forecasting such as Gaussian Process Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
19 Previous Work Traditional approach such as Social Force model Optimize energy function Hand-crafted functions Hard to generalize Probabilistic Forecasting such as Gaussian Process Recurrent Neural Networks N-LSTM (CVPR 2016) Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
20 Outline 1 Introduction 2 LSTM Recap 3 Coherent LSTM 4 Experimental Results Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
21 LSTM Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
22 LSTM Structure Input / Output / Forget gate Memory state c t Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
23 LSTM Structure Input / Output / Forget gate Memory state c t Advantage Prevent vanishing gradient problem Nonlinear characteristic Generalization Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
24 LSTM i t = σ(w xi x t + W hi h t 1 + W ci c t 1 + b i ) (1) f t = σ(w xf x t + W hf h t 1 + W cf c t 1 + b f ) (2) c t =f t c t 1 + i t tanh(w xc x t + W hc h t 1 + b c ) (3) o t =σ(w xo x t + W ho h t 1 + W co c t + b o ) (4) h t = o t tanh(c t ) (5) Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
25 Outline 1 Introduction 2 LSTM Recap 3 Coherent LSTM 4 Experimental Results Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
26 Why Coherent LSTM? LSTM can model individual behaviors but dose not capture the interaction of people in a group. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
27 Why Coherent LSTM? LSTM can model individual behaviors but dose not capture the interaction of people in a group. The individuals are always willing to engage with seed groups and form spatially coherent structures. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
28 Why Coherent LSTM? LSTM can model individual behaviors but dose not capture the interaction of people in a group. The individuals are always willing to engage with seed groups and form spatially coherent structures. When the neighboring relationship of individuals remain invariant over time and correlation of their velocities remain high, they tend to have similar hidden state. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
29 Why Coherent LSTM? LSTM can model individual behaviors but dose not capture the interaction of people in a group. The individuals are always willing to engage with seed groups and form spatially coherent structures. When the neighboring relationship of individuals remain invariant over time and correlation of their velocities remain high, they tend to have similar hidden state. The trajectories of pedestrians not only follow the old trend, but also are influenced by current environment. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
30 clstm Unit c t = f t c t 1 + j N λ j (t)f j t cj t 1 + i t tanh(w xc x t + W hc h t 1 + b c ) (6) Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
31 clstm Unit c t = f t c t 1 + j N λ j (t)f j t cj t 1 + i t tanh(w xc x t + W hc h t 1 + b c ) (6) σ Forget Gate σ Input Gate Cell ϕ Coherent Regularization x t ht 1 Output Gate σ h t Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
32 Coherent Motion Modeling Use coherent filtering [Zhou et al., 2012a] [Shao et al., 2014] to discover the coherent group. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
33 Coherent Motion Modeling Use coherent filtering [Zhou et al., 2012a] [Shao et al., 2014] to discover the coherent group. The dependency relationship between two tracklets within the same group is measured as: τ j (t) = v i(t) v j (t) v i (t) v j (t) (7) Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
34 Dependency Coefficient The dependency coefficient between the i th and j th tracklets in Eq. (6) is defined as λ j (t) = 1 ( ) τj (t) 1 exp Z i 2σ 2 (0, 1], (8) Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
35 Dependency Coefficient The dependency coefficient between the i th and j th tracklets in Eq. (6) is defined as λ j (t) = 1 ( ) τj (t) 1 exp Z i 2σ 2 (0, 1], (8) Z i : normalization constant corresponding to the i th tracklet. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
36 Dependency Coefficient The dependency coefficient between the i th and j th tracklets in Eq. (6) is defined as λ j (t) = 1 ( ) τj (t) 1 exp Z i 2σ 2 (0, 1], (8) Z i : normalization constant corresponding to the i th tracklet. λ j (t) 1 if v i (t) v j (t) which implies that tracklets i and j are similar. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
37 Dependency Coefficient The dependency coefficient between the i th and j th tracklets in Eq. (6) is defined as λ j (t) = 1 ( ) τj (t) 1 exp Z i 2σ 2 (0, 1], (8) Z i : normalization constant corresponding to the i th tracklet. λ j (t) 1 if v i (t) v j (t) which implies that tracklets i and j are similar. Coherent regularization encourages the tracklets to learn similar feature distributions by sharing information across tracklets within a coherent group. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
38 Framework Unsupervised encoder-decoder clstm framework: h T = clstm e (x T, h T 1 ), (9) ˆx t = clstm dr (h t, ˆx t+1 ), where t [1, T ], (10) ˆx t = clstm dp (h t, ˆx t 1 ). where t > T, (11) ˆx 3 ˆx ˆx 2 1 x1 x2 x3 Wrd Wrd Reconstruction Decoder We We h T ˆx 4 ˆx 5 ˆx 6 Encoder Learnt Hidden Features Wpd Wpd Prediction Decoder Coherent Regularization Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
39 Crowd Scene Profiling Solve critical tasks in crowd scene analysis: Estimating group state Gas, Solid, Pure Fluid and Impure Fluid Softmax classification using the feature learnt from the unsupervised clstm. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
40 Crowd Scene Profiling Solve critical tasks in crowd scene analysis: Estimating group state Gas, Solid, Pure Fluid and Impure Fluid Softmax classification using the feature learnt from the unsupervised clstm. Crowd video classification Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
41 Outline 1 Introduction 2 LSTM Recap 3 Coherent LSTM 4 Experimental Results Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
42 Datasets and Settings CUHK Crowd Dataset Scene: streets, shopping malls, airports and parks More than 400 sequences and more then 200,000 traklets Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
43 Datasets and Settings CUHK Crowd Dataset Scene: streets, shopping malls, airports and parks More than 400 sequences and more then 200,000 traklets Settings 128 hidden units in clstm 2/3 of tracklets as the input and 1/3 as the predicted tracklets to evaluate the performance. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
44 Future Path Forecasting Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
45 Future Path Forecasting Table 1: Error of Path Prediction(pixels) Kalman Filter Un-coherent LSTM Coherent LSTM 9.32 ± ± ±0.93 Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
46 Group State Estimation Gas: Particles move in different directions without forming collective behaviors Solid: Particles move in the same direction with relative positions unchanged Pure Fluid: Particles move towards the same direction with ever-changing relative positions Impure Fluid: Particles move in a pure fluid style with invasion of particles from other groups Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
47 Group State Estimation Gas: Particles move in different directions without forming collective behaviors Solid: Particles move in the same direction with relative positions unchanged Pure Fluid: Particles move towards the same direction with ever-changing relative positions Impure Fluid: Particles move in a pure fluid style with invasion of particles from other groups (a) Gas (b) Solid (c) Pure Fluid (d) Impure Fluid Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
48 Group State Estimation Confusion matrices of estimating group states using different methods: (a) collective transition [Shao et al., 2014]; (b) prediction LSTM; (c) reconstruction LSTM; (d) un-coherent LSTM; and (e) coherent LSTM. (a) Collective Transition (b) Prediction LSTM (c) Reconstruction LSTM (d) Un-coherent LSTM (e) Coherent LSTM Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
49 Crowd Video Classification All video clips are annotated into 8 classes as 1) Highly mixed pedestrian walking; 2) Crowd walking following a mainstream and well organized; 3) Crowd walking following a mainstream but poorly organized; 4) Crowd merge; 5) Crowd split; 6) Crowd crossing in opposite directions; 7) Intervened escalator traffic; and 8) Smooth escalator traffic. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
50 Crowd Video Classification All video clips are annotated into 8 classes as 1) Highly mixed pedestrian walking; 2) Crowd walking following a mainstream and well organized; 3) Crowd walking following a mainstream but poorly organized; 4) Crowd merge; 5) Crowd split; 6) Crowd crossing in opposite directions; 7) Intervened escalator traffic; and 8) Smooth escalator traffic. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
51 Crowd Video Classification All video clips are annotated into 8 classes as 1) Highly mixed pedestrian walking; 2) Crowd walking following a mainstream and well organized; 3) Crowd walking following a mainstream but poorly organized; 4) Crowd merge; 5) Crowd split; 6) Crowd crossing in opposite directions; 7) Intervened escalator traffic; and 8) Smooth escalator traffic. Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
52 Thanks for your time! Questions? Hang Su, Yinpeng Dong, Jun Zhu IJCAI 2016 May 22, / 26
Crowd Scene Understanding with Coherent Recurrent Neural Networks
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16) Crowd Scene Understanding with Coherent Recurrent Neural Networks Hang Su, Yinpeng Dong, Jun Zhu, Haibin
More informationRecurrent Neural Network (RNN) Industrial AI Lab.
Recurrent Neural Network (RNN) Industrial AI Lab. For example (Deterministic) Time Series Data Closed- form Linear difference equation (LDE) and initial condition High order LDEs 2 (Stochastic) Time Series
More informationPrediction of Pedestrian Trajectories Final Report
Prediction of Pedestrian Trajectories Final Report Mingchen Li (limc), Yiyang Li (yiyang7), Gendong Zhang (zgdsh29) December 15, 2017 1 Introduction As the industry of automotive vehicles growing rapidly,
More informationDefinition, Detection, and Evaluation of Meeting Events in Airport Surveillance Videos
Definition, Detection, and Evaluation of Meeting Events in Airport Surveillance Videos Sung Chun Lee, Chang Huang, and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA sungchun@usc.edu,
More informationRecurrent Neural Nets II
Recurrent Neural Nets II Steven Spielberg Pon Kumar, Tingke (Kevin) Shen Machine Learning Reading Group, Fall 2016 9 November, 2016 Outline 1 Introduction 2 Problem Formulations with RNNs 3 LSTM for Optimization
More informationMachine Learning: Chenhao Tan University of Colorado Boulder LECTURE 15
Machine Learning: Chenhao Tan University of Colorado Boulder LECTURE 15 Slides adapted from Jordan Boyd-Graber Machine Learning: Chenhao Tan Boulder 1 of 21 Logistics HW3 available on Github, due on October
More informationLSTM and its variants for visual recognition. Xiaodan Liang Sun Yat-sen University
LSTM and its variants for visual recognition Xiaodan Liang xdliang328@gmail.com Sun Yat-sen University Outline Context Modelling with CNN LSTM and its Variants LSTM Architecture Variants Application in
More informationMining Human Trajectory Data: A Study on Check-in Sequences. Xin Zhao Renmin University of China,
Mining Human Trajectory Data: A Study on Check-in Sequences Xin Zhao batmanfly@qq.com Renmin University of China, Check-in data What information these check-in data contain? User ID Location ID Check-in
More informationLSTM: An Image Classification Model Based on Fashion-MNIST Dataset
LSTM: An Image Classification Model Based on Fashion-MNIST Dataset Kexin Zhang, Research School of Computer Science, Australian National University Kexin Zhang, U6342657@anu.edu.au Abstract. The application
More informationTitle. Author(s)Noguchi, Wataru; Iizuka, Hiroyuki; Yamamoto, Masahit. CitationEAI Endorsed Transactions on Security and Safety, 16
Title Proposing Multimodal Integration Model Using LSTM an Author(s)Noguchi, Wataru; Iizuka, Hiroyuki; Yamamoto, Masahit CitationEAI Endorsed Transactions on Security and Safety, 16 Issue Date 216-12-28
More informationarxiv: v2 [cs.cv] 14 May 2018
ContextVP: Fully Context-Aware Video Prediction Wonmin Byeon 1234, Qin Wang 1, Rupesh Kumar Srivastava 3, and Petros Koumoutsakos 1 arxiv:1710.08518v2 [cs.cv] 14 May 2018 Abstract Video prediction models
More informationDiffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting
Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting Yaguang Li Joint work with Rose Yu, Cyrus Shahabi, Yan Liu Page 1 Introduction Traffic congesting is wasteful of time,
More information16-785: Integrated Intelligence in Robotics: Vision, Language, and Planning. Spring 2018 Lecture 14. Image to Text
16-785: Integrated Intelligence in Robotics: Vision, Language, and Planning Spring 2018 Lecture 14. Image to Text Input Output Classification tasks 4/1/18 CMU 16-785: Integrated Intelligence in Robotics
More informationRecurrent Neural Networks. Nand Kishore, Audrey Huang, Rohan Batra
Recurrent Neural Networks Nand Kishore, Audrey Huang, Rohan Batra Roadmap Issues Motivation 1 Application 1: Sequence Level Training 2 Basic Structure 3 4 Variations 5 Application 3: Image Classification
More informationMachine Learning 13. week
Machine Learning 13. week Deep Learning Convolutional Neural Network Recurrent Neural Network 1 Why Deep Learning is so Popular? 1. Increase in the amount of data Thanks to the Internet, huge amount of
More informationProbabilistic Vehicle Trajectory Prediction over Occupancy Grid Map via Recurrent Neural Network
Probabilistic Vehicle Trajectory Prediction over Occupancy Grid Map via Recurrent Neural Network ByeoungDo Kim, Chang Mook Kang, Jaekyum Kim, Seung Hi Lee, Chung Choo Chung, and Jun Won Choi* Hanyang University,
More informationGroup Activity Recognition with Differential Recurrent Convolutional Neural Networks
Group Activity Recognition with Differential Recurrent Convolutional Neural Networks Naifan Zhuang1, Tuoerhongjiang Yusufu1, Jun Ye2 and Kien A. Hua1 1 Department of Computer Science, University of Central
More informationRobotics. Chapter 25. Chapter 25 1
Robotics Chapter 25 Chapter 25 1 Outline Robots, Effectors, and Sensors Localization and Mapping Motion Planning Chapter 25 2 Mobile Robots Chapter 25 3 Manipulators P R R R R R Configuration of robot
More informationTracking Pedestrians using Local Spatio-temporal Motion Patterns in Extremely Crowded Scenes
1 Submitted to IEEE Trans. on Pattern Analysis and Machine Intelligence Regular Paper Tracking Pedestrians using Local Spatio-temporal Motion Patterns in Extremely Crowded Scenes Louis Kratz and Ko Nishino
More informationTracking with Local Spatio-Temporal Motion Patterns in Extremely Crowded Scenes
Tracking with Local Spatio-Temporal Motion Patterns in Extremely Crowded Scenes Louis Kratz Ko Nishino Department of Computer Science Drexel University {lak24, kon}@drexel.edu Abstract Tracking individuals
More informationPixel-level Generative Model
Pixel-level Generative Model Generative Image Modeling Using Spatial LSTMs (2015NIPS) L. Theis and M. Bethge University of Tübingen, Germany Pixel Recurrent Neural Networks (2016ICML) A. van den Oord,
More informationSequence Modeling: Recurrent and Recursive Nets. By Pyry Takala 14 Oct 2015
Sequence Modeling: Recurrent and Recursive Nets By Pyry Takala 14 Oct 2015 Agenda Why Recurrent neural networks? Anatomy and basic training of an RNN (10.2, 10.2.1) Properties of RNNs (10.2.2, 8.2.6) Using
More informationDetection of Video Anomalies Using Convolutional Autoencoders and One-Class Support Vector Machines
Detection of Video Anomalies Using Convolutional Autoencoders and One-Class Support Vector Machines Matheus Gutoski 1, Nelson Marcelo Romero Aquino 2 Manassés Ribeiro 3, André Engênio Lazzaretti 4, Heitor
More informationWHY do we study crowds and why is tracking individuals
JOURNAL OF L A T E X CLASS FILES, VOL. 4, NO. 8, AUGUST 2 Binary Quadratic Programing for Online Tracking of Hundreds of People in Extremely Crowded Scenes Afshin Dehghan, Member, IEEE, and Mubarak Shah,
More informationMobility Models. Larissa Marinho Eglem de Oliveira. May 26th CMPE 257 Wireless Networks. (UCSC) May / 50
Mobility Models Larissa Marinho Eglem de Oliveira CMPE 257 Wireless Networks May 26th 2015 (UCSC) May 2015 1 / 50 1 Motivation 2 Mobility Models 3 Extracting a Mobility Model from Real User Traces 4 Self-similar
More informationRECURRENT NEURAL NETWORKS
RECURRENT NEURAL NETWORKS Methods Traditional Deep-Learning based Non-machine Learning Machine-Learning based method Supervised SVM MLP CNN RNN (LSTM) Localizati on GPS, SLAM Self Driving Perception Pedestrian
More informationImage Captioning with Object Detection and Localization
Image Captioning with Object Detection and Localization Zhongliang Yang, Yu-Jin Zhang, Sadaqat ur Rehman, Yongfeng Huang, Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
More informationCROWD MOTION ANALYSIS: SEGMENTATION, ANOMALY DETECTION, AND BEHAVIOR CLASSIFICATION. Habib Ullah
CROWD MOTION ANALYSIS: SEGMENTATION, ANOMALY DETECTION, AND BEHAVIOR CLASSIFICATION Habib Ullah Advisor: Nicola Conci, PhD February 2015 Abstract The objective of this doctoral study is to develop efficient
More informationSpatial Latent Dirichlet Allocation
Spatial Latent Dirichlet Allocation Xiaogang Wang and Eric Grimson Computer Science and Computer Science and Artificial Intelligence Lab Massachusetts Tnstitute of Technology, Cambridge, MA, 02139, USA
More informationLearning the Three Factors of a Non-overlapping Multi-camera Network Topology
Learning the Three Factors of a Non-overlapping Multi-camera Network Topology Xiaotang Chen, Kaiqi Huang, and Tieniu Tan National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy
More informationDeep Learning Applications
October 20, 2017 Overview Supervised Learning Feedforward neural network Convolution neural network Recurrent neural network Recursive neural network (Recursive neural tensor network) Unsupervised Learning
More informationCAP 6412 Advanced Computer Vision
CAP 6412 Advanced Computer Vision http://www.cs.ucf.edu/~bgong/cap6412.html Boqing Gong Feb 04, 2016 Today Administrivia Attention Modeling in Image Captioning, by Karan Neural networks & Backpropagation
More informationCrowd Behavior Detection for Abnormal Conditions
International Journal of Computer Systems (ISSN: 2394-1065), Volume 03 Issue 06, June, 2016 Available at http://www.ijcsonline.com/ Aniket A. Patil, Prof. S. A. Shinde Department of Computer Engineering,
More informationPREDICTION OF ANOMALOUS ACTIVITIES IN A VIDEO
PREDICTION OF ANOMALOUS ACTIVITIES IN A VIDEO Lekshmy K Nair 1 P.G. Student, Department of Computer Science and Engineering, LBS Institute of Technology for Women, Trivandrum, Kerala, India ---------------------------------------------------------------------***---------------------------------------------------------------------
More informationAction Classification in Soccer Videos with Long Short-Term Memory Recurrent Neural Networks
Action Classification in Soccer Videos with Long Short-Term Memory Recurrent Neural Networks Moez Baccouche 1,2, Franck Mamalet 1, Christian Wolf 2, Christophe Garcia 1, and Atilla Baskurt 2 1 Orange Labs,
More informationShow, Discriminate, and Tell: A Discriminatory Image Captioning Model with Deep Neural Networks
Show, Discriminate, and Tell: A Discriminatory Image Captioning Model with Deep Neural Networks Zelun Luo Department of Computer Science Stanford University zelunluo@stanford.edu Te-Lin Wu Department of
More informationReal-time Gesture Pattern Classification with IMU Data
Real-time Gesture Pattern Classification with IMU Data Alex Fu Stanford University Computer Science Department alexfu@stanford.edu Yangyang Yu Stanford University Electrical Engineering Department yyu10@stanford.edu
More informationCAP 6412 Advanced Computer Vision
CAP 6412 Advanced Computer Vision http://www.cs.ucf.edu/~bgong/cap6412.html Boqing Gong April 5th, 2016 Today Administrivia LSTM Attribute in computer vision, by Abdullah and Samer Project II posted, due
More informationarxiv: v1 [cs.cv] 16 Jun 2016
HOLISTIC FEATURES FOR REAL-TIME CROWD BEHAVIOUR ANOMALY DETECTION Mark Marsden Kevin McGuinness Suzanne Little Noel E. O Connor Insight Centre for Data Analytics Dublin City University Dublin, Ireland
More informationOverview Citation. ML Introduction. Overview Schedule. ML Intro Dataset. Introduction to Semi-Supervised Learning Review 10/4/2010
INFORMATICS SEMINAR SEPT. 27 & OCT. 4, 2010 Introduction to Semi-Supervised Learning Review 2 Overview Citation X. Zhu and A.B. Goldberg, Introduction to Semi- Supervised Learning, Morgan & Claypool Publishers,
More informationRealtime Anomaly Detection using Trajectory-level Crowd Behavior Learning
Realtime Anomaly Detection using Trajectory-level Crowd Behavior Learning Aniket Bera University of North Carolina Chapel Hill, NC, USA ab@cs.unc.edu Sujeong Kim SRI International Princeton, NJ, USA sujeong.kim@sri.com
More informationDeep Learning For Video Classification. Presented by Natalie Carlebach & Gil Sharon
Deep Learning For Video Classification Presented by Natalie Carlebach & Gil Sharon Overview Of Presentation Motivation Challenges of video classification Common datasets 4 different methods presented in
More informationarxiv: v1 [cond-mat.dis-nn] 30 Dec 2018
A General Deep Learning Framework for Structure and Dynamics Reconstruction from Time Series Data arxiv:1812.11482v1 [cond-mat.dis-nn] 30 Dec 2018 Zhang Zhang, Jing Liu, Shuo Wang, Ruyue Xin, Jiang Zhang
More informationDeep Learning. Deep Learning. Practical Application Automatically Adding Sounds To Silent Movies
http://blog.csdn.net/zouxy09/article/details/8775360 Automatic Colorization of Black and White Images Automatically Adding Sounds To Silent Movies Traditionally this was done by hand with human effort
More informationMotion in 2D image sequences
Motion in 2D image sequences Definitely used in human vision Object detection and tracking Navigation and obstacle avoidance Analysis of actions or activities Segmentation and understanding of video sequences
More informationStructured Attention Networks
Structured Attention Networks Yoon Kim Carl Denton Luong Hoang Alexander M. Rush HarvardNLP ICLR, 2017 Presenter: Chao Jiang ICLR, 2017 Presenter: Chao Jiang 1 / Outline 1 Deep Neutral Networks for Text
More informationPairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement
Pairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement Daegeon Kim Sung Chun Lee Institute for Robotics and Intelligent Systems University of Southern
More informationW4. Perception & Situation Awareness & Decision making
W4. Perception & Situation Awareness & Decision making Robot Perception for Dynamic environments: Outline & DP-Grids concept Dynamic Probabilistic Grids Bayesian Occupancy Filter concept Dynamic Probabilistic
More informationais Rheinische Friedrich-Wilhelms-Universität Bonn Rheinische Master thesis
ais Rheinische Friedrich-Wilhelms- Universität Bonn Institute for Computer Science Department VI Autonomous Intelligent Systems Rheinische Friedrich-Wilhelms-Universität Bonn Master thesis Video Prediction
More informationCSE 586 Final Programming Project Spring 2011 Due date: Tuesday, May 3
CSE 586 Final Programming Project Spring 2011 Due date: Tuesday, May 3 What I have in mind for our last programming project is to do something with either graphical models or random sampling. A few ideas
More information2003 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes
2003 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or
More informationNon-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs and Adaptive Motion Frame Method
Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs
More informationDeep Learning and Its Applications
Convolutional Neural Network and Its Application in Image Recognition Oct 28, 2016 Outline 1 A Motivating Example 2 The Convolutional Neural Network (CNN) Model 3 Training the CNN Model 4 Issues and Recent
More informationChapter 9 Object Tracking an Overview
Chapter 9 Object Tracking an Overview The output of the background subtraction algorithm, described in the previous chapter, is a classification (segmentation) of pixels into foreground pixels (those belonging
More informationRandom Field Topic Model for Semantic Region Analysis in Crowded Scenes from Tracklets
Random Field Topic Model for Semantic Region Analysis in Crowded Scenes from Tracklets Bolei Zhou, Xiaogang Wang 2,3, and Xiaoou Tang,3 Department of Information Engineering, The Chinese University of
More informationConditional Random Fields as Recurrent Neural Networks
BIL722 - Deep Learning for Computer Vision Conditional Random Fields as Recurrent Neural Networks S. Zheng, S. Jayasumana, B. Romera-Paredes V. Vineet, Z. Su, D. Du, C. Huang, P.H.S. Torr Introduction
More informationModule 7 VIDEO CODING AND MOTION ESTIMATION
Module 7 VIDEO CODING AND MOTION ESTIMATION Lesson 20 Basic Building Blocks & Temporal Redundancy Instructional Objectives At the end of this lesson, the students should be able to: 1. Name at least five
More informationStochastic geometry. automatic object detection and tracking. remotely sensed image sequences
Stochastic geometry for automatic object detection and tracking in remotely sensed image sequences Paula CR https://team.inria.fr/ayin/paula-craciun/ This work has been done in collaboration with dr. Josiane
More informationRobotics Programming Laboratory
Chair of Software Engineering Robotics Programming Laboratory Bertrand Meyer Jiwon Shin Lecture 8: Robot Perception Perception http://pascallin.ecs.soton.ac.uk/challenges/voc/databases.html#caltech car
More informationMotion Tracking and Event Understanding in Video Sequences
Motion Tracking and Event Understanding in Video Sequences Isaac Cohen Elaine Kang, Jinman Kang Institute for Robotics and Intelligent Systems University of Southern California Los Angeles, CA Objectives!
More informationObject detection using Region Proposals (RCNN) Ernest Cheung COMP Presentation
Object detection using Region Proposals (RCNN) Ernest Cheung COMP790-125 Presentation 1 2 Problem to solve Object detection Input: Image Output: Bounding box of the object 3 Object detection using CNN
More informationLearning Articulated Skeletons From Motion
Learning Articulated Skeletons From Motion Danny Tarlow University of Toronto, Machine Learning with David Ross and Richard Zemel (and Brendan Frey) August 6, 2007 Point Light Displays It's easy for humans
More informationIntroduction to Trajectory Clustering. By YONGLI ZHANG
Introduction to Trajectory Clustering By YONGLI ZHANG Outline 1. Problem Definition 2. Clustering Methods for Trajectory data 3. Model-based Trajectory Clustering 4. Applications 5. Conclusions 1 Problem
More informationOutline GF-RNN ReNet. Outline
Outline Gated Feedback Recurrent Neural Networks. arxiv1502. Introduction: RNN & Gated RNN Gated Feedback Recurrent Neural Networks (GF-RNN) Experiments: Character-level Language Modeling & Python Program
More informationEvaluation of Moving Object Tracking Techniques for Video Surveillance Applications
International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2015INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Evaluation
More informationAdaptive Action Detection
Adaptive Action Detection Illinois Vision Workshop Dec. 1, 2009 Liangliang Cao Dept. ECE, UIUC Zicheng Liu Microsoft Research Thomas Huang Dept. ECE, UIUC Motivation Action recognition is important in
More informationDeep Generative Models Variational Autoencoders
Deep Generative Models Variational Autoencoders Sudeshna Sarkar 5 April 2017 Generative Nets Generative models that represent probability distributions over multiple variables in some way. Directed Generative
More informationAction recognition in robot-assisted minimally invasive surgery
Action recognition in robot-assisted minimally invasive surgery Candidate: Laura Erica Pescatori Co-Tutor: Hirenkumar Chandrakant Nakawala Tutor: Elena De Momi 1 Project Objective da Vinci Robot: Console
More informationResearch on Recognition and Classification of Moving Objects in Mixed Traffic Based on Video Detection
Hu, Qu, Li and Wang 1 Research on Recognition and Classification of Moving Objects in Mixed Traffic Based on Video Detection Hongyu Hu (corresponding author) College of Transportation, Jilin University,
More informationLocally Weighted Learning
Locally Weighted Learning Peter Englert Department of Computer Science TU Darmstadt englert.peter@gmx.de Abstract Locally Weighted Learning is a class of function approximation techniques, where a prediction
More informationCrowd Event Recognition Using HOG Tracker
Crowd Event Recognition Using HOG Tracker Carolina Gárate Piotr Bilinski Francois Bremond Pulsar Pulsar Pulsar INRIA INRIA INRIA Sophia Antipolis, France Sophia Antipolis, France Sophia Antipolis, France
More informationCSC 578 Neural Networks and Deep Learning
CSC 578 Neural Networks and Deep Learning Fall 2018/19 7. Recurrent Neural Networks (Some figures adapted from NNDL book) 1 Recurrent Neural Networks 1. Recurrent Neural Networks (RNNs) 2. RNN Training
More informationText Recognition in Videos using a Recurrent Connectionist Approach
Author manuscript, published in "ICANN - 22th International Conference on Artificial Neural Networks, Lausanne : Switzerland (2012)" DOI : 10.1007/978-3-642-33266-1_22 Text Recognition in Videos using
More informationClinical Named Entity Recognition Method Based on CRF
Clinical Named Entity Recognition Method Based on CRF Yanxu Chen 1, Gang Zhang 1, Haizhou Fang 1, Bin He, and Yi Guan Research Center of Language Technology Harbin Institute of Technology, Harbin, China
More informationSeq2SQL: Generating Structured Queries from Natural Language Using Reinforcement Learning
Seq2SQL: Generating Structured Queries from Natural Language Using Reinforcement Learning V. Zhong, C. Xiong, R. Socher Salesforce Research arxiv: 1709.00103 Reviewed by : Bill Zhang University of Virginia
More informationIntroduction to behavior-recognition and object tracking
Introduction to behavior-recognition and object tracking Xuan Mo ipal Group Meeting April 22, 2011 Outline Motivation of Behavior-recognition Four general groups of behaviors Core technologies Future direction
More informationLecture 28 Intro to Tracking
Lecture 28 Intro to Tracking Some overlap with T&V Section 8.4.2 and Appendix A.8 Recall: Blob Merge/Split merge occlusion occlusion split When two objects pass close to each other, they are detected as
More informationRecall: Blob Merge/Split Lecture 28
Recall: Blob Merge/Split Lecture 28 merge occlusion Intro to Tracking Some overlap with T&V Section 8.4.2 and Appendix A.8 occlusion split When two objects pass close to each other, they are detected as
More information27: Hybrid Graphical Models and Neural Networks
10-708: Probabilistic Graphical Models 10-708 Spring 2016 27: Hybrid Graphical Models and Neural Networks Lecturer: Matt Gormley Scribes: Jakob Bauer Otilia Stretcu Rohan Varma 1 Motivation We first look
More informationAkarsh Pokkunuru EECS Department Contractive Auto-Encoders: Explicit Invariance During Feature Extraction
Akarsh Pokkunuru EECS Department 03-16-2017 Contractive Auto-Encoders: Explicit Invariance During Feature Extraction 1 AGENDA Introduction to Auto-encoders Types of Auto-encoders Analysis of different
More informationDeep Learning based Authorship Identification
Deep Learning based Authorship Identification Chen Qian Tianchang He Rao Zhang Department of Electrical Engineering Stanford University, Stanford, CA 94305 cqian23@stanford.edu th7@stanford.edu zhangrao@stanford.edu
More informationWhere-and-When to Look: Deep Siamese Attention Networks for Video-based Person Re-identification
1 Where-and-When to Look: Deep Siamese Attention Networks for Video-based Person Re-identification Lin Wu, Yang Wang, Junbin Gao, Xue Li arxiv:1808.01911v2 [cs.cv] 14 Oct 2018 Abstract Video-based person
More informationLecture 21 : A Hybrid: Deep Learning and Graphical Models
10-708: Probabilistic Graphical Models, Spring 2018 Lecture 21 : A Hybrid: Deep Learning and Graphical Models Lecturer: Kayhan Batmanghelich Scribes: Paul Liang, Anirudha Rayasam 1 Introduction and Motivation
More informationPeople Detection and Tracking with World-Z Map from a Single Stereo Camera
Author manuscript, published in "Dans The Eighth International Workshop on Visual Surveillance - VS2008 (2008)" People Detection and Tracking with World-Z Map from a Single Stereo Camera Sofiane Yous Trinity
More informationIntroduction to SLAM Part II. Paul Robertson
Introduction to SLAM Part II Paul Robertson Localization Review Tracking, Global Localization, Kidnapping Problem. Kalman Filter Quadratic Linear (unless EKF) SLAM Loop closing Scaling: Partition space
More informationDiscriminative classifiers for image recognition
Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study
More informationNetwork embedding. Cheng Zheng
Network embedding Cheng Zheng Outline Problem definition Factorization based algorithms --- Laplacian Eigenmaps(NIPS, 2001) Random walk based algorithms ---DeepWalk(KDD, 2014), node2vec(kdd, 2016) Deep
More informationDeep Learning with Tensorflow AlexNet
Machine Learning and Computer Vision Group Deep Learning with Tensorflow http://cvml.ist.ac.at/courses/dlwt_w17/ AlexNet Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton, "Imagenet classification
More informationA Feature Point Matching Based Approach for Video Objects Segmentation
A Feature Point Matching Based Approach for Video Objects Segmentation Yan Zhang, Zhong Zhou, Wei Wu State Key Laboratory of Virtual Reality Technology and Systems, Beijing, P.R. China School of Computer
More informationA Real-Time Street Actions Detection
A Real-Time Street Actions Detection Salah Alghyaline Department of Computer Science The World Islamic Sciences and Education University, Amman, Jordan Abstract Human action detection in real time is one
More informationDay 3 Lecture 1. Unsupervised Learning
Day 3 Lecture 1 Unsupervised Learning Semi-supervised and transfer learning Myth: you can t do deep learning unless you have a million labelled examples for your problem. Reality You can learn useful representations
More informationCOMP 551 Applied Machine Learning Lecture 16: Deep Learning
COMP 551 Applied Machine Learning Lecture 16: Deep Learning Instructor: Ryan Lowe (ryan.lowe@cs.mcgill.ca) Slides mostly by: Class web page: www.cs.mcgill.ca/~hvanho2/comp551 Unless otherwise noted, all
More informationList of Accepted Papers for ICVGIP 2018
List of Accepted Papers for ICVGIP 2018 Paper ID ACM Article Title 3 1 PredGAN - A deep multi-scale video prediction framework for anomaly detection in videos 7 2 Handwritten Essay Grading on Mobiles using
More informationAutomatic visual recognition for metro surveillance
Automatic visual recognition for metro surveillance F. Cupillard, M. Thonnat, F. Brémond Orion Research Group, INRIA, Sophia Antipolis, France Abstract We propose in this paper an approach for recognizing
More informationA Heat-Map-based Algorithm for Recognizing Group Activities in Videos
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 1 A Heat-Map-based Algorithm for Recognizing Group Activities in Videos Weiyao Lin, Hang Chu, Jianxin Wu, Bin Sheng, and Zhenzhong Chen Abstract
More information(LSS Erlangen, Simon Bogner, Ulrich Rüde, Thomas Pohl, Nils Thürey in collaboration with many more
Parallel Free-Surface Extension of the Lattice-Boltzmann Method A Lattice-Boltzmann Approach for Simulation of Two-Phase Flows Stefan Donath (LSS Erlangen, stefan.donath@informatik.uni-erlangen.de) Simon
More informationVideo Texture. A.A. Efros
Video Texture A.A. Efros 15-463: Computational Photography Alexei Efros, CMU, Fall 2005 Weather Forecasting for Dummies Let s predict weather: Given today s weather only, we want to know tomorrow s Suppose
More informationSemantic-Based Surveillance Video Retrieval
Semantic-Based Surveillance Video Retrieval Weiming Hu, Dan Xie, Zhouyu Fu, Wenrong Zeng, and Steve Maybank, Senior Member, IEEE IEEE Transactions on Image Processing, Vol. 16, No. 4, April 2007 Present
More informationNatural Language Processing with Deep Learning CS224N/Ling284
Natural Language Processing with Deep Learning CS224N/Ling284 Lecture 8: Recurrent Neural Networks Christopher Manning and Richard Socher Organization Extra project office hour today after lecture Overview
More informationBilevel Sparse Coding
Adobe Research 345 Park Ave, San Jose, CA Mar 15, 2013 Outline 1 2 The learning model The learning algorithm 3 4 Sparse Modeling Many types of sensory data, e.g., images and audio, are in high-dimensional
More information