Human Violence Recognition and Detection in Surveillance Videos

Size: px
Start display at page:

Download "Human Violence Recognition and Detection in Surveillance Videos"

Transcription

1 Human Volence Recognton and Detecton n Survellance Vdeos Potr Blnsk and Francos Bremond INRIA Sopha Antpols, STARS team 2004 Route des Lucoles, BP93, Sopha Antpols, France {Potr.Blnsk,Francos.Bremond}@nra.fr Abstract In ths paper, we focus on the mportant topc of volence recognton and detecton n survellance vdeos. Our goal s to determne f a volence occurs n a vdeo (recognton) and when t happens (detecton). Frstly, we propose an extenson of the Improved Fsher Vectors (IFV) for vdeos, whch allows to represent a vdeo usng both local features and ther spato-temporal postons. Then, we study the popular sldng wndow approach for volence detecton, and we re-formulate the Improved Fsher Vectors and use the summed area table data structure to speed up the approach. We present an extensve evaluaton, comparson and analyss of the proposed mprovements on 4 state-ofthe-art datasets. We show that the proposed mprovements make the volence recognton more accurate (as compared to the standard IFV, IFV wth spato-temporal grd, and other state-of-the-art methods) and make the volence detecton sgnfcantly faster. 1. Introducton Vdeo survellance cameras are part of our lves. They are used almost everywhere, e.g. at streets, subways, tran and bus statons, arports, and sport stadums. Today s ncrease n threats to securty n ctes and towns around the world makes the use of vdeo cameras to montor people necessary. The attacks on humans, fghts, and vandalsm are just some cases where detecton, partcularly volence detecton, systems are needed. In ths paper, we focus on the mportant topc of volence recognton and detecton n survellance vdeos. Our goal s to determne f a volence occurs n a vdeo (recognton) and when t happens (detecton). Over the last years, several volence recognton and detecton technques have been proposed. [6] have used moton trajectory nformaton and orentaton nformaton of person s lmbs for person-on-person fght detecton. One of the man drawbacks of ths approach s that t requres precse segmentaton, whch s very dffcult to obtan n real world vdeos. Instead, [7, 9, 21, 23] have focused on local features and the bag-of-features approach; the man dfference between these technques les n the type of features used. [23] have appled the STIP and SIFT, and [7] have used the STIP and MoSIFT features. [9] have proposed the VIolence Flows descrptor, encodng how flow-vector magntudes change over tme. [21] have proposed a vdeo descrptor based on substantal dervatve. Despte recent mprovements n volence recognton and detecton, effectve solutons for real-world stuatons are stll unavalable. The Improved Fsher Vectors (IFV) [24] s a bag-offeatures-lke vdeo encodng strategy whch has shown to outperform the standard bag-of-features. It s a vdeo (and mage) descrptor obtaned by poolng local features nto a global representaton. It descrbes local features by ther devaton from the unversal generatve Gaussan Mxture Model. The IFV has been wdely appled for recognton tasks n vdeos [1, 2, 11, 27, 28]. One of the man drawbacks of the IFV s that t smplfes the structure of a vdeo assumng condtonal ndependence across spatal and temporal domans; t computes global statstcs of local features only, gnorng spato-temporal postons of features. Clearly, spatal nformaton may contan useful nformaton. A common way to use spato-temporal nformaton wth IFV s to use ether spato-temporal grds [16] or mult-scale pyramds [17]; however, these methods are stll lmted n terms of a detaled descrpton provdng only a coarse representaton. There are several other state-of-theart methods [13, 14, 19, 25], but as they were proposed for mages (for mage categorzaton and object recognton), they cannot be drectly appled for vdeos; moreover, [13, 14] acheve smlar results as compared to the spatal grds/pyramds, and [19] s parameter senstve and requres addtonal parameter learnng. As opposed to the exstng volence recognton and detecton methods (whch focus manly on new descrptors), we focus on a vdeo representaton model due to two reasons: to make t more accurate for volence recognton, and to make t faster for volence detecton. Frstly, we propose an extenson of the IFV for vdeos (Sec. 2.2), /16/$ IEEE IEEE AVSS 2016, August 2016, Colorado Sprngs, CO, USA

2 whch allows to use spato-temporal postons of features wth the IFV. The proposed extenson boosts the IFV and acheves better or smlar accuracy (keepng the representaton more compact) as compared to the spato-temporal grds. Then, we study and evaluate the popular sldng wndow approach [12] for volence detecton. We re-formulate the IFV and use the summed area table data structure to speed up the sldng wndow method (Sec. 2.3). Then, we present an extensve evaluaton, comparson and analyss of the proposed mprovements on 4 state-of-the-art datasets (Sec. 3 and Sec. 4). Fnally, we conclude n Sec. 5. Abnormal behavor detecton: There are several methods for abnormal behavor and anomaly detecton [4, 18, 20, 22]. However, abnormaltes do not represent a compact and well defned concept. Abnormalty detecton s a dfferent research topc, wth dfferent constrants and assumptons, and therefore we do not focus on these technques. 2. Boostng the Improved Fsher Vectors (IFV) 2.1. State-of-The-Art: Improved Fsher Vectors Ths secton provdes a bref descrpton of the Improved Fsher Vectors, ntroduced n [24]. The mathematcal notatons and formulas provded here are n accordance wth [24], and we refer to t for more detals. Let = {x t, t = 1... T } be a set of T local features extracted from a vdeo, where each local feature s of dmenson D, x t R D. Let λ = {w, µ, Σ, = 1... K} be parameters of a Gaussan Mxture Model (GMM): u λ (x) = K =1 w u (x) fttng the dstrbuton of local features, where w R, µ R D and Σ R D D are respectvely the mxture weght, mean vector and covarance matrx of the -th Gaussan u. We assume that the covarance matrces are dagonal and we denote by σ 2 the varance vector,.e. Σ = dag(σ 2 ), σ2 RD. Moreover, let γ t () be the soft assgnment of a descrptor x t to a Gaussan : γ t () = w u (x t ) K w ju j (x t ), (1) and let G µ, (resp. G σ,) be the gradent w.r.t. the mean µ (resp. standard devaton σ ) of a Gaussan : G µ, = 1 T w G 1 σ, = T 2w ( ) xt µ γ t (), (2) σ [ (xt µ ) 2 ] γ t () σ 2 1, (3) where the dvson between vectors s as a term-by-term operaton. Then, the gradent vector G λ s the concatenaton of all the K gradent vectors G µ, R D and all the K gradent vectors G σ, R D, = 1... K: G λ = [ G µ,1, G σ,1,..., G µ,k, G σ,k ]. (4) The IFV (Improved Fsher Vectors) representaton Φ λ, Φ λ R 2DK, s the gradent vector G λ normalzed by the power normalzaton and then the L2 norm: Φ λ = [ G µ,1, G σ,1,..., G µ,k, G σ,k ] +l2. (5) 2.2. Boostng the IFV wth spato-temporal nf. The Improved Fsher Vectors encodng smplfes the structure of a vdeo assumng condtonal ndependence across spatal and temporal domans (see Sec. 2.1). It computes global statstcs of local features only, gnorng spatotemporal postons of features. Thus, we propose an extenson of the Improved Fsher Vectors whch ncorporates spato-temporal postons of features nto the vdeo model. Frstly, we represent postons of local features n a vdeo normalzed manner. In ths paper, we focus on local trajectores only; however, the followng representaton can also be appled to spato-temporal nterest ponts [8, 15, 29] (wth assumptons: p t = (a t,1, b t,1, c t,1 ) s the spatotemporal poston of a pont and n t = 1). Let P = {p t, t = 1... T } be a set of T trajectores extracted from a vdeo sequence and p t = ((a t,1, b t,1, c t,1 ),..., (a t,nt, b t,nt, c t,nt )) s a sample trajectory, where a feature pont detected at a spatal poston (a t,1, b t,1 ) n a frame c t,1 s tracked n n t 1 subsequent frames untl a spatal poston (a t,nt, b t,nt ) n a frame c t,nt. We defne the vdeo normalzed poston ˆp t of a center of a trajectory p t as: ˆp t = [ 1 v w n t n t =1 1 n t 1 n t a t,, b t,, v h n t v l n t =1 =1 c t, ], (6) where v w s the vdeo wdth (wth the unts n pxels), v h s the vdeo heght (n pxels), and v l s the vdeo length (number of frames). We normalze the poston of a center of a trajectory, so that the vdeo sze does not sgnfcantly change the magntude of the feature poston vector. Once postons of local features are represented n a vdeo normalzed manner, we also consder usng the unty based normalzaton to reduce the nfluence of motonless regons at the boundares of a vdeo, so that the large motonless regons do not sgnfcantly change the magntude of the feature poston vector. Let ˆp t, be the -th dmenson of the vector ˆp t and mn(ˆp :, ) (resp. max(ˆp :, )) be the mnmum (resp. maxmum) value of the -th dmenson among all the vdeo normalzed poston vectors extracted from the tranng vdeos. When the condton : mn(ˆp :, ) max(ˆp :, ) s true, we can apply the unty based

3 normalzaton to calculate the vector p t. The -th dmenson of the vector p t s: p t, = ˆp t, mn(ˆp :, ) max(ˆp :, ) mn(ˆp :, ). (7) Then, we ncorporate the normalzed postons of local features nto the Improved Fsher Vectors model, so that vdeos are represented usng both local descrptors and ther spato-temporal postons. Let Y = {y t = [ p t, x t ], t = 1... T } be a set of local features, where x t R D s a local feature descrptor and p t R E s ts correspondng normalzed poston, typcally E = 3, calculated as above. Let λ = { w, µ, Σ, = 1... K} be parameters of a GMM u λ(y) = K =1 w u (y) fttng the dstrbuton of local features, where w R, µ R D+E and Σ R (D+E) (D+E) are respectvely the mxture weght, mean vector and covarance matrx of the -th Gaussan. As before, we assume that the covarance matrces are dagonal and we denote by σ 2 the varance vector,.e. Σ = dag( σ 2 ), σ2 R D+E. We calculate G Ỹ µ, (Eq. 2) and G Ỹ σ, (Eq. 3) for all K Gaussan components, and concatenate all the gradent vectors nto a vector G Ỹ λ. Fnally, the new Improved Fsher Vectors representaton s the gradent vector G Ỹ normalzed by the power normalzaton and then the L2 λ norm: Φ Ỹ λ = [ GỸ µ,1, G Ỹ σ,1,..., G Ỹ µ,k, G Ỹ σ,k ] +l2. (8) 2.3. Fast IFV-based Sldng Wndow Our goal s to determne f a volence occurs n a vdeo and when t happens; therefore, we search for a range of frames whch contans volence. We base our approach on the temporal sldng wndow [12] whch evaluates vdeo sub-sequences at varyng locatons and scales. Let v l be a vdeo length (n frames), s > 0 be the wndow step sze (n frames), and w = {s} =1...m be temporal wndow szes (scales) for the sldng wndow algorthm. Moreover, let v = ns be an approxmated vdeo length (where: ns v l > (n 1)s and n m 1). Vsualzaton of a sample vdeo and sample sldng wndows s presented n Fg. 1. Note the IFV are calculated for features from the same temporal segments multple tmes,.e. m(n m + 1) tmes for m segments (e.g. Fg. 1: 20 tmes for 8 segments). Therefore, to speed up the detecton framework, we re-formulate the IFV and use the summed area table data structure, so that the IFV are calculated for features from the temporal segments only ones. Let = {x t, t = 1... T } be a set of T local features extracted from a vdeo. Let = { j, j = 1... N} be a partton of a set nto N subsets j = {x j,k } j k=1 such that: j s the cardnalty of the set j, = N j, tme s 1 s 2 s 3 s 4 s 5 s 6 s 7 s = 20 Fgure 1. Temporal sldng wndow: a sample vdeo s dvded nto n 8 segments. We use m = 4 wndow scales. Note that the IFV are calculated for features from the same segments multple tmes (20 tmes for 8 segments). N j k, j,k=1 j k = and φ(j, k) t s the mappng functon such that x j,k = x t. We re-wrte Eq. (2): G µ, = 1 T w = 1 T w = 1 T ( ) xt µ γ t () H j µ, = Gj µ, j j w = 1 σ j ( ) xφ(j,k) µ k=1 G j µ, j = 1 T k=1 Smlarly, we re-wrte Eq. (3): G 1 σ, = T 2w 1 = T 2w = 1 T H j σ, = Gj σ, j = 1 j 2w H j µ,, σ ( xφ(j,k) µ σ ). (9) [ (xt µ ) 2 ] γ t () σ 2 1 j [ (xφ(j,k) µ ) 2 ] σ 2 1 k=1 G j σ, j = 1 T k=1 H j σ,, [ (xφ(j,k) µ ) 2 ] σ 2 1. (10) Then, let s defne the gradent vector H j λ as a concatenaton of all the K gradent vectors H j µ, and all the K grad-

4 ent vectors Hσ,j, = 1... K: 0 j j Hλ j = [ Hµ,1j, Hσ,1j,..., Hµ,K, Hσ,K ]. (11) The Improved Fsher Vectors representaton Φ of local λ SN features = j=m j, where 1 < M N, can be calculated usng: SN SN Gλ j=m j j SM 1 j H Hλ, = PNλ PM 1 j j (12) and applyng the power normalzaton and then the L2 norm to the obtaned gradent vector. The obtaned representaton s exactly the same as f we use Eq. (2)-(5). However, n contrast to the orgnal IFV, the above equatons can be drectly used wth data structures such as summed area table (Integral Images) and KDD-trees. For the task of volence localzaton, we use the above formulaton of the IFV (Eq. (9)-(12)), and drectly apply the summed area table (Integral Images [26]). The 2 man advantages of ths soluton are: (1) t allows to speed up the calculatons, as every feature s assgned to each Gaussan exactly once; e.g. we detected 25k features n a 84 frames long vdeo. Wth m = 4 and s = 5, every feature was assgned to each Gaussan 4 10 tmes; ths s lke 224k features were assgned to each Gaussan. In our algorthm, each feature s assgned to each Gaussan exactly once. Ths means nearly 9 tmes less calculatons. (2) t allows to reduce the memory usage, especally when a vdeo contans a lot of moton and dense features are extracted [27]; e.g. we extracted 130k features n a 35 seconds long vdeo ( 3.7k features per second on average). Wth Improved Dense Trajectores [27] (each trajectory s represented usng 426 floats), ths means 1.6M floats to store per second (segment), whch s 29 tmes more than the IFV representaton wth 128 Gaussans calculated for ths segment. 3. Expermental Setup: Approach Overvew Frstly, we extract local spato-temporal features n vdeos, and we use the Improved Dense Trajectores (IDT) [27] for that; we apply a dense samplng and track the extracted nterest ponts usng a dense optcal flow feld. Then, we extract local spato-temporal vdeo volumes around the detected trajectores, and we represent each trajectory usng: Hstogram of Orented Gradents (HOG) capturng appearance nformaton, and Trajectory Shape (TS), Hstogram of Optcal Flow (HOF) and Moton Boundary Hstogram (wth MBH-x and MBH-y components) descrptors capturng moton nformaton. The extracted IDT features provde a good coverage of a vdeo and ensure extracton of meanngful nformaton. As the results, they have shown to acheve excellent results for varous recognton Fgure 2. Sample vdeo frames from the Volent-Flows (frst row), Hockey Fght (second row), Moves (thrd row), and VolentFlows 21 (fourth row) datasets. tasks n vdeos and they have been wdely used n lterature [1, 27]. To represent a vdeo, we calculate a separate vdeo representaton for each descrptor ndependently (.e. HOG, etc.) usng the IFV / proposed Spato-Temporal IFV (Sec. 2.2), and we concatenate the obtaned representatons usng late fuson (.e. per vdeo: we concatenate the IFVbased vdeo representaton from HOG wth vdeo representaton from HOF, etc.). For volence recognton, we use lnear Support Vector Machnes (SVMs) [3] classfer, whch has shown to acheve excellent results wth hgh-dmensonal data such as Fsher Vectors; as typcally f the number of features s large, there s no need to map data to a hgher dmensonal space [10]. Moreover, lnear SVMs have shown to be effcent both n tranng and predcton steps. For volence detecton, we use the Fast Sldng Wndowbased framework, explaned n Sec Experments 4.1. Datasets We use 4 benchmark datasets for evaluaton and we follow the recommended evaluaton protocols provded by the authors of the datasets. We use Volent-Flows dataset [9], Hockey Fght dataset [23] and Moves dataset [23] for volence recognton task. We use Volence-Flows 21 dataset [9] for volence detecton task. Sample vdeo frames from the datasets are presented n Fg. 2. The Volent-Flows (Crowd Volence \ Non-volence) dataset [9] contans 246 vdeos wth real-world footage of crowd volence. Vdeos are collected from YouTube and

5 contan a varety of scenes, e.g. streets, football stadums, volleyball and ce hockey arenas, and schools. The dataset s dvded nto 5 folds and we follow the recommended 5- folds cross-valdaton to report the performance. The Hockey Fght dataset [23] contans 1000 real-world vdeos: 500 volent scenes (between two and more partcpants) and 500 non-volent scenes. Vdeos are dvded nto 5 folds, where each fold contans 50% volent and 50% nonvolent vdeos, and we follow the recommended 5-folds cross-valdaton to report the performance. The Moves dataset [23] contans 200 vdeo clps: 100 vdeos wth a person-on-person fght (collected from acton moves) and 100 vdeos wth non-fght scenaros (collected from varous acton recognton datasets). Ths dataset contans a wder varety of scenes than the Hockey Fght dataset, and scenes are captured at dfferent resolutons [23]. Vdeos are dvded nto 5 folds and we follow the recommended 5-folds cross-valdaton to report the performance. Although ths dataset does not contan survellance vdeos, t has been wdely used n the past for volence recognton task. The man dfferences between above datasets are: varous scenaros and scenes, volence/fght and nonvolence/non-fght classes varatons, number of tranng and testng samples, pose and camera vew pont varatons, moton blur, background clutter, occlusons, and llumnaton condtons. The Volent-Flows 21 dataset (Crowd Volence \ Nonvolence 21 Database) [9] contans 21 vdeos wth realworld vdeo footage of crowd volence. Vdeos are collected from YouTube, they are of spatal sze pxels, and they begn wth non-volent behavor, whch turns to volent md-way through the vdeo. The tranng s performed usng 227 out of 246 vdeos from the Volent-Flows dataset; 19 vdeos are removed as they are ncluded n the detecton set. The orgnal annotatons are not avalable. Therefore, as proposed n the orgnal paper [9], we manually mark the frame n each vdeo where the transton happens from non-volent to volent behavor Implementaton Detals We use the GMM wth K = 128 and K = 256 to compute the IFV / Spato-Temporal IFV, and we set the number of Gaussans usng 5-folds cross-valdaton. To ncrease clusterng precson, we ntalze the GMM 10 tmes and we keep the codebook wth the lowest error. To lmt the complexty, we cluster a subset of 100, 000 randomly selected tranng features. To report recognton results, we use the Mean Class Accuracy (MCA) metrc. For volence detecton, we use sx temporal wndows of length {5} 6 =1 frames and the wndow strde equal to 1 frame. To report 1 Dfferences can exst between our and [9] annotatons. Approach Sze Volent-F. Hockey F. Moves Baselne Ours: STIFV IFV 1x1x IFV 1x2x IFV 2x1x IFV 1x1x IFV 1x3x IFV 3x1x IFV 2x2x IFV 2x2x IFV 2x2x IFV 2x1x IFV 1x2x Table 1. Evaluaton results: the baselne (IFV wth 1x1x1) approach, our IFV wth spato-temporal nformaton (STIFV), and the IFV wth varous spato-temporal grds on the Volent-Flows, Hockey Fght, and Moves datasets. Second column presents the sze of the vdeo representaton relatvely to the sze of the vdeo representaton of the baselne approach. detecton results, we use the Recever Operatng Characterstc (ROC) curve and the Area Under Curve (AUC) metrcs Results: Volence Recognton For volence recognton, we evaluate the standard IFV approach (baselne approach) and our IFV wth spatotemporal nformaton (STIFV, Sec. 2.2). Moreover, we evaluate the IFV wth 11 varous spato-temporal grds (1x1x2, 1x2x1, 2x1x1, 1x1x3, 1x3x1, 3x1x1, 2x2x2, 2x2x3, 2x2x1, 2x1x2, and 1x2x2). The evaluatons are performed on 3 datasets: Volent-Flows, Hockey Fght and Moves datasets. The results are presented n Table 1. In all cases, our STIFV approach outperforms the IFV method, and acheves better or smlar performance as compared to the IFV wth spato-temporal grd. Note that fndng an approprate sze of the spato-temporal grd s tme consumng (there are 3 addtonal parameters to learn). Moreover, a spato-temporal grd-based representaton requres sgnfcantly more amount of memory (up to 12 tmes n our experments, see Table 1). Then, we compare our approach wth the state-of-theart. The comparson on the Volent-Flows, Hockey Fght, and Moves datasets s presented n Table 2. Note that our approach sgnfcantly outperforms remanng technques, achevng even up to 11% better results (on the Volent- Flows dataset). In summary, for volence recognton, the proposed mprovement (IFV wth spato-temporal nformaton) boosts the state-of-the-art IFV, and acheves better or smlar ac-

6 Volent-Flows Dataset Hockey Fght Dataset Moves Dataset Approach Acc. (%) Approach Acc. (%) HNF [16] 56.5 Approach Acc. (%) STIP-HOG + HIK [23] 49 HOG [16] 57.4 LTP [30] 71.9 STIP-HOF + HIK [23] 59 HOF [16] 58.3 VF [9] 82.9 BoW-MoSIFT [5] 86.5 LTP [30] 71.5 STIP-HOF + HIK [23] 88.6 MoSIFT + HIK [23] 89.5 Jerk [6] 74.2 Extreme Acceleratons [5] 90.1 VF [9] 91.3 Interacton Force [20] 74.5 MoSIFT + HIK [23] 90.9 Jerk [6] 95.0 VF [9] 81.3 BoW-MoSIFT [5] 91.2 Interacton Force [20] 95.5 HOT [22] 82.3 STIP-HOG + HIK [23] 91.7 F L F Cv [21] 96.9 F L F Cv [21] 85.4 Our Approach 93.7 Extreme Acceleratons [5] 98.9 Our Approach 96.4 Our Approach 99.5 Table 2. Comparson wth the state-of-the-art on the Volent-Flows (left table), Hockey Fght (mddle), and Moves (rght) datasets. same results). The results and comparson wth the state-ofthe-art are presented n Fgure 3 (usng the ROC curves) and n Table 3 (usng the AUC metrc). Then, we evaluate the speed of the Improved Dense Trajectores (IDT), and we compare the speed of the standard sldng wndow approach wth the speed of our Fast Sldng Wndow technque (Sec. 2.3). The results are presented n Table 4. We observe that the proposed Fast Sldng Wndow technque s more than 10 tmes faster than the standard sldng wndow approach. Fgure 3. ROC curves: our approach (on the left) vs. the state-ofthe-art (on the rght) on the Volent-Flows 21 dataset. Approach LTP HOG HOF HNF VIF Ours AUC Table 3. AUC metrc on the Volent-Flows 21 dataset [9]. Process Processng Tme (fps) Feature Extracton (IDT) 5.7 Sldng Wndow 9.28 Ours: Fast Sldng Wndow Table 4. Average processng tme on the Volent-Flows 21 dataset usng a sngle Intel(R) eon(r) CPU E GHz. curacy (keepng the representaton more compact) as compared to the IFV wth spato-temporal grds. Moreover, our approach sgnfcantly outperforms the exstng technques on all three volence recognton datasets Results: Volence Detecton We evaluate our Fast Sldng Wndow-based approach on the Volence-Flows 21 dataset. Frstly, we evaluate the accuracy of the sldng wndow / Fast Sldng Wndow approach (both technques acheve the 5. Conclusons We have proposed an extenson of the Improved Fsher Vectors (IFV) for volence recognton n vdeos, whch allows to represent a vdeo usng both local features and ther spato-temporal postons. The proposed extenson has shown to boost the IFV achevng better or smlar accuracy (and keepng the representaton more compact) as compared to the IFV wth spato-temporal grd. Moreover, our approach has shown to sgnfcantly outperform the exstng technques on three volence recognton datasets. Then, we have studed the popular sldng wndow approach for volence detecton. We have re-formulated the IFV and have used the summed area table data structure to sgnfcantly speed up the volence detecton framework. The evaluatons have been performed on 4 state-of-the-art datasets. Acknowledgements. The research leadng to these results has receved fundng from the People Programme (Mare Cure Actons) of the European Unon s Seventh Framework Programme FP7/ / under REA grant agreement n o [324359]. However, the vews and opnons expressed heren do not necessarly reflect those of the fnancng nsttutons.

7 References [1] P. Blnsk and F. Bremond. Vdeo Covarance Matrx Logarthm for Human Acton Recognton n Vdeos. In Internatonal Jont Conference on Artfcal Intellgence (IJCAI), [2] P. Blnsk, M. Kopersk, S. Bak, and F. Bremond. Representng Vsual Appearance by Vdeo Brownan Covarance Descrptor for Human Acton Recognton. In IEEE Internatonal Conference on Advanced Vdeo and Sgnal-Based Survellance (AVSS), [3] C.-C. Chang and C.-J. Ln. LIBSVM: A Lbrary for Support Vector Machnes. ACM Transactons on Intellgent Systems and Technology (TIST), 2(3):27, [4]. Cu, Q. Lu, M. Gao, and D. N. Metaxas. Abnormal Detecton Usng Interacton Energy Potentals. In IEEE Conference on Computer Vson and Pattern Recognton (CVPR), [5] O. Danz, I. Serrano, G. Bueno, and T.-K. Km. Fast Volence Detecton n Vdeo. In Internatonal Conference on Computer Vson Theory and Applcatons (VISAPP), [6] A. Datta, M. Shah, and N. D. V. Lobo. Person-on-Person Volence Detecton n Vdeo Data. In Internatonal Conference on Pattern Recognton (ICPR), [7] F. D. de Souza, G. C. Chavez, E. A. do Valle, and A. de A Araujo. Volence Detecton n Vdeo Usng Spato- Temporal Features. In SIBGRAPI Conference on Graphcs, Patterns and Images, [8] P. Dollar, V. Rabaud, G. Cottrell, and S. Belonge. Behavor Recognton va Sparse Spato-Temporal Features. In IEEE Internatonal Workshop on Vsual Survellance and Performance Evaluaton of Trackng and Survellance, [9] T. Hassner, Y. Itcher, and O. Klper-Gross. Volent Flows: Real-Tme Detecton of Volent Crowd Behavor. In IEEE Conference on Computer Vson and Pattern Recognton (CVPR) Workshops, [10] C.-W. Hsu, C.-C. Chang, and C.-J. Ln. A Practcal Gude to Support Vector Classfcaton. Techncal report, Department of Computer Scence, Natonal Tawan Unversty, [11] V. Kantorov and I. Laptev. Effcent feature extracton, encodng and classfcaton for acton recognton. In IEEE Conference on Computer Vson and Pattern Recognton (CVPR), [12] A. Klaser, M. Marszalek, C. Schmd, and A. Zsserman. Human Focused Acton Localzaton n Vdeo. In Trends and Topcs n Computer Vson, pages Sprnger, [13] J. Krapac, J. Verbeek, and F. Jure. Modelng Spatal Layout wth Fsher Vectors for Image Categorzaton. In IEEE Internatonal Conference on Computer Vson (ICCV), [14] J. Krapac, J. Verbeek, and F. Jure. Spatal Fsher Vectors for Image Categorzaton. Research Report RR-7680, INRIA, [15] I. Laptev. On Space-Tme Interest Ponts. Internatonal Journal of Computer Vson (IJCV), 64(2-3): , [16] I. Laptev, M. Marszalek, C. Schmd, and B. Rozenfeld. Learnng realstc human actons from moves. In IEEE Conference on Computer Vson and Pattern Recognton (CVPR), [17] S. Lazebnk, C. Schmd, and J. Ponce. Beyond Bags of Features: Spatal Pyramd Matchng for Recognzng Natural Scene Categores. In IEEE Conference on Computer Vson and Pattern Recognton (CVPR), [18] V. Mahadevan, W. L, V. Bhaloda, and N. Vasconcelos. Anomaly Detecton n Crowded Scenes. In IEEE Conference on Computer Vson and Pattern Recognton (CVPR), [19] S. McCann and D. G. Lowe. Spatally Local Codng for Object Recognton. In Asan Conference on Computer Vson (ACCV), [20] R. Mehran, A. Oyama, and M. Shah. Abnormal Crowd Behavor Detecton usng Socal Force Model. In IEEE Conference on Computer Vson and Pattern Recognton (CVPR), [21] S. Mohammad, H. Kan, A. Perna, and V. Murno. Volence detecton n crowded scenes usng substantal dervatve. In IEEE Internatonal Conference on Advanced Vdeo and Sgnal-Based Survellance (AVSS), [22] H. Mousav, S. Mohammad, A. Perna, R. Chellal, and V. Murno. Analyzng Tracklets for the Detecton of Abnormal Crowd Behavor. In IEEE Wnter Conference on Applcatons of Computer Vson (WACV), [23] E. B. Nevas, O. D. Suarez, G. B. Garca, and R. Sukthankar. Volence Detecton n Vdeo Usng Computer Vson Technques. In Internatonal Conference on Computer Analyss of Images and Patterns (CAIP), [24] F. Perronnn, J. Sanchez, and T. Mensnk. Improvng the Fsher Kernel for Large-Scale Image Classfcaton. In European Conference on Computer Vson (ECCV), [25] J. Sanchez, F. Perronnn, and T. De Campos. Modelng the Spatal Layout of Images Beyond Spatal Pyramds. Pattern Recognton Letters, 33(16): , [26] O. Tuzel, F. Porkl, and P. Meer. Regon Covarance: A Fast Descrptor for Detecton And Classfcaton. In European Conference on Computer Vson (ECCV), [27] H. Wang and C. Schmd. Acton Recognton wth Improved Trajectores. In IEEE Internatonal Conference on Computer Vson (ICCV), [28] L. Wang, Y. Qao, and. Tang. Acton Recognton wth Trajectory-Pooled Deep-Convolutonal Descrptors. In IEEE Conference on Computer Vson and Pattern Recognton (CVPR), [29] G. Wllems, T. Tuytelaars, and L. Van Gool. An Effcent Dense and Scale-Invarant Spato-Temporal Interest Pont Detector. In European Conference on Computer Vson (ECCV), [30] L. Yeffet and L. Wolf. Local Trnary Patterns for human acton recognton. In IEEE Internatonal Conference on Computer Vson (ICCV), 2009.

Motion Boundary Trajectory for Human Action Recognition

Motion Boundary Trajectory for Human Action Recognition Moton Boundary Trajectory for Human Acton Recognton So-Long Lo and Ah-Chung Tso Faculty of Informaton Technology, Macau Unversty of Scence and Technology Abstract. In ths paper, we propose a novel approach

More information

Large-scale Web Video Event Classification by use of Fisher Vectors

Large-scale Web Video Event Classification by use of Fisher Vectors Large-scale Web Vdeo Event Classfcaton by use of Fsher Vectors Chen Sun and Ram Nevata Unversty of Southern Calforna, Insttute for Robotcs and Intellgent Systems Los Angeles, CA 90089, USA {chensun nevata}@usc.org

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Scale Selective Extended Local Binary Pattern For Texture Classification

Scale Selective Extended Local Binary Pattern For Texture Classification Scale Selectve Extended Local Bnary Pattern For Texture Classfcaton Yutng Hu, Zhlng Long, and Ghassan AlRegb Multmeda & Sensors Lab (MSL) Georga Insttute of Technology 03/09/017 Outlne Texture Representaton

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Histogram of Template for Pedestrian Detection

Histogram of Template for Pedestrian Detection PAPER IEICE TRANS. FUNDAMENTALS/COMMUN./ELECTRON./INF. & SYST., VOL. E85-A/B/C/D, No. xx JANUARY 20xx Hstogram of Template for Pedestran Detecton Shaopeng Tang, Non Member, Satosh Goto Fellow Summary In

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Local Quaternary Patterns and Feature Local Quaternary Patterns

Local Quaternary Patterns and Feature Local Quaternary Patterns Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents

More information

Face Recognition University at Buffalo CSE666 Lecture Slides Resources:

Face Recognition University at Buffalo CSE666 Lecture Slides Resources: Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input Real-tme Jont Tracng of a Hand Manpulatng an Object from RGB-D Input Srnath Srdhar 1 Franzsa Mueller 1 Mchael Zollhöfer 1 Dan Casas 1 Antt Oulasvrta 2 Chrstan Theobalt 1 1 Max Planc Insttute for Informatcs

More information

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity Journal of Sgnal and Informaton Processng, 013, 4, 114-119 do:10.436/jsp.013.43b00 Publshed Onlne August 013 (http://www.scrp.org/journal/jsp) Corner-Based Image Algnment usng Pyramd Structure wth Gradent

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Audio Event Detection and classification using extended R-FCN Approach. Kaiwu Wang, Liping Yang, Bin Yang

Audio Event Detection and classification using extended R-FCN Approach. Kaiwu Wang, Liping Yang, Bin Yang Audo Event Detecton and classfcaton usng extended R-FCN Approach Kawu Wang, Lpng Yang, Bn Yang Key Laboratory of Optoelectronc Technology and Systems(Chongqng Unversty), Mnstry of Educaton, ChongQng Unversty,

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS P.G. Demdov Yaroslavl State Unversty Anatoly Ntn, Vladmr Khryashchev, Olga Stepanova, Igor Kostern EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS Yaroslavl, 2015 Eye

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

Face Detection with Deep Learning

Face Detection with Deep Learning Face Detecton wth Deep Learnng Yu Shen Yus122@ucsd.edu A13227146 Kuan-We Chen kuc010@ucsd.edu A99045121 Yzhou Hao y3hao@ucsd.edu A98017773 Mn Hsuan Wu mhwu@ucsd.edu A92424998 Abstract The project here

More information

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.

More information

WIRELESS CAPSULE ENDOSCOPY IMAGE CLASSIFICATION BASED ON VECTOR SPARSE CODING.

WIRELESS CAPSULE ENDOSCOPY IMAGE CLASSIFICATION BASED ON VECTOR SPARSE CODING. WIRELESS CAPSULE ENDOSCOPY IMAGE CLASSIFICATION BASED ON VECTOR SPARSE CODING Tao Ma 1, Yuexan Zou 1 *, Zhqang Xang 1, Le L 1 and Y L 1 ADSPLAB/ELIP, School of ECE, Pekng Unversty, Shenzhen 518055, Chna

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

Margin-Constrained Multiple Kernel Learning Based Multi-Modal Fusion for Affect Recognition

Margin-Constrained Multiple Kernel Learning Based Multi-Modal Fusion for Affect Recognition Margn-Constraned Multple Kernel Learnng Based Mult-Modal Fuson for Affect Recognton Shzh Chen and Yngl Tan Electrcal Engneerng epartment The Cty College of New Yor New Yor, NY USA {schen, ytan}@ccny.cuny.edu

More information

Improved SIFT-Features Matching for Object Recognition

Improved SIFT-Features Matching for Object Recognition Improved SIFT-Features Matchng for Obect Recognton Fara Alhwarn, Chao Wang, Danela Rstć-Durrant, Axel Gräser Insttute of Automaton, Unversty of Bremen, FB / NW Otto-Hahn-Allee D-8359 Bremen Emals: {alhwarn,wang,rstc,ag}@at.un-bremen.de

More information

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Shape Representaton Robust to the Sketchng Order Usng Dstance Map and Drecton Hstogram Department of Computer Scence Yonse Unversty Kwon Yun CONTENTS Revew Topc Proposed Method System Overvew Sketch Normalzaton

More information

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity ISSN(Onlne): 2320-9801 ISSN (Prnt): 2320-9798 Internatonal Journal of Innovatve Research n Computer and Communcaton Engneerng (An ISO 3297: 2007 Certfed Organzaton) Vol.2, Specal Issue 1, March 2014 Proceedngs

More information

A fast algorithm for color image segmentation

A fast algorithm for color image segmentation Unersty of Wollongong Research Onlne Faculty of Informatcs - Papers (Arche) Faculty of Engneerng and Informaton Scences 006 A fast algorthm for color mage segmentaton L. Dong Unersty of Wollongong, lju@uow.edu.au

More information

Image Alignment CSC 767

Image Alignment CSC 767 Image Algnment CSC 767 Image algnment Image from http://graphcs.cs.cmu.edu/courses/15-463/2010_fall/ Image algnment: Applcatons Panorama sttchng Image algnment: Applcatons Recognton of object nstances

More information

High Five: Recognising human interactions in TV shows

High Five: Recognising human interactions in TV shows PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS 1 Hgh Fve: Recognsng human nteractons n TV shows Alonso Patron-Perez alonso@robots.ox.ac.uk Marcn Marszalek marcn@robots.ox.ac.uk Andrew Zsserman

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

2. Related Work Hand-crafted Features Based Trajectory Prediction Deep Neural Networks Based Trajectory Prediction

2. Related Work Hand-crafted Features Based Trajectory Prediction Deep Neural Networks Based Trajectory Prediction Encodng Crowd Interacton wth Deep Neural Network for Pedestran Trajectory Predcton Yanyu Xu ShanghaTech Unversty xuyy2@shanghatech.edu.cn Zhxn Pao ShanghaTech Unversty paozhx@shanghatech.edu.cn Shenghua

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

Gender Classification using Interlaced Derivative Patterns

Gender Classification using Interlaced Derivative Patterns Gender Classfcaton usng Interlaced Dervatve Patterns Author Shobernejad, Ameneh, Gao, Yongsheng Publshed 2 Conference Ttle Proceedngs of the 2th Internatonal Conference on Pattern Recognton (ICPR 2) DOI

More information

Classification of Face Images Based on Gender using Dimensionality Reduction Techniques and SVM

Classification of Face Images Based on Gender using Dimensionality Reduction Techniques and SVM Classfcaton of Face Images Based on Gender usng Dmensonalty Reducton Technques and SVM Fahm Mannan 260 266 294 School of Computer Scence McGll Unversty Abstract Ths report presents gender classfcaton based

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng, Natonal Unversty of Sngapore {shva, phanquyt, tancl }@comp.nus.edu.sg

More information

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline mage Vsualzaton mage Vsualzaton mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and Analyss outlne mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection 2009 10th Internatonal Conference on Document Analyss and Recognton A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng,

More information

A Probabilistic Approach to Detect Urban Regions from Remotely Sensed Images Based on Combination of Local Features

A Probabilistic Approach to Detect Urban Regions from Remotely Sensed Images Based on Combination of Local Features A Probablstc Approach to Detect Urban Regons from Remotely Sensed Images Based on Combnaton of Local Features Berl Sırmaçek German Aerospace Center (DLR) Remote Sensng Technology Insttute Weßlng, 82234,

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

Manifold Regularized Slow Feature Analysis for Dynamic Texture Recognition

Manifold Regularized Slow Feature Analysis for Dynamic Texture Recognition 1 Manfold Regularzed Slow Feature Analyss for Dynamc Texture Recognton Je Mao, Xangmn Xu, Member, IEEE, Xaofen Xng, and Dacheng Tao, Fellow, IEEE arxv:1706.03015v1 [cs.cv] 9 Jun 2017 Abstract Dynamc textures

More information

A Novel Adaptive Descriptor Algorithm for Ternary Pattern Textures

A Novel Adaptive Descriptor Algorithm for Ternary Pattern Textures A Novel Adaptve Descrptor Algorthm for Ternary Pattern Textures Fahuan Hu 1,2, Guopng Lu 1 *, Zengwen Dong 1 1.School of Mechancal & Electrcal Engneerng, Nanchang Unversty, Nanchang, 330031, Chna; 2. School

More information

Detection of Human Actions from a Single Example

Detection of Human Actions from a Single Example Detecton of Human Actons from a Sngle Example Hae Jong Seo and Peyman Mlanfar Electrcal Engneerng Department Unversty of Calforna at Santa Cruz 1156 Hgh Street, Santa Cruz, CA, 95064 {rokaf,mlanfar}@soe.ucsc.edu

More information

Lower Body Pose Estimation in Team Sports Videos Using Label-Grid Classifier Integrated with Tracking-by-Detection

Lower Body Pose Estimation in Team Sports Videos Using Label-Grid Classifier Integrated with Tracking-by-Detection Informaton and Meda Technologes 10(2): 246-258 (2015) reprnted from: IPSJ Transactons on Computer Vson and Applcatons 7: 18-30 (2015) Informaton Processng Socety of Japan Research Paper Lower Body Pose

More information

Classifier Swarms for Human Detection in Infrared Imagery

Classifier Swarms for Human Detection in Infrared Imagery Classfer Swarms for Human Detecton n Infrared Imagery Yur Owechko, Swarup Medasan, and Narayan Srnvasa HRL Laboratores, LLC 3011 Malbu Canyon Road, Malbu, CA 90265 {owechko, smedasan, nsrnvasa}@hrl.com

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

Multi-view 3D Position Estimation of Sports Players

Multi-view 3D Position Estimation of Sports Players Mult-vew 3D Poston Estmaton of Sports Players Robbe Vos and Wlle Brnk Appled Mathematcs Department of Mathematcal Scences Unversty of Stellenbosch, South Afrca Emal: vosrobbe@gmal.com Abstract The problem

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

The Study of Remote Sensing Image Classification Based on Support Vector Machine

The Study of Remote Sensing Image Classification Based on Support Vector Machine Sensors & Transducers 03 by IFSA http://www.sensorsportal.com The Study of Remote Sensng Image Classfcaton Based on Support Vector Machne, ZHANG Jan-Hua Key Research Insttute of Yellow Rver Cvlzaton and

More information

Comparison Study of Textural Descriptors for Training Neural Network Classifiers

Comparison Study of Textural Descriptors for Training Neural Network Classifiers Comparson Study of Textural Descrptors for Tranng Neural Network Classfers G.D. MAGOULAS (1) S.A. KARKANIS (1) D.A. KARRAS () and M.N. VRAHATIS (3) (1) Department of Informatcs Unversty of Athens GR-157.84

More information

Learning a Class-Specific Dictionary for Facial Expression Recognition

Learning a Class-Specific Dictionary for Facial Expression Recognition BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 16, No 4 Sofa 016 Prnt ISSN: 1311-970; Onlne ISSN: 1314-4081 DOI: 10.1515/cat-016-0067 Learnng a Class-Specfc Dctonary for

More information

Face Tracking Using Motion-Guided Dynamic Template Matching

Face Tracking Using Motion-Guided Dynamic Template Matching ACCV2002: The 5th Asan Conference on Computer Vson, 23--25 January 2002, Melbourne, Australa. Face Trackng Usng Moton-Guded Dynamc Template Matchng Lang Wang, Tenu Tan, Wemng Hu atonal Laboratory of Pattern

More information

Novel Fuzzy logic Based Edge Detection Technique

Novel Fuzzy logic Based Edge Detection Technique Novel Fuzzy logc Based Edge Detecton Technque Aborsade, D.O Department of Electroncs Engneerng, adoke Akntola Unversty of Tech., Ogbomoso. Oyo-state. doaborsade@yahoo.com Abstract Ths paper s based on

More information

Feature Selection for Target Detection in SAR Images

Feature Selection for Target Detection in SAR Images Feature Selecton for Detecton n SAR Images Br Bhanu, Yngqang Ln and Shqn Wang Center for Research n Intellgent Systems Unversty of Calforna, Rversde, CA 95, USA Abstract A genetc algorthm (GA) approach

More information

Palmprint Feature Extraction Using 2-D Gabor Filters

Palmprint Feature Extraction Using 2-D Gabor Filters Palmprnt Feature Extracton Usng 2-D Gabor Flters Wa Kn Kong Davd Zhang and Wenxn L Bometrcs Research Centre Department of Computng The Hong Kong Polytechnc Unversty Kowloon Hong Kong Correspondng author:

More information

MOTION BLUR ESTIMATION AT CORNERS

MOTION BLUR ESTIMATION AT CORNERS Gacomo Boracch and Vncenzo Caglot Dpartmento d Elettronca e Informazone, Poltecnco d Mlano, Va Ponzo, 34/5-20133 MILANO boracch@elet.polm.t, caglot@elet.polm.t Keywords: Abstract: Pont Spread Functon Parameter

More information

2 ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE (a) Ambgutes from person re dentfcaton n solaton (b) Assocatng groups of people may reduce ambgutes n mat

2 ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE (a) Ambgutes from person re dentfcaton n solaton (b) Assocatng groups of people may reduce ambgutes n mat ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE 1 Assocatng Groups of People We-Sh Zheng jason@dcs.qmul.ac.uk Shaogang Gong sgg@dcs.qmul.ac.uk Tao Xang txang@dcs.qmul.ac.uk School of EECS, Queen Mary Unversty

More information

CORRELATION ICP ALGORITHM FOR POSE ESTIMATION BASED ON LOCAL AND GLOBAL FEATURES

CORRELATION ICP ALGORITHM FOR POSE ESTIMATION BASED ON LOCAL AND GLOBAL FEATURES CORRELATION ICP ALGORITHM FOR POSE ESTIMATION BASED ON LOCAL AND GLOBAL FEATURES Marco A. Chavarra, Gerald Sommer Cogntve Systems Group. Chrstan-Albrechts-Unversty of Kel, D-2498 Kel, Germany {mc,gs}@ks.nformatk.un-kel.de

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

Robust Inlier Feature Tracking Method for Multiple Pedestrian Tracking

Robust Inlier Feature Tracking Method for Multiple Pedestrian Tracking 2011 Internatonal Conference on Informaton and Intellgent Computng IPCSIT vol.18 (2011) (2011) IACSIT Press, Sngapore Robust Inler Feature Trackng Method for Multple Pedestran Trackng Young-Chul Lm a*

More information

Dynamic wetting property investigation of AFM tips in micro/nanoscale

Dynamic wetting property investigation of AFM tips in micro/nanoscale Dynamc wettng property nvestgaton of AFM tps n mcro/nanoscale The wettng propertes of AFM probe tps are of concern n AFM tp related force measurement, fabrcaton, and manpulaton technques, such as dp-pen

More information

Deep learning is a good steganalysis tool when embedding key is reused for different images, even if there is a cover source-mismatch

Deep learning is a good steganalysis tool when embedding key is reused for different images, even if there is a cover source-mismatch Deep learnng s a good steganalyss tool when embeddng key s reused for dfferent mages, even f there s a cover source-msmatch Lonel PIBRE 2,3, Jérôme PASQUET 2,3, Dno IENCO 2,3, Marc CHAUMONT 1,2,3 (1) Unversty

More information

SIGGRAPH Interactive Image Cutout. Interactive Graph Cut. Interactive Graph Cut. Interactive Graph Cut. Hard Constraints. Lazy Snapping.

SIGGRAPH Interactive Image Cutout. Interactive Graph Cut. Interactive Graph Cut. Interactive Graph Cut. Hard Constraints. Lazy Snapping. SIGGRAPH 004 Interactve Image Cutout Lazy Snappng Yn L Jan Sun Ch-Keung Tang Heung-Yeung Shum Mcrosoft Research Asa Hong Kong Unversty Separate an object from ts background Compose the object on another

More information

Data Mining: Model Evaluation

Data Mining: Model Evaluation Data Mnng: Model Evaluaton Aprl 16, 2013 1 Issues: Evaluatng Classfcaton Methods Accurac classfer accurac: predctng class label predctor accurac: guessng value of predcted attrbutes Speed tme to construct

More information

Metrol. Meas. Syst., Vol. XXIII (2016), No. 1, pp METROLOGY AND MEASUREMENT SYSTEMS. Index , ISSN

Metrol. Meas. Syst., Vol. XXIII (2016), No. 1, pp METROLOGY AND MEASUREMENT SYSTEMS. Index , ISSN Metrol. Meas. Syst., Vol. XXIII (2016), No. 1, pp. 27 36. METROLOGY AND MEASUREMENT SYSTEMS Index 330930, ISSN 0860-8229 www.metrology.pg.gda.pl HISTOGRAM OF ORIENTED GRADIENTS WITH CELL AVERAGE BRIGHTNESS

More information

Learning Ensemble of Local PDM-based Regressions. Yen Le Computational Biomedicine Lab Advisor: Prof. Ioannis A. Kakadiaris

Learning Ensemble of Local PDM-based Regressions. Yen Le Computational Biomedicine Lab Advisor: Prof. Ioannis A. Kakadiaris Learnng Ensemble of Local PDM-based Regressons Yen Le Computatonal Bomedcne Lab Advsor: Prof. Ioanns A. Kakadars 1 Problem statement Fttng a statstcal shape model (PDM) for mage segmentaton Callosum segmentaton

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

Image Matching Algorithm based on Feature-point and DAISY Descriptor

Image Matching Algorithm based on Feature-point and DAISY Descriptor JOURNAL OF MULTIMEDIA, VOL. 9, NO. 6, JUNE 2014 829 Image Matchng Algorthm based on Feature-pont and DAISY Descrptor L L School of Busness, Schuan Agrcultural Unversty, Schuan Dujanyan 611830, Chna Abstract

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

Action Recognition by Matching Clustered Trajectories of Motion Vectors

Action Recognition by Matching Clustered Trajectories of Motion Vectors Acton Recognton by Matchng Clustered Trajectores of Moton Vectors Mchals Vrgkas 1, Vasleos Karavasls 1, Chrstophoros Nkou 1 and Ioanns Kakadars 2 1 Department of Computer Scence, Unversty of Ioannna, Ioannna,

More information

Video Copy Detection Based on Fusion of Spatio-temporal Features

Video Copy Detection Based on Fusion of Spatio-temporal Features Vdeo Copy Detecton Based on Fuson of Spato-temporal Features BAO We, JI Lxn, GAO Shln, LI Xng, Lu Lxong Natonal Dgtal swtchng System Engneerng & Technologcal R&D Center Zhengzhou, Chna 3202004075@163.com

More information

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 4, APRIL

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 4, APRIL IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 4, APRIL 2016 1713 Weakly Supervsed Fne-Graned Categorzaton Wth Part-Based Image Representaton Yu Zhang, Xu-Shen We, Janxn Wu, Member, IEEE, Janfe Ca,

More information

A Background Subtraction for a Vision-based User Interface *

A Background Subtraction for a Vision-based User Interface * A Background Subtracton for a Vson-based User Interface * Dongpyo Hong and Woontack Woo KJIST U-VR Lab. {dhon wwoo}@kjst.ac.kr Abstract In ths paper, we propose a robust and effcent background subtracton

More information

Deep Spatial-Temporal Joint Feature Representation for Video Object Detection

Deep Spatial-Temporal Joint Feature Representation for Video Object Detection sensors Artcle Deep Spatal-Temporal Jont Feature Representaton for Vdeo Object Detecton Baojun Zhao 1,2, Boya Zhao 1,2 ID, Lnbo Tang 1,2, *, Yuq Han 1,2 and Wenzheng Wang 1,2 1 School of Informaton and

More information

Using Spatial Pyramids with Compacted VLAT for Image Categorization

Using Spatial Pyramids with Compacted VLAT for Image Categorization Usng Spatal Pyramds wth Compacted VLAT for Image Categorzaton Roman Negrel, Davd Pcard, Phlppe-Henr Gosseln To cte ths verson: Roman Negrel, Davd Pcard, Phlppe-Henr Gosseln. Usng Spatal Pyramds wth Compacted

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

International Conference on Applied Science and Engineering Innovation (ASEI 2015)

International Conference on Applied Science and Engineering Innovation (ASEI 2015) Internatonal Conference on Appled Scence and Engneerng Innovaton (ASEI 205) Desgn and Implementaton of Novel Agrcultural Remote Sensng Image Classfcaton Framework through Deep Neural Network and Mult-

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information