High Five: Recognising human interactions in TV shows

Size: px
Start display at page:

Download "High Five: Recognising human interactions in TV shows"

Transcription

1 PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS 1 Hgh Fve: Recognsng human nteractons n TV shows Alonso Patron-Perez alonso@robots.ox.ac.uk Marcn Marszalek marcn@robots.ox.ac.uk Andrew Zsserman az@robots.ox.ac.uk Ian Red an@robots.ox.ac.uk Department of Engneerng Scence Unversty of Oxford Oxford, UK Abstract In ths paper we address the problem of recognsng nteractons between two people n realstc scenaros for vdeo retreval purposes. We develop a per-person descrptor that uses attenton (head orentaton) and the local spatal and temporal context n a neghbourhood of each detected person. Usng head orentaton mtgates camera vew ambgutes, whle the local context, comprsed of hstograms of gradents and moton, ams to capture cues such as hand and arm movement. We also employ structured learnng to capture spatal relatonshps between nteractng ndvduals. We tran an ntal set of one-vs-the-rest lnear SVM classfers, one for each nteracton, usng ths descrptor. Notng that people generally face each other whle nteractng, we learn a structured SVM that combnes head orentaton and the relatve locaton of people n a frame to mprove upon the ntal classfcaton obtaned wth our descrptor. To test the effcacy of our method, we have created a new dataset of realstc human nteractons comprsed of clps extracted from TV shows, whch represents a very dffcult challenge. Our experments show that usng structured learnng mproves the retreval results compared to usng the nteracton classfers ndependently. 1 Introducton The am of ths paper s the recognton of nteractons between two people n vdeos n the context of vdeo retreval. In partcular we focus on four nteractons: hand shakes, hgh fves, hugs and ksses. Recognsng human nteractons can be consdered an extenson of sngle-person acton recognton and can provde a dfferent crtera for content-based vdeo retreval. Two-person nteractons can also be used drectly or as a buldng block to create complex systems n applcatons lke survellance, vdeo games and human-computer nteracton. Prevous work n two-person nteracton recognton s scarce compared to closely related areas such as sngle-person acton recognton [7, 10, 12, 22], group acton recognton [14, 24] and human-object nteracton recognton [16, 23]. Closer to our work are [4, c The copyrght of ths document resdes wth ts authors. It may be dstrbuted unchanged freely n prnt or electronc forms.

2 2 PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS Fgure 1: Dataset snapshots. Note the varaton n the actors, scale and camera vews. 17, 19], where nteractons are generally recognsed n a herarchcal manner puttng specal attenton on hgher level descrptons and usng very constraned data. These approaches rely heavly upon many low level mage pre-processng steps lke background subtracton and segmentaton of body parts whch are, by themselves, very dffcult problems to solve when workng wth more complex scenaros. In contrast, recent publcatons on sngle-acton recognton have shown a natural move from smplfed and constraned datasets to more realstc ones [11, 12, 13, 21, 22]. One of the contrbutons of ths paper s the complaton of a realstc human nteracton dataset extracted from a collecton of TV shows (Secton 2). Workng wth realstc datasets ntroduces a new set of challenges that have to be addressed n order to acheve successful recognton: background clutter, a varyng number of people n the scene, camera moton and changes of camera vewponts, to name a few. Our approach s to ntroduce a person-centred descrptor that uses a combnaton of smple features to deal n a systematc way wth these challenges. An upper body detector [6] s frst used to fnd people n every frame of the vdeo (Secton 3). The detectons are then clustered to form tracks. A track s defned as a set of upper body boundng boxes, n consecutve frames, correspondng to the same person. The am of ths frst step s to reduce the search space for nteractons to a lnear search along each track n an analogous way as [9]. We then calculate descrptors along these tracks and use them to learn a Support Vector Machne (SVM) classfer for each nteracton. Then nteracton scores are computed for each boundng box of each track. We also use the head orentaton of people detected n two novel ways: frst to acheve a weak vew nvarance n the descrptor (see Secton 3), and second to learn nteracton-based spatal relatons between people (Secton 4). The latter s based on our assumpton that people generally face each other whle nteractng. Ths assumpton s used to learn a structured SVM [20] that s traned to obtan the best jont classfcaton of a group of people n a frame. We show that usng structured learnng (SL) can mprove the retreval results obtaned by ndependently classfyng each track. An addtonal characterstc of our structured formulaton s that t provdes nformaton about whch people are nteractng. In Secton 4.2 we show the retreval results obtaned by the ndvdual and structured track classfcaton. Secton 5 presents our conclusons and future work. 2 Dataset We have compled a dataset of 300 vdeo clps extracted from 23 dfferent TV shows 1. Each of the clps contans one of four nteractons: hand shake, hgh fve, hug and kss (each 1 vgg/data/tv_human_nteractons

3 PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS 3 appearng n 50 vdeos). Negatve examples (clps that don t contan any of the nteractons) make up the remanng 100 vdeos. The length of the vdeo clps ranges from 30 to 600 frames. The nteractons are not temporally algned (.e. a clp contanng a hand shake mght start wth people walkng towards each other or drectly at the moment of the hand shake). There s a great degree of varaton between dfferent clps and also n several cases wthn the same clp (Fgure 1). Such varaton ncludes the number of actors n each scene, ther scales and the camera angle, ncludng abrupt vewpont changes (shot boundares). To have a ground truth for the evaluaton of the methods developed n ths paper, we have annotated every frame of each vdeo wth the followng: the upper body, dscrete head orentaton and nteracton label of all persons present whose upper body sze s wthn a certan range. Ths range goes from far shots that show the whole body to medum shots where only the upper body s vsble and s equvalent to pxels n our vdeos. We have also annotated whch persons are nteractng, f any, n each frame. For the purposes of tranng and testng, the dataset has been splt nto two groups, each contanng vdeos of mutually exclusve TV shows. The experments shown n the followng sectons were performed usng one set for tranng, the other for testng and vce versa. 3 Modelng human actvty Because of the complexty and varablty of the vdeos n our dataset, fndng relevant and dstnctve features becomes ncreasngly dffcult. The descrptor has to be smultaneously () relatvely coarse to deal wth varaton, and () to some extent focused to avod learnng background nose when codfyng the nteracton. We address these ponts by makng our descrptor person-centred, and by further organsng the data based on head orentaton. The person-centred descrptor focuses on the area around the upper body of a sngle person, enablng us to localse regons of potental nterest and to learn relevant nformaton nsde them. Our descrptor does ths by coarsely quantfyng appearance and moton nsde ths regon. Ths s n contrast to other approaches n sngle-acton recognton [7, 11, 12, 15, 22], where features are estmated n the whole frame or vdeo and then clustered to localse where the acton s happenng. Another advantage for mplementng a personcentred descrptor s that, dependng on the camera angle, both persons are not always vsble n a gven frame, and we would lke to be able to provde a classfcaton n these nstances. For the moment, we assume that we know the locaton and scale of people n each frame and leave the detecton method for secton Person-centred descrptor The followng descrbes the process for obtanng a descrptor gven an upper body locaton, whch s repeated for each person detected n a frame. Our descrptor supermposes an 8 8 grd around an upper body detecton. The sze of the grd, beng dependent on the detecton sze, deals wth changes of scale. We then calculate hstograms of gradents and optcal flow n each of ts cells. An example of ths can be seen n Fgure 2b. Ths technque of usng hstograms of gradents and flow s a coarse analog to the descrptor used n [3, 11]. Gradents are dscretsed nto fve bns: horzontal, vertcal, two dagonal orentatons and a no-gradent bn. Optcal flow s also dscretsed nto fve bns: no-moton, left, rght, up and down. The hstograms are ndependently normalsed and concatenated to create an ntal grd descrptor g (Note on notaton: whenever a vector s used n ths paper s consdered to be n row format by default). We also expermented wth several varants of the grd

4 4 PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS (a) Fgure 2: (a) Upper body detectons and estmated dscrete head orentaton. (b) Grd showng domnant cell gradent and sgnfcant moton (red cells) for a hand shake. (b) descrptor: usng only moton, only gradents, only nformaton of the cells outsde the upper body detecton as well as dfferent normalsatons. The experments descrbed n Secton 3.3 show the results obtaned by selectng dfferent parameters. To obtan the fnal descrptor d, we take nto account the head orentaton, dscretsed nto one of fve orentatons: profle-left, front-left, front-rght, profle-rght and backwards. Perfect frontal vews are very rare and they are ncluded n ether of the two frontal categores. Effectvely, we want to create a compact and automatc representaton from whch we can learn a dfferent classfer for each dscrete head orentaton. To do ths, the dscrete head orentaton, θ, s used to perform the followng operaton: g + = g δ θ, d = [g + g] (1) where s the Kronecker product, δ θ s an ndcator vector wth fve elements (correspondng to the dscrete head orentatons) havng a one at poston θ and zero everywhere else. By usng the head orentaton, we are amng to capture nformaton correlated wth t. Assumng that an nteracton occurs n the drecton a person s facng (Fgure2a) ths can provde us wth a weak knd of vew nvarance. We add an extra copy of g at the end of the descrptor d to account for any nformaton that s ndependent of the head orentaton and to help n cases where the automatc estmaton of the head orentaton s wrong. We can duplcate the amount of examples used for tranng by horzontally flppng the vdeo frames resultng n oposte head orentatons (.e. profle-left becomes profle-rght). The descrptor d s used as a data vector for tranng a lnear SVM classfer. An llustratve example of the results that we obtan, Fgure 3, shows the moton regons (outsde the upper body detecton) learnt by a lnear SVM classfer traned to dscrmnate between hand shakes and hgh fves. As expected, mportant moton regons are correlated wth the head orentaton and occur n lower locatons for hand shakes and hgher ones for hgh fves. 3.2 Localsng humans and estmatng head orentaton To be able to use the descrptor proposed above, we need to pre-process our vdeo clps. The pre-processng follows the same steps as n [6], and we brefly explan them here for completeness. Frst we run an upper body detector n each frame. Ths detector s traned usng a standard Hstogram of Orented Gradents (HOG) descrptor [2] and a smple lnear SVM classfer. We tran two such detectors at a dfferent ntal scale (to mprove the detecton rate). Next, we cluster these detectons usng clque parttonng to form tracks. Very short tracks and tracks wth low average SVM scores are elmnated, and those that reman are used n the experments. As n [1, 18] we learn a classfer for dscrete head orentatons,

5 PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS 5 Fgure 3: Moton weghts outsde upper body detecton (blue square) learnt by a lnear SVM classfer traned to dscrmnate between hand shakes and hgh fves. Hgher weghts are ndcated by lghter areas. As expected, the more mportant moton regons are n lower locatons for hand shakes and n hgher ones for hgh fves. These also follow the drecton of the face. however we smply tran a one-vs-the-rest lnear SVM usng HOG descrptors. Once the classfer s learnt, we estmate the head locaton n each boundng box of each track and obtan a dscrete head orentaton classfcaton. 3.3 Experments Gven that people s tracks have been calculated n every vdeo as prevously descrbed, we want to evaluate the accuracy of our descrptor when classfyng nteractons. We have desgned a set of experments to show the effect of: () not usng head orentaton nformaton vs addng t ether by manual annotaton or by automatc classfcaton; () changng descrptor nformaton: usng only moton, only gradents or both; () addng weak temporal nformaton by concatenatng descrptors of consecutve frames to form a sngle descrptor. The term n-frame descrptor refers to a concatenaton of n descrptors from consecutve frames. To be able to compare the results obtaned, all of the experments follow the next steps. We manually select from each clp fve consecutve frames that are nsde the temporal regon where the nteracton s happenng. From these frames we extract descrptors from a track of one of the people performng the nteracton (agan we manually select the track). The same process s appled to the negatve vdeos. As descrbed n Secton 2, the dataset s dvded nto two sets for tranng and testng. We use n turn the descrptors of each set to tran a one-vs-the-rest lnear SVM classfer for each nteracton n a supervsed way. The classfcaton of a clp s done by addng the SVM classfcaton scores of each one of the descrptors extracted from ts fve selected frames. Fgure 4 provdes a vsual representaton of the results. Column-wse we observe accuracy results obtaned usng dfferent n-frame descrptors. Row-wse represents the average accuracy when choosng dfferent nformaton to nclude n the descrptor: only moton, only gradents and both. Each row s an average over tests usng full or external cells and dfferent normalsatons (L1, L2 or no-norm). The table tself s an average of the results obtaned when testng on both sets. Several thngs can be concluded from ths representaton. Frst, we can readly observe that the use of head orentaton mproves the classfcaton accuracy when correctly estmated, but errors when automatcally classfyng the head orentaton reduce t. Takng the

6 6 PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS Fgure 4: Average classfcaton accuracy results wth dfferent parameter combnatons. No consstent mprovement s notced by usng hgher n-frame descrptors. Moton nformaton s a more dscrmnatve feature than gradents n three of the four nteractons. On average usng head nformaton mproves the accuracy. (Best vewed n color). best combnaton of parameters for each nteracton (usng 1-frame descrptors), the average accuracy when usng manually annotated head orentaton s 59.4%, for automatc head orentaton 52.2% and for no head orentaton 48.8%. We noted that the concatenaton of descrptors dd not consstently mprove the classfcaton results. Another easly dstngushable characterstc s that the use of moton features alone has better performance when classfyng hgh fves and ksses, whle a combnaton of both works better for hugs. Ths s very ntutve because hugs contan mnmal moton n contrast to the other actons. The bad performance of usng only gradents could be explaned by the coarseness of our descrptor, whch results n learnng gradents that are too general to be dstnctve. We tred to mprove these results by ncreasng the number of cells. The resultng ncreased sze of the descrptor combned wth a reduced number of tranng examples led to worse classfcaton results. 4 Learnng human nteractons As mentoned before, sometmes only one of the two people performng an nteracton appears n the vdeo clp. However, when the locaton of two or more people s avalable n a specfc frame, we should use ths to mprove our classfcaton. The assumpton we make s that people face each other whle nteractng. Thus we want to learn relatve locatons of people gven both ther head orentaton and an nteracton label. We propose to do ths by usng a structured learnng (SL) framework smlar to the one descrbed n [5]. The goal s to smultaneously estmate the best jont classfcaton for a set of detectons n a vdeo frame rather than classfyng each detecton ndependently. In contrast to [5], where SL s used to learn spatal relatons between object classes, we want to learn spatal relatons between people gven ther nteracton class and head orentaton. 4.1 Structured learnng We pose the SL problem n the followng terms: n each frame we have a set of upper body detectons X = [x 1...x M ]. Each detecton x = [l x l y s θ v], has nformaton about ts upper left corner locaton (l x,l y ), scale (s), dscrete head orentaton (θ), and SVM classfcaton scores (v) obtaned by classfyng the descrptor assocated wth ths detecton usng the n-

7 PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS 7 Fgure 5: (a) Spatal relatons (δ j ) used n our structured learnng method. The black square at the centre represents the head locaton nsde an upper body detecton. (b) Weghts (β) learnt for each nteracton class and head orentaton combnaton. Lghter ntensty ndcates a hgher weght. teracton classfers prevously learnt. Assocated wth each frame s a label Y = [y 1...y M y c ]. Ths label s formed by a class label y {0,..,K} for each detecton (where K s the number of nteracton classes, wth 0 representng the no-nteracton class) and a confguraton label y c that serves as an ndex for one of the vald parngs of detectons. For example, for three detectons there are four vald confguratons: {(1,0), (2,0), (3,0)}, {(1,0), (2,3)}, {(1,3), (2,0)} and {(1,2) (3,0)}, where (, j) ndcates that detecton s nteractng wth detecton j and the 0 ndex means there s no nteracton. We measure the match between an nput X and a labelng Y by the followng cost functon: S(X,Y) = M α 0 y θ v y + M αy 1 θ + (δ j β T y θ + δ j β T y j θ j ) (2) (, j) P yc where v y s the SVM classfcaton score for class y of detecton, P yc s the set of vald pars defned by confguraton ndex y c, δ j and δ j are ndcator vectors codfyng the relatve locaton of detecton j wth respect to detecton (and vce versa) nto one of R = 6 spatal relatons shown n Fgure 5a. α 0 y θ and α 1 y θ are scalar weghtng and bas parameters that measure the confdence that we have n the SVM score of class y when the head dscrete orentaton s θ {1,...,D}. β y θ s a vector that weghts each spatal confguraton gven a class label and dscrete head orentaton. Once the weghts are learnt, we can fnd the label that maxmses the cost functon by exhaustve search, whch s possble gven the small number of nteracton classes and number of people n each frame. Learnng. We use the SV M struct package [8] to learn the weghts α and β descrbed prevously. To do ths, we must frst re-arrange equaton 2 to defne a sngle weght vector and encapsulate the X and Y components nto a potental functon Ψ (see [20]), and second we need to defne a sutable loss functon. We start by defnng: δ + j = δ j δ y θ and δ + j = δ j δ y j θ j, where means the Kronecker product and δ y θ s an ndcator vector of sze KD havng a one at poston y K + θ and zeros everywhere else. Also, let α 0 = [α α0 KD ], α 1 = [α α1 KD ] and β = [β β KD ]. By substtutng nto equaton 2 we obtan: M S(X,Y) = [α 0 α 1 β ] [ v y δ y θ }{{} δ y θ (δ + j + δ + j) ] T (, j) P yc w }{{} Ψ M (3)

8 8 PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS A key element of a SL framework s to defne an adequate loss functon for the problem n consderaton. Here we would lke the loss functon not only to penalse wrong assgnments of nteracton labels but confguraton labels as well. We also want addtonally to penalse a label msmatch between detectons that are labeled as nteractng. Takng these elements nto consderaton, we defne our loss functon as: (Y,Ŷ) = c (, j) = M 01 (y,ŷ ) + c (, j) (4) (, j) P yc 1 f (, j) / Pŷc 1 f (, j) Pŷc and ŷ ŷ j 0 otherwse were 01 s the zero-one loss, Y s the ground truth labelng and Ŷ s a labelng hypothess. Consder a frame wth three people, two of them nteractng. A canddate label that assgns an ncorrect nteracton label to a person that s not nteractng wll result n a loss of 1 from 01. If nstead ths error occurs n one of the people that are nteractng the loss wll be 2 (1 for the ncorrect label n 01 plus 1 for assgnng dfferent labels to nteractng people n c ). Errors n the confguraton label (y c ) tend to ncrease the loss sgnfcantly dependng on the number of actors present. An example of the spatal weghts learned usng ths method can be seen n Fgure 5b. 4.2 Experments In ths secton we compare the retreval results obtaned by ndvdual classfcaton and by SL. As ndcated n Secton 3.3, the concatenaton of descrptors dd not consstently mprove the classfcaton accuracy. Therefore, we selected a smple 1-frame descrptor that uses both moton and gradents wth L1 normalsaton and all cells n the grd. The classfers were traned to dscrmnate between fve classes: the four nteractons and a no-nteracton class. For a retreval task we need to defne a scorng functon for a vdeo clp. We propose a score based on the classfcaton of each track extracted from the clp. In each frame a detecton belongng to a track s classfed ether ndependently usng the classfers learned n Secton 3.3 or usng the SL framework. The score of each nteracton n a track s smply the percentage of ts detectons that were classfed as that nteracton. The overall nteracton scores of a clp are the average of the track scores. The average s calculated over the tracks where at least one frame was classfed as an nteracton. Ths s to avod assgnng low (5) Method HS HF HG KS AVG M + ID M + SL M + ID + N M + SL + N A + ID A + SL A + ID + N A + SL + N Table 1: Average precson results for the vdeo retreval task, when usng manual (M) or automatc (A) annotatons, ndependent (ID) or structured (SL) classfcaton and when ncludng the negatve (N) vdeos as part of the retreval task. In every case, the use of structured learnng mproves the average results.

9 PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS 9 Fgure 6: Hghest ranked true and false postves for each nteracton obtaned usng the automatc method. The red square ndcates negatve vdeos. nteracton scores to vdeos wth many actors (most of whom are not nteractng). The score for no-nteracton s an average over all tracks. The same process s used for scorng the negatve vdeos and evaluate the effect that ncludng these clps has on the overall rankng. Average precson (AP) results obtaned usng ths rankng measure are shown n Table 1. We tested the nfluence of usng SL when we have manually labeled upper body detectons and head orentatons, and when we use the automatc method descrbed n Secton 3.2. Consderng the substantal challenges of the task, our results fall wthn those obtaned by state-of-the-art methods n sngle-acton recognton that use smlar datasets [7, 10, 12, 22], although a drect comparson s not possble. In every case the mean AP s mproved by the use of SL. Ths mprovement s more obvous n the manually labeled case. When usng the automatc method, there are many factors that can account for the smaller degree of mprovement when usng SL, namely: the nablty to always detect both people performng the nteracton (SL, as we have employed t, can t mprove the results n ths case), the appearance of false postves and the ncorrect automatc classfcaton of head orentaton. In the last two cases, the nput to the SL method s corrupted, and attempts to derve a jont classfcaton wll most lkely produce ncorrect results. To gve an nsght nto the dffculty of ths task Fgure 6 shows the best ranked true and false postves when generatng tracks automatcally and usng the full dataset ncludng negatve vdeos (complete average precson results for ths setup are shown n the last two rows of Table 1). We observed that hand shakes tend to be detected where no nteracton s happenng, ths could be because the natural moton of the arms (when walkng or talkng) resembles the moton pattern of a hand shake n some frames.

10 10 PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS 5 Concluson and future work In ths paper we have proposed a new descrptor for human nteractons that captures nformaton n a regon around a person and uses head orentaton to focus attenton on specfc places nsde ths regon. We have also ntroduced a new dataset of realstc nteractons extracted from TV shows, and have shown good classfcaton and retreval results usng our descrptor. Furthermore, we have shown that usng SL to ncorporate spatal relatonshps between detected people n the scene mproves the retreval results obtaned by ndependently classfyng each detecton. Several deas for future work are readly avalable by analysng the results obtaned n Sectons 3.3 and 4.2. It s clear that an mprovement n the automatc head orentaton classfcaton and the automatc generaton of vdeo tracks wll have a postve effect on the classfcaton and retreval results. Although concatenatng descrptors of consecutve frames ddn t mprove the classfcaton scores n a consstent way, ths may be due to the fact that there wasn t much temporal varance to be captured n the fve frames of an nteracton that these experments consdered. It s lkely that capturng moton and appearance nformaton n longer perods of tme could gve us a better classfcaton. Acknowledgements. We are grateful for fnancal support from CONACYT and ERC grant VsRec no References [1] B. Benfold and I. Red. Gudng vsual survellance by trackng human attenton. In Brtsh Machne Vson Conference, [2] N. Dalal and B. Trggs. Hstograms of Orented Gradents for Human Detecton. In Conference on Computer Vson and Pattern Recognton, [3] N. Dalal, B. Trggs, and C. Schmd. Human Detecton Usng Orented Hstograms of Flow and Appearance. In European Conference on Computer Vson, [4] A. Datta, M. Shah, and N. Da Vtora Lobo. Person-on-Person Volence Detecton n Vdeo Data. In Internatonal Conference on Pattern Recognton, [5] C. Desa, D. Ramanan, and C. Fowlkes. Dscrmnatve models for mult-class object layout. In Internatonal Conference on Computer Vson, [6] V. Ferrar, M. Marn-Jmenez, and A. Zsserman. Pose Search: retrevng people usng ther pose. In Conference on Computer Vson and Pattern Recognton, [7] A. Glbert, J. Illngworth, and R. Bowden. Fast Realstc Mult-Acton Recognton usng Mned Dense Spato-temporal Features. In Internatonal Conference on Computer Vson, [8] T. Joachms, T. Fnley, and C. Yu. Cuttng plane tranng of structural SVMs. Machne Learnng, 77(1):27 59, [9] A. Kläser, M. Marszalek, C. Schmd, and A. Zsserman. Human Focused Acton Localzaton n Vdeo. In SGA, 2010.

11 PATRON-PEREZ ET AL.: RECOGNISING INTERACTIONS IN TV SHOWS 11 [10] I. Laptev and P. Perez. Retrevng Actons n Moves. In Internatonal Conference on Computer Vson, [11] I. Laptev, M. Marszalek, C. Schmd, and B. Rozenfeld. Learnng realstc human actons from moves. In Conference on Computer Vson and Pattern Recognton, [12] J. Lu, J. Luo, and M. Shah. Recognzng Realstc Actons from Vdeos "n the Wld". In Conference on Computer Vson and Pattern Recognton, [13] M Marszalek, I. Laptev, and C. Schmd. Actons n Context. In Conference on Computer Vson and Pattern Recognton, [14] B. N, S. Yan, and A. Kassm. Recognzng Human Group Actvtes wth Localzed Causaltes. In Conference on Computer Vson and Pattern Recognton, [15] J. Nebles, H. Wang, and L. Fe-Fe. Unsupervsed Learnng of Human Acton Categores Usng Spatal-Temporal Words. In Brtsh Machne Vson Conference, [16] K. Ogawara, Y. Tanabe, R. Kurazume, and T. Hasegawa. Learnng Meannful Interactons from Repettous Moton Patterns. In Internatonal Conference on Intellgent Robots and Systems, [17] S. Park and J.K. Aggarwal. Smultaneous trackng of multple body parts of nteractng persons. Computer Vson and Image Understandng, 102(1):1 21, [18] N. Robertson and I. Red. Estmatng gaze drecton from low-resoluton faces n vdeo. In European Conference on Computer Vson, [19] M. S. Ryoo and J. K. Aggarwal. Spato-Temporal Relatonshp Match: Vdeo Structure Comparson for Recognton of Complex Human Actvtes. In Internatonal Conference on Computer Vson, [20] I. Tsochantards, T. Hofman, T. Joachms, and Y. Altun. Support Vector Machne Learnng for Interdependent and Structured Output Spaces. In Internatonal Conference on Machne Learnng, [21] G. Wllems, J. H. Becker, T. Tuytelaars, and L. Van Gool. Exemplar-based Acton Recognton n Vdeo. In Brtsh Machne Vson Conference, [22] X. Wu, C. W. Ngo, J. L, and Y. Zhang. Localzng Volumetrc Moton for Acton Recognton n Realstc Vdeos. In ACM nternatonal conference on Multmeda, [23] B. Yao and L. Fe-Fe. Grouplet: a Structured Image Representaton for Recognzng Human and Object Interactons. In Conference on Computer Vson and Pattern Recognton, [24] W. Zhang, F. Chen, W. Xu, and Y. Du. Herarchcal group process representaton n mult-agent actvty recognton. Sgnal Processng: Image Communcaton, 23(10): , 2008.

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

Local Quaternary Patterns and Feature Local Quaternary Patterns

Local Quaternary Patterns and Feature Local Quaternary Patterns Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents

More information

What is Object Detection? Face Detection using AdaBoost. Detection as Classification. Principle of Boosting (Schapire 90)

What is Object Detection? Face Detection using AdaBoost. Detection as Classification. Principle of Boosting (Schapire 90) CIS 5543 Coputer Vson Object Detecton What s Object Detecton? Locate an object n an nput age Habn Lng Extensons Vola & Jones, 2004 Dalal & Trggs, 2005 one or ultple objects Object segentaton Object detecton

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

Detection of hand grasping an object from complex background based on machine learning co-occurrence of local image feature

Detection of hand grasping an object from complex background based on machine learning co-occurrence of local image feature Detecton of hand graspng an object from complex background based on machne learnng co-occurrence of local mage feature Shnya Moroka, Yasuhro Hramoto, Nobutaka Shmada, Tadash Matsuo, Yoshak Shra Rtsumekan

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Improving Web Image Search using Meta Re-rankers

Improving Web Image Search using Meta Re-rankers VOLUME-1, ISSUE-V (Aug-Sep 2013) IS NOW AVAILABLE AT: www.dcst.com Improvng Web Image Search usng Meta Re-rankers B.Kavtha 1, N. Suata 2 1 Department of Computer Scence and Engneerng, Chtanya Bharath Insttute

More information

Motion Boundary Trajectory for Human Action Recognition

Motion Boundary Trajectory for Human Action Recognition Moton Boundary Trajectory for Human Acton Recognton So-Long Lo and Ah-Chung Tso Faculty of Informaton Technology, Macau Unversty of Scence and Technology Abstract. In ths paper, we propose a novel approach

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Face Detection with Deep Learning

Face Detection with Deep Learning Face Detecton wth Deep Learnng Yu Shen Yus122@ucsd.edu A13227146 Kuan-We Chen kuc010@ucsd.edu A99045121 Yzhou Hao y3hao@ucsd.edu A98017773 Mn Hsuan Wu mhwu@ucsd.edu A92424998 Abstract The project here

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Large-scale Web Video Event Classification by use of Fisher Vectors

Large-scale Web Video Event Classification by use of Fisher Vectors Large-scale Web Vdeo Event Classfcaton by use of Fsher Vectors Chen Sun and Ram Nevata Unversty of Southern Calforna, Insttute for Robotcs and Intellgent Systems Los Angeles, CA 90089, USA {chensun nevata}@usc.org

More information

Histogram of Template for Pedestrian Detection

Histogram of Template for Pedestrian Detection PAPER IEICE TRANS. FUNDAMENTALS/COMMUN./ELECTRON./INF. & SYST., VOL. E85-A/B/C/D, No. xx JANUARY 20xx Hstogram of Template for Pedestran Detecton Shaopeng Tang, Non Member, Satosh Goto Fellow Summary In

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Related-Mode Attacks on CTR Encryption Mode

Related-Mode Attacks on CTR Encryption Mode Internatonal Journal of Network Securty, Vol.4, No.3, PP.282 287, May 2007 282 Related-Mode Attacks on CTR Encrypton Mode Dayn Wang, Dongda Ln, and Wenlng Wu (Correspondng author: Dayn Wang) Key Laboratory

More information

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1) Secton 1.2 Subsets and the Boolean operatons on sets If every element of the set A s an element of the set B, we say that A s a subset of B, or that A s contaned n B, or that B contans A, and we wrte A

More information

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images Internatonal Journal of Informaton and Electroncs Engneerng Vol. 5 No. 6 November 015 Usng Fuzzy Logc to Enhance the Large Sze Remote Sensng Images Trung Nguyen Tu Huy Ngo Hoang and Thoa Vu Van Abstract

More information

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS P.G. Demdov Yaroslavl State Unversty Anatoly Ntn, Vladmr Khryashchev, Olga Stepanova, Igor Kostern EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS Yaroslavl, 2015 Eye

More information

y and the total sum of

y and the total sum of Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng, Natonal Unversty of Sngapore {shva, phanquyt, tancl }@comp.nus.edu.sg

More information

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity ISSN(Onlne): 2320-9801 ISSN (Prnt): 2320-9798 Internatonal Journal of Innovatve Research n Computer and Communcaton Engneerng (An ISO 3297: 2007 Certfed Organzaton) Vol.2, Specal Issue 1, March 2014 Proceedngs

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection 2009 10th Internatonal Conference on Document Analyss and Recognton A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng,

More information

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity Journal of Sgnal and Informaton Processng, 013, 4, 114-119 do:10.436/jsp.013.43b00 Publshed Onlne August 013 (http://www.scrp.org/journal/jsp) Corner-Based Image Algnment usng Pyramd Structure wth Gradent

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law)

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law) Machne Learnng Support Vector Machnes (contans materal adapted from talks by Constantn F. Alfers & Ioanns Tsamardnos, and Martn Law) Bryan Pardo, Machne Learnng: EECS 349 Fall 2014 Support Vector Machnes

More information

Prof. Feng Liu. Spring /24/2017

Prof. Feng Liu. Spring /24/2017 Prof. Feng Lu Sprng 2017 ttp://www.cs.pd.edu/~flu/courses/cs510/ 05/24/2017 Last me Compostng and Mattng 2 oday Vdeo Stablzaton Vdeo stablzaton ppelne 3 Orson Welles, ouc of Evl, 1958 4 Images courtesy

More information

Fast Feature Value Searching for Face Detection

Fast Feature Value Searching for Face Detection Vol., No. 2 Computer and Informaton Scence Fast Feature Value Searchng for Face Detecton Yunyang Yan Department of Computer Engneerng Huayn Insttute of Technology Hua an 22300, Chna E-mal: areyyyke@63.com

More information

Online Detection and Classification of Moving Objects Using Progressively Improving Detectors

Online Detection and Classification of Moving Objects Using Progressively Improving Detectors Onlne Detecton and Classfcaton of Movng Objects Usng Progressvely Improvng Detectors Omar Javed Saad Al Mubarak Shah Computer Vson Lab School of Computer Scence Unversty of Central Florda Orlando, FL 32816

More information

Object-Based Techniques for Image Retrieval

Object-Based Techniques for Image Retrieval 54 Zhang, Gao, & Luo Chapter VII Object-Based Technques for Image Retreval Y. J. Zhang, Tsnghua Unversty, Chna Y. Y. Gao, Tsnghua Unversty, Chna Y. Luo, Tsnghua Unversty, Chna ABSTRACT To overcome the

More information

UB at GeoCLEF Department of Geography Abstract

UB at GeoCLEF Department of Geography   Abstract UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department

More information

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION 1 THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Seres A, OF THE ROMANIAN ACADEMY Volume 4, Number 2/2003, pp.000-000 A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION Tudor BARBU Insttute

More information

Active Contours/Snakes

Active Contours/Snakes Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

2 ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE (a) Ambgutes from person re dentfcaton n solaton (b) Assocatng groups of people may reduce ambgutes n mat

2 ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE (a) Ambgutes from person re dentfcaton n solaton (b) Assocatng groups of people may reduce ambgutes n mat ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE 1 Assocatng Groups of People We-Sh Zheng jason@dcs.qmul.ac.uk Shaogang Gong sgg@dcs.qmul.ac.uk Tao Xang txang@dcs.qmul.ac.uk School of EECS, Queen Mary Unversty

More information

Learning-based License Plate Detection on Edge Features

Learning-based License Plate Detection on Edge Features Learnng-based Lcense Plate Detecton on Edge Features Wng Teng Ho, Woo Hen Yap, Yong Haur Tay Computer Vson and Intellgent Systems (CVIS) Group Unverst Tunku Abdul Rahman, Malaysa wngteng_h@yahoo.com, woohen@yahoo.com,

More information

Robust Inlier Feature Tracking Method for Multiple Pedestrian Tracking

Robust Inlier Feature Tracking Method for Multiple Pedestrian Tracking 2011 Internatonal Conference on Informaton and Intellgent Computng IPCSIT vol.18 (2011) (2011) IACSIT Press, Sngapore Robust Inler Feature Trackng Method for Multple Pedestran Trackng Young-Chul Lm a*

More information

Image Alignment CSC 767

Image Alignment CSC 767 Image Algnment CSC 767 Image algnment Image from http://graphcs.cs.cmu.edu/courses/15-463/2010_fall/ Image algnment: Applcatons Panorama sttchng Image algnment: Applcatons Recognton of object nstances

More information

Multi-view 3D Position Estimation of Sports Players

Multi-view 3D Position Estimation of Sports Players Mult-vew 3D Poston Estmaton of Sports Players Robbe Vos and Wlle Brnk Appled Mathematcs Department of Mathematcal Scences Unversty of Stellenbosch, South Afrca Emal: vosrobbe@gmal.com Abstract The problem

More information

IMAGE MATCHING WITH SIFT FEATURES A PROBABILISTIC APPROACH

IMAGE MATCHING WITH SIFT FEATURES A PROBABILISTIC APPROACH IMAGE MATCHING WITH SIFT FEATURES A PROBABILISTIC APPROACH Jyot Joglekar a, *, Shrsh S. Gedam b a CSRE, IIT Bombay, Doctoral Student, Mumba, Inda jyotj@tb.ac.n b Centre of Studes n Resources Engneerng,

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Combined Object Detection and Segmentation

Combined Object Detection and Segmentation Combned Object Detecton and Segmentaton Jarch Vansteenberge, Masayuk Mukunok, and Mchhko Mnoh Abstract We develop a method for combned object detecton and segmentaton n natural scene. In our approach segmentaton

More information

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning Computer Anmaton and Vsualsaton Lecture 4. Rggng / Sknnng Taku Komura Overvew Sknnng / Rggng Background knowledge Lnear Blendng How to decde weghts? Example-based Method Anatomcal models Sknnng Assume

More information

Robust Shot Boundary Detection from Video Using Dynamic Texture

Robust Shot Boundary Detection from Video Using Dynamic Texture Sensors & Transducers 204 by IFSA Publshng, S. L. http://www.sensorsportal.com Robust Shot Boundary Detecton from Vdeo Usng Dynamc Teture, 3 Peng Tale, 2 Zhang Wenjun School of Communcaton & Informaton

More information

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution Real-tme Moton Capture System Usng One Vdeo Camera Based on Color and Edge Dstrbuton YOSHIAKI AKAZAWA, YOSHIHIRO OKADA, AND KOICHI NIIJIMA Graduate School of Informaton Scence and Electrcal Engneerng,

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

Classification of Face Images Based on Gender using Dimensionality Reduction Techniques and SVM

Classification of Face Images Based on Gender using Dimensionality Reduction Techniques and SVM Classfcaton of Face Images Based on Gender usng Dmensonalty Reducton Technques and SVM Fahm Mannan 260 266 294 School of Computer Scence McGll Unversty Abstract Ths report presents gender classfcaton based

More information

Deformable Part-based Robust Face Detection under Occlusion by Using Face Decomposition into Face Components

Deformable Part-based Robust Face Detection under Occlusion by Using Face Decomposition into Face Components Deformable Part-based Robust Face Detecton under Occluson by Usng Face Decomposton nto Face Components Darjan Marčetć, Slobodan Rbarć Unversty of Zagreb, Faculty of Electrcal Engneerng and Computng, Croata

More information

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science EECS 730 Introducton to Bonformatcs Sequence Algnment Luke Huan Electrcal Engneerng and Computer Scence http://people.eecs.ku.edu/~huan/ HMM Π s a set of states Transton Probabltes a kl Pr( l 1 k Probablty

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

MOTION BLUR ESTIMATION AT CORNERS

MOTION BLUR ESTIMATION AT CORNERS Gacomo Boracch and Vncenzo Caglot Dpartmento d Elettronca e Informazone, Poltecnco d Mlano, Va Ponzo, 34/5-20133 MILANO boracch@elet.polm.t, caglot@elet.polm.t Keywords: Abstract: Pont Spread Functon Parameter

More information

A Background Subtraction for a Vision-based User Interface *

A Background Subtraction for a Vision-based User Interface * A Background Subtracton for a Vson-based User Interface * Dongpyo Hong and Woontack Woo KJIST U-VR Lab. {dhon wwoo}@kjst.ac.kr Abstract In ths paper, we propose a robust and effcent background subtracton

More information

Accurate Overlay Text Extraction for Digital Video Analysis

Accurate Overlay Text Extraction for Digital Video Analysis Accurate Overlay Text Extracton for Dgtal Vdeo Analyss Dongqng Zhang, and Shh-Fu Chang Electrcal Engneerng Department, Columba Unversty, New York, NY 10027. (Emal: dqzhang, sfchang@ee.columba.edu) Abstract

More information

Gender Classification using Interlaced Derivative Patterns

Gender Classification using Interlaced Derivative Patterns Gender Classfcaton usng Interlaced Dervatve Patterns Author Shobernejad, Ameneh, Gao, Yongsheng Publshed 2 Conference Ttle Proceedngs of the 2th Internatonal Conference on Pattern Recognton (ICPR 2) DOI

More information

Discriminative Dictionary Learning with Pairwise Constraints

Discriminative Dictionary Learning with Pairwise Constraints Dscrmnatve Dctonary Learnng wth Parwse Constrants Humn Guo Zhuoln Jang LARRY S. DAVIS UNIVERSITY OF MARYLAND Nov. 6 th, Outlne Introducton/motvaton Dctonary Learnng Dscrmnatve Dctonary Learnng wth Parwse

More information

Dynamic Camera Assignment and Handoff

Dynamic Camera Assignment and Handoff 12 Dynamc Camera Assgnment and Handoff Br Bhanu and Ymng L 12.1 Introducton...338 12.2 Techncal Approach...339 12.2.1 Motvaton and Problem Formulaton...339 12.2.2 Game Theoretc Framework...339 12.2.2.1

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Fitting: Deformable contours April 26 th, 2018

Fitting: Deformable contours April 26 th, 2018 4/6/08 Fttng: Deformable contours Aprl 6 th, 08 Yong Jae Lee UC Davs Recap so far: Groupng and Fttng Goal: move from array of pxel values (or flter outputs) to a collecton of regons, objects, and shapes.

More information

Unsupervised object segmentation in video by efficient selection of highly probable positive features

Unsupervised object segmentation in video by efficient selection of highly probable positive features Unsupervsed object segmentaton n vdeo by effcent selecton of hghly probable postve features Emanuela Haller 1,2 and Marus Leordeanu 1,2 1 Unversty Poltehnca of Bucharest, Romana 2 Insttute of Mathematcs

More information

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following. Complex Numbers The last topc n ths secton s not really related to most of what we ve done n ths chapter, although t s somewhat related to the radcals secton as we wll see. We also won t need the materal

More information

3D vector computer graphics

3D vector computer graphics 3D vector computer graphcs Paolo Varagnolo: freelance engneer Padova Aprl 2016 Prvate Practce ----------------------------------- 1. Introducton Vector 3D model representaton n computer graphcs requres

More information

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search Sequental search Buldng Java Programs Chapter 13 Searchng and Sortng sequental search: Locates a target value n an array/lst by examnng each element from start to fnsh. How many elements wll t need to

More information

Data Mining: Model Evaluation

Data Mining: Model Evaluation Data Mnng: Model Evaluaton Aprl 16, 2013 1 Issues: Evaluatng Classfcaton Methods Accurac classfer accurac: predctng class label predctor accurac: guessng value of predcted attrbutes Speed tme to construct

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input Real-tme Jont Tracng of a Hand Manpulatng an Object from RGB-D Input Srnath Srdhar 1 Franzsa Mueller 1 Mchael Zollhöfer 1 Dan Casas 1 Antt Oulasvrta 2 Chrstan Theobalt 1 1 Max Planc Insttute for Informatcs

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

Takahiro ISHIKAWA Takahiro Ishikawa Takahiro Ishikawa Takeo KANADE

Takahiro ISHIKAWA Takahiro Ishikawa Takahiro Ishikawa Takeo KANADE Takahro ISHIKAWA Takahro Ishkawa Takahro Ishkawa Takeo KANADE Monocular gaze estmaton s usually performed by locatng the pupls, and the nner and outer eye corners n the mage of the drver s head. Of these

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

Real-Time View Recognition and Event Detection for Sports Video

Real-Time View Recognition and Event Detection for Sports Video Real-Tme Vew Recognton and Event Detecton for Sports Vdeo Authors: D Zhong and Shh-Fu Chang {dzhong, sfchang@ee.columba.edu} Department of Electrcal Engneerng, Columba Unversty For specal ssue on Multmeda

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

Lower Body Pose Estimation in Team Sports Videos Using Label-Grid Classifier Integrated with Tracking-by-Detection

Lower Body Pose Estimation in Team Sports Videos Using Label-Grid Classifier Integrated with Tracking-by-Detection Informaton and Meda Technologes 10(2): 246-258 (2015) reprnted from: IPSJ Transactons on Computer Vson and Applcatons 7: 18-30 (2015) Informaton Processng Socety of Japan Research Paper Lower Body Pose

More information

Incremental Multiple Kernel Learning for Object Recognition

Incremental Multiple Kernel Learning for Object Recognition Incremental Multple Kernel Learnng for Obect Recognton Anruddha Kembhav, Behat Sddque, Roland Mezano, Scott McClosey, Larry S. Davs Unversty of Maryland, College Par Honeywell Labs Abstract A good tranng

More information

Metrol. Meas. Syst., Vol. XXIII (2016), No. 1, pp METROLOGY AND MEASUREMENT SYSTEMS. Index , ISSN

Metrol. Meas. Syst., Vol. XXIII (2016), No. 1, pp METROLOGY AND MEASUREMENT SYSTEMS. Index , ISSN Metrol. Meas. Syst., Vol. XXIII (2016), No. 1, pp. 27 36. METROLOGY AND MEASUREMENT SYSTEMS Index 330930, ISSN 0860-8229 www.metrology.pg.gda.pl HISTOGRAM OF ORIENTED GRADIENTS WITH CELL AVERAGE BRIGHTNESS

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information