Object Tracking with Dynamic Feature Graph

Size: px
Start display at page:

Download "Object Tracking with Dynamic Feature Graph"

Transcription

1 Object Tracking with Dynamic Feature Graph Feng Tang and Hai Tao Department o Computer Engineering, University o Caliornia, Santa Cruz {tang,tao}@soe.ucsc.edu Abstract Two major problems or model-based object tracking are: 1) how to represent an object so that it can eectively be discriminated with background and other objects; 2) how to dynamically update the model to accommodate the object appearance and structure changes. Traditional appearance based representations (like color histogram) will ail when the object has rich texture. In this paper, we present a novel eature based object representation attributed relational graph (ARG) or reliable object tracking. The object is modeled with invariant eatures (SIFT) and their relationship is encoded in the orm o an ARG that can eectively distinguish itsel rom background and other objects. We adopt a competitive and eicient dynamic model to adaptively update the object model by adding new stable eatures as well as deleting inactive eatures. A relaxation labeling method is used to match the model graph with the observation to get the best object position. Experiments show that our method can get reliable track even under dramatic appearance changes, occlusions, etc. 1 Introduction Model-based object tracking usually has a ixed object representation, and this so-called template is matched with the observation to get object position in visual tracking. Two major problems or model-based object tracking are 1) how to represent an object so that it can eectively be discriminated with background and other objects; 2) how to dynamically update the model to accommodate the object appearance and structure changes due to the changes in the surrounding conditions against which the tracked object is observed. Much work has been done on the irst problem but the second has received relatively little attention. In terms o object representation, appearance and shape based approaches have been widely used. Appearance based models usually model the object as color histogram, shape models usually model the object using 2D edges or 3D geometry models. In [5], the color histogram o a target region is used as the appearance representation. In [8], a contour-based ace-and-shoulder model is used or modeling people. In [18], a Gaussian distribution o the object pixel values is used. These algorithms build appearance models rom examples in training datasets, and then use the model to track objects. Appearance-based representations have been very successul, but most o they are holistic representations which completely lose the object structure inormation and are sensitive to illumination change and background clutter. For shape-based representation, shape contexts [1] characterize the local image by histogramming its edges into radial-polar bins. [3] takes a similar approach by characterizing each edge pixel with the local distribution o edges in its image neighborhood. Similarly, [15]measures the distributions o orientation inormation in a neighborhood chosen to be invariant to scale changes. Shape-based representations can eectively represent the object structure, but they don t consider the object appearance, which is also not satisying. Recent development in local invariant eature representations and their success in pattern recognition inspire us to use the advantage o local eatures in tracking. Such local eatures are usually designed to be invariant to appearance changes as well as geometric transormation (scale, rotation, etc). [10] describes eatures using scale-invariant salient convex local arrangement o contours in the image. The SIFT descriptor [13] uses histogram o gradient, which is scale and rotation invariant. [9] describes local eatures using spin-image, which is generated using a histogram o the relative position o neighborhood points to the interest point. [11] detects salient regions based on image complexity and uses local entropy as eature descriptor. [12] proposes an aine invariant descriptor or texture recognition. [16] gives a detailed perormance evaluation o dierent descriptors. One o the disadvantages o local eature based representation is it s lack o global structure inormation. But such structural inormation is crucial or object to distinguish it rom background and other objects. In this paper, to overcome the disadvantage o traditional appearance based representations, we propose to develop a new Attributed Relational Graph (ARG) based object representation that incorporates both distinctive eature Scale Invariant Feature Transorm (SIFT) and their relations or tracking. Locally, eatures describe the object details; globally, the relations between eatures encode the object structure. This elastic representation has the lexibility to handle objects with coherent motion and certain amount o variations caused by illumination changes, occlusion as well as discriminating between structurally dierent object types. Compared to object representation, relatively less work has been done on model/template updating to accommodate the object changes. In [4], a ranking system is proposed to select the best eature space among the 49 candi-

2 dates acquired by a linear combination o 3 likelihood images in R, G, B space. It actually uses the immediate previous rame as the training rame or eature selection and use the current rame as the test rame or oreground/background classiication; the results are very impressive or various background changes. The dierence between their work and our work is that their method is a discriminative tracker which cannot easily handle occlusions, while our tracker is a generative one which can handle occlusions easily. Kernel-based tracking [5] proposes a eature value weighting scheme based on the background color inormation and ocuses on salient target parts rom the representation o target and candidate model. In [14] the template is irst updated with the image at the current template location. To eliminate drit, this updated template is then aligned with the irst template to give the inal update. These methods are all global update, which are not suitable or our local eature based representation. In dynamic image sequence, local eatures may be unstable over time; it may show up in one rame, then disappear in the next rame then show up again. In this case, using only the previous rame to do the update is not enough. We model the object dynamic behaviors using a high order HMM that can eectively and adaptively update the model to handle new eatures show up, old eatures die o. Our object model can dynamically take in good eatures (stable eatures) and eliminate unstable eatures based on eature past perormances in a probabilistic sense. For example, i we know that in the previous consecutive several rames, one eature is never matched, it is not likely to be an active eature, so we delete it rom the model. I a new eature persistently appears in several consecutive rames, it is very likely to be a new eature, so we will add it to the model. This adaptively evolving scheme makes the tracker always settle on the optimal state even when the object undergos signiicant changes. Major contributions o this paper are: 1) We propose a novel, compact and robust eature-based object representation in the orm o attributed relational graph (ARG) that is invariant to various appearance changes as well as non-rigid motion. 2) We propose a competitive object dynamic model that can model the short term dynamics (inter-rame graph change) as well as the long term dynamics (dynamic model update - stable eature birth and death) that can eectively predict the evolving behavior o the object. 3) We present a general MAP ramework or eature based tracking. The rest o the paper is organized as ollows. Section 2 describes our eature-based object representation. Section 3 gives the main ramework. Section 4 presents the likelihood computation and object dynamics based on this representation. Experimental results are demonstrated in section 5. Section 6 concludes the paper. 2 Attributed relational graph based eature representation We use SIFT eatures [13] as our object primitive and organize SIFT eatures into an attributed relational graph (ARG). The ARG encodes the relations between dierent eatures, thus provides a more reliable representation or matching and tracking. 2.1 Feature Representation Each eature (or keypoint) is described by its location and scale, and the orientation o the main intensity gradient within a neighborhood and the gradient histogram in the local region. The eatures are located using the DOG detector, as developed by Lowe [13]. The SIFT descriptor is distinctive and it can have high probability to ind the exact match under certain extent o illumination changes and aine transormation. All these characteristics, especially the robustness under illumination changes, match the requirements o object tracking task. More over, the SIFT eatures can also be computed very eiciently so that it makes ast tracking possible. The SIFT eature is represented as : = { p, s, o, hist} where p is the 2-D position o the eature in terms o the image coordinate, s is the eature scale, o is the eature vector direction, and hist is the gradient orientation distribution quantized into 128 bins. 2.2 Organizing eatures with ARG Graph representations are widely used or representing structural inormation in dierent domains such as psychosociology, image interpretation and pattern recognition [7],[2]. Attributed relational graph is a more powerul approach or image representation than pure eature based representations. The semantic inormation o the relations among the image eatures is represented by the attributes associated with the relations between their corresponding eatures. This approach or image representation has shown to provide compact, concise and powerul representation that is capable o comprehending rich inormation contents o the images[7]. Feature relations encode their geometric structure that is a global description o how the object looks like. For simplicity and computational concerns, we deine the relation to be binary. Our deinition o neighborhood is adaptive. I two eatures are close to each other with respect to its scale, a relation is ormed. This is used in many image analysis algorithms, or example [17]. More speciically we deine the relations as ollows. Suppose the and ' are two eatures, their relation attributes are deined as: r(, ') = { rd, rs, ro}, where rd = p p ' is the Euclidean distance between two eatures, rs = s s ' / s + s ' is the scale dierence and ro = o o ' is the orientation dierence. The attributed relational graph G is deined as ollows:

3 G = {, r} is a relational graph. = { 1, 2... n} is the node set, as deined in 2.1. r r r r = { 1, 2... m} is the edge set. Since we use relative attributes as the relation, our graph representation is rotation and translation invariant. This will enhance the lexibility and robustness o our tracker. Figure 1. Object representation. The let image is the scene with the SIFT eatures imposed by blue arrows; the middle igure is the scaled version o the red rectangle area; the right image is the relational graph, with the green lines showing the edges. 3 The MAP ramework Our eature-based tracker is ormulated into a Maximum A Posteriori (MAP) ramework using the Hidden Markov Model. The hidden state includes the inormation about the eatures and relations (in the orm o an ARG). The object state at time t is identiied as Xt = Gt = We ignore the edge set here because the edges(relations) are completely dependent on the nodes, once the nodes are determined, edges are uniquely determined. Speciically, our eature tracker is ormulated as inding the Maximum A Posterior probability: max arg PX ( t It,... I0, Xt 1,..., X 0) Xt We assume an m-order Hidden Markov Model: max arg PX ( t It,... I0, Xt 1,..., X 0) = Xt max arg PX ( t It, It 1, Xt 1,..., Xt m) Xt PI Xt It 1 Xt 1 PXt It 1 Xt 1 Xt m Xt = max arg (,, ) (,,..., ) where PI ( Xt, It 1, Xt 1) is the likelihood and PX ( t It 1, Xt 1,..., Xt m) is the object dynamics. The reason why we use a high order hidden Markov model is that we need such a model to incorporate the object state history to predict the graph dynamics. Details o the model are in Section 4. Using our graph-based representation, the likelihood computation can be ormulated as a graph matching algorithm. The likelihood computation will be discussed in the ollowing section. 4 Object dynamics and likelihood (1) Unlike other trackers in which observations are image intensities, in our eature-based representation, the observation in each rame is the extracted SIFT eatures, which are used to generate the relational graph. The object dynamics express how the object (in the orm o a graph) evolves over time. To allow urther lexibility, we model the short term graph dynamics as well as long term dynamics. Short term dynamics relects the eature attributes changes with respect to the previous rame. Long term dynamics add new stable eatures and delete inactive eatures in a probabilistic sense based on past perormance o eatures. The likelihood is a measure o how well the model its the observation. We compute the likelihood using an eicient relaxation labeling based graph matching algorithm. 4.1 Object Dynamics Since both the graph nodes and edges evolve over time, their dynamics should be modeled simultaneously. However, the edges are ully dependent on nodes, and once nodes are determined, edges can be uniquely determined. So, we only need to model the node dynamics and the relation dynamics is automatically incorporated. Based on this, we ormulate the object dynamics as: t t 1 t m PX ( t It 1, Xt 1,..., Xt m) = P(,..., ) (2) t t t t Suppose = { 1, 2... Nt}, the state transition can be actorized as: N t t 1 t m t t 1 t m P(,..., ) = P( i i,..., ) (3) i= 1 t t 1 t t 1 P s( i i ) wheni, i t t 1 t m t t 1 t m t 1 i i i new i i i i t t 1 t m t Pdelete ( i i,..., i ) wheni P (,..., )) = P (,..., ) when = (4) = Ps is the short term dynamics which models the eature attribute prior. To better model the graph dynamics, we introduce the stable eatures. They are those persistent eatures that are observed in several consecutive rames. Stable eatures can adaptively model the evolving behavior o the object. Pnew and Pdelete model birth and death o stable eatures. They condition on eature s matching perormance in previous rames, because we want only the stable eature to stay in the stable eature set and unstable eatures which have a poor history will have very low probability o occurring in the next rame. We will discuss the 3 items in detail below Feature attribute prior We consider the dynamic prior or eature position, scale and orientation to be Gaussian with previous state attribute as the mean. The attribute prior can be ormulated as: P P P s s P o o P t t 1 s( i i ) = ( pt pt 1)* ( t t 1)* ( t t 1)* ( ht ht 1) T 1 st st 1 2 ot ot 1 2 pt 1 pt p pt 1 pt σs σo d( hh t, t 1) 2 h exp{ ( ) ( )}*exp{ ( ) )}*exp{ ( ) )}* exp{ ( ) )} (5) σ where p, s, o, h are the eature position, scale, orientation and histogram respectively, as deined in 2.1. d( hh t, t 1) is

4 the distance between histograms. p, σ s, σ o, σ h are the corresponding covariance matrices or variance Adding new stable eatures Since in each rame, new eatures may show up due to the appearance changes or pose changes, it s not wise to treat newcomers equally with the eatures that have been proved to be stable. We need a scheme to temporarily hold the new eatures, and ater some period o competition, add those really stable eatures into model. So, we maintain a candidate eature set to hold potential stable eatures. Each eature has an associated status vector, identiying on which rame it appears and on which rame not. We assume the probability or the eatures in the candidate set to be added into the model to be a Binomial distribution Bmp (, b) where m is the order o the HMM, i.e. the time window we use or evaluating the eature stableness. pb is the probability that the eature is observed. For eiciency concerns, in each rame, only those eatures whose probability o being boosted are higher than a threshold τ a can be added into the model. That is Pnew ~ B( no : m, pb) truncated between [ τ a, 1] where n o is the number o times the eature is observed in the previous m rames. This candidate eature set is updated in each rame ater state update Deleting inactive eatures Due to object pose or illumination changes, the eatures which have been stable in previous rames may become inactive in the uture. A scheme or deleting such inactive eatures is incorporated into our ramework. For those eatures already in the model, we also maintain a history o their perormances on the previous m rames, i a eature has not been matched or quite some times, we consider it to be inactive and delete it rom the model. Similar to the scenario to incorporate new eatures, we also model the eature deletion as a Binomial distribution: Pdelete ~ B( nm : m, p m) truncated between [ τ d, 1], τ d is the minimum probability that the eature can be deleted rom the model. Figure 2. Dynamic model update Graph nodes are shown as colored circles, and the edges are the lines connecting them. The irst row is the observation rom image sequence; the second row is the object model at each rame. At rame n, the model contains 5 eatures(1-5). In the 5 rames(n ~ n+4), the blue eature(5) is observed only twice, so it is selected as an unstable eature and deleted rom the model(second row) at rame n+4. And at rame n+3, the new eature 6 has been persistently observed or 3 consecutive rames, so, it is a stable eature, and is selected rom candidate set. (For simplicity, the candidate set is not drawn in the igure) The model eature addition and deletion is a survival o the ittest scheme. Figure 2 is a simpliied demonstration o this model update process. It always keeps good eatures (in terms o stableness) in the model. This competitive strategy greatly enhances the lexibility o the model, which makes it more suitable or tracking under appearance and pose changes. 4.2 Likelihood The likelihood unction PI ( t Xt) describes how the underlying state Xt its the observation I t. We propose a eaturebased likelihood unction that is computed as the matching score o the object representation with the observation. The likelihood computation is a graph matching problem Graph matching ormulation To handle the case o dierent numbers o eatures in two graphs, we add dummy nodes (null) into both observation graph node set and model graph node set.that is, or the matching unction F, we allow the eature to match to dummy node when no good match can be ound. The likelihood can be measured as the similarity o the observed data and the model graph (both matched node similarity and matched edge similarity). We assume the observation model to be: Gi = Mi + ei (); r Gi = r Mi + e'( i ) ; where Gi and Mi are observation graph and model graph eatures, rgi and rmi are observation graph and model graph relations, ei () and e'( i) are zero mean independent Gaussian noise. The likelihood can be ormalized as a Gibbs distribution with energy: EG ( X) = EG ( F) = E1( F( )) + α E2( r Fr ( )) (6) G r Gr where P1 and P2 are the eature and edge potentials deined below, and α is the coeicient to balance the eature matching and relation matching: K1 k k 2 k 2 k [ i F( i) ] /[ σ ] i F( i) isnotnull E1( i F( i)) = (7) i= 1 k Pv1 i F( i) null Where F( i) is the corresponding node o i and K 1 is the number o attributes associated with the node i. M k i is the k 2 k th component o the attributes, and ( σ i ) is the variance o its Gaussian noise distribution. I the matching o a eature is a dummy node (null), we assign a large penalty P v1. K2 k k 2 k 2 k [ ri F( ri) ] /[ σr] i F( ri) isnot null Er 1( i Fr ( i)) = (8) i= 1 k P2( ri F( ri)) = Pv2 i F( ri) is null

5 Where F( r i) is the corresponding edge o ri and K 2 is the number o properties associated with the edge e i, k ei is the k 2 k thcomponent o the edge properties, and ( σ 2 ) is the variance o Gaussian noise distribution. I the edge is matched to a dummy edge, it is assigned a penalty P v Relaxation labeling or graph matching Generally speaking, graph matching is an NP-hard problem. Local optimal search techniques are oten used in real applications, whose perormance highly depends on the initial solution. Thus, in the irst stage, we use eature distance to initialize the graph matching (without considering the relations), ollowed by a relaxation labeling process to reine the match. We irst construct a larger graph than that in the previous state, then using sub-graph matching to match the model with this graph, as shown in igure 2. To solve such a matching problem, we use a relaxation labeling method. The basic idea is to use iterated local context updates to achieve a globally consistent result. Details about relaxation labeling can be ound at [23], [19]. Figure 3. Graph matching. Let is the model graph. Right is the constructed larger graph than that in the previous state at the region around the predicted position. The sub-graph matching algorithm matches the model graph with this larger graph. The relaxation labeling method considers the compatibility o label probabilities as constraints in the labeling algorithm, i.e., consider its neighbors. The compatibility Cij( ' k, ' l) is deined as the conditional probability that eature i has a label ' k given that eature j has a label 'l, i.e. Cij( ' k, ' l) = P( ' k ' l). Thus, updating the probabilities o labels is done by considering the probabilities o labels or neighbor eatures. Let us assume that we have changed all probabilities up to some step, and we now seek an updated probability or the next step. We can estimate the change in conidence o Pi( ' k) by δ Pi( ' k) = wij Cij( ' k, ' l) Pj( ' l) (9) j N l L where N is the number o neighbors o i, and wij is the weight o these neighbors with the constraint w ij = 1. The new probability or label Pi( ' k) in generating next step can be computed rom the values rom previous iteration using pi( ' k) = pi( ' k)[1 + δ pi( ' k)] pi( ' l)[1 + δ pi( ' l)] l (10) In this way, ater some iterations, the probability or each eature labeling will stabilize, thus the eature to eature matching is obtained. The matching score is taken as the likelihood. It also gives the approximate graph state in a maximum likelihood sense. 5 Implementation and experiment results We implemented our algorithm on Pentium-4 3.2GHz machine, the computation time depends on the object complexity, i the object has rich texture, it tends to have more eatures, thus will take more time to do the graph matching and model update. Averagely, it runs at 3 rames per second using un-optimized C++ code. We tested our tracking algorithm using real video sequences with complex textures and are undergoing signiicant appearance changes and occlusions. For the eature description, we describe the eatures using a 128-bin gradient histogram. To construct the relations, we need a threshold to determine which nodes are neighbors, the larger the threshold, the more edges the graph will include, i it is very large, the graph will become a complete graph. I it is too small, there will be very ew edges; the structural inormation is not accurately described. The parameters are tuned by testing. For the parameters τ a, τ d, we both set to be 0.8, means that only those eatures that are really stable/unstable, are selected or addition/deletion. For those matching thresholds, we try dierent measure and the ix it, and works well or other scenes. Figure 4, 5, 6, 7 are the tracking results. They demonstrate that using our method, the object can be robustly tracked even through severe appearance changes and occlusion. We compared our method with CamShit which is implemented in OpenCV library. In igure 4, the coee pot is rotating when moving, it s appearance is very complex (highly textured) and changes dramatically(rom dark color to light then to dark again), the background is also very clutter. The upper row(with white cross line as the object locator) shows the tracking result o Cam- Shit. It completely loses track within ten rames. One reason is that CamShit representation is a holistic color histogram; it loses much inormation (highly textured area) which is crucial or correct tracking ( Camshit is good at tracking regions with homogeneous color). Another reason is that the histogram based representation cannot well adapt to the dramatic appearance changes. Our representation can model such details using local eature graph, and the adaptive model update algorithm can elegantly adapt to such changes and keep track. The lower row is our results with the red rectangle as the tracked object. In igure5, the pedestrian o interest is walking. The sun casts shadow on the tracked pedestrian that causes dramatic appearance

6 changes, but the invariant eature representation guarantees it s correctness or tracking. In the second row, the pedestrian is gradually occluded by another wearing a white shirt. However, our graph dynamics can be adaptively updated, keeping the good eatures into the model. This is because the occluding object is not stable with the object movements. The eatures o occluding object will change; they will only have a small chance to come into the candidate eature set, and an even smaller chance to be added into the model graph. Thus the proposed algorithm can track through partial occlusion and illumination changes. The Camshit will lose track ater a ew rames. Figure 6 shows a vehicle tracking result under severe weather conditions, the vehicle appearances are similar to the background that even human eye cannot easily distinguish them. Also the object made a sharp turn, which changes its pose signiicantly; however, our eature tracker can robustly handle such cases. Figure 7 shows a vehicle tracking with heavy occlusions. Almost 80% part o the vehicle is occluded, but our tracker can keep track o the object. This is because the vehicle in this video has a very salient and stable eature, which is a gray color along the car s door and ront window; it is highly distinctive with background. The detected eature alone is enough to locate the object even the trees may occlude most o other parts o the vehicle. 6 Conclusions In this paper, we presented a novel tracking ramework based on our graph-based object representation. The object is described using a collection o SIFT eatures; the relations between eatures are encoded as the edges in the attributed graph. We also model the graph dynamics with the eature/relation addition and deletion which provides a competitive mechanism that can always keep the stable eatures in the model. The likelihood computation is ormulated into a graph matching problem that can be eiciently solved using relaxation labeling. Experiments have demonstrated the power o our eature-based tracker under signiicant appearance changes, pose changes, occlusions and so on. However, our method relies on the local eature stableness, i the eatures are very unstable, it will degrade the perormance o our tracker. From the experiments, we can see that the eature stability over time is crucial to tracking. We see this as starting point o our uture research in the ollowing directions. (1)Come up new eatures to be suitable or tracking in terms o distinctiveness and stability. (2) Real-time tracking with salient eatures Using only a ew stable salient eatures, tracking can be perormed very ast. It is possible to use salient eatures to build a tracker that can simultaneously track a dozen or even more objects at real-time. Reerences [1] S. Belongie, J. Malik & J. Puzicha, "Shape Matching and Object Recognition Using Shape Contexts", PAMI 24(4): , 2002 [2] Horst Bunke, Bruno T. Messmer: Eicient Attributed Graph Matching and Its Application to Image Analysis. ICIAP: [3] Carmichael and M. Hebert. Shape-based recognition wiry objects. In CVPR03 II, pages: , [4] R.T. Collins and Y. Liu. "On-Line Selection o Discriminative Tracking Features", ICCV '03, Nice, France. October, pp [5]D.Comaniciu, Kernel-based Object Tracking, IEEE Trans.Pattern Anal. Machine Intell., vol. 25,no.5,pp ,2003 [6] R.Deriche, O.Faugeras, Tracking line segments Image and Vision Computing,ECCV,1990 Vol 8(4) [7] M A Eshera, K S Fu An image understanding system using attributed symbolic representation and inexact graph-matching IEEE PAMI Volume8, Issue 5 Pages: (September 1986) [8] M. Isard and A. Blake, "Contour tracking by stochastic propagation o conditional density" n, pp , Cambridge UK, [9] A.Johnson and M.Hebert. Object recognition by matching oriented points. CVPR, pp ,1997 [10]Jurie, F. & Schmid, C Scale-invariant shape eatures or recognition o object categories CVPR, 2004 [11] Kadir, T. and Brady, M. Scale, Saliency and Image description n International Journal o Computer Vision. 45 (2):83-105, November [12]Svetlana Lazebnik, Cordelia Schmid, Jean Ponce "Aine-Invariant Local Descriptors and Neighborhood Statistics or Texture Recognition" ICCV 2003 [13]David G. Lowe Distinctive image eatures rom scale-invariant keypoints, International Journal o Computer Vision, 60, 2, pp ,2004 [14]I. Matthews, T. Ishikawa, and S. Baker The Template Update Problem IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 26, No. 6, June, 2004, pp [15] K. Mikolajczk, A. Zisserman and C. Schmid Shape recognition with edge-based eatures. In British Machine Vision Conerence, September 2003 [16]K. Mikolajczyk, C. Schmid, "A perormance evaluation o local descriptors", CVPR '03 [17]Shams, L., Kamitani, Y., & Shimojo, S Graphmatching vs. entropy-based methods or object detection. Neural Networks Vol. 14, pp (2000). [18]Hai Tao, Harpreet S. Sawhney, Rakesh Kumar, "Object tracking with Bayesian estimation o dynamic layer representations," IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), vol. 24, no. 1, pp , 2002.

7 Figure 4 Tracking o moving coee pot. The upper row (with white cross line) shows the tracking result o CamShit, The lower row (with red rectangle as the object identiier) is our results with the red rectangle as the tracked object position. (Note that the highly textured object is undergoing dramatic appearance changes).

8 Figure 5 Tracking o a pedestrian under signiicant appearance changes and heavy occlusion The odd rows are the results o CamShit method, with the while line cross as the tracked object position. Figure a-1 is the tracked result and a-2 is the ground truth o the object (scaled up). The even rows are the results o our method; the red rectangles are the tracked position, and the model and clipped tracking results with SIFT eatures(blue line with arrow) overlaid are shown on the side. Figure 6, Tracking o a vehicle under signiicant appearance changes Figure 7. Tracking o a vehicle under heavy occlusion and view-point changes.

ROBUST FACE DETECTION UNDER CHALLENGES OF ROTATION, POSE AND OCCLUSION

ROBUST FACE DETECTION UNDER CHALLENGES OF ROTATION, POSE AND OCCLUSION ROBUST FACE DETECTION UNDER CHALLENGES OF ROTATION, POSE AND OCCLUSION Phuong-Trinh Pham-Ngoc, Quang-Linh Huynh Department o Biomedical Engineering, Faculty o Applied Science, Hochiminh University o Technology,

More information

Gesture Recognition using a Probabilistic Framework for Pose Matching

Gesture Recognition using a Probabilistic Framework for Pose Matching Gesture Recognition using a Probabilistic Framework or Pose Matching Ahmed Elgammal Vinay Shet Yaser Yacoob Larry S. Davis Computer Vision Laboratory University o Maryland College Park MD 20742 USA elgammalvinayyaserlsd

More information

Introduction to SLAM Part II. Paul Robertson

Introduction to SLAM Part II. Paul Robertson Introduction to SLAM Part II Paul Robertson Localization Review Tracking, Global Localization, Kidnapping Problem. Kalman Filter Quadratic Linear (unless EKF) SLAM Loop closing Scaling: Partition space

More information

CS485/685 Computer Vision Spring 2012 Dr. George Bebis Programming Assignment 2 Due Date: 3/27/2012

CS485/685 Computer Vision Spring 2012 Dr. George Bebis Programming Assignment 2 Due Date: 3/27/2012 CS8/68 Computer Vision Spring 0 Dr. George Bebis Programming Assignment Due Date: /7/0 In this assignment, you will implement an algorithm or normalizing ace image using SVD. Face normalization is a required

More information

CS 223B Computer Vision Problem Set 3

CS 223B Computer Vision Problem Set 3 CS 223B Computer Vision Problem Set 3 Due: Feb. 22 nd, 2011 1 Probabilistic Recursion for Tracking In this problem you will derive a method for tracking a point of interest through a sequence of images.

More information

Classification Method for Colored Natural Textures Using Gabor Filtering

Classification Method for Colored Natural Textures Using Gabor Filtering Classiication Method or Colored Natural Textures Using Gabor Filtering Leena Lepistö 1, Iivari Kunttu 1, Jorma Autio 2, and Ari Visa 1, 1 Tampere University o Technology Institute o Signal Processing P.

More information

Distribution Fields with Adaptive Kernels for Large Displacement Image Alignment

Distribution Fields with Adaptive Kernels for Large Displacement Image Alignment MEARS et al.: DISTRIBUTION FIELDS WITH ADAPTIVE KERNELS 1 Distribution Fields with Adaptive Kernels or Large Displacement Image Alignment Benjamin Mears bmears@cs.umass.edu Laura Sevilla Lara bmears@cs.umass.edu

More information

CS 231A Computer Vision (Fall 2012) Problem Set 3

CS 231A Computer Vision (Fall 2012) Problem Set 3 CS 231A Computer Vision (Fall 2012) Problem Set 3 Due: Nov. 13 th, 2012 (2:15pm) 1 Probabilistic Recursion for Tracking (20 points) In this problem you will derive a method for tracking a point of interest

More information

Face detection and recognition. Detection Recognition Sally

Face detection and recognition. Detection Recognition Sally Face detection and recognition Detection Recognition Sally Face detection & recognition Viola & Jones detector Available in open CV Face recognition Eigenfaces for face recognition Metric learning identification

More information

Binary Morphological Model in Refining Local Fitting Active Contour in Segmenting Weak/Missing Edges

Binary Morphological Model in Refining Local Fitting Active Contour in Segmenting Weak/Missing Edges 0 International Conerence on Advanced Computer Science Applications and Technologies Binary Morphological Model in Reining Local Fitting Active Contour in Segmenting Weak/Missing Edges Norshaliza Kamaruddin,

More information

The SIFT (Scale Invariant Feature

The SIFT (Scale Invariant Feature The SIFT (Scale Invariant Feature Transform) Detector and Descriptor developed by David Lowe University of British Columbia Initial paper ICCV 1999 Newer journal paper IJCV 2004 Review: Matt Brown s Canonical

More information

Automatic Video Segmentation for Czech TV Broadcast Transcription

Automatic Video Segmentation for Czech TV Broadcast Transcription Automatic Video Segmentation or Czech TV Broadcast Transcription Jose Chaloupka Laboratory o Computer Speech Processing, Institute o Inormation Technology and Electronics Technical University o Liberec

More information

Fast Image Matching Using Multi-level Texture Descriptor

Fast Image Matching Using Multi-level Texture Descriptor Fast Image Matching Using Multi-level Texture Descriptor Hui-Fuang Ng *, Chih-Yang Lin #, and Tatenda Muindisi * Department of Computer Science, Universiti Tunku Abdul Rahman, Malaysia. E-mail: nghf@utar.edu.my

More information

Evaluation and comparison of interest points/regions

Evaluation and comparison of interest points/regions Introduction Evaluation and comparison of interest points/regions Quantitative evaluation of interest point/region detectors points / regions at the same relative location and area Repeatability rate :

More information

Selection of Scale-Invariant Parts for Object Class Recognition

Selection of Scale-Invariant Parts for Object Class Recognition Selection of Scale-Invariant Parts for Object Class Recognition Gy. Dorkó and C. Schmid INRIA Rhône-Alpes, GRAVIR-CNRS 655, av. de l Europe, 3833 Montbonnot, France fdorko,schmidg@inrialpes.fr Abstract

More information

Local Image Features

Local Image Features Local Image Features Ali Borji UWM Many slides from James Hayes, Derek Hoiem and Grauman&Leibe 2008 AAAI Tutorial Overview of Keypoint Matching 1. Find a set of distinctive key- points A 1 A 2 A 3 B 3

More information

Scale-invariant shape features for recognition of object categories

Scale-invariant shape features for recognition of object categories Scale-invariant shape features for recognition of object categories Frédéric Jurie and Cordelia Schmid GRAVIR, INRIA-CNRS, 655 avenue de l Europe, Montbonnot 38330, France {Frederic.Jurie, Cordelia.Schmid}@inrialpes.fr,

More information

Visual Saliency Based Object Tracking

Visual Saliency Based Object Tracking Visual Saliency Based Object Tracking Geng Zhang 1,ZejianYuan 1, Nanning Zheng 1, Xingdong Sheng 1,andTieLiu 2 1 Institution of Artificial Intelligence and Robotics, Xi an Jiaotong University, China {gzhang,

More information

Shape Descriptor using Polar Plot for Shape Recognition.

Shape Descriptor using Polar Plot for Shape Recognition. Shape Descriptor using Polar Plot for Shape Recognition. Brijesh Pillai ECE Graduate Student, Clemson University bpillai@clemson.edu Abstract : This paper presents my work on computing shape models that

More information

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. Section 10 - Detectors part II Descriptors Mani Golparvar-Fard Department of Civil and Environmental Engineering 3129D, Newmark Civil Engineering

More information

Object detection using non-redundant local Binary Patterns

Object detection using non-redundant local Binary Patterns University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Object detection using non-redundant local Binary Patterns Duc Thanh

More information

OCCLUSION BOUNDARIES ESTIMATION FROM A HIGH-RESOLUTION SAR IMAGE

OCCLUSION BOUNDARIES ESTIMATION FROM A HIGH-RESOLUTION SAR IMAGE OCCLUSION BOUNDARIES ESTIMATION FROM A HIGH-RESOLUTION SAR IMAGE Wenju He, Marc Jäger, and Olaf Hellwich Berlin University of Technology FR3-1, Franklinstr. 28, 10587 Berlin, Germany {wenjuhe, jaeger,

More information

Detecting Object Instances Without Discriminative Features

Detecting Object Instances Without Discriminative Features Detecting Object Instances Without Discriminative Features Edward Hsiao June 19, 2013 Thesis Committee: Martial Hebert, Chair Alexei Efros Takeo Kanade Andrew Zisserman, University of Oxford 1 Object Instance

More information

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS Cognitive Robotics Original: David G. Lowe, 004 Summary: Coen van Leeuwen, s1460919 Abstract: This article presents a method to extract

More information

Local Feature Detectors

Local Feature Detectors Local Feature Detectors Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr Slides adapted from Cordelia Schmid and David Lowe, CVPR 2003 Tutorial, Matthew Brown,

More information

Computer Vision for HCI. Topics of This Lecture

Computer Vision for HCI. Topics of This Lecture Computer Vision for HCI Interest Points Topics of This Lecture Local Invariant Features Motivation Requirements, Invariances Keypoint Localization Features from Accelerated Segment Test (FAST) Harris Shi-Tomasi

More information

A SAR IMAGE REGISTRATION METHOD BASED ON SIFT ALGORITHM

A SAR IMAGE REGISTRATION METHOD BASED ON SIFT ALGORITHM A SAR IMAGE REGISTRATION METHOD BASED ON SIFT ALGORITHM W. Lu a,b, X. Yue b,c, Y. Zhao b,c, C. Han b,c, * a College o Resources and Environment, University o Chinese Academy o Sciences, Beijing, 100149,

More information

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14 Announcements Computer Vision I CSE 152 Lecture 14 Homework 3 is due May 18, 11:59 PM Reading: Chapter 15: Learning to Classify Chapter 16: Classifying Images Chapter 17: Detecting Objects in Images Given

More information

MAPI Computer Vision. Multiple View Geometry

MAPI Computer Vision. Multiple View Geometry MAPI Computer Vision Multiple View Geometry Geometry o Multiple Views 2- and 3- view geometry p p Kpˆ [ K R t]p Geometry o Multiple Views 2- and 3- view geometry Epipolar Geometry The epipolar geometry

More information

A Feature Point Matching Based Approach for Video Objects Segmentation

A Feature Point Matching Based Approach for Video Objects Segmentation A Feature Point Matching Based Approach for Video Objects Segmentation Yan Zhang, Zhong Zhou, Wei Wu State Key Laboratory of Virtual Reality Technology and Systems, Beijing, P.R. China School of Computer

More information

Designing Applications that See Lecture 7: Object Recognition

Designing Applications that See Lecture 7: Object Recognition stanford hci group / cs377s Designing Applications that See Lecture 7: Object Recognition Dan Maynes-Aminzade 29 January 2008 Designing Applications that See http://cs377s.stanford.edu Reminders Pick up

More information

Research on Image Splicing Based on Weighted POISSON Fusion

Research on Image Splicing Based on Weighted POISSON Fusion Research on Image Splicing Based on Weighted POISSO Fusion Dan Li, Ling Yuan*, Song Hu, Zeqi Wang School o Computer Science & Technology HuaZhong University o Science & Technology Wuhan, 430074, China

More information

Face Detection for Automatic Avatar Creation by using Deformable Template and GA

Face Detection for Automatic Avatar Creation by using Deformable Template and GA Face Detection or Automatic Avatar Creation by using Deormable Template and GA Tae-Young Park*, Ja-Yong Lee **, and Hoon Kang *** * School o lectrical and lectronics ngineering, Chung-Ang University, Seoul,

More information

TA Section 7 Problem Set 3. SIFT (Lowe 2004) Shape Context (Belongie et al. 2002) Voxel Coloring (Seitz and Dyer 1999)

TA Section 7 Problem Set 3. SIFT (Lowe 2004) Shape Context (Belongie et al. 2002) Voxel Coloring (Seitz and Dyer 1999) TA Section 7 Problem Set 3 SIFT (Lowe 2004) Shape Context (Belongie et al. 2002) Voxel Coloring (Seitz and Dyer 1999) Sam Corbett-Davies TA Section 7 02-13-2014 Distinctive Image Features from Scale-Invariant

More information

Implementing the Scale Invariant Feature Transform(SIFT) Method

Implementing the Scale Invariant Feature Transform(SIFT) Method Implementing the Scale Invariant Feature Transform(SIFT) Method YU MENG and Dr. Bernard Tiddeman(supervisor) Department of Computer Science University of St. Andrews yumeng@dcs.st-and.ac.uk Abstract The

More information

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT Oct. 15, 2013 Prof. Ronald Fearing Electrical Engineering and Computer Sciences University of California, Berkeley (slides courtesy of Prof. John Wawrzynek)

More information

Object Tracking with an Adaptive Color-Based Particle Filter

Object Tracking with an Adaptive Color-Based Particle Filter Object Tracking with an Adaptive Color-Based Particle Filter Katja Nummiaro 1, Esther Koller-Meier 2, and Luc Van Gool 1,2 1 Katholieke Universiteit Leuven, ESAT/VISICS, Belgium {knummiar,vangool}@esat.kuleuven.ac.be

More information

Face detection in a video sequence - a temporal approach

Face detection in a video sequence - a temporal approach Face detection in a video sequence - a temporal approach K. Mikolajczyk R. Choudhury C. Schmid INRIA Rhône-Alpes GRAVIR-CNRS, 655 av. de l Europe, 38330 Montbonnot, France {Krystian.Mikolajczyk,Ragini.Choudhury,Cordelia.Schmid}@inrialpes.fr

More information

A Novel Algorithm for Color Image matching using Wavelet-SIFT

A Novel Algorithm for Color Image matching using Wavelet-SIFT International Journal of Scientific and Research Publications, Volume 5, Issue 1, January 2015 1 A Novel Algorithm for Color Image matching using Wavelet-SIFT Mupuri Prasanth Babu *, P. Ravi Shankar **

More information

Det De e t cting abnormal event n s Jaechul Kim

Det De e t cting abnormal event n s Jaechul Kim Detecting abnormal events Jaechul Kim Purpose Introduce general methodologies used in abnormality detection Deal with technical details of selected papers Abnormal events Easy to verify, but hard to describe

More information

Improving Alignment of Faces for Recognition

Improving Alignment of Faces for Recognition Improving Alignment o Faces or Recognition Md. Kamrul Hasan Département de génie inormatique et génie logiciel École Polytechnique de Montréal, Québec, Canada md-kamrul.hasan@polymtl.ca Christopher J.

More information

Neighbourhood Operations

Neighbourhood Operations Neighbourhood Operations Neighbourhood operations simply operate on a larger neighbourhood o piels than point operations Origin Neighbourhoods are mostly a rectangle around a central piel Any size rectangle

More information

Particle Filtering. CS6240 Multimedia Analysis. Leow Wee Kheng. Department of Computer Science School of Computing National University of Singapore

Particle Filtering. CS6240 Multimedia Analysis. Leow Wee Kheng. Department of Computer Science School of Computing National University of Singapore Particle Filtering CS6240 Multimedia Analysis Leow Wee Kheng Department of Computer Science School of Computing National University of Singapore (CS6240) Particle Filtering 1 / 28 Introduction Introduction

More information

Color Image Segmentation

Color Image Segmentation Color Image Segmentation Yining Deng, B. S. Manjunath and Hyundoo Shin* Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 93106-9560 *Samsung Electronics Inc.

More information

CRF Based Point Cloud Segmentation Jonathan Nation

CRF Based Point Cloud Segmentation Jonathan Nation CRF Based Point Cloud Segmentation Jonathan Nation jsnation@stanford.edu 1. INTRODUCTION The goal of the project is to use the recently proposed fully connected conditional random field (CRF) model to

More information

SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014

SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014 SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014 SIFT SIFT: Scale Invariant Feature Transform; transform image

More information

Motion illusion, rotating snakes

Motion illusion, rotating snakes Motion illusion, rotating snakes Local features: main components 1) Detection: Find a set of distinctive key points. 2) Description: Extract feature descriptor around each interest point as vector. x 1

More information

Building a Panorama. Matching features. Matching with Features. How do we build a panorama? Computational Photography, 6.882

Building a Panorama. Matching features. Matching with Features. How do we build a panorama? Computational Photography, 6.882 Matching features Building a Panorama Computational Photography, 6.88 Prof. Bill Freeman April 11, 006 Image and shape descriptors: Harris corner detectors and SIFT features. Suggested readings: Mikolajczyk

More information

Deformation Invariant Image Matching

Deformation Invariant Image Matching Deformation Invariant Image Matching Haibin Ling David W. Jacobs Center for Automation Research, Computer Science Department University of Maryland, College Park {hbling, djacobs}@ umiacs.umd.edu Abstract

More information

A Review of Evaluation of Optimal Binarization Technique for Character Segmentation in Historical Manuscripts

A Review of Evaluation of Optimal Binarization Technique for Character Segmentation in Historical Manuscripts 010 Third International Conerence on Knowledge Discovery and Data Mining A Review o Evaluation o Optimal Binarization Technique or Character Segmentation in Historical Manuscripts Chun Che Fung and Rapeeporn

More information

Switching Hypothesized Measurements: A Dynamic Model with Applications to Occlusion Adaptive Joint Tracking

Switching Hypothesized Measurements: A Dynamic Model with Applications to Occlusion Adaptive Joint Tracking Switching Hypothesized Measurements: A Dynamic Model with Applications to Occlusion Adaptive Joint Tracking Yang Wang Tele Tan Institute for Infocomm Research, Singapore {ywang, telctan}@i2r.a-star.edu.sg

More information

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1 Feature Detection Raul Queiroz Feitosa 3/30/2017 Feature Detection 1 Objetive This chapter discusses the correspondence problem and presents approaches to solve it. 3/30/2017 Feature Detection 2 Outline

More information

Road Sign Analysis Using Multisensory Data

Road Sign Analysis Using Multisensory Data Road Sign Analysis Using Multisensory Data R.J. López-Sastre, S. Lauente-Arroyo, P. Gil-Jiménez, P. Siegmann, and S. Maldonado-Bascón University o Alcalá, Department o Signal Theory and Communications

More information

Study and Analysis of Edge Detection and Implementation of Fuzzy Set. Theory Based Edge Detection Technique in Digital Images

Study and Analysis of Edge Detection and Implementation of Fuzzy Set. Theory Based Edge Detection Technique in Digital Images Study and Analysis o Edge Detection and Implementation o Fuzzy Set Theory Based Edge Detection Technique in Digital Images Anju K S Assistant Proessor, Department o Computer Science Baselios Mathews II

More information

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011 Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition

More information

Digital Image Processing. Image Enhancement in the Spatial Domain (Chapter 4)

Digital Image Processing. Image Enhancement in the Spatial Domain (Chapter 4) Digital Image Processing Image Enhancement in the Spatial Domain (Chapter 4) Objective The principal objective o enhancement is to process an images so that the result is more suitable than the original

More information

Face Recognition using Hough Peaks extracted from the significant blocks of the Gradient Image

Face Recognition using Hough Peaks extracted from the significant blocks of the Gradient Image Face Recognition using Hough Peaks extracted rom the signiicant blocks o the Gradient Image Arindam Kar 1, Debotosh Bhattacharjee, Dipak Kumar Basu, Mita Nasipuri, Mahantapas Kundu 1 Indian Statistical

More information

Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing

Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing Tomoyuki Nagahashi 1, Hironobu Fujiyoshi 1, and Takeo Kanade 2 1 Dept. of Computer Science, Chubu University. Matsumoto 1200,

More information

Stable Interest Points for Improved Image Retrieval and Matching

Stable Interest Points for Improved Image Retrieval and Matching Stable Interest Points for Improved Image Retrieval and Matching Matthew Johnson and Roberto Cipolla University of Cambridge September 16, 2006 Abstract Local interest points and descriptors have been

More information

Conditional Random Fields for Object Recognition

Conditional Random Fields for Object Recognition Conditional Random Fields for Object Recognition Ariadna Quattoni Michael Collins Trevor Darrell MIT Computer Science and Artificial Intelligence Laboratory Cambridge, MA 02139 {ariadna, mcollins, trevor}@csail.mit.edu

More information

Local Features and Bag of Words Models

Local Features and Bag of Words Models 10/14/11 Local Features and Bag of Words Models Computer Vision CS 143, Brown James Hays Slides from Svetlana Lazebnik, Derek Hoiem, Antonio Torralba, David Lowe, Fei Fei Li and others Computer Engineering

More information

Local Image Features

Local Image Features Local Image Features Computer Vision CS 143, Brown Read Szeliski 4.1 James Hays Acknowledgment: Many slides from Derek Hoiem and Grauman&Leibe 2008 AAAI Tutorial This section: correspondence and alignment

More information

Relaxing the 3L algorithm for an accurate implicit polynomial fitting

Relaxing the 3L algorithm for an accurate implicit polynomial fitting Relaxing the 3L algorithm or an accurate implicit polynomial itting Mohammad Rouhani Computer Vision Center Ediici O, Campus UAB 08193 Bellaterra, Barcelona, Spain rouhani@cvc.uab.es Angel D. Sappa Computer

More information

SUPER RESOLUTION IMAGE BY EDGE-CONSTRAINED CURVE FITTING IN THE THRESHOLD DECOMPOSITION DOMAIN

SUPER RESOLUTION IMAGE BY EDGE-CONSTRAINED CURVE FITTING IN THE THRESHOLD DECOMPOSITION DOMAIN SUPER RESOLUTION IMAGE BY EDGE-CONSTRAINED CURVE FITTING IN THE THRESHOLD DECOMPOSITION DOMAIN Tsz Chun Ho and Bing Zeng Department o Electronic and Computer Engineering The Hong Kong University o Science

More information

Topological Mapping. Discrete Bayes Filter

Topological Mapping. Discrete Bayes Filter Topological Mapping Discrete Bayes Filter Vision Based Localization Given a image(s) acquired by moving camera determine the robot s location and pose? Towards localization without odometry What can be

More information

A NEW FEATURE BASED IMAGE REGISTRATION ALGORITHM INTRODUCTION

A NEW FEATURE BASED IMAGE REGISTRATION ALGORITHM INTRODUCTION A NEW FEATURE BASED IMAGE REGISTRATION ALGORITHM Karthik Krish Stuart Heinrich Wesley E. Snyder Halil Cakir Siamak Khorram North Carolina State University Raleigh, 27695 kkrish@ncsu.edu sbheinri@ncsu.edu

More information

Lecture 10 Detectors and descriptors

Lecture 10 Detectors and descriptors Lecture 10 Detectors and descriptors Properties of detectors Edge detectors Harris DoG Properties of detectors SIFT Shape context Silvio Savarese Lecture 10-26-Feb-14 From the 3D to 2D & vice versa P =

More information

Pairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement

Pairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement Pairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement Daegeon Kim Sung Chun Lee Institute for Robotics and Intelligent Systems University of Southern

More information

Video Google: A Text Retrieval Approach to Object Matching in Videos

Video Google: A Text Retrieval Approach to Object Matching in Videos Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic, Frederik Schaffalitzky, Andrew Zisserman Visual Geometry Group University of Oxford The vision Enable video, e.g. a feature

More information

ECSE-626 Project: An Adaptive Color-Based Particle Filter

ECSE-626 Project: An Adaptive Color-Based Particle Filter ECSE-626 Project: An Adaptive Color-Based Particle Filter Fabian Kaelin McGill University Montreal, Canada fabian.kaelin@mail.mcgill.ca Abstract The goal of this project was to discuss and implement a

More information

A New Feature Local Binary Patterns (FLBP) Method

A New Feature Local Binary Patterns (FLBP) Method A New Feature Local Binary Patterns (FLBP) Method Jiayu Gu and Chengjun Liu The Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA Abstract - This paper presents

More information

Local features: detection and description. Local invariant features

Local features: detection and description. Local invariant features Local features: detection and description Local invariant features Detection of interest points Harris corner detection Scale invariant blob detection: LoG Description of local patches SIFT : Histograms

More information

Part-based and local feature models for generic object recognition

Part-based and local feature models for generic object recognition Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza

More information

CS 4495 Computer Vision A. Bobick. CS 4495 Computer Vision. Features 2 SIFT descriptor. Aaron Bobick School of Interactive Computing

CS 4495 Computer Vision A. Bobick. CS 4495 Computer Vision. Features 2 SIFT descriptor. Aaron Bobick School of Interactive Computing CS 4495 Computer Vision Features 2 SIFT descriptor Aaron Bobick School of Interactive Computing Administrivia PS 3: Out due Oct 6 th. Features recap: Goal is to find corresponding locations in two images.

More information

3-D TERRAIN RECONSTRUCTION WITH AERIAL PHOTOGRAPHY

3-D TERRAIN RECONSTRUCTION WITH AERIAL PHOTOGRAPHY 3-D TERRAIN RECONSTRUCTION WITH AERIAL PHOTOGRAPHY Bin-Yih Juang ( 莊斌鎰 ) 1, and Chiou-Shann Fuh ( 傅楸善 ) 3 1 Ph. D candidate o Dept. o Mechanical Engineering National Taiwan University, Taipei, Taiwan Instructor

More information

Bridging the Gap Between Local and Global Approaches for 3D Object Recognition. Isma Hadji G. N. DeSouza

Bridging the Gap Between Local and Global Approaches for 3D Object Recognition. Isma Hadji G. N. DeSouza Bridging the Gap Between Local and Global Approaches for 3D Object Recognition Isma Hadji G. N. DeSouza Outline Introduction Motivation Proposed Methods: 1. LEFT keypoint Detector 2. LGS Feature Descriptor

More information

SHIP RECOGNITION USING OPTICAL IMAGERY FOR HARBOR SURVEILLANCE

SHIP RECOGNITION USING OPTICAL IMAGERY FOR HARBOR SURVEILLANCE SHIP RECOGNITION USING OPTICAL IMAGERY FOR HARBOR SURVEILLANCE Dr. Patricia A. Feineigle, Dr. Daniel D. Morris, and Dr. Franklin D. Snyder General Dynamics Robotic Systems, 412-473-2159 (phone), 412-473-2190

More information

Computer Vision I - Filtering and Feature detection

Computer Vision I - Filtering and Feature detection Computer Vision I - Filtering and Feature detection Carsten Rother 30/10/2015 Computer Vision I: Basics of Image Processing Roadmap: Basics of Digital Image Processing Computer Vision I: Basics of Image

More information

Object Recognition with Invariant Features

Object Recognition with Invariant Features Object Recognition with Invariant Features Definition: Identify objects or scenes and determine their pose and model parameters Applications Industrial automation and inspection Mobile robots, toys, user

More information

SCALE INVARIANT FEATURE TRANSFORM (SIFT)

SCALE INVARIANT FEATURE TRANSFORM (SIFT) 1 SCALE INVARIANT FEATURE TRANSFORM (SIFT) OUTLINE SIFT Background SIFT Extraction Application in Content Based Image Search Conclusion 2 SIFT BACKGROUND Scale-invariant feature transform SIFT: to detect

More information

Local Features Tutorial: Nov. 8, 04

Local Features Tutorial: Nov. 8, 04 Local Features Tutorial: Nov. 8, 04 Local Features Tutorial References: Matlab SIFT tutorial (from course webpage) Lowe, David G. Distinctive Image Features from Scale Invariant Features, International

More information

Definition, Detection, and Evaluation of Meeting Events in Airport Surveillance Videos

Definition, Detection, and Evaluation of Meeting Events in Airport Surveillance Videos Definition, Detection, and Evaluation of Meeting Events in Airport Surveillance Videos Sung Chun Lee, Chang Huang, and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA sungchun@usc.edu,

More information

Motion Estimation and Optical Flow Tracking

Motion Estimation and Optical Flow Tracking Image Matching Image Retrieval Object Recognition Motion Estimation and Optical Flow Tracking Example: Mosiacing (Panorama) M. Brown and D. G. Lowe. Recognising Panoramas. ICCV 2003 Example 3D Reconstruction

More information

Detecting and Segmenting Humans in Crowded Scenes

Detecting and Segmenting Humans in Crowded Scenes Detecting and Segmenting Humans in Crowded Scenes Mikel D. Rodriguez University of Central Florida 4000 Central Florida Blvd Orlando, Florida, 32816 mikel@cs.ucf.edu Mubarak Shah University of Central

More information

TEXTURE CLASSIFICATION METHODS: A REVIEW

TEXTURE CLASSIFICATION METHODS: A REVIEW TEXTURE CLASSIFICATION METHODS: A REVIEW Ms. Sonal B. Bhandare Prof. Dr. S. M. Kamalapur M.E. Student Associate Professor Deparment of Computer Engineering, Deparment of Computer Engineering, K. K. Wagh

More information

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION Panca Mudjirahardjo, Rahmadwati, Nanang Sulistiyanto and R. Arief Setyawan Department of Electrical Engineering, Faculty of

More information

Image Features: Detection, Description, and Matching and their Applications

Image Features: Detection, Description, and Matching and their Applications Image Features: Detection, Description, and Matching and their Applications Image Representation: Global Versus Local Features Features/ keypoints/ interset points are interesting locations in the image.

More information

3D Hand and Fingers Reconstruction from Monocular View

3D Hand and Fingers Reconstruction from Monocular View 3D Hand and Fingers Reconstruction rom Monocular View 1. Research Team Project Leader: Graduate Students: Pro. Isaac Cohen, Computer Science Sung Uk Lee 2. Statement o Project Goals The needs or an accurate

More information

Xavier: A Robot Navigation Architecture Based on Partially Observable Markov Decision Process Models

Xavier: A Robot Navigation Architecture Based on Partially Observable Markov Decision Process Models Xavier: A Robot Navigation Architecture Based on Partially Observable Markov Decision Process Models Sven Koenig and Reid G. Simmons Carnegie Mellon University School o Computer Science Pittsburgh, PA

More information

A Comparison and Matching Point Extraction of SIFT and ISIFT

A Comparison and Matching Point Extraction of SIFT and ISIFT A Comparison and Matching Point Extraction of SIFT and ISIFT A. Swapna A. Geetha Devi M.Tech Scholar, PVPSIT, Vijayawada Associate Professor, PVPSIT, Vijayawada bswapna.naveen@gmail.com geetha.agd@gmail.com

More information

The most cited papers in Computer Vision

The most cited papers in Computer Vision COMPUTER VISION, PUBLICATION The most cited papers in Computer Vision In Computer Vision, Paper Talk on February 10, 2012 at 11:10 pm by gooly (Li Yang Ku) Although it s not always the case that a paper

More information

Local features and image matching. Prof. Xin Yang HUST

Local features and image matching. Prof. Xin Yang HUST Local features and image matching Prof. Xin Yang HUST Last time RANSAC for robust geometric transformation estimation Translation, Affine, Homography Image warping Given a 2D transformation T and a source

More information

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58 Image Features: Local Descriptors Sanja Fidler CSC420: Intro to Image Understanding 1/ 58 [Source: K. Grauman] Sanja Fidler CSC420: Intro to Image Understanding 2/ 58 Local Features Detection: Identify

More information

Online Spatial-temporal Data Fusion for Robust Adaptive Tracking

Online Spatial-temporal Data Fusion for Robust Adaptive Tracking Online Spatial-temporal Data Fusion for Robust Adaptive Tracking Jixu Chen Qiang Ji Department of Electrical, Computer, and Systems Engineering Rensselaer Polytechnic Institute, Troy, NY 12180-3590, USA

More information

Discovering Visual Hierarchy through Unsupervised Learning Haider Razvi

Discovering Visual Hierarchy through Unsupervised Learning Haider Razvi Discovering Visual Hierarchy through Unsupervised Learning Haider Razvi hrazvi@stanford.edu 1 Introduction: We present a method for discovering visual hierarchy in a set of images. Automatically grouping

More information

Generic Face Alignment Using an Improved Active Shape Model

Generic Face Alignment Using an Improved Active Shape Model Generic Face Alignment Using an Improved Active Shape Model Liting Wang, Xiaoqing Ding, Chi Fang Electronic Engineering Department, Tsinghua University, Beijing, China {wanglt, dxq, fangchi} @ocrserv.ee.tsinghua.edu.cn

More information

ELL 788 Computational Perception & Cognition July November 2015

ELL 788 Computational Perception & Cognition July November 2015 ELL 788 Computational Perception & Cognition July November 2015 Module 6 Role of context in object detection Objects and cognition Ambiguous objects Unfavorable viewing condition Context helps in object

More information

Supervised texture detection in images

Supervised texture detection in images Supervised texture detection in images Branislav Mičušík and Allan Hanbury Pattern Recognition and Image Processing Group, Institute of Computer Aided Automation, Vienna University of Technology Favoritenstraße

More information

A Research on Moving Human Body Detection Based on the Depth Images of Kinect

A Research on Moving Human Body Detection Based on the Depth Images of Kinect Sensors & Transducers 2014 by IFSA Publishing, S. L. http://www.sensorsportal.com A Research on Moving Human Body Detection Based on the Depth Images o Kinect * Xi an Zhu, Jiaqi Huo Institute o Inormation

More information

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints Last week Multi-Frame Structure from Motion: Multi-View Stereo Unknown camera viewpoints Last week PCA Today Recognition Today Recognition Recognition problems What is it? Object detection Who is it? Recognizing

More information