3D Hand Pose Reconstruction Using Specialized Mappings

Size: px
Start display at page:

Download "3D Hand Pose Reconstruction Using Specialized Mappings"

Transcription

1 Boston University Coputer Science Tech. Report No ,Dec (revised Apr. 2001). To Appear in Proc. IEEE International Conf. on Coputer Vision (ICCV). Canada. Jul D and Pose Reconstruction Using Specialized Mappings Róer Rosales, Vassilis Athitsos, Leonid Sigal, and Stan Sclaroff Boston University, Coputer Science Departent 111 Cuington St., Boston, MA eail: Abstract A syste for recovering 3D hand pose fro onocular color sequences is proposed. The syste eploys a non-linear supervised learning fraework, the specialized appings architecture (SMA), to ap iage features to likely 3D hand poses. The SMA s fundaental coponents are a set of specialized forward apping functions, and a single feedback atching function. The forward functions are estiated directly fro training data, which in our case are exaples of hand joint configurations and their corresponding visual features. The joint angle data in the training set is obtained via a Cyberlove, a glove with 22 sensors that onitor the angular otions of the pal and fingers. In training, the visual features are generated using a coputer graphics odule that renders the hand fro arbitrary viewpoints given the 22 joint angles. The viewpoint is encoded by two real values, therefore 24 real values represent a hand pose. We test our syste both on synthetic sequences and on sequences taken with a color caera. The syste autoatically detects and tracks both hands of the user, calculates the appropriate features, and estiates the 3D hand joint angles and viewpoint fro those features. Results are encouraging given the coplexity of the task. 1 Introduction The estiation of hand pose fro visual cues is a key proble in the developent of intuitive, non-intrusive huancoputer interfaces. The shape and otion of the hand during a gesture can be used to recognize the gesture and classify it as a eber of a predefined class. The iportance of hand pose estiation is evident in other areas as well; e.g.,video coding, video indexing/retrieval, sign language understanding, coputer-aided otion analysis for ergonoics, etc. In this paper, we address the proble of recovering 3D hand pose fro a onocular color sequence. Our solution to this proble akes use of concepts fro stochastic visual segentation, coputer graphics, and non-linear supervised learning. Our contribution is an autoatic syste that tracks the hand and estiates its 3D configuration on every frae, that does not ipose any restrictions on the hand shape, does not require anual initialization, and can easily recover fro estiation errors. 2 Related Work Several existing systes include autoated hand detection and tracking. Such systes typically ake restrictive assuptions on the doain: only hands ove, the hands are the fastest oving objects in the scene [17, This work was supported in part through Office of Naval Research Young Investigator Award N , and National Science Foundation grants IIS and EIA Figure 1: and pose estiation overview. 38, 40, 3, 21, 23], hands are skin colored, or they are the only skin-colored objects in the scene [21, 34]. Often the background is assued to be static, and known [21, 8]. Soe systes use such assuptions to obtain several possible regions where the hands are, and use atching with appearance-based odels to choose aong those regions [38, 9]. Stochastic tools, such as Kalan filtering [34, 38, 40], can be used to predict the hand position in a future frae. Overall, hand detection and tracking algoriths tend to perfor well in restricted environents, where assuptions about the nuber, location, appearance and otion of hands are valid, and the background is known. Reliable perforance in ore general doains, is still beyond the current state of the art. Previous systes representation of hand pose varies widely. For certain applications, hand trajectories can be sufficient for gesture classification [3, 23]. owever, in soe doains, knowledge of ore detailed hand configuration ust be used to disabiguate between different gestures; e.g.,in signed languages. Pose can be estiated in 2D or 3D. Most 2D-based approaches try to atch the iage of the hand with view-based odels corresponding to a liited nuber of predefined hand poses [9, 4, 38, 11, 17, 35, 34]. In [20] the condensation algorith is used to track the index and thub of a hand. Such ethods are valid in restricted doains, in which users are observed fro a known viewpoint, perforing a liited variety of otions. One liitation in view-based ethods is that pose recog-

2 nition is not viewpoint invariant. Iages of the sae 3D hand shape fro different viewpoints, or even rotated exaples of the sae iage would be considered different poses. Soe of those liits have been addressed by using ultiple caeras [35], and stereo [8]; naturally, such ethods will not work in onocular sequences. Our approach will avoid this liitation through the use of probabilistic odeling, Specialized Mappings (SMA), to ap iage features to likely 3D hand poses. A related approach to SMA is described in [39], where a syste is trained with views corresponding to any different hand orientations and viewpoints. Soe training views are labeled with the 3D pose category they correspond to, but ost of the are unlabeled. The categories of the unlabeled data are treated as issing values in a D-EM (Discriinant Expectation-Maxiization) fraework. The syste can recognize 14 hand configurations, observed fro a variety of viewpoints. A difference between that approach and ours is that, in their syste, the configuration estiation is forulated as a classification proble, in which a finite nuber of classes are defined. Our SMA approach is based on regression rather than classification, allowing for theoretically continuous solutions of the estiation proble. Soeties, such continuous solutions are preferable to siply recognizing a liited nuber of classes. For exaple, in a virtual reality application, we ay want to accurately reconstruct the hand of the user in the virtual environent and estiate the effects of that particular configuration on the environent. Even in cases where the ultiate goal is classification, accurate 3D inforation can iprove recognition by aking it robust to viewpoint variations. An iportant decision in estiating 3D pose is the representation and paraeterization. Link-and-joint odels are used by [25, 31], whereas a esh odel is used by [10]. In those three systes, the hand configuration at the beginning of a sequence ust be known a priori. In addition, self-occlusions and fast otions ake it hard to aintain accuracy while tracking. Our proposed SMA approach avoids these drawbacks. SMA is related to achine learning odels [16, 12, 6, 28] that use the principle of divide-and-conquer to reduce the coplexity of the learning proble by splitting it into several sipler ones. In general these algoriths try to fit surfaces to the observed data by (1) splitting the input space into several regions, and (2) approxiating sipler functions to fit the input-output relationship inside these regions. The splitting process ay create a new proble: how to optially partition the proble such that we obtain several sub-probles that can be solved using the specific solver capabilities (i.e.,for of apping functions). In SMA s, we address this proble by solving for the partitions and the appings siultaneously. In the work of [6], hard splits of the data were used, i.e.,the paraeters in one region only depend on the data falling in that region. In [16], soe of the drawbacks of the hard-split approach were pointed out (e.g.,increase in the variance of the estiator), and an architecture that uses soft splits of the data, the ierarchical Mixture of Experts, was described. In this architecture, as in [12], at each level of the tree, a gating network is used to control the influence (weight) of the expert units (apping functions) to odel the data. owever, in [12] arbitrary subsets of the experts units can be chosen. Unlike these architectures, in SMA s the apping selection is done using a feedback atching process, currently in a winner-take-all fashion, but soft splitting is done during training. In applications where a feedback ap can be coputed easily and accurately, this is an iportant advantage. Also, the shape of the regions that deterine ownership to given specialized functions is general; therefore, we do not assue any fixed functional for or discriinant function to define these regions (gating networks). With respect to work on learning based approaches for estiating articulated body pose, Point Distribution Models have been applied to recovering upper-body pose fro silhouettes or skin-colored blobs [1, 24]. In [13], a aussian probability odel for short huan otion sequences was built. owever, this ethod assues that 2D tracking of joints in the iage is given. In [2], the anifold of huan body configurations was odeled via a hidden Markov odel and learned via entropy iniization. In [33] dynaic prograing is used to calculate the best global labeling of the joint probability density function of the position and velocity of body features; it was also assued that it is possible to track these features for pairs of fraes. These last three approaches odel the dynaics of otion, a proble that in general requires uch ore training data to build a reasonable approxiation to the underlying probability distribution. 3 Overview An overview of our approach can be seen in Fig. 1. First is first trained, given a nuber of exaple hand joint configurations are acquired using a Cyberlove (at approx. 15 z). The Cyberlove easures 22 angular DOF of the hand. Coputer graphics software can be used to render a shaded view of any hand configuration captured by the Cyberlove. Using this coputer graphics rendering function, we can generate a unifor sapling (with size ) on the whole view sphere, and render views (iages) of every hand configuration fro all sapled viewpoints. We can then use iage processing to extract visual feature vector fro each of the iages generated; in our case we extract oent based-features, but other features are possible [13]. This process yields a set, where is each of the hand joint configurations fro each viewpoint 1, and, where is a vector of visual features corresponding to each. These sets and constitute saples fro the inputoutput relationship that we will attept to learn using our architecture. iven a new iage of a hand, we will copute its visual feature vector. We then copute the apping fro to the ost likely 24 DOF hand configuration. Note that this apping is highly abiguous. In fact the relationship is any to any; therefore no single function can perfor this task. Using the Specialized Mapping Architecture (SMA), we split (partition) this apping into any appings. Each of these hopefully sipler probles is then solved using a different specialized function. The SMA learning schee solves for partitions and appings siultaneously. The SMA tries to learn a ultiple apping so that, when perforing inference, given a vector of visual features, an output in the output space of hand configurations can be 1 This vector is then coposed of 22 internal pose paraeters plus two global orientation paraeters.

3 provided. On the right colun of Fig. 1, a diagra of the inference process is shown. First video input is obtained, and using a segentation odule, regions with high likelihood of being skin colored are found. Fro these regions we extract visual features (e.g.,oents). Then the given vector of visual features is presented to SMA, which generates several output estiates, one of which is chosen using a defined cost function. Most of the details, including the processes of learning and inference by SMA are presented in the following sections. Our approach can easily integrate different choices of features. Furtherore, the sae approach can be used to estiate the pose of articulated objects other than hands. 4 and Shape Representation The hand odel that we use is ipleented in the Virtualand prograing library [36]. The paraeters of the odel are 22 joint angles. For the index, iddle, ring and pinky finger, there is an angle for each of the distal, proxial and etacarpophalangeal joints. For the thub, there is an inner joint angle, an outer joint angle and two angles for the trapezioetacarpal joint. There are also abduction angles between the following pairs of successive fingers: index/iddle, iddle/ring and ring/pinky. Finally, there is an angle for the pal arch, an angle easuring wrist flexion and an angle easuring wrist bending towards the pinky finger. The Virtualand library provides tools that can render an artificial hand fro an arbitrary viewpoint, given values for the 22 angles. Fig. 3 shows exaples of hand renderings. Using a Cyberlove (anufactured by VirtualTechnologies) we collected about 2,400 exaples of hand poses (paraeterized as vectors containing the 22 angles). We rendered each pose fro 86 different viewpoints. Those viewpoints fored an approxiately uniforly distributed set on the surface of a sphere centered at the hand. The synthetic iages obtained this way were used for training and testing as described in the experiental results. The Virtualand library was also used to reconstruct the estiated 3D hand shape for testing data, based on the output of our syste. 5 Learning Algorith The estiation paradig used in this work consist of apping the observed low-level visual features to hand joint configurations. The underlying approach for finding this apping is based on the Specialized Mappings Architecture (SMA), a non-linear supervised learning architecture. iven an input and output space and respectively, SMA s consist of several specialized forward apping functions and a feedback atching function, which in this case is known (visual features can be obtained given the joint configurations by using coputer graphics based rendering). In order to estiate these appings, we use a supervised learning approach with training data, with! #" an input-output pair (visual features and hand joint angles respectively). Our architecture generates a series of $ functions % in which each of these functions is specialized to ap certain inputs (their specialized doain) better than others. The specialized doain can be for exaple a region of the input space. owever, this specialized doain of can be ore general than just a connected region in the input Figure 2: SMA diagra illustrating (a) an estiated SMA odel with & specialized functions apping subsets of the training data (each subset is drawn with a different color) and (b) the inference process in which a given observation is apped by all the specialized functions, and then a feedback atching step is perfored to choose the best of the & estiates. space. We propose to deterine these specialized doains and functions siultaneously. Fig. 2(a) illustrates the basic idea of this odel. We use different colors (gray-levels) to represent the doain of each specialized function. At initialization rando colors are assigned to each point, the goal is to find an optial apping and partition that is efficient in reducing soe error function. Once the odel has been learned our apping ay look like Fig. 2(a), in which each function is in charge of apping certain inputs only. 5.1 Probabilistic Model Let the training sets of output-input observations be ' )( +*,**, and -. /( +*,**, respectively. We will use 0 1 2" to define a given output-input training pair,.3 *** 4 represents our observed training set. Define the unobserved rando variables 5 with 6 87 ** 9 and : ;58 5 ( <**,* 5= ". In our odel the variables 5 have doain the discrete set > 87 ** $ of labels for the specialized functions, and can be thought as the func- is the tion nuber used to ap data point 6, therefore $ nuber of specialized functions in the odel. Define the odel paraeters? ;??( <***?A@CBD", where? represents the paraeters of the apping function 6. The vector B E B FCB ( ", where B represent ;5 JI? ". Using Bayes rule and assuing independence aong observations, we have the joint probability of the observed and hidden variables conditioned on our odel paraeters: ; :I? " ;KI :? " ;:I? " ML ; I 5? " (a) (b) ;5 CI? " (1)

4 7 9 L + : : 5.2 SMA Paraeter Estiation and the EM Algorith The optiization proble defined by Eq. 1 is coputationally very expensive. ere, the probabilistic paraeter estiation proble is approached under the Expectation Maxiization (EM) algorith fraework [5]. We use the notation followed by [22]. Note that Eq. 1 akes reference to a still undefined distribution # I 5 1? ". Several options had been proposed [27]. ere we will use a aussian distribution with ean defined by the error incurred in using the possibly nonlinear function as a apping function, and a variance : ;DI 5? "? " " (2) Using this distribution, the E-step consists of finding ;: " ;: " L :I? ". In our case, this factorizes as: B ; CI 5? " B ; I 5? " B!" $#%#&('*),+ $-!" $#.# B -0/!/ #%# & ' ),+ -0/!1/ #%# The M-step consists of finding? # !9;: <>=@?BADC EF 4 ;: I? "$. can be shown that:? # ! I $ # ;5 2" C EF 4 Using our odel, it ; I 5!? "1J EF 4 (5) This gives the following update rules for B and (where Lagrange ultipliers were used to incorporate the constraint B 7 ). B The update for?f depends on the for of. ere we have chosen a non-linear function of the for:?f " M ON=(4QPSR T UV ( M T # N WP UXV (3) (4) T ZY # "!" (8) where Y is the 6 K\[] coponent of the visual feature vector, ^ # and ^ ( # are weights and biases (part of?= ), N and N ( are a sigoidal and linear function respectively, _ and _ ( are the nuber of nodes in each layer, and ` is just the diension index of the output vector. This is a 1-hidden layer feed-forward network. Unfortunately, using this function (as it would be by using ost non-linear functions) forces us to use iterative optiization for the M-step. 5 I? "$2* 5 %I 1? " (6) using # ;5 " K?F "!" *K /!?F "1"L # ;5 " 5.3 Stochastic Learning The update equations described above are useful to find a local iniu given the initial values of the paraeters. In order to iprove this process, and avoid soe of the local inia that inevitably arise, we use an annealing schedule on the # probabilities during the M-step. In this way, we redefine: # 5 a " < cb de =f?a T #.#.gh # < b de =f?a #%#.gh # In our experients the teperature paraeter i decays exponentially. This step not only does help in avoiding local inia, but it also creates two desirable effects. It forces # ;5 ja " to be binary (either 7 or k ) at low teperatures, as a consequence each point will tend to be apped by only one specialized function at the end of optiization. Moreover, it akes # 5 " ( 7 ** $ ) be fairly even at high teperatures, aking the optiization less dependent on initialization. Note that there is no closed for solution for the M-step as described above. In practice we have decided to perfor two or three iterations per M-step. Another source of randoness added to the process so far described consists in choosing data points randoly uniforly distributed when perforing the M-step. These two variants of the M-step have been justified in the sense of a partial M-step [22]. 5.4 Feedback Matching Once the odel paraeters have been estiated each specialized function aps (with different levels of accuracy) the whole input space. Therefore, the following question arises: during reconstruction, given a point in input space, how do we choose the apping function % that should be used to ap this point? Fig. 2(b) illustrates the inference process. When generating an estiate l of body pose given an input (the gray point with a dark contour in the lower plane), SMA s generate a series of output hypotheses n obtained points pointed by the arrows). iven the set n, we define (7) the ost accurate hypothesis to be that one that iniizes the function q2/ T " ", over a, in this paper we use: srt T (9) ", with po > (illustrated by each of the # T "XK " L u 2/ T "vk " (10) and ake l, where u is the covariance atrix of the eleents in the set (i.e.,the input vectors in our training set) and 6 is the assigned label. In Fig. 2(b) we can see that each of the points in the output space is apped back to the input space, once in this space, these points can be copared (using a given cost function e.g.,eq. 10) to the initial input observation. The for of the cost function could vary, using Eq. 10 is the sae as assuing that u ". I " w 6 and Detection and Segentation Soe of our test data consists of video sequences collected with a color digital caera. In those sequences the background is static, there is only one person present, and the

5 person is facing towards the caera. Our syste tracks both hands of the user autoatically, using a skin color tracker. In the first frae of the sequence, the tracker needs to be initialized, by locating in the iage the objects that we want to track. That could be done by applying a skin detector syste, like the one described in [15]. owever, using that detector, clothes are labeled as skin, soeties, because of their color. We can locate and segent the hands ore accurately using the fact that their color is very siilar to the color of the face. The position of the face can be found reliably using a face detector syste [29]. For each pixel in the detected face we copute a easure of how skin-like the pixel color is. That easure is based on histogras of skin and non-skin color distributions, coputed fro a database of thousands of iages in which regions were labeled as skin and non-skin. Those labeled iages were fraes fro coercially available DVD ovies. We select the top 50% of the pixels in the face, for which the easure of skin siilarity is the highest. For each of those pixels we copute itsn color ( N ), and we find the eancn color of all selected pixels. Then, for each pixel in the iage, we calculate the distance of itsn color fro the eann color. We label as skin all pixels for which that distance is less than a threshold. The threshold we use is 17, for RB values between 0 and 255. The objects we want to track are the three largest connected coponents of the skin pixels. One of the overlaps with the face, and the other two are considered to be the hands. We initialize the skin tracker with the position of the face and hand regions, and the tracker locates the face and hands in the rest of the fraes in each sequence. The skin tracker odels skin color distribution as a histogra in SV space. It can handle distributions that change fro one frae to the next, because of varying illuination or otion with respect to light sources. The changes in skin color that occur in a new frae are odeled as the results of translating, rotating and scaling the current histogra. Furtherore, the evolution of the histogra is odeled as a second-order Markov process. The tracker is initialized in the first frae, by being told which regions to track, and it estiates the initial color distribution. In the next 8-30 fraes, in addition to tracking and adapting the skin color histogra, it also learns the paraeters of the Markov process. After the learning stage, it uses those paraeters to predict the color distribution in every new frae, while still updating the Markov odel, based on the actual histogra that is observed in the new frae. The learning and tracking stage are explained in detail in [32]. Our siple hand detection and tracking algorith would not work at any frae where the hands overlap with each other or with the face. In our video sequences we took care to avoid such situations. Our syste could be ade ore general by including odules to predict occlusion of an object by another and to detect when those objects are separated again. A siilar approach has been successfully applied in the doain of ultiple person tracking with occlusion handling [26]. 7 Experiental Results The described approach was tested in experients with training data consisting of approxiately 30 sequences obtained through the use of a Cyberglove. Input-output pairs were generated using coputer graphics by rendering fro 86 viewpoints roughly uniforly distributed on the view sphere. The output consisted of 24 joint angles of a huan hand linearly encoded by nine real values using Principal Coponent Analysis (PCA). The input consisted of seven real-valued u oents [14] coputed on synthetically generated silhouettes of the hand. u oents are functions of central iage oents. They are invariant to translation, scaling, and rotation on the iage plane. These invariances ease the observation process (e.g.,we do not need to be concern about where and how large the hand appears on the iage). owever, rotation invariance akes hand rotation parallel to the iage plane unobservable. For the real experients observation inputs were obtained tracking skin color distributions [32]. Approxiately 300,000 iages were generated synthetically. Of these, 8,000 were used for training and the rest for testing. We used cross-validation for early stopping the training procedure and avoid overfitting. In the experients shown, the nuber of specialized functions was set to 30. Each of these functions was a one hidden layer, feedforward network with 5 hidden neurons. The annealing schedule was 7 where was the iteration nuber in the EM algorith. Other experients were perfored to test the convergence and fitting properties of the odel, due to space liitations these results will not be presented in this paper. 7.1 Quantitative Experients Fig. 3 shows exaple hand configuration estiates obtained in representative test fraes (not in the training set). Synthetic iages were used in this experient, because ground-truth data was available for quantitative perforance evaluation. As can be seen in the figure, selfoccluding configurations are obviously harder, but still the estiate is close to ground-truth given that no huan intervention nor pose initialization was required. In order to provide quantitative easures of perforance, test data was used to generate viewpoint dependent error easures. Fig. 4 shows the ean squared error and its variance per viewpoint at the equator 4(a) and at different latitudes 4(b). Note in 4(a) that for views on the equator the error is saller for longitudes closer to radians, this corresponds to a view of the pal (fro different latitudes). These perforance differences are ost likely due to that at side-view angles there is an increased aount of self-occlusion and also because the projections involve fewer pixels, reducing the saples used to calculate iage oents. In 4(b) we can observe that reconstruction errors increase at the poles of the view sphere, where there is also little inforation projected to the iage plane. While the MSE result is encouraging, the variance suggests that certain hand poses are not accurately recovered (we discover they ostly correspond to coplex hand configurations coing fro the Aerican Sign Language part of our data).

6 MSE in 22 joint config. space Mean error and variance for all views on the viewsphere equator Views (x 2π/16 rads.) MSE in 22 joint config. space Mean error and variance grouped by latitude Latitude (x 2π/16 rads.) evenly spaced fro 90 o to +90 o Figure 4: Quantitative experiental results. Mean square error in the reconstruction is shown in (a) taken at the equator of the view sphere, varying the longitude and (b) at different latitudes, averaging over all the longitudes. Longitude and latitude radians represents a view towards the pal of the hand. 7.2 Experients with Real Sequences In the next set of experients, we tested the syste against real segented visual data. The sequences were segented to yield blobs that corresponded to hands in each frae, as described in Sec. 6. The resulting reconstruction for several relatively coplex gesture sequences is shown in Fig. 5. Note that given blob iages, recovering 3D hand pose is a difficult task even for a huan observer. This difficulty is increased by perforing inference fro blob oents, obviously with an inferior descriptive power. Methods for addressing this issue will be covered further in Sec D Reconstruction Reliability It should be noted that SMA s can provide a easure of reconstruction reliability by using the log-probabilities coputed in Eq. 10. Abiguous inputs can be discovered by looking at the relative scores given by Eq. 10 (another option is to look at the entropy of I " ). This is extreely iportant because even though the forward aps are designed to handle abiguities, the inference process clearly still suffers fro abiguities. Therefore, it can be ipossible to recover soe configurations with enough reliability. As an exaple, in Fig. 5, the configurations 5-6 have low reliability score, even though we obtain good estiates. Soe of the copeting hypotheses include estiates that are also consistent (in ters of the visual features used) with the input presented, and soe of these consistent hypothesis are far fro the true 3D reconstruction. Thus, it was very likely to choose one of the bad estiates instead of the good ones shown in configurations Discussion and Conclusions In this paper we addressed the proble of recovering 3D hand pose fro a onocular color sequence. The ain contributions of our work are: 1. A single observed frae can be used for estiation 2. As a consequence, no anual initialization is required. Furtherore, the sequence can start with the hand in any position and orientation. 2. No liitation is iposed in the caera viewpoints allowed. 3. The syste does regression rather than classification, thereby providing a continuu of pose estiates rather than recognizing a finite nuber of classes. 4. A novel non-linear supervised learning fraework is adapted to the pose estiation proble. This fraework allows us, aong other things, to avoid the pitfalls of explicit tracking and to easure reliability of estiates during inference. 5. Reconstruction can be accoplished at near frae rate. The ain advantage of using SMA s in this doain over other function estiation paradigs is that it allows odeling of the abiguous input-output relationships that arise. For instance, different hand configurations can generate the sae visual features, due to self-occlusion. Different visual features can be related to a single hand configuration, due to inaccurate observations or variations in hand orphology. SMA s splits (partitions) the proble into sipler apping probles. This allows for odeling different parts of the output space independently, as well as coputation of ultiple possible configurations in abiguous situations. owever, so far we choose one estiate only. This is an interesting aspect not fully addressed in this paper, Sec. 7.3 extends a little on this topic. In our current ipleentation, teporal context is not used for iproving the output estiates during apping, but only for segentation. The hand pose is re-estiated at every frae given the segented data. We expect that using previous estiates in coputing the current hand shape will iprove accuracy, and we plan to extend our approach to allow this. owever frae independence allows a very attractive inference tie of ", with specialized functions. Our algorith could be used as a front end in several gesture recognition applications that take the hand configuration as input. Current systes rely alost exclusively on non-vision techniques to obtain such data, such as Cyberloves [7, 18, 19, 30] and color arkers [11]. An autoated coputer vision technique like ours iposes no restrictions on users. It can also be used in doains where we do not have control of the data collection, and therefore we cannot require the use of ore sophisticated input devices. In future work, we plan to experient with sets of features that are richer and ore descriptive than binary silhouettes; e.g.,orientation histogras, or other texture features. Using stereo should further increase the accuracy of the syste, by providing ore shape constraints than a single 2D iage does. Finally, ore sophisticated odels of teporal dependencies, like linear aussian Models in general [11, 34, 37], could be used in the feedback atching to guide the choice of best reconstruction. Even though we have a useful estiate of confidence, given by Eq. 10, we are looking at alternatives for decreasing the error variance. 3D and pose reconstruction fro a single iage is a very difficult task, and at present no fully-general solution to the proble exists. Our results show that it is possible to approach this proble using a cobination of vision and statistical learning tools. We consider this an iportant step considering the coplexity of the task and the low descriptive power of the features currently eployed. 2 We insist that in applications where highly correlated fraes can be observed, it is iperative to use this teporal inforation. owever, the ability of our fraework to estiate hand pose given only a single frae affords autoatic initialization, faster estiation, and could be used as a bootstrap echanis in ore coplex systes.

7 References [1] R. Bowden, T. Mitchell, and M.Sarhadi. Non-linear statistical odels for the 3d reconstruction of huan body pose and otion fro onocular iage sequences. Iage Vision Cop., 18(9: ), [2] M. Brand. Shadow puppetry. In ICCV, [3] R. Cutler and M. Turk. View-based interpretation of realtie optical flow for gesture recognition. In Face and esture Recognition, pages , [4] T.J. Darrell, I.A. Essa, and A.P. Pentland. Task-specific gesture analysis in real-tie using interpolated views. PAMI, 18(12), [5] A. Depster, N. Laird, and D. Rubin. Maxiu likelihood estiation fro incoplete data. Journal of the Royal Statistical Society (B), 39(1), [6] J.. Friedan. Multivatiate adaptive regression splines. The Annals of Statistics, 19,1-141, [7] M. Fröhlich and I. Wachsuth. esture recognition of the upper libs : Fro signal to sybol. In I. Wachsuth and M. Fröhlich, editors, esture and Sign Language in uan- Coputer Interaction, esture Workshop, pages , Bielefeld, erany, [8] R. rzeszczuk,. Bradski, M.. Chu, and Jean-Yves Bouguet. Stereo based gesture recognition invariant to 3d pose and lighting. In CVPR, volue 1, pages , [9] R. adan, F. eitz, and L. Thoraval. esture localization and recognition using probabilistic visual learning. In CVPR, volue 2, pages , [10] T. eap and D. ogg. Towards 3d hand tracking using a deforable odel. In Face and esture Recognition, pages , [11]. ienz, K. Kraiss, and B. Bauer. Continuous sign language recognition using hidden arkov odels. In Intl. Conf. on Multiodal Interfaces, volue 4, pages 10 15, [12]. inton, B. Sallans, and Z. hahraani. A hierarchical counity of experts. Learning in raphical Models, M. Jordan (editor), [13] N. owe, M. Leventon, and B. Freean. Bayesian reconstruction of 3d huan otion fro single-caera video. In NIPS, [14] M. K. u. Visual pattern recognition by oent invariants. IRE Trans. Infor. Theory, IT(8), [15] M.J. Jones and J.M. Rehg. Statistical color odels with application to skin detection. In CVPR, pages I: , [16] M. I. Jordan and R. A. Jacobs. ierarchical ixtures of experts and the EM algorith. Neural Coputation, 6, , [17] M. Kohler. Special topics of gesture recognition applied in intelligent hoe environents. In Proceedings of the esture Workshop, pages , [18] R. Liang and M. Ouhyoung. A real-tie continuous gesture recognition syste for sign language. In Face and esture Recognition, pages , [19] J. Ma, W. ao, and C. Wang J. Wu. A continuous chinese sign language recognition syste. In Face and esture Recognition, pages , [20] J.P. MacCorick and M. Isard. Partitioned sapling, articulated objects, and interface-quality hand tracking. In ECCV, [21] J. Martin, V. Devin,, and J.L. Crowley. Active hand tracking. In Face and esture Recognition, pages , [22] R. Neal and. inton. A view of the e algorith that justifies increental, sparse, and other variants. Learning in raphical Models, M. Jordan (editor), [23] A. Nishikawa, A. Ohnishi, and F. Miyazaki. Description and recognition of huan gestures based on the transition of curvature fro otion iages. In Face and esture Recognition, pages , [24] E-J. Ong and S. ong. Tracking hybrid 2d-3d huan odels through ultiple views. In ICCV Workshop on Modelling People, Corfu, reece, [25] J.M. Rehg. Visual Analysis of igh DOF Articulated Objects with Application to and Tracking. PhD thesis, Electrical and Coputer Eng., Carnegie Mellon University, [26] R. Rosales and S. Sclaroff. Iproved tracking of ultiple huans with trajectory prediction and occlusion odeling. In IEEE CVPR Workshop on the Interpretation of Visual Motion, [27] R. Rosales and S. Sclaroff. Specialized appings and the estiation of body pose fro a single iage. In IEEE uan Motion Workshop. Austin, TX, [28] R. Rosales and Stan Sclaroff. Inferring body pose without tracking body parts. In CVPR, [29].A. Rowley, S. Baluja, and T. Kanade. Rotation invariant neural network-based face detection. In CVPR, pages 38 44, [30]. Sagawa and M. Takeuchi. A ethod for recognizing a sequence of sign language words represented in a japanese sign language sentence. In Face and esture Recognition, pages , [31] N. Shiada, Y. Shirai, Y. Kuno, and J. Miura. and gesture estiation and odel refineent using onocular caera - abiguity liitation by inequality constraints. In Face and esture Recognition, pages , [32] L. Sigal, S. Sclaroff, and V. Athitsos. Estiation and prediction of evolving color distributions for skin segentation under varying illuination. In CVPR, [33] Y. Song, Xiaoling Feng, and P. Perona. Towards detection of huan otion. In CVPR, [34] T. Starner and A. Pentland. Real-tie aerican sign language recognition using desk and wearable coputer based video. PAMI, 20(12): , [35] A. Utsui and J. Ohya. Multiple-hand-gesture tracking using ultiple caeras. In CVPR, volue 1, pages , [36] Virtual Technologies, Inc., Palo Alto, CA. Virtualand Software Library Reference Manual, August [37] C. Vogler and D. Metaxas. Toward scalability in asl recognition: Breaking down signs into phonees. In Proceedings of the esture Workshop, [38] J. Weng and Y. Cui. Recognition of hand signs fro coplex backgrounds. In R. Cipolla and A. Pentland, editors, Coputer Vision for uan-machine Interaction. Cabridge University Press, [39] Y. Wu and T.S. uang. View-independent recognition of hand postures. In CVPR, volue 2, pages 88 94, [40] M. Yang and N. Ahuja. Recognizing hand gesture using otion trajectories. In CVPR, volue 1, pages , 1999.

8 (a) (b) (a) (b) (a) (b) Figure 3: Exaple reconstruction of several synthetic test sequences. Each set (2 rows each) consists of (a)input iages, (b)reconstruction. Because our approach can provide us with a reconstruction confidence, we used this to show high-ediu-low confidence estiates (one pair of rows each). Figure 5: Reconstruction obtained fro perforing hand segentation in a huan subject. The two top pairs of rows show good reconstruction while the botto pair show exaples of bad perforance. Reconstruction is shown fro a fixed viewpoint (latitude -longitude rads.).

NON-RIGID OBJECT TRACKING: A PREDICTIVE VECTORIAL MODEL APPROACH

NON-RIGID OBJECT TRACKING: A PREDICTIVE VECTORIAL MODEL APPROACH NON-RIGID OBJECT TRACKING: A PREDICTIVE VECTORIAL MODEL APPROACH V. Atienza; J.M. Valiente and G. Andreu Departaento de Ingeniería de Sisteas, Coputadores y Autoática Universidad Politécnica de Valencia.

More information

Boosted Detection of Objects and Attributes

Boosted Detection of Objects and Attributes L M M Boosted Detection of Objects and Attributes Abstract We present a new fraework for detection of object and attributes in iages based on boosted cobination of priitive classifiers. The fraework directly

More information

Computer Aided Drafting, Design and Manufacturing Volume 26, Number 2, June 2016, Page 13

Computer Aided Drafting, Design and Manufacturing Volume 26, Number 2, June 2016, Page 13 Coputer Aided Drafting, Design and Manufacturing Volue 26, uber 2, June 2016, Page 13 CADDM 3D reconstruction of coplex curved objects fro line drawings Sun Yanling, Dong Lijun Institute of Mechanical

More information

3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views

3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views 3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views Vassilis Athitsos and Stan Sclaroff Computer Science Department Boston University 111 Cummington Street

More information

A Novel 2D Texture Classifier For Gray Level Images

A Novel 2D Texture Classifier For Gray Level Images 2012, TextRoad Publication ISSN 2090-4304 Journal of Basic and Applied Scientific Research www.textroad.co A Novel 2D Texture Classifier For Gray Level Iages B.S. Mousavi 1 Young Researchers Club, Zahedan

More information

The optimization design of microphone array layout for wideband noise sources

The optimization design of microphone array layout for wideband noise sources PROCEEDINGS of the 22 nd International Congress on Acoustics Acoustic Array Systes: Paper ICA2016-903 The optiization design of icrophone array layout for wideband noise sources Pengxiao Teng (a), Jun

More information

IMAGE MOSAICKING FOR ESTIMATING THE MOTION OF AN UNDERWATER VEHICLE. Rafael García, Xevi Cufí and Lluís Pacheco

IMAGE MOSAICKING FOR ESTIMATING THE MOTION OF AN UNDERWATER VEHICLE. Rafael García, Xevi Cufí and Lluís Pacheco IMAGE MOSAICKING FOR ESTIMATING THE MOTION OF AN UNDERWATER VEHICLE Rafael García, Xevi Cufí and Lluís Pacheco Coputer Vision and Robotics Group Institute of Inforatics and Applications, University of

More information

A wireless sensor network for visual detection and classification of intrusions

A wireless sensor network for visual detection and classification of intrusions A wireless sensor network for visual detection and classification of intrusions ANDRZEJ SLUZEK 1,3, PALANIAPPAN ANNAMALAI 2, MD SAIFUL ISLAM 1 1 School of Coputer Engineering, 2 IntelliSys Centre Nanyang

More information

EE 364B Convex Optimization An ADMM Solution to the Sparse Coding Problem. Sonia Bhaskar, Will Zou Final Project Spring 2011

EE 364B Convex Optimization An ADMM Solution to the Sparse Coding Problem. Sonia Bhaskar, Will Zou Final Project Spring 2011 EE 364B Convex Optiization An ADMM Solution to the Sparse Coding Proble Sonia Bhaskar, Will Zou Final Project Spring 20 I. INTRODUCTION For our project, we apply the ethod of the alternating direction

More information

POSITION-PATCH BASED FACE HALLUCINATION VIA LOCALITY-CONSTRAINED REPRESENTATION. Junjun Jiang, Ruimin Hu, Zhen Han, Tao Lu, and Kebin Huang

POSITION-PATCH BASED FACE HALLUCINATION VIA LOCALITY-CONSTRAINED REPRESENTATION. Junjun Jiang, Ruimin Hu, Zhen Han, Tao Lu, and Kebin Huang IEEE International Conference on ultiedia and Expo POSITION-PATCH BASED FACE HALLUCINATION VIA LOCALITY-CONSTRAINED REPRESENTATION Junjun Jiang, Ruiin Hu, Zhen Han, Tao Lu, and Kebin Huang National Engineering

More information

Brian Noguchi CS 229 (Fall 05) Project Final Writeup A Hierarchical Application of ICA-based Feature Extraction to Image Classification Brian Noguchi

Brian Noguchi CS 229 (Fall 05) Project Final Writeup A Hierarchical Application of ICA-based Feature Extraction to Image Classification Brian Noguchi A Hierarchical Application of ICA-based Feature Etraction to Iage Classification Introduction Iage classification poses one of the greatest challenges in the achine vision and achine learning counities.

More information

COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL

COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL 1 Te-Wei Chiang ( 蔣德威 ), 2 Tienwei Tsai ( 蔡殿偉 ), 3 Jeng-Ping Lin ( 林正平 ) 1 Dept. of Accounting Inforation Systes, Chilee Institute

More information

Evaluation of a multi-frame blind deconvolution algorithm using Cramér-Rao bounds

Evaluation of a multi-frame blind deconvolution algorithm using Cramér-Rao bounds Evaluation of a ulti-frae blind deconvolution algorith using Craér-Rao bounds Charles C. Beckner, Jr. Air Force Research Laboratory, 3550 Aberdeen Ave SE, Kirtland AFB, New Mexico, USA 87117-5776 Charles

More information

Vision Based Mobile Robot Navigation System

Vision Based Mobile Robot Navigation System International Journal of Control Science and Engineering 2012, 2(4): 83-87 DOI: 10.5923/j.control.20120204.05 Vision Based Mobile Robot Navigation Syste M. Saifizi *, D. Hazry, Rudzuan M.Nor School of

More information

A Novel Fast Constructive Algorithm for Neural Classifier

A Novel Fast Constructive Algorithm for Neural Classifier A Novel Fast Constructive Algorith for Neural Classifier Xudong Jiang Centre for Signal Processing, School of Electrical and Electronic Engineering Nanyang Technological University Nanyang Avenue, Singapore

More information

Image Filter Using with Gaussian Curvature and Total Variation Model

Image Filter Using with Gaussian Curvature and Total Variation Model IJECT Vo l. 7, Is s u e 3, Ju l y - Se p t 016 ISSN : 30-7109 (Online) ISSN : 30-9543 (Print) Iage Using with Gaussian Curvature and Total Variation Model 1 Deepak Kuar Gour, Sanjay Kuar Shara 1, Dept.

More information

Hand Gesture Recognition for Human-Computer Interaction

Hand Gesture Recognition for Human-Computer Interaction Journal of Coputer Science 6 (9): 002-007, 200 ISSN 549-3636 200 Science Publications Hand Gesture Recognition for Huan-Coputer Interaction S. ohaed ansoor Rooi, R. Jyothi Priya and H. Jayalakshi Departent

More information

A simplified approach to merging partial plane images

A simplified approach to merging partial plane images A siplified approach to erging partial plane iages Mária Kruláková 1 This paper introduces a ethod of iage recognition based on the gradual generating and analysis of data structure consisting of the 2D

More information

TensorFlow and Keras-based Convolutional Neural Network in CAT Image Recognition Ang LI 1,*, Yi-xiang LI 2 and Xue-hui LI 3

TensorFlow and Keras-based Convolutional Neural Network in CAT Image Recognition Ang LI 1,*, Yi-xiang LI 2 and Xue-hui LI 3 2017 2nd International Conference on Coputational Modeling, Siulation and Applied Matheatics (CMSAM 2017) ISBN: 978-1-60595-499-8 TensorFlow and Keras-based Convolutional Neural Network in CAT Iage Recognition

More information

Effective Tracking of the Players and Ball in Indoor Soccer Games in the Presence of Occlusion

Effective Tracking of the Players and Ball in Indoor Soccer Games in the Presence of Occlusion Effective Tracking of the Players and Ball in Indoor Soccer Gaes in the Presence of Occlusion Soudeh Kasiri-Bidhendi and Reza Safabakhsh Airkabir Univerisity of Technology, Tehran, Iran {kasiri, safa}@aut.ac.ir

More information

Novel Image Representation and Description Technique using Density Histogram of Feature Points

Novel Image Representation and Description Technique using Density Histogram of Feature Points Novel Iage Representation and Description Technique using Density Histogra of Feature Points Keneilwe ZUVA Departent of Coputer Science, University of Botswana, P/Bag 00704 UB, Gaborone, Botswana and Tranos

More information

PROBABILISTIC LOCALIZATION AND MAPPING OF MOBILE ROBOTS IN INDOOR ENVIRONMENTS WITH A SINGLE LASER RANGE FINDER

PROBABILISTIC LOCALIZATION AND MAPPING OF MOBILE ROBOTS IN INDOOR ENVIRONMENTS WITH A SINGLE LASER RANGE FINDER nd International Congress of Mechanical Engineering (COBEM 3) Noveber 3-7, 3, Ribeirão Preto, SP, Brazil Copyright 3 by ABCM PROBABILISTIC LOCALIZATION AND MAPPING OF MOBILE ROBOTS IN INDOOR ENVIRONMENTS

More information

Clustering. Cluster Analysis of Microarray Data. Microarray Data for Clustering. Data for Clustering

Clustering. Cluster Analysis of Microarray Data. Microarray Data for Clustering. Data for Clustering Clustering Cluster Analysis of Microarray Data 4/3/009 Copyright 009 Dan Nettleton Group obects that are siilar to one another together in a cluster. Separate obects that are dissiilar fro each other into

More information

Solving the Damage Localization Problem in Structural Health Monitoring Using Techniques in Pattern Classification

Solving the Damage Localization Problem in Structural Health Monitoring Using Techniques in Pattern Classification Solving the Daage Localization Proble in Structural Health Monitoring Using Techniques in Pattern Classification CS 9 Final Project Due Dec. 4, 007 Hae Young Noh, Allen Cheung, Daxia Ge Introduction Structural

More information

Feature Based Registration for Panoramic Image Generation

Feature Based Registration for Panoramic Image Generation IJCSI International Journal of Coputer Science Issues, Vol. 10, Issue 6, No, Noveber 013 www.ijcsi.org 13 Feature Based Registration for Panoraic Iage Generation Kawther Abbas Sallal 1, Abdul-Mone Saleh

More information

Feature Selection to Relate Words and Images

Feature Selection to Relate Words and Images The Open Inforation Systes Journal, 2009, 3, 9-13 9 Feature Selection to Relate Words and Iages Wei-Chao Lin 1 and Chih-Fong Tsai*,2 Open Access 1 Departent of Coputing, Engineering and Technology, University

More information

Leveraging Relevance Cues for Improved Spoken Document Retrieval

Leveraging Relevance Cues for Improved Spoken Document Retrieval Leveraging Relevance Cues for Iproved Spoken Docuent Retrieval Pei-Ning Chen 1, Kuan-Yu Chen 2 and Berlin Chen 1 National Taiwan Noral University, Taiwan 1 Institute of Inforation Science, Acadeia Sinica,

More information

COMPUTER GENERATED HOLOGRAMS Optical Sciences 627 W.J. Dallas (Monday, August 23, 2004, 12:38 PM) PART III: CHAPTER ONE DIFFUSERS FOR CGH S

COMPUTER GENERATED HOLOGRAMS Optical Sciences 627 W.J. Dallas (Monday, August 23, 2004, 12:38 PM) PART III: CHAPTER ONE DIFFUSERS FOR CGH S COPUTER GEERATED HOLOGRAS Optical Sciences 67 W.J. Dallas (onday, August 3, 004, 1:38 P) PART III: CHAPTER OE DIFFUSERS FOR CGH S Part III: Chapter One Page 1 of 8 Introduction Hologras for display purposes

More information

Identifying Converging Pairs of Nodes on a Budget

Identifying Converging Pairs of Nodes on a Budget Identifying Converging Pairs of Nodes on a Budget Konstantina Lazaridou Departent of Inforatics Aristotle University, Thessaloniki, Greece konlaznik@csd.auth.gr Evaggelia Pitoura Coputer Science and Engineering

More information

Relief shape inheritance and graphical editor for the landscape design

Relief shape inheritance and graphical editor for the landscape design Relief shape inheritance and graphical editor for the landscape design Egor A. Yusov Vadi E. Turlapov Nizhny Novgorod State University after N. I. Lobachevsky Nizhny Novgorod Russia yusov_egor@ail.ru vadi.turlapov@cs.vk.unn.ru

More information

Defining and Surveying Wireless Link Virtualization and Wireless Network Virtualization

Defining and Surveying Wireless Link Virtualization and Wireless Network Virtualization 1 Defining and Surveying Wireless Link Virtualization and Wireless Network Virtualization Jonathan van de Belt, Haed Ahadi, and Linda E. Doyle The Centre for Future Networks and Counications - CONNECT,

More information

3D Building Detection and Reconstruction from Aerial Images Using Perceptual Organization and Fast Graph Search

3D Building Detection and Reconstruction from Aerial Images Using Perceptual Organization and Fast Graph Search 436 Journal of Electrical Engineering & Technology, Vol. 3, No. 3, pp. 436~443, 008 3D Building Detection and Reconstruction fro Aerial Iages Using Perceptual Organization and Fast Graph Search Dong-Min

More information

Optimized stereo reconstruction of free-form space curves based on a nonuniform rational B-spline model

Optimized stereo reconstruction of free-form space curves based on a nonuniform rational B-spline model 1746 J. Opt. Soc. A. A/ Vol. 22, No. 9/ Septeber 2005 Y. J. Xiao and Y. F. Li Optiized stereo reconstruction of free-for space curves based on a nonunifor rational B-spline odel Yi Jun Xiao Departent of

More information

Simulated Sensor Reading. Heading. Y [m] local minimum ; m. local maximum ; M. discontinuity ; D. connection ; c X [m]

Simulated Sensor Reading. Heading. Y [m] local minimum ; m. local maximum ; M. discontinuity ; D. connection ; c X [m] Localization based on Visibility Sectors using Range Sensors Sooyong Lee y Nancy. Aato Jaes Fellers Departent of Coputer Science Texas A& University College Station, TX 7784 fsooyong,aato,jpf938g@cs.tau.edu

More information

Designing High Performance Web-Based Computing Services to Promote Telemedicine Database Management System

Designing High Performance Web-Based Computing Services to Promote Telemedicine Database Management System Designing High Perforance Web-Based Coputing Services to Proote Teleedicine Database Manageent Syste Isail Hababeh 1, Issa Khalil 2, and Abdallah Khreishah 3 1: Coputer Engineering & Inforation Technology,

More information

Affine Invariant Texture Analysis Based on Structural Properties 1

Affine Invariant Texture Analysis Based on Structural Properties 1 ACCV: The 5th Asian Conference on Coputer Vision, --5 January, Melbourne, Australia Affine Invariant Texture Analysis Based on tructural Properties Jianguo Zhang, Tieniu Tan National Laboratory of Pattern

More information

HUMAN pose estimation is an important problem in computer

HUMAN pose estimation is an important problem in computer JOURNAL OF L A T E X CLASS FILES, VOL. 6, NO., JANUARY 7 Robust 3D Huan Pose Estiation fro Single Iages or Video Sequences Chunyu Wang, Student Meber, IEEE, Yizhou Wang, Meber, IEEE, Zhouchen Lin, Meber,

More information

AGV PATH PLANNING BASED ON SMOOTHING A* ALGORITHM

AGV PATH PLANNING BASED ON SMOOTHING A* ALGORITHM International Journal of Software Engineering & Applications (IJSEA), Vol.6, No.5, Septeber 205 AGV PATH PLANNING BASED ON SMOOTHING A* ALGORITHM Xie Yang and Cheng Wushan College of Mechanical Engineering,

More information

A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing

A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing A Trajectory Splitting Model for Efficient Spatio-Teporal Indexing Slobodan Rasetic Jörg Sander Jaes Elding Mario A. Nasciento Departent of Coputing Science University of Alberta Edonton, Alberta, Canada

More information

MiPPS: A Generative Model for Multi-Manifold Clustering

MiPPS: A Generative Model for Multi-Manifold Clustering Manifold Learning and its Applications: Papers fro the AAAI Fall Syposiu (FS-9-) MiPPS: A Generative Model for Multi-Manifold Clustering Oluwasani Koyejo and Joydeep Ghosh Electrical and Coputer Engineering

More information

3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views

3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views Technical Report BU-CS-TR-2001-021 A shorter version of this paper is published in the proceedings of IEEE Workshop on Cues in Communication, 2001 3D Hand Pose Estimation by Finding Appearance-Based Matches

More information

EFFICIENT VIDEO SEARCH USING IMAGE QUERIES A. Araujo1, M. Makar2, V. Chandrasekhar3, D. Chen1, S. Tsai1, H. Chen1, R. Angst1 and B.

EFFICIENT VIDEO SEARCH USING IMAGE QUERIES A. Araujo1, M. Makar2, V. Chandrasekhar3, D. Chen1, S. Tsai1, H. Chen1, R. Angst1 and B. EFFICIENT VIDEO SEARCH USING IMAGE QUERIES A. Araujo1, M. Makar2, V. Chandrasekhar3, D. Chen1, S. Tsai1, H. Chen1, R. Angst1 and B. Girod1 1 Stanford University, USA 2 Qualco Inc., USA ABSTRACT We study

More information

Action Recognition Using Local SpatioTemporal Oriented Energy Features and Additive. Kernel SVMs

Action Recognition Using Local SpatioTemporal Oriented Energy Features and Additive. Kernel SVMs International Journal of Electronics and Electrical Engineering Vol., No., June, 4 Action Recognition Using Local SpatioTeporal Oriented Energy Features and Additive Kernel SVMs Jiangfeng Yang and Zheng

More information

Detection of Outliers and Reduction of their Undesirable Effects for Improving the Accuracy of K-means Clustering Algorithm

Detection of Outliers and Reduction of their Undesirable Effects for Improving the Accuracy of K-means Clustering Algorithm Detection of Outliers and Reduction of their Undesirable Effects for Iproving the Accuracy of K-eans Clustering Algorith Bahan Askari Departent of Coputer Science and Research Branch, Islaic Azad University,

More information

Rule Extraction using Artificial Neural Networks

Rule Extraction using Artificial Neural Networks Rule Extraction using Artificial Neural Networks S. M. Karuzzaan 1 Ahed Ryadh Hasan 2 Abstract Artificial neural networks have been successfully applied to a variety of business application probles involving

More information

Real Time Displacement Measurement of an image in a 2D Plane

Real Time Displacement Measurement of an image in a 2D Plane International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882 Volue 5, Issue 3, March 2016 176 Real Tie Displaceent Measureent of an iage in a 2D Plane Abstract Prashant

More information

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science. Professor Willia Hoff Dept of Electrical Engineering &Coputer Science http://inside.ines.edu/~whoff/ 1 Caera Calibration 2 Caera Calibration Needed for ost achine vision and photograetry tasks (object

More information

AN INTEGRATED APPROACH TO MUSIC BOUNDARY DETECTION

AN INTEGRATED APPROACH TO MUSIC BOUNDARY DETECTION 10th International Society for Music Inforation Retrieval Conference (ISMIR 2009) AN INTEGRATED APPROACH TO MUSIC BOUNDARY DETECTION Min-Yian Su, Yi-Hsuan Yang, Yu-Ching Lin, Hoer Chen National Taiwan

More information

Resolution. Super-Resolution Imaging. Problem

Resolution. Super-Resolution Imaging. Problem Resolution Super-Resolution Iaging Resolution: Sallest easurable detail in a visual presentation Subhasis Chaudhuri Departent of Electrical Engineering Indian institute of Technology Bobay Powai, Mubai-400

More information

On computing global similarity in images

On computing global similarity in images On coputing global siilarity in iages S Ravela and R Manatha Coputer Science Departent University of Massachusetts, Aherst, MA 01003 Eail: ravela,anatha @csuassedu Abstract The retrieval of iages based

More information

THE rapid growth and continuous change of the real

THE rapid growth and continuous change of the real IEEE TRANSACTIONS ON SERVICES COMPUTING, VOL. 8, NO. 1, JANUARY/FEBRUARY 2015 47 Designing High Perforance Web-Based Coputing Services to Proote Teleedicine Database Manageent Syste Isail Hababeh, Issa

More information

Abstract. 2. Segmentation Techniques. Keywords. 1. Introduction. 3. Threshold based Image Segmentation

Abstract. 2. Segmentation Techniques. Keywords. 1. Introduction. 3. Threshold based Image Segmentation International Journal of Advanced Coputer Research (ISSN (print): 49-777 ISSN (online): 77-7970) Volue-3 Nuber- Issue-8 March-03 MRI Brain Iage Segentation based on hresholding G. Evelin Sujji, Y.V.S.

More information

ELEVATION SURFACE INTERPOLATION OF POINT DATA USING DIFFERENT TECHNIQUES A GIS APPROACH

ELEVATION SURFACE INTERPOLATION OF POINT DATA USING DIFFERENT TECHNIQUES A GIS APPROACH ELEVATION SURFACE INTERPOLATION OF POINT DATA USING DIFFERENT TECHNIQUES A GIS APPROACH Kulapraote Prathuchai Geoinforatics Center, Asian Institute of Technology, 58 Moo9, Klong Luang, Pathuthani, Thailand.

More information

Efficient Learning of Generalized Linear and Single Index Models with Isotonic Regression

Efficient Learning of Generalized Linear and Single Index Models with Isotonic Regression Efficient Learning of Generalized Linear and Single Index Models with Isotonic Regression Sha M. Kakade Microsoft Research and Wharton, U Penn skakade@icrosoft.co Varun Kanade SEAS, Harvard University

More information

A Learning Framework for Nearest Neighbor Search

A Learning Framework for Nearest Neighbor Search A Learning Fraework for Nearest Neighbor Search Lawrence Cayton Departent of Coputer Science University of California, San Diego lcayton@cs.ucsd.edu Sanjoy Dasgupta Departent of Coputer Science University

More information

Super-Resolution on Moving Objects using a Polygon-Based Object Description

Super-Resolution on Moving Objects using a Polygon-Based Object Description Super-Resolution on Moving Objects using a Polgon-Based Object Description A. van Eekeren K. Schutte O.R. Oudegeest L.J. van Vliet TNO Defence, Securit and Safet, P.O. Box 96864, 2509 JG, The Hague, The

More information

A Model Free Automatic Tuning Method for a Restricted Structured Controller. by using Simultaneous Perturbation Stochastic Approximation

A Model Free Automatic Tuning Method for a Restricted Structured Controller. by using Simultaneous Perturbation Stochastic Approximation 28 Aerican Control Conference Westin Seattle Hotel, Seattle, Washington, USA June -3, 28 WeC9.5 A Model Free Autoatic Tuning Method for a Restricted Structured Controller by Using Siultaneous Perturbation

More information

Summary. Reconstruction of data from non-uniformly spaced samples

Summary. Reconstruction of data from non-uniformly spaced samples Is there always extra bandwidth in non-unifor spatial sapling? Ralf Ferber* and Massiiliano Vassallo, WesternGeco London Technology Center; Jon-Fredrik Hopperstad and Ali Özbek, Schluberger Cabridge Research

More information

Using Imperialist Competitive Algorithm in Optimization of Nonlinear Multiple Responses

Using Imperialist Competitive Algorithm in Optimization of Nonlinear Multiple Responses International Journal of Industrial Engineering & Production Research Septeber 03, Volue 4, Nuber 3 pp. 9-35 ISSN: 008-4889 http://ijiepr.iust.ac.ir/ Using Iperialist Copetitive Algorith in Optiization

More information

Geo-activity Recommendations by using Improved Feature Combination

Geo-activity Recommendations by using Improved Feature Combination Geo-activity Recoendations by using Iproved Feature Cobination Masoud Sattari Middle East Technical University Ankara, Turkey e76326@ceng.etu.edu.tr Murat Manguoglu Middle East Technical University Ankara,

More information

An Efficient Approach for Content Delivery in Overlay Networks

An Efficient Approach for Content Delivery in Overlay Networks An Efficient Approach for Content Delivery in Overlay Networks Mohaad Malli, Chadi Barakat, Walid Dabbous Projet Planète, INRIA-Sophia Antipolis, France E-ail:{alli, cbarakat, dabbous}@sophia.inria.fr

More information

Evidence Filtering in a Sequence of Images for Recognition

Evidence Filtering in a Sequence of Images for Recognition Evidence Filtering in a Sequence of Iages for Recognition Sukhan Lee 1,2, * 1 Director, Intelligent Systes Research Institute (ISRI), School of Inforation and Counication Engineering. 2 Interaction Science

More information

A robust incremental learning framework for accurate skin region segmentation in color images

A robust incremental learning framework for accurate skin region segmentation in color images Pattern Recognition 40 (2007) 3621 3632 www.elsevier.co/locate/pr A robust increental learning fraework for accurate skin region segentation in color iages Bin Li a,, Xiangyang Xue a, Jianping Fan b a

More information

INTRINSIC DECOMPOSITION FOR STEREOSCOPIC IMAGES

INTRINSIC DECOMPOSITION FOR STEREOSCOPIC IMAGES INTRINSIC DECOMPOSITION FOR STEREOSCOPIC IMAGES Dehua Xie 1, Shuaicheng Liu 1, Kaio Lin 2, Shuyuan Zhu 1, and Bing Zeng 1 1 School of Electronic Engineering, University of Electronic Science and Technology

More information

Collaborative Web Caching Based on Proxy Affinities

Collaborative Web Caching Based on Proxy Affinities Collaborative Web Caching Based on Proxy Affinities Jiong Yang T J Watson Research Center IBM jiyang@usibco Wei Wang T J Watson Research Center IBM ww1@usibco Richard Muntz Coputer Science Departent UCLA

More information

Set Theoretic Estimation for Problems in Subtractive Color

Set Theoretic Estimation for Problems in Subtractive Color Set Theoretic Estiation for Probles in Subtractive Color Gaurav Shara Digital Iaging Technology Center, Xerox Corporation, MS0128-27E, 800 Phillips Rd, Webster, NY 14580 Received 25 March 1999; accepted

More information

The Internal Conflict of a Belief Function

The Internal Conflict of a Belief Function The Internal Conflict of a Belief Function Johan Schubert Abstract In this paper we define and derive an internal conflict of a belief function We decopose the belief function in question into a set of

More information

A Learning Framework for Nearest Neighbor Search

A Learning Framework for Nearest Neighbor Search A Learning Fraework for Nearest Neighbor Search Lawrence Cayton Departent of Coputer Science University of California, San Diego lcayton@cs.ucsd.edu Sanjoy Dasgupta Departent of Coputer Science University

More information

Video summarization based on shot boundary detection with penalized contrasts

Video summarization based on shot boundary detection with penalized contrasts Video suarization based on shot boundary detection with penalized contrasts Paschalina Medentzidou and Constantine Kotropoulos Departent of Inforatics Aristotle University of Thessaloniki Thessaloniki

More information

Efficient Estimation of Inclusion Coefficient using HyperLogLog Sketches

Efficient Estimation of Inclusion Coefficient using HyperLogLog Sketches Efficient Estiation of Inclusion Coefficient using HyperLogLog Sketches Azade Nazi, Bolin Ding, Vivek Narasayya, Surajit Chaudhuri Microsoft Research {aznazi, bolind, viveknar, surajitc}@icrosoft.co ABSTRACT

More information

1 Extended Boolean Model

1 Extended Boolean Model 1 EXTENDED BOOLEAN MODEL It has been well-known that the Boolean odel is too inflexible, requiring skilful use of Boolean operators to obtain good results. On the other hand, the vector space odel is flexible

More information

Feature-Centric Evaluation for Efficient Cascaded Object Detection

Feature-Centric Evaluation for Efficient Cascaded Object Detection IEEE Conference on Coputer Vision and Pattern (CVPR, 2004 Feature-Centric Evaluation for Efficient Cascaded Object Detection Henry Schneideran 1 Robotics Institute Carnegie Mellon University Pittsburgh,

More information

(Geometric) Camera Calibration

(Geometric) Camera Calibration (Geoetric) Caera Calibration CS635 Spring 217 Daniel G. Aliaga Departent of Coputer Science Purdue University Caera Calibration Caeras and CCDs Aberrations Perspective Projection Calibration Caeras First

More information

Modeling Parallel Applications Performance on Heterogeneous Systems

Modeling Parallel Applications Performance on Heterogeneous Systems Modeling Parallel Applications Perforance on Heterogeneous Systes Jaeela Al-Jaroodi, Nader Mohaed, Hong Jiang and David Swanson Departent of Coputer Science and Engineering University of Nebraska Lincoln

More information

A New Generic Model for Vision Based Tracking in Robotics Systems

A New Generic Model for Vision Based Tracking in Robotics Systems A New Generic Model for Vision Based Tracking in Robotics Systes Yanfei Liu, Ada Hoover, Ian Walker, Ben Judy, Mathew Joseph and Charly Heranson lectrical and Coputer ngineering Departent Cleson University

More information

Gromov-Hausdorff Distance Between Metric Graphs

Gromov-Hausdorff Distance Between Metric Graphs Groov-Hausdorff Distance Between Metric Graphs Jiwon Choi St Mark s School January, 019 Abstract In this paper we study the Groov-Hausdorff distance between two etric graphs We copute the precise value

More information

A Discrete Spring Model to Generate Fair Curves and Surfaces

A Discrete Spring Model to Generate Fair Curves and Surfaces A Discrete Spring Model to Generate Fair Curves and Surfaces Atsushi Yaada Kenji Shiada 2 Tootake Furuhata and Ko-Hsiu Hou 2 Tokyo Research Laboratory IBM Japan Ltd. LAB-S73 623-4 Shiotsurua Yaato Kanagawa

More information

Image Processing for fmri John Ashburner. Wellcome Trust Centre for Neuroimaging, 12 Queen Square, London, UK.

Image Processing for fmri John Ashburner. Wellcome Trust Centre for Neuroimaging, 12 Queen Square, London, UK. Iage Processing for fmri John Ashburner Wellcoe Trust Centre for Neuroiaging, 12 Queen Square, London, UK. Contents * Preliinaries * Rigid-Body and Affine Transforations * Optiisation and Objective Functions

More information

Automatic Graph Drawing Algorithms

Automatic Graph Drawing Algorithms Autoatic Graph Drawing Algoriths Susan Si sisuz@turing.utoronto.ca Deceber 7, 996. Ebeddings of graphs have been of interest to theoreticians for soe tie, in particular those of planar graphs and graphs

More information

HIGH PERFORMANCE PRE-SEGMENTATION ALGORITHM FOR SONAR IMAGES

HIGH PERFORMANCE PRE-SEGMENTATION ALGORITHM FOR SONAR IMAGES HIGH PERFORMANCE PRE-SEGMENTATION ALGORITHM FOR SONAR IMAGES Benjain Lehann*, Konstantinos Siantidis*, Dieter Kraus** *ATLAS ELEKTRONIK GbH Sebaldsbrücker Heerstraße 235 D-28309 Breen, GERMANY Eail: benjain.lehann@atlas-elektronik.co

More information

Utility-Based Resource Allocation for Mixed Traffic in Wireless Networks

Utility-Based Resource Allocation for Mixed Traffic in Wireless Networks IEEE IFOCO 2 International Workshop on Future edia etworks and IP-based TV Utility-Based Resource Allocation for ixed Traffic in Wireless etworks Li Chen, Bin Wang, Xiaohang Chen, Xin Zhang, and Dacheng

More information

GAN-OPC: Mask Optimization with Lithography-guided Generative Adversarial Nets

GAN-OPC: Mask Optimization with Lithography-guided Generative Adversarial Nets GAN-OPC: Mask Optiization with Lithography-guided Generative Adversarial Nets Haoyu Yang hyyang@cse.cuhk.edu.hk Shuhe Li shli@cse.cuhk.edu.hk Yuzhe Ma yza@cse.cuhk.edu.hk ABSTRACT Bei Yu byu@cse.cuhk.edu.hk

More information

Predicting x86 Program Runtime for Intel Processor

Predicting x86 Program Runtime for Intel Processor Predicting x86 Progra Runtie for Intel Processor Behra Mistree, Haidreza Haki Javadi, Oid Mashayekhi Stanford University bistree,hrhaki,oid@stanford.edu 1 Introduction Progras can be equivalent: given

More information

Design Optimization of Mixed Time/Event-Triggered Distributed Embedded Systems

Design Optimization of Mixed Time/Event-Triggered Distributed Embedded Systems Design Optiization of Mixed Tie/Event-Triggered Distributed Ebedded Systes Traian Pop, Petru Eles, Zebo Peng Dept. of Coputer and Inforation Science, Linköping University {trapo, petel, zebpe}@ida.liu.se

More information

An Optimization Clustering Algorithm Based on Texture Feature Fusion for Color Image Segmentation

An Optimization Clustering Algorithm Based on Texture Feature Fusion for Color Image Segmentation Algoriths 2015, 8, 234-247; doi:10.3390/a8020234 Article OPEN ACCESS algoriths ISSN 1999-4893 www.dpi.co/journal/algoriths An Optiization Clustering Algorith Based on Texture Feature Fusion for Color Iage

More information

Problem Solving of graph correspondence using Genetics Algorithm and ACO Algorithm

Problem Solving of graph correspondence using Genetics Algorithm and ACO Algorithm Proble Solving of graph correspondence using Genetics Algorith and ACO Algorith Alireza Rezaee, 1, Azizeh Ajalli 2 Assistant professor,departent of Mechatronics Engineering, Faculty of New Sciences and

More information

A Broadband Spectrum Sensing Algorithm in TDCS Based on ICoSaMP Reconstruction

A Broadband Spectrum Sensing Algorithm in TDCS Based on ICoSaMP Reconstruction MATEC Web of Conferences 73, 03073(08) https://doi.org/0.05/atecconf/087303073 SMIMA 08 A roadband Spectru Sensing Algorith in TDCS ased on I Reconstruction Liu Yang, Ren Qinghua, Xu ingzheng and Li Xiazhao

More information

Supplementary Section. A. Algorithm. B. Datasets. C. More on Labeling Functions

Supplementary Section. A. Algorithm. B. Datasets. C. More on Labeling Functions Suppleentary Section In this section, we include ore details about our algorith, labeling functions, datasets as well as additional results and coparisons, which could not be included in the ain paper

More information

Detecting Anomalous Structures by Convolutional Sparse Models

Detecting Anomalous Structures by Convolutional Sparse Models Detecting Anoalous Structures by Convolutional Sparse Models Diego Carrera, Giacoo Boracchi Dipartiento di Elettronica, Inforazione e Bioingegeria Politecnico di Milano, Italy {giacoo.boracchi, diego.carrera}@polii.it

More information

OPTIMAL COMPLEX SERVICES COMPOSITION IN SOA SYSTEMS

OPTIMAL COMPLEX SERVICES COMPOSITION IN SOA SYSTEMS Key words SOA, optial, coplex service, coposition, Quality of Service Piotr RYGIELSKI*, Paweł ŚWIĄTEK* OPTIMAL COMPLEX SERVICES COMPOSITION IN SOA SYSTEMS One of the ost iportant tasks in service oriented

More information

Secure Wireless Multihop Transmissions by Intentional Collisions with Noise Wireless Signals

Secure Wireless Multihop Transmissions by Intentional Collisions with Noise Wireless Signals Int'l Conf. Wireless etworks ICW'16 51 Secure Wireless Multihop Transissions by Intentional Collisions with oise Wireless Signals Isau Shiada 1 and Hiroaki Higaki 1 1 Tokyo Denki University, Japan Abstract

More information

An Ad Hoc Adaptive Hashing Technique for Non-Uniformly Distributed IP Address Lookup in Computer Networks

An Ad Hoc Adaptive Hashing Technique for Non-Uniformly Distributed IP Address Lookup in Computer Networks An Ad Hoc Adaptive Hashing Technique for Non-Uniforly Distributed IP Address Lookup in Coputer Networks Christopher Martinez Departent of Electrical and Coputer Engineering The University of Texas at San

More information

Support Vector Machine Based Robotic Traversability Prediction with Vision Features

Support Vector Machine Based Robotic Traversability Prediction with Vision Features International Journal of Coputational Intelligence Systes, Vol. 6, No. (July, 0), 96-608 Support Vector Machine Based Robotic Traversability Prediction with Vision Features Jianwei Cui, Yan Guo, Huatao

More information

A Low-Cost Multi-Failure Resilient Replication Scheme for High Data Availability in Cloud Storage

A Low-Cost Multi-Failure Resilient Replication Scheme for High Data Availability in Cloud Storage 216 IEEE 23rd International Conference on High Perforance Coputing A Low-Cost Multi-Failure Resilient Replication Schee for High Data Availability in Cloud Storage Jinwei Liu* and Haiying Shen *Departent

More information

Verdict Accuracy of Quick Reduct Algorithm using Clustering, Classification Techniques for Gene Expression Data

Verdict Accuracy of Quick Reduct Algorithm using Clustering, Classification Techniques for Gene Expression Data Verdict Accuracy of Quick Reduct Algorith using Clustering, Classification Techniques for Gene Expression Data T.Chandrasekhar 1, K.Thangavel 2 and E.N.Sathishkuar 3 1 Departent of Coputer Science, eriyar

More information

Cassia County School District #151. Expected Performance Assessment Students will: Instructional Strategies. Performance Standards

Cassia County School District #151. Expected Performance Assessment Students will: Instructional Strategies. Performance Standards Unit 1 Congruence, Proof, and Constructions Doain: Congruence (CO) Essential Question: How do properties of congruence help define and prove geoetric relationships? Matheatical Practices: 1. Make sense

More information

Enhancing Real-Time CAN Communications by the Prioritization of Urgent Messages at the Outgoing Queue

Enhancing Real-Time CAN Communications by the Prioritization of Urgent Messages at the Outgoing Queue Enhancing Real-Tie CAN Counications by the Prioritization of Urgent Messages at the Outgoing Queue ANTÓNIO J. PIRES (1), JOÃO P. SOUSA (), FRANCISCO VASQUES (3) 1,,3 Faculdade de Engenharia da Universidade

More information

DEBARRED OBJECTS RECOGNITION BY PFL OPERATOR

DEBARRED OBJECTS RECOGNITION BY PFL OPERATOR DEBARRED OBJECTS RECOGNITION BY PFL OPERATOR Manish Kuar Srivastava,Madan Kushwah, Abhishe uar 3 Assistant Professor, Departent of CSE, Lal Bahadur Shastri Group of Institutions Lucnow, Uttar Pradesh,

More information

Modal Masses Estimation in OMA by a Consecutive Mass Change Method.

Modal Masses Estimation in OMA by a Consecutive Mass Change Method. Modal Masses Estiation in OMA by a Consecutive Mass Change Method. F. Pelayo University of Oviedo, Departent of Construction and Manufacturing Engineering, Gijón, Spain M. López-Aenlle University of Oviedo,

More information

AN APPROACH ON BIMODAL BIOMETRIC SYSTEMS

AN APPROACH ON BIMODAL BIOMETRIC SYSTEMS AN APPROACH ON BIODAL BIOETRIC SYSTES Eugen LUPU, Siina EERICH Technical University of Cluj-Napoca, 26-28 Baritiu str. Cluj-Napoca phone: +40-264-40-266; fax: +40-264-592-055; e-ail: Eugen.Lupu @co.utcluj.ro

More information