arxiv: v2 [cs.cv] 9 Apr 2018

Size: px
Start display at page:

Download "arxiv: v2 [cs.cv] 9 Apr 2018"

Transcription

1 Boundary-senstve Network for Portrat Segmentaton Xanzh Du 1, Xaolong Wang 2, Dawe L 2, Jngwen Zhu 2, Serafettn Tasc 2, Cameron Uprght 2, Stephen Walsh 2, Larry Davs 1 1 Computer Vson Lab, UMIACS, Unversty of Maryland, College Park, Samsung Research Amerca, Mountan Vew, arxv: v2 [cs.cv] 9 Apr 2018 {xanzh,lsd}@umacs.umd.edu {xaolong.w,dawe.l,jngwen.z,s.tasc,c.uprght,s1.walsh}@samsung.com Abstract Compared to the general semantc segmentaton problem, portrat segmentaton has hgher precson requrement on boundary area. However, ths problem has not been well studed n prevous works. In ths paper, we propose a boundary-senstve deep neural network (BSN) for portrat segmentaton. BSN ntroduces three novel technques. Frst, an ndvdual boundary-senstve kernel s proposed by dlatng the contour lne and assgnng the boundary pxels wth mult-class labels. Second, a global boundarysenstve kernel s employed as a poston senstve pror to further constran the overall shape of the segmentaton map. Thrd, we tran a boundary-senstve attrbute classfer jontly wth the segmentaton network to renforce the network wth semantc boundary shape nformaton. We have evaluated BSN on the current largest publc portrat segmentaton dataset,.e., the PFCN dataset, as well as the portrat mages collected from other three popular mage segmentaton datasets: COCO, COCO-Stuff, and PAS- CAL VOC. Our method acheves the superor quanttatve and qualtatve performance over state-of-the-arts on all the datasets, especally on the boundary area. 1. Introducton Semantc segmentaton s a fundamental problem n computer vson communty whch ams to classfy pxels nto semantc categores. In ths paper, we target a specal bnary class semantc segmentaton problem, namely portrat segmentaton, whch generates pxel-wse predctons as foreground (.e., people) or background. Recently, t s becomng a hot topc and has been wdely used n many real-world applcatons, such as augmented realty (AR), background replacement, portrat stylzaton, depth of feld, advanced drver assstance systems [5], etc. Although numerous deep learnng based approaches (e.g., [6] [1] [19] [24] [16] [15] [14]) were proposed to solve the general semantc segmentaton problem, drect adaptaton of these methods cannot satsfy the hgh precson requrement n the portrat segmentaton problem. In portrat segmentaton, precse segmentaton around object boundares s crucal but challengng. For applcatons lke background replacement, accurate and smooth boundary segmentaton (such as har and clothes) s the key for better vsual effects. However, ths has long been one of the most challengng part of portrat segmentaton, especally when usng convolutonal neural networks (CNN). Snce the neghborhood of boundary pxels contans a mxture of both foreground and background labels, convolutonal flters fuse nformaton of dfferent classes, whch may confuse the network when segmentng boundary pxels. Prevous CNN based semantc segmentaton methods, whch use ether the conventonal hard-label method or gnore the boundary pxels durng tranng [6] [1] [10], fal to solve ths problem. These methods am to tran a better model to separate foreground and background whle sacrfcng the accuracy when predctng the boundary pxels. In ths paper, we propose a new boundary-senstve network (BSN) for more accurate portrat segmentaton. In contrast to conventonal semantc mage segmentaton systems, we dlate the contour lne of the portrat foreground and label the boundary pxels as the thrd class wth the proposed soft-label method. Two boundary-senstve kernels are ntroduced nto the loss functon to help the network learn better representatons for the boundary class as well as govern an overall shape of the portrat. The frst boundarysenstve kernel s desgned for each tranng mage such that a floatng pont vector s assgned as a soft label for each pxel n the boundary class. The second boundary-senstve kernel s a global kernel where each locaton n the kernel ndcates the probablty of the current locaton belongng to the boundary class. Furthermore, a boundary-senstve attrbute classfer s traned jontly wth the segmentaton network to renforce the tranng process. We evaluate our 1

2 Fgure 1: The whole archtecture of our framework. method on PFCN [21], the largest avalable portrat segmentaton dataset. Our method acheves the best quanttatve performance n mean IoU at 96.7%. In order to show the effectveness and generalzaton capablty of our method, we further test on the portrat mages collected from COCO [17], COCO-Stuff [3], PASCAL VOC [11] and the experment results demonstrate that our method sgnfcantly outperforms all other state-of-the-art methods. The rest of ths paper s organzed as follows: Secton 2 revews the prevous work on related problems. Secton 3 descrbes the general framework and the three boundarysenstve technques n detal. Secton 4 dscusses and analyzes the expermental results. Secton 5 draws conclusons and dscusses further work. 2. Related work Semantc segmentaton systems can be categorzed as unsupervsed methods and supervsed methods. Unsupervsed methods solve the semantc segmentaton problem wth classc machne learnng technques nclude thresholdng, hashng, K-means clusterng, topc model, graph-cut [22] [8] [9], etc. On the other hand, conventonal supervsed methods treat the semantc segmentaton problem as a pxel-wse classfcaton problem whch frst buld handcrafted features and then tran classfers such as Support Vector Machnes [12], Random Forest [2], etc. In recent years, convolutonal neural network (CNN) based methods have been successfully appled to semantc segmentaton. In 2014, Long et al. [18] ntroduced the endto-end Fully Convolutonal Networks (FCN) whch takes a natural mage as nput and performs dense pxel-wse predctons to generate a segmentaton map of the same sze as the nput mage. Fully connected layers are removed from ths network to preserve the spatal nformaton and deconvolutonal layers are proposed for up-samplng to recover the full mage sze. Ths paradgm popularzed the CNN based method and was quckly adopted by subsequent approaches. In tradtonal CNN archtectures, poolng layer was ntroduced to ncrease the receptve feld as the network goes deeper. However, t also decreases the resoluton of feature map. Yu et al. [23] proposed the dlated convolutonal layer to replace the poolng layer, whch allows for ncreasng the sze of the receptve feld wthout losng resoluton n feature maps. Chen et al. [7] proposed the DeepLab system whch passes multple rescaled nput mages to dfferent network branches n parallel and combnes the features maps wth max operaton at the end. Portrat segmentaton s generally regarded as a subproblem of semantc segmentaton, and t s dfferent from tradtonal segmentaton n two aspects. Frst, the foreground object s lmted to only people whch provdes addtonal pror nformaton. Meanwhle, portrat segmentaton has hgher precson requrement on boundary area. Shen et al. [21] fne-tuned a portrat segmentaton system from a pre-traned FCN network wth portrat mages. To provde more portrat-specfc nformaton to the network, two normalzed x and y poston channels and one mean mask shape channel are added to the nput mage. Shen et al. [20] proposed a jont correspondence and segmentaton estmaton method by usng extra nformaton provded by 2

3 Fgure 2: The kernel generatng process n our method: (a) represents the orgnal mage; (b) represents the detected contour lne; (c) shows the three class labels: foreground, background, and dlated boundary; (d) shows the ndvdual boundarysenstve kernel; (e) shows the global boundary-senstve kernel. dual-lens camera. Whle most methods can easly generate a rough segmentaton, they generally fal to provde precse segmentaton near the object boundares. For refnng the predctons near the boundares, the most commonly used soluton s employng Condtonal Random Felds (CRF) along wth CNN. Deeplab[7] employs dense CRF after CNN as a post processng method to smooth out the predctons. However, CRF s generally used as a post-precessng step and may be qute tme-consumng. 3. Boundary-senstve portrat segmentaton The archtecture of our framework s shown n Fgure 1. We use DeepLabv2 ResNet101 model as the base segmentaton network. DeepLabv2 ResNet101 conssts of three ResNet101 branches at the base whch process dfferent scales of the nput mage. Then the three branches are followed by the atrous spatal pyramd poolng (ASPP) at dfferent dlaton rates and fused together at the end. For more detals please refer to [7]. To make the model more senstve to a portrat s boundary, durng tranng, we label the tranng samples wth three non-overlappng classes: foreground, boundary, and background, usng the softlabel method descrbed below. One ndvdual boundarysenstve kernel and one global boundary-senstve kernel are ntroduced when updatng the loss functon, whch affect both the forward pass and the back-propagaton. The generaton process of the two kernels are shown n Fgure 2. Furthermore, an attrbute classfer whch shares the base layers wth BSN s traned jontly wth the segmentaton task to renforce the tranng process The ndvdual boundary-senstve kernel and the soft-label method To better address the boundary predcton problem, we ntroduce the ndvdual boundary-senstve kernel. We label the boundary class as a thrd class to separate from foreground and background classes and assgn soft-labels to pxels n the boundary class as follows. Frst, the portrat s contour lne s dentfed n the ground truth segmentaton map wth the Canny edge detector [4]. The contour s then dlated to be P-pxels n wdth and that map s overlayed onto the ground truth segmentaton map. We call the new label map the ndvdual boundary-senstve kernel. For each pxel n the kernel, a 1 3 floatng-pont vector K ndv = [l fg, l bdry, l bg ] s assgned as the soft-label to represent how lkely the current pxel belongs to each class. The K ndv s computed as Equatons (1) (2) (3). mn I C j C j C l bdry mn = I, f boundary k C j k C j C 0, f foreground 0, f background (1) 1(M fg)(1 l bdry l fg ), f boundary = 1, f foreground 0, f background (2) l bg = 1(M bg)(1 l bdry ), f boundary 0, f foreground 1, f background (3) 3

4 where mn C j C I C j represents the dstance from the current pxel I to the nearest pont on the contour lne C. M represents the bnary label of the current pxel n the orgnal label map M. We can see that pxels n the foreground/background class are labeled as [1, 0, 0]/[0, 0, 1] and pxels n the boundary class are labeled wth a floatng-pont vector. The soft-label method computes l bdry as the normalzed dstance from the current pxel to the nearest pont on the contour and sets l fg and l bg to ether (1 l bdry ) or 0 based on the class label of the current pxel n the ground truth segmentaton map. Durng the forward pass for each pxel n one sample, the new formula for updatng the loss functon can be expressed as Equaton (4): ɛ = j=1 j log( ezj K ndv k e z k ) = j=1 K ndv j log(y j ) (4) where l j denotes the soft-label for class j and y j = e zj / e z k denotes the softmax probablty for ths class. c k represents all the three classes. The new back-propagaton for ths sample can be derved as n Equaton (5): ɛ z = j=1 = ( Kndv y = ( Kndv y K ndv j log(y j) z y z + = (K ndv y j Kndv j y j y (1 y ) j=1 j y j z ) l j ) = (K ndv y ) K ndv j y j (y j y )) (5) The last step holds snce the soft-label vector sums to one. By usng the soft-label method, we can see that boundary pxels contrbute not only to the boundary class but also to the foreground/background class n a weghted manner based on how close t s to the contour lne The global boundary-senstve kernel By the nature of algned portrat mages, t s lkely that some locatons n the mage, such as the upper corner pxels, should belong to the background wth very hgh probabltes whle some other locatons, such as the mddle bottom pxels, should belong to the foreground wth hgh probabltes. These pxels should be more easly classfed, whle pxels n between should be harder to classfy. We estmate a poston senstve pror from the tranng data. We desgn a global boundary-senstve kernel to gude the network to learn a better shape predcton specfcally for portrat mages. The global kernel s desgned as follows. Frst, a mean mask M s computed usng the average of all ground truth segmentaton maps from the tranng samples. Ths generates a probablty map where the value at each locaton ndcates how lkely the current locaton belongs to foreground/background. Second, Equaton (6) s employed to generate the global boundary-senstve kernel. All the values are normalzed to range [a, b]. A larger value close to b n the global kernel ndcates that the current locaton has a hgher probablty to be boundary. In other words, ths locaton should be more dffcult for the network to classfy. To force the network to focus more on the possble boundary locatons, we weght the locatons wth ther correspondng kernel values when updatng the loss functon. When performng the forward pass for one pxel locaton n one sample, we update the loss functon as equatons (7) K global = b (1 ɛ = K global s j M 0.5 ) (b a) (6) 0.5 1(j = c) log(y j ) (7) where Ks global denotes the global kernel value at the pxel locaton s. g denotes the ground truth class label for the current pxel locaton. Durng back-propagaton, the new gradent s computed as Equaton (8): ɛ = Ks global z j = K global s 1(j = g) log(y j) z ( 1 y y z ) = K global s ( 1 y y (1( = g) y )) = K global s (1( = g) y ) From the new forward pass and back-propagaton functons we can see that the pxels that are more lkely to be located n the boundary (e.g., the pxels lyng wthn the brghter regon n Fgure 2 (e)) are weghted hgher so that they contrbute more to the loss. Ths gudes the network to be more senstve to the dffcult locatons The boundary-senstve attrbute classfer Portrat attrbutes such as long/short har play an mportant role n determnng a portrat s shape. Tranng a network whch s capable of classfyng boundary-senstve attrbutes wll gve more pror nformaton to the system, whch further makes the system more accurate and effcent on boundary predcton. Motvated by ths dea, we tran an attrbute classfer jontly wth the portrat segmentaton network for mult-task learnng. An example of how the (8) 4

5 har style attrbute changes the boundary shape s shown n Fgure 3. the attrbute classfer to tran the model for another 20K teratons. The whole network s bult wth the Caffe deep learnng framework [13]. Durng testng, we gnore the boundary class and the attrbute classfer. Only probabltes from foreground and background classes are used for segmentaton. Mean IoU: The standard mean Intersecton-over-Unon (IoU) metrc s used to evaluate the segmentaton performance. The mean IoU s computed as followng. N IoU = X Aseg Agt 1 seg gt N A A (9) and Agt where Aseg represent the area of the segmentaton results and the ground-truth label mask for the th testng sample, respectvely. Fgure 3: An example of how boundary-senstve attrbutes affect the portrat s shape: long har vs. short har Results on the PFCN dataset To desgn the attrbute classfer, the base layers from conv1 1 to pool5 are shared between the segmentaton network and the classfer. Above ths, for each channel, we add three more fully connected layers. The frst two fully connected layers have 1024 neurons and are followed by a dropout layer and a ReLU layer. The last fully connected layer has two neurons for bnary classfcaton. We evaluate the proposed method on the largest publcly avalable portrat segmentaton dataset [21]. Ths dataset s collected from Flckr and manually labeled wth varatons n age, pose, appearance, background, lghtng condton, har style, accessory, etc. Most of the portrat mages are captured by the frontal cameras of moble phones. Ths dataset conssts of 1800 portrat mages whch are splt nto 1500 tranng mages and 300 testng mages. All the mages are scaled and cropped nto sze In one portrat mage, the pxels are labeled as ether foreground or background. We wll refer to ths dataset as PFCN dataset. Some sample mages from the PFCN dataset are gven n Fgure Experments and results analyss 4.1. Tranng settngs and evaluaton settngs Model detals: To tran our portrat segmentaton system, we fne-tune the DeepLabv2 ResNet101 model usng the tranng set of the PFCN dataset. We wll ntroduce ths dataset n the next subsecton. There are three ResNet branches n DeepLabv2. In each branch, 4 atrous convoluton layers are added n parallel wth dlaton factors [6, 12, 18, 24] and then summed together to produce the fnal feature map. Element-wse max operaton s performed at the end over the three branches to produce the fnal predcton. To generate the ndvdual kernel, we dlate the contour lne to 10-pxels n wdth and label the dlated boundary usng the soft-label method. We select the weght range n the global kernel as [0.9, 1]. Followng PortratFCN+, n addton to the three RGB channels, we add two normalzed x and y poston channels and one mean mask shape channel nto the nput. For more detals please refer to [21]. At each teraton, a random patch of sze s cropped out from the orgnal mage and randomly flpped wth probablty 0.5 for data augmentaton. Then the nput mage s rescaled by factors of [0.5, 0.75, 1.0] as the new nput mages to the three branches of the DeepLabv2 network. To tran the attrbute classfer, we label the tranng mages nto long/short har classes. We use Stochastc Gradent Descent (SGD) wth a learnng rate of 2.5e 4 to tran the model for 20K teratons wthout the attrbute classfer. Then we decrease the learnng rate by a factor of 10 and add Fgure 4: Sample mages from the PFCN dataset. We compare wth the state-of-the-art method reported on ths dataset: PortratFCN+ [21] and the 5

6 Dataset COCO portrat COCO-Stuff portrat PASCAL VOC portrat Num. of Portrat Table 2: Statstcs of the three portrat datasets Evaluaton on other datasets: Snce the performance on the PFCN dataset s pretty hgh and data for the boundary class s unbalanced compare to foreground/background, a good performance on boundary segmentaton may only lead to margnal mprovement n mean IoU on ths dataset. Thus we further test our method on the portrat mages collected from three more popular semantc segmentaton datasets to evaluate the effectveness of our boundary-senstve technques. COCO portrat: We automatcally collect all the portrat and portrat-lke mages from the COCO dataset. We run a face detector over the dataset and keep the mages only contanng one person where the face area covers at least 10% of the whole mage. There are 626 mages n total wth ground truth segmentaton maps. We wll refer to ths dataset as COCO portrat. COCO portrat s more challengng than the PFCN data n varous ways such as large pose varatons, large occlusons, unlabeled ndvduals appear on the background, large porton of background, dfferent knds of accessores, etc. COCO-Stuff portrat: The COCO-Stuff dataset augments the COCO dataset wth refned pxel-level stuff annotatons on 10K mages. We collect 92 portrat and portratlke mages from ths dataset. The qualty of mages n ths dataset are same as COCO portrat. We wll refer to ths dataset as COCO-Stuff portrat. Pascal VOC portrat: We use the same method to collect portrat and portrat-lke mages from the Pascal VOC 2007, 2008, and 2012 datasets. Due to the lack of ground truth segmentaton maps on ths dataset, 62 mages are collected. The mages n ths dataset are also challengng and unconstraned. We wll refer to ths dataset as PASCAL VOC portrat. Some sample mages from the three datasets are llustrated n Fgure 5 and the statstcs are gven n Table 2. To test the generalzaton capablty of our model, we drectly test on these three datasets wthout fne-tunng. We acheve 77.7% mean IoU, 72.0% mean IoU, and 75.6% mean IoU on COCO portrat, COCO-stuff portrat, and PASCAL VOC portrat, respectvely. We sgnfcantly outperform PortratFCN+ on all the three datasets. The result comparsons are llustrated n Table 3. Snce the DeepLabv2 model s traned on these dataset, we can not compare wth t drectly. Fgure 5: Sample mages from COCO, COCO-Stuff, and Pascal VOC portrat datasets. Method Graph-cut PortratFCN+ PortratDeepLabv2 BSN AC (ours) BSN GK (ours) BSN IK (ours) BSN (ours) Mean IoU 80.0% 95.9% 96.1% 96.2% 96.2% 96.5% 96.7% Table 1: Quanttatve performance comparsons on the PFCN dataset. DeepLabv2 ResNet101 fne-tuned model, whch we wll refer to as PortratDeepLabv2. The PortratDeepLabv2 model s fne-tuned usng the same 6-channel tranng data as PortratFCN+ and the same tranng settngs as BSN. For ablaton study, we report results of four models from our work: tran wth the attrbute classfer only (BSN AC), tran wth the global boundary-senstve kernel only (BSN GK), tran wth the ndvdual boundarysenstve kernel only (BSN IK), and the all-n-one model (BSN). Our fnal model acheves the state-of-the-art mean IoU at 96.7%. The quanttatve result comparson s gven n Table 1. Result from graph-cut [22] s shown as the baselne. 6

7 Fgure 6: Result vsualzatons of three challengng examples. The frst row shows contans confusng objects n the background; the second row ncludes multple people n the background; n the thrd row the background color s close to the foreground. Method COCO PortratFCN+ BSN (ours) 68.6% 77.7% COCOStuff 60.8% 72.0% PASCAL VOC 59.5% 75.6% 4.4. Result analyss Results vsualzaton on challengng scenaros We vsualze the overall performance of our BSN model compared to DeepLabv2 and PortratFCN+ usng three challengng scenaros: confusng objects n the background, multple people appear n the mage, and the background color theme s close to the foreground. Fgure 6 shows that our model s more accurate and robust than other methods even under challengng condtons. Table 3: Quanttatve performance comparsons on COCO portrat, COCO-Stuff portrat and Pascal VOC portrat datasets Accurate boundary segmentaton Our method also delvers more precse boundary predctons thanks to ts novel boundary-senstve segmentaton 7

8 technques. Fgure 7 shows the comparson of our method wth DeepLabv2 and PortratFCN+ n three challengng scenaros: har segmentaton, accessory segmentaton and ear segmentaton. Results reveal that whle other methods have dffculty n segmentng accessores and small body parts, our method can provde a smooth and accurate segmentaton. Fgure 8: Trmaps generated from our segmentaton maps. Fgure 7: Boundary segmentaton comparsons. The frst column are the orgnal mages. The three subsequent columns represent the results from the PortratFCN+ method, the fne-tuned DeepLabv2 model wth the attrbute classfer, and our fnal model (magnfed for best vewng) Generatng trmap for mage mattng Snce our method can delver an accurate boundary predcton, t s a natural extenson to generate trmaps for mage mattng models. After performng segmentaton, we use the same technque durng tranng to dlate the boundary pxels to 10-pxels n wdth. Several examples are shown n Fgure Applcatons of portrat segmentaton Fgure 9: Some applcatons of portrat segmentaton. Portrat segmentaton has been wdely used n varous mage processng applcatons such as background replacement, depth of feld, augmented realty, mage cartoonzaton, etc. We show some applcatons n Fgure 9. 8

9 5. Concluson and dscusson We present a boundary-senstve portrat segmentaton system. Two boundary-senstve kernels are ntroduced nto the loss functon. One gves more boundary nformaton for each ndvdual mage and one governs the overall shape of portrat predcton. An attrbute classfer s traned jontly wth the segmentaton network to renforce the tranng process. Experments are conducted on the largest publcly avalable portrat segmentaton dataset as well as portrat mages collected from other three popular semantc segmentaton datasets. We outperform the prevous state-ofthe-arts n both quanttatve performance and vsual performance. For future work, we would lke to extend our boundarysenstve methods to general semantc segmentaton problem and explore more semantc attrbutes to renforce the tranng process. References [1] V. Badrnarayanan, A. Kendall, and R. Cpolla. Segnet: A deep convolutonal encoder-decoder archtecture for mage segmentaton. CoRR, abs/ , [2] L. Breman. Random forests. Machne Learnng, 45(1):5 32, Oct [3] H. Caesar, J. R. R. Ujlngs, and V. Ferrar. Coco-stuff: Thng and stuff classes n context. CoRR, abs/ , [4] J. Canny. A computatonal approach to edge detecton. IEEE Transactons on Pattern Analyss and Machne Intellgence, PAMI-8(6): , Nov [5] B. Chen, Z. Yang, S. Huang, X. Du, Z. Cu, J. Bhman, and N. M. Cyber-physcal system enabled nearby traffc flow modellng for autonomous vehcles. 36th IEEE Internatonal Performance Computng and Communcatons Conference, Specal Sesson on Cyber Physcal Systems: Securty, Computng, and Performance (IPCCC-CPS). IEEE, [6] L. Chen, G. Papandreou, F. Schroff, and H. Adam. Rethnkng atrous convoluton for semantc mage segmentaton. CoRR, abs/ , [7] L.-C. Chen, G. Papandreou, I. Kokknos, K. Murphy, and A. L. Yulle. Deeplab: Semantc mage segmentaton wth deep convolutonal nets, atrous convoluton, and fully connected crfs. arxv: , [8] X. Du, W. Abdalmageed, and D. Doermann. Large-scale sgnature matchng usng mult-stage hashng. In th Internatonal Conference on Document Analyss and Recognton, pages , Aug [9] X. Du, D. Doermann, and W. AbdAlmageed. A graphcal model approach for matchng partal sgnatures. In 2015 IEEE Conference on Computer Vson and Pattern Recognton (CVPR), pages , June [10] X. Du, M. El-Khamy, J. Lee, and L. Davs. Fused dnn: A deep neural network fuson approach to fast and robust pedestran detecton. In 2017 IEEE Wnter Conference on Applcatons of Computer Vson (WACV), pages , March [11] M. Everngham, L. Van Gool, C. K. I. Wllams, J. Wnn, and A. Zsserman. The pascal vsual object classes (voc) challenge. Internatonal Journal of Computer Vson, 88(2): , June [12] M. A. Hearst, S. T. Dumas, E. Osuna, J. Platt, and B. Scholkopf. Support vector machnes. IEEE Intellgent Systems and ther Applcatons, 13(4):18 28, July [13] Y. Ja, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Grshck, S. Guadarrama, and T. Darrell. Caffe: Convolutonal archtecture for fast feature embeddng. arxv preprnt arxv: , [14] D. L, T. Salonds, N. V. Desa, and M. C. Chuah. Deepcham: Collaboratve edge-medated adaptve deep learnng for moble object recognton. In 2016 IEEE/ACM Symposum on Edge Computng (SEC), pages 64 76, Oct [15] D. L, X. Wang, and D. Kong. Deeprebrth: Acceleratng deep neural network executon on moble devces. CoRR, abs/ , [16] G. Ln, A. Mlan, C. Shen, and I. D. Red. Refnenet: Multpath refnement networks for hgh-resoluton semantc segmentaton. CoRR, abs/ , [17] T. Ln, M. Mare, S. J. Belonge, L. D. Bourdev, R. B. Grshck, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Ztnck. Mcrosoft COCO: common objects n context. CoRR, abs/ , [18] J. Long, E. Shelhamer, and T. Darrell. Fully convolutonal networks for semantc segmentaton. In The IEEE Conference on Computer Vson and Pattern Recognton (CVPR), June [19] C. Peng, X. Zhang, G. Yu, G. Luo, and J. Sun. Large kernel matters - mprove semantc segmentaton by global convolutonal network. CoRR, abs/ , [20] X. Shen, H. Gao, X. Tao, C. Zhou, and J. Ja. Hgh-qualty correspondence and segmentaton estmaton for dual-lens smart-phone portrats. arxv preprnt arxv: , [21] X. Shen, A. Hertzmann, J. Ja,, S. Pars, B. Prce, E. Shechtman, and I. Sachs. Automatc portrat segmentaton for mage stylzaton. Computer Graphcs Forum, 35(3):93 102, [22] J. Sh and J. Malk. Normalzed cuts and mage segmentaton. IEEE Transactons on Pattern Analyss and Machne Intellgence, 22(8): , Aug [23] F. Yu and V. Koltun. Mult-scale context aggregaton by dlated convolutons. In ICLR, [24] H. Zhao, J. Sh, X. Q, X. Wang, and J. Ja. Pyramd scene parsng network. CoRR, abs/ ,

SEMANTIC image segmentation has been one of the most

SEMANTIC image segmentation has been one of the most 1 Stacked Deconvolutonal Network for Semantc Segmentaton Jun Fu, Jng Lu, Member, IEEE, Yuhang Wang, and Hanqng Lu, Senor Member, IEEE arxv:1708.04943v1 [cs.cv] 16 Aug 2017 Abstract Recent progress n semantc

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS P.G. Demdov Yaroslavl State Unversty Anatoly Ntn, Vladmr Khryashchev, Olga Stepanova, Igor Kostern EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS Yaroslavl, 2015 Eye

More information

Joint Object and Part Segmentation using Deep Learned Potentials

Joint Object and Part Segmentation using Deep Learned Potentials Jont Object and Part Segmentaton usng Deep Learned Potentals Peng Wang 1 Xaohu Shen 2 Zhe Ln 2 Scott Cohen 2 Bran Prce 2 Alan Yulle 1 1 Unversty of Calforna, Los Angeles 2 Adobe Research Abstract Segmentng

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 1. SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 1. SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY SSDH: Sem-supervsed Deep Hashng for Large Scale Image Retreval Jan Zhang, and Yuxn Peng arxv:607.08477v2 [cs.cv] 8 Jun 207 Abstract Hashng

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

(a) Input data X n. (b) VersNet. (c) Output data Y n. (d) Supervsed data D n. Fg. 2 Illustraton of tranng for proposed CNN. 2. Related Work In segment

(a) Input data X n. (b) VersNet. (c) Output data Y n. (d) Supervsed data D n. Fg. 2 Illustraton of tranng for proposed CNN. 2. Related Work In segment 一般社団法人電子情報通信学会 THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS 信学技報 IEICE Techncal Report SANE2017-92 (2018-01) Deep Learnng for End-to-End Automatc Target Recognton from Synthetc

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng, Natonal Unversty of Sngapore {shva, phanquyt, tancl }@comp.nus.edu.sg

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection 2009 10th Internatonal Conference on Document Analyss and Recognton A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng,

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Face Detection with Deep Learning

Face Detection with Deep Learning Face Detecton wth Deep Learnng Yu Shen Yus122@ucsd.edu A13227146 Kuan-We Chen kuc010@ucsd.edu A99045121 Yzhou Hao y3hao@ucsd.edu A98017773 Mn Hsuan Wu mhwu@ucsd.edu A92424998 Abstract The project here

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Audio Event Detection and classification using extended R-FCN Approach. Kaiwu Wang, Liping Yang, Bin Yang

Audio Event Detection and classification using extended R-FCN Approach. Kaiwu Wang, Liping Yang, Bin Yang Audo Event Detecton and classfcaton usng extended R-FCN Approach Kawu Wang, Lpng Yang, Bn Yang Key Laboratory of Optoelectronc Technology and Systems(Chongqng Unversty), Mnstry of Educaton, ChongQng Unversty,

More information

A Background Subtraction for a Vision-based User Interface *

A Background Subtraction for a Vision-based User Interface * A Background Subtracton for a Vson-based User Interface * Dongpyo Hong and Woontack Woo KJIST U-VR Lab. {dhon wwoo}@kjst.ac.kr Abstract In ths paper, we propose a robust and effcent background subtracton

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input Real-tme Jont Tracng of a Hand Manpulatng an Object from RGB-D Input Srnath Srdhar 1 Franzsa Mueller 1 Mchael Zollhöfer 1 Dan Casas 1 Antt Oulasvrta 2 Chrstan Theobalt 1 1 Max Planc Insttute for Informatcs

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

CSCI 5417 Information Retrieval Systems Jim Martin!

CSCI 5417 Information Retrieval Systems Jim Martin! CSCI 5417 Informaton Retreval Systems Jm Martn! Lecture 11 9/29/2011 Today 9/29 Classfcaton Naïve Bayes classfcaton Ungram LM 1 Where we are... Bascs of ad hoc retreval Indexng Term weghtng/scorng Cosne

More information

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity Journal of Sgnal and Informaton Processng, 013, 4, 114-119 do:10.436/jsp.013.43b00 Publshed Onlne August 013 (http://www.scrp.org/journal/jsp) Corner-Based Image Algnment usng Pyramd Structure wth Gradent

More information

Comparing Image Representations for Training a Convolutional Neural Network to Classify Gender

Comparing Image Representations for Training a Convolutional Neural Network to Classify Gender 2013 Frst Internatonal Conference on Artfcal Intellgence, Modellng & Smulaton Comparng Image Representatons for Tranng a Convolutonal Neural Network to Classfy Gender Choon-Boon Ng, Yong-Haur Tay, Bok-Mn

More information

Focal Loss in 3D Object Detection

Focal Loss in 3D Object Detection 1 Focal Loss n 3D Object Detecton eng Yun1 Le Ta2 Yuan Wang2 Chengju Lu3 Mng Lu2 Fg. 1. Upper two rows show projected 3D object detecton results from the detector traned wth bnary cross entropy. Lower

More information

A B-Snake Model Using Statistical and Geometric Information - Applications to Medical Images

A B-Snake Model Using Statistical and Geometric Information - Applications to Medical Images A B-Snake Model Usng Statstcal and Geometrc Informaton - Applcatons to Medcal Images Yue Wang, Eam Khwang Teoh and Dnggang Shen 2 School of Electrcal and Electronc Engneerng, Nanyang Technologcal Unversty

More information

Deep learning is a good steganalysis tool when embedding key is reused for different images, even if there is a cover source-mismatch

Deep learning is a good steganalysis tool when embedding key is reused for different images, even if there is a cover source-mismatch Deep learnng s a good steganalyss tool when embeddng key s reused for dfferent mages, even f there s a cover source-msmatch Lonel PIBRE 2,3, Jérôme PASQUET 2,3, Dno IENCO 2,3, Marc CHAUMONT 1,2,3 (1) Unversty

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

Learning-based License Plate Detection on Edge Features

Learning-based License Plate Detection on Edge Features Learnng-based Lcense Plate Detecton on Edge Features Wng Teng Ho, Woo Hen Yap, Yong Haur Tay Computer Vson and Intellgent Systems (CVIS) Group Unverst Tunku Abdul Rahman, Malaysa wngteng_h@yahoo.com, woohen@yahoo.com,

More information

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Shape Representaton Robust to the Sketchng Order Usng Dstance Map and Drecton Hstogram Department of Computer Scence Yonse Unversty Kwon Yun CONTENTS Revew Topc Proposed Method System Overvew Sketch Normalzaton

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 4, APRIL

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 4, APRIL IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 4, APRIL 2016 1713 Weakly Supervsed Fne-Graned Categorzaton Wth Part-Based Image Representaton Yu Zhang, Xu-Shen We, Janxn Wu, Member, IEEE, Janfe Ca,

More information

Fast Feature Value Searching for Face Detection

Fast Feature Value Searching for Face Detection Vol., No. 2 Computer and Informaton Scence Fast Feature Value Searchng for Face Detecton Yunyang Yan Department of Computer Engneerng Huayn Insttute of Technology Hua an 22300, Chna E-mal: areyyyke@63.com

More information

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline mage Vsualzaton mage Vsualzaton mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and Analyss outlne mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and

More information

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.

More information

2. Related Work Hand-crafted Features Based Trajectory Prediction Deep Neural Networks Based Trajectory Prediction

2. Related Work Hand-crafted Features Based Trajectory Prediction Deep Neural Networks Based Trajectory Prediction Encodng Crowd Interacton wth Deep Neural Network for Pedestran Trajectory Predcton Yanyu Xu ShanghaTech Unversty xuyy2@shanghatech.edu.cn Zhxn Pao ShanghaTech Unversty paozhx@shanghatech.edu.cn Shenghua

More information

Sequential Monte-Carlo Based Road Region Segmentation Algorithm with Uniform Spatial Sampling

Sequential Monte-Carlo Based Road Region Segmentation Algorithm with Uniform Spatial Sampling IPSJ Transactons on Computer Vson and Applcatons Vol.8 1 10 (Feb. 2016) [DOI: 10.2197/psjtcva.8.1] Regular Paper Sequental Monte-Carlo Based Road Regon Segmentaton Algorthm wth Unform Spatal Samplng Zdeněk

More information

Research of Image Recognition Algorithm Based on Depth Learning

Research of Image Recognition Algorithm Based on Depth Learning 208 4th World Conference on Control, Electroncs and Computer Engneerng (WCCECE 208) Research of Image Recognton Algorthm Based on Depth Learnng Zhang Jan, J Xnhao Zhejang Busness College, Hangzhou, Chna,

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Vol. 5, No. 3 March 2014 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

Vol. 5, No. 3 March 2014 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved. Journal of Emergng Trends n Computng and Informaton Scences 009-03 CIS Journal. All rghts reserved. http://www.csjournal.org Unhealthy Detecton n Lvestock Texture Images usng Subsampled Contourlet Transform

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

A Compositional Exemplar-Based Model for Hair Segmentation

A Compositional Exemplar-Based Model for Hair Segmentation A Compostonal Exemplar-Based Model for Har Segmentaton Nan Wang 1, Hazhou A 1, and Shhong Lao 2 1 Computer Scence & Technology Department, Tsnghua Unversty, Bejng, Chna ahz@mal.tsnghua.edu.cn 2 Core Technology

More information

Combined Object Detection and Segmentation

Combined Object Detection and Segmentation Combned Object Detecton and Segmentaton Jarch Vansteenberge, Masayuk Mukunok, and Mchhko Mnoh Abstract We develop a method for combned object detecton and segmentaton n natural scene. In our approach segmentaton

More information

Local Quaternary Patterns and Feature Local Quaternary Patterns

Local Quaternary Patterns and Feature Local Quaternary Patterns Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents

More information

Efficient Inference for Fully-Connected CRFs with Stationarity

Efficient Inference for Fully-Connected CRFs with Stationarity Effcent Inference for Fully-onnected RFs wth Statonarty Ymeng Zhang Tsuhan hen School of Electrcal and omputer Engneerng, ornell Unversty {yz457,tsuhan}@cornell.edu Abstract The ondtonal Random Feld (RF)

More information

Font Recognition in Natural Images via Transfer Learning

Font Recognition in Natural Images via Transfer Learning Font Recognton n Natural Images va Transfer Learnng Yzh Wang, Zhouhu Lan, Yngmn Tang, and Janguo Xao Insttute of Computer Scence and Technology, Pekng Unversty Abstract. Font recognton s an mportant and

More information

arxiv: v2 [cs.cv] 3 Aug 2017

arxiv: v2 [cs.cv] 3 Aug 2017 Swtchng Convolutonal Neural Network for Crowd Countng Deepak Babu Sam Shv Surya R. Venkatesh Babu Indan Insttute of Scence Bangalore, INDIA 560012 arxv:1708.00199v2 [cs.cv] 3 Aug 2017 bsdeepak@grads.cds.sc.ac.n,

More information

Lecture 13: High-dimensional Images

Lecture 13: High-dimensional Images Lec : Hgh-dmensonal Images Grayscale Images Lecture : Hgh-dmensonal Images Math 90 Prof. Todd Wttman The Ctadel A grayscale mage s an nteger-valued D matrx. An 8-bt mage takes on values between 0 and 55.

More information

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation GeoNet: Geometrc Neural Network for Jont Depth and Surface Normal Estmaton Xaojuan Q Renje Lao, Zhengzhe Lu Raquel Urtasun, Jaya Ja, The Chnese Unversty of Hong Kong Unversty of Toronto Uber Advanced Technologes

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

Deep GrabCut for Object Selection

Deep GrabCut for Object Selection NING XU, BRIAN PRICE, SCOTT COHEN, JIMEI YANG, THOMAS HUANG: 1 arxv:1707.00243v2 [cs.cv] 14 Jul 2017 Deep GrabCut for Object Selecton Nng Xu 1 nngxu2@llnos.edu Bran Prce 2 bprce@adobe.com Scott Cohen 2

More information

Lower Body Pose Estimation in Team Sports Videos Using Label-Grid Classifier Integrated with Tracking-by-Detection

Lower Body Pose Estimation in Team Sports Videos Using Label-Grid Classifier Integrated with Tracking-by-Detection Informaton and Meda Technologes 10(2): 246-258 (2015) reprnted from: IPSJ Transactons on Computer Vson and Applcatons 7: 18-30 (2015) Informaton Processng Socety of Japan Research Paper Lower Body Pose

More information

Multi-view 3D Position Estimation of Sports Players

Multi-view 3D Position Estimation of Sports Players Mult-vew 3D Poston Estmaton of Sports Players Robbe Vos and Wlle Brnk Appled Mathematcs Department of Mathematcal Scences Unversty of Stellenbosch, South Afrca Emal: vosrobbe@gmal.com Abstract The problem

More information

Object-Based Techniques for Image Retrieval

Object-Based Techniques for Image Retrieval 54 Zhang, Gao, & Luo Chapter VII Object-Based Technques for Image Retreval Y. J. Zhang, Tsnghua Unversty, Chna Y. Y. Gao, Tsnghua Unversty, Chna Y. Luo, Tsnghua Unversty, Chna ABSTRACT To overcome the

More information

A Probabilistic Approach to Detect Urban Regions from Remotely Sensed Images Based on Combination of Local Features

A Probabilistic Approach to Detect Urban Regions from Remotely Sensed Images Based on Combination of Local Features A Probablstc Approach to Detect Urban Regons from Remotely Sensed Images Based on Combnaton of Local Features Berl Sırmaçek German Aerospace Center (DLR) Remote Sensng Technology Insttute Weßlng, 82234,

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

GA-Based Learning Algorithms to Identify Fuzzy Rules for Fuzzy Neural Networks

GA-Based Learning Algorithms to Identify Fuzzy Rules for Fuzzy Neural Networks Seventh Internatonal Conference on Intellgent Systems Desgn and Applcatons GA-Based Learnng Algorthms to Identfy Fuzzy Rules for Fuzzy Neural Networks K Almejall, K Dahal, Member IEEE, and A Hossan, Member

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET 1 BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET TZU-CHENG CHUANG School of Electrcal and Computer Engneerng, Purdue Unversty, West Lafayette, Indana 47907 SAUL B. GELFAND School

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Gender Classification using Interlaced Derivative Patterns

Gender Classification using Interlaced Derivative Patterns Gender Classfcaton usng Interlaced Dervatve Patterns Author Shobernejad, Ameneh, Gao, Yongsheng Publshed 2 Conference Ttle Proceedngs of the 2th Internatonal Conference on Pattern Recognton (ICPR 2) DOI

More information

Online Detection and Classification of Moving Objects Using Progressively Improving Detectors

Online Detection and Classification of Moving Objects Using Progressively Improving Detectors Onlne Detecton and Classfcaton of Movng Objects Usng Progressvely Improvng Detectors Omar Javed Saad Al Mubarak Shah Computer Vson Lab School of Computer Scence Unversty of Central Florda Orlando, FL 32816

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

Pictures at an Exhibition

Pictures at an Exhibition 1 Pctures at an Exhbton Stephane Kwan and Karen Zhu Department of Electrcal Engneerng Stanford Unversty, Stanford, CA 9405 Emal: {skwan1, kyzhu}@stanford.edu Abstract An mage processng algorthm s desgned

More information

Supplementary Material DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

Supplementary Material DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents Supplementary Materal DESIRE: Dstant Future Predcton n Dynamc Scenes wth Interactng Agents Namhoon Lee 1, Wongun Cho 2, Paul Vernaza 2, Chrstopher B. Choy 3, Phlp H. S. Torr 1, Manmohan Chandraker 2,4

More information

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier Some materal adapted from Mohamed Youns, UMBC CMSC 611 Spr 2003 course sldes Some materal adapted from Hennessy & Patterson / 2003 Elsever Scence Performance = 1 Executon tme Speedup = Performance (B)

More information

Discriminative Dictionary Learning with Pairwise Constraints

Discriminative Dictionary Learning with Pairwise Constraints Dscrmnatve Dctonary Learnng wth Parwse Constrants Humn Guo Zhuoln Jang LARRY S. DAVIS UNIVERSITY OF MARYLAND Nov. 6 th, Outlne Introducton/motvaton Dctonary Learnng Dscrmnatve Dctonary Learnng wth Parwse

More information

2 ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE (a) Ambgutes from person re dentfcaton n solaton (b) Assocatng groups of people may reduce ambgutes n mat

2 ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE (a) Ambgutes from person re dentfcaton n solaton (b) Assocatng groups of people may reduce ambgutes n mat ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE 1 Assocatng Groups of People We-Sh Zheng jason@dcs.qmul.ac.uk Shaogang Gong sgg@dcs.qmul.ac.uk Tao Xang txang@dcs.qmul.ac.uk School of EECS, Queen Mary Unversty

More information

Switching Convolutional Neural Network for Crowd Counting

Switching Convolutional Neural Network for Crowd Counting Swtchng Convolutonal Neural Network for Crowd Countng Deepak Babu Sam Shv Surya R. Venkatesh Babu Indan Insttute of Scence Bangalore, INDIA 560012 bsdeepak@grads.cds.sc.ac.n, shv.surya314@gmal.com, venky@cds.sc.ac.n

More information

Available online at Available online at Advanced in Control Engineering and Information Science

Available online at   Available online at   Advanced in Control Engineering and Information Science Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced

More information

Deep Classification in Large-scale Text Hierarchies

Deep Classification in Large-scale Text Hierarchies Deep Classfcaton n Large-scale Text Herarches Gu-Rong Xue Dkan Xng Qang Yang 2 Yong Yu Dept. of Computer Scence and Engneerng Shangha Jao-Tong Unversty {grxue, dkxng, yyu}@apex.sjtu.edu.cn 2 Hong Kong

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

Transformation Networks for Target-Oriented Sentiment Classification ACL / 25

Transformation Networks for Target-Oriented Sentiment Classification ACL / 25 Transformaton Networks for Target-Orented Sentment Classfcaton 1 Xn L 1, Ldong Bng 2, Wa Lam 1, Be Sh 1 1 The Chnese Unversty of Hong Kong 2 Tencent AI Lab ACL 2018 1 Jont work wth Tencent AI Lab Transformaton

More information

Unsupervised object segmentation in video by efficient selection of highly probable positive features

Unsupervised object segmentation in video by efficient selection of highly probable positive features Unsupervsed object segmentaton n vdeo by effcent selecton of hghly probable postve features Emanuela Haller 1,2 and Marus Leordeanu 1,2 1 Unversty Poltehnca of Bucharest, Romana 2 Insttute of Mathematcs

More information

Face Recognition Based on SVM and 2DPCA

Face Recognition Based on SVM and 2DPCA Vol. 4, o. 3, September, 2011 Face Recognton Based on SVM and 2DPCA Tha Hoang Le, Len Bu Faculty of Informaton Technology, HCMC Unversty of Scence Faculty of Informaton Scences and Engneerng, Unversty

More information

AUTOMATIC RECOGNITION OF TRAFFIC SIGNS IN NATURAL SCENE IMAGE BASED ON CENTRAL PROJECTION TRANSFORMATION

AUTOMATIC RECOGNITION OF TRAFFIC SIGNS IN NATURAL SCENE IMAGE BASED ON CENTRAL PROJECTION TRANSFORMATION AUTOMATIC RECOGNITION OF TRAFFIC SIGNS IN NATURAL SCENE IMAGE BASED ON CENTRAL PROJECTION TRANSFORMATION Ka Zhang a, Yehua Sheng a, Pefang Wang b, Ln Luo c, Chun Ye a, Zhjun Gong d a Key Laboratory of

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION SHI-LIANG SUN, HONG-LEI SHI Department of Computer Scence and Technology, East Chna Normal Unversty 500 Dongchuan Road, Shangha 200241, P. R. Chna E-MAIL: slsun@cs.ecnu.edu.cn,

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

A fast algorithm for color image segmentation

A fast algorithm for color image segmentation Unersty of Wollongong Research Onlne Faculty of Informatcs - Papers (Arche) Faculty of Engneerng and Informaton Scences 006 A fast algorthm for color mage segmentaton L. Dong Unersty of Wollongong, lju@uow.edu.au

More information

Learning to Co-Generate Object Proposals with a Deep Structured Network

Learning to Co-Generate Object Proposals with a Deep Structured Network Learnng to Co-enerate Object Proposals wth a Deep Structured Network Zeeshan Hayder,2, Xumng He 2, Australan Natonal Unversty & 2 NICTA {zeeshan.hayder, xumng.he}@anu.edu.au atheu Salzmann,3 3 CVLab, EPFL,

More information

Face Recognition using 3D Directional Corner Points

Face Recognition using 3D Directional Corner Points 2014 22nd Internatonal Conference on Pattern Recognton Face Recognton usng 3D Drectonal Corner Ponts Xun Yu, Yongsheng Gao School of Engneerng Grffth Unversty Nathan, QLD, Australa xun.yu@grffthun.edu.au,

More information

Development of an Active Shape Model. Using the Discrete Cosine Transform

Development of an Active Shape Model. Using the Discrete Cosine Transform Development of an Actve Shape Model Usng the Dscrete Cosne Transform Kotaro Yasuda A Thess n The Department of Electrcal and Computer Engneerng Presented n Partal Fulfllment of the Requrements for the

More information

Augmented Distinctive Features for Efficient Image Matching

Augmented Distinctive Features for Efficient Image Matching Augmented Dstnctve Features for Effcent Image Matchng Quan Wang, We Guan and Suya You CGIT/IMSC USC Los Angeles, CA 90089 quanwang@usc.edu, wguan@usc.edu and suyay@graphcs.usc.edu Abstract Fndng correspondng

More information

Histogram of Template for Pedestrian Detection

Histogram of Template for Pedestrian Detection PAPER IEICE TRANS. FUNDAMENTALS/COMMUN./ELECTRON./INF. & SYST., VOL. E85-A/B/C/D, No. xx JANUARY 20xx Hstogram of Template for Pedestran Detecton Shaopeng Tang, Non Member, Satosh Goto Fellow Summary In

More information