(a) Input data X n. (b) VersNet. (c) Output data Y n. (d) Supervsed data D n. Fg. 2 Illustraton of tranng for proposed CNN. 2. Related Work In segment

Size: px
Start display at page:

Download "(a) Input data X n. (b) VersNet. (c) Output data Y n. (d) Supervsed data D n. Fg. 2 Illustraton of tranng for proposed CNN. 2. Related Work In segment"

Transcription

1 一般社団法人電子情報通信学会 THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS 信学技報 IEICE Techncal Report SANE ( ) Deep Learnng for End-to-End Automatc Target Recognton from Synthetc Aperture Radar Imagery Hdetosh FURUKAWA Toshba Infrastructure Systems & Solutons Corporaton 1 Komukatoshba-cho, Sawa-ku, Kawasak-sh, Kanagawa, Japan E-mal: hdetosh.furukawa@toshba.co.jp arxv: v1 [cs.cv] 25 Jan 2018 Abstract The standard archtecture of synthetc aperture radar (SAR) automatc target recognton (ATR) conssts of three stages: detecton, dscrmnaton, and classfcaton. In recent years, convolutonal neural networks (CNNs) for SAR ATR have been proposed, but most of them classfy target classes from a target chp extracted from SAR magery, as a classfcaton for the thrd stage of SAR ATR. In ths report, we propose a novel CNN for end-to-end ATR from SAR magery. The CNN named verfcaton support network (VersNet) performs all three stages of SAR ATR end-to-end. VersNet nputs a SAR mage of arbtrary szes wth multple classes and multple targets, and outputs a SAR ATR mage representng the poston, class, and pose of each detected target. Ths report descrbes the evaluaton results of VersNet whch traned to output scores of all 12 classes: 10 target classes, a target front class, and a background class, for each pxel usng the movng and statonary target acquston and recognton (MSTAR) publc dataset. Key words Automatc target recognton (ATR), Mult-target detecton, Mult-target classfcaton, Pose estmaton, Convolutonal neural network (CNN), Synthetc aperture radar (SAR) 1. Introducton Synthetc aperture radar (SAR) transmts mcrowaves and generates magery usng mcrowaves reflected from objects, under all weather, day and nght condtons. However, t s dffcult for a human to recognze a target from SAR magery snce there s no color nformaton and the shape reflected (a) Input. (b) VersNet. (c) Output. from a target changes. Therefore, automatc target recognton (ATR) from SAR magery (or mage) has been studed for many years. The standard archtecture of SAR ATR conssts of three Fg. 1 Illustraton of nput and output of proposed CNN. The CNN named VersNet performs automatc target recognton of mult-class / mult-target n varable sze SAR mage. In ths case, the nput s a sngle mage wth three classes and four targets (upper left and lower rght targets stages: detecton, dscrmnaton, and classfcaton. Detecton: the frst stage of SAR ATR detects a regon of nterest are the same class). VersNet outputs the poston, class, and pose (front sde) of each detected target. (ROI) from a SAR mage. Dscrmnaton: the second stage of SAR ATR dscrmnates whether an ROI s a target or non-target regon, and outputs the dscrmnated ROI as a target chp. Classfcaton: the thrd stage of SAR ATR classfes target classes from a target chp. In recent years, methods usng convoluton neural network (CNN) [1] [4] have been successful n the classfcaton of mage recognton. Smlarly, CNNs for SAR ATR have been proposed. On the movng and statonary target acquston and recognton (MSTAR) publc dataset [5], the target classfcaton accuracy of the CNNs [6] [9] exceeds conventonal methods (support vector machne, etc.). However, most of CNNs for SAR ATR classfy target classes from a target chp extracted from SAR mage but do not classfy multple targets or a target chp (or SAR mage) of an arbtrary sze. In addton, a CNN for target classfcaton can output score or probablty of each class as classfcaton result, but t s dffcult for a human to verfy the classfcaton result. We propose a new CNN whch nputs a SAR mage of varable szes wth mult-target and outputs a SAR ATR mage. 35 Ths artcle s a techncal report wthout peer revew, and ts polshed and/or extended verson may be publshed elsewhere. Copyrght 2018 by IEICE

2 (a) Input data X n. (b) VersNet. (c) Output data Y n. (d) Supervsed data D n. Fg. 2 Illustraton of tranng for proposed CNN. 2. Related Work In segmentaton of mage recognton gvng classfcaton label for each pxel of an mage, methods usng CNN [10] [12] show a good performance n recent years. For SAR mage, segmentaton of a target regon whch reflected from a target, and a shadow regon whch not reflected from a target by radar shadow s performed. The CNN [13], WD-CFAR [14], and other methods [15] [17] have been proposed for segmentaton of a SAR mage. The reference [18] descrbes manually generatng the segmentaton of target and shadow regons as ground truth. Generally, for segmentaton usng CNN, supervsed learnng s performed usng label mages correspondng to nput mages, but the dffculty of the generaton of label mages for SAR ATR s a problem for applyng ths method. In response to ths problem, the reference [13] descrbes the CNN whch traned to output a contour usng the contour data of target and shadow regons generated by computer graphcs as ground truth. In contrast, our proposed CNN performs target detecton, target classfcaton, and pose estmaton by segmentaton. 3. Proposed Method A proposed CNN named verfcaton support network (VersNet) nputs an arbtrary sze SAR mage wth multple classes and multple targets, and outputs the poston, class, and pose of each detected target as a SAR ATR mage. Fgure 1 shows the outlne of VersNet for end-to-end SAR ATR. VersNet s a CNN composed of an encoder and a decoder. The encoder of VersNet extracts features from an nput SAR mage. The decoder converts the features based on the converson rule n the tranng data and outputs t as a SAR ATR mage. Here, we defne the end-to-end SAR ATR as a task of supervsed learnng. Let {(X n,d n),n=1,..., N} be the tranng dataset, where X n = {x (n),=1,..., X n } s SAR mage as nput data, D n = {d (n), =1,..., D n,d (n) {1,..., N c}} Table 1 Dataset. The tranng and testng data contan respectvely 2747 and 2420 target chps from the MSTAR. Class Tranng data Testng data 2S BMP2 (9563) BRDM BTR BTR D T T72 (132) ZIL ZSU Total s label mage for X n, whch s the supervsed data of VersNet output data Y n = f(x n; θ). The values of X n and D n represent the number of pxels (vertcal horzontal) of SAR and label mage, respectvely. When d (n) s 1, t represents a background class, and when d (n) s 2 or more, t ndcates a correspondng target class. Let L(θ) be a loss functon, the network parameters θ are adjusted usng tranng data so that the output of loss functon becomes small. 4. Experments 4. 1 Dataset For tranng and testng of VersNet, we used the ten classes data shown n Table 1 from the MSTAR [5]. The dataset contans 2747 target chps wth a depresson angle of 17 for the tranng and 2420 target chps wth a depresson angle of 15 for the testng. As descrbed later n Appendx 1, fve target chps of target class BTR60 for testng data were excluded. Of course, label mages for segmentaton do not exst n the MSTAR dataset. Therefore, we create label mages for VersNet. Fgure 2(d) shows samples of label mages. The label mages have all 12 classes: 10 target classes, a target front class, and a background class. 1 Fve target chps were excluded. Appendx 1 shows the detals. 36

3 Table 2 Classfcaton accuracy of testng data. The overall accuracy s 99.55%. Class Accuracy (%) 2S BMP BRDM BTR BTR Average Overall D T T ZIL ZSU Table 3 Defntons of TP, FP, FN, and TN. True Condton postve Condton negatve Predcted Condton postve True postve (TP) False postve (FP) Condton negatve False negatve (FN) True negatve (TN) Fg. 3 Detal archtecture of VersNet for experments. The VersNet refers to the fully convolutonal network called FCN- 32s VersNet: Proposed CNN Fgure 3 shows a detaled archtecture of VersNet for experments. The encoder of the VersNet conssts of four convoluton blocks and two convoluton layers. The convoluton block contans two convoluton layers of kernel sze 3 3 and a poolng layer smlarly to VGG [19]. The actvaton functon of all convolutons except the fnal convoluton uses rectfed lnear unt (ReLU) [20]. Dropout [21] s appled after a convoluton of kernel sze 6 6. Batch normalzaton [22] s not appled. The decoder of the VersNet conssts of a transposed convoluton [23] that performs 16 tmes upsamplng. As the loss functon, we use cross entropy expressed by L(θ) = p(x) log q(x). (1) x For the optmzaton of the loss functon, we use stochastc gradent descent (SGD) wth momentum. Snce the VersNet s a CNN wthout fully connected layers called fully convolutonal network (FCN) [10], even f tranng s done wth small sze mages, the VersNet can process SAR mages of arbtrary sze Classfcaton Accuracy Frst, we show results of classfcaton accuracy. Table 2 shows classfcaton accuracy for the target chps of testng f we smply select the majorty class of maxmum probablty for each pxel from ten target classes as the predcted class. An average accuracy of ten target classes s 99.52%, and an overall accuracy s 99.55% (2409/2420), whch s almost the same as a state-of-the-art accuracy. Also, Table A 2 of Appendx shows a confuson matrx for the target chps of testng Segmentaton Performance Next, we show results of segmentaton performance. We use precson, recall, F 1, and ntersecton over unon (IoU) as metrcs of segmentaton performance. Each metrcs s gven by Precson = TP TP + FP, (2) TP Recall = TP + FN, (3) precson recall F 1 =2 precson + recall, (4) TP IoU = TP + FP + FN, (5) where the defntons of TP, FP, FN, and TN are shown n Table 3. Table 4 shows precson, recall, F 1, and IoU for all the pxels of testng. The average IoU of all 12 classes and 10 target classes are and 0.923, respectvely. Also, Table A 3 of Appendx shows a confuson matrx for all the pxels of testng. Fgure 4 shows a hstogram of IoU for each mage wth ten target classes. A mean and a standard devaton of the IoU are and 0.082, respectvely. 37

4 Table 4 Segmentaton performance for all pxels of testng. The average IoU of ten target classes s Class Precson Recall F 1 IoU Background S BMP BRDM BTR BTR D T T ZIL ZSU Front Average of Average of Fg. 4 Hstogram of IoU. Fg. 5 Cumulatve dstrbuton of IoU. Fgure 5 shows a cumulatve dstrbuton of IoU for each mage wth ten target classes. The emprcal cumulatve dstrbuton functon P(IoU 0.5) and P(IoU 0.9) are about 0.01 and 0.1, respectvely Mult-Class and Mult-Target Fnally, we show the VersNet output for mult-class and mult-target nput. Fgure 6 shows nput (SAR mage of 10 target classes and 25 targets), output (SAR ATR mage), and ground truth. 5. Concluson By applyng CNN to the thrd stage classfcaton n the standard archtecture of SAR ATR, the performance has been mproved. In order to mprove the overall performance of SAR ATR, t s mportant not only to mprove the performance of the thrd stage classfcaton but also to mprove the performance of the frst stage detecton and the second stage dscrmnaton. In ths report, we proposed a CNN based on a new archtecture of SAR ATR that conssts of a sngle stage,.e. endto-end, not the standard archtecture of SAR ATR. Unlke conventonal CNNs for target classfcaton, the CNN named VersNet nputs a SAR mage of arbtrary szes wth multple classes and multple targets, and outputs a SAR ATR mage representng the poston, class, and pose of each detected target. We traned the VersNet to output scores nclude ten target classes on MSTAR dataset and evaluated ts performance. The average IoU for all the pxels of testng (2420 target chps) s over 0.9. Also, the classfcaton accuracy s about 99.5%, f we select the majorty class of maxmum probablty for each pxel as the predcted class. References [1] A. Krzhevsky, I. Sutskever, and G.E. Hnton, Imagenet classfcaton wth deep convolutonal neural networks, Advances n neural nformaton processng systems, pp , [2] M.D. Zeler and R. Fergus, Vsualzng and understandng convolutonal networks, European conference on computer vson, pp , [3] C. Szegedy, W. Lu, Y. Ja, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabnovch, Gong deeper wth convolutons, Proceedngs of the IEEE conference on computer vson and pattern recognton, pp.1 9, [4] K. He, X. Zhang, S. Ren, and J. Sun, Deep resdual learnng for mage recognton, Proceedngs of the IEEE conference on computer vson and pattern recognton, pp , [5] T. Ross, S. Worrell, V. Velten, J. Mossng, and M. Bryant, Standard sar atr evaluaton experments usng the mstar publc release data set, Proc. SPIE, vol.3370, pp , [6] S. Chen, H. Wang, F. Xu, and Y.Q. Jn, Target classfcaton usng the deep convolutonal networks for sar mages, IEEE Transactons on Geoscence and Remote Sensng, vol.54, no.8, pp , [7] S. Wagner, Sar atr by a combnaton of convolutonal neu- 2 The average of all 12 classes. 3 The average of ten target classes. 38

5 (a) Input (SAR mage of 10 target classes and 25 targets). (b) Output (SAR ATR mage). (c) Ground truth. Fg. 6 Input (SAR mage of multple classes and multple targets), output (SAR ATR mage), and ground truth. ral network and support vector machnes, IEEE Transactons on Aerospace and Electronc Systems, vol.52, no.6, pp , [8] Y. Zhong and G. Ettnger, Enlghtenng deep neural networks wth knowledge of confoundng factors, arxv preprnt arxv: , [9] H. Furukawa, Deep learnng for target classfcaton from sar magery: Data augmentaton and translaton nvarance, IEICE Tech. Rep., vol.117, no.182, SANE , pp.13 17, [10] J. Long, E. Shelhamer, and T. Darrell, Fully convolutonal networks for semantc segmentaton, Proceedngs of the IEEE Conference on Computer Vson and Pattern Recognton, pp , [11] O. Ronneberger, P. Fscher, and T. Brox, U-net: Convolutonal networks for bomedcal mage segmentaton, Internatonal Conference on Medcal Image Computng and Computer-Asssted Interventon, pp , [12] V. Badrnarayanan, A. Kendall, and R. Cpolla, Segnet: A deep convolutonal encoder-decoder archtecture for mage segmentaton, arxv preprnt arxv: , [13] D. Malmgren-Hansen and M. Nobel-J, Convolutonal neural networks for sar mage segmentaton, 2015 IEEE Internatonal Symposum on Sgnal Processng and Informaton Technology (ISSPIT), pp , [14] S. Huang and T.Z. Wenzhun Huang, A new sar mage segmentaton algorthm for the detecton of target and shadow regons, Scentfc reports 6, Artcle number: 38596, [15] Y. Han, Y. L, and W. Yu, Sar target segmentaton based on shape pror, 2014 IEEE Internatonal Geoscence and Remote Sensng Symposum (IGARSS), pp , [16] E. Atnour, S. Wang, and D. Zou, Segmentaton of small vehcle targets n sar mages, Proc. SPIE, vol.4726, no.1, pp.35 45, [17] R.A. Wesenseel, W.C. Karl, D.A. Castanon, G.J. Power, and P. Douvlle, Markov random feld segmentaton methods for sar target chps, Proc. SPIE, vol.3721, pp , [18] G.J. Power and R.A. Wesenseel, Atr subsystem performance measures usng manual segmentaton of sar target chps, Algorthms for Synthetc Aperture Radar Imagery VI, vol.3721, pp , [19] K. Smonyan and A. Zsserman, Very deep convolutonal networks for large-scale mage recognton, arxv preprnt Table A 1 Lst of target chps excluded from testng data. Class Flename Aspect angle ( ) BTR60 HB BTR60 HB BTR60 HB BTR60 HB BTR60 HB arxv: , [20] V. Nar and G.E. Hnton, Rectfed lnear unts mprove restrcted boltzmann machnes, Proceedngs of the 27th nternatonal conference on machne learnng (ICML-10), pp , [21] N. Srvastava, G.E. Hnton, A. Krzhevsky, I. Sutskever, and R. Salakhutdnov, Dropout: a smple way to prevent neural networks from overfttng., Journal of Machne Learnng Research, vol.15, no.1, pp , [22] S. Ioffe and C. Szegedy, Batch normalzaton: Acceleratng deep network tranng by reducng nternal covarate shft, Internatonal Conference on Machne Learnng, pp , [23] V. Dumouln and F. Vsn, A gude to convoluton arthmetc for deep learnng, arxv preprnt arxv: , Appendx 1. Excluded Target Chps from Testng Data Table A 1 shows a lst of target chps excluded from testng data. Fgure A 1 shows nputs (target chps) and outputs (SAR ATR mages) of the VersNet. 2. Confuson Matrx of Testng Data Table A 2 shows a confuson matrx for the mages (target chps) of testng, and Table A 3 shows a confuson matrx for all the pxels of testng. Each column n the confuson matrxes represents the actual target class, and each row represents the target class predcted by the VersNet. 39

6 (a) Inputs (target chps of BTR60 wth target aspect angle from (b) Outputs (SAR ATR mages). 292 to 313 ). Fg. A 1 Excluded target chps from testng data. As for the 2nd-row fve target chps of nputs (a), the nfluence of the radar shadow of a certan other object appears strongly. The nfluence appears also n the outputs (b) of the VersNet. Table A 2 Confuson matrx for mages of testng. True 2S1 BMP2 BRDM2 BTR60 BTR70 D7 T62 T72 ZIL131 ZSU234 Predcted 2S BMP BRDM BTR BTR D T T ZIL ZSU Accuracy (%) Average Overall Table A 3 Confuson matrx for all pxels of testng. True Background 2S1 BMP2 BRDM2 BTR60 BTR70 D7 T62 T72 ZIL131 ZSU234 Front Precson Predcted Background S BMP BRDM BTR BTR D T T ZIL ZSU Front Recall F IoU

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Deep learning is a good steganalysis tool when embedding key is reused for different images, even if there is a cover source-mismatch

Deep learning is a good steganalysis tool when embedding key is reused for different images, even if there is a cover source-mismatch Deep learnng s a good steganalyss tool when embeddng key s reused for dfferent mages, even f there s a cover source-msmatch Lonel PIBRE 2,3, Jérôme PASQUET 2,3, Dno IENCO 2,3, Marc CHAUMONT 1,2,3 (1) Unversty

More information

Audio Event Detection and classification using extended R-FCN Approach. Kaiwu Wang, Liping Yang, Bin Yang

Audio Event Detection and classification using extended R-FCN Approach. Kaiwu Wang, Liping Yang, Bin Yang Audo Event Detecton and classfcaton usng extended R-FCN Approach Kawu Wang, Lpng Yang, Bn Yang Key Laboratory of Optoelectronc Technology and Systems(Chongqng Unversty), Mnstry of Educaton, ChongQng Unversty,

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Comparing Image Representations for Training a Convolutional Neural Network to Classify Gender

Comparing Image Representations for Training a Convolutional Neural Network to Classify Gender 2013 Frst Internatonal Conference on Artfcal Intellgence, Modellng & Smulaton Comparng Image Representatons for Tranng a Convolutonal Neural Network to Classfy Gender Choon-Boon Ng, Yong-Haur Tay, Bok-Mn

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images Internatonal Journal of Informaton and Electroncs Engneerng Vol. 5 No. 6 November 015 Usng Fuzzy Logc to Enhance the Large Sze Remote Sensng Images Trung Nguyen Tu Huy Ngo Hoang and Thoa Vu Van Abstract

More information

ALEXNET FEATURE EXTRACTION AND MULTI-KERNEL LEARNING FOR OBJECT- ORIENTED CLASSIFICATION

ALEXNET FEATURE EXTRACTION AND MULTI-KERNEL LEARNING FOR OBJECT- ORIENTED CLASSIFICATION ALEXNET FEATURE EXTRACTION AND MULTI-KERNEL LEARNING FOR OBJECT- ORIENTED CLASSIFICATION Lng Dng 1, Hongy L 2, *, Changmao Hu 2, We Zhang 2, Shumn Wang 1 1 Insttute of Earthquake Forecastng, Chna Earthquake

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

arxiv: v2 [cs.cv] 9 Apr 2018

arxiv: v2 [cs.cv] 9 Apr 2018 Boundary-senstve Network for Portrat Segmentaton Xanzh Du 1, Xaolong Wang 2, Dawe L 2, Jngwen Zhu 2, Serafettn Tasc 2, Cameron Uprght 2, Stephen Walsh 2, Larry Davs 1 1 Computer Vson Lab, UMIACS, Unversty

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Feature Selection for Target Detection in SAR Images

Feature Selection for Target Detection in SAR Images Feature Selecton for Detecton n SAR Images Br Bhanu, Yngqang Ln and Shqn Wang Center for Research n Intellgent Systems Unversty of Calforna, Rversde, CA 95, USA Abstract A genetc algorthm (GA) approach

More information

PERFORMANCE EVALUATION FOR SCENE MATCHING ALGORITHMS BY SVM

PERFORMANCE EVALUATION FOR SCENE MATCHING ALGORITHMS BY SVM PERFORMACE EVALUAIO FOR SCEE MACHIG ALGORIHMS BY SVM Zhaohu Yang a, b, *, Yngyng Chen a, Shaomng Zhang a a he Research Center of Remote Sensng and Geomatc, ongj Unversty, Shangha 200092, Chna - yzhac@63.com

More information

Dropout: A Simple Way to Prevent Neural Networks from Overfitting

Dropout: A Simple Way to Prevent Neural Networks from Overfitting Journal of Machne Learnng Research 15 (2014) 1929-1958 Submtted 11/13; Publshed 6/14 Dropout: A Smple Way to Prevent Neural Networks from Overfttng Ntsh Srvastava Geoffrey Hnton Alex Krzhevsky Ilya Sutskever

More information

Font Recognition in Natural Images via Transfer Learning

Font Recognition in Natural Images via Transfer Learning Font Recognton n Natural Images va Transfer Learnng Yzh Wang, Zhouhu Lan, Yngmn Tang, and Janguo Xao Insttute of Computer Scence and Technology, Pekng Unversty Abstract. Font recognton s an mportant and

More information

Keywords - Wep page classification; bag of words model; topic model; hierarchical classification; Support Vector Machines

Keywords - Wep page classification; bag of words model; topic model; hierarchical classification; Support Vector Machines (IJCSIS) Internatonal Journal of Computer Scence and Informaton Securty, Herarchcal Web Page Classfcaton Based on a Topc Model and Neghborng Pages Integraton Wongkot Srura Phayung Meesad Choochart Haruechayasak

More information

Shape-adaptive DCT and Its Application in Region-based Image Coding

Shape-adaptive DCT and Its Application in Region-based Image Coding Internatonal Journal of Sgnal Processng, Image Processng and Pattern Recognton, pp.99-108 http://dx.do.org/10.14257/sp.2014.7.1.10 Shape-adaptve DCT and Its Applcaton n Regon-based Image Codng Yamn Zheng,

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Deep GrabCut for Object Selection

Deep GrabCut for Object Selection NING XU, BRIAN PRICE, SCOTT COHEN, JIMEI YANG, THOMAS HUANG: 1 arxv:1707.00243v2 [cs.cv] 14 Jul 2017 Deep GrabCut for Object Selecton Nng Xu 1 nngxu2@llnos.edu Bran Prce 2 bprce@adobe.com Scott Cohen 2

More information

Scale Selective Extended Local Binary Pattern For Texture Classification

Scale Selective Extended Local Binary Pattern For Texture Classification Scale Selectve Extended Local Bnary Pattern For Texture Classfcaton Yutng Hu, Zhlng Long, and Ghassan AlRegb Multmeda & Sensors Lab (MSL) Georga Insttute of Technology 03/09/017 Outlne Texture Representaton

More information

Learning-based License Plate Detection on Edge Features

Learning-based License Plate Detection on Edge Features Learnng-based Lcense Plate Detecton on Edge Features Wng Teng Ho, Woo Hen Yap, Yong Haur Tay Computer Vson and Intellgent Systems (CVIS) Group Unverst Tunku Abdul Rahman, Malaysa wngteng_h@yahoo.com, woohen@yahoo.com,

More information

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS P.G. Demdov Yaroslavl State Unversty Anatoly Ntn, Vladmr Khryashchev, Olga Stepanova, Igor Kostern EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS Yaroslavl, 2015 Eye

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

WIRELESS CAPSULE ENDOSCOPY IMAGE CLASSIFICATION BASED ON VECTOR SPARSE CODING.

WIRELESS CAPSULE ENDOSCOPY IMAGE CLASSIFICATION BASED ON VECTOR SPARSE CODING. WIRELESS CAPSULE ENDOSCOPY IMAGE CLASSIFICATION BASED ON VECTOR SPARSE CODING Tao Ma 1, Yuexan Zou 1 *, Zhqang Xang 1, Le L 1 and Y L 1 ADSPLAB/ELIP, School of ECE, Pekng Unversty, Shenzhen 518055, Chna

More information

A fast algorithm for color image segmentation

A fast algorithm for color image segmentation Unersty of Wollongong Research Onlne Faculty of Informatcs - Papers (Arche) Faculty of Engneerng and Informaton Scences 006 A fast algorthm for color mage segmentaton L. Dong Unersty of Wollongong, lju@uow.edu.au

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

Multiclass Object Recognition based on Texture Linear Genetic Programming

Multiclass Object Recognition based on Texture Linear Genetic Programming Multclass Object Recognton based on Texture Lnear Genetc Programmng Gustavo Olague 1, Eva Romero 1 Leonardo Trujllo 1, and Br Bhanu 2 1 CICESE, Km. 107 carretera Tjuana-Ensenada, Mexco, olague@ccese.mx,

More information

Research and Application of Fingerprint Recognition Based on MATLAB

Research and Application of Fingerprint Recognition Based on MATLAB Send Orders for Reprnts to reprnts@benthamscence.ae The Open Automaton and Control Systems Journal, 205, 7, 07-07 Open Access Research and Applcaton of Fngerprnt Recognton Based on MATLAB Nng Lu* Department

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

Classifier Swarms for Human Detection in Infrared Imagery

Classifier Swarms for Human Detection in Infrared Imagery Classfer Swarms for Human Detecton n Infrared Imagery Yur Owechko, Swarup Medasan, and Narayan Srnvasa HRL Laboratores, LLC 3011 Malbu Canyon Road, Malbu, CA 90265 {owechko, smedasan, nsrnvasa}@hrl.com

More information

Histogram of Template for Pedestrian Detection

Histogram of Template for Pedestrian Detection PAPER IEICE TRANS. FUNDAMENTALS/COMMUN./ELECTRON./INF. & SYST., VOL. E85-A/B/C/D, No. xx JANUARY 20xx Hstogram of Template for Pedestran Detecton Shaopeng Tang, Non Member, Satosh Goto Fellow Summary In

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

SRBIR: Semantic Region Based Image Retrieval by Extracting the Dominant Region and Semantic Learning

SRBIR: Semantic Region Based Image Retrieval by Extracting the Dominant Region and Semantic Learning Journal of Computer Scence 7 (3): 400-408, 2011 ISSN 1549-3636 2011 Scence Publcatons SRBIR: Semantc Regon Based Image Retreval by Extractng the Domnant Regon and Semantc Learnng 1 I. Felc Raam and 2 S.

More information

Implementation of Robust HOG-SVM based Pedestrian Classification

Implementation of Robust HOG-SVM based Pedestrian Classification Implementaton of Robust HOG-SVM based Pedestran Classfcaton Reecha P. Yadav K.K.W.I.E.E.R Nashk Inda Vnuchackravarthy Senthamlarasu and Krshnan Kutty KPIT Technologes Ltd. Pune Inda Sunta P. Ugale K.K.W.I.E.E.R

More information

Face Detection with Deep Learning

Face Detection with Deep Learning Face Detecton wth Deep Learnng Yu Shen Yus122@ucsd.edu A13227146 Kuan-We Chen kuc010@ucsd.edu A99045121 Yzhou Hao y3hao@ucsd.edu A98017773 Mn Hsuan Wu mhwu@ucsd.edu A92424998 Abstract The project here

More information

Face Recognition Based on SVM and 2DPCA

Face Recognition Based on SVM and 2DPCA Vol. 4, o. 3, September, 2011 Face Recognton Based on SVM and 2DPCA Tha Hoang Le, Len Bu Faculty of Informaton Technology, HCMC Unversty of Scence Faculty of Informaton Scences and Engneerng, Unversty

More information

Focal Loss in 3D Object Detection

Focal Loss in 3D Object Detection 1 Focal Loss n 3D Object Detecton eng Yun1 Le Ta2 Yuan Wang2 Chengju Lu3 Mng Lu2 Fg. 1. Upper two rows show projected 3D object detecton results from the detector traned wth bnary cross entropy. Lower

More information

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity ISSN(Onlne): 2320-9801 ISSN (Prnt): 2320-9798 Internatonal Journal of Innovatve Research n Computer and Communcaton Engneerng (An ISO 3297: 2007 Certfed Organzaton) Vol.2, Specal Issue 1, March 2014 Proceedngs

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

arxiv: v1 [cs.sd] 22 Dec 2017

arxiv: v1 [cs.sd] 22 Dec 2017 Musc Genre Classfcaton wth Parallelng Recurrent Convolutonal Neural Network arxv:1712.08370v1 [cs.sd] 22 Dec 2017 Ln Feng, Shenlan Lu, Janng Yao December 2017 Abstract Deep learnng has been demonstrated

More information

Writer Identification using a Deep Neural Network

Writer Identification using a Deep Neural Network Wrter Identfcaton usng a Deep Neural Network Jun Chu and Sargur Srhar Department of Computer Scence and Engneerng Unversty at Buffalo, The State Unversty of New York Buffalo, NY 1469, USA {jchu6, srhar}@buffalo.edu

More information

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION SHI-LIANG SUN, HONG-LEI SHI Department of Computer Scence and Technology, East Chna Normal Unversty 500 Dongchuan Road, Shangha 200241, P. R. Chna E-MAIL: slsun@cs.ecnu.edu.cn,

More information

Learning a Class-Specific Dictionary for Facial Expression Recognition

Learning a Class-Specific Dictionary for Facial Expression Recognition BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 16, No 4 Sofa 016 Prnt ISSN: 1311-970; Onlne ISSN: 1314-4081 DOI: 10.1515/cat-016-0067 Learnng a Class-Specfc Dctonary for

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

Deep Spatial-Temporal Joint Feature Representation for Video Object Detection

Deep Spatial-Temporal Joint Feature Representation for Video Object Detection sensors Artcle Deep Spatal-Temporal Jont Feature Representaton for Vdeo Object Detecton Baojun Zhao 1,2, Boya Zhao 1,2 ID, Lnbo Tang 1,2, *, Yuq Han 1,2 and Wenzheng Wang 1,2 1 School of Informaton and

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Unsupervised object segmentation in video by efficient selection of highly probable positive features

Unsupervised object segmentation in video by efficient selection of highly probable positive features Unsupervsed object segmentaton n vdeo by effcent selecton of hghly probable postve features Emanuela Haller 1,2 and Marus Leordeanu 1,2 1 Unversty Poltehnca of Bucharest, Romana 2 Insttute of Mathematcs

More information

Combining the Best of Graphical Models and ConvNets for Semantic Segmentation

Combining the Best of Graphical Models and ConvNets for Semantic Segmentation Combnng the Best of Graphcal Models and ConvNets for Semantc Segmentaton Mchael Cogswell Vrgna Tech cogswell@vt.edu Xao Ln Vrgna Tech lnxao@vt.edu Senthl Purushwalam Vrgna Tech senthl@vt.edu Dhruv Batra

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Research of Image Recognition Algorithm Based on Depth Learning

Research of Image Recognition Algorithm Based on Depth Learning 208 4th World Conference on Control, Electroncs and Computer Engneerng (WCCECE 208) Research of Image Recognton Algorthm Based on Depth Learnng Zhang Jan, J Xnhao Zhejang Busness College, Hangzhou, Chna,

More information

Learning Ensemble of Local PDM-based Regressions. Yen Le Computational Biomedicine Lab Advisor: Prof. Ioannis A. Kakadiaris

Learning Ensemble of Local PDM-based Regressions. Yen Le Computational Biomedicine Lab Advisor: Prof. Ioannis A. Kakadiaris Learnng Ensemble of Local PDM-based Regressons Yen Le Computatonal Bomedcne Lab Advsor: Prof. Ioanns A. Kakadars 1 Problem statement Fttng a statstcal shape model (PDM) for mage segmentaton Callosum segmentaton

More information

Data Mining: Model Evaluation

Data Mining: Model Evaluation Data Mnng: Model Evaluaton Aprl 16, 2013 1 Issues: Evaluatng Classfcaton Methods Accurac classfer accurac: predctng class label predctor accurac: guessng value of predcted attrbutes Speed tme to construct

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

Comparing heterogeneous entities using artificial neural networks of trainable weighted structural components and machine-learned activation functions

Comparing heterogeneous entities using artificial neural networks of trainable weighted structural components and machine-learned activation functions Comparng heterogeneous enttes usng artfcal neural networks of tranable weghted structural components and machne-learned actvaton functons * Artt Wangperawong *, Kettp Krangchavech, Austn Lanar, Supu Lam,

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng, Natonal Unversty of Sngapore {shva, phanquyt, tancl }@comp.nus.edu.sg

More information

INTELLECT SENSING OF NEURAL NETWORK THAT TRAINED TO CLASSIFY COMPLEX SIGNALS. Reznik A. Galinskaya A.

INTELLECT SENSING OF NEURAL NETWORK THAT TRAINED TO CLASSIFY COMPLEX SIGNALS. Reznik A. Galinskaya A. Internatonal Journal "Informaton heores & Applcatons" Vol.10 173 INELLEC SENSING OF NEURAL NEWORK HA RAINED O CLASSIFY COMPLEX SIGNALS Reznk A. Galnskaya A. Abstract: An expermental comparson of nformaton

More information

3D Backbone Network for 3D Object Detection

3D Backbone Network for 3D Object Detection 3D Backbone Network for 3D Object Detecton Xuesong L 1, Jose E Guvant 1, Ngamng Kwok 1, Yongzh Xu 2 1 School of Mechancal and Manufacturng Engneerng, Unversty of New South Wales, NSW 2052, Australa 2 School

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection 2009 10th Internatonal Conference on Document Analyss and Recognton A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng,

More information

SEMANTIC image segmentation has been one of the most

SEMANTIC image segmentation has been one of the most 1 Stacked Deconvolutonal Network for Semantc Segmentaton Jun Fu, Jng Lu, Member, IEEE, Yuhang Wang, and Hanqng Lu, Senor Member, IEEE arxv:1708.04943v1 [cs.cv] 16 Aug 2017 Abstract Recent progress n semantc

More information

Convolutional Neural Network- based Human Recognition for Vision Occupancy Sensors

Convolutional Neural Network- based Human Recognition for Vision Occupancy Sensors 10 Int'l Conf. IP, Comp. Vson, and Pattern Recognton IPCV'18 Convolutonal Neural Network- based Human Recognton for Vson Occupancy Sensors Seung Soo Lee and Manbae Km * Dept. of Computer and Communcatons

More information

A Background Subtraction for a Vision-based User Interface *

A Background Subtraction for a Vision-based User Interface * A Background Subtracton for a Vson-based User Interface * Dongpyo Hong and Woontack Woo KJIST U-VR Lab. {dhon wwoo}@kjst.ac.kr Abstract In ths paper, we propose a robust and effcent background subtracton

More information

Fusion of Deep Features and Weighted VLAD Vectors based on Multiple Features for Image Retrieval

Fusion of Deep Features and Weighted VLAD Vectors based on Multiple Features for Image Retrieval MATEC Web of Conferences, 0500 (07) DTS-07 DO: 005/matecconf/070500 Fuson of Deep Features and Weghted VLAD Vectors based on Multple Features for mage Retreval Yanhong Wang,, Ygang Cen,, Lequan Lang,*,

More information

Extraction of Human Activities as Action Sequences using plsa and PrefixSpan

Extraction of Human Activities as Action Sequences using plsa and PrefixSpan Extracton of Human Actvtes as Acton Sequences usng plsa and PrefxSpan Takuya TONARU Tetsuya TAKIGUCHI Yasuo ARIKI Graduate School of Engneerng, Kobe Unversty Organzaton of Advanced Scence and Technology,

More information

Lecture 13: High-dimensional Images

Lecture 13: High-dimensional Images Lec : Hgh-dmensonal Images Grayscale Images Lecture : Hgh-dmensonal Images Math 90 Prof. Todd Wttman The Ctadel A grayscale mage s an nteger-valued D matrx. An 8-bt mage takes on values between 0 and 55.

More information

Competitive Sparse Representation Classification for Face Recognition

Competitive Sparse Representation Classification for Face Recognition Vol. 6, No. 8, 05 Compettve Sparse Representaton Classfcaton for Face Recognton Yng Lu Chongqng Key Laboratory of Computatonal Intellgence Chongqng Unversty of Posts and elecommuncatons Chongqng, Chna

More information

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity Journal of Sgnal and Informaton Processng, 013, 4, 114-119 do:10.436/jsp.013.43b00 Publshed Onlne August 013 (http://www.scrp.org/journal/jsp) Corner-Based Image Algnment usng Pyramd Structure wth Gradent

More information

A New Feature of Uniformity of Image Texture Directions Coinciding with the Human Eyes Perception 1

A New Feature of Uniformity of Image Texture Directions Coinciding with the Human Eyes Perception 1 A New Feature of Unformty of Image Texture Drectons Concdng wth the Human Eyes Percepton Xng-Jan He, De-Shuang Huang, Yue Zhang, Tat-Mng Lo 2, and Mchael R. Lyu 3 Intellgent Computng Lab, Insttute of Intellgent

More information

Human Action Recognition Using Dynamic Time Warping Algorithm and Reproducing Kernel Hilbert Space for Matrix Manifold

Human Action Recognition Using Dynamic Time Warping Algorithm and Reproducing Kernel Hilbert Space for Matrix Manifold IJCTA, 10(07), 2017, pp 79-85 Internatonal Scence Press Closed Loop Control of Soft Swtched Forward Converter Usng Intellgent Controller 79 Human Acton Recognton Usng Dynamc Tme Warpng Algorthm and Reproducng

More information

Discriminative Dictionary Learning with Pairwise Constraints

Discriminative Dictionary Learning with Pairwise Constraints Dscrmnatve Dctonary Learnng wth Parwse Constrants Humn Guo Zhuoln Jang LARRY S. DAVIS UNIVERSITY OF MARYLAND Nov. 6 th, Outlne Introducton/motvaton Dctonary Learnng Dscrmnatve Dctonary Learnng wth Parwse

More information

Supplementary Material DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

Supplementary Material DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents Supplementary Materal DESIRE: Dstant Future Predcton n Dynamc Scenes wth Interactng Agents Namhoon Lee 1, Wongun Cho 2, Paul Vernaza 2, Chrstopher B. Choy 3, Phlp H. S. Torr 1, Manmohan Chandraker 2,4

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

Pictures at an Exhibition

Pictures at an Exhibition 1 Pctures at an Exhbton Stephane Kwan and Karen Zhu Department of Electrcal Engneerng Stanford Unversty, Stanford, CA 9405 Emal: {skwan1, kyzhu}@stanford.edu Abstract An mage processng algorthm s desgned

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution Real-tme Moton Capture System Usng One Vdeo Camera Based on Color and Edge Dstrbuton YOSHIAKI AKAZAWA, YOSHIHIRO OKADA, AND KOICHI NIIJIMA Graduate School of Informaton Scence and Electrcal Engneerng,

More information

Robust Dictionary Learning with Capped l 1 -Norm

Robust Dictionary Learning with Capped l 1 -Norm Proceedngs of the Twenty-Fourth Internatonal Jont Conference on Artfcal Intellgence (IJCAI 205) Robust Dctonary Learnng wth Capped l -Norm Wenhao Jang, Fepng Ne, Heng Huang Unversty of Texas at Arlngton

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation GeoNet: Geometrc Neural Network for Jont Depth and Surface Normal Estmaton Xaojuan Q Renje Lao, Zhengzhe Lu Raquel Urtasun, Jaya Ja, The Chnese Unversty of Hong Kong Unversty of Toronto Uber Advanced Technologes

More information

Extraction of Fuzzy Rules from Trained Neural Network Using Evolutionary Algorithm *

Extraction of Fuzzy Rules from Trained Neural Network Using Evolutionary Algorithm * Extracton of Fuzzy Rules from Traned Neural Network Usng Evolutonary Algorthm * Urszula Markowska-Kaczmar, Wojcech Trelak Wrocław Unversty of Technology, Poland kaczmar@c.pwr.wroc.pl, trelak@c.pwr.wroc.pl

More information

Vanishing Hull. Jinhui Hu, Suya You, Ulrich Neumann University of Southern California {jinhuihu,suyay,

Vanishing Hull. Jinhui Hu, Suya You, Ulrich Neumann University of Southern California {jinhuihu,suyay, Vanshng Hull Jnhu Hu Suya You Ulrch Neumann Unversty of Southern Calforna {jnhuhusuyay uneumann}@graphcs.usc.edu Abstract Vanshng ponts are valuable n many vson tasks such as orentaton estmaton pose recovery

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

Invariant Shape Object Recognition Using B-Spline, Cardinal Spline, and Genetic Algorithm

Invariant Shape Object Recognition Using B-Spline, Cardinal Spline, and Genetic Algorithm Proceedngs of the 5th WSEAS Int. Conf. on Sgnal Processng, Robotcs and Automaton, Madrd, Span, February 5-7, 6 (pp4-45) Invarant Shape Obect Recognton Usng B-Splne, Cardnal Splne, and Genetc Algorthm PISIT

More information

A Probabilistic Approach to Detect Urban Regions from Remotely Sensed Images Based on Combination of Local Features

A Probabilistic Approach to Detect Urban Regions from Remotely Sensed Images Based on Combination of Local Features A Probablstc Approach to Detect Urban Regons from Remotely Sensed Images Based on Combnaton of Local Features Berl Sırmaçek German Aerospace Center (DLR) Remote Sensng Technology Insttute Weßlng, 82234,

More information

2-Dimensional Image Representation. Using Beta-Spline

2-Dimensional Image Representation. Using Beta-Spline Appled Mathematcal cences, Vol. 7, 03, no. 9, 4559-4569 HIKARI Ltd, www.m-hkar.com http://dx.do.org/0.988/ams.03.3359 -Dmensonal Image Representaton Usng Beta-plne Norm Abdul Had Faculty of Computer and

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Local Quaternary Patterns and Feature Local Quaternary Patterns

Local Quaternary Patterns and Feature Local Quaternary Patterns Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents

More information

Novel Pattern-based Fingerprint Recognition Technique Using 2D Wavelet Decomposition

Novel Pattern-based Fingerprint Recognition Technique Using 2D Wavelet Decomposition Mathematcal Methods for Informaton Scence and Economcs Novel Pattern-based Fngerprnt Recognton Technque Usng D Wavelet Decomposton TUDOR BARBU Insttute of Computer Scence of the Romanan Academy T. Codrescu,,

More information

Fast Feature Value Searching for Face Detection

Fast Feature Value Searching for Face Detection Vol., No. 2 Computer and Informaton Scence Fast Feature Value Searchng for Face Detecton Yunyang Yan Department of Computer Engneerng Huayn Insttute of Technology Hua an 22300, Chna E-mal: areyyyke@63.com

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 1. SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 1. SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY SSDH: Sem-supervsed Deep Hashng for Large Scale Image Retreval Jan Zhang, and Yuxn Peng arxv:607.08477v2 [cs.cv] 8 Jun 207 Abstract Hashng

More information

Accurate Overlay Text Extraction for Digital Video Analysis

Accurate Overlay Text Extraction for Digital Video Analysis Accurate Overlay Text Extracton for Dgtal Vdeo Analyss Dongqng Zhang, and Shh-Fu Chang Electrcal Engneerng Department, Columba Unversty, New York, NY 10027. (Emal: dqzhang, sfchang@ee.columba.edu) Abstract

More information

Modeling Inter-cluster and Intra-cluster Discrimination Among Triphones

Modeling Inter-cluster and Intra-cluster Discrimination Among Triphones Modelng Inter-cluster and Intra-cluster Dscrmnaton Among Trphones Tom Ko, Bran Mak and Dongpeng Chen Department of Computer Scence and Engneerng The Hong Kong Unversty of Scence and Technology Clear Water

More information