Universität Augsburg. Institut für Informatik. PLSA on Large Scale Image Databases. Rainer Lienhart and Malcolm Slaney.
|
|
- Kellie Payne
- 5 years ago
- Views:
Transcription
1 Unverstät Augsburg à ÊÇÅÍÆ ËÀǼ PLSA on Large Scale Image Databases Raner Lenhart and Malcolm Slaney Report Dezember 2006 Insttut für Informat D Augsburg
2 Copyrght c Raner Lenhart and Malcolm Slaney Insttut für Informat Unverstät Augsburg D Augsburg, Germany all rghts reserved
3 PLSA O LARGE SCALE IMAGE DATABASES Raner Lenhart Multmeda Computng Lab Unversty of Augsburg Augsburg, Germany ABSTRACT The web and mage repostores such as Fcr are the largest mage databases n the world. There are bllons of mages on the web, and hundreds of mllon hgh-qualty mages n mage repostores. Currently, these mages are ndexed based on manually-entered tags and ndvdual and group usage patterns. In ths wor we explore a thrd nformaton dmenson: mage features. We explore probablstc latent semantc analyss (plsa) n order to nfer whch vsual patterns descrbe each obect. We buld models that connect words and mage features, and use content features and tags to fnd smlar mages. We demonstrate that mage features usng gray-scale salent ponts and an aspect model based on plsa outperforms a conventonal word-frequency model as well as refned color-hstrogram approach on an mage-smlarty tas. Index Terms large scale mage retreval, probablstc semantc analyss, color coherence vectors. 1. ITRODUCTIO The usage of Probablstc Latent Semantc Analyss (plsa) [4] a statstcal technque to derve hdden concepts from data has recently become very popular n the mage doman. So far, plsa has only been appled to relatvely small, carefully selected mage databases rangng from a few hundred to a few thousand mages [2][8]. In ths paper we study plsa on a large-scale, real-world mage database for mprovng mage retreval based on mage smlarty as perceved by humans. Our wor centers around fndng vsual words that are typcal for the varous nds of aspects an mage can show. One of the largest mage repostores on the web s Flcr. For ths wor we have download 253,460 mages that were tagged wth at least one out of the 23 tags lsted n Table 1 1. These words where grouped nto 12 categores for our mage-retreval tas. The resultng mage database was not 1 These mages were selected from all publc Flcr mages uploaded pror to 8 Sep and labeled wth one of the followng tags: sanfrancsco, beach, toyo and geotagged. Malcolm Slaney Yahoo! Research Santa Clara, CA USA cleaned nor pre-processed n any way to ncrease consstency. Snce these tags are provded by the creators of the pctures wth unnown ntensons, the technques we nvestgate must be able to tolerate from a pure vsual smlarty standpont a sgnfcant fracton of ncorrect labels. A good example, for nstance, are the mages tagged Chrstmas n Flcr. Only a very small fracton of the mages depct a relgous event (as one mght expect). Instead the tag mostly denotes the tme and date of creaton. Thus thousands of vacaton and party photos pop up wth no real common theme. The ambguty of tags maes mage retreval more dffcult. On ths real-world database we explore two questons: (1) Does t matters how vsual words are created? We compare three dfferent technques: (a) random selecton, (b) clusterng random subsets, and (c) clusterng tagbased subsets. (2) Does plsa outperform a smple word-occurrence statstc? How does plsa on grayscale SIFT [5] features compare to well-nown global color-retreval technques such as color-coherence vectors [9]? Category # OR lst of tags # of mage 1 wldlfe anmal anmals cat cats dog dogs brd brds flower flowers grafft sgn sgns surf surfng nght food buldng buldngs goldengate goldengatebrdge baseball Total # of Images (ote mages may have multple tags) 253,460 Table 1: The mage database and ts 12 categores
4 We evaluate these dfferent retreval confguratons purely based on mage smlarty as perceved by a number of users wthout any specal context nowledge. 2. DERIVIG VISUAL WORDS plsa was orgnally derved n the context of document retreval, where words are the elementary parts of a document. For mages our vsual documents we need comparable elementary parts we call vsual words. In ths wor we use the popular SIFT features [5] to fnd salent vsual parts n each mage. SIFT features are calculated n a two-step process: Frst, a sparse set of salent areas n an mage are determned and descrbed by poston, scale, and orentaton. Then for each salent pont we derve a 128-dmensonal edge-based feature vector to descrbe the unque grayscale content of that salent area n a scale- and orentaton-nvarant manner. Snce SIFT feature vectors can tae on almost every value n 128!, we wsh to fnd a small set of representatve feature vectors to become our vsual words. Thus the problem of dervng vsual words s as follows: Gven a set of mages I={d } wth I = I = # of mages a set of feature F={f l } wth F = F = # of features (here 128-dm. SIFT features) derved from I mages a set C = {c r } of mage categores (see Table 1) wth C = C categores n total (here C =12) derve a vocabulary V = {v } of V = V vsual words. Fndng the structure n such a large set of data (mllons of mages, thousands of salent ponts per mage) s computatonally expensve. We nvestgate three ways to determne the V vsual words and we wll evaluate ther utlty later n ths paper: (v1) Random: Select all V sample features randomly from the set F of all features. (v2) K-means clusterng (wth subselecton): Randomly select S sample features from the set F of all features. Apply K-means clusterng to each set of S samples to derve (V /C ) vsual words. Perform ths subselecton C tmes. In total ths wll result n C * (V /C ) = V vsual words. (v3) Tag subselecton: For each of the C categores derve (V /C ) vsual words by means of K-means clusterng by randomly samplng S sample features from mages n each category only. In total ths wll result n C * (V /C ) = V vsual words. Method (v2) s the approach commonly used n mage retreval [2][8]. Snce K-means clusterng s computatonally expensve (quadratc n the number of samples and the number of clusters), t s more effcent to brea up (C * S ) samples nto C subsets of S samples and fnd V /C clusters from ths subset nstead of determnng all V clusters on the entre set of (C * S ) samples drectly. For our 12 categores (C = 12) the speedup s C * C = 144 tmes. The ratonal behnd method (v3) s to explore whether the tags n the database provde useful nformaton for dervng vsual words. Wthn each category the mages should be less dverse and thus mae t easer for K-means clusterng to fnd the domnant vsual words. The better the vsual words, the better plsa should wor and thus mprove retreval. Concepts that have no representatve vsual words cannot be learned. The random method (v1) s added to answer the queston whether K-means clusterng s necessary at all. The answer to ths queston has a few mportant mplcatons: Frstly, clusterng s often the slowest part of the learnng algorthm. If t can be spped wthout harm, t would greatly reduce the computatonal complexty. Secondly, f clusterng s not necessary, the set of vsual vocabulary can easly be extended any tme needed by addtonal random samples. In each experment we derved 12 * 200 = 2400 vsual words that are used to descrbe each mage n our database. In Secton 4 we wll compare these three methods based on ther smlarty retreval results n user studes. 3. MEASURIG IMAGE SIMILARITY 3.1 Term-Document Matrx Usng the vsual vocabulary V, each feature f l of F can be quantzed by ts most smlar feature vector n V. Thus we represent each mage d as an mage document consstng of L nstances of the vsual words {w 1,,w L }, w p V. Gven the collecton of I mage documents I={d } wth F vsual words W = {w } from the vocabulary V and gven that we gnore the sequental orderng of the word occurrences n the mages (the so-called bag-of-word model), the mage data can be summarzed by an I! V matrx of vsual word occurrence counts = (n(d,v )), where n(d,v ) specfes the number of tmes the word v occurred n document d. The resultng table s called the termdocument matrx (see Fgure 1). ote by normalzng each document vector to 1 usng the L1-norm, the document vector of d becomes the estmated mass probablty dstrbuton v d ). The smlarty between two documents can be calculated usng the cosne metrc between two document vectors a=d
5 and b=d p. The cosne metrc between to vector a and b s defned as < a, b > CSMetrc( a, b) = a! b It s commonly used n text retreval [1]. 3.2 plsa Fgure 1: Term-document matrx Each L1-normalzed row n the term-document matrx descrbes the dstrbuton of the vsual words n each document,.e., v d ). The dea of plsa s to ntroduce a medator nown as aspects or concepts between the document and the words. Thus, every word occurrng n a document s generated by an unobservable aspect varable z leadng to the followng generatve model for the document vector [4]: (1) Pc a document d wth pror probablty d ) (2) Select a latent concept z wth probablty z d ) (3) Generate a word v wth probablty v z ) An mportant aspect of ths model s that word occurrences are condtonally ndependent from the document gven the unobservable aspects. Thus K! = 1 v d ) = d ) v z ) z d ). In addton, every document s modeled as consstng of one or more aspects. Ths s very natural snce mages consst of multple obects and thus multple aspects n dfferent mage areas. plsa can model ths fact very effcently. For nstance, an mage wth a lon and eep each obect descrbed by a set of SIFT features mght be descrbed by two hdden aspects lon and eep. Dependent on the aspects the probabltes of each vsual word v s dfferent. We learn the unobservable probablty dstrbutons z d ) and v z ) from the data usng the Expectaton- Maxmzaton-Algorthm (EM-Algorthm) [3][4]: E-Step: M-Step: z d, v v z ) z d ) = ) = K! l = 1! = 1!! = M v z ) z d ) m= 1 = 1 M! (, = n d 1 l v z ) z d ) n( d, v ) z d, v ) l n( d, v ) z d, v ) v ) z n( d ) d, v ) Gven a new test mage d test, we estmate the aspect probabltes, smlar to above, from the observed words. The only dfference s that the learned condtonal word dstrbutons v z ) are never updated. The smlarty between two documents s calculated usng the cosne metrc between two the two aspect vectors a= (z d )) and b= (z d p )). We model the collecton of vsual words wth 48 aspects n total analogous to a 48- mxture Gaussan mxture model. 3.3 Color Coherence Vectors As a baselne for comparson we use one of the best tradtonal global color features: Color Coherence Vectors (CCVs) [9]. It s computed by frst quantzng each pxel s color by usng the 2 most sgnfcant bts per color channel, resultng n only 64 possble dfferent color values. Then, for each pxel we measure the area connected (wth an 8- neghborhood) of the same quantzed color. If the area s above a threshold (usually 1% of the pxel count n the mage), then the pxel s added to the coherent hstogram, otherwse to the ncoherent color hstogram. Combnng both hstograms results n a 128-dmensonal vector. Dssmlarty between two CCV vectors a and b s computed based on the L1-norm. 4. EXPERIMETAL RESULTS Performance Metrc: For evaluaton we selected randomly 5 query mages from each of our 12 categores,.e., 60 query mages n total. Then, for each query mage each retreval technque was used to return the top 20 most smlar mages. In each of the three experments below, three rval technques were compared based on the udgments of a number of users: For each query, the retreval results (top 20 mages, tled 5 by 4 on a sheet of paper) for the three technques under comparson were shown to the user. The user had to put the results from each query mage nto an order from best to worst retreval result. The technque wth the best retreval result receved 2 ponts, the second best 1 pont, and the worst 0 ponts. We computed the average score for each technque over the 60 samples queres to assgn a sngle performance number. The technque wth the hghest score obvously performs best.
6 (a) (b) (c) Fgure 2: Results from the three experments: a) cosne smlarty on word-hstogram feature, b) cosne smlarty on plsa, and c) comparng best cosne methods wth CCV baselne algorthm. Exp. 1: In ths experment we compared the three vsual word extracton technques (v1), (v2), and (v3) aganst each other by usng the document vectors from the termdocument matrx wth the cosne metrc for smlarty retreval. Fgure 2a shows the average scores for 8 dfferent subects. Exp. 2: In ths experment we compared the three vsual word extracton technques (v1), (v2), and (v3) aganst each other by usng the aspect vectors of the mage documents wth the cosne metrc for smlarty retreval. Fgure 2b shows the average scores for 8 dfferent subects. In both of these experments dervng vsual words usng plan clusterng produced the best results. Selectng vsual words completely at random s computatonally cheap, and should wor well asymptotcally, but not evdently at ths level. We are surprsed that dervng specfc vsual words based on category subsets dd not produce an overall beneft, but an nformal analyss suggests that these category-specfc words helped for a category le dogs. Exp. 3: In our fnal experment we compared the randomsubset vsual word selecton approaches that won from Exp.1 (cosne metrc of word hstograms) and Exp. 2 (cosne metrc of plsa hstograms) to a baselne usng CCV features. Ths test s dffcult for subects because n such a large database the matches n a color space are at frst glance dentcal to the query. It s only when the pcture s studed does one realzes that the obects shown are so dfferent. Ths s especally true when we loo color smlarty wth our full 2.5M mage database. The results of ths test are shown n Fgure 2c. Seven subects udged that mages found by usng a cosne metrc n plsa space are more smlar to the query mage than a drect comparson n word space, or the baselne CCV approach. Much le t does n text-based retreval, calculatng smlarty n subspace formed by the aspect model gves better results. 5. COCLUSIO In ths paper we have shown that the aspect model, usng an approach le plsa, s as mportant for mage-retreval as t s for text-retreval [1]. The aspect model learns the probablty of each vsual word gven an unobserved aspect. We have extended Bosch s wor [2] by showng that plsa mproves performance on a smlarty tas. The dmensonalty reducton due to an aspect model s mportant as we go to larger databases. In future wor we want to verfy our results wth a larger number of subects, and we want to test the smlarty on the full 2.5M mage database. References [1] Rcardo Baeza-Yates, Berther Rbero-eto. Modern Informaton Retreval. Addson-Wesley, [2] A. Bosch, A. Zsserman and X. Munoz. Scene Classfcaton va plsa. Proceedngs of the European Conference on Computer Vson (2006). [3] Dempster, A. P.,. M. Lard, and D. B. Rubn. Maxmum Lelhood from Incomplete Data va the EM Algorthm. Journal of the Loyal Statstcal Socety,B.39, [4] Thomas Hoffmann. Unsupervsed Learnng by Probablstc Latent Semantc Analyss. Machne Learnng, Vol. 42, Issue 1 2, pp , [5] D. Lowe. Dstnctve mage features from scale nvarant eyponts. In IJCV 60(2):91 110, [6] K. Molaczy, T. Tuytelaars, C. Schmd, A. Zsserman, J. Matas, F. Schaffaltzy, T. Kadr and L. Van Gool. A comparson of affne regon detectors. In IJCV 65(1/2):43 72, [7] K. Molaczy, C. Schmd. A performance evaluaton of local descrptors. In PAMI 27(10): [8] P. Quelhas, F. Monay, J.-M Odobez, D. Gatca-Perez, T. Tuytelaars, L. Van Gool. Modelng scenes wth local descrptors and latent aspects. ICCV 2005, Vol. 1, pp , Oct [9] G. Pass, R. Zabh, and J. Mller. Comparng Images Usng Color Coherence Vectors. In Proc. of the 4th ACM Int. Conf. on Multmeda, Boston, MA, pages 65 73, 1996.
A Fast Visual Tracking Algorithm Based on Circle Pixels Matching
A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng
More informationContent Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers
IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth
More informationSubspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;
Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features
More informationA Fast Content-Based Multimedia Retrieval Technique Using Compressed Data
A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,
More informationClassifier Selection Based on Data Complexity Measures *
Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.
More informationMachine Learning: Algorithms and Applications
14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of
More informationCS 534: Computer Vision Model Fitting
CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust
More informationSemantic Image Retrieval Using Region Based Inverted File
Semantc Image Retreval Usng Regon Based Inverted Fle Dengsheng Zhang, Md Monrul Islam, Guoun Lu and Jn Hou 2 Gppsland School of Informaton Technology, Monash Unversty Churchll, VIC 3842, Australa E-mal:
More informationOutline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1
4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:
More informationFEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur
FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents
More informationImage Alignment CSC 767
Image Algnment CSC 767 Image algnment Image from http://graphcs.cs.cmu.edu/courses/15-463/2010_fall/ Image algnment: Applcatons Panorama sttchng Image algnment: Applcatons Recognton of object nstances
More informationPerformance Evaluation of Information Retrieval Systems
Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence
More informationMULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION
MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and
More informationCourse Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms
Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques
More informationProblem Definitions and Evaluation Criteria for Computational Expensive Optimization
Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty
More informationA new Unsupervised Clustering-based Feature Extraction Method
A new Unsupervsed Clusterng-based Feature Extracton Method Sabra El Ferchch ACS Natonal School of Engneerng at Tuns, Tunsa Salah Zd AGIS lle Unversty of Scence and Technology, France Kaouther aabd ACS
More informationA Binarization Algorithm specialized on Document Images and Photos
A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a
More informationKeywords - Wep page classification; bag of words model; topic model; hierarchical classification; Support Vector Machines
(IJCSIS) Internatonal Journal of Computer Scence and Informaton Securty, Herarchcal Web Page Classfcaton Based on a Topc Model and Neghborng Pages Integraton Wongkot Srura Phayung Meesad Choochart Haruechayasak
More informationFuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval
Fuzzy -Means Intalzed by Fxed Threshold lusterng for Improvng Image Retreval NAWARA HANSIRI, SIRIPORN SUPRATID,HOM KIMPAN 3 Faculty of Informaton Technology Rangst Unversty Muang-Ake, Paholyotn Road, Patumtan,
More informationUnsupervised Learning and Clustering
Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned
More informationMachine Learning. Topic 6: Clustering
Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess
More informationA PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION
1 THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Seres A, OF THE ROMANIAN ACADEMY Volume 4, Number 2/2003, pp.000-000 A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION Tudor BARBU Insttute
More informationApplying EM Algorithm for Segmentation of Textured Images
Proceedngs of the World Congress on Engneerng 2007 Vol I Applyng EM Algorthm for Segmentaton of Textured Images Dr. K Revathy, Dept. of Computer Scence, Unversty of Kerala, Inda Roshn V. S., ER&DCI Insttute
More informationParallelism for Nested Loops with Non-uniform and Flow Dependences
Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr
More informationSearching Large Image Databases using Color Information
Searchng Large Image Databases usng Color Informaton Ioan Racu CMSC350: Artfcal Intellgence Wnter Quarter 2004 Department of Computer Scence Unversty of Chcago racu@cs.uchcago.edu ABSTRACT The goal of
More informationIntelligent Information Acquisition for Improved Clustering
Intellgent Informaton Acquston for Improved Clusterng Duy Vu Unversty of Texas at Austn duyvu@cs.utexas.edu Mkhal Blenko Mcrosoft Research mblenko@mcrosoft.com Prem Melvlle IBM T.J. Watson Research Center
More informationFeature Reduction and Selection
Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components
More informationDetermining the Optimal Bandwidth Based on Multi-criterion Fusion
Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn
More informationCOMPLEX WAVELET TRANSFORM-BASED COLOR INDEXING FOR CONTENT-BASED IMAGE RETRIEVAL
COMPLEX WAVELET TRANSFORM-BASED COLOR INDEXING FOR CONTENT-BASED IMAGE RETRIEVAL Nader Safavan and Shohreh Kasae Department of Computer Engneerng Sharf Unversty of Technology Tehran, Iran skasae@sharf.edu
More informationTerm Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task
Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto
More informationAPPLIED MACHINE LEARNING
Methods for Clusterng K-means, Soft K-means DBSCAN 1 Objectves Learn basc technques for data clusterng K-means and soft K-means, GMM (next lecture) DBSCAN Understand the ssues and major challenges n clusterng
More informationSupport Vector Machines
Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned
More informationCollaboratively Regularized Nearest Points for Set Based Recognition
Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,
More informationQuery Clustering Using a Hybrid Query Similarity Measure
Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan
More informationCluster Analysis of Electrical Behavior
Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School
More informationAn Optimal Algorithm for Prufer Codes *
J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,
More informationComplex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.
Complex Numbers The last topc n ths secton s not really related to most of what we ve done n ths chapter, although t s somewhat related to the radcals secton as we wll see. We also won t need the materal
More informationClassifying Acoustic Transient Signals Using Artificial Intelligence
Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)
More informationTsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance
Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for
More informationBioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.
[Type text] [Type text] [Type text] ISSN : 0974-74 Volume 0 Issue BoTechnology 04 An Indan Journal FULL PAPER BTAIJ 0() 04 [684-689] Revew on Chna s sports ndustry fnancng market based on market -orented
More informationTN348: Openlab Module - Colocalization
TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages
More informationA Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems
A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty
More informationHigh Dimensional Data Clustering
Hgh Dmensonal Data Clusterng Charles Bouveyron 1,2, Stéphane Grard 1, and Cordela Schmd 2 1 LMC-IMAG, BP 53, Unversté Grenoble 1, 38041 Grenoble Cede 9, France charles.bouveyron@mag.fr, stephane.grard@mag.fr
More informationMULTISPECTRAL REMOTE SENSING IMAGE CLASSIFICATION WITH MULTIPLE FEATURES
MULISPECRAL REMOE SESIG IMAGE CLASSIFICAIO WIH MULIPLE FEAURES QIA YI, PIG GUO, Image Processng and Pattern Recognton Laboratory, Bejng ormal Unversty, Bejng 00875, Chna School of Computer Scence and echnology,
More informationIMAGE MATCHING WITH SIFT FEATURES A PROBABILISTIC APPROACH
IMAGE MATCHING WITH SIFT FEATURES A PROBABILISTIC APPROACH Jyot Joglekar a, *, Shrsh S. Gedam b a CSRE, IIT Bombay, Doctoral Student, Mumba, Inda jyotj@tb.ac.n b Centre of Studes n Resources Engneerng,
More informationImprovement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration
Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,
More informationDetection of hand grasping an object from complex background based on machine learning co-occurrence of local image feature
Detecton of hand graspng an object from complex background based on machne learnng co-occurrence of local mage feature Shnya Moroka, Yasuhro Hramoto, Nobutaka Shmada, Tadash Matsuo, Yoshak Shra Rtsumekan
More information1. Introduction. Abstract
Image Retreval Usng a Herarchy of Clusters Danela Stan & Ishwar K. Seth Intellgent Informaton Engneerng Laboratory, Department of Computer Scence & Engneerng, Oaland Unversty, Rochester, Mchgan 48309-4478
More informationLearning-Based Top-N Selection Query Evaluation over Relational Databases
Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **
More informationHelsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)
Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute
More informationMachine Learning 9. week
Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below
More informationAnalysis of Continuous Beams in General
Analyss of Contnuous Beams n General Contnuous beams consdered here are prsmatc, rgdly connected to each beam segment and supported at varous ponts along the beam. onts are selected at ponts of support,
More informationKIDS Lab at ImageCLEF 2012 Personal Photo Retrieval
KD Lab at mageclef 2012 Personal Photo Retreval Cha-We Ku, Been-Chan Chen, Guan-Bn Chen, L-J Gaou, Rong-ng Huang, and ao-en Wang Knowledge, nformaton, and Database ystem Laboratory Department of Computer
More informationSkew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach
Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research
More informationCSCI 5417 Information Retrieval Systems Jim Martin!
CSCI 5417 Informaton Retreval Systems Jm Martn! Lecture 11 9/29/2011 Today 9/29 Classfcaton Naïve Bayes classfcaton Ungram LM 1 Where we are... Bascs of ad hoc retreval Indexng Term weghtng/scorng Cosne
More informationA Robust Method for Estimating the Fundamental Matrix
Proc. VIIth Dgtal Image Computng: Technques and Applcatons, Sun C., Talbot H., Ourseln S. and Adraansen T. (Eds.), 0- Dec. 003, Sydney A Robust Method for Estmatng the Fundamental Matrx C.L. Feng and Y.S.
More informationCompiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz
Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster
More informationFederated Search of Text-Based Digital Libraries in Hierarchical Peer-to-Peer Networks
Federated Search of Text-Based Dgtal Lbrares n Herarchcal Peer-to-Peer Networks Je Lu School of Computer Scence Carnege Mellon Unversty Pttsburgh, PA 15213 jelu@cs.cmu.edu Jame Callan School of Computer
More informationVISUAL SELECTION OF SURFACE FEATURES DURING THEIR GEOMETRIC SIMULATION WITH THE HELP OF COMPUTER TECHNOLOGIES
UbCC 2011, Volume 6, 5002981-x manuscrpts OPEN ACCES UbCC Journal ISSN 1992-8424 www.ubcc.org VISUAL SELECTION OF SURFACE FEATURES DURING THEIR GEOMETRIC SIMULATION WITH THE HELP OF COMPUTER TECHNOLOGIES
More informationR s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes
SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges
More informationPrivate Information Retrieval (PIR)
2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market
More informationA new segmentation algorithm for medical volume image based on K-means clustering
Avalable onlne www.jocpr.com Journal of Chemcal and harmaceutcal Research, 2013, 5(12):113-117 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCRC5 A new segmentaton algorthm for medcal volume mage based
More information12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification
Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero
More informationImproving Web Image Search using Meta Re-rankers
VOLUME-1, ISSUE-V (Aug-Sep 2013) IS NOW AVAILABLE AT: www.dcst.com Improvng Web Image Search usng Meta Re-rankers B.Kavtha 1, N. Suata 2 1 Department of Computer Scence and Engneerng, Chtanya Bharath Insttute
More informationModule Management Tool in Software Development Organizations
Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,
More informationOutline. Type of Machine Learning. Examples of Application. Unsupervised Learning
Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton
More informationTOPOGRAPHIC OBJECT RECOGNITION THROUGH SHAPE
TOOGAHIC OBJECT ECOGNITION THOUGH SHAE Laura Keyes and Adam Wnstanley Techncal eport Submtted to Ordnance Survey, Southampton March 00 Department of Computer Scence Natonal Unversty of Ireland, Maynooth
More informationUser Authentication Based On Behavioral Mouse Dynamics Biometrics
User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA
More informationBackground Removal in Image indexing and Retrieval
Background Removal n Image ndexng and Retreval Y Lu and Hong Guo Department of Electrcal and Computer Engneerng The Unversty of Mchgan-Dearborn Dearborn Mchgan 4818-1491, U.S.A. Voce: 313-593-508, Fax:
More information3D Metric Reconstruction with Auto Calibration Method CS 283 Final Project Tarik Adnan Moon
3D Metrc Reconstructon wth Auto Calbraton Method CS 283 Fnal Project Tark Adnan Moon tmoon@collge.harvard.edu Abstract In ths paper, dfferent methods for auto camera calbraton have been studed for metrc
More informationFace Recognition using 3D Directional Corner Points
2014 22nd Internatonal Conference on Pattern Recognton Face Recognton usng 3D Drectonal Corner Ponts Xun Yu, Yongsheng Gao School of Engneerng Grffth Unversty Nathan, QLD, Australa xun.yu@grffthun.edu.au,
More informationHierarchical Image Retrieval by Multi-Feature Fusion
Preprnts (www.preprnts.org) NOT PEER-REVIEWED Posted: 26 Aprl 207 do:0.20944/preprnts20704.074.v Artcle Herarchcal Image Retreval by Mult- Fuson Xaojun Lu, Jaojuan Wang,Yngq Hou, Me Yang, Q Wang* and Xangde
More informationImpact of a New Attribute Extraction Algorithm on Web Page Classification
Impact of a New Attrbute Extracton Algorthm on Web Page Classfcaton Gösel Brc, Banu Dr, Yldz Techncal Unversty, Computer Engneerng Department Abstract Ths paper ntroduces a new algorthm for dmensonalty
More informationImproved SIFT-Features Matching for Object Recognition
Improved SIFT-Features Matchng for Obect Recognton Fara Alhwarn, Chao Wang, Danela Rstć-Durrant, Axel Gräser Insttute of Automaton, Unversty of Bremen, FB / NW Otto-Hahn-Allee D-8359 Bremen Emals: {alhwarn,wang,rstc,ag}@at.un-bremen.de
More informationThe Codesign Challenge
ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.
More informationLarge-Scale Multimodal Semantic Concept Detection for Consumer Video
Large-Scale Multmodal Semantc Concept Detecton for Consumer Vdeo Shh-Fu Chang, Dan Ells, We Jang, Keansub Lee, Akra Yanagawa, Alexander C. Lou, Jebo Luo ABSTRACT Columba Unversty, New York, NY {sfchang,
More informationThree supervised learning methods on pen digits character recognition dataset
Three supervsed learnng methods on pen dgts character recognton dataset Chrs Flezach Department of Computer Scence and Engneerng Unversty of Calforna, San Dego San Dego, CA 92093 cflezac@cs.ucsd.edu Satoru
More informationMathematics 256 a course in differential equations for engineering students
Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the
More informationX- Chart Using ANOM Approach
ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are
More informationLearning the Kernel Parameters in Kernel Minimum Distance Classifier
Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department
More informationQuery classification using topic models and support vector machine
Query classfcaton usng topc models and support vector machne Deu-Thu Le Unversty of Trento, Italy deuthu.le@ds.untn.t Raffaella Bernard Unversty of Trento, Italy bernard@ds.untn.t Abstract Ths paper descrbes
More informationA Bilinear Model for Sparse Coding
A Blnear Model for Sparse Codng Davd B. Grmes and Rajesh P. N. Rao Department of Computer Scence and Engneerng Unversty of Washngton Seattle, WA 98195-2350, U.S.A. grmes,rao @cs.washngton.edu Abstract
More informationA Multi-step Strategy for Shape Similarity Search In Kamon Image Database
A Mult-step Strategy for Shape Smlarty Search In Kamon Image Database Paul W.H. Kwan, Kazuo Torach 2, Kesuke Kameyama 2, Junbn Gao 3, Nobuyuk Otsu 4 School of Mathematcs, Statstcs and Computer Scence,
More informationCan We Beat the Prefix Filtering? An Adaptive Framework for Similarity Join and Search
Can We Beat the Prefx Flterng? An Adaptve Framework for Smlarty Jon and Search Jannan Wang Guolang L Janhua Feng Department of Computer Scence and Technology, Tsnghua Natonal Laboratory for Informaton
More informationA Probabilistic Approach to Detect Urban Regions from Remotely Sensed Images Based on Combination of Local Features
A Probablstc Approach to Detect Urban Regons from Remotely Sensed Images Based on Combnaton of Local Features Berl Sırmaçek German Aerospace Center (DLR) Remote Sensng Technology Insttute Weßlng, 82234,
More informationRecognizing Faces. Outline
Recognzng Faces Drk Colbry Outlne Introducton and Motvaton Defnng a feature vector Prncpal Component Analyss Lnear Dscrmnate Analyss !"" #$""% http://www.nfotech.oulu.f/annual/2004 + &'()*) '+)* 2 ! &
More informationABSTRACT 1. INTRODUCTION
Robust SIFT-Based Descrptor for Vdeo Classfcaton Razyeh Salarfard, Mahshd lsadat Hossen, Mahmood Karman and Shohreh Kasae Department of Computer Engneerng, Sharf Unversty of Technology Tehran, Iran BSTRCT
More informationFast Feature Value Searching for Face Detection
Vol., No. 2 Computer and Informaton Scence Fast Feature Value Searchng for Face Detecton Yunyang Yan Department of Computer Engneerng Huayn Insttute of Technology Hua an 22300, Chna E-mal: areyyyke@63.com
More informationUnsupervised Learning
Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and
More informationParallel Numerics. 1 Preconditioning & Iterative Solvers (From 2016)
Technsche Unverstät München WSe 6/7 Insttut für Informatk Prof. Dr. Thomas Huckle Dpl.-Math. Benjamn Uekermann Parallel Numercs Exercse : Prevous Exam Questons Precondtonng & Iteratve Solvers (From 6)
More informationPERFORMANCE EVALUATION FOR SCENE MATCHING ALGORITHMS BY SVM
PERFORMACE EVALUAIO FOR SCEE MACHIG ALGORIHMS BY SVM Zhaohu Yang a, b, *, Yngyng Chen a, Shaomng Zhang a a he Research Center of Remote Sensng and Geomatc, ongj Unversty, Shangha 200092, Chna - yzhac@63.com
More informationLarge-scale Web Video Event Classification by use of Fisher Vectors
Large-scale Web Vdeo Event Classfcaton by use of Fsher Vectors Chen Sun and Ram Nevata Unversty of Southern Calforna, Insttute for Robotcs and Intellgent Systems Los Angeles, CA 90089, USA {chensun nevata}@usc.org
More informationScale Selective Extended Local Binary Pattern For Texture Classification
Scale Selectve Extended Local Bnary Pattern For Texture Classfcaton Yutng Hu, Zhlng Long, and Ghassan AlRegb Multmeda & Sensors Lab (MSL) Georga Insttute of Technology 03/09/017 Outlne Texture Representaton
More informationK-means and Hierarchical Clustering
Note to other teachers and users of these sldes. Andrew would be delghted f you found ths source materal useful n gvng your own lectures. Feel free to use these sldes verbatm, or to modfy them to ft your
More informationEFFICIENT H.264 VIDEO CODING WITH A WORKING MEMORY OF OBJECTS
EFFICIENT H.264 VIDEO CODING WITH A WORKING MEMORY OF OBJECTS A Thess presented to the Faculty of the Graduate School at the Unversty of Mssour-Columba In Partal Fulfllment of the Requrements for the Degree
More informationFrom Comparing Clusterings to Combining Clusterings
Proceedngs of the Twenty-Thrd AAAI Conference on Artfcal Intellgence (008 From Comparng Clusterngs to Combnng Clusterngs Zhwu Lu and Yuxn Peng and Janguo Xao Insttute of Computer Scence and Technology,
More information6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour
6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the
More informationOutline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:
Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A
More informationSequential search. Building Java Programs Chapter 13. Sequential search. Sequential search
Sequental search Buldng Java Programs Chapter 13 Searchng and Sortng sequental search: Locates a target value n an array/lst by examnng each element from start to fnsh. How many elements wll t need to
More informationFor instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)
Secton 1.2 Subsets and the Boolean operatons on sets If every element of the set A s an element of the set B, we say that A s a subset of B, or that A s contaned n B, or that B contans A, and we wrte A
More information