A Knowledge Management System for Organizing MEDLINE Database

Size: px
Start display at page:

Download "A Knowledge Management System for Organizing MEDLINE Database"

Transcription

1 A Knowledge Management System for Organzng MEDLINE Database Hyunk Km, Su-Shng Chen Computer and Informaton Scence Engneerng Department, Unversty of Florda, Ganesvlle, Florda 32611, USA Wth the exploson of bomedcal data, nformaton overload and users nablty of expressng ther nformaton needs may become more serous. To solve those problems, ths paper presents a text data mnng method that uses both text categorzaton and text clusterng for buldng concept herarches for MEDLINE ctatons. The approach we propose s a three-step data mnng process for organzng MEDLINE database: (1) categorzatons accordng to MeSH terms, MeSH major topcs, and the cooccurrence of MeSH descrptors; (2) clusterng usng the results of MeSH term categorzaton; and (3) vsualzaton of categores and herarchcal clusters. The herarches automatcally generated may be used to support users n browsng behavor and help them dentfy good startng ponts for searchng. An nterface for ths underlyng system s also presented. 1. INTRODUCTION MEDLINE, developed by the U.S. Natonal Lbrary of Medcne (NLM), s a database of ndexed bblographc ctatons and abstracts. It contans over 4,600 bomedcal journals [1]. MEDLINE ctatons and abstracts are searchable va PubMed or the NLM Gateway. The NLM produces the MeSH (Medcal Subject Headngs) for the purposes of subject ndexng, catalogng and searchng journal artcles n MEDLINE wth an annual update cycle. MeSH conssts of descrptors (or man headngs), qualfers (or subheadngs), and supplementary concept records. It contans more than 19,000 descrptors whch are used to descrbe the subject topc of an artcle. It also provdes less than 100 qualfers whch are used to express a certan aspect of the concept represented by the descrptor. MeSH terms are arranged both alphabetcally and n a herarchcal tree, n whch specfc subject categores are arranged beneath broader terms. MeSH terms provde a consstent way of retrevng nformaton regardless of dfferent termnology used by the authors n the orgnal artcles. By usng MeSH terms, the user s able to narrow the search space n MEDLINE. As a result, by addng more MeSH terms to the query, retreval performance may be mproved [2]. However, there are nherent challenges, as well. There may be nformaton overload [3], and users may be unable to express ther nformaton needs, n order to take full advantage of the MEDLINE database. MEDLINE contans over 12 mllon artcle ctatons. Begnnng n 2002, t began to add over 2,000 new references on a daly bass [1]. Although the user may be able to lmt the search space of MEDLINE wth MeSH terms, keyword searches often result n a long lst of results. For nstance, when the user queres the term Parknson s Dsease by lmtng t to the MeSH descrptors, PubMed returns over 21,000 results. Here, there s a problem of nformaton overload, wth the user havng dffculty fndng relevant nformaton. The nablty of users to express nformaton needs may become more serous, unless users have a precse knowledge n ther area of nterest, or an understandng of MeSH and ts structure. The use of common abbrevatons, techncal terms, and synonyms n bomedcal artcles prevents users from artculatng ther nformaton needs accurately. To avod the vocabulary problem, MeSH may be used. However, t s dffcult for an unfamlar user to locate approprate descrptors and/or qualfers, snce MeSH s a very complex thesaurus. Furthermore, new terms are added, some are modfed, and others are removed each year as bomedcal felds change. An mprecse query usually results n a long lst of rrelevant hts [4]. Under such crcumstances, a better mechansm s needed to organze nformaton n order to help users explore wthn an organzed nformaton space [5]. In order to arrange the contents n a useful way, text categorzaton and text clusterng have been researched extensvely. Text categorzaton s a bolng down of the specfc content of a document nto a set of one or more pre-defned labels [6]. Text clusterng can group smlar documents nto a set of clusters based on shared features among subsets of the documents [4], [7]. In ths paper, we present a text data mnng method that uses both text categorzaton and text clusterng for buldng a concept herarchy for MEDLINE ctatons. The approach we propose s a three-step data mnng process for organzng MEDLINE database: (1) categorzatons accordng to MeSH terms, MeSH major topcs, and the co-occurrence of MeSH descrptors, (2) clusterng usng the results of MeSH term categorzaton, and (3) vsualzaton of categores and herarchcal clusters. The herarches

2 automatcally generated may be used to support users n browsng behavor as well as help them dentfy good startng ponts for searchng. An nterface for ths underlyng system s also presented. 2. METHODS In ths Secton, we wll explan the data mnng method proposed n detal. We used MySQL to store MEDLINE ctatons and addtonal data that was generated by the data mnng process. 2.1 Data Collecton For the followng experment, we extracted a total of 1,736 ctatons encoded n XML (extensble Markup Language) from the query Secondary Parknson Dsease, lmtng the results to the MeSH major topc feld and to ctatons wth abstracts n MEDLINE. 2.2 Text Categorzaton Categorzaton refers to an algorthm or procedure whch results n the assgnment of categores to documents [6]. We chose the MeSH major topc, the MeSH descrptor and qualfer, and a co-occurrence of MeSH descrptors as a feature to be used n classfcaton. To categorze the collecton accordng to the selected features, we frst parsed the data collecton encoded n XML usng SAX (Smple API for XML). After extractng the MeSH major topcs, the MeSH descrptors, and the co-occurrence of MeSH descrptors for each ctaton, we nserted the data nto the correspondng MySQL tables. 2.3 Text Clusterng usng the Results of MeSH descrptor Categorzaton Snce many MeSH terms may be assgned to a ctaton and vce versa, categorzaton wth the MeSH terms or the co-occurrence of MeSH terms often results n a large lst or herarchy. Some categores may contan a large number of documents. Smply lstng categores assocated wth documents s nadequate for organzng data [6]. To allevate ths problem, the approach we propose here s to cluster the results of MeSH descrptor categorzaton usng the herarchcal Self-Organzng Map (SOM). We chose only those MeSH descrptor categores whose document frequences are over a predetermned threshold for clusterng. Document frequency s the number of documents n whch a term occurs. Terms are extracted and selected usng category dependent document frequency thresholdng from the categores chosen. There are two ways that document frequency s calculated: category ndependent term selecton and category dependent term selecton [8]. In category dependent term selecton, document frequency of each term s computed from all the documents n the collecton and the selected set of terms are used on each category. In category ndependent term selecton, document frequency of each term s calculated from only those documents belongng to that category. Thus, dfferent sets of terms are used for dfferent categores. After the feature selecton and extracton, and the SOM clusterng, a concept herarchy s obtaned, by relyng on the MeSH descrptors for the top layer, and by usng feature vectors extracted from the ttles and abstracts for the sub-layer Feature Extracton and Selecton To produce a concept herarchy usng the SOM, documents must be represented by a set of features. For ths purpose, we use full-text ndexng to extract a lst of terms (words or phrases). The nput vector s constructed by ndexng the ttle and abstract elements of the collecton. We then weght these terms usng the vector space model n Informaton Retreval [9]. In the vector space model, documents are represented as term vectors usng the product of the term frequency (TF) and the nverse document frequency (IDF). Each entry n the document vector corresponds to the weght of a term n the document. We used normalzed TF x IDF term weghtng scheme, best fully weghted scheme [9], so that longer documents are not gven more weght and all values of a document vector are dstrbuted n the range of 0 to 1. Thus, weghted word hstogram can be vewed as the feature vector descrbng the document [10]. The preprocessng procedure s manly dvded nto two stages: noun phrase extracton and term weghtng. In the noun phrase extracton phase, we frst fetched the MEDLINE dentfer, the ttle and abstract elements from the collecton and then tokenzed the ttle and abstract elements based on Penn Treebank tokenzaton scheme to detect sentence boundares, and to separate extraneous punctuatons from the nput text. The MEDLINE dentfer was used as a document dentfer. We then automatcally assgned part of speech tags to words reflectng ther syntactc category by usng the rulebased part of speech tagger [11]. After recognzng the chunks that consst of noun phrases from the tagged text, we extracted a set of noun phrases for each ctaton. At ths stage, we removed common terms by consultng a lst of 906 stop words. We computed document frequency of all terms usng category dependent term selecton for those MeSH descrptor categores whose document frequences were over a predetermned threshold (n ths experment, greater than 100 tmes). We then elmnated terms from the feature space whose

3 document frequency was less than a predetermned threshold (n ths experment, less than 10 tmes). Fnally, we weghted the terms ndexed usng the best fully weghted scheme [9], and assgned correspondng term weghts to each document for each category selected. Thus, the weghted term vector set can be used as the nput vector set for the SOM Constructon of a Concept Herarchy Document clusterng s defned as groupng smlar documents nto a cluster. To mprove retreval effcency and effectveness, related documents should be collected together n the same cluster based on some noton of smlarty The Self-Organzng Map s an unsupervsed learnng neural network algorthm for the vsualzaton of hgh-dmensonal data. The SOM defnes a mappng from the nput data space onto a two-dmensonal array of nodes. Every node s represented by a model vector, also called reference vector, m = [m 1, m 2,, m n ], where n s nput vector dmenson. Our algorthm s dfferent from other SOM-varant algorthms, n that each sub-layer SOM dynamcally reconstructs a new nput vector from an upper-level nput vector. The followng algorthm descrbes how to construct a subject-specfc concept herarchy. 1. Intalze network by usng the subject feature vector as the nput vector: Create a twodmensonal map and randomly ntalze model vectors m n the range of 0 to 1 to start from an arbtrary ntal state. 2. Present nput vector n sequental order: Cyclcally present nput vector x(t), the weghted subject nput vector of an n-dmensonal space, to all nodes n the network. Each entry n the nput vector corresponds to the weght of a term n the document. Zero means the term has no sgnfcance n the document or t smply does not exst n the document. 3. Fnd the wnnng node by computng the Eucldean dstance for each node: In order to compare the nput and weght vectors, each node computes the Eucldean dstance between ts weght vector and the nput. The smallest Eucldean dstance dentfes the best-matchng node, whch s chosen as the wnnng node for that partcular nput vector. The best-matchng node, denoted by the subscrpt c, s x m = mn{ x m }. c 4. Update weghts of the wnnng node and ts topologcal neghborhoods: The update rule for the model vector of node s [ x( t) m ( )] m ( t + 1) = m ( t) + α ( t) h ( t) t, where t s dscrete-tme coordnate, (t) s adaptaton coeffcent, and h c (t) s neghborhood functon, a smoothng kernel centered on the wnng node. 5. Repeat steps 2-4 untl all teratons have been completed. 6. Label nodes of the traned network wth the noun phrases of the subject feature vectors: For each node, we determned the dmenson wth the greatest value, and labeled the node wth a correspondng term for that node, and then labeled aggregate nodes wth the same term nto groups. Thus, the subject-specfc top-ter concept map s generated. 7. Repeat steps 1-6 by constructng new nput vector for each grouped concept regon: For each grouped concept regon contanng more than k documents (e.g. 100), recursvely create a SOM and repeat steps 1-6. At ths pont new nput feature vector s dynamcally created by selectng only terms that are contaned n the concept regon from the upper-level feature vector. For each MeSH descrptor category contanng more than 100 documents, we generated a concept herarchy usng the SOM, lmtng the maxmum level of herarchy to 3. We bult a 10 x 10 SOM, and presented each nput vector 100 tmes to the SOM. We then recursvely bult the sub-layer concept herarchy by tranng a new 10 x 10 SOM wth a new nput vector, whch s dynamcally constructed by selectng only a document feature vector contaned n the concept regon from the upper-level feature vector. The concept herarchy generated contans two knds of nformaton: category labels extracted from the MeSH descrptors for the top-level, and the concept herarchy usng the SOM for the sub-layer. We nserted ths nformaton nto the MySQL database to buld an nteractve user nterface. 2.4 Results For the results of categorzaton, we extracted 2,210 dstnct MeSH descrptors, 70 dstnct MeSH qualfers, 269 dstnct MeSH major topcs, and 60,192 co-occurrng MeSH descrptors from the collecton. On average, each ctaton n the collecton contans 14 MeSH descrptors, 10 MeSH qualfers, and 4 MeSH major topcs. c

4 Fgure 1. Interface of MeSH Major Topc Vew Fgure 2. Interface of SOM Tree Vew For text clusterng, we dentfed a total of 20,367 dstnct terms from the collecton after the stop word removal. A total of 22 categores contanng more than 100 ctatons were dentfed from the results of MeSH descrptor categorzaton. After the category dependent document frequency thresholdng, an average of 66 terms were selected per category, rangng from 14 terms for one category to 260 terms for another category. After the herarchcal SOM clusterng, 193 dstnct concepts were generated from 22 categores. 3. USER INTERFACES We provded four dfferent vews, three category herarches and one clusterng herarchy to users. We represented ths herarchy nformaton as herarchcal trees to help users understand MeSH qualfers and descrptors, so they could fnd a set of documents of nterest, and locate good startng ponts for searchng. 3.1 MeSH Major Topc Tree and MeSH Term Tree The MeSH term tree dsplays the categorzed nformaton space, arranged by frst descrptors and then qualfers. Fgure 1 shows the nterface of the MeSH term tree. In each level of herarchy, MeSH terms are lsted n alphabetcal order, along wth ther document frequences. When the user clcks on a category label that s ether a descrptor or a qualfer on the left pane, the assocated document set s dsplayed on the rght pane. At ths pont, f the category s a descrptor, the assocated qualfers n the collecton are also expanded as ts chldren n the tree. Users can see more detaled nformaton of a document by clckng on the ttle of a document that s shown on the rght pane. To help users better understand the meanng of an ambguous MeSH term, the correspondng descrptor data and context n the MeSH tree may be dsplayed by clckng on the lnk

5 MeSH Descrptor Data & Tree Structures wthn each level of the tree. In some cases, the user may want to see the category arranged by only MeSH major topcs. The MeSH major topc tree provdes the same nformaton as the MeSH term tree except that t shows the category herarchy arranged by only MeSH major topcs. 3.2 MeSH Co-occurrence Tree The MeSH co-occurrence tree provdes the cooccurrence of MeSH descrptors, along wth ther cooccurrence frequency n the collecton. Snce an average of 14 MeSH descrptors are assgned to each ctaton n the collecton, there are a large number of nodes n the co-occurrence tree. To better organze the co-occurrence tree, the nterface allows the user to select the co-occurrence frequency range. Thus, the user can easly dentfy co-occurrng semantc types n the collecton. 3.3 SOM Tree The SOM tree was constructed for each MeSH descrptor whose document frequency was less than some predetermned threshold. Typcally, 10 to 12 MeSH descrptors are assgned to each MEDLINE ctaton. Thus, some categores assocated wth a large number of ctatons do not characterze the nformaton n a way that s of nterest to the user [6]. To solve ths problem, we further arrange those categores herarchcally usng the SOM. In some cases, clusterng seems useful n helpng users flter out sets of documents that are clearly not relevant and should be gnored [6]. Fgure 2 show the nterface for browsng the SOM tree. 4. CONCLUSIONS We have proposed a three-step data mnng process for organzng MEDLINE database: (1) categorzatons accordng to MeSH terms, MeSH major topcs, and the co-occurrence of MeSH descrptors; (2) clusterng usng the results of MeSH term categorzaton; and (3) vsualzaton of categores and herarchcal clusters. The proposed SOM algorthm s dfferent from other SOM-varant algorthms. Frst, t uses the results of categorzaton. Second, after constructng the top-level concept map and aggregatng nodes wth the same concept on the map nto a group, t dynamcally reconstructs nput vector by selectng only terms that are contaned for each concept regon from the nput vector of the hgher level and recomputng ther weghts to generate the sub-layer map. Thus, the new nput vector would reflect only the contents of the regon and not the all collecton for each SOM. One of weak ponts of our approach s that the SOM algorthm s nadequate for the collecton updates when new documents are added. Another weakness s that we need to do user evaluaton for the herarchcal clusterng results. A future research wll evaluate the accuracy of clusterng results, and refne the algorthm more effectvely. REFERENCES 1. Natonal Lbrary of Medcne.: MEDLINE Fact Sheet. ed lne.html. 2. French, J. C., Powell, A. L., Gey, F. and Perelman, N.: Explotng a Controlled Vocabulary to Improve Collecton Selecton and Retreval Effectveness, In Proceedngs Tenth Internatonal Conference on Informaton and Knowledge Management (CIKM). November Pratt, W., Fagan, L.: The Usefulness of Dynamcally Categorzng Search Results. In Journal of the Amercan Medcal Informatcs Assocaton (JAMIA) (6), Chen, H., Schuffels, C., and Orwg, R.: Internet Categorzaton and Search: A Self-Organzng Approach. In Journal of Vsual Communcaton and Image Representaton (1), Chen, S.: Dgtal Lbrares: The Lfe Cycle of Informaton. Better Earth Publsher, Hearst, M. A.: The Use of Categores and Clusters for Organzng Retreval Results. Natural Language Informaton Retreval, Dordrecht, Kluwer Academc Publshers Kohonen, T.: Self-Organzaton of Very Large Document Collecton: State of the Art. In Proceedngs of ICANN98, the 8th Internatonal Conference on Artfcal Neural Networks, Skovde, Sweden Chen, H. and Ho, T. K.: Evaluaton of Decson Forests on Text Categorzaton. In Proceedngs of the 7th Conference on Document Recognton and Retreval Salton, G., and Buckley, C.: Term-Weghtng Approaches n Automatc Text Retreval. Informaton Processng and Management, (5), Kohonen, T., Kask, S., Lagus, K., Salojärv, J., Honkela, J., Paatero, V., and Saarela, A.: Self Organzng of a Massve Document Collecton. IEEE Transactons on Neural Networks. May (3). 11. Brll, E.: A Smple Rule-based Part of Speech Tagger. In Proceedngs of the 3rd Conference on Appled Natural Language Processng, Trento, Italy

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION 1 THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Seres A, OF THE ROMANIAN ACADEMY Volume 4, Number 2/2003, pp.000-000 A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION Tudor BARBU Insttute

More information

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like: Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A

More information

Visual Thesaurus for Color Image Retrieval using Self-Organizing Maps

Visual Thesaurus for Color Image Retrieval using Self-Organizing Maps Vsual Thesaurus for Color Image Retreval usng Self-Organzng Maps Chrstopher C. Yang and Mlo K. Yp Department of System Engneerng and Engneerng Management The Chnese Unversty of Hong Kong, Hong Kong ABSTRACT

More information

Combining Multiple Resources, Evidence and Criteria for Genomic Information Retrieval

Combining Multiple Resources, Evidence and Criteria for Genomic Information Retrieval Combnng Multple Resources, Evdence and Crtera for Genomc Informaton Retreval Luo S 1, Je Lu 2 and Jame Callan 2 1 Department of Computer Scence, Purdue Unversty, West Lafayette, IN 47907, USA ls@cs.purdue.edu

More information

Query Clustering Using a Hybrid Query Similarity Measure

Query Clustering Using a Hybrid Query Similarity Measure Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

UB at GeoCLEF Department of Geography Abstract

UB at GeoCLEF Department of Geography   Abstract UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15 CS434a/541a: Pattern Recognton Prof. Olga Veksler Lecture 15 Today New Topc: Unsupervsed Learnng Supervsed vs. unsupervsed learnng Unsupervsed learnng Net Tme: parametrc unsupervsed learnng Today: nonparametrc

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Keywords - Wep page classification; bag of words model; topic model; hierarchical classification; Support Vector Machines

Keywords - Wep page classification; bag of words model; topic model; hierarchical classification; Support Vector Machines (IJCSIS) Internatonal Journal of Computer Scence and Informaton Securty, Herarchcal Web Page Classfcaton Based on a Topc Model and Neghborng Pages Integraton Wongkot Srura Phayung Meesad Choochart Haruechayasak

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Querying by sketch geographical databases. Yu Han 1, a *

Querying by sketch geographical databases. Yu Han 1, a * 4th Internatonal Conference on Sensors, Measurement and Intellgent Materals (ICSMIM 2015) Queryng by sketch geographcal databases Yu Han 1, a * 1 Department of Basc Courses, Shenyang Insttute of Artllery,

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

SRBIR: Semantic Region Based Image Retrieval by Extracting the Dominant Region and Semantic Learning

SRBIR: Semantic Region Based Image Retrieval by Extracting the Dominant Region and Semantic Learning Journal of Computer Scence 7 (3): 400-408, 2011 ISSN 1549-3636 2011 Scence Publcatons SRBIR: Semantc Regon Based Image Retreval by Extractng the Domnant Regon and Semantc Learnng 1 I. Felc Raam and 2 S.

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Novel Pattern-based Fingerprint Recognition Technique Using 2D Wavelet Decomposition

Novel Pattern-based Fingerprint Recognition Technique Using 2D Wavelet Decomposition Mathematcal Methods for Informaton Scence and Economcs Novel Pattern-based Fngerprnt Recognton Technque Usng D Wavelet Decomposton TUDOR BARBU Insttute of Computer Scence of the Romanan Academy T. Codrescu,,

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval Fuzzy -Means Intalzed by Fxed Threshold lusterng for Improvng Image Retreval NAWARA HANSIRI, SIRIPORN SUPRATID,HOM KIMPAN 3 Faculty of Informaton Technology Rangst Unversty Muang-Ake, Paholyotn Road, Patumtan,

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Query classification using topic models and support vector machine

Query classification using topic models and support vector machine Query classfcaton usng topc models and support vector machne Deu-Thu Le Unversty of Trento, Italy deuthu.le@ds.untn.t Raffaella Bernard Unversty of Trento, Italy bernard@ds.untn.t Abstract Ths paper descrbes

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Decision Strategies for Rating Objects in Knowledge-Shared Research Networks

Decision Strategies for Rating Objects in Knowledge-Shared Research Networks Decson Strateges for Ratng Objects n Knowledge-Shared Research etwors ALEXADRA GRACHAROVA *, HAS-JOACHM ER **, HASSA OUR ELD ** OM SUUROE ***, HARR ARAKSE *** * nsttute of Control and System Research,

More information

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity ISSN(Onlne): 2320-9801 ISSN (Prnt): 2320-9798 Internatonal Journal of Innovatve Research n Computer and Communcaton Engneerng (An ISO 3297: 2007 Certfed Organzaton) Vol.2, Specal Issue 1, March 2014 Proceedngs

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

Keyword-based Document Clustering

Keyword-based Document Clustering Keyword-based ocument lusterng Seung-Shk Kang School of omputer Scence Kookmn Unversty & AIrc hungnung-dong Songbuk-gu Seoul 36-72 Korea sskang@kookmn.ac.kr Abstract ocument clusterng s an aggregaton of

More information

KIDS Lab at ImageCLEF 2012 Personal Photo Retrieval

KIDS Lab at ImageCLEF 2012 Personal Photo Retrieval KD Lab at mageclef 2012 Personal Photo Retreval Cha-We Ku, Been-Chan Chen, Guan-Bn Chen, L-J Gaou, Rong-ng Huang, and ao-en Wang Knowledge, nformaton, and Database ystem Laboratory Department of Computer

More information

A Clustering Algorithm for Key Frame Extraction Based on Density Peak

A Clustering Algorithm for Key Frame Extraction Based on Density Peak Journal of Computer and Communcatons, 2018, 6, 118-128 http://www.scrp.org/ournal/cc ISSN Onlne: 2327-5227 ISSN Prnt: 2327-5219 A Clusterng Algorthm for Key Frame Extracton Based on Densty Peak Hong Zhao

More information

1. Introduction. Abstract

1. Introduction. Abstract Image Retreval Usng a Herarchy of Clusters Danela Stan & Ishwar K. Seth Intellgent Informaton Engneerng Laboratory, Department of Computer Scence & Engneerng, Oaland Unversty, Rochester, Mchgan 48309-4478

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

Alignment Results of SOBOM for OAEI 2010

Alignment Results of SOBOM for OAEI 2010 Algnment Results of SOBOM for OAEI 2010 Pegang Xu, Yadong Wang, Lang Cheng, Tany Zang School of Computer Scence and Technology Harbn Insttute of Technology, Harbn, Chna pegang.xu@gmal.com, ydwang@ht.edu.cn,

More information

Relevance Feedback for Image Retrieval

Relevance Feedback for Image Retrieval Vashal D Dhale et al, / (IJCSIT Internatonal Journal of Computer Scence and Informaton Technologes, Vol 4 (2, 203, 39-323 Relevance Feedback for Image Retreval Vashal D Dhale, Dr A R Mahaan, Prof Uma Thakur

More information

Object-Based Techniques for Image Retrieval

Object-Based Techniques for Image Retrieval 54 Zhang, Gao, & Luo Chapter VII Object-Based Technques for Image Retreval Y. J. Zhang, Tsnghua Unversty, Chna Y. Y. Gao, Tsnghua Unversty, Chna Y. Luo, Tsnghua Unversty, Chna ABSTRACT To overcome the

More information

BIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING

BIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING An Improved K-means Algorthm based on Cloud Platform for Data Mnng Bn Xa *, Yan Lu 2. School of nformaton and management scence, Henan Agrcultural Unversty, Zhengzhou, Henan 450002, P.R. Chna 2. College

More information

CMPS 10 Introduction to Computer Science Lecture Notes

CMPS 10 Introduction to Computer Science Lecture Notes CPS 0 Introducton to Computer Scence Lecture Notes Chapter : Algorthm Desgn How should we present algorthms? Natural languages lke Englsh, Spansh, or French whch are rch n nterpretaton and meanng are not

More information

Background Removal in Image indexing and Retrieval

Background Removal in Image indexing and Retrieval Background Removal n Image ndexng and Retreval Y Lu and Hong Guo Department of Electrcal and Computer Engneerng The Unversty of Mchgan-Dearborn Dearborn Mchgan 4818-1491, U.S.A. Voce: 313-593-508, Fax:

More information

Web Document Classification Based on Fuzzy Association

Web Document Classification Based on Fuzzy Association Web Document Classfcaton Based on Fuzzy Assocaton Choochart Haruechayasa, Me-Lng Shyu Department of Electrcal and Computer Engneerng Unversty of Mam Coral Gables, FL 33124, USA charuech@mam.edu, shyu@mam.edu

More information

Improving the Quality of Information Retrieval Using Syntactic Analysis of Search Query

Improving the Quality of Information Retrieval Using Syntactic Analysis of Search Query Improvng the Qualty of Informaton Retreval Usng Syntactc Analyss of Search Query Nadezhda Yarushkna 1[0000-0002-5718-8732], Aleksey Flppov 1[0000-0003-0008-5035], and Mara Grgorcheva 1[0000-0001-7492-5178]

More information

Online Text Mining System based on M2VSM

Online Text Mining System based on M2VSM FR-E2-1 SCIS & ISIS 2008 Onlne Text Mnng System based on M2VSM Yasufum Takama 1, Takash Okada 1, Toru Ishbash 2 1. Tokyo Metropoltan Unversty, 2. Tokyo Metropoltan Insttute of Technology 6-6 Asahgaoka,

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

A Novel Term_Class Relevance Measure for Text Categorization

A Novel Term_Class Relevance Measure for Text Categorization A Novel Term_Class Relevance Measure for Text Categorzaton D S Guru, Mahamad Suhl Department of Studes n Computer Scence, Unversty of Mysore, Mysore, Inda Abstract: In ths paper, we ntroduce a new measure

More information

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

KOHONEN'S SELF ORGANIZING NETWORKS WITH "CONSCIENCE"

KOHONEN'S SELF ORGANIZING NETWORKS WITH CONSCIENCE Kohonen's Self Organzng Maps and ther use n Interpretaton, Dr. M. Turhan (Tury) Taner, Rock Sold Images Page: 1 KOHONEN'S SELF ORGANIZING NETWORKS WITH "CONSCIENCE" By: Dr. M. Turhan (Tury) Taner, Rock

More information

Performance Assessment and Fault Diagnosis for Hydraulic Pump Based on WPT and SOM

Performance Assessment and Fault Diagnosis for Hydraulic Pump Based on WPT and SOM Performance Assessment and Fault Dagnoss for Hydraulc Pump Based on WPT and SOM Be Jkun, Lu Chen and Wang Zl PERFORMANCE ASSESSMENT AND FAULT DIAGNOSIS FOR HYDRAULIC PUMP BASED ON WPT AND SOM. Be Jkun,

More information

Semantic Image Retrieval Using Region Based Inverted File

Semantic Image Retrieval Using Region Based Inverted File Semantc Image Retreval Usng Regon Based Inverted Fle Dengsheng Zhang, Md Monrul Islam, Guoun Lu and Jn Hou 2 Gppsland School of Informaton Technology, Monash Unversty Churchll, VIC 3842, Australa E-mal:

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION 1 FENG YONG, DANG XIAO-WAN, 3 XU HONG-YAN School of Informaton, Laonng Unversty, Shenyang Laonng E-mal: 1 fyxuhy@163.com, dangxaowan@163.com, 3 xuhongyan_lndx@163.com

More information

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images Internatonal Journal of Informaton and Electroncs Engneerng Vol. 5 No. 6 November 015 Usng Fuzzy Logc to Enhance the Large Sze Remote Sensng Images Trung Nguyen Tu Huy Ngo Hoang and Thoa Vu Van Abstract

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Supervsed vs. Unsupervsed Learnng Up to now we consdered supervsed learnng scenaro, where we are gven 1. samples 1,, n 2. class labels for all samples 1,, n Ths s also

More information

Federated Search of Text-Based Digital Libraries in Hierarchical Peer-to-Peer Networks

Federated Search of Text-Based Digital Libraries in Hierarchical Peer-to-Peer Networks Federated Search of Text-Based Dgtal Lbrares n Herarchcal Peer-to-Peer Networks Je Lu School of Computer Scence Carnege Mellon Unversty Pttsburgh, PA 15213 jelu@cs.cmu.edu Jame Callan School of Computer

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Shape Representaton Robust to the Sketchng Order Usng Dstance Map and Drecton Hstogram Department of Computer Scence Yonse Unversty Kwon Yun CONTENTS Revew Topc Proposed Method System Overvew Sketch Normalzaton

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

Adaptive Knowledge-Based Visualization for Accessing Educational Examples

Adaptive Knowledge-Based Visualization for Accessing Educational Examples Adaptve Knowledge-Based Vsualzaton for Accessng Educatonal Examples Peter Bruslovsky, Jae-wook Ahn, Tbor Dumtru, Mchael Yudelson School of Informaton Scences, Unversty of Pttsburgh {peterb, jaa38, mvy3}@ptt.edu

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

Cross-Language Information Retrieval

Cross-Language Information Retrieval Feature Artcle: Cross-Language Informaton Retreval 19 Cross-Language Informaton Retreval Jan-Yun Ne 1 Abstract A research group n Unversty of Montreal has worked on the problem of cross-language nformaton

More information

A new segmentation algorithm for medical volume image based on K-means clustering

A new segmentation algorithm for medical volume image based on K-means clustering Avalable onlne www.jocpr.com Journal of Chemcal and harmaceutcal Research, 2013, 5(12):113-117 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCRC5 A new segmentaton algorthm for medcal volume mage based

More information

Description of NTU Approach to NTCIR3 Multilingual Information Retrieval

Description of NTU Approach to NTCIR3 Multilingual Information Retrieval Proceedngs of the Thrd NTCIR Workshop Descrpton of NTU Approach to NTCIR3 Multlngual Informaton Retreval Wen-Cheng Ln and Hsn-Hs Chen Department of Computer Scence and Informaton Engneerng Natonal Tawan

More information

An Intelligent Tool for Building E-Learning Contend- Material Using Natural Language in Digital Libraries

An Intelligent Tool for Building E-Learning Contend- Material Using Natural Language in Digital Libraries An Intellgent Tool for Buldng E-Learnng Contend- Materal Usng Natural Language n Dgtal Lbrares A. Drgas, J. Vrettaros NCSR Demokrtos DTE/YE, Department of technologcal applcaton, NetMeda Lab Ag. Paraskev

More information

Research on Categorization of Animation Effect Based on Data Mining

Research on Categorization of Animation Effect Based on Data Mining MATEC Web of Conferences 22, 0102 0 ( 2015) DOI: 10.1051/ matecconf/ 2015220102 0 C Owned by the authors, publshed by EDP Scences, 2015 Research on Categorzaton of Anmaton Effect Based on Data Mnng Na

More information

Robust Classification of ph Levels on a Camera Phone

Robust Classification of ph Levels on a Camera Phone Robust Classfcaton of ph Levels on a Camera Phone B.Y. Loh, N.K. Vuong, S. Chan and C.. Lau AbstractIn ths paper, we present a new algorthm that automatcally classfes the ph level on a test strp usng color

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

Hierarchical Image Retrieval by Multi-Feature Fusion

Hierarchical Image Retrieval by Multi-Feature Fusion Preprnts (www.preprnts.org) NOT PEER-REVIEWED Posted: 26 Aprl 207 do:0.20944/preprnts20704.074.v Artcle Herarchcal Image Retreval by Mult- Fuson Xaojun Lu, Jaojuan Wang,Yngq Hou, Me Yang, Q Wang* and Xangde

More information

Document Representation and Clustering with WordNet Based Similarity Rough Set Model

Document Representation and Clustering with WordNet Based Similarity Rough Set Model IJCSI Internatonal Journal of Computer Scence Issues, Vol. 8, Issue 5, No 3, September 20 ISSN (Onlne): 694-084 www.ijcsi.org Document Representaton and Clusterng wth WordNet Based Smlarty Rough Set Model

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

A Deflected Grid-based Algorithm for Clustering Analysis

A Deflected Grid-based Algorithm for Clustering Analysis A Deflected Grd-based Algorthm for Clusterng Analyss NANCY P. LIN, CHUNG-I CHANG, HAO-EN CHUEH, HUNG-JEN CHEN, WEI-HUA HAO Department of Computer Scence and Informaton Engneerng Tamkang Unversty 5 Yng-chuan

More information

A Resources Virtualization Approach Supporting Uniform Access to Heterogeneous Grid Resources 1

A Resources Virtualization Approach Supporting Uniform Access to Heterogeneous Grid Resources 1 A Resources Vrtualzaton Approach Supportng Unform Access to Heterogeneous Grd Resources 1 Cunhao Fang 1, Yaoxue Zhang 2, Song Cao 3 1 Tsnghua Natonal Labatory of Inforamaton Scence and Technology 2 Department

More information

Brushlet Features for Texture Image Retrieval

Brushlet Features for Texture Image Retrieval DICTA00: Dgtal Image Computng Technques and Applcatons, 1 January 00, Melbourne, Australa 1 Brushlet Features for Texture Image Retreval Chbao Chen and Kap Luk Chan Informaton System Research Lab, School

More information

Application of k-nn Classifier to Categorizing French Financial News

Application of k-nn Classifier to Categorizing French Financial News Applcaton of k-nn Classfer to Categorzng French Fnancal News Huazhong KOU, Georges GARDARIN 2, Alan D'heygère 2, Karne Zetoun PRSM Laboratory, Unversty of Versalles Sant-Quentn 45 Etats-Uns Road, 78035

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

A Webpage Similarity Measure for Web Sessions Clustering Using Sequence Alignment

A Webpage Similarity Measure for Web Sessions Clustering Using Sequence Alignment A Webpage Smlarty Measure for Web Sessons Clusterng Usng Sequence Algnment Mozhgan Azmpour-Kv School of Engneerng and Scence Sharf Unversty of Technology, Internatonal Campus Ksh Island, Iran mogan_az@ksh.sharf.edu

More information

Single Document Keyphrase Extraction Using Neighborhood Knowledge

Single Document Keyphrase Extraction Using Neighborhood Knowledge Proceedngs of the Twenty-Thrd AAAI Conference on Artfcal Intellgence (2008) Sngle Document Keyphrase Extracton Usng Neghborhood Knowledge Xaoun Wan and Janguo Xao Insttute of Computer Scence and Technology

More information

Determining Fuzzy Sets for Quantitative Attributes in Data Mining Problems

Determining Fuzzy Sets for Quantitative Attributes in Data Mining Problems Determnng Fuzzy Sets for Quanttatve Attrbutes n Data Mnng Problems ATTILA GYENESEI Turku Centre for Computer Scence (TUCS) Unversty of Turku, Department of Computer Scence Lemmnkäsenkatu 4A, FIN-5 Turku

More information

Elliptical Rule Extraction from a Trained Radial Basis Function Neural Network

Elliptical Rule Extraction from a Trained Radial Basis Function Neural Network Ellptcal Rule Extracton from a Traned Radal Bass Functon Neural Network Andrey Bondarenko, Arkady Borsov DITF, Rga Techncal Unversty, Meza ¼, Rga, LV-1658, Latva Andrejs.bondarenko@gmal.com, arkadjs.borsovs@cs.rtu.lv

More information

Clustering Algorithm of Similarity Segmentation based on Point Sorting

Clustering Algorithm of Similarity Segmentation based on Point Sorting Internatonal onference on Logstcs Engneerng, Management and omputer Scence (LEMS 2015) lusterng Algorthm of Smlarty Segmentaton based on Pont Sortng Hanbng L, Yan Wang*, Lan Huang, Mngda L, Yng Sun, Hanyuan

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

A new Unsupervised Clustering-based Feature Extraction Method

A new Unsupervised Clustering-based Feature Extraction Method A new Unsupervsed Clusterng-based Feature Extracton Method Sabra El Ferchch ACS Natonal School of Engneerng at Tuns, Tunsa Salah Zd AGIS lle Unversty of Scence and Technology, France Kaouther aabd ACS

More information

Invariant Shape Object Recognition Using B-Spline, Cardinal Spline, and Genetic Algorithm

Invariant Shape Object Recognition Using B-Spline, Cardinal Spline, and Genetic Algorithm Proceedngs of the 5th WSEAS Int. Conf. on Sgnal Processng, Robotcs and Automaton, Madrd, Span, February 5-7, 6 (pp4-45) Invarant Shape Obect Recognton Usng B-Splne, Cardnal Splne, and Genetc Algorthm PISIT

More information

A KIND OF ROUTING MODEL IN PEER-TO-PEER NETWORK BASED ON SUCCESSFUL ACCESSING RATE

A KIND OF ROUTING MODEL IN PEER-TO-PEER NETWORK BASED ON SUCCESSFUL ACCESSING RATE A KIND OF ROUTING MODEL IN PEER-TO-PEER NETWORK BASED ON SUCCESSFUL ACCESSING RATE 1 TAO LIU, 2 JI-JUN XU 1 College of Informaton Scence and Technology, Zhengzhou Normal Unversty, Chna 2 School of Mathematcs

More information

Image Alignment CSC 767

Image Alignment CSC 767 Image Algnment CSC 767 Image algnment Image from http://graphcs.cs.cmu.edu/courses/15-463/2010_fall/ Image algnment: Applcatons Panorama sttchng Image algnment: Applcatons Recognton of object nstances

More information

COMPLEX WAVELET TRANSFORM-BASED COLOR INDEXING FOR CONTENT-BASED IMAGE RETRIEVAL

COMPLEX WAVELET TRANSFORM-BASED COLOR INDEXING FOR CONTENT-BASED IMAGE RETRIEVAL COMPLEX WAVELET TRANSFORM-BASED COLOR INDEXING FOR CONTENT-BASED IMAGE RETRIEVAL Nader Safavan and Shohreh Kasae Department of Computer Engneerng Sharf Unversty of Technology Tehran, Iran skasae@sharf.edu

More information

Information Retrieval

Information Retrieval Anmol Bhasn abhasn[at]cedar.buffalo.edu Moht Devnan mdevnan[at]cse.buffalo.edu Sprng 2005 #$ "% &'" (! Informaton Retreval )" " * + %, ##$ + *--. / "#,0, #'",,,#$ ", # " /,,#,0 1"%,2 '",, Documents are

More information

Efficient Mean Shift Algorithm based Color Images Categorization and Searching

Efficient Mean Shift Algorithm based Color Images Categorization and Searching 152 Effcent Mean Shft Algorthm based Color Images Categorzaton and Searchng 1 Dr S K Vay, 2 Sanay Rathore, 3 Abhshek Verma and 4 Hemra Sngh Thakur 1 Professor, Head of Dept Physcs, Govt Geetanal Grl s

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

Impact of a New Attribute Extraction Algorithm on Web Page Classification

Impact of a New Attribute Extraction Algorithm on Web Page Classification Impact of a New Attrbute Extracton Algorthm on Web Page Classfcaton Gösel Brc, Banu Dr, Yldz Techncal Unversty, Computer Engneerng Department Abstract Ths paper ntroduces a new algorthm for dmensonalty

More information

CUM: An Efficient Framework for Mining Concept Units

CUM: An Efficient Framework for Mining Concept Units CUM: An Effcent Framework for Mnng Concept Unts P.Santh Thlagam Ananthanarayana V.S Department of Informaton Technology Natonal Insttute of Technology Karnataka - Surathkal Inda 575025 santh_soc@yahoo.co.n,

More information

Deep Classification in Large-scale Text Hierarchies

Deep Classification in Large-scale Text Hierarchies Deep Classfcaton n Large-scale Text Herarches Gu-Rong Xue Dkan Xng Qang Yang 2 Yong Yu Dept. of Computer Scence and Engneerng Shangha Jao-Tong Unversty {grxue, dkxng, yyu}@apex.sjtu.edu.cn 2 Hong Kong

More information

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution Real-tme Moton Capture System Usng One Vdeo Camera Based on Color and Edge Dstrbuton YOSHIAKI AKAZAWA, YOSHIHIRO OKADA, AND KOICHI NIIJIMA Graduate School of Informaton Scence and Electrcal Engneerng,

More information