A Novel Optimization Technique for Translation Retrieval in Networks Search Engines
|
|
- Sharleen Rose
- 5 years ago
- Views:
Transcription
1 A Novel Optmzaton Technque for Translaton Retreval n Networks Search Engnes Yanyan Zhang Zhengzhou Unversty of Industral Technology, Henan, Chna Abstract - Ths paper studes models of Translaton Retreval.e. the relatonshp between enqurer s nput words and the retreved nformaton n network search engnes. In order to solve the dffcultes n the tradtonal model, a new mathematcal model s proposed to quantfy the correlaton between web content and user query, and the method s shown by experments to outperform other Translaton Retreval methods. The mproved model s a good soluton to the problems of the tradtonal model, greatly mprovng the query precson and recall rate of search engnes. Keywords - Search engne; Translaton Retreval model; Network searchng engne; Optmzaton. I. INTRODUCTION When a search engne provdes nformaton nqury servce, t only sees the query words. People from dfferent backgrounds may submt the same query words, but are often concerned about dfferent nformaton meanng of those query words. Moreover, the search engne usually does not know the background of the users, so n order not to mss any relevant nformaton, t places the focused nformaton as much as possble n the front of search lst. Ths s a basc requrement for search engnes. Therefore, the core work of a search engne s to sequence the crawled webpages accordng to some factors based on the query words. The three man factors affectng the Translaton Retreval results are the Network searchng engne of webpages, the lnk relatonshp of pages and the user s query ntenton. II. TRANSLATION RETRIEVAL MODEL FRAMEWORK Although there s a varety of Translaton Retreval models, ther status and functon n search engne s the same. Fgure shows a frame of calculaton smlarty of search engne. When the user has nformaton demand, the query words wll be constructed as a concrete manfestaton of the nformaton demand, and the search engne wll construct the nternal query representaton to the user s query words. For the massve web pages or document collecton, there s also correspondng document representaton method nsde the search system. The core of the search engne s to judge whch documents are relevant to user s demand, and to output n a sorted way. So the correlaton calculaton s a process of matchng the user query and document content, and the Translaton Retreval model s a theoretcal bass and core component whch s used to calculate the Network searchng engne. Fgure. Translaton Retreval Model Framework DOI 0.503/IJSSST.a ISSN: x onlne, prnt
2 III. THE BIM25 MODEL BIM (Bnary Independent Model) only consders whether a word appears n the document or not and does not consder ts own feature. BM25, based on BIM, ntroduces the weght value of the word n the query and the weght value of the word n the document. So, now BM25 model s a comparatvely successful content sortng model. The specfc calculaton method of BM25 model s as shown n the formula (). For each query word appeared n the query Q, ther scores n the document D wll be calculated n turn, and after the accumulaton, comes the correlaton score of document D to query Q. Q ( r 0.5) / ( Rr 0.5) log ( n r 0.5)/( N n Rr 0.5) ( k) f ( k2 ) qf K f k qf In the above formula (( ) dl K k ) b b avdl represents the consderaton of document length. In the calculaton formula of K, dl refers to the length of the document D, and avdl s the average length of all the documents n document collecton, and k and b are emprcal parameters. The parameters b s an adjustment factor, n some extreme cases, f b s set as 0, the document length factor wll not work. Generally, f b s set as 0.75, we wll get a better search effect. Overall, the BM25 model formula actually combnes four factors: the IDF factor, the length factor of document, the word amount of document and the query word frequency; and uses the three free adjustment factor (k, k2 and b) to adjust the weght of varous factors. IV. THE DIFFICULTIES AND SOLUTION A. The Dffcultes There s a dfferent frequency dstrbuton n query words. Qute a number of query words have hardly been quered by the users, whle a small number of query words are repeatedly quered. Ths leads to a problem that numerous relevant query words do not appear n the document, so the generaton probablty of the query word s 0, and ths means that the generaton probablty of the total query s 0. So f a document wth lmted words and content, especally some ndvdual query words do not appear n ths document, t wll lead to a falure to the tradtonal Translaton Retreval model. The problem s called data sparsty of Translaton Retreval model. The query words submtted by the Users may appear n the domans such as page ttle, descrpton nformaton, text, etc. In the calculaton of Network searchng engne, the weghts of the words n the ttle should be greater than 2 () that appear n the text. However, When the tradtonal Translaton Retreval model calculate the correlaton between a document and query, t takes the document as a whole, and not take nto account that dfferent doman gves dfferent weghts. That leads to the precson of Translaton Retreval model droppng and users cannot fnd pages wth whch they are satsfed. B. The Soluton B. Mult-Parameter Data Smoothng Fuson Strategy Ths paper proposes the data smoothng strategy to solve the problem of sparse data. The so-called data smoothng s that takng a part from the dstrbuton probablty value of the words appearng n a document and then assgnng the value to the words whch dd not appear n the document, so all the words have non-zero probablty values and the phenomenon that the whole probablty s zero n the calculaton s avoded. The specfc method s to ntroduce a background probablty to all the words to do data smoothng. The socalled background probablty s to set up a whole language model to document collecton, because of ts relatvely large sze, most of the query has a probablty value. So, for the language model method, f the document collecton contans N documents, t needs to establsh N+ dfferent language models, n whch each document has ts own language model and the data smoothng fuson strategy s establshed on the document collecton language model. f c (2) D C n q, D q ( )( ) (( ) ) PQ D Q D Formula (2) s the formula for calculatng the probablty of document generaton after data smoothng, t can be seen that the probablty of each query word s composed of two parts: model. The frst part The second part ( ) f, q D D s the document language c q s used to make the language C model of document collecton after data smoothng, and the weghts of both can be adjusted by the parameters. The strategy s useful for processng the nvsble words n a query document, especally for the content doman wth only a few words or keywords rarely appeared. The smoothng strategy can ntroduce global nformaton through the overall probablty estmaton, carryng on the revson to the zero probablty and mnmum probablty, whch helps to mprove the language model Translaton Retreval accuracy. DOI 0.503/IJSSST.a ISSN: x onlne, prnt
3 The object treated n the content analyss s the content block of the webpage. As for the representaton of content block, feature vector method s also applcable. Therefore, n calculatng the feature weght, we focus more on ts mportance n a page, but not the statstc mportance n a document collecton. Based on the above analyss, we use formula (3) to calculate the feature weght. W BN j n BN j Where BWe ght BWe ght ( BWe ght BTf ) j j BT j j 2 (3), the weght of the content doman j, s decded by an mportant label of the content doman; BN represents the total number of content domans dvded n webpage; n represents the total number of keywords n webpage; and BTf represents the word frequency that keywords appears n the content doman j. V. EXPERIMENT AND ANALYSIS A. The Optmalty Verfcaton of Language Model Smoothng Strateges Frst s the selecton of data sets, usng 20 Newsgroup data sets and subsets TD2003 and TD2004 of Letor3.0 data sets. In order to test the performance of the Translaton Retreval model proposed n ths paper, the average precson of the man ensemble (MAP) and the normalzed damage cumulatve gan (NDCG) are used as the evaluaton methods. TABLE THE PARAMETERS SELECTED IN DIFFERENT SMOOTHING STRATEGIES Base Lne SVD JM DIR 50 Newsgroup DIS 0.5 JM 0.7 TD 2003 DIR 2000 DIS 0. JM 0.7 TD 2004 DIR 2000 DIS 0. We select the entre document as a sngle doman, and then take the language model parameters n 20 newsgroup data set as a comparatve test, fnally compare the performance between the mult-parameter fuson sequencng and the sngle optmal parameter sortng of the language model n test set. Table shows the parameters selected n the smoothng strategy n data sets 20 newsgroup, TD 2003 and TD j 0 parameters are selected as the optonal parameters for each smoothng strategy. The parameters of the Ds and JM smoothng strateges are [0,], so ther parameter set can be set as {0.,0.2,0.3,,}. As for the Dr smoothng method, the selecton of ts parameters s centred on the parameters of the Letor data set. In ths way, we can get 0 page sortng features from each smoothng strategy of the language Translaton Retreval model. In ths experment, the MAP value s taken as an ndcator for evaluatng the performance of fuson method, and the expermental results of language Translaton Retreval models based on smoothng strategy are ganed, whch s shown n Table 2. TABLE 2 MULTI PARAMETER LANGUAGE MODEL SMOOTHING METHOD FUSION 0-feature SP MP Gan( %) NEWS_Jm NEWS_Dr NEWS_Ds TD3_Jm TD3_Dr TD3_Ds TD4_Jm TD4_Dr TD4_Ds Comparng the expermental results n Table 2, t can be seen that the expermental result of mult-parameter language model smoothng method s superor to the sngle parameter language model smoothng method, especally n TD 2003 data set. The SP method shows the general level of sortng. It also llustrates that there s strong complementarty between the mult-parameter sortng features, whch can greatly mprove the sortng effect. B. Performance Verfcaton of Feature Weghts Comprehensve Sortng n Dfferent Domans of Pages In ths experment, the classfer developed by Bejng Unversty network laboratory s taken as the basc classfer. And the tradtonal precson rato, recall rate and F value are adopted to evaluate the classfcaton results. When a user makes a certan search request, the search system wll always return the relevant documents systematcally to the user. For such search behavour, we can dvde a document collecton nto four dsjont subsets accordng to two dmensons, as s shown n Fgure 3. DOI 0.503/IJSSST.a ISSN: x onlne, prnt
4 On the bass of dvdng the document set nto 4 subsets, we can quanttatvely descrbe the precson rate, recall rate and F value. The followng three formulas are the calculaton methods of these three ndexes. pr ec s on N N M recall N N K Fgure 3 Understandng the two dmensons of document collecton In fgure 3, ) N represents the document whch s n the results of ths search and related to the search request. 2) M represents the document whch s n the results of ths search but not related to the search request. 3) K represents the document whch s out of the results of ths search but related to the search request. 4) L represents the document whch s out of the results of ths search and not related to the search request.. F 2 pr ec s on r ecal l pr ec s on r ecal l We can use the above three formulas to calculate those three ndexes of dfferent categores of the documents n data set. Fgure 4 s a performance comparson between the old classfer and the new one, n whch, the horzontal axs represents the dfferent category numbers, and table 3 shows the correspondng meanng to each category number n Fgure 4. Fgure 4 Comparson of classfcaton results before and after web page cleanng Category Numbers Class Names Category Numbers Class Names TABLE 3 THE CHECK LIST OF CATEGORY NUMBERS Humanty News Meda Busness Economy Entertanment and Lesure IT Educaton Toursm Natural Scence Government Poltcs Socal Scence Health Care Socal Culture Through Fgure 4 we can see that all the classfcaton results of categores get mproved than that before. In addton, when those webpages n tranng set and testng set are selected manually, they are supposed to be the DOI 0.503/IJSSST.a ISSN: x onlne, prnt
5 pages as far as possble wth more text nformaton and less nose nformaton. Therefore, the purfcaton effect of web page n the practcal applcaton s more obvous than the results of ths experment. VI. CONCLUSION Ths paper optmzes the tradtonal Translaton Retreval model based on Network searchng engne and fnds an effectve soluton to the problems of data sparsty and equalty of weghts of dfferent domans n tradtonal model. The mproved model can effectvely promote the precson and recall rate of search engne, whch provdes a method and a theoretcal prncple for the development of search engne. REFERENCES [] Z.J. Yang. Research and applcaton of personalzed query expanson technology of search engne. Natonal Unversty of Defense Technology. Changsha Chna(200) [2] J. Guo, H. Guo, and Z. Wang. An Actvaton Force-based Affnty Measure foranalyzng Complex Networks. Sc. Rep. Vol., No.7,9-2(20) [3] H. Zhao, C.S. Ba and S.Zhu. Automatc keyword extracton algorthm and mplementaton. App. Mech. Mater. Vol.44, (20) [4] X.Q.Ja. Topc nformaton acquston system based on an mproved ant-spoofng topc crawler algorthm. Int. J. Dg. Con. Tech. App. Vol. 6, No.6, (202) [5] Saraswath D, Kathravan A V, Kavtha R. A new enhanced technque for lnk farm detecton. Info. Med. Eng. (PRIME). Vol.2, 74-8(202) [6] Z.M. He, L.H. Wang, G. Zhang. An mproved pagerank algorthm wth ant-lnk spam. J. Chn. Inf. Vol.26,No.5,0-06(202) [7] D.X. Lu, X. Yan, W. Xe. Improved pagerank algorthm based on the resdence tme of the webste. Int. Comput. Appl. Vol.4, No.5, (202) [8] H. Huang, L. Qan and Y. Wang. A SVM-based technque to detect phshng URLs. Inf. Tec. J. Vol., No.7, (202) [9] X.He, Z.X.Nu, J.Y.Sun.The effect of context on user search behavor. J. Int. Vol.3, No.0, 22-25(202) [0] L.Dong, H.W.Xe. Study on optmzaton of rank fuson algorthm n meta search engne. Comput. Appl. Software. Vol. 29, No.0, 88-90(202) [] Parra A J, Forne M J, Rebollo M D. Prvacy protecton of user profles n personalzed nformaton systems. U. Polt. Catal. Vol. 33, No.2, 53-63(203) [2] L. Shou, H. Ba, K. Chen. Supportng prvacy protecton n personalzed Web search. IEEE. T. Knowl. Data. En. Vol.26, No.2, (204) [3] C.Z.L. Research on the personalzed servce of search engne and ts models under Web2.0 envronment. Inform. Sc. Vol.35, No.3,75-79(205) [4] H.W.Wang, W. Wang, M. Yuan. Counterng page rankng spam based on text content and lnk structure analyss. Syst. Eng. Th. Pract. Vol.35, No.2, (205) DOI 0.503/IJSSST.a ISSN: x onlne, prnt
Performance Evaluation of Information Retrieval Systems
Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence
More informationQuery Clustering Using a Hybrid Query Similarity Measure
Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan
More informationNUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS
ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana
More informationUB at GeoCLEF Department of Geography Abstract
UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department
More informationA Fast Content-Based Multimedia Retrieval Technique Using Compressed Data
A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,
More informationBioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.
[Type text] [Type text] [Type text] ISSN : 0974-74 Volume 0 Issue BoTechnology 04 An Indan Journal FULL PAPER BTAIJ 0() 04 [684-689] Revew on Chna s sports ndustry fnancng market based on market -orented
More informationTerm Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task
Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto
More informationCluster Analysis of Electrical Behavior
Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School
More informationAn Indian Journal FULL PAPER ABSTRACT KEYWORDS. Trade Science Inc.
[Type text] [Type text] [Type text] ISSN : 97-735 Volume Issue 9 BoTechnology An Indan Journal FULL PAPER BTAIJ, (9), [333-3] Matlab mult-dmensonal model-based - 3 Chnese football assocaton super league
More informationDetermining the Optimal Bandwidth Based on Multi-criterion Fusion
Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn
More informationA Fast Visual Tracking Algorithm Based on Circle Pixels Matching
A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng
More informationAn Improved Image Segmentation Algorithm Based on the Otsu Method
3th ACIS Internatonal Conference on Software Engneerng, Artfcal Intellgence, Networkng arallel/dstrbuted Computng An Improved Image Segmentaton Algorthm Based on the Otsu Method Mengxng Huang, enjao Yu,
More informationMULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION
MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and
More informationApplication of Clustering Algorithm in Big Data Sample Set Optimization
Applcaton of Clusterng Algorthm n Bg Data Sample Set Optmzaton Yutang Lu 1, Qn Zhang 2 1 Department of Basc Subjects, Henan Insttute of Technology, Xnxang 453002, Chna 2 School of Mathematcs and Informaton
More informationMaximum Variance Combined with Adaptive Genetic Algorithm for Infrared Image Segmentation
Internatonal Conference on Logstcs Engneerng, Management and Computer Scence (LEMCS 5) Maxmum Varance Combned wth Adaptve Genetc Algorthm for Infrared Image Segmentaton Huxuan Fu College of Automaton Harbn
More informationBIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING
An Improved K-means Algorthm based on Cloud Platform for Data Mnng Bn Xa *, Yan Lu 2. School of nformaton and management scence, Henan Agrcultural Unversty, Zhengzhou, Henan 450002, P.R. Chna 2. College
More informationThe Research of Support Vector Machine in Agricultural Data Classification
The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou
More informationContent Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers
IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth
More informationImproving Web Image Search using Meta Re-rankers
VOLUME-1, ISSUE-V (Aug-Sep 2013) IS NOW AVAILABLE AT: www.dcst.com Improvng Web Image Search usng Meta Re-rankers B.Kavtha 1, N. Suata 2 1 Department of Computer Scence and Engneerng, Chtanya Bharath Insttute
More informationOptimizing Document Scoring for Query Retrieval
Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng
More informationSmoothing Spline ANOVA for variable screening
Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory
More informationTN348: Openlab Module - Colocalization
TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages
More informationDescription of NTU Approach to NTCIR3 Multilingual Information Retrieval
Proceedngs of the Thrd NTCIR Workshop Descrpton of NTU Approach to NTCIR3 Multlngual Informaton Retreval Wen-Cheng Ln and Hsn-Hs Chen Department of Computer Scence and Informaton Engneerng Natonal Tawan
More informationClassifier Selection Based on Data Complexity Measures *
Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.
More informationRemote Sensing Image Retrieval Algorithm based on MapReduce and Characteristic Information
Remote Sensng Image Retreval Algorthm based on MapReduce and Characterstc Informaton Zhang Meng 1, 1 Computer School, Wuhan Unversty Hube, Wuhan430097 Informaton Center, Wuhan Unversty Hube, Wuhan430097
More informationLobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide
Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.
More informationA Binarization Algorithm specialized on Document Images and Photos
A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a
More informationAn Image Fusion Approach Based on Segmentation Region
Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua
More informationA Novel Adaptive Descriptor Algorithm for Ternary Pattern Textures
A Novel Adaptve Descrptor Algorthm for Ternary Pattern Textures Fahuan Hu 1,2, Guopng Lu 1 *, Zengwen Dong 1 1.School of Mechancal & Electrcal Engneerng, Nanchang Unversty, Nanchang, 330031, Chna; 2. School
More informationResearch of Dynamic Access to Cloud Database Based on Improved Pheromone Algorithm
, pp.197-202 http://dx.do.org/10.14257/dta.2016.9.5.20 Research of Dynamc Access to Cloud Database Based on Improved Pheromone Algorthm Yongqang L 1 and Jn Pan 2 1 (Software Technology Vocatonal College,
More informationA Method of Hot Topic Detection in Blogs Using N-gram Model
84 JOURNAL OF SOFTWARE, VOL. 8, NO., JANUARY 203 A Method of Hot Topc Detecton n Blogs Usng N-gram Model Xaodong Wang College of Computer and Informaton Technology, Henan Normal Unversty, Xnxang, Chna
More informationAn Optimal Algorithm for Prufer Codes *
J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,
More informationA Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems
A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty
More informationTsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance
Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for
More informationModular PCA Face Recognition Based on Weighted Average
odern Appled Scence odular PCA Face Recognton Based on Weghted Average Chengmao Han (Correspondng author) Department of athematcs, Lny Normal Unversty Lny 76005, Chna E-mal: hanchengmao@163.com Abstract
More informationThe Comparison of Calibration Method of Binocular Stereo Vision System Ke Zhang a *, Zhao Gao b
3rd Internatonal Conference on Materal, Mechancal and Manufacturng Engneerng (IC3ME 2015) The Comparson of Calbraton Method of Bnocular Stereo Vson System Ke Zhang a *, Zhao Gao b College of Engneerng,
More informationPerformance Assessment and Fault Diagnosis for Hydraulic Pump Based on WPT and SOM
Performance Assessment and Fault Dagnoss for Hydraulc Pump Based on WPT and SOM Be Jkun, Lu Chen and Wang Zl PERFORMANCE ASSESSMENT AND FAULT DIAGNOSIS FOR HYDRAULIC PUMP BASED ON WPT AND SOM. Be Jkun,
More informationBackpropagation: In Search of Performance Parameters
Bacpropagaton: In Search of Performance Parameters ANIL KUMAR ENUMULAPALLY, LINGGUO BU, and KHOSROW KAIKHAH, Ph.D. Computer Scence Department Texas State Unversty-San Marcos San Marcos, TX-78666 USA ae049@txstate.edu,
More informationQuerying by sketch geographical databases. Yu Han 1, a *
4th Internatonal Conference on Sensors, Measurement and Intellgent Materals (ICSMIM 2015) Queryng by sketch geographcal databases Yu Han 1, a * 1 Department of Basc Courses, Shenyang Insttute of Artllery,
More informationA Model Based on Multi-agent for Dynamic Bandwidth Allocation in Networks Guang LU, Jian-Wen QI
216 Jont Internatonal Conference on Artfcal Intellgence and Computer Engneerng (AICE 216) and Internatonal Conference on etwork and Communcaton Securty (CS 216) ISB: 978-1-6595-362-5 A Model Based on Mult-agent
More informationSolving two-person zero-sum game by Matlab
Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by
More informationSequential search. Building Java Programs Chapter 13. Sequential search. Sequential search
Sequental search Buldng Java Programs Chapter 13 Searchng and Sortng sequental search: Locates a target value n an array/lst by examnng each element from start to fnsh. How many elements wll t need to
More informationModel Research on the Optimized and Improved Design of Lucene Search Engine Based on Big Data
Model Research on the Optmzed and Improved Desgn of Lucene Search Engne Based on Bg Data Shaoyu Lang 1, Syn Lang 2 1 Guangzhou Huashang Vocatonal College, Guangdong 511300, Chna 2 Guangdong Huashang Techncal
More informationSteps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices
Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between
More informationLearning-Based Top-N Selection Query Evaluation over Relational Databases
Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **
More informationTHE PATH PLANNING ALGORITHM AND SIMULATION FOR MOBILE ROBOT
Journal of Theoretcal and Appled Informaton Technology 30 th Aprl 013. Vol. 50 No.3 005-013 JATIT & LLS. All rghts reserved. ISSN: 199-8645 www.jatt.org E-ISSN: 1817-3195 THE PATH PLANNING ALGORITHM AND
More informationS1 Note. Basis functions.
S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type
More informationCompiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz
Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster
More informationProblem Definitions and Evaluation Criteria for Computational Expensive Optimization
Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty
More informationNetwork Intrusion Detection Based on PSO-SVM
TELKOMNIKA Indonesan Journal of Electrcal Engneerng Vol.1, No., February 014, pp. 150 ~ 1508 DOI: http://dx.do.org/10.11591/telkomnka.v1.386 150 Network Intruson Detecton Based on PSO-SVM Changsheng Xang*
More informationA CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION
A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION 1 FENG YONG, DANG XIAO-WAN, 3 XU HONG-YAN School of Informaton, Laonng Unversty, Shenyang Laonng E-mal: 1 fyxuhy@163.com, dangxaowan@163.com, 3 xuhongyan_lndx@163.com
More informationAn Entropy-Based Approach to Integrated Information Needs Assessment
Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology
More informationAn IPv6-Oriented IDS Framework and Solutions of Two Problems
An IPv6-Orented IDS Framework and Solutons of Two Problems We LI, Zhy FANG, Peng XU and ayang SI,2 School of Computer Scence and Technology, Jln Unversty Changchun, 3002, P.R.Chna 2 Graduate Unversty of
More informationA Generation Model to Unify Topic Relevance and Lexicon-based Sentiment for Opinion Retrieval
A Generaton Model to Unfy Topc Relevance and Lexcon-based Sentment for Opnon Retreval Mn Zhang State key lab of Intellgent Tech.& Sys, Dept. of Computer Scence, Tsnghua Unversty, Bejng, 00084, Chna 86-0-6279-2595
More informationPRÉSENTATIONS DE PROJETS
PRÉSENTATIONS DE PROJETS Rex Onlne (V. Atanasu) What s Rex? Rex s an onlne browser for collectons of wrtten documents [1]. Asde ths core functon t has however many other applcatons that make t nterestng
More informationClustering Algorithm of Similarity Segmentation based on Point Sorting
Internatonal onference on Logstcs Engneerng, Management and omputer Scence (LEMS 2015) lusterng Algorthm of Smlarty Segmentaton based on Pont Sortng Hanbng L, Yan Wang*, Lan Huang, Mngda L, Yng Sun, Hanyuan
More informationChinese Word Segmentation based on the Improved Particle Swarm Optimization Neural Networks
Chnese Word Segmentaton based on the Improved Partcle Swarm Optmzaton Neural Networks Ja He Computatonal Intellgence Laboratory School of Computer Scence and Engneerng, UESTC Chengdu, Chna Department of
More informationKIDS Lab at ImageCLEF 2012 Personal Photo Retrieval
KD Lab at mageclef 2012 Personal Photo Retreval Cha-We Ku, Been-Chan Chen, Guan-Bn Chen, L-J Gaou, Rong-ng Huang, and ao-en Wang Knowledge, nformaton, and Database ystem Laboratory Department of Computer
More informationImage Emotional Semantic Retrieval Based on ELM
Internatonal Conference on Logstcs Engneerng, Management and Computer Scence (LEMCS 2014) Image Emotonal Semantc Retreval Based on ELM Pele Zhang, Mn Yao, Shenzhang La College of computer scence & Technology
More informationAssignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.
Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton
More informationPrivate Information Retrieval (PIR)
2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market
More informationFEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur
FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents
More informationCourse Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms
Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques
More informationAvailable online at Available online at Advanced in Control Engineering and Information Science
Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced
More informationA Robust Method for Estimating the Fundamental Matrix
Proc. VIIth Dgtal Image Computng: Technques and Applcatons, Sun C., Talbot H., Ourseln S. and Adraansen T. (Eds.), 0- Dec. 003, Sydney A Robust Method for Estmatng the Fundamental Matrx C.L. Feng and Y.S.
More informationRecommended Items Rating Prediction based on RBF Neural Network Optimized by PSO Algorithm
Recommended Items Ratng Predcton based on RBF Neural Network Optmzed by PSO Algorthm Chengfang Tan, Cayn Wang, Yuln L and Xx Q Abstract In order to mtgate the data sparsty and cold-start problems of recommendaton
More informationAssociation Rule Mining with Parallel Frequent Pattern Growth Algorithm on Hadoop
Assocaton Rule Mnng wth Parallel Frequent Pattern Growth Algorthm on Hadoop Zhgang Wang 1,2, Guqong Luo 3,*,Yong Hu 1,2, ZhenZhen Wang 1 1 School of Software Engneerng Jnlng Insttute of Technology Nanng,
More informationOptimal Workload-based Weighted Wavelet Synopses
Optmal Workload-based Weghted Wavelet Synopses Yoss Matas School of Computer Scence Tel Avv Unversty Tel Avv 69978, Israel matas@tau.ac.l Danel Urel School of Computer Scence Tel Avv Unversty Tel Avv 69978,
More informationTECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.
TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of
More informationSome material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier
Some materal adapted from Mohamed Youns, UMBC CMSC 611 Spr 2003 course sldes Some materal adapted from Hennessy & Patterson / 2003 Elsever Scence Performance = 1 Executon tme Speedup = Performance (B)
More informationCross-Language Information Retrieval
Feature Artcle: Cross-Language Informaton Retreval 19 Cross-Language Informaton Retreval Jan-Yun Ne 1 Abstract A research group n Unversty of Montreal has worked on the problem of cross-language nformaton
More informationFace Recognition University at Buffalo CSE666 Lecture Slides Resources:
Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural
More informationThe Effect of Similarity Measures on The Quality of Query Clusters
The effect of smlarty measures on the qualty of query clusters. Fu. L., Goh, D.H., Foo, S., & Na, J.C. (2004). Journal of Informaton Scence, 30(5) 396-407 The Effect of Smlarty Measures on The Qualty of
More informationDeep Classification in Large-scale Text Hierarchies
Deep Classfcaton n Large-scale Text Herarches Gu-Rong Xue Dkan Xng Qang Yang 2 Yong Yu Dept. of Computer Scence and Engneerng Shangha Jao-Tong Unversty {grxue, dkxng, yyu}@apex.sjtu.edu.cn 2 Hong Kong
More informationMachine Learning: Algorithms and Applications
14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of
More informationImprovement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration
Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,
More informationA Clustering Algorithm for Key Frame Extraction Based on Density Peak
Journal of Computer and Communcatons, 2018, 6, 118-128 http://www.scrp.org/ournal/cc ISSN Onlne: 2327-5227 ISSN Prnt: 2327-5219 A Clusterng Algorthm for Key Frame Extracton Based on Densty Peak Hong Zhao
More informationParallel Implementation of Classification Algorithms Based on Cloud Computing Environment
TELKOMNIKA, Vol.10, No.5, September 2012, pp. 1087~1092 e-issn: 2087-278X accredted by DGHE (DIKTI), Decree No: 51/Dkt/Kep/2010 1087 Parallel Implementaton of Classfcaton Algorthms Based on Cloud Computng
More informationHelsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)
Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute
More informationProfessional competences training path for an e-commerce major, based on the ISM method
World Transactons on Engneerng and Technology Educaton Vol.14, No.4, 2016 2016 WIETE Professonal competences tranng path for an e-commerce maor, based on the ISM method Ru Wang, Pn Peng, L-gang Lu & Lng
More informationFINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK
FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK L-qng Qu, Yong-quan Lang 2, Jng-Chen 3, 2 College of Informaton Scence and Technology, Shandong Unversty of Scence and Technology,
More informationFederated Search of Text Search Engines in Uncooperative Environments
1 Federated Search of Text Search Engnes n Uncooperatve Envronments Luo S Thess Proposal Language Technology Insttute School of Computer Scence Carnege Mellon Unversty ls@cs.cmu.edu Thess Commttee: Jame
More informationCSCI 5417 Information Retrieval Systems Jim Martin!
CSCI 5417 Informaton Retreval Systems Jm Martn! Lecture 11 9/29/2011 Today 9/29 Classfcaton Naïve Bayes classfcaton Ungram LM 1 Where we are... Bascs of ad hoc retreval Indexng Term weghtng/scorng Cosne
More informationFederated Search of Text-Based Digital Libraries in Hierarchical Peer-to-Peer Networks
Federated Search of Text-Based Dgtal Lbrares n Herarchcal Peer-to-Peer Networks Je Lu School of Computer Scence Carnege Mellon Unversty Pttsburgh, PA 15213 jelu@cs.cmu.edu Jame Callan School of Computer
More informationBOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET
1 BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET TZU-CHENG CHUANG School of Electrcal and Computer Engneerng, Purdue Unversty, West Lafayette, Indana 47907 SAUL B. GELFAND School
More informationDiscriminative Dictionary Learning with Pairwise Constraints
Dscrmnatve Dctonary Learnng wth Parwse Constrants Humn Guo Zhuoln Jang LARRY S. DAVIS UNIVERSITY OF MARYLAND Nov. 6 th, Outlne Introducton/motvaton Dctonary Learnng Dscrmnatve Dctonary Learnng wth Parwse
More informationPruning Training Corpus to Speedup Text Classification 1
Prunng Tranng Corpus to Speedup Text Classfcaton Jhong Guan and Shugeng Zhou School of Computer Scence, Wuhan Unversty, Wuhan, 430079, Chna hguan@wtusm.edu.cn State Key Lab of Software Engneerng, Wuhan
More informationCHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION
24 CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION The present chapter proposes an IPSO approach for multprocessor task schedulng problem wth two classfcatons, namely, statc ndependent tasks and
More informationProblem Set 3 Solutions
Introducton to Algorthms October 4, 2002 Massachusetts Insttute of Technology 6046J/18410J Professors Erk Demane and Shaf Goldwasser Handout 14 Problem Set 3 Solutons (Exercses were not to be turned n,
More informationX- Chart Using ANOM Approach
ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are
More informationObject-Based Techniques for Image Retrieval
54 Zhang, Gao, & Luo Chapter VII Object-Based Technques for Image Retreval Y. J. Zhang, Tsnghua Unversty, Chna Y. Y. Gao, Tsnghua Unversty, Chna Y. Luo, Tsnghua Unversty, Chna ABSTRACT To overcome the
More informationFeature Selection as an Improving Step for Decision Tree Construction
2009 Internatonal Conference on Machne Learnng and Computng IPCSIT vol.3 (2011) (2011) IACSIT Press, Sngapore Feature Selecton as an Improvng Step for Decson Tree Constructon Mahd Esmael 1, Fazekas Gabor
More informationOn-line Hot Topic Recommendation Using Tolerance Rough Set Based Topic Clustering
JOURNAL OF COMPUTERS, VOL. 5, NO. 4, APRIL 2010 549 On-lne Hot Topc Recommendaton Usng Tolerance Rough Set Based Topc Clusterng Yonghu Wu, Yuxn Dng, Xaolong Wang, Jun Xu Intellgence Computng Research Center
More informationCombining Multiple Resources, Evidence and Criteria for Genomic Information Retrieval
Combnng Multple Resources, Evdence and Crtera for Genomc Informaton Retreval Luo S 1, Je Lu 2 and Jame Callan 2 1 Department of Computer Scence, Purdue Unversty, West Lafayette, IN 47907, USA ls@cs.purdue.edu
More informationLocal Quaternary Patterns and Feature Local Quaternary Patterns
Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents
More informationA PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION
1 THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Seres A, OF THE ROMANIAN ACADEMY Volume 4, Number 2/2003, pp.000-000 A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION Tudor BARBU Insttute
More informationThe Study of Remote Sensing Image Classification Based on Support Vector Machine
Sensors & Transducers 03 by IFSA http://www.sensorsportal.com The Study of Remote Sensng Image Classfcaton Based on Support Vector Machne, ZHANG Jan-Hua Key Research Insttute of Yellow Rver Cvlzaton and
More informationRobust visual tracking based on Informative random fern
5th Internatonal Conference on Computer Scences and Automaton Engneerng (ICCSAE 205) Robust vsual trackng based on Informatve random fern Hao Dong, a, Ru Wang, b School of Instrumentaton Scence and Opto-electroncs
More informationParallelism for Nested Loops with Non-uniform and Flow Dependences
Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr
More informationHybrid Non-Blind Color Image Watermarking
Hybrd Non-Blnd Color Image Watermarkng Ms C.N.Sujatha 1, Dr. P. Satyanarayana 2 1 Assocate Professor, Dept. of ECE, SNIST, Yamnampet, Ghatkesar Hyderabad-501301, Telangana 2 Professor, Dept. of ECE, AITS,
More information