NTCIR-6 CLIR-J-J Experiments at Yahoo! Japan
|
|
- Maude Wilkerson
- 5 years ago
- Views:
Transcription
1 NTCIR-6 CLIR-J-J Experiments at Yahoo! Japan Sumio FUJITA Yahoo Japan Corporation, Roppongi Hills Mori Tower, , Roppongi, Minato-ku, Tokyo , Japan sufujita AT yahoo-corp DOT jp Abstract This paper escribes NTCIR-6 experiments of the CLIR- J-J task, i.e. Japanese monolingual retrieval subtask, at the Yahoo group, focusing on the parameter optimization in information retrieval (IR). Unlike regression approaches, we optimize parameters completely inepenent from retrieval moels so that the optimize parameter set can illustrate the characteristics of the target test collections. We aopte the genetic algorithm as optimization tools an cross-valiate with 4 test collections, namely NTCIR- 3,4,5, an 6 CLIR-J-J. Keywors: Information retrieval, Test collections, Parameter calibration, Genetic algorithm. 1. Introuction The choice of scoring function is crucial for better search effectiveness. In our past NTCIR-4 CLIR-J-J an NTCIR-5 CLIR-J-J experiments, we ene by choosing BM25TF*IDF runs for official submissions. In fact, the choice was foun to be goo for previous NTCIRs. Our choices of NTCIR-X were base upon the pre-submission experiments using the test collection of NTCIR-(X-1). The scoring functions have some coefficient parameters to be etermine uring the pre-submission experiments. Failing to well calibrate these parameters results in poor effectiveness in the official evaluation even the scoring function is goo enough. Then the calibration of parameters becomes a major work in pre-submission experiments of the NTCIR tasks. Such coefficient parameters make the scoring function aaptable to iverse environments, where requiring the efforts of calibration. Given available four test collections in the NTCIR-6 CLIR task, we try to evaluate if an automatic calibration oes work for such limite number of training ata. In orer to compare with retrospective runs, we aopte the same system as for the NTCIR-5 experiments. Optimization of scoring function is stuie by several regression approaches to information retrieval [3][6]. We consier the calibration process as a simple optimization problem an we aopte the genetic algorithm (GA) to optimize the parameters to given test collections. The process is challenging because: 1) The optimization process is easily fallen into local maximum points an it faile to fin the global maximum. 2) The optimize parameters might be overfitte to the training collection an they o not perform well for other collections. Aopting the GA, the main concern seems to be the secon issue rather than the first one. In this paper, we present our NTCIR-6 CLIR-J-J task experiments, i.e. Japanese monolingual runs, focusing on the possibility of automatic calibration of search parameters. The rest of the paper is organize as follows: Section 2 escribes our experiment environment an retrieval system. Section 3 explains briefly the genetic algorithm an Section 4 reports our official runs an post submission experiments. Section 5 conclues the paper. 2. System Description Our evaluation environment: YLMS system is implemente base on Lemur toolkit 4.0 for inexing system [12], which is being evelope by the Lemur project. 2.1 Inexing language Chasen version Japanese morphological analyzer with IPADIC ictionary version are utilize for Japanese text segmentation an output single wors are use as inexing units. Stop wor lists for newspaper ocumentation are prepare. 2.2 BM25TF*IDF A Retrieval Status Value between a ocument an a query q is calculate as a ot prouct between the ocument term vector an the query term vector, where each term is weighte by TF*IDF [15]. Okapi BM25 TF [13][14] is use. RSV ( q, ) = w( q, w( t q w( = TF( IDF( w( q, = TF( q, IDF(
2 ( k1+ 1) freq( TF( = l k1((1 b) + b ) + freq( avl N IDF( = ( k4 + log ) f ( : ocument or query t : term N : total number of f ( : number of ocuments where t appears freq( l : number of : ocument length of ocuments in the collection occurrences of t in avl : average ocument length in the collection 2.3 Feeback strategies The strategy of feeback from top k ocuments in a pilot search is applie. The Rocchio feeback for TF*IDF is aopte as term extraction metho where the term precision measure to select salient terms is calculate as an element of the centroi vector of pseuo-relevant ocuments. Finally, an upate query vector Q was compute from the original query vector Q an a set of (pseuo) relevant ocument vectors R. 1 Q' = Q + poscoeff R R Instea of using a linear mean to average through feeback ocument vectors, we also use an alpha average, which is introuce by information geometry society to mixture probability istributions [1]. Alpha averaging is also use by an experimental IR system [8][9] to mixture ocument feature vectors. 1 Q' = Q + poscoeff R R 2 1 α 1 α 2 Giving -1 to alpha, this is equivalent to a linear mean. 2.4 Parameter calibration In our past experiences in NTCIR-4 an 5 [4][5], where we compare BM25TF*IDF with KL- Divergence approach. The BM25TF*IDF scoring metho is usually a very goo function to achieve better effectiveness against iverse test collections but it is subject to some coefficients to be calibrate base on training collections, namely k1, b, k4. As for k1 an b of the query sie TF, these are fixe to 1000 an 0 respectively, reucing the query TF function to a simple count of in-text term occurrences. Feeback effectiveness is also subject to the feeback parameters, namely feeback ocument number (fbdoccn, feeback term number (fbtermcn an feeback term coefficient (fbposcoeff). 3. Genetic Algorithm for IR GA is applie to information retrieval systems mainly on relevance feeback contexts as has been note by Lopez-Pujalte et al.[11]. Fan et al. propose to irectly learn ranking functions by applying genetic programming, i.e. an extension of GA [2]. 3.1 Genetic algorithm at a glance From the metaphor of the organic reprouction systems, the genetic algorithm is generally applie to optimization problems in iverse omains, consiering each trial point in the search space as an iniviual. Iniviuals to be examine are generate by applying genetic operations on each chromosome, representative of an iniviual, on which parameters to generate a particular iniviual are encoe. Given a population of iniviuals, genetic operations are applie iteratively in orer to prouce a new generation of the population. For each generation, each iniviual is evaluate by the given fitness function an the process terminates when a targete Parameter Range K B K fbdoccnt (Integer) fbtermcnt (Integer) fbposcoeff Table 1: Parameter range of GA search space fitness value is achieve or the preefine number of generations are processe. On top of the traitional genetic operations, namely selection, crossover an mutation, the istribute genetic algorithm aopts the migration operation: a population is ivie into several islans an GA is performe in each islan where the migration operation moves a certain number of iniviuals to another islan as escribe in Hiroyasu et al. [7]. We use their implementation. 3.2 GA process an operations Unlike many applications of GA to IR in literature, we o not encoe term vectors irectly to chromosomes. Instea we encoe the parameters of our retrieval moel, namely k1, b, k4, fbdocnum, fbtermnum, fbposcoeff. These integer or real numbers are encoe on 45 bits of Boolean strings in the ranges shown in Table 1. We carrie out GA optimization on a 8-noe cluster of Xeon 3.00GHz Dual CPU machines running a Free BSD operating system. The process is ivie into 8 islans an each islan contains 10 iniviuals. Initial
3 T T-01-N T-01-N4 T-01-N Table 2: Effectiveness of CLIR-J-J 1 st stage, the official title only run an corresponing 2 n stage runs D D-02-N D-02-N4 D-02-N Table 3: Effectiveness of CLIR-J-J 1 st stage, the first official escription only run an corresponing 2 n stage runs D D-03-N D-03-N4 D-03-N Table 4: Effectiveness of CLIR-J-J 1 st stage, the secon official escription only run (wor + char-bigram fusion) an corresponing 2 n stage runs T-01 D-02 D-03 bigram K1 b K4 #fb #fb Fb pos FB Inexing Docs Terms coeff Alpha Wors Wors Bigrams/ Wors Table 5: Parameters of CLIR-J-J official runs Fusion 0.5:0.5 populations are ranomly generate an the following operations are sequentially applie in each islan Migration The migration operation moves ranomly chosen iniviuals from an islan to another islan. This operation continues from the islan to the next islan an finally returns to the first islan so that the population in each islan remains the same.
4 Recall-precision curves Sensitivity of (feebackposcoefficien Precision No feeback T-03 KL-Dir No feeback KL-Dir 5 BM25TF*IDF Recall Feeback positive coefficient Figure 1 (Lef: Recall-precision curves of NTCIR-5 T-03, its no feeback baseline, KL-Dir run an its no feeback baseline Figure 2 (Righ: Sensitivity of to feeback positive coefficient (coefficient of positive term weigh in NTCIR-5 CLIR-J-J Sensitivity of to feeback positive coefficient 4 Sensitivity of to feeback Alpha alpha=-1.2 alpha= feeback positive coefficient Figure 3 (Lef: Sensitivity of to feeback positive coefficient (coefficient of positive term weigh in NTCIR-6 CLIR-J-J, T-01 Figure 4 (Righ: Sensitivity of to feeback alpha (coefficient of alpha averaging operation in positive ocument vector aggregation) in NTCIR-6 CLIR-J-J, T-01 Alpha Crossover We use two point crossover operator with the crossover ratio 1.0. It takes couples of iniviuals, chooses ranomly two positions an exchanges the part between positions of each couple Mutation The mutation operation consists of reversing ranomly chosen one bit on each chromosome Evaluation by fitness function Each iniviual is evaluate by the fitness function, i.e. the that the retrieval system using the ecoe parameters achieve against the training collection. As we are trying to maximize the against test collections, this fitness function is naturally aopte. However there may be other strategies, e.g. combining several measures in orer to avoi overfitting to the training collection Elitism Given the number of elite, the elite group of the previous generation an the same number of the best fitte iniviuals in the current generation are merge. The best fitte iniviuals as the same number as elites in this group are save in the current generation Selection As recommene in [7], we use tournament selection with the tournament size 4, i.e. select ranomly 4 iniviuals from the islan an take the best fitte iniviual to the next generation an repeat this until the next generation is complete. 4. CLIR Experiments The etails of the NTCIR-6 CLIR task are escribe in Kishia et al.[10].
5 Collections Test collections N3 N4 N5 N6 N N N N3,N4,N N Table 6: of GA parameter optimization of CLIR-J-J runs, the training collection an the test collection are the same in shaowe cells collectio n Parameters K1 b K4 fdoccnt fbtermcnt fbposcoeff N N N N3,N4,N N Avg Offic. run Table 7: Coefficient parameter sets optimize by 4 test collections of CLIR-J-J Population 80 Chromosome length 45 or 24 Number of Islans 8 Mutation rate 1/45 or 1/24 Population / Islan 10 Tournament size 4 Elite/Islan 5 Migration rate 0.5 Crossover rate 1.0 Migration interval 5 Table 8: Control parameters of GA process an GA operations Collections Test collections N3 N4 N5 N6 N N N N3,N4,N N Table 9: of GA parameter optimization of CLIR-J-J runs without feeback, the training collection an the test collection are the same in shaowe cells collectio n Parameters K1 b K4 fdoccnt fbtermcnt fbposcoeff N N/A N/A N N/A N/A N N/A N/A N3,N4,N N/A N/A N N/A N/A Avg N/A N/A Table 10: Coefficient parameter sets optimize by 4 test collections of CLIR-J-J without feeback parameters for the moels are calibrate by using 4.1 CLIR official runs for J-J SLIR NTCIR-5 CLIR-J-J test collections. The exactly We submitte a title only run, two escription only same parameter sets are applie for corresponing runs for the 1 st stage of Japanese monolingual 2 n stage runs. retrieval subtask. All the official runs are using the T-01 is our title only run an YLMS-J- TF*IDF metho with BM25 TF an a Rocchio J-D-02 is our escription run. D-03 is the feeback with a top k ocuments strategy. The fusion of D-02 an other escription only
6 run using a bigram inexing language. Table 2,3 an 4 show effectiveness of these three official runs in 1 st stage an their corresponing 2 n stage runs. Table 5 shows parameters use in the official runs. 4.2 Sensitivity of feeback effectiveness to test collection characteristics We emphasize that the top k ocument feeback strategy was exceptionally successful with the NTCIR-5 CLIR-J-J test collection as shown in Figure 1. Figure 2 illustrates the situation where the higher the fbposcoeff parameter, the better the results obtaine. Because our official submission T-01 is calibrate to NTCIR-5 CLIR-J-J, the feeback positive coefficient is ajuste to 0.8, a comparatively higher value. Figure 3 shows the sensitivity of to feeback positive coefficient (coefficient of positive term weigh in NTCIR-6 CLIR-J-J (T-01). The value arrives at its best when feeback poscoeff is 0.6. The situation is quite ifferent from NTCIR-5 CLIR-J-J. From Figure 4, we can see that giving any value other than -1.0 to feeback alpha causes the egraation. For the following experiments, we fixe the feeback alpha parameter to GA optimization of NTCIR-6 runs We trie to re-optimize the parameters of the title runs of NTCIR-6 CLIR-J-J by applying a istribute GA on a 8 noe cluster servers. NTCIR-3,4,5 an 6 CLIR-J-J test collections (N3,N4,N5,N6) are use for cross-valiation. Each of these test collections an combination of N3,N4 an N5 are use for training. Table 6 shows the of runs where coefficient parameters are optimize by the training collection. Table 7 shows optimize parameter set against each training collection. In our GA optimize run to N5, the feebackposcoeff parameter is ajuste to , whereas in GA optimize to N3,N4,N5. This is consistent with our observation that the impact of feeback is comparatively large in N5. Unlike regression approaches, where optimize coefficients are ifficult to be interprete or genetic programming approaches, which generate increibly complicate formulae, optimize parameters are interprete by referring to our experiences of past empirical experiments. See the column of N5, using N5 itself for training, the of as high as is achieve whereas using N4 for training, the of 616 is even much worse than our best official title run in NTCIR- 5, which achieve the of In effect, this NTCIR-5 official run was manually calibrate with N4 [5]. GA optimization processes seem to be fining approximately the maximum fitness against the training collection. The problem is rather overfitting to the training collection. Human calibrators o much work than simply looking for the best fitness setting in the search space but they carefully o avoi overfit problems. 4.4 GA optimization without feeback In orer to illustrate ifferent characteristics of test collections, we carrie out optimizations by GA without feeback as well. As shown in Table 9, overfitting to the training set seems to be alleviate. Overfitting is mainly cause by the feeback parameters like the number of terms/ocuments to use for feeback or feeback coefficient, which are presumably collection epenent. 4.5 Optimization costs As can be seen from Table 6, GA optimize N6 runs are very close to our official title only run, which is set manually. The GA optimization using N5 took 36 hours for 20 generation reiteration operate on a 8 noe cluster of Xeon 3.00GHz Dual CPU machines. These harware environments are not at all cheap even toay. But it presumably pays for efforts to optimize these parameters manually from scratch. We stoppe the iteration at the 20 th generation because no rastic improvement is observe even at the 100th generation. 5. Conclusions We reporte our NTCIR-6 evaluation experiments of the CLIR-J-J task. We aopte a TF*IDF approach an applie GA optimization of coefficient parameters of the retrieval moel. GA optimize runs achieve almost the same effectiveness as human calibrate runs, although GA runs overfitte to the training collections. Automatic calibration also illustrates ifferent characteristics of the test collections especially such as feeback effectiveness. These optimize parameters can be utilize to help experts in system calibration. Acknowlegments We thank NTCIR management/secretariat members an CLIR task organizers for proviing us of the ata. References [1] Amari, S. Alpha-integration of Stochastic Eviences. In Proceeings of 2 n International Symposium on Information Geometry an its Applications, Tokyo, [2] Fan, W., Goron, M.D. an Pathak, P., A generic ranking function iscovery framework by genetic programming for information retrieval, Information Processing an Management, vol. 40, issue 4, , July [3] Fuhr, N. an Pfeifer, U. Probabilistic information retrieval as combination of abstraction, inuctive learning an probabilistic assumptions. ACM Transactions on Information Systems, 12(1), , 1994.
7 [4] Fujita, S. Revisiting the Document Length Hypotheses --NTCIR-4 CLIR an Patent Experiments at Patolis. In Working notes of the fourth NTCIR workshop meeting, , [5] Fujita, S. A Decae after TREC-4 NTCIR-5 CLIR- J-J Experiments at Yahoo!Japan. In Proceeings of NTCIR Workshop 5 Meeting, , [6] Gey, F. C. Inferring probability of relevance using the metho of logistic regression. In The proceeings of seventeenth annual international ACM SIGIR conference on research an evelopment in information retrieval, , [7] Hiroyasu, T., Yoshia, J., Sano, M., Fukunaga, T. an Kataura, T. Distribute Genetic Algorithms ga2k (ver. 1.3) Specification, Intelligent System Design Laboratory, Doshisha University, Sep [8] Kanazawa, T., Takasu, A., an Aachi, J. The effects of the Relevance-base Superimposition Moel for Effective Information Retrieval. IEICE Transactions, Vol. E83-D, No. 12, pp , Dec [9] Kano, N., Kanazawa, T., an Miyazawa A. Retrieval of Web Resource Using a Fusion of Ontology-base an Content-base Retrieval with the RS Vector Space Moel on a Portal for Japanese Universities an Acaemic Institutes. In Proceeings of the 39th Hawaii International Conference on System Sciences (HICSS 39), [10] Kishia, K., Chen, K.-H., Lee, S., Kuriyama, K., Kano, N., Chen, H. -H. an Myaeng., S. -H. Overview of CLIR task at the sixth NTCIR workshop. In Proceeings of the Sixth NTCIR Workshop, [11] Lopez-Pujalte, C., Guerrero-Bote, V.P. an Moya- Anegon, F. e, Genetic algorithm in relevance feeback: a secon test an new contributions, Information Processing an Management, vol 39, issue 5, , September [12] Ogilvie,O. an Callan,J. Experiments Using the Lemur Toolkit, In NIST Special Publication : The Tenth Text REtrieval Conference (TREC 2001), , [13] Robertson, S. an Walker S. Some Simple Effective Approximations to the 2-Poisson Moel for Probabilistic Weighte Retrieval. In Proceeings of the 1994 ACM SIGIR Conference on Research an Development in Information Retrieval, , [14] Robertson, S. E., Walker, S., Jones, S., M.Hancock- Beaulieu, M., an Gatfor M. Okapi at TREC-3. In NIST Special Publication :Overview of the Thir Text REtrieval Conference (TREC-3), , [15] Salton, G. Automatic Text Processing The Transformation, Analysis, an Retrieval of Information by Computer, Aison-Wesley publishing company, Massachusetts,1988.
APPLYING GENETIC ALGORITHM IN QUERY IMPROVEMENT PROBLEM. Abdelmgeid A. Aly
International Journal "Information Technologies an Knowlege" Vol. / 2007 309 [Project MINERVAEUROPE] Project MINERVAEUROPE: Ministerial Network for Valorising Activities in igitalisation -
More informationMultilevel Linear Dimensionality Reduction using Hypergraphs for Data Analysis
Multilevel Linear Dimensionality Reuction using Hypergraphs for Data Analysis Haw-ren Fang Department of Computer Science an Engineering University of Minnesota; Minneapolis, MN 55455 hrfang@csumneu ABSTRACT
More informationClassification and clustering methods for documents. by probabilistic latent semantic indexing model
A Short Course at amang University aipei, aiwan, R.O.C., March 7-9, 2006 Classification an clustering methos for ocuments by probabilistic latent semantic inexing moel Shigeichi Hirasawa Department of
More informationYet Another Parallel Hypothesis Search for Inverse Entailment Hiroyuki Nishiyama and Hayato Ohwada Faculty of Sci. and Tech. Tokyo University of Scien
Yet Another Parallel Hypothesis Search for Inverse Entailment Hiroyuki Nishiyama an Hayato Ohwaa Faculty of Sci. an Tech. Tokyo University of Science, 2641 Yamazaki, Noa-shi, CHIBA, 278-8510, Japan hiroyuki@rs.noa.tus.ac.jp,
More informationEvolutionary Optimisation Methods for Template Based Image Registration
Evolutionary Optimisation Methos for Template Base Image Registration Lukasz A Machowski, Tshilizi Marwala School of Electrical an Information Engineering University of Witwatersran, Johannesburg, South
More informationFast Fractal Image Compression using PSO Based Optimization Techniques
Fast Fractal Compression using PSO Base Optimization Techniques A.Krishnamoorthy Visiting faculty Department Of ECE University College of Engineering panruti rishpci89@gmail.com S.Buvaneswari Visiting
More informationFrequency Domain Parameter Estimation of a Synchronous Generator Using Bi-objective Genetic Algorithms
Proceeings of the 7th WSEAS International Conference on Simulation, Moelling an Optimization, Beijing, China, September 15-17, 2007 429 Frequenc Domain Parameter Estimation of a Snchronous Generator Using
More informationRandom Clustering for Multiple Sampling Units to Speed Up Run-time Sample Generation
DEIM Forum 2018 I4-4 Abstract Ranom Clustering for Multiple Sampling Units to Spee Up Run-time Sample Generation uzuru OKAJIMA an Koichi MARUAMA NEC Solution Innovators, Lt. 1-18-7 Shinkiba, Koto-ku, Tokyo,
More informationLearning Polynomial Functions. by Feature Construction
I Proceeings of the Eighth International Workshop on Machine Learning Chicago, Illinois, June 27-29 1991 Learning Polynomial Functions by Feature Construction Richar S. Sutton GTE Laboratories Incorporate
More informationNon-homogeneous Generalization in Privacy Preserving Data Publishing
Non-homogeneous Generalization in Privacy Preserving Data Publishing W. K. Wong, Nios Mamoulis an Davi W. Cheung Department of Computer Science, The University of Hong Kong Pofulam Roa, Hong Kong {wwong2,nios,cheung}@cs.hu.h
More informationClassifying Facial Expression with Radial Basis Function Networks, using Gradient Descent and K-means
Classifying Facial Expression with Raial Basis Function Networks, using Graient Descent an K-means Neil Allrin Department of Computer Science University of California, San Diego La Jolla, CA 9237 nallrin@cs.ucs.eu
More informationImproving Web Search Ranking by Incorporating User Behavior Information
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Microsoft Research eugeneag@microsoft.com Eric Brill Microsoft Research brill@microsoft.com Susan Dumais Microsoft
More informationModifying ROC Curves to Incorporate Predicted Probabilities
Moifying ROC Curves to Incorporate Preicte Probabilities Cèsar Ferri DSIC, Universitat Politècnica e València Peter Flach Department of Computer Science, University of Bristol José Hernánez-Orallo DSIC,
More informationSoftware Reliability Modeling and Cost Estimation Incorporating Testing-Effort and Efficiency
Software Reliability Moeling an Cost Estimation Incorporating esting-effort an Efficiency Chin-Yu Huang, Jung-Hua Lo, Sy-Yen Kuo, an Michael R. Lyu -+ Department of Electrical Engineering Computer Science
More informationfiltering LETTER An Improved Neighbor Selection Algorithm in Collaborative Taek-Hun KIM a), Student Member and Sung-Bong YANG b), Nonmember
107 IEICE TRANS INF & SYST, VOLE88 D, NO5 MAY 005 LETTER An Improve Neighbor Selection Algorithm in Collaborative Filtering Taek-Hun KIM a), Stuent Member an Sung-Bong YANG b), Nonmember SUMMARY Nowaays,
More informationCoupling the User Interfaces of a Multiuser Program
Coupling the User Interfaces of a Multiuser Program PRASUN DEWAN University of North Carolina at Chapel Hill RAJIV CHOUDHARY Intel Corporation We have evelope a new moel for coupling the user-interfaces
More informationImage Segmentation using K-means clustering and Thresholding
Image Segmentation using Kmeans clustering an Thresholing Preeti Panwar 1, Girhar Gopal 2, Rakesh Kumar 3 1M.Tech Stuent, Department of Computer Science & Applications, Kurukshetra University, Kurukshetra,
More informationShort-term prediction of photovoltaic power based on GWPA - BP neural network model
Short-term preiction of photovoltaic power base on GWPA - BP neural networ moel Jian Di an Shanshan Meng School of orth China Electric Power University, Baoing. China Abstract In recent years, ue to China's
More informationTHE BAYESIAN RECEIVER OPERATING CHARACTERISTIC CURVE AN EFFECTIVE APPROACH TO EVALUATE THE IDS PERFORMANCE
БСУ Международна конференция - 2 THE BAYESIAN RECEIVER OPERATING CHARACTERISTIC CURVE AN EFFECTIVE APPROACH TO EVALUATE THE IDS PERFORMANCE Evgeniya Nikolova, Veselina Jecheva Burgas Free University Abstract:
More informationParticle Swarm Optimization Based on Smoothing Approach for Solving a Class of Bi-Level Multiobjective Programming Problem
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 17, No 3 Sofia 017 Print ISSN: 1311-970; Online ISSN: 1314-4081 DOI: 10.1515/cait-017-0030 Particle Swarm Optimization Base
More informationGeneralized Edge Coloring for Channel Assignment in Wireless Networks
Generalize Ege Coloring for Channel Assignment in Wireless Networks Chun-Chen Hsu Institute of Information Science Acaemia Sinica Taipei, Taiwan Da-wei Wang Jan-Jan Wu Institute of Information Science
More informationA Practical Passage-based Approach for Chinese Document Retrieval
A Practical Passage-based Approach for Chinese Document Retrieval Szu-Yuan Chi 1, Chung-Li Hsiao 1, Lee-Feng Chien 1,2 1. Department of Information Management, National Taiwan University 2. Institute of
More informationClustering using Particle Swarm Optimization. Nuria Gómez Blas, Octavio López Tolic
24 International Journal Information Theories an Applications, Vol. 23, Number 1, (c) 2016 Clustering using Particle Swarm Optimization Nuria Gómez Blas, Octavio López Tolic Abstract: Data clustering has
More informationSemantic internet search engine with focus on Arabic language
Semantic internet search engine with focus on Arabic language Naima Tazit, El Hossin Bouyakhf, Soua Sabri Faculty of Sciences Agal, Rabat, Morocco Abellah Yousfi Faculty of Economy Souissi, Rabat, Morocco
More informationInterior Permanent Magnet Synchronous Motor (IPMSM) Adaptive Genetic Parameter Estimation
Interior Permanent Magnet Synchronous Motor (IPMSM) Aaptive Genetic Parameter Estimation Java Rezaie, Mehi Gholami, Reza Firouzi, Tohi Alizaeh, Karim Salashoor Abstract - Interior permanent magnet synchronous
More informationGeneralized Edge Coloring for Channel Assignment in Wireless Networks
TR-IIS-05-021 Generalize Ege Coloring for Channel Assignment in Wireless Networks Chun-Chen Hsu, Pangfeng Liu, Da-Wei Wang, Jan-Jan Wu December 2005 Technical Report No. TR-IIS-05-021 http://www.iis.sinica.eu.tw/lib/techreport/tr2005/tr05.html
More informationDocument Structure Analysis in Associative Patent Retrieval
Document Structure Analysis in Associative Patent Retrieval Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media Studies University of Tsukuba 1-2 Kasuga, Tsukuba, 305-8550,
More informationCloud Search Service Product Introduction. Issue 01 Date HUAWEI TECHNOLOGIES CO., LTD.
1.3.15 Issue 01 Date 2018-11-21 HUAWEI TECHNOLOGIES CO., LTD. Copyright Huawei Technologies Co., Lt. 2019. All rights reserve. No part of this ocument may be reprouce or transmitte in any form or by any
More informationSolution Representation for Job Shop Scheduling Problems in Ant Colony Optimisation
Solution Representation for Job Shop Scheuling Problems in Ant Colony Optimisation James Montgomery, Carole Faya 2, an Sana Petrovic 2 Faculty of Information & Communication Technologies, Swinburne University
More informationOn the Role of Multiply Sectioned Bayesian Networks to Cooperative Multiagent Systems
On the Role of Multiply Sectione Bayesian Networks to Cooperative Multiagent Systems Y. Xiang University of Guelph, Canaa, yxiang@cis.uoguelph.ca V. Lesser University of Massachusetts at Amherst, USA,
More informationLoop Scheduling and Partitions for Hiding Memory Latencies
Loop Scheuling an Partitions for Hiing Memory Latencies Fei Chen Ewin Hsing-Mean Sha Dept. of Computer Science an Engineering University of Notre Dame Notre Dame, IN 46556 Email: fchen,esha @cse.n.eu Tel:
More informationCluster Center Initialization Method for K-means Algorithm Over Data Sets with Two Clusters
Available online at www.scienceirect.com Proceia Engineering 4 (011 ) 34 38 011 International Conference on Avances in Engineering Cluster Center Initialization Metho for K-means Algorithm Over Data Sets
More informationAn Algorithm for Building an Enterprise Network Topology Using Widespread Data Sources
An Algorithm for Builing an Enterprise Network Topology Using Wiesprea Data Sources Anton Anreev, Iurii Bogoiavlenskii Petrozavosk State University Petrozavosk, Russia {anreev, ybgv}@cs.petrsu.ru Abstract
More informationShift-map Image Registration
Shift-map Image Registration Svärm, Linus; Stranmark, Petter Unpublishe: 2010-01-01 Link to publication Citation for publishe version (APA): Svärm, L., & Stranmark, P. (2010). Shift-map Image Registration.
More informationTerm Frequency Normalisation Tuning for BM25 and DFR Models
Term Frequency Normalisation Tuning for BM25 and DFR Models Ben He and Iadh Ounis Department of Computing Science University of Glasgow United Kingdom Abstract. The term frequency normalisation parameter
More informationJournal of Cybernetics and Informatics. Slovak Society for Cybernetics and Informatics
Journal of Cybernetics an Informatics publishe by Slovak Society for Cybernetics an Informatics Volume 13, 1 http://www.sski.sk/casopis/inex.php (home page) ISSN: 1336-4774 Journal of Cybernetics an Informatics
More informationR 2 D 2 at NTCIR-4 Web Retrieval Task
R 2 D 2 at NTCIR-4 Web Retrieval Task Teruhito Kanazawa KYA group Corporation 5 29 7 Koishikawa, Bunkyo-ku, Tokyo 112 0002, Japan tkana@kyagroup.com Tomonari Masada University of Tokyo 7 3 1 Hongo, Bunkyo-ku,
More informationStudy of Network Optimization Method Based on ACL
Available online at www.scienceirect.com Proceia Engineering 5 (20) 3959 3963 Avance in Control Engineering an Information Science Stuy of Network Optimization Metho Base on ACL Liu Zhian * Department
More informationA Revised Simplex Search Procedure for Stochastic Simulation Response Surface Optimization
272 INFORMS Journal on Computing 0899-1499 100 1204-0272 $05.00 Vol. 12, No. 4, Fall 2000 2000 INFORMS A Revise Simplex Search Proceure for Stochastic Simulation Response Surface Optimization DAVID G.
More informationOnline Appendix to: Generalizing Database Forensics
Online Appenix to: Generalizing Database Forensics KYRIACOS E. PAVLOU an RICHARD T. SNODGRASS, University of Arizona This appenix presents a step-by-step iscussion of the forensic analysis protocol that
More informationStudy on Merging Multiple Results from Information Retrieval System
Proceedings of the Third NTCIR Workshop Study on Merging Multiple Results from Information Retrieval System Hiromi itoh OZAKU, Masao UTIAMA, Hitoshi ISAHARA Communications Research Laboratory 2-2-2 Hikaridai,
More informationExploring Context with Deep Structured models for Semantic Segmentation
1 Exploring Context with Deep Structure moels for Semantic Segmentation Guosheng Lin, Chunhua Shen, Anton van en Hengel, Ian Rei between an image patch an a large backgroun image region. Explicitly moeling
More informationComparison of Methods for Increasing the Performance of a DUA Computation
Comparison of Methos for Increasing the Performance of a DUA Computation Michael Behrisch, Daniel Krajzewicz, Peter Wagner an Yun-Pang Wang Institute of Transportation Systems, German Aerospace Center,
More informationMORA: a Movement-Based Routing Algorithm for Vehicle Ad Hoc Networks
: a Movement-Base Routing Algorithm for Vehicle A Hoc Networks Fabrizio Granelli, Senior Member, Giulia Boato, Member, an Dzmitry Kliazovich, Stuent Member Abstract Recent interest in car-to-car communications
More informationAdjacency Matrix Based Full-Text Indexing Models
1000-9825/2002/13(10)1933-10 2002 Journal of Software Vol.13, No.10 Ajacency Matrix Base Full-Text Inexing Moels ZHOU Shui-geng 1, HU Yun-fa 2, GUAN Ji-hong 3 1 (Department of Computer Science an Engineering,
More informationOverview of Patent Retrieval Task at NTCIR-5
Overview of Patent Retrieval Task at NTCIR-5 Atsushi Fujii, Makoto Iwayama, Noriko Kando Graduate School of Library, Information and Media Studies University of Tsukuba 1-2 Kasuga, Tsukuba, 305-8550, Japan
More informationA shortest path algorithm in multimodal networks: a case study with time varying costs
A shortest path algorithm in multimoal networks: a case stuy with time varying costs Daniela Ambrosino*, Anna Sciomachen* * Department of Economics an Quantitative Methos (DIEM), University of Genoa Via
More informationQueueing Model and Optimization of Packet Dropping in Real-Time Wireless Sensor Networks
Queueing Moel an Optimization of Packet Dropping in Real-Time Wireless Sensor Networks Marc Aoun, Antonios Argyriou, Philips Research, Einhoven, 66AE, The Netherlans Department of Computer an Communication
More informationindexing and query processing. The inverted le was constructed for the retrieval target collection which contains full texts of two years' Japanese pa
Term Distillation in Patent Retrieval Hideo Itoh Hiroko Mano Yasushi Ogawa Software R&D Group, RICOH Co., Ltd. 1-1-17 Koishikawa, Bunkyo-ku, Tokyo 112-0002, JAPAN fhideo,mano,yogawag@src.ricoh.co.jp Abstract
More informationFeature Extraction and Rule Classification Algorithm of Digital Mammography based on Rough Set Theory
Feature Extraction an Rule Classification Algorithm of Digital Mammography base on Rough Set Theory Aboul Ella Hassanien Jafar M. H. Ali. Kuwait University, Faculty of Aministrative Science, Quantitative
More informationElement-based
Proceeings of te 12t NTCIR Conference on Evaluation of Information Access Tecnologies June 7-10 2016 Tokyo Japan Element-base Retrieval@MobileClick-2 Atsusi eyaki Tokyo Institute of Tecnology keyaki@lsc.cs.titec.ac.jp
More informationA Comparative Evaluation of Iris and Ocular Recognition Methods on Challenging Ocular Images
A Comparative Evaluation of Iris an Ocular Recognition Methos on Challenging Ocular Images Vishnu Naresh Boeti Carnegie Mellon University Pittsburgh, PA 523 naresh@cmu.eu Jonathon M Smereka Carnegie Mellon
More informationRough Set Approach for Classification of Breast Cancer Mammogram Images
Rough Set Approach for Classification of Breast Cancer Mammogram Images Aboul Ella Hassanien Jafar M. H. Ali. Kuwait University, Faculty of Aministrative Science, Quantitative Methos an Information Systems
More informationAnalysis of half-space range search using the k-d search skip list. Here we analyse the expected time for half-space
Analysis of half-space range search using the k- search skip list Mario A. Lopez Brafor G. Nickerson y 1 Abstract We analyse the average cost of half-space range reporting for the k- search skip list.
More informationA New Search Algorithm for Solving Symmetric Traveling Salesman Problem Based on Gravity
Worl Applie Sciences Journal 16 (10): 1387-1392, 2012 ISSN 1818-4952 IDOSI Publications, 2012 A New Search Algorithm for Solving Symmetric Traveling Salesman Problem Base on Gravity Aliasghar Rahmani Hosseinabai,
More informationTransient analysis of wave propagation in 3D soil by using the scaled boundary finite element method
Southern Cross University epublications@scu 23r Australasian Conference on the Mechanics of Structures an Materials 214 Transient analysis of wave propagation in 3D soil by using the scale bounary finite
More informationLearning to Rank: A New Technology for Text Processing
TFANT 07 Tokyo Univ. March 2, 2007 Learning to Rank: A New Technology for Text Processing Hang Li Microsoft Research Asia Talk Outline What is Learning to Rank? Ranking SVM Definition Search Ranking SVM
More informationUninformed search methods
CS 1571 Introuction to AI Lecture 4 Uninforme search methos Milos Hauskrecht milos@cs.pitt.eu 539 Sennott Square Announcements Homework assignment 1 is out Due on Thursay, September 11, 014 before the
More informationLearning convex bodies is hard
Learning convex boies is har Navin Goyal Microsoft Research Inia navingo@microsoftcom Luis Raemacher Georgia Tech lraemac@ccgatecheu Abstract We show that learning a convex boy in R, given ranom samples
More informationSkyline Community Search in Multi-valued Networks
Syline Community Search in Multi-value Networs Rong-Hua Li Beijing Institute of Technology Beijing, China lironghuascut@gmail.com Jeffrey Xu Yu Chinese University of Hong Kong Hong Kong, China yu@se.cuh.eu.h
More informationSURVIVABLE IP OVER WDM: GUARANTEEEING MINIMUM NETWORK BANDWIDTH
SURVIVABLE IP OVER WDM: GUARANTEEEING MINIMUM NETWORK BANDWIDTH Galen H Sasaki Dept Elec Engg, U Hawaii 2540 Dole Street Honolul HI 96822 USA Ching-Fong Su Fuitsu Laboratories of America 595 Lawrence Expressway
More informationHuman recognition based on ear shape images using PCA-Wavelets and different classification methods
Meical Devices an Diagnostic Engineering Review Article ISSN: 2399-6854 Human recognition base on ear shape images using PCA-Wavelets an ifferent classification methos Ali Mahmou Mayya1* an Mariam Mohamma
More informationLecture 1 September 4, 2013
CS 84r: Incentives an Information in Networks Fall 013 Prof. Yaron Singer Lecture 1 September 4, 013 Scribe: Bo Waggoner 1 Overview In this course we will try to evelop a mathematical unerstaning for the
More informationProbabilistic Medium Access Control for. Full-Duplex Networks with Half-Duplex Clients
Probabilistic Meium Access Control for 1 Full-Duplex Networks with Half-Duplex Clients arxiv:1608.08729v1 [cs.ni] 31 Aug 2016 Shih-Ying Chen, Ting-Feng Huang, Kate Ching-Ju Lin, Member, IEEE, Y.-W. Peter
More informationMaking Retrieval Faster Through Document Clustering
R E S E A R C H R E P O R T I D I A P Making Retrieval Faster Through Document Clustering David Grangier 1 Alessandro Vinciarelli 2 IDIAP RR 04-02 January 23, 2004 D a l l e M o l l e I n s t i t u t e
More informationOpen Access Adaptive Image Enhancement Algorithm with Complex Background
Sen Orers for Reprints to reprints@benthamscience.ae 594 The Open Cybernetics & Systemics Journal, 205, 9, 594-600 Open Access Aaptive Image Enhancement Algorithm with Complex Bacgroun Zhang Pai * epartment
More informationCharacterizing Decoding Robustness under Parametric Channel Uncertainty
Characterizing Decoing Robustness uner Parametric Channel Uncertainty Jay D. Wierer, Wahee U. Bajwa, Nigel Boston, an Robert D. Nowak Abstract This paper characterizes the robustness of ecoing uner parametric
More informationNew Version of Davies-Bouldin Index for Clustering Validation Based on Cylindrical Distance
New Version of Davies-Boulin Inex for lustering Valiation Base on ylinrical Distance Juan arlos Roas Thomas Faculta e Informática Universia omplutense e Mari Mari, España correoroas@gmail.com Abstract
More informationChalmers Publication Library
Chalmers Publication Library All-to-all Broacast for Vehicular Networks Base on Coe Slotte ALOHA This ocument has been ownloae from Chalmers Publication Library (CPL). It is the author s version of a work
More informationWilliam S. Law. Erik K. Antonsson. Engineering Design Research Laboratory. California Institute of Technology. Abstract
Optimization Methos for Calculating Design Imprecision y William S. Law Eri K. Antonsson Engineering Design Research Laboratory Division of Engineering an Applie Science California Institute of Technology
More informationA Modified Immune Network Optimization Algorithm
IAENG International Journal of Computer Science, 4:4, IJCS_4_4_3 A Moifie Immune Network Optimization Algorithm Lu Hong, Joarer Kamruzzaman Abstract This stuy proposes a moifie artificial immune network
More informationAn Energy Efficient Routing for Wireless Sensor Networks: Hierarchical Approach
An Energy Efficient Routing for Wireless Sensor Networks: Hierarchical Approach Nishi Sharma, Vanna Verma Abstract Wireless sensor networks (WSNs) is one of the emerging fiel of research in recent era
More informationExploring Context with Deep Structured models for Semantic Segmentation
APPEARING IN IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, APRIL 2017. 1 Exploring Context with Deep Structure moels for Semantic Segmentation Guosheng Lin, Chunhua Shen, Anton van en
More informationNon-Uniform Sensor Deployment in Mobile Wireless Sensor Networks
01 01 01 01 01 00 01 01 Non-Uniform Sensor Deployment in Mobile Wireless Sensor Networks Mihaela Carei, Yinying Yang, an Jie Wu Department of Computer Science an Engineering Floria Atlantic University
More informationAdvanced method of NC programming for 5-axis machining
Available online at www.scienceirect.com Proceia CIRP (0 ) 0 07 5 th CIRP Conference on High Performance Cutting 0 Avance metho of NC programming for 5-axis machining Sergej N. Grigoriev a, A.A. Kutin
More informationFuzzy Learning Variable Admittance Control for Human-Robot Cooperation
Fuzzy Learning ariable Amittance Control for Human-Robot Cooperation Fotios Dimeas an Nikos Aspragathos Abstract This paper presents a metho for variable amittance control in human-robot cooperation tasks,
More informationResearch Article Hydrological Cycle Algorithm for Continuous Optimization Problems
Hinawi Journal of Optimization Volume 217, Article ID 382842, 25 pages https://oi.org/1.1155/217/382842 Research Article Hyrological Cycle Algorithm for Continuous Optimization Problems Ahma Weyan, Jacqueline
More informationOptimal Oblivious Path Selection on the Mesh
Optimal Oblivious Path Selection on the Mesh Costas Busch Malik Magon-Ismail Jing Xi Department of Computer Science Rensselaer Polytechnic Institute Troy, NY 280, USA {buschc,magon,xij2}@cs.rpi.eu Abstract
More informationThreshold Based Data Aggregation Algorithm To Detect Rainfall Induced Landslides
Threshol Base Data Aggregation Algorithm To Detect Rainfall Inuce Lanslies Maneesha V. Ramesh P. V. Ushakumari Department of Computer Science Department of Mathematics Amrita School of Engineering Amrita
More informationMessage Transport With The User Datagram Protocol
Message Transport With The User Datagram Protocol User Datagram Protocol (UDP) Use During startup For VoIP an some vieo applications Accounts for less than 10% of Internet traffic Blocke by some ISPs Computer
More informationDATABASE MERGING STRATEGY BASED ON LOGISTIC REGRESSION
DATABASE MERGING STRATEGY BASED ON LOGISTIC REGRESSION Anne Le Calvé, Jacques Savoy Institut interfacultaire d'informatique Université de Neuchâtel (Switzerland) e-mail: {Anne.Lecalve, Jacques.Savoy}@seco.unine.ch
More informationNET Institute*
NET Institute* www.netinst.org Working Paper #08-24 October 2008 Computer Virus Propagation in a Network Organization: The Interplay between Social an Technological Networks Hsing Kenny Cheng an Hong Guo
More informationA Neural Network Model Based on Graph Matching and Annealing :Application to Hand-Written Digits Recognition
ITERATIOAL JOURAL OF MATHEMATICS AD COMPUTERS I SIMULATIO A eural etwork Moel Base on Graph Matching an Annealing :Application to Han-Written Digits Recognition Kyunghee Lee Abstract We present a neural
More informationDivide-and-Conquer Algorithms
Supplment to A Practical Guie to Data Structures an Algorithms Using Java Divie-an-Conquer Algorithms Sally A Golman an Kenneth J Golman Hanout Divie-an-conquer algorithms use the following three phases:
More informationMEASURING THE PERFORMANCE OF SIMILARITY PROPAGATION IN AN SEMANTIC SEARCH ENGINE
ISSN: 9-6956 (ONLINE) DOI: 10.1917/isc.013.0096 ICTACT JOURNAL ON SOFT COMPUTING, OCTOBER 013, VOLUME: 04, ISSUE: 01 MEASURING THE PERFORMANCE OF SIMILARITY PROPAGATION IN AN SEMANTIC SEARCH ENGINE S.
More informationA PSO Optimized Layered Approach for Parametric Clustering on Weather Dataset
Vol.3, Issue.1, Jan-Feb. 013 pp-504-508 ISSN: 49-6645 A PSO Optimize Layere Approach for Parametric Clustering on Weather Dataset Shikha Verma, 1 Kiran Jyoti 1 Stuent, Guru Nanak Dev Engineering College
More informationExperion PKS R500 Migration Planning Guide
Experion PKS R500 Migration Planning Guie EPDOC-XX70-en-500E May 2018 Release 500 Document Release Issue Date EPDOC-XX70- en-500e 500 0 May 2018 Disclaimer This ocument contains Honeywell proprietary information.
More informationComputer Organization
Computer Organization Douglas Comer Computer Science Department Purue University 250 N. University Street West Lafayette, IN 47907-2066 http://www.cs.purue.eu/people/comer Copyright 2006. All rights reserve.
More informationUsing Vector and Raster-Based Techniques in Categorical Map Generalization
Thir ICA Workshop on Progress in Automate Map Generalization, Ottawa, 12-14 August 1999 1 Using Vector an Raster-Base Techniques in Categorical Map Generalization Beat Peter an Robert Weibel Department
More informationInformation Granulation for the Design of Granular Information Retrieval Systems
Association for Information Systems AIS Electronic Library (AISeL) ICIS 008 Proceeings International Conference on Information Systems (ICIS) 008 Information Granulation for the Design of Granular Information
More informationImage compression predicated on recurrent iterated function systems
2n International Conference on Mathematics & Statistics 16-19 June, 2008, Athens, Greece Image compression preicate on recurrent iterate function systems Chol-Hui Yun *, Metzler W. a an Barski M. a * Faculty
More informationRMIT University at TREC 2006: Terabyte Track
RMIT University at TREC 2006: Terabyte Track Steven Garcia Falk Scholer Nicholas Lester Milad Shokouhi School of Computer Science and IT RMIT University, GPO Box 2476V Melbourne 3001, Australia 1 Introduction
More informationResearch Article REALFLOW: Reliable Real-Time Flooding-Based Routing Protocol for Industrial Wireless Sensor Networks
Hinawi Publishing Corporation International Journal of Distribute Sensor Networks Volume 2014, Article ID 936379, 17 pages http://x.oi.org/10.1155/2014/936379 Research Article REALFLOW: Reliable Real-Time
More informationAll-to-all Broadcast for Vehicular Networks Based on Coded Slotted ALOHA
Preprint, August 5, 2018. 1 All-to-all Broacast for Vehicular Networks Base on Coe Slotte ALOHA Mikhail Ivanov, Frerik Brännström, Alexanre Graell i Amat, an Petar Popovski Department of Signals an Systems,
More informationSupporting Fully Adaptive Routing in InfiniBand Networks
XIV JORNADAS DE PARALELISMO - LEGANES, SEPTIEMBRE 200 1 Supporting Fully Aaptive Routing in InfiniBan Networks J.C. Martínez, J. Flich, A. Robles, P. López an J. Duato Resumen InfiniBan is a new stanar
More informationProvisioning Virtualized Cloud Services in IP/MPLS-over-EON Networks
Provisioning Virtualize Clou Services in IP/MPLS-over-EON Networks Pan Yi an Byrav Ramamurthy Department of Computer Science an Engineering, University of Nebraska-Lincoln Lincoln, Nebraska 68588 USA Email:
More informationKinematic Analysis of a Family of 3R Manipulators
Kinematic Analysis of a Family of R Manipulators Maher Baili, Philippe Wenger an Damien Chablat Institut e Recherche en Communications et Cybernétique e Nantes, UMR C.N.R.S. 6597 1, rue e la Noë, BP 92101,
More informationAnimated Surface Pasting
Animate Surface Pasting Clara Tsang an Stephen Mann Computing Science Department University of Waterloo 200 University Ave W. Waterloo, Ontario Canaa N2L 3G1 e-mail: clftsang@cgl.uwaterloo.ca, smann@cgl.uwaterloo.ca
More informationBlind Data Classification using Hyper-Dimensional Convex Polytopes
Blin Data Classification using Hyper-Dimensional Convex Polytopes Brent T. McBrie an Gilbert L. Peterson Department of Electrical an Computer Engineering Air Force Institute of Technology 9 Hobson Way
More informationAnalyzing Document Retrievability in Patent Retrieval Settings
Analyzing Document Retrievability in Patent Retrieval Settings Shariq Bashir and Andreas Rauber Institute of Software Technology and Interactive Systems, Vienna University of Technology, Austria {bashir,rauber}@ifs.tuwien.ac.at
More information