Research Article. August 2017
|
|
- Kory Edwards
- 5 years ago
- Views:
Transcription
1 International Journals of Advanced Research in Computer Science and Software Engineering ISSN: X (Volume-7, Issue-8) a Research Article August 2017 English-Marathi Cross Language Information Retrieval System Kalyani Lokhande Research Scholar, Department of Computer Engineering, SSBT s COET, Jalgaon, Maharashtra, India DOI: /ijarcsse/V7I8/0127 Dhanashree Tayade Assistant Professor, Department of Computer Engineering, SSBT s COET, Jalgaon, Maharashtra, India Abstract Nowadays, diff erent types of content in diff erent languages are available on World Wide Web and their usage is increasing rapidly. Cross Language Information Retrieval (CLIR) deals with retrieval of documents in another language than the language of the requested query. Various researchers worked on Cross Language Information Retrieval systems for Indian languages using diff erent translation approaches. There is still CLIR system to be developed which allow user to retrieve Marathi documents when English query is given. In the proposed English to Marathi Cross Language Information Retrieval system, translation is based on query translation approach. The proposed system retrieves Marathi documents depending on matching terms in query. The performance of the proposed system is improved by query pre-processing and query expansion using WordNet. Keywords Cross Language Information Retrieval; WordNet; FIRE 2010 Dataset; Query Pre-processing; Query Expansion. I. INTRODUCTION Information Retrieval (IR) systems since developed has opened doors of information across the world. Initially IR systems were mostly developed for English language. Language has been barrier for users. Introduction of Cross Language Information Retrieval systems has opened new paradigm for efficient and easy retrieval of information. The evaluation of Cross Language Information Retrieval for Indian languages started recently. After the highly successful CLEF and NTCIR campaigns, since 2008, the Forum for Information Retrieval Evaluation [FIRE], modeled focused specifically on Indian languages and English. Document collections have been developed for some Indian Languages namely, Hindi, Bangla, Marathi and English. The number of Internet users increasing day to day accessing any kind of required information at any time. Information Retrieval (IR) mainly refers to a process that the finding required information. With 100 million internet users, India is at third place globally in usage of internet. Though the internet has shrunken the geographical boundaries, the language diversification is a big barrier to get full benefit of the internet. Hence there is a need to develop a technique like Cross Language Information Retrieval which is used to retrieve documents in a language other than the user used to specify the query. Therefore, Internet is no longer monolingual and non-english contents are accessed rapidly. In this aspect, Information retrieved is mainly considered in text form [1]. A. Cross Language Information Retrieval System for Indian Languages The number of Web users accessing the Internet become increasing day to day because people can access any kind of required information at any time. Information Retrieval (IR) mainly refers to a process that the user can find required information should be easily accessible and digestible. Though the network shrank the globe, the language diversification is a great barrier to attain full benefit of the digital life. Hence there is a need to develop a technique like Cross Language Text Retrieval which is used to retrieve text documents in a language other than the user used to specify the query. Therefore Internet is now becoming multilingual and non-english contents are accessed rapidly [1]. CLIR is an area of IR which is interesting and having much scope of development. The aim of CLIR is to allow users to input queries in one language and retrieve documents in one or more other languages. The resulting documents can then be translated into the language used for the query to allow the user to get the sense about the information retrieved For example, user wants information about Marathi Abhangas if the query fired in English (like Abhang) the retrieved documents shows poor result compared when query fired in Marathi language itself. Cross Lingual Information Retrieval provides the solution for language barrier, by allowing the user to ask the query in the local language and then to get the documents in another language (English) and vice versa [2]. In India plenty of people are speaking diversified local languages. In world population, very few people know English language and can right the queries in English in a right way. The language diversification is a great barrier to get the benefits of the web. Cross Lingual Information Retrieval provides the solution for that barrier, by allowing the user to ask the query in the local language and then to get the documents in another language [3]. II. RELATED WORK D.Mandal et al., in [4], proposed a Bengali and Hindi to English CLIR system. The cross language task includes the retrieval of English documents on giving queries in two languages Bengali and Hindi. Automatic query generation All Rights Reserved Page 112
2 and Machine Translation approach is used. Other language-specific resources included a Hindi stemmer, a Bengali morphological analyser, and a set of 200 Hindi and 273 Bengali stop words. Lucene framework was used for retrieval, indexing, stemming and scoring of the documents. The experiment pointed out the necessity of good linguistic resources, mainly a rich bilingual lexicon. Saurabh Varshney and Jyoti Bajpai, in [5], proposed an algorithm for improving the performance of the English- Hindi CLIR system. Use of all possible combination of Hindi translated query using transliteration of English query terms and choosing the best query among them for retrieval of documents is done. The experimental results show that the proposed approach helps to resolve ambiguity in English-Hindi CLIR system and gives more relevant information as compared English monolingual. The pre and post query expansion helps to improving the performance of the English- Hindi CLIR system and based upon past experiences the proposed approach retrieves more relevant information. Eva Katta and Anuj Arora, in [6], proposed an improved searching and ranking approach for English- Hindi based CLIR system. The approach focuses on proper searching and ranking of documents by using algorithms such as Particle Swarm Optimization and Nave Bayes. The n-gram matching of query terms to that of document is done. The proposed system retrieves more relevant documents as compared to other systems. Chaware and Srikantha Rao, in [7], proposed Domain Specific Information Retrieval in Multilingual environment by considering a shopping mall as the domain. The user can pose the query in Hindi, Marathi or Guajarati and the back end data is stored in English. Using Character-by-Character mapping, the query is converted to English. When there was an exact matching exist, keywords were converted to local language by doing Character-to-ASCII mapping. The efficiency of the Information retrieval depends on the minimum number of keys to be mapped. Rajendra et al., in [8], proposed a Cross Language Information Retrieval (CLIR) approach using corpus driven query suggestion. Use of corpus statistics to gather a clue on selecting the right query terms when the translation of a specific query is missing or incorrect is done. The top ranked queries are used to perform query formulation. Using the re-formulated weighted query, CLIR is performed. Comparison is done between the results of CLIR system with Google translation of user queries and CLIR system with the proposed query suggestion approach. English and Tamil corpus of FIRE 2012 dataset used for analysis. The experimental results show that the proposed approach performs well while dealing with incorrect translation of the queries. As per study of previous work, it is found that for CLIR systems are developed for many of the Indian languages. English-Marathi CLIR will be new invention in CLIR field of Indian languages. Among translation approaches, Query translation approach has been adapted by most of the authors. For query translation, bilingual dictionary and machine translation systems are widely used being easier approach. However, new approaches like Corpus and Ontology proves promising if it used for specific domain. Experiment setup is mostly on standard dataset. In this paper, propose English-Marathi CLIR system which makes use of basic techniques of query preprocessing. The performance will be improved by using query expansion technique using WordNet. The experiment will be performed on FIRE 2010 dataset in Marathi. The performance of the system will be evaluated using precision and recall. III. PROPOSED WORK The framework of proposed approach is described in Fig.1.The proposed framework shows the working of English-Marathi CLIR system in which user gives their query in English language and the relevant documents are retrieved in Marathi language. The documents will be used from FIRE 2010 Dataset of Marathi news corpus. The steps involved in flow of the proposed system is given below: Fig.1 Proposed System Architecture All Rights Reserved Page 113
3 1. Firstly user enters the query in English language. 2. By using pre-query expansion, the English query by using various stools like English WordNet is expanded. 3. Query translation translate refined English query to Marathi query with query translation approach by using Google translator. 4. This expanded Marathi query is fired to retrieve Marathi relevant documents based on similarity between query and documents. 1) Query Pre-processing: In the process of pre-processing, tokenization is applied to query to separate each term in a query. It usually involves separation and segmentation of words as well as isolating them from punctuation marks or other related formatting and mark-up symbols. Query pre-processing is first and basic step which is done whenever query is given. The input query is English. When inputted the query each word in query is separated and keywords are identified by process of tokenization. 2) Query Expansion: Query Expansion (QE) is a widely used technique that attempts to increase the likelihood of a match between the query and relevant documents by adding semantically related terms (called expansion terms) to a user s query [4]. Here, in this work, WordNet is used as source of query expansion. First English WordNet is used to expand user query. Here, WordNet not only used to get the synonyms for each keyword but also to make combinations of these synonyms to get similar queries. 3) Query Translation: In proposed system, for query translation among diff erent types of translation approaches Google translator is used. From these set of queries, each query is submitted for translation one by one. The query translation is done via Google Translator. The original query along with expanded query words which stored in array of strings are translated using Google Translator. 4) Information retrieval system : The information retrieval process involves searching and ranking of documents. Documents are searched for matching terms. The documents retrieved which satisfy matching condition. The translated Marathi query is fired to database to get the desired documents. The documents are retrieved based matching terms using Boolean logic. IV. RESULT AND DISCUSSION Building a successful and sustainable CLIR requires suitable methodologies and metrics for assessing its effectiveness. Among the major evaluation measures that are mostly used for determining the eff ectiveness CLIR system include precision and recall. The Equation 4.1 and Equation 4.2 are used to calculate precision and recall of the system respectively. Precision: It refers to the ratio of the relevant retrieved documents and the set of retrieved documents. It is defined by the amount of relevant documents retrieved compared to all documents retrieved. Where α= relevant documents, β= retrieved documents Precision = (4.1) Recall: It refers to the ratio of the relevant retrieved documents and the set of relevant documents. It is defined by the amount of relevant documents retrieved compared to all documents relevant. Where α= relevant documents, β= retrieved documents Recall = (4.2) The proposed system performed experiments on FIRE 2010 datasets. The system uses WordNet to generate similar multiple queries. For analysing the performance of the system the queries from diff erent TEN topics are chosen. Here rather than title field or the description field generalized domain of queries is used because the structure of document in dataset is not formulated in terms such field. Before firing the query the documents are analyzed and distinguished in term of relevance to the query topic. The query topics are given to English-Marathi CLIR system, the precision and recall is calculated for each one. All Rights Reserved Page 114
4 Fig. 2 Comparative Precision-Recall Chart Fig. 3 Comparative Precision-Recall Graph From the analysis of results of experiment, it is depicted as the precision and recall values lies between 0.5 to 1. Hence the objective the proposed system is achieved which is to maintain good precision recall. When compared to most of the CLIR system in Indian language, the precision and recall values are better one. V. CONCLUSION Cross-language IR made new prototypes in searching documents through various languages across the world. CLIR for Indian languages has gained importance in last decade and there is scope to explore much in this field. Crosslingual IR provides new paradigms in searching documents through varieties of languages across the world. CLIR for Indian languages has gained importance in last decade and there is scope to explore much in this field. The proposed English-Marathi CLIR uses WordNet to improve results of retrieval. The proposed system uses WordNet not just to expand queries but to create similar multiple queries. The results show that the performance of the system is good with maintaining promising precision and recall. The performance of any CLIR depends mainly on linguistic resources used for the development of the system. In future, the linguistic resources such as standard translation dictionaries and other tools can be developed. The research work can be extended by adding these linguistic resources in Marathi to proposed architecture for better performance. ACKNOWLEDGMENT It is a great pleasure and moment of immense satisfaction for me to express my profound gratitude to Head of Department of Computer engineering, Dr. Prof. G. K. Patnaik for his valuable advice and guidance. I acknowledge all the staff members of the department of Computer Engineering, SSBT's College of Engineering & Technology for their help and suggestions during various phases of this project work. I would also like to extent my gratitude towards my family members and friends who always encouraged me in every deed. REFERENCES [1] P. Iswarya, Dr. V. Radha, International Journal Of Engineering Research And Applications, "Cross Language Text Retrieval: A Review" (IJERA) ISSN: Vol.2, Issue 5, September- October 2012, pp [2] L. Kalyani and T. Dhanashree, English-marathi cross language information retrieval system based on query translation approach," International Journal of Computer Science & Communication Networks, vol. 6, pp , All Rights Reserved Page 115
5 [3] A. Nagarathinam and S. Saraswathi, State of art: Cross lingual information retrieval system for Indian languages," International Journal of Computer Applications, vol. 35, no. 13, [4] D. Mandal, S. Dandapat, M. Gupta, P. Banerjee, and S. Sarkar, Bengali and hindi to English cross-language text retrieval under limited resources." in CLEF (Working Notes), [5] S. Varshney, J. Bajpai, Improving performance of English-Hindi Cross Language Information Retrieval using Transliteration of query terms 2013 IEEE International Conference in MOOC, Innovation and Technology in Education (MITE), /13/2013 IEEE. [6] E. Katta and A. Arora, \An improved approach to english-hindi based cross language information retrieval system," in Eighth International Conference on Contemporary Computing (IC3), IEEE, 2015, pp [7] S. Chaware and S. Rao, \Ontology approach for cross-language information retrieval," Published in International Journal of Computer Technology and Application, vol. 2, pp. 379{384, [8] P. Rajendra and S. Sudeshna, \Cross-language information retrieval with incorrect query translations," Polibits,. 54, pp. 33{42, 2016.FLEXChip Signal Processor (MC68175/D), Motorola, All Rights Reserved Page 116
DCU at FIRE 2013: Cross-Language!ndian News Story Search
DCU at FIRE 2013: Cross-Language!ndian News Story Search Piyush Arora, Jennifer Foster, and Gareth J. F. Jones CNGL Centre for Global Intelligent Content School of Computing, Dublin City University Glasnevin,
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationOntology Approach for Cross-Language Information Retrieval
Ontology Approach for Cross-Language Information Retrieval S.M.Chaware 1, MPSTME, Mumbai Srikantha Rao 2, TIMSCDR, Mumbai { 1 smchaware@gmail.com, 2 dr_s_rao@yahoo.com} Abstract Information retrieval is
More informationImplementation of Smart Question Answering System using IoT and Cognitive Computing
Implementation of Smart Question Answering System using IoT and Cognitive Computing Omkar Anandrao Salgar, Sumedh Belsare, Sonali Hire, Mayuri Patil omkarsalgar@gmail.com, sumedhbelsare@gmail.com, hiresoni278@gmail.com,
More informationDeveloping Focused Crawlers for Genre Specific Search Engines
Developing Focused Crawlers for Genre Specific Search Engines Nikhil Priyatam Thesis Advisor: Prof. Vasudeva Varma IIIT Hyderabad July 7, 2014 Examples of Genre Specific Search Engines MedlinePlus Naukri.com
More informationEnhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm
Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm K.Parimala, Assistant Professor, MCA Department, NMS.S.Vellaichamy Nadar College, Madurai, Dr.V.Palanisamy,
More informationCROSS LANGUAGE INFORMATION ACCESS IN TELUGU
CROSS LANGUAGE INFORMATION ACCESS IN TELUGU by Vasudeva Varma, Aditya Mogadala Mogadala, V. Srikanth Reddy, Ram Bhupal Reddy in Siliconandhrconference (Global Internet forum for Telugu) Report No: IIIT/TR/2011/-1
More informationSQL Generation and PL/SQL Execution from Natural Language Processing
SQL Generation and PL/SQL Execution from Natural Language Processing Swapnil Kanhe Pramod Bodke Vaibhav Udawant Akshay Chikhale Abstract In this paper we proposes a method of executing query with the databases
More informationA Document Graph Based Query Focused Multi- Document Summarizer
A Document Graph Based Query Focused Multi- Document Summarizer By Sibabrata Paladhi and Dr. Sivaji Bandyopadhyay Department of Computer Science and Engineering Jadavpur University Jadavpur, Kolkata India
More informationMobile Application with Optical Character Recognition Using Neural Network
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 1, January 2015,
More informationAN UNSUPERVISED APPROACH TO DEVELOP IR SYSTEM: THE CASE OF URDU
AN UNSUPERVISED APPROACH TO DEVELOP IR SYSTEM: THE CASE OF URDU ABSTRACT Mohd. Shahid Husain Integral University, Lucknow Web Search Engines are best gifts to the mankind by Information and Communication
More informationDesign and Implementation of Search Engine Using Vector Space Model for Personalized Search
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 1, January 2014,
More informationData Analytics Framework and Methodology for WhatsApp Chats
Data Analytics Framework and Methodology for WhatsApp Chats Transliteration of Thanglish and Short WhatsApp Messages P. Sudhandradevi Department of Computer Applications Bharathiar University Coimbatore,
More informationTERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More informationSelf Introduction. Presentation Outline. College of Information 3/31/2016. Multilingual Information Access to Digital Collections
College of Information Multilingual Information Access to Digital Collections Jiangping Chen Http://coolt.lis.unt.edu/ Jiangping.chen@unt.edu April 20, 2016 Self Introduction An Associate Professor at
More informationMIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion
MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion Sara Lana-Serrano 1,3, Julio Villena-Román 2,3, José C. González-Cristóbal 1,3 1 Universidad Politécnica de Madrid 2 Universidad
More informationMultilingual Information Retrieval
Proposal for Tutorial on Multilingual Information Retrieval Proposed by Arjun Atreya V Shehzaad Dhuliawala ShivaKarthik S Swapnil Chaudhari Under the direction of Prof. Pushpak Bhattacharyya Department
More informationEnhanced Retrieval of Web Pages using Improved Page Rank Algorithm
Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm Rekha Jain 1, Sulochana Nathawat 2, Dr. G.N. Purohit 3 1 Department of Computer Science, Banasthali University, Jaipur, Rajasthan ABSTRACT
More informationAutomatically Generating Queries for Prior Art Search
Automatically Generating Queries for Prior Art Search Erik Graf, Leif Azzopardi, Keith van Rijsbergen University of Glasgow {graf,leif,keith}@dcs.gla.ac.uk Abstract This report outlines our participation
More informationWeb Query Translation with Representative Synonyms in Cross Language Information Retrieval
Web Query Translation with Representative Synonyms in Cross Language Information Retrieval August 25, 2005 Bo-Young Kang, Qing Li, Yun Jin, Sung Hyon Myaeng Information Retrieval and Natural Language Processing
More informationInfluence of Word Normalization on Text Classification
Influence of Word Normalization on Text Classification Michal Toman a, Roman Tesar a and Karel Jezek a a University of West Bohemia, Faculty of Applied Sciences, Plzen, Czech Republic In this paper we
More informationWeb Information Retrieval using WordNet
Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT
More informationDocument Structure Analysis in Associative Patent Retrieval
Document Structure Analysis in Associative Patent Retrieval Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media Studies University of Tsukuba 1-2 Kasuga, Tsukuba, 305-8550,
More informationChapter 27 Introduction to Information Retrieval and Web Search
Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval
More informationOverview of FIRE 2011 Prasenjit Majumder on behalf of the FIRE team
Overview of FIRE 2011 Prasenjit Majumder on behalf of the FIRE team Overview of FIRE 2011 p. 1/21 Overview Background Tasks Data Results Problems and prospects People Overview of FIRE 2011 p. 2/21 Background
More informationCLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval
DCU @ CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval Walid Magdy, Johannes Leveling, Gareth J.F. Jones Centre for Next Generation Localization School of Computing Dublin City University,
More informationLearning to find transliteration on the Web
Learning to find transliteration on the Web Chien-Cheng Wu Department of Computer Science National Tsing Hua University 101 Kuang Fu Road, Hsin chu, Taiwan d9283228@cs.nthu.edu.tw Jason S. Chang Department
More informationINTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) CONTEXT SENSITIVE TEXT SUMMARIZATION USING HIERARCHICAL CLUSTERING ALGORITHM
INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & 6367(Print), ISSN 0976 6375(Online) Volume 3, Issue 1, January- June (2012), TECHNOLOGY (IJCET) IAEME ISSN 0976 6367(Print) ISSN 0976 6375(Online) Volume
More informationACCESSING DATABASE USING NLP
ACCESSING DATABASE USING NLP Pooja A.Dhomne 1, Sheetal R.Gajbhiye 2, Tejaswini S.Warambhe 3, Vaishali B.Bhagat 4 1 Student, Computer Science and Engineering, SRMCEW, Maharashtra, India, poojadhomne@yahoo.com
More informationAutomatic Bangla Corpus Creation
Automatic Bangla Corpus Creation Asif Iqbal Sarkar, Dewan Shahriar Hossain Pavel and Mumit Khan BRAC University, Dhaka, Bangladesh asif@bracuniversity.net, pavel@bracuniversity.net, mumit@bracuniversity.net
More informationTHE WEB SEARCH ENGINE
International Journal of Computer Science Engineering and Information Technology Research (IJCSEITR) Vol.1, Issue 2 Dec 2011 54-60 TJPRC Pvt. Ltd., THE WEB SEARCH ENGINE Mr.G. HANUMANTHA RAO hanu.abc@gmail.com
More informationEnhanced Web Log Based Recommendation by Personalized Retrieval
Enhanced Web Log Based Recommendation by Personalized Retrieval Xueping Peng FACULTY OF ENGINEERING AND INFORMATION TECHNOLOGY UNIVERSITY OF TECHNOLOGY, SYDNEY A thesis submitted for the degree of Doctor
More informationPatent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF
Patent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF Julia Jürgens, Sebastian Kastner, Christa Womser-Hacker, and Thomas Mandl University of Hildesheim,
More informationMEASUREMENT OF SEMANTIC SIMILARITY BETWEEN WORDS: A SURVEY
MEASUREMENT OF SEMANTIC SIMILARITY BETWEEN WORDS: A SURVEY Ankush Maind 1, Prof. Anil Deorankar 2 and Dr. Prashant Chatur 3 1 M.Tech. Scholar, Department of Computer Science and Engineering, Government
More informationR. R. Badre Associate Professor Department of Computer Engineering MIT Academy of Engineering, Pune, Maharashtra, India
Volume 7, Issue 4, April 2017 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Web Service Ranking
More informationImprovement of Web Search Results using Genetic Algorithm on Word Sense Disambiguation
Volume 3, No.5, May 24 International Journal of Advances in Computer Science and Technology Pooja Bassin et al., International Journal of Advances in Computer Science and Technology, 3(5), May 24, 33-336
More informationEmpirical Analysis of Single and Multi Document Summarization using Clustering Algorithms
Engineering, Technology & Applied Science Research Vol. 8, No. 1, 2018, 2562-2567 2562 Empirical Analysis of Single and Multi Document Summarization using Clustering Algorithms Mrunal S. Bewoor Department
More informationEffect of log-based Query Term Expansion on Retrieval Effectiveness in Patent Searching
Effect of log-based Query Term Expansion on Retrieval Effectiveness in Patent Searching Wolfgang Tannebaum, Parvaz Madabi and Andreas Rauber Institute of Software Technology and Interactive Systems, Vienna
More informationA Frequent Max Substring Technique for. Thai Text Indexing. School of Information Technology. Todsanai Chumwatana
School of Information Technology A Frequent Max Substring Technique for Thai Text Indexing Todsanai Chumwatana This thesis is presented for the Degree of Doctor of Philosophy of Murdoch University May
More informationA Novel Image Retrieval Method Using Segmentation and Color Moments
A Novel Image Retrieval Method Using Segmentation and Color Moments T.V. Saikrishna 1, Dr.A.Yesubabu 2, Dr.A.Anandarao 3, T.Sudha Rani 4 1 Assoc. Professor, Computer Science Department, QIS College of
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK POSSIBLE USE OF OCR FOR RECOGNITION OF KORKU LANGUAGE TEXT ARVIND ARJUNRAO TAYADE,
More informationAn Efficient Methodology for Image Rich Information Retrieval
An Efficient Methodology for Image Rich Information Retrieval 56 Ashwini Jaid, 2 Komal Savant, 3 Sonali Varma, 4 Pushpa Jat, 5 Prof. Sushama Shinde,2,3,4 Computer Department, Siddhant College of Engineering,
More informationMultilingual Image Search from a user s perspective
Multilingual Image Search from a user s perspective Julio Gonzalo, Paul Clough, Jussi Karlgren QUAERO-Image CLEF workshop, 16/09/08 Finding is a matter of two fast stupid smart slow great potential for
More information1.
* 390/0/2 : 389/07/20 : 2 25-8223 ( ) 2 25-823 ( ) ISC SCOPUS L ISA http://jist.irandoc.ac.ir 390 22-97 - :. aminnezarat@gmail.com mosavit@pnu.ac.ir : ( ).... 00.. : 390... " ". ( )...2 2. 3. 4 Google..
More informationISSN: [Sugumar * et al., 7(4): April, 2018] Impact Factor: 5.164
IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY IMPROVED PERFORMANCE OF STEMMING USING ENHANCED PORTER STEMMER ALGORITHM FOR INFORMATION RETRIEVAL Ramalingam Sugumar & 2 M.Rama
More informationTansu Alpcan C. Bauckhage S. Agarwal
1 / 16 C. Bauckhage S. Agarwal Deutsche Telekom Laboratories GBR 2007 2 / 16 Outline 3 / 16 Overview A novel expert peering system for community-based information exchange A graph-based scheme consisting
More informationA Retrieval Mechanism for Multi-versioned Digital Collection Using TAG
A Retrieval Mechanism for Multi-versioned Digital Collection Using Dr M Thangaraj #1, V Gayathri *2 # Associate Professor, Department of Computer Science, Madurai Kamaraj University, Madurai, TN, India
More informationAdaptive Model of Personalized Searches using Query Expansion and Ant Colony Optimization in the Digital Library
International Conference on Information Systems for Business Competitiveness (ICISBC 2013) 90 Adaptive Model of Personalized Searches using and Ant Colony Optimization in the Digital Library Wahyu Sulistiyo
More informationGRID SIMULATION FOR DYNAMIC LOAD BALANCING
GRID SIMULATION FOR DYNAMIC LOAD BALANCING Kapil B. Morey 1, Prof. A. S. Kapse 2, Prof. Y. B. Jadhao 3 1 Research Scholar, Computer Engineering Dept., Padm. Dr. V. B. Kolte College of Engineering, Malkapur,
More informationSense-based Information Retrieval System by using Jaccard Coefficient Based WSD Algorithm
ISBN 978-93-84468-0-0 Proceedings of 015 International Conference on Future Computational Technologies (ICFCT'015 Singapore, March 9-30, 015, pp. 197-03 Sense-based Information Retrieval System by using
More informationOverview of iclef 2008: search log analysis for Multilingual Image Retrieval
Overview of iclef 2008: search log analysis for Multilingual Image Retrieval Julio Gonzalo Paul Clough Jussi Karlgren UNED U. Sheffield SICS Spain United Kingdom Sweden julio@lsi.uned.es p.d.clough@sheffield.ac.uk
More informationCACAO PROJECT AT THE 2009 TASK
CACAO PROJECT AT THE TEL@CLEF 2009 TASK Alessio Bosca, Luca Dini Celi s.r.l. - 10131 Torino - C. Moncalieri, 21 alessio.bosca, dini@celi.it Abstract This paper presents the participation of the CACAO prototype
More informationText Document Clustering Using DPM with Concept and Feature Analysis
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 2, Issue. 10, October 2013,
More informationDepartment of Electronic Engineering FINAL YEAR PROJECT REPORT
Department of Electronic Engineering FINAL YEAR PROJECT REPORT BEngCE-2007/08-HCS-HCS-03-BECE Natural Language Understanding for Query in Web Search 1 Student Name: Sit Wing Sum Student ID: Supervisor:
More informationCANDIDATE LINK GENERATION USING SEMANTIC PHEROMONE SWARM
CANDIDATE LINK GENERATION USING SEMANTIC PHEROMONE SWARM Ms.Susan Geethu.D.K 1, Ms. R.Subha 2, Dr.S.Palaniswami 3 1, 2 Assistant Professor 1,2 Department of Computer Science and Engineering, Sri Krishna
More informationKeywords APSE: Advanced Preferred Search Engine, Google Android Platform, Search Engine, Click-through data, Location and Content Concepts.
Volume 5, Issue 3, March 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Advanced Preferred
More informationWorkshop On Empowering The Poor Through Rural Information Centers:
Workshop On Empowering The Poor Through Rural Information Centers: What Works and What is Sustainable? Monday, December 2, 2002. Presentation on Shortage of the Relevant Contents in Indian & Regional Context
More informationObtaining Rough Set Approximation using MapReduce Technique in Data Mining
Obtaining Rough Set Approximation using MapReduce Technique in Data Mining Varda Dhande 1, Dr. B. K. Sarkar 2 1 M.E II yr student, Dept of Computer Engg, P.V.P.I.T Collage of Engineering Pune, Maharashtra,
More informationContext Based Web Indexing For Semantic Web
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 12, Issue 4 (Jul. - Aug. 2013), PP 89-93 Anchal Jain 1 Nidhi Tyagi 2 Lecturer(JPIEAS) Asst. Professor(SHOBHIT
More informationPERSONALIZED MOBILE SEARCH ENGINE BASED ON MULTIPLE PREFERENCE, USER PROFILE AND ANDROID PLATFORM
PERSONALIZED MOBILE SEARCH ENGINE BASED ON MULTIPLE PREFERENCE, USER PROFILE AND ANDROID PLATFORM Ajit Aher, Rahul Rohokale, Asst. Prof. Nemade S.B. B.E. (computer) student, Govt. college of engg. & research
More informationInternational Journal of Scientific & Engineering Research Volume 2, Issue 12, December ISSN Web Search Engine
International Journal of Scientific & Engineering Research Volume 2, Issue 12, December-2011 1 Web Search Engine G.Hanumantha Rao*, G.NarenderΨ, B.Srinivasa Rao+, M.Srilatha* Abstract This paper explains
More informationICT (Information and Communication Technologies)
Towards a Generic Framework for the Development of Unicode Based Digital Sindhi Dictionaries IMDAD ALI ISMAILI*, ZEESHAN BHATTI** AND AZHAR ALI SHAH** RECISED ON 15.07.2011 ACCEPTED ON 01.10.2011 ABSTRACT
More informationWikiTranslate: Query Translation for Cross-Lingual Information Retrieval Using Only Wikipedia
WikiTranslate: Query Translation for Cross-Lingual Information Retrieval Using Only Wikipedia Dong Nguyen, Arnold Overwijk, Claudia Hauff, Dolf R.B. Trieschnigg, Djoerd Hiemstra, and Franciska M.G. de
More informationAN EFFECTIVE SEARCH ON WEB LOG FROM MOST POPULAR DOWNLOADED CONTENT
AN EFFECTIVE SEARCH ON WEB LOG FROM MOST POPULAR DOWNLOADED CONTENT Brindha.S 1 and Sabarinathan.P 2 1 PG Scholar, Department of Computer Science and Engineering, PABCET, Trichy 2 Assistant Professor,
More informationIJRIM Volume 2, Issue 2 (February 2012) (ISSN )
AN ENHANCED APPROACH TO OPTIMIZE WEB SEARCH BASED ON PROVENANCE USING FUZZY EQUIVALENCE RELATION BY LEMMATIZATION Divya* Tanvi Gupta* ABSTRACT In this paper, the focus is on one of the pre-processing technique
More informationHyperlink-Extended Pseudo Relevance Feedback for Improved. Microblog Retrieval
THE AMERICAN UNIVERSITY IN CAIRO SCHOOL OF SCIENCES AND ENGINEERING Hyperlink-Extended Pseudo Relevance Feedback for Improved Microblog Retrieval A thesis submitted to Department of Computer Science and
More informationCS377: Database Systems Text data and information. Li Xiong Department of Mathematics and Computer Science Emory University
CS377: Database Systems Text data and information retrieval Li Xiong Department of Mathematics and Computer Science Emory University Outline Information Retrieval (IR) Concepts Text Preprocessing Inverted
More informationOntology Based Prediction of Difficult Keyword Queries
Ontology Based Prediction of Difficult Keyword Queries Lubna.C*, Kasim K Pursuing M.Tech (CSE)*, Associate Professor (CSE) MEA Engineering College, Perinthalmanna Kerala, India lubna9990@gmail.com, kasim_mlp@gmail.com
More informationQuery Expansion using Wikipedia and DBpedia
Query Expansion using Wikipedia and DBpedia Nitish Aggarwal and Paul Buitelaar Unit for Natural Language Processing, Digital Enterprise Research Institute, National University of Ireland, Galway firstname.lastname@deri.org
More informationPrior Art Retrieval Using Various Patent Document Fields Contents
Prior Art Retrieval Using Various Patent Document Fields Contents Metti Zakaria Wanagiri and Mirna Adriani Fakultas Ilmu Komputer, Universitas Indonesia Depok 16424, Indonesia metti.zakaria@ui.edu, mirna@cs.ui.ac.id
More informationKeywords Data alignment, Data annotation, Web database, Search Result Record
Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web
More informationImage Retrieval Based on Quad Chain Code and Standard Deviation
Vol3 Issue12, December- 2014, pg 466-473 Available Online at wwwijcsmccom International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology
More informationTEXT CHAPTER 5. W. Bruce Croft BACKGROUND
41 CHAPTER 5 TEXT W. Bruce Croft BACKGROUND Much of the information in digital library or digital information organization applications is in the form of text. Even when the application focuses on multimedia
More informationApplying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task
Applying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task Walid Magdy, Gareth J.F. Jones Centre for Next Generation Localisation School of Computing Dublin City University,
More informationHELP ON THE VIRTUAL LIBRARY
HELP ON THE VIRTUAL LIBRARY The Virtual Library search system allows accessing in a quick way to the information the students are interested in and that are available in the Didactic Cyberspace. In its
More informationOCR For Handwritten Marathi Script
International Journal of Scientific & Engineering Research Volume 3, Issue 8, August-2012 1 OCR For Handwritten Marathi Script Mrs.Vinaya. S. Tapkir 1, Mrs.Sushma.D.Shelke 2 1 Maharashtra Academy Of Engineering,
More informationInformation Retrieval
Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,
More informationInformation Retrieval
Information Retrieval CSC 375, Fall 2016 An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have
More informationCross Lingual Information Retrieval Using Data Mining Methods
Association for Information Systems AIS Electronic Library (AISeL) AMCIS 2009 Proceedings Americas Conference on Information Systems (AMCIS) 2009 Cross Lingual Information Retrieval Using Data Mining Methods
More informationA Survey on improving performance of Information Retrieval System using Adaptive Genetic Algorithm
A Survey on improving performance of Information Retrieval System using Adaptive Genetic Algorithm Prajakta Mitkal 1, Prof. Ms. D.V. Gore 2 1 Modern College of Engineering Shivajinagar, Pune 2 Modern College
More informationImproving the Performance of the Peer to Peer Network by Introducing an Assortment of Methods
Journal of Computer Science 7 (1): 32-38, 2011 ISSN 1549-3636 2011 Science Publications Improving the Performance of the Peer to Peer Network by Introducing an Assortment of Methods 1 M. Sadish Sendil
More informationInformation Retrieval Using Context Based Document Indexing and Term Graph
Information Retrieval Using Context Based Document Indexing and Term Graph Mr. Mandar Donge ME Student, Department of Computer Engineering, P.V.P.I.T, Bavdhan, Savitribai Phule Pune University, Pune, Maharashtra,
More informationInternational Journal of Scientific & Engineering Research, Volume 6, Issue 10, October ISSN
International Journal of Scientific & Engineering Research, Volume 6, Issue 10, October-2015 726 Performance Validation of the Modified K- Means Clustering Algorithm Clusters Data S. Govinda Rao Associate
More informationCross-Lingual Word Sense Disambiguation
Cross-Lingual Word Sense Disambiguation Priyank Jaini Ankit Agrawal pjaini@iitk.ac.in ankitag@iitk.ac.in Department of Mathematics and Statistics Department of Mathematics and Statistics.. Mentor: Prof.
More informationChapter 6: Information Retrieval and Web Search. An introduction
Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods
More informationChrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO
Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO INDEX Proposal Recap Implementation Evaluation Future Works Proposal Recap Keyword Visualizer (chrome
More informationQUERY RECOMMENDATION SYSTEM USING USERS QUERYING BEHAVIOR
International Journal of Emerging Technology and Innovative Engineering QUERY RECOMMENDATION SYSTEM USING USERS QUERYING BEHAVIOR V.Megha Dept of Computer science and Engineering College Of Engineering
More informationDiscovery of Agricultural Patterns Using Parallel Hybrid Clustering Paradigm
IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 PP 10-15 www.iosrjen.org Discovery of Agricultural Patterns Using Parallel Hybrid Clustering Paradigm P.Arun, M.Phil, Dr.A.Senthilkumar
More informationCHAPTER 3 INFORMATION RETRIEVAL BASED ON QUERY EXPANSION AND LATENT SEMANTIC INDEXING
43 CHAPTER 3 INFORMATION RETRIEVAL BASED ON QUERY EXPANSION AND LATENT SEMANTIC INDEXING 3.1 INTRODUCTION This chapter emphasizes the Information Retrieval based on Query Expansion (QE) and Latent Semantic
More informationA Survey On Diversification Techniques For Unabmiguous But Under- Specified Queries
J. Appl. Environ. Biol. Sci., 4(7S)271-276, 2014 2014, TextRoad Publication ISSN: 2090-4274 Journal of Applied Environmental and Biological Sciences www.textroad.com A Survey On Diversification Techniques
More informationFull Text Search in Multi-lingual Documents - A Case Study describing Evolution of the Technology At Spectrum Business Support Ltd.
Full Text Search in Multi-lingual Documents - A Case Study describing Evolution of the Technology At Spectrum Business Support Ltd. This paper was presented at the ICADL conference December 2001 by Spectrum
More informationResearch and Design of Key Technology of Vertical Search Engine for Educational Resources
2017 International Conference on Arts and Design, Education and Social Sciences (ADESS 2017) ISBN: 978-1-60595-511-7 Research and Design of Key Technology of Vertical Search Engine for Educational Resources
More informationExtensible Dynamic Form Approach for Supplier Discovery
Extensible Dynamic Form Approach for Supplier Discovery Yan Kang, Jaewook Kim, and Yun Peng Department of Computer Science and Electrical Engineering University of Maryland, Baltimore County {kangyan1,
More informationA Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task
A Text-Based Approach to the ImageCLEF 2010 Photo Annotation Task Wei Li, Jinming Min, Gareth J. F. Jones Center for Next Generation Localisation School of Computing, Dublin City University Dublin 9, Ireland
More informationAn Ontology-Based Information Retrieval Model for Domesticated Plants
An Ontology-Based Information Retrieval Model for Domesticated Plants Ruban S 1, Kedar Tendolkar 2, Austin Peter Rodrigues 2, Niriksha Shetty 2 Assistant Professor, Department of IT, AIMIT, St Aloysius
More informationText Mining: A Burgeoning technology for knowledge extraction
Text Mining: A Burgeoning technology for knowledge extraction 1 Anshika Singh, 2 Dr. Udayan Ghosh 1 HCL Technologies Ltd., Noida, 2 University School of Information &Communication Technology, Dwarka, Delhi.
More informationDublin City University at CLEF 2005: Multi-8 Two-Years-On Merging Experiments
Dublin City University at CLEF 2005: Multi-8 Two-Years-On Merging Experiments Adenike M. Lam-Adesina Gareth J. F. Jones School of Computing, Dublin City University, Dublin 9, Ireland {adenike,gjones}@computing.dcu.ie
More informationRefinement of Web Search using Word Sense Disambiguation and Intent Mining
International Journal of Information and Computation Technology. ISSN 974-2239 Volume 4, Number 3 (24), pp. 22-23 International Research Publications House http://www. irphouse.com /ijict.htm Refinement
More informationDomain-specific Concept-based Information Retrieval System
Domain-specific Concept-based Information Retrieval System L. Shen 1, Y. K. Lim 1, H. T. Loh 2 1 Design Technology Institute Ltd, National University of Singapore, Singapore 2 Department of Mechanical
More informationMultilingual Information Access for Digital Libraries The Metadata Records Translation Project
Multilingual Information Access for Digital Libraries The Metadata Records Translation Project Jiangping Chen Http://max.lis.unt.edu/ Jiangping.chen@unt.edu July 2011 Presentation Outline About Me Current
More information