The Research of A multi-language supporting description-oriented Clustering Algorithm on Meta-Search Engine Result Wuling Ren 1, a and Lijuan Liu 2,b
|
|
- Alaina Small
- 6 years ago
- Views:
Transcription
1 Applied Mechanics and Materials Online: ISSN: , Vol. 151, pp doi: / Trans Tech Publications, Switzerland The Research of A multi-language supporting description-oriented Clustering Algorithm on Meta-Search Engine Result Wuling Ren 1, a and Lijuan Liu 2,b 1 Zhejiang Gongshang University, Hangzhou,China 2 Zhejiang Gongshang University, Hangzhou,China a rwl@zjgsu.edu.cn l, b liulijuan2012@163.com Keywords: Meta-Search Engine; Chinese Segmentation;Text Clustering; DCFC Clustering. Abstract. Search engine has adopted a variety of techniques to improve the accuracy of information retrieval, but the way of a linear list of search engine results, which mixes unrelated documents with relevant documents, has brought user great burden. This article commits to build clustering of search results, which is based on meta search engine techniques. We use all the popular search engine as a data source, then after a certain pre-processing of the source search engine, hierarchical clustering results is formed and returned to the query users. we propose a multi-language supporting, label first clustering algorithm, which we named DCFC algorithm. This algorithm supports both Chinese and English query, focuses on generating human readable labels, shows search results in hierarchical structure. 1 Introduction Given the scale of the Internet, and search engines has become an important means of access to information, which can give the search engine operators to bring huge economic benefits. Meanwhile, the search engine as an emerging cross-disciplinary, integrated search engine can be a text mining, database theory, natural language understanding, etc. Can be seen from the theoretical and practical aspects with a high research value. Currently, the field of data mining are at home and abroad to study, research institute and a hot area[1]. Search results clustering in the study, can be divided into pre-and post-clustering clustering two. Pre-clustering of web documents in the search before clustering, a large amount of data the Internet, this approach requires high computing resources. After the clustering is a clustering search results, greatly reducing the number of documents clustering, such a program for real-time requirements. Given that the Internet is huge, and the dynamic characteristics of the Internet, often used after clustering. The main content of this study include the following: Analysis of the search engines and data mining related research status, highlights the relevant principles of meta-search engines, as well as the theory of text clustering. The design of a meta-search engine DCFC based on the data source, on the one hand the use of search engines to obtain the source data, on the other hand the results of user queries can be indexed and stored, so as to improve the efficiency of the next query. segmentation and other operations, by calculating the maximum frequent candidate set as the classlabel name, and then by hierarchical latent semantic analysis to generate class labels, and finally by calculating the relevant data and class labels of the data into the corresponding class[2]. 2 Meta-search engines and text mining research Search through the user interface for the user's query, and then store the data from the index database and return relevant information to end users. In order to obtain enough and relevant user data, search engines often need to maintain a large inventory to put the index data. A typical search engine, mainly by the web spider, Indexer, searcher and the user interface components. All rights reserved. No part of contents of this paper may be reproduced or transmitted in any form or by any means without the written permission of Trans Tech Publications, (# , Pennsylvania State University, University Park, USA-18/09/16,15:46:03)
2 550 New Trends in Mechatronics and Materials Engineering The characteristics of meta-search engines Compared to the independent search engine, meta search engine has the following characteristics:data from multiple independent secondary result of the search engine results; no independent web library, save storage space and network bandwidth; provides a single query interface to submit a query to multiple search engines;of multiple independent search engine results to re-combined, scoring, sorting. Meta Search Engine(MSE) works as follows[3]: The user enters a query keywords, MSE certain keyword query processing, such as multiple query terms translated into all members of the Boolean search engine supports the format of the query to determine the theme and so on. MSE source search engine based on the scheduling of its way, a number of sources from which you select the search engine, users can set their own members through the search engine list. MSE according to their individual source queries search engine support, through pre-configured mapping will query the source assembly into the URL string search engine support. via an HTTP request method, the queries submitted to various search engines and receive the source returns the result. If it exceeds the allotted time, the source did not receive a search engine results, then resubmit the request, or to abandon the source of search engine results. receiving each source search engine results, and all results to the weight and eliminate reproduced page, and query results and query the relevance of query results. According to the user's individual characteristics, the results presented to the user. Such as through a linear list, graphical, and automatic classification methods. 3 Multi-layer label priority text clustering algorithm 3.1 Multi-label text clustering algorithm DCFC priority Search results clustering in the existing studies, mostly based on the search engine returns a list of documents to provide the URL, and content of the full-text crawl down the cluster. Label priority text clustering algorithm (Description Comes First Clustering), based on user input keywords into the search engine queries, and the results (a summary of the search engine returned) after clustering to the user. Characterized on the one hand to support Chinese and English queries, the other is different from the traditional clustering algorithm is to emphasize the name of readability clustering. Fig. 1. Clustering algorithm with the traditional label priority difference between clustering process This section of the DCFC complete description of the algorithm, where necessary, given the program's pseudo-code and some examples. DCFC includes the following five steps: data preprocessing, segmentation, frequent phrase generation, the generation of multi-class label, the data go to the appropriate category under.
3 Applied Mechanics and Materials Vol Generate frequent phrase Through the data pre-processing to extract the title and summary information, we have obtained the required data source clustering. Frequent phrase generation is mainly used to find documents in a phrase used to describe the document content, the tag name as a candidate. In this paper, the classic frequent itemsets algorithm Apriori, to find frequent phrases Mining frequent itemsets algorithm described as follows: (1) L1 = find_frequent_1-itemsets(d); // mining frequent 1 - itemsets, it is easier (2) for (k=2; Lk 1 Φ ;k++) { C _ ( (3) k = apriori gen Lk 1, min_ sup) ; // call apriori_gen method to generate candidate frequent k-itemsets (4) for each transaction t D { // Scan the transaction database D C (5) Ct = subset( k,t); C (6) for each candidate c t (7) c.count++; // statistics candidate frequent k-itemset count(8) } L (9) k C ={c k c.count min_sup} // satisfy the minimum support of the k-itemset is frequent k-itemsets (10) } (11) return L= L k ; // merge frequent k-itemsets (k> 0) 3.3 Generation of multi-class label Frequent phrase in the generation stage, we get to represent the semantic information of the document as a frequent phrase candidate class labels. In the following, we will first generate the class labels, and organized according to their semantic relations hierarchy. Here, we use latent semantic analysis (Latent Semantic Indexing, LSI), to achieve the extraction of abstract concepts. LSI matrix of singular value decomposition (SVD) method as its mathematical basis, the use of SVD method the results of the U matrix to obtain the document containing the concept. U matrix of any column vector represents an abstraction of the document, as the column vector U contains the matrix concept is expressed in the form of vectors, so that users can not directly understand this. First, we calculated based on frequent word document word frequency matrix A, where the weights we use TF-IDF calculation. SVD decomposition of matrix A, we can get the matrix U, on behalf of its column vector abstraction. Secondly, the formation of abstract concepts and the SVD generated by frequent phrase match, about abstract concepts concrete, to find documents that summarize the contents of the tag name of summary. Frequent phrases in the candidate generation phase to obtain labels, who have been in
4 552 New Trends in Mechatronics and Materials Engineering accordance with easy-to-understand way of screening, it is a good label readability. This process is similar with the query process, we will name the candidate labels as a query q, all the candidate labels were composed of vector and vector matrix of the candidate word phrase which is frequently a column vector is a column vector of the candidate word. Calculate P for each column vector and the distance between the column vectors, so that C = P, C means that each column vector P with the abstract concept of the distance value of each component of the matrix. Matrix C can be selected from the biggest match of abstract phrases[4]. At this point, we take the matrix C in each row corresponding to the maximum frequent candidate phrases, frequent words were selected from all angles the maximum value. If the difference between the threshold value in a range, meaning that the two are very similar, can be used to express the same concept; if the difference between the two is greater than the threshold value, that value is smaller word or phrase and abstract concept distance, thus leaving only the larger value of a (word or phrase is possible) to express abstract concepts. Hierarchical: the tag name of a single word or phrase as a child tree node, the node child node does not exist. The label of the same word or phrase into the tree under the same node[5]. 4 Prototype system running example and conclusions 4.1Development Environment The system development and operating environment are as follows: TABLE 1. DESCRIBES HARDWARE AND SOFTWARE ENVIRONMENT Hardware environment GATEWAY T6832C Operating system Windows XP Development language Java Database MYSQL(5.0.18) Application Server Tomcat(6.0.18) 4.2 System Architecture View annotation tool for collaborative product design in the lace for the practical application process, designers and technicians are being spent to design and geometry-based discussion. Designers view as the server-side start the collaborative annotation tools, process design staff provide the assembly model to browse examination, collaborative exchange, customers can flower pattern already in the database annotation and the need to modify their own program, a total collaboration platform designers and customers in different reference pattern to speed up the design of new development. 5 Conclusion This traditional search engines as an improvement by bringing together the results to achieve higher data coverage, data mining clustering techniques to achieve results in order to shorten the time the user location information. This approach has some viability, to a certain extent reduce the burden on the user query information to quickly find the information really needed. However, the proposed method, there are some shortcomings need to further improve system accuracy. In addition, the system implementation process, many functions can be optimized. Acknowledgment This project is supported by the Science and Technology Research Programs of Zhejiang Province, China (No.2009C , 2009C11159) and by Zhejiang GongShang University Graduate Science andtechnology innovative projects (NO.1130XJ ,3070JQ ).
5 Applied Mechanics and Materials Vol References [1] Stanis law Osi nski, Jerzy Stefanowski, and Dawid Weiss. Lingo: Search results clustering algorithm based on Singular Value Decomposition. In K_lopotek, M.A., Wierzcho n, S.T., Trojanowski, K., eds.: Proceedings of the International IIS: Intelligent Information Processing and Web Mining Conference. Advances in Soft Computing, Zakopane, Poland, Springer (2004) [2] Chien L.F., PAT-Tree-Based Adaptive Key phrase Extraction for Intelligent Chinese Information Retrieval. In Proceedings of the 20m Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval(SIGIR 93), pages , Pittsburgh, PA, [3] Lan Huang. A Survey on Web Information Retrieval Technologies [ EB/ OL ]. ECSL Technical Report, State University of New York, [4] L Ding, T Finin, A Joshi, R Pan, RS Cost, Y Peng. Swoogle: a search and metadata engine for the semantic web. In CIKM [5] Wu L, Mcelean S. Result merging methods in distributed information retrieval with overlapping databases.information Retrieval, 2007,10(3): [6] Carrot2 Framework. Carrot2: Design of a Flexible and Efficient Web Information Retrieval Framework. Third International Atlantic Web Intelligence Conference (AWIC2005), Łodź, Poland, 2005, [7] P. Ferragina, A. Gulli. A personalized search engine based on web-snippet hierarchical clustering. www14, 2005.
Chapter 6: Information Retrieval and Web Search. An introduction
Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods
More informationImproving Suffix Tree Clustering Algorithm for Web Documents
International Conference on Logistics Engineering, Management and Computer Science (LEMCS 2015) Improving Suffix Tree Clustering Algorithm for Web Documents Yan Zhuang Computer Center East China Normal
More informationResearch and Application of E-Commerce Recommendation System Based on Association Rules Algorithm
Research and Application of E-Commerce Recommendation System Based on Association Rules Algorithm Qingting Zhu 1*, Haifeng Lu 2 and Xinliang Xu 3 1 School of Computer Science and Software Engineering,
More informationResearch and implementation of search engine based on Lucene Wan Pu, Wang Lisha
2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) Research and implementation of search engine based on Lucene Wan Pu, Wang Lisha Physics Institute,
More informationConstruction of the Library Management System Based on Data Warehouse and OLAP Maoli Xu 1, a, Xiuying Li 2,b
Applied Mechanics and Materials Online: 2013-08-30 ISSN: 1662-7482, Vols. 380-384, pp 4796-4799 doi:10.4028/www.scientific.net/amm.380-384.4796 2013 Trans Tech Publications, Switzerland Construction of
More informationWeb Page Classification using FP Growth Algorithm Akansha Garg,Computer Science Department Swami Vivekanad Subharti University,Meerut, India
Web Page Classification using FP Growth Algorithm Akansha Garg,Computer Science Department Swami Vivekanad Subharti University,Meerut, India Abstract - The primary goal of the web site is to provide the
More informationRepresentation/Indexing (fig 1.2) IR models - overview (fig 2.1) IR models - vector space. Weighting TF*IDF. U s e r. T a s k s
Summary agenda Summary: EITN01 Web Intelligence and Information Retrieval Anders Ardö EIT Electrical and Information Technology, Lund University March 13, 2013 A Ardö, EIT Summary: EITN01 Web Intelligence
More informationThe Design of Distributed File System Based on HDFS Yannan Wang 1, a, Shudong Zhang 2, b, Hui Liu 3, c
Applied Mechanics and Materials Online: 2013-09-27 ISSN: 1662-7482, Vols. 423-426, pp 2733-2736 doi:10.4028/www.scientific.net/amm.423-426.2733 2013 Trans Tech Publications, Switzerland The Design of Distributed
More informationResearch Institute of Uranium Geology,Beijing , China a
Advanced Materials Research Online: 2014-06-25 ISSN: 1662-8985, Vols. 971-973, pp 1607-1610 doi:10.4028/www.scientific.net/amr.971-973.1607 2014 Trans Tech Publications, Switzerland Discussion on Development
More informationAutomated Online News Classification with Personalization
Automated Online News Classification with Personalization Chee-Hong Chan Aixin Sun Ee-Peng Lim Center for Advanced Information Systems, Nanyang Technological University Nanyang Avenue, Singapore, 639798
More informationResearch and Improvement of Apriori Algorithm Based on Hadoop
Research and Improvement of Apriori Algorithm Based on Hadoop Gao Pengfei a, Wang Jianguo b and Liu Pengcheng c School of Computer Science and Engineering Xi'an Technological University Xi'an, 710021,
More informationConstructing an University Scientific Research Management Information System of NET Platform Jianhua Xie 1, a, Jian-hua Xiao 2, b
Applied Mechanics and Materials Online: 2013-12-04 ISSN: 1662-7482, Vol. 441, pp 984-988 doi:10.4028/www.scientific.net/amm.441.984 2014 Trans Tech Publications, Switzerland Constructing an University
More informationResearch on Full-text Retrieval based on Lucene in Enterprise Content Management System Lixin Xu 1, a, XiaoLin Fu 2, b, Chunhua Zhang 1, c
Applied Mechanics and Materials Submitted: 2014-07-18 ISSN: 1662-7482, Vols. 644-650, pp 1950-1953 Accepted: 2014-07-21 doi:10.4028/www.scientific.net/amm.644-650.1950 Online: 2014-09-22 2014 Trans Tech
More informationDesign and Implementation of Search Engine Using Vector Space Model for Personalized Search
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 1, January 2014,
More informationIntroduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.
Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. 6 What is Web Mining? p. 6 Summary of Chapters p. 8 How
More informationTALP at WePS Daniel Ferrés and Horacio Rodríguez
TALP at WePS-3 2010 Daniel Ferrés and Horacio Rodríguez TALP Research Center, Software Department Universitat Politècnica de Catalunya Jordi Girona 1-3, 08043 Barcelona, Spain {dferres, horacio}@lsi.upc.edu
More informationThe Analysis of the Loss Rate of Information Packet of Double Queue Single Server in Bi-directional Cable TV Network
Applied Mechanics and Materials Submitted: 2014-06-18 ISSN: 1662-7482, Vol. 665, pp 674-678 Accepted: 2014-07-31 doi:10.4028/www.scientific.net/amm.665.674 Online: 2014-10-01 2014 Trans Tech Publications,
More informationA New Technique to Optimize User s Browsing Session using Data Mining
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
More informationBing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer
Bing Liu Web Data Mining Exploring Hyperlinks, Contents, and Usage Data With 177 Figures Springer Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web
More informationSearch Results Clustering in Polish: Evaluation of Carrot
Search Results Clustering in Polish: Evaluation of Carrot DAWID WEISS JERZY STEFANOWSKI Institute of Computing Science Poznań University of Technology Introduction search engines tools of everyday use
More informationResearch Of Data Model In Engineering Flight Simulation Platform Based On Meta-Data Liu Jinxin 1,a, Xu Hong 1,b, Shen Weiqun 2,c
Applied Mechanics and Materials Online: 2013-06-13 ISSN: 1662-7482, Vols. 325-326, pp 1750-1753 doi:10.4028/www.scientific.net/amm.325-326.1750 2013 Trans Tech Publications, Switzerland Research Of Data
More informationMining of Web Server Logs using Extended Apriori Algorithm
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational
More informationMultimodal Information Spaces for Content-based Image Retrieval
Research Proposal Multimodal Information Spaces for Content-based Image Retrieval Abstract Currently, image retrieval by content is a research problem of great interest in academia and the industry, due
More informationIntroducing Usability Practices to OSS: The Insiders Experience
Introducing Usability Practices to OSS: The Insiders Experience Stanis law Osiński 1 and Dawid Weiss 2 1 Poznan Supercomputing and Networking Center, stanislaw.osinski@man.poznan.pl 2 Institute of Computing
More informationClustering Analysis based on Data Mining Applications Xuedong Fan
Applied Mechanics and Materials Online: 203-02-3 ISSN: 662-7482, Vols. 303-306, pp 026-029 doi:0.4028/www.scientific.net/amm.303-306.026 203 Trans Tech Publications, Switzerland Clustering Analysis based
More informationResearch Article Apriori Association Rule Algorithms using VMware Environment
Research Journal of Applied Sciences, Engineering and Technology 8(2): 16-166, 214 DOI:1.1926/rjaset.8.955 ISSN: 24-7459; e-issn: 24-7467 214 Maxwell Scientific Publication Corp. Submitted: January 2,
More informationThe Application of Programmable Controller to Chip Design. Shihong Lan 1, Jian Zhang 2
Applied Mechanics and Materials Online: 2013-01-11 ISSN: 1662-7482, Vol. 273, pp 722-725 doi:10.4028/www.scientific.net/amm.273.722 2013 Trans Tech Publications, Switzerland The Application of Programmable
More informationOntology based Model and Procedure Creation for Topic Analysis in Chinese Language
Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language Dong Han and Kilian Stoffel Information Management Institute, University of Neuchâtel Pierre-à-Mazel 7, CH-2000 Neuchâtel,
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationAn Improved Frequent Pattern-growth Algorithm Based on Decomposition of the Transaction Database
Algorithm Based on Decomposition of the Transaction Database 1 School of Management Science and Engineering, Shandong Normal University,Jinan, 250014,China E-mail:459132653@qq.com Fei Wei 2 School of Management
More informationDesign and Implementation of Agricultural Information Resources Vertical Search Engine Based on Nutch
619 A publication of CHEMICAL ENGINEERING TRANSACTIONS VOL. 51, 2016 Guest Editors: Tichun Wang, Hongyang Zhang, Lei Tian Copyright 2016, AIDIC Servizi S.r.l., ISBN 978-88-95608-43-3; ISSN 2283-9216 The
More informationSearching the Deep Web
Searching the Deep Web 1 What is Deep Web? Information accessed only through HTML form pages database queries results embedded in HTML pages Also can included other information on Web can t directly index
More informationA Finite State Mobile Agent Computation Model
A Finite State Mobile Agent Computation Model Yong Liu, Congfu Xu, Zhaohui Wu, Weidong Chen, and Yunhe Pan College of Computer Science, Zhejiang University Hangzhou 310027, PR China Abstract In this paper,
More informationProject Report on winter
Project Report on 01-60-538-winter Yaxin Li, Xiaofeng Liu October 17, 2017 Li, Liu October 17, 2017 1 / 31 Outline Introduction a Basic Search Engine with Improvements Features PageRank Classification
More informationClustering of Web Search Results Based on Document Segmentation
Computer and Information Science; Vol. 6, No. 3; 23 ISSN 93-8989 E-ISSN 93-8997 Published by Canadian Center of Science and Education Clustering of Web Search Results Based on Document Segmentation Mohammad
More informationStorage Model of Graph Based on Variable Collection
Advanced Materials Research Online: 2013-09-04 ISSN: 1662-8985, Vols. 765-767, pp 1456-1460 doi:10.4028/www.scientific.net/amr.765-767.1456 2013 Trans Tech Publications, Switzerland Storage Model of Graph
More informationPart I: Data Mining Foundations
Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?
More informationApplication of Individualized Service System for Scientific and Technical Literature In Colleges and Universities
Journal of Applied Science and Engineering Innovation, Vol.6, No.1, 2019, pp.26-30 ISSN (Print): 2331-9062 ISSN (Online): 2331-9070 Application of Individualized Service System for Scientific and Technical
More informationRANKING AND SUGGESTING POPULAR ITEMSETS IN MOBILE STORES USING MODIFIED APRIORI ALGORITHM
Vol.2, Issue.1, Jan-Feb 2012 pp-431-435 ISSN: 2249-6645 RANKING AND SUGGESTING POPULAR ITEMSETS IN MOBILE STORES USING MODIFIED APRIORI ALGORITHM P V Vara Prasad #1,Sayempu Sushmitha *2, Badduri Divya
More informationShape Optimization Design of Gravity Buttress of Arch Dam Based on Asynchronous Particle Swarm Optimization Method. Lei Xu
Applied Mechanics and Materials Submitted: 2014-08-26 ISSN: 1662-7482, Vol. 662, pp 160-163 Accepted: 2014-08-31 doi:10.4028/www.scientific.net/amm.662.160 Online: 2014-10-01 2014 Trans Tech Publications,
More informationTERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More informationInternational Journal of Advanced Research in Computer Science and Software Engineering
Volume 3, Issue 3, March 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue:
More informationThe Analysis and Research of IPTV Set-top Box System. Fangyan Bai 1, Qi Sun 2
Applied Mechanics and Materials Online: 2012-12-13 ISSN: 1662-7482, Vols. 256-259, pp 2898-2901 doi:10.4028/www.scientific.net/amm.256-259.2898 2013 Trans Tech Publications, Switzerland The Analysis and
More informationResearch Article Semantic Clustering of Search Engine Results
Hindawi Publishing Corporation e Scientific World Journal Volume 25, Article ID 93258, 9 pages http://dx.doi.org/.55/25/93258 Research Article Semantic Clustering of Search Engine Results Sara Saad Soliman,
More informationResearch on the Application of Digital Images Based on the Computer Graphics. Jing Li 1, Bin Hu 2
Applied Mechanics and Materials Online: 2014-05-23 ISSN: 1662-7482, Vols. 556-562, pp 4998-5002 doi:10.4028/www.scientific.net/amm.556-562.4998 2014 Trans Tech Publications, Switzerland Research on the
More informationINTRODUCTION. Chapter GENERAL
Chapter 1 INTRODUCTION 1.1 GENERAL The World Wide Web (WWW) [1] is a system of interlinked hypertext documents accessed via the Internet. It is an interactive world of shared information through which
More informationAPRIORI ALGORITHM FOR MINING FREQUENT ITEMSETS A REVIEW
International Journal of Computer Application and Engineering Technology Volume 3-Issue 3, July 2014. Pp. 232-236 www.ijcaet.net APRIORI ALGORITHM FOR MINING FREQUENT ITEMSETS A REVIEW Priyanka 1 *, Er.
More informationStudy on A Recommendation Algorithm of Crossing Ranking in E- commerce
International Journal of u-and e-service, Science and Technology, pp.53-62 http://dx.doi.org/10.14257/ijunnesst2014.7.4.6 Study on A Recommendation Algorithm of Crossing Ranking in E- commerce Duan Xueying
More informationPersonalized Search Engine using Social Networking Activity
Indian Journal of Science and Technology, Vol 8(4), 301 306, February 2015 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 DOI : 10.17485/ijst/2015/v8i4/60376 Personalized Search Engine using Social
More informationA Template-Matching-Based Fast Algorithm for PCB Components Detection Haiming Yin
Advanced Materials Research Online: 2013-05-14 ISSN: 1662-8985, Vols. 690-693, pp 3205-3208 doi:10.4028/www.scientific.net/amr.690-693.3205 2013 Trans Tech Publications, Switzerland A Template-Matching-Based
More informationPersonalized Search for TV Programs Based on Software Man
Personalized Search for TV Programs Based on Software Man 12 Department of Computer Science, Zhengzhou College of Science &Technology Zhengzhou, China 450064 E-mail: 492590002@qq.com Bao-long Zhang 3 Department
More informationSocial Network Recommendation Algorithm based on ICIP
Social Network Recommendation Algorithm based on ICIP 1 School of Computer Science and Technology, Changchun University of Science and Technology E-mail: bilin7080@163.com Xiaoqiang Di 2 School of Computer
More informationA New Model of Search Engine based on Cloud Computing
A New Model of Search Engine based on Cloud Computing DING Jian-li 1,2, YANG Bo 1 1. College of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300, China 2. Tianjin Key
More informationWeb People Search using Ontology Based Decision Tree Mrunal Patil 1, Sonam Khomane 2, Varsha Saykar, 3 Kavita Moholkar 4
Web People Search using Ontology Based Decision Tree Mrunal Patil 1, Sonam Khomane 2, Varsha Saykar, 3 Kavita Moholkar 4 * (Department of Computer Engineering, Rajarshi Shahu College of Engineering, Pune-411033,
More informationCS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University
CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and
More informationResearch on 3G Terminal-Based Agricultural Information Service
Research on 3G Terminal-Based Agricultural Information Service Neng-fu Xie and Xuefu Zhang Agricultural Information Institute, The Chinese Academy of Agricultural Sciences Key Laboratory of Digital Agricultural
More informationIntroduction to Information Retrieval
Introduction to Information Retrieval Mohsen Kamyar چهارمین کارگاه ساالنه آزمایشگاه فناوری و وب بهمن ماه 1391 Outline Outline in classic categorization Information vs. Data Retrieval IR Models Evaluation
More informationKeywords: Interactive electronic technical manuals; GJB6600; XML markup language; Automatic control equipment
Applied Mechanics and Materials Submitted: 2014-06-11 ISSN: 1662-7482, Vols. 602-605, pp 1165-1168 Accepted: 2014-06-11 doi:10.4028/www.scientific.net/amm.602-605.1165 Online: 2014-08-11 2014 Trans Tech
More informationAn Intelligent Retrieval Platform for Distributional Agriculture Science and Technology Data
An Intelligent Retrieval Platform for Distributional Agriculture Science and Technology Data Xiaorong Yang 1,2, Wensheng Wang 1,2, Qingtian Zeng 3, and Nengfu Xie 1,2 1 Agriculture Information Institute,
More informationEnhanced Web Log Based Recommendation by Personalized Retrieval
Enhanced Web Log Based Recommendation by Personalized Retrieval Xueping Peng FACULTY OF ENGINEERING AND INFORMATION TECHNOLOGY UNIVERSITY OF TECHNOLOGY, SYDNEY A thesis submitted for the degree of Doctor
More informationMining Distributed Frequent Itemset with Hadoop
Mining Distributed Frequent Itemset with Hadoop Ms. Poonam Modgi, PG student, Parul Institute of Technology, GTU. Prof. Dinesh Vaghela, Parul Institute of Technology, GTU. Abstract: In the current scenario
More informationDesign of the Software for Wirelessly Intercepting Voices
Advanced Materials Research Online: 2014-05-23 ISSN: 1662-8985, Vols. 926-930, pp 2470-2473 doi:10.4028/www.scientific.net/amr.926-930.2470 2014 Trans Tech Publications, Switzerland Design of the Software
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationData Mining Part 3. Associations Rules
Data Mining Part 3. Associations Rules 3.2 Efficient Frequent Itemset Mining Methods Fall 2009 Instructor: Dr. Masoud Yaghini Outline Apriori Algorithm Generating Association Rules from Frequent Itemsets
More informationSerial Communication Based on LabVIEW for the Development of an ECG Monitor
Advanced Materials Research Online: 2013-08-16 ISSN: 1662-8985, Vols. 734-737, pp 3003-3006 doi:10.4028/www.scientific.net/amr.734-737.3003 2013 Trans Tech Publications, Switzerland Serial Communication
More informationA Data Classification Algorithm of Internet of Things Based on Neural Network
A Data Classification Algorithm of Internet of Things Based on Neural Network https://doi.org/10.3991/ijoe.v13i09.7587 Zhenjun Li Hunan Radio and TV University, Hunan, China 278060389@qq.com Abstract To
More informationAN IMPROVED APRIORI BASED ALGORITHM FOR ASSOCIATION RULE MINING
AN IMPROVED APRIORI BASED ALGORITHM FOR ASSOCIATION RULE MINING 1NEESHA SHARMA, 2 DR. CHANDER KANT VERMA 1 M.Tech Student, DCSA, Kurukshetra University, Kurukshetra, India 2 Assistant Professor, DCSA,
More informationThe Application Analysis and Network Design of wireless VPN for power grid. Wang Yirong,Tong Dali,Deng Wei
Applied Mechanics and Materials Online: 2013-09-27 ISSN: 1662-7482, Vols. 427-429, pp 2130-2133 doi:10.4028/www.scientific.net/amm.427-429.2130 2013 Trans Tech Publications, Switzerland The Application
More informationStudy and Design of CAN / LIN Hybrid Network of Automotive Body. Peng Huang
Advanced Materials Research Online: 2014-06-30 ISSN: 1662-8985, Vol. 940, pp 469-474 doi:10.4028/www.scientific.net/amr.940.469 2014 Trans Tech Publications, Switzerland Study and Design of CAN / LIN Hybrid
More informationExperience of Developing a Meta-Semantic Search Engine
2013 International Conference on Cloud & Ubiquitous Computing & Emerging Technologies Experience of Developing a Meta-Semantic Search Engine Debajyoti Mukhopadhyay 1, Manoj Sharma 1, Gajanan Joshi 1, Trupti
More informationInformation Gathering Support Interface by the Overview Presentation of Web Search Results
Information Gathering Support Interface by the Overview Presentation of Web Search Results Takumi Kobayashi Kazuo Misue Buntarou Shizuki Jiro Tanaka Graduate School of Systems and Information Engineering
More informationQuery Languages. Berlin Chen Reference: 1. Modern Information Retrieval, chapter 4
Query Languages Berlin Chen 2005 Reference: 1. Modern Information Retrieval, chapter 4 Data retrieval Pattern-based querying The Kinds of Queries Retrieve docs that contains (or exactly match) the objects
More informationResearch on Computer Network Virtual Laboratory based on ASP.NET. JIA Xuebin 1, a
International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2015) Research on Computer Network Virtual Laboratory based on ASP.NET JIA Xuebin 1, a 1 Department of Computer,
More informationInternational Journal of Science Engineering and Advance Technology, IJSEAT, Vol 2, Issue 11, November ISSN
International Journal of Science Engineering and Advance Technology, IJSEAT, Vol 2, Issue 11, November - 2014 ISSN 2321-6905 Unique value disintegration for probing results using clustering algorithm 1
More informationDesigning a Data Warehouse for an ERP Using Business Intelligence
IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Volume 2, PP 70-74 www.iosrjen.org Designing a Data Warehouse for an ERP Using Business Intelligence Sanket Masurkar 1,Aishwarya
More informationTutorial on Association Rule Mining
Tutorial on Association Rule Mining Yang Yang yang.yang@itee.uq.edu.au DKE Group, 78-625 August 13, 2010 Outline 1 Quick Review 2 Apriori Algorithm 3 FP-Growth Algorithm 4 Mining Flickr and Tag Recommendation
More informationSemantic Website Clustering
Semantic Website Clustering I-Hsuan Yang, Yu-tsun Huang, Yen-Ling Huang 1. Abstract We propose a new approach to cluster the web pages. Utilizing an iterative reinforced algorithm, the model extracts semantic
More informationAnalysis on the technology improvement of the library network information retrieval efficiency
Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2014, 6(6):2198-2202 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 Analysis on the technology improvement of the
More informationFrequent Item Set using Apriori and Map Reduce algorithm: An Application in Inventory Management
Frequent Item Set using Apriori and Map Reduce algorithm: An Application in Inventory Management Kranti Patil 1, Jayashree Fegade 2, Diksha Chiramade 3, Srujan Patil 4, Pradnya A. Vikhar 5 1,2,3,4,5 KCES
More informationChapter 2. Architecture of a Search Engine
Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them
More informationIMPROVING APRIORI ALGORITHM USING PAFI AND TDFI
IMPROVING APRIORI ALGORITHM USING PAFI AND TDFI Manali Patekar 1, Chirag Pujari 2, Juee Save 3 1,2,3 Computer Engineering, St. John College of Engineering And Technology, Palghar Mumbai, (India) ABSTRACT
More informationEfficient Indexing and Searching Framework for Unstructured Data
Efficient Indexing and Searching Framework for Unstructured Data Kyar Nyo Aye, Ni Lar Thein University of Computer Studies, Yangon kyarnyoaye@gmail.com, nilarthein@gmail.com ABSTRACT The proliferation
More informationLRLW-LSI: An Improved Latent Semantic Indexing (LSI) Text Classifier
LRLW-LSI: An Improved Latent Semantic Indexing (LSI) Text Classifier Wang Ding, Songnian Yu, Shanqing Yu, Wei Wei, and Qianfeng Wang School of Computer Engineering and Science, Shanghai University, 200072
More informationChapter 27 Introduction to Information Retrieval and Web Search
Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval
More informationDepartment of Computer Science and Engineering B.E/B.Tech/M.E/M.Tech : B.E. Regulation: 2013 PG Specialisation : _
COURSE DELIVERY PLAN - THEORY Page 1 of 6 Department of Computer Science and Engineering B.E/B.Tech/M.E/M.Tech : B.E. Regulation: 2013 PG Specialisation : _ LP: CS6007 Rev. No: 01 Date: 27/06/2017 Sub.
More informationInternational Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2015)
International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2015) The Improved Apriori Algorithm was Applied in the System of Elective Courses in Colleges and Universities
More informationApplication of CAD/CAE/CAM Technology in Plastics Injection Mould Design and Manufacture. Ming He Dai,Zhi Dong Yun
Advanced Materials Research Vols. 399-401 (2012) pp 2271-2275 Online available since 2011/Nov/22 at www.scientific.net (2012) Trans Tech Publications, Switzerland doi:10.4028/www.scientific.net/amr.399-401.2271
More informationCemetery Navigation and Information Query System Based on Android and Java Web
2017 3rd International Conference on Computational Systems and Communications (ICCSC 2017) Cemetery Navigation and Information Query System Based on Android and Java Web Chao Ding1, a, Yongjie Yang1, b,
More informationContext Based Web Indexing For Semantic Web
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 12, Issue 4 (Jul. - Aug. 2013), PP 89-93 Anchal Jain 1 Nidhi Tyagi 2 Lecturer(JPIEAS) Asst. Professor(SHOBHIT
More informationDATA MINING II - 1DL460. Spring 2014"
DATA MINING II - 1DL460 Spring 2014" A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt14 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationSTUDY ON 3D SOLID RECONSTRUCTION FROM 2D VIEWS BASED ON INTELLIGENT UNDERSTANDING OF MECHANICAL ENGINEERING DRAWINGS
STUDY ON 3D SOLID RECONSTRUCTION FROM 2D VIEWS BASED ON INTELLIGENT UNDERSTANDING OF MECHANICAL ENGINEERING DRAWINGS Jianping Liu^'^, Bangyan YQ\ Xiaohong Wu^, Miaoan Ouyang^ ^College of Mechanical Engineering,
More informationWeighted Suffix Tree Document Model for Web Documents Clustering
ISBN 978-952-5726-09-1 (Print) Proceedings of the Second International Symposium on Networking and Network Security (ISNNS 10) Jinggangshan, P. R. China, 2-4, April. 2010, pp. 165-169 Weighted Suffix Tree
More informationAssociation Rule Mining among web pages for Discovering Usage Patterns in Web Log Data L.Mohan 1
Volume 4, No. 5, May 2013 (Special Issue) International Journal of Advanced Research in Computer Science RESEARCH PAPER Available Online at www.ijarcs.info Association Rule Mining among web pages for Discovering
More informationImgSeek: Capturing User s Intent For Internet Image Search
ImgSeek: Capturing User s Intent For Internet Image Search Abstract - Internet image search engines (e.g. Bing Image Search) frequently lean on adjacent text features. It is difficult for them to illustrate
More informationDesign and Implementation of unified Identity Authentication System Based on LDAP in Digital Campus
Advanced Materials Research Online: 2014-04-09 ISSN: 1662-8985, Vols. 912-914, pp 1213-1217 doi:10.4028/www.scientific.net/amr.912-914.1213 2014 Trans Tech Publications, Switzerland Design and Implementation
More informationText Analytics (Text Mining)
CSE 6242 / CX 4242 Text Analytics (Text Mining) Concepts and Algorithms Duen Horng (Polo) Chau Georgia Tech Some lectures are partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko,
More informationRealization of Automatic Keystone Correction for Smart mini Projector Projection Screen
Applied Mechanics and Materials Online: 2014-02-06 ISSN: 1662-7482, Vols. 519-520, pp 504-509 doi:10.4028/www.scientific.net/amm.519-520.504 2014 Trans Tech Publications, Switzerland Realization of Automatic
More informationInternational Journal of Advanced Computer Technology (IJACT) ISSN: CLUSTERING OF WEB QUERY RESULTS USING ENHANCED K-MEANS ALGORITHM
CLUSTERING OF WEB QUERY RESULTS USING ENHANCED K-MEANS ALGORITHM M.Manikantan, Assistant Professor (Senior Grade), Department of MCA, Kumaraguru College of Technology, Coimbatore, Tamilnadu. Abstract :
More informationMedical Data Mining Based on Association Rules
Medical Data Mining Based on Association Rules Ruijuan Hu Dep of Foundation, PLA University of Foreign Languages, Luoyang 471003, China E-mail: huruijuan01@126.com Abstract Detailed elaborations are presented
More informationPrivacy-Preserving of Check-in Services in MSNS Based on a Bit Matrix
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 15, No 2 Sofia 2015 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.1515/cait-2015-0032 Privacy-Preserving of Check-in
More information