Query Optimization Using Fuzzy Logic in Integrated Database

Similar documents
CHAPTER 6 HYBRID AI BASED IMAGE CLASSIFICATION TECHNIQUES

Intelligent Control. 4^ Springer. A Hybrid Approach Based on Fuzzy Logic, Neural Networks and Genetic Algorithms. Nazmul Siddique.

Identification of Vehicle Class and Speed for Mixed Sensor Technology using Fuzzy- Neural & Genetic Algorithm : A Design Approach

Optimization of Makespan and Mean Flow Time for Job Shop Scheduling Problem FT06 Using ACO

Fuzzy-Genetic Approach to Optimize Machining Process Parameters of AWJM

Research on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a

Reduce Total Distance and Time Using Genetic Algorithm in Traveling Salesman Problem

GENETIC ALGORITHM BASED COLLABORATIVE FILTERING MODEL FOR PERSONALIZED RECOMMENDER SYSTEM

Optimization of fuzzy multi-company workers assignment problem with penalty using genetic algorithm

Family Relationship Identification by Using Extract Feature of Gray Level Co-occurrence Matrix (GLCM) Based on Parents and Children Fingerprint

Applying genetic algorithm on power system stabilizer for stabilization of power system

Query Optimization of A Distributed Database System Using Fuzzy Logic

INCREASING CLASSIFICATION QUALITY BY USING FUZZY LOGIC

JHPCSN: Volume 4, Number 1, 2012, pp. 1-7

Hybrid AFS Algorithm and k-nn Classification for Detection of Diseases

Scheduling of Independent Tasks in Cloud Computing Using Modified Genetic Algorithm (FUZZY LOGIC)

AUTOMATIC CLUSTERING AND OPTIMIZED FUZZY LOGICAL RELATIONSHIPS FOR MINIMUM LIVING NEEDS FORECASTING

Solving A Nonlinear Side Constrained Transportation Problem. by Using Spanning Tree-based Genetic Algorithm. with Fuzzy Logic Controller

A Network Intrusion Detection System Architecture Based on Snort and. Computational Intelligence

Design of Different Fuzzy Controllers for Delayed Systems

A Web Page Recommendation system using GA based biclustering of web usage data

Approach Using Genetic Algorithm for Intrusion Detection System

Cosine Similarity Measurement for Indonesian Publication Recommender System

Optimized Watermarking Using Swarm-Based Bacterial Foraging

Crew Scheduling Problem: A Column Generation Approach Improved by a Genetic Algorithm. Santos and Mateus (2007)

Optimization of Cutting Parameters for Milling Operation using Genetic Algorithm technique through MATLAB

Final Exam. Controller, F. Expert Sys.., Solving F. Ineq.} {Hopefield, SVM, Comptetive Learning,

1. Introduction. 2. Motivation and Problem Definition. Volume 8 Issue 2, February Susmita Mohapatra

Experiment of Query Optimization Techniques to get the Efficient Query

ROBUST-HYBRID GENETIC ALGORITHM FOR A FLOW-SHOP SCHEDULING PROBLEM (A Case Study at PT FSCM Manufacturing Indonesia)

Cross Layer Detection of Wormhole In MANET Using FIS

SIMULATION APPROACH OF CUTTING TOOL MOVEMENT USING ARTIFICIAL INTELLIGENCE METHOD

A study of hybridizing Population based Meta heuristics

Optimization Technique for Maximization Problem in Evolutionary Programming of Genetic Algorithm in Data Mining

Traffic Signal Control Based On Fuzzy Artificial Neural Networks With Particle Swarm Optimization

CLUSTERING CATEGORICAL DATA USING k-modes BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHM

Route Optimization in MANET using FIGA

GA is the most popular population based heuristic algorithm since it was developed by Holland in 1975 [1]. This algorithm runs faster and requires les

Path Planning of Mobile Robots Via Fuzzy Logic in Unknown Dynamic Environments with Different Complexities

Selection of Defuzzification Method to Obtain Crisp Value for Representing Uncertain Data in a Modified Sweep Algorithm

A Framework for adaptive focused web crawling and information retrieval using genetic algorithms

Hybrid of Ant Colony Optimization and Gravitational Emulation Based Load Balancing Strategy in Cloud Computing

Argha Roy* Dept. of CSE Netaji Subhash Engg. College West Bengal, India.

Cell-to-switch assignment in. cellular networks. barebones particle swarm optimization

Reconfiguration Optimization for Loss Reduction in Distribution Networks using Hybrid PSO algorithm and Fuzzy logic

Query Processing and Interlinking of Fuzzy Object-Oriented Database

QUANTUM BASED PSO TECHNIQUE FOR IMAGE SEGMENTATION

CHAPTER 6 ORTHOGONAL PARTICLE SWARM OPTIMIZATION

Simple Bacteria Cooperative Optimization with Rank-based Perturbation

Modified K-Means Algorithm for Genetic Clustering

Design of Obstacle Avoidance System for Mobile Robot using Fuzzy Logic Systems

Comparison of Genetic Algorithm and Hill Climbing for Shortest Path Optimization Mapping

NETWORK FAULT DETECTION - A CASE FOR DATA MINING

Fuzzy Inspired Hybrid Genetic Approach to Optimize Travelling Salesman Problem

Duelist Algorithm: An Algorithm in Stochastic Optimization Method

International Journal of Digital Application & Contemporary research Website: (Volume 1, Issue 7, February 2013)

1 Lab 5: Particle Swarm Optimization

Optimization of Benchmark Functions Using Artificial Bee Colony (ABC) Algorithm

PATH PLANNING OF ROBOT IN STATIC ENVIRONMENT USING GENETIC ALGORITHM (GA) TECHNIQUE

Takagi-Sugeno Fuzzy System Accuracy Improvement with A Two Stage Tuning

METHODOLOGY FOR SOLVING TWO-SIDED ASSEMBLY LINE BALANCING IN SPREADSHEET

Fuzzy time series forecasting of wheat production

A Technique for Design Patterns Detection

Abstract Keyword Searching with Knuth Morris Pratt Algorithm

A NEW APPROACH TO FULLY FUZZY

Comparison of segmentation using fast marching and geodesic active contours methods for bone

LATENT SEMANTIC ANALYSIS AND WEIGHTED TREE SIMILARITY FOR SEMANTIC SEARCH IN DIGITAL LIBRARY

GENETIC ALGORITHM VERSUS PARTICLE SWARM OPTIMIZATION IN N-QUEEN PROBLEM

ERASURE-CODING DEPENDENT STORAGE AWARE ROUTING

Relevancy Measurement of Retrieved Webpages Using Ruzicka Similarity Measure

A Genetic Algorithm for Graph Matching using Graph Node Characteristics 1 2

AUTOMATED INFORMATION RETRIEVAL MODEL USING FP GROWTH BASED FUZZY PARTICLE SWARM OPTIMIZATION

Various Strategies of Load Balancing Techniques and Challenges in Distributed Systems

Metaheuristic Development Methodology. Fall 2009 Instructor: Dr. Masoud Yaghini

Risk Factor Assessment of Software Usability Using Fuzzy-Analytic Hierarchy Process Method

Regression Test Case Prioritization using Genetic Algorithm

AN EVOLUTIONARY APPROACH TO DISTANCE VECTOR ROUTING

Interval Type 2 Fuzzy Logic System: Construction and Applications

A New RR Scheduling Approach for Real Time Systems using Fuzzy Logic

Innovative Systems Design and Engineering ISSN (Paper) ISSN (Online) Vol.5, No.1, 2014

LOCAL CONTRAST ENHANCEMENT USING INTUITIONISTIC FUZZY SETS OPTIMIZED BY ARTIFICIAL BEE COLONY ALGORITHM

Search Of Favorite Books As A Visitor Recommendation of The Fmipa Library Using CT-Pro Algorithm

1 Lab + Hwk 5: Particle Swarm Optimization

CHAPTER 5 ENERGY MANAGEMENT USING FUZZY GENETIC APPROACH IN WSN

Classification and Optimization using RF and Genetic Algorithm

Background Fuzzy control enables noncontrol-specialists. A fuzzy controller works with verbal rules rather than mathematical relationships.

FUZZY LOGIC CONTROL. Helsinki University of Technology Control Engineering Laboratory

Effect of Cleaning Level on Topology Optimization of Permanent Magnet Synchronous Generator

European Journal of Science and Engineering Vol. 1, Issue 1, 2013 ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM IDENTIFICATION OF AN INDUCTION MOTOR

Research Article Path Planning Using a Hybrid Evolutionary Algorithm Based on Tree Structure Encoding

Aristoteles University of Thessaloniki, GREECE.

Multi-Objective Discrete Particle Swarm Optimisation Algorithm for. Integrated Assembly Sequence Planning and Assembly Line Balancing

CHAPTER 5 OPTIMAL CLUSTER-BASED RETRIEVAL

Genetic Tuning for Improving Wang and Mendel s Fuzzy Database

Hybrid of Genetic Algorithm and Continuous Ant Colony Optimization for Optimum Solution

Advance Clustering Technique for Query Optimization in Distributed Database

Fast Efficient Clustering Algorithm for Balanced Data

Fuzzy Region Merging Using Fuzzy Similarity Measurement on Image Segmentation

The Travelling Salesman Problem. in Fuzzy Membership Functions 1. Abstract

Preprocessing of Stream Data using Attribute Selection based on Survival of the Fittest

Transcription:

Indonesian Journal of Electrical Engineering and Computer Science Vol. 4, No. 3, December 2016, pp. 637 ~ 642 DOI: 10.11591/ijeecs.v4.i3.pp637-642 637 Query Optimization Using Fuzzy Logic in Integrated Database Susana*, Suharjito Magister in Information Technology, Binus Graduate Program, Bina Nusantara University, Jakarta, Indonesia *Corresponding author, e-mail: elf.shinee@gmail.com Abstract Query optimization in integrated database can t be separated from data processing method. In order to have faster query response time, a method to optimize queries is required. One of many methods that can be used for query optimization is using fuzzy logic with Tsukamoto inference system. Value set on each variable is defined membership functions and Tsukamoto inference system used in determining these rules or the terms of query results, then apply it into query method or query line structure. The application of fuzzy logic inference systems with Tsukamoto can accelerate query response time, and will have more significant difference when the amount of selected data is greater. Keywords: query optimization, fuzzy logic, database Copyright 2016 Institute of Advanced Engineering and Science. All rights reserved. 1. Introduction Technology and data is becoming a very important part in the survival of a company's business. As the company grows, the greater the (amount of) data that must be processed. Such data must be analyzed to be used to support business decision making. Analysis for any type of decisions must also be done by considering different factors. For certain types of transactions made by specific customers, negotiation must be done with the grantor, which slows down the transaction process, to accelerate this process; a categorical list of consumers is made. With the list of categories of consumers, trading decisions can be directly carried out by the system. But the list of categories of consumers obtained from querying the current data provide considerable comparative results with a list of categories created by the grantor's decision. This means that the decision must still be made manually, because the results from the system cannot be used to make decisions. The significant difference between the results of the list of categories of consumers of the system and a list of categories of consumers of the decision maker occurs because the system makes decisions based only on the exact figure (firmly) from the factors used as reference, while decision maker (human) makes a decision based on estimates mainly on factors that are considered more important. Therefore, it needs to be optimized in the query that is used to obtain the list of categories of consumers. Use factors that affect query response time, among others are: missing indexes, inexact statistics, badly written queries, deadlocks, cursors, excessive fragmentation of indexes, and frequent recompilation of queries [1]. Optimization of query response time, especially on a large database with the number of joins that a lot can be done by using a genetic algorithm to obtain the best selection methods and the fitness function used in processing and the process of chromosomal mutations will shorten the response time [2]. In this study, query optimization will be carried out by using fuzzy logic. The use (of) query with fuzzy logic will accelerate query processing time because of the limitations use on the data that will be selected [3]. The use of fuzzy logic can not only improve the response time, but can also improve the selection results data obtained primarily related to the fuzzy logic so as to produce a system that is flexible and can handle general human demand [4]. The use of fuzzy logic will be a solution to problems in this study because fuzzy logic provides results that are can also use uncertain or it can be said to be approaching the human mind [5]. Received September 16, 2016; Revised November 10, 2016; Accepted November 29, 2016

638 ISSN: 2502-4752 2. Related Works Siallagan utilizes genetic algorithms to optimize the database query by using the virtual tables represented in the query tree to generate new parse tree used left-deep strategy with two methods of crossover, namely: M2S crossover and crossover Chunk [6]. Raipukar perform optimization in distributed databases by using fuzzy logic role. Queries on distributed data are expressed in the form of relational calculus relations resulting fuzzy queries [3]. Tiwari optimizes queries in a distributed database with some evolutionary algorithms such as Ant Colony Optimization (ACO), Genetic Algorithms (GA), and Particle Swarm Optimization (PSO). Each optimization results obtained are then compared, in order to obtain that ACO provides the most optimum results [7]. Syaifudin implement online analitycal Hybrid processing (HOLAP) for a distributed database query optimization by using genetic algorithms on database clustering process [8]. Barbhuiya create architectural models of optimal query using a genetic algorithm with genetic programming approach to optimize complex queries [9]. 3. Research Method The first step of this research is literature study, collect data into the test sample, analyze the condition of the current database, determine variables, create membership function, and determine the rules, process optimization, evaluation of optimization results and then conclusions and suggestions. The research will begin by describing the background, and then determine the purpose and scope of this study. The study of literature becomes necessary to deepen the understanding of the optimization of fuzzy membership functions, fuzzy logic and Fuzzy Inference System Tsukamoto. Second step, collect the data needed to conduct trials, and ensure the company is granted permission to use the data. Third step, analyze the condition of the existing database. The next is determined variables into a factor in query optimization, making the membership function in each set at a predetermined variable, then define the rules in query optimization. After that, using the fuzzification membership functions that have been made and to do query optimization by inference using Fuzzy Inference System Tsukamoto. From the results of inference, defuzzification will be conducted to get the result in crisp set. Fourth step, query optimization results will be evaluated to see a comparison of the query before and after optimization. In the process of fuzzification Fuzzy Inference System Tsukamoto, use the following steps: 1) Fuzzification the crisp data. Fuzzification carried out in accordance with the membership function of each variable predetermined. 2) Results fuzzification inference will be conducted by the rules (rules) that have been determined. 3) After the process of inference, the next step is to do defuzzification of the results of the inference obtained, in order to obtain a query that will produce data category of consumers which is the output variable in this study. 4. Results and Analysis 4.1. Comparison before and after Optimization The test results are made will be compared to determine the difference between the response time before optimization with fuzzy logic and the response time after optimization with fuzzy logic. Comparison of average response time required for query can be seen in the Figure 1. Based on Figure1 it can be seen that the response time after the optimization with fuzzy logic is faster than the response time prior to optimization. This is because with the use of fuzzy logic in modeling rules then there is a limitation of data query region so that the query is run on a more specific area. In addition, with the use of valuable linguistic variables, the search query is easier because the query only search the data by comparing the word on every decision variables. Response time difference obvious when doing quite a lot of data selection. The lesser amount of data to be selected, the fewer differences in response time. IJEECS Vol. 4, No. 3, December 2016: 637 642

IJEECS ISSN: 2502-4752 639 Figure 1. Comparison of Before and After Optimization Furthermore, the T-Test determines whether there are significant differences between the response time before optimization and response time after optimization. Here are the results Paired T-Test before and after optimization with fuzzy logic: Figure 2. Paired T-Test between Response Time before Optimization and After Optimization In testing using Paired T-Test between response time before optimization and response time after optimization, with a significance level of 5%, the value of the t-table obtained amounted to 2.1315 and t value by 8.528. T count> t table, it shows that there is a significant difference between the response time before and after optimization with fuzzy logic. 4.2. Comparison of Optimization with Fuzzy Logic and Genetic Algorithms Comparisons are also made between the query optimization using fuzzy logic with optimizations performed by using a genetic algorithm. Comparison of average response time between optimization with genetic algorithms and fuzzy logic can be seen in the Figure 2. Figure 3 shows that the response time comparison of fuzzy logic and genetic algorithms is approximately 0.763 to 1.065. The amount of data that is selected does not significantly affect the query that is optimized with genetic algorithm, but the results of optimization with genetic algorithm similar to results obtained from queries prior to optimization, while the number of data selected on queries that are optimized with fuzzy logic is more optimal for close to the results determined manually. Table 1 compares of the number of records that are selected manually (by humans), before the query optimization, query-optimized fuzzy logic and queries are optimized with genetic algorithm. Query Optimization Using Fuzzy Logic in Integrated Database (Susana)

640 ISSN: 2502-4752 Figure 3. Comparison of Optimization with Fuzzy Logic and Genetic Algorithm Furthermore, the Paired T-Test to determine whether there are significant differences between the response time optimized with genetic algorithm and response time optimized with fuzzy logic (Figure 4). Here are the results Paired T-Test query with genetic algorithms and fuzzy logic: Figure 4. Paired T-Test between Response time with Genetic Algorithm and Fuzzy Logic In testing using Paired T-Test between the response time of queries that are optimized with genetic algorithm and the response time of queries that are optimized with fuzzy logic, with the same test that the significance level of 5%, the value t-table obtainedamounted to 2.1315 and the value of t amounted to 4.644. T count> t table, it shows that there are significant differences between the response time before and after optimization with fuzzy logic. The next evaluation is to compare the selection results obtained from the man-made selection, before the query optimization, query after optimization with fuzzy logic and query with genetic algorithms. The following table is a comparison of the number of records that are selected manually (by humans), before the query optimization, query-optimized fuzzy logic and queries are optimized with genetic algorithm. Table 1 show that the amount of data the result of a query that is optimized using a fuzzy logic approach results manually determined by a ratio of 0.706 to 1.136. This is because the results of fuzzy logic approach using human logic membership functions and rules given. The next step is to do Precision, Recall and Accuracy Test to determine the level of accuracy of the information requested using the answers given by the system (precision), the success rate in finding information back (recall), as well as the degree of closeness between the predicted value compare with the actual value (accuracy). The data that will be used to IJEECS Vol. 4, No. 3, December 2016: 637 642

IJEECS ISSN: 2502-4752 641 determine the accuracy would be priority consumer data from each month. The formula used for calculating the value of precision and recall can be described as follows Table 1. Month May June July Table 1. Comparison of the Number of Records Number of Records Consumer Data Fuzzy Logic Genetic Algorithms Manual Before Optimization Optimization Optimization Priority 19 4 17 4 Rekommended 81 38 73 38 Considered 1127 909 1149 909 Not Recommended 885 1161 873 1161 Priority 17 7 12 7 Rekommended 109 34 94 34 Considered 1603 1224 1531 1224 Not Recommended 691 1155 783 1155 Priority 11 4 8 4 Rekommended 63 29 50 29 Considered 1488 1525 1690 1525 Not Recommended 1104 1108 918 1108 Fuzzy Logic Table 2. Confusion Matrix Precision, Recall and Accuracy Test Human (Manually) Not a Priority Priority Not a TN (True Negative) Correct Absence of FN (False Negative) Missing Priority Result Result Priority FP (False Positive) Unexpected Result TP (True Positive) Correct Result Precision = Recall = Accuracy = Precision, recall and accuracy test will be conducted on data obtained from the query before optimization and after optimization. Table 3. Comparison of the Average Percentage of Precision, Recall and Accuracy Test Before Optimization with Fuzzy Logic After Optimization with Fuzzy Logic Precision 100% 100% Recall 32.86% 77.60% Accuracy 99.54% 99.86% Table 3 shows the significant difference lies in the degree of success in finding the back of information (recall). Query that is optimized using fuzzy logic provides a higher yield compared to the prior query optimization, while the results of precision and accuracy tests did not have significant difference. This test shows that the query with fuzzy logic can be used on the system to accelerate the processes in the system to provide accurate results. 5. Conclusion In this study, it appears that the query optimization with fuzzy logic can improve the speed of query response time. The amount of data selected still affect the speed of query response time, but the time of selection of the data with the same number of records, it appears that query with fuzzy logic faster than the query without fuzzy logic. A significant difference occurred while processing the query for selecting data in a large number. The number and resulting data s are selected from the query using fuzzy logic is closer to the result of selection by humans with an accuracy rate of over 90%. Thus the results of Query Optimization Using Fuzzy Logic in Integrated Database (Susana)

642 ISSN: 2502-4752 query optimizations are useable decision making without going through the process of analysis so as to shorten the processing time. References [1] Corlatan CG, Lazar MM, Luca V, Petricica OT. Query Optimization Techniques in Microsoft SQL Server. Database System Journal. 2014; 5(2): 33-48. [2] Butey PK, Meshram S, Sonolikar RL. Query Optimization by Genetic Algorithm. Journal of Information Technology and Engineering. 2012; 3(1). [3] Raipurkar AR, Bamnote GR. Fuzzy Logic Based Query Optimization in Distributed Database. International Journal of Innovatibe Research in Computer and Communication Engineering. 2013; 1(2): 422-426. [4] Sharma P. Retrieval of Information Using Fuzzy Queries. International Journal of Engineering and Techniques. 2016; 2(3). [5] Mohiuddin MS. Performance comparison of conventional controller with Fuzzy logic controller using chopper circuit and Fuzzy tuned PID controller. Indonesian Journal of Electrical Engineering and Informatics (IJEEI). 2014; 2(4): 149-164. [6] Siallagan M, Sabariah MK, Sontya M. Optimasi Query Database Menggunakan Algoritma Genetik. In Seminar Nasional Aplikasi Teknologi Informasi. 2008: 53-57. [7] Tiwari MP, Chande SV. Query Optimization Strategies in Distributed Databases. International Journal of Advanced in Engineering Sciences. 2013; 3: 23-29. [8] Syaifudin R, S, Hartanto R. Review: Implementasi Holap Untuk Optimasi Query Sistem Basis Data Terdistribusi Dengan Pendekatan Algoritma Genetik. Jurnal Sistem Informasi Bisnis. 03. 2014: 164-171. [9] Barbhuiya SZ, Nandi MG. Query Optimization Using a Genetic Programming Approach. ADBU- Journal of Engineering Technology. 2016: 158-163. IJEECS Vol. 4, No. 3, December 2016: 637 642