AN OPTIMIZATION GENETIC ALGORITHM FOR IMAGE DATABASES IN AGRICULTURE

Similar documents
An Application of Genetic Algorithm for Auto-body Panel Die-design Case Library Based on Grid

Optimization of Association Rule Mining through Genetic Algorithm

Open Access Research on the Prediction Model of Material Cost Based on Data Mining

A Hybrid Genetic Algorithm for the Distributed Permutation Flowshop Scheduling Problem Yan Li 1, a*, Zhigang Chen 2, b

DETERMINING MAXIMUM/MINIMUM VALUES FOR TWO- DIMENTIONAL MATHMATICLE FUNCTIONS USING RANDOM CREOSSOVER TECHNIQUES

Preprocessing of Stream Data using Attribute Selection based on Survival of the Fittest

The k-means Algorithm and Genetic Algorithm

Approach Using Genetic Algorithm for Intrusion Detection System

Aero-engine PID parameters Optimization based on Adaptive Genetic Algorithm. Yinling Wang, Huacong Li

A Modified Genetic Algorithm for Process Scheduling in Distributed System

Research on time optimal trajectory planning of 7-DOF manipulator based on genetic algorithm

Classification of Concept-Drifting Data Streams using Optimized Genetic Algorithm

Study on GA-based matching method of railway vehicle wheels

Scheme of Big-Data Supported Interactive Evolutionary Computation

THE DECISION OF THE OPTIMAL PARAMETERS IN MARKOV RANDOM FIELDS OF IMAGES BY GENETIC ALGORITHM

BP Neural Network Based On Genetic Algorithm Applied In Text Classification

A Genetic Algorithm for Graph Matching using Graph Node Characteristics 1 2

Santa Fe Trail Problem Solution Using Grammatical Evolution

Genetic algorithm based on number of children and height task for multiprocessor task Scheduling

5th International Conference on Information Engineering for Mechanics and Materials (ICIMM 2015)

Mining Interesting Positive and Negative Association Rule Based on Improved Genetic Algorithm (MIPNAR_GA)

Research on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a

Fast Efficient Clustering Algorithm for Balanced Data

Segmentation of Noisy Binary Images Containing Circular and Elliptical Objects using Genetic Algorithms

Application of Wang-Yu Algorithm in the Geometric Constraint Problem

Genetic-PSO Fuzzy Data Mining With Divide and Conquer Strategy

A Genetic Algorithm Approach for Clustering

ET-based Test Data Generation for Multiple-path Testing

Combinatorial Double Auction Winner Determination in Cloud Computing using Hybrid Genetic and Simulated Annealing Algorithm

Test Data Generation with A Hybrid Genetic Tabu Search Algorithm for Decision Coverage Criteria

Using The Heuristic Genetic Algorithm in Multi-runway Aircraft Landing Scheduling

M.Kannan et al IJCSET Feb 2011 Vol 1, Issue 1,30-34

Hybridization of Genetic Algorithm and Linear Programming for Solving Cell Formation Problem with Alternative Process Routings

C 1 Modified Genetic Algorithm to Solve Time-varying Lot Sizes Economic Lot Scheduling Problem

A Clustering Method with Efficient Number of Clusters Selected Automatically Based on Shortest Path

International Journal of Digital Application & Contemporary research Website: (Volume 1, Issue 7, February 2013)

Genetic Algorithms. Kang Zheng Karl Schober

Mining High Average-Utility Itemsets

A Parallel Evolutionary Algorithm for Discovery of Decision Rules

1. Introduction. 2. Motivation and Problem Definition. Volume 8 Issue 2, February Susmita Mohapatra

Design of student information system based on association algorithm and data mining technology. CaiYan, ChenHua

Multiprocessor Task Scheduling Using Hybrid Genetic Algorithm

Immune Optimization Design of Diesel Engine Valve Spring Based on the Artificial Fish Swarm

Using Genetic Algorithms in Integer Programming for Decision Support

Genetic Algorithms Variations and Implementation Issues

The Genetic Algorithm for finding the maxima of single-variable functions

Optimization Technique for Maximization Problem in Evolutionary Programming of Genetic Algorithm in Data Mining

A Novel Method for Activity Place Sensing Based on Behavior Pattern Mining Using Crowdsourcing Trajectory Data

How to Apply the Geospatial Data Abstraction Library (GDAL) Properly to Parallel Geospatial Raster I/O?

A Power Grid Comprehensive Evaluation Based on BP Neural Network

Solving A Nonlinear Side Constrained Transportation Problem. by Using Spanning Tree-based Genetic Algorithm. with Fuzzy Logic Controller

Time Complexity Analysis of the Genetic Algorithm Clustering Method

A Genetic Algorithm-Based Approach for Energy- Efficient Clustering of Wireless Sensor Networks

Optimization of Association Rule Mining Using Genetic Algorithm

Available online at ScienceDirect. Procedia Computer Science 89 (2016 )

Research on Design and Application of Computer Database Quality Evaluation Model

Chapter 28. Outline. Definitions of Data Mining. Data Mining Concepts

AN EVOLUTIONARY APPROACH TO DISTANCE VECTOR ROUTING

Towards New Heterogeneous Data Stream Clustering based on Density

Genetic Algorithm for Finding Shortest Path in a Network

Towards Automatic Recognition of Fonts using Genetic Approach

A Technique for Design Patterns Detection

An experimental evaluation of a parallel genetic algorithm using MPI

An Evolutionary Algorithm for the Multi-objective Shortest Path Problem

A GENETIC ALGORITHM FOR CLUSTERING ON VERY LARGE DATA SETS

Solving Traveling Salesman Problem Using Parallel Genetic. Algorithm and Simulated Annealing

FSRM Feedback Algorithm based on Learning Theory

A Kind of Wireless Sensor Network Coverage Optimization Algorithm Based on Genetic PSO

Data Mining Technology Based on Bayesian Network Structure Applied in Learning

A hierarchical network model for network topology design using genetic algorithm

A Method Based Genetic Algorithm for Pipe Routing Design

Literature Review On Implementing Binary Knapsack problem

A novel genetic algorithm based on all spanning trees of undirected graph for distribution network reconfiguration

Monika Maharishi Dayanand University Rohtak

A LOCAL SEARCH GENETIC ALGORITHM FOR THE JOB SHOP SCHEDULING PROBLEM

Hybrid Bionic Algorithms for Solving Problems of Parametric Optimization

Using a genetic algorithm for editing k-nearest neighbor classifiers

A Structural Optimization Method of Genetic Network Programming. for Enhancing Generalization Ability

Chapter 14 Global Search Algorithms

Web page recommendation using a stochastic process model

Research Article Path Planning Using a Hybrid Evolutionary Algorithm Based on Tree Structure Encoding

A Genetic Algorithm for Multiprocessor Task Scheduling

Role of Genetic Algorithm in Routing for Large Network

Comparison Study of Multiple Traveling Salesmen Problem using Genetic Algorithm

Optimizing the Coordinates Position for Optimal Placement of ZigBee for Power Monitoring From Remote Location

Reducing Graphic Conflict In Scale Reduced Maps Using A Genetic Algorithm

A NOVEL APPROACH FOR PRIORTIZATION OF OPTIMIZED TEST CASES

Genetic Model Optimization for Hausdorff Distance-Based Face Localization

Performance Assessment of DMOEA-DD with CEC 2009 MOEA Competition Test Instances

CHAPTER 6 HYBRID AI BASED IMAGE CLASSIFICATION TECHNIQUES

Neural Network Weight Selection Using Genetic Algorithms

THE MULTI-TARGET FIRE DISTRIBUTION STRATEGY RESEARCH OF THE ANTI-AIR FIRE BASED ON THE GENETIC ALGORITHM. Received January 2011; revised May 2011

Genetic Fourier Descriptor for the Detection of Rotational Symmetry

Feature Selection for Multi-Class Imbalanced Data Sets Based on Genetic Algorithm

Robot Path Planning Method Based on Improved Genetic Algorithm

A New Selection Operator - CSM in Genetic Algorithms for Solving the TSP

A Method of Hyper-sphere Cover in Multidimensional Space for Human Mocap Data Retrieval

Sequences Modeling and Analysis Based on Complex Network

Paul POCATILU, PhD Economics Informatics and Cybernetics Department The Bucharest Academy of Economic Studies

RESOLVING AMBIGUITIES IN PREPOSITION PHRASE USING GENETIC ALGORITHM

Transcription:

AN OPTIMIZATION GENETIC ALGORITHM FOR IMAGE DATABASES IN AGRICULTURE Changwu Zhu 1, Guanxiang Yan 2, Zhi Liu 3, Li Gao 1,* 1 Department of Computer Science, Hua Zhong Normal University, Wuhan 430079, China 2 School of Information Management, Wuhan University, Wuhan 430072,China 3 Wuhan Junxie Shiguan School, Wuhan 430079, China * Corresponding author, Tel: 027-62265691, E-mail: lgao@mail.ccnu.edu.cn Abstract: Data Mining is rapidly evolving areas of research that are at the intersection of several disciplines, including statistics, databases, pattern recognition, and high-performance and parallel computing. In this paper, we propose a novel mining algorithm, called ARMAGA (Association rules mining Algorithm based on a novel Genetic Algorithm), to mine the association rules from an image database, where every image is represented by the ARMAGA representation. We first take advantage of the genetic algorithm designed specifically for discovering association rules. Second we propose the Algorithm Compared to the algorithm in, and the ARMAGA algorithm avoids generating impossible candidates, and therefore is more efficient in terms of the execution time. 1. INTRODUCTION The image databases contain an enormous amount of information, and it is becoming more and more complex while its size continues to grow at a remarkable rate. So it can be exceedingly difficult for users to locate resources that are both relevant to their information needs and high in quality. Vast numbers of images have accumulated on the Internet and in entertainment, agriculture, education, and other multimedia applications. Therefore, how to mine interesting patterns from image databases has attracted more and more attention in recent years (G.chen, 2002). Many additional algorithms have been proposed for association rule mining. Also, the concept of association rule has been extended in many different ways, such as

1296 Changwu Zhu, Guanxiang Yan, Zhi Liu, Li Gao generalized association rules, association rules with item constraints, sequence rules etc. Apart from the earlier analysis of market basket data, these algorithms have been widely used in many other practical applications such as customer profiling, analysis of products and so on(gaoli, 2003). Genetic Algorithm (GA) is one self-adaptive optimization searching algorithm. GA obtains the best solution, or the most satisfactory solution through generations of chromosomes constant evolution includes the reproduce, crossover and mutation etc. operation, until a certain termination condition is coincident (K. Koperski, 1995). Association rules mining Algorithm Based on a novel Genetic Algorithm (ARMAGA) is an optimal algorithm combing GA with ARMA. In this paper we first take advantage of the genetic algorithm designed specifically for discovering association rules. Second we propose a novel spatial mining algorithm, called ARMAGA, Compared to the algorithm in (G.chen, 2002), and the ARMAGA algorithm avoids generating impossible candidates, and therefore is more efficient in terms of the execution time. 2. ASSOCIATION RULES Definition 1 confidence Set up I = i, i, i } for items of collection, for item in ( 1 j m), ( 1 j m) { 1 2 m { T 1, T N for lasting item, D = ) it is a trade collection, I( 1 i N) here T is the trade. Rule X Y is probability that X U Y concentrates on including in the trade. The association rule here is an implication of the form X Y where X is the conjunction of conditions, and Y is the type of classification. The rule X Y has to satisfy specified minimum support and minimum confidence measure (Shijue Zheng,2006). The support of Rule X Y is the measure of frequency both X and Y in D S ( = xy D (1) The confidence measure of Rule the bargain descend, in the meantime includes Y C ( x y) = S( S( (2) Definition 2 Weighting support T i i j X Y is for the premise that includes X in Designated ones project to collect I = {i 1, i2, i m }, each project i j is composed with the value w j of right (0 wj 1, 1 j m). If the rule is X Y, the weighting support is 1 S w( = w js( (3) k i xy And, the K is the size of the Set XY of the project. When the right value Wj is the same as i j, we calculating the weighting including rule to have the same support.

An Optimization Genetic Algorithm for Image Databases in Agriculture 1297 3. GENETIC ALGORITHM (GA) Genetic Algorithm (GA) is a self-adaptive optimization searching algorithm. GA obtains the best solution, or the most satisfactory solution through generations of chromosomes constant evolution includes reproduction, crossover and mutation etc. Here is the general description of this problem: F ( = a S( + b C( (4) a, b is constants, a 0, b 0, S( is the support, and C( is the confidence. 4. ASSOCIATION RULES MINING BASED ON A NOVEL GENETIC ALGORITHM 4.1 Encoding This paper employs natural numbers to encode the variable A ij. That is, the number of the lines of every range in the matrix A in which the element 1 exists is regarded as a gene. The genes are independent of each other. They are marked by A 1, A 2 A j,, A n, in which 1, ], [ 1, and A n may be a repeatedly A [ m j n] j equal natural number. When the distributive method at random is employed to produce the initial population comprised of certain individuals, the population must be in a certain scale in order to achieve the optimal solution on the whole. The best way is the generated M individuals randomly that the length is N then the chromosome bunch encoded by the natural number is calculated as the initial population. 4.2 The Fitness Formula (3) is properly transformed into: F( S( C( = W (5) C s + Wc Smin min Here, W C+ W s =1, W c 0, Ws 0, S min, is minimum support, and C min is minimum confidence. 4.3 Reproduction Operator Reproduction is the transmission of personal information from the father generation to the son generation. Each individual in each generation determines the probability that it can reproduce the next generation according to how big or small the fitness value is. Through reproducing, the number of excellent individuals in the

1298 Changwu Zhu, Guanxiang Yan, Zhi Liu, Li Gao population increases constantly, and the whole process of evolution head for the optimal direction. We are adopting roulette selection strategy; each individual reproduction probability is proportion to fitness value. 1) Compute the reproduction probability of all the individuals P i) = M i= 1 f ( i) ( (6) f ( i) 2) Generate a number r randomly, r=random [0, 1] 3) If P(0)+ P(1)+ + P(i-1)<r< P(0)+ P(1)+ + P(i),the individual i is selected into the next generation. 4.4 Crossover Operator Crossover is the substitution between two individuals of the father generation that is to generating new individual.the crossover probability Pc directly influences the convergence of the algorithm. The larger Pc is the most likely is the genetic mode of the optimal individual to be destroyed.however, the over-small of Pc can slow down the research process (Wu zhaohui,2005).here is the definition of the crossover operator: Computing crossover probability Pc 0. 3 ( f ( f ( ) 0. 9 Pc = fmax( f ( 0. 9 In which, the average fitness value of the population. f ( f ( f ( p f ( fmax is the maximum fitness value of the population, f (X) 4.5 Mutation Operator The role of the mutation operator lies in that it enables the whole population to maintain a certain variety through the abrupt change of the mutation operator when a local convergence occurs in the population. The selection of the mutation probability P m is the vital point because it influences the action and performance of the ARMNGA. If P m is over-small, the ARMNGA will become a pure random research.here is the definition of the mutations operator, computing the mutation probability P m (7) is

An Optimization Genetic Algorithm for Image Databases in Agriculture 1299 0. 009 ( f ( x ) f ( x ) ) 0. 1 f ( x ) f ( x ) Pm = f max ( x ) f ( x ) 0. 1 f ( x ) p f ( x ) f In which, max is the maximum fitness value of the population, is the average fitness value of the population. (8) f (X) 4.6 Termination Condition When the matching error ε 0 or the condition is not coincident, the process will naturally stop. 5. EXPERIMENTS ON SYNTHETIC DATA To check the research capability of the operator and its operational efficiency, such a simulation result is given compared with the GA in [2],The platform of the simulation experiment is a Dell power Edge2600 server (double Intel Xeon 1.8GHz CPU,1G memory, Redhat Linux 9.0). We first compare the performance of our proposed method with the algorithm in Fig. 1 shows the runtime vs. the size of an image for both algorithms, where the size of the image varies for the synthetic dataset. As the size of the image increases, the runtimes of both algorithms decrease; nevertheless, the runtime of the ARMAGA algorithm does not change very much. Fig.1 Runtime vs. image size Fig.2 shows the runtime vs. number of objects for both algorithm, where the number of objects varies from 25 to 100 for the synthetic dataset.since the average size of process and number of transaction are both fixed, the average support for the item sets decreases as the number of objects increases. Thus, the runtimes of both algorithms decrease slightly when the

1300 Changwu Zhu, Guanxiang Yan, Zhi Liu, Li Gao number of objects increases. Nevertheless, our proposed algorithm is faster than the algorithm in. Fig.2 Runtime vs. number of objects. From Fig.1 and Fig.2, we can educe that ARMAGA has a higher convergence speed and more reasonable selective scheme which guarantees the non-reduction performance of the optimal solution. Therefore, it is better than GA and ARMA through the theoretic analysis and the experimental results. 6. CONCLUSIONS The image data mining in agriculture is a newly researching hot point in database area. But general data mining get knowledge from large quantities of data. We propose an Association rules mining based on a novel Genetic Algorithm, designed specifically for discovering association rules. We compare the results of the ARMAGA with the results of (G.chen,2002), and, it is better than GA and ARMA through the theoretic analysis and the experimental results. REFERENCES G. Chen, Q. Wei. Fuzzy association rules and the extended mining algorithms, Information Sciences 147 (2002) 201 228. Gao Li, Li Dan, Dai Shangping. A mining Algorithm of constraint based association rules, journal of Henan University Vol.33 (2003) pp.55-58 K. Koperski, J. Han. Discovery of spatial association rules in geographic information databases, in: Proc. of International Symposium on Advance in Spatial Databases, SSD, LNCS, vol. 951, Springer Verlag, 1995, pp. 47 66. P.Y. Hu, Y.L. Chen, C.C. Ling. Algorithms for mining association rules in bag databases, Information Sciences 166 (2004) 31 47. Shijue Zheng, Wanneng Shu, Li Gao.Task Scheduling Using Parallel Genetic Simulated Annealing Algorithm,2006 IEEE International Conference on Service Operations and Logistics, and Informatics Proceedings June 21-23, 2006, Shanghai, pp46-50 Wu zhaohui. Association rule mining based on simulated annealing genetic algorithm, Computer Applications Vol.25 (2005) pp.1009-1011