Scheme of Big-Data Supported Interactive Evolutionary Computation

Similar documents
Interactive Computational Intelligence: A Framework and Applications

Paired Comparisons-based Interactive Differential Evolution

AN OPTIMIZATION GENETIC ALGORITHM FOR IMAGE DATABASES IN AGRICULTURE

Immune Optimization Design of Diesel Engine Valve Spring Based on the Artificial Fish Swarm

Santa Fe Trail Problem Solution Using Grammatical Evolution

Distributed Optimization of Feature Mining Using Evolutionary Techniques

Feature Selection for Multi-Class Imbalanced Data Sets Based on Genetic Algorithm

Heuristic Optimisation

Optimization of Association Rule Mining through Genetic Algorithm

Research Article Path Planning Using a Hybrid Evolutionary Algorithm Based on Tree Structure Encoding

Aero-engine PID parameters Optimization based on Adaptive Genetic Algorithm. Yinling Wang, Huacong Li

Traffic Signal Control Based On Fuzzy Artificial Neural Networks With Particle Swarm Optimization

Performance Assessment of DMOEA-DD with CEC 2009 MOEA Competition Test Instances

Genetic Algorithms Variations and Implementation Issues

Time Complexity Analysis of the Genetic Algorithm Clustering Method

Literature Review On Implementing Binary Knapsack problem

DETERMINING MAXIMUM/MINIMUM VALUES FOR TWO- DIMENTIONAL MATHMATICLE FUNCTIONS USING RANDOM CREOSSOVER TECHNIQUES

CutLeader Nesting Technology

Evolving SQL Queries for Data Mining

Structural topology optimization based on improved genetic algorithm

Preprocessing of Stream Data using Attribute Selection based on Survival of the Fittest

The Genetic Algorithm for finding the maxima of single-variable functions

A Naïve Soft Computing based Approach for Gene Expression Data Analysis

A THREAD BUILDING BLOCKS BASED PARALLEL GENETIC ALGORITHM

Evolutionary Multi-objective Optimization of Business Process Designs with Pre-processing

Thresholds Determination for Probabilistic Rough Sets with Genetic Algorithms

4/22/2014. Genetic Algorithms. Diwakar Yagyasen Department of Computer Science BBDNITM. Introduction

Genetic-PSO Fuzzy Data Mining With Divide and Conquer Strategy

Structural Optimizations of a 12/8 Switched Reluctance Motor using a Genetic Algorithm

Job Shop Scheduling Problem (JSSP) Genetic Algorithms Critical Block and DG distance Neighbourhood Search

Genetic Algorithm Performance with Different Selection Methods in Solving Multi-Objective Network Design Problem

ISSN: [Keswani* et al., 7(1): January, 2018] Impact Factor: 4.116

QUANTUM BASED PSO TECHNIQUE FOR IMAGE SEGMENTATION

Genetic Model Optimization for Hausdorff Distance-Based Face Localization

Robot Path Planning Method Based on Improved Genetic Algorithm

3D Face Modeling Support System for Avatar by Using Interactive Genetic Algorithm

An Evolutionary Algorithm for the Multi-objective Shortest Path Problem

Research on friction parameter identification under the influence of vibration and collision

Meta- Heuristic based Optimization Algorithms: A Comparative Study of Genetic Algorithm and Particle Swarm Optimization

DERIVATIVE-FREE OPTIMIZATION

The k-means Algorithm and Genetic Algorithm

Handling Multi Objectives of with Multi Objective Dynamic Particle Swarm Optimization

An Application of Genetic Algorithm for Auto-body Panel Die-design Case Library Based on Grid

Multi-objective Optimization

Multi-Objective Optimization Using Genetic Algorithms

Solving the Traveling Salesman Problem using Reinforced Ant Colony Optimization techniques

A Hybrid Genetic Algorithm for the Distributed Permutation Flowshop Scheduling Problem Yan Li 1, a*, Zhigang Chen 2, b

A Search Method with User s Preference Direction using Reference Lines

Creating Time-Varying Fuzzy Control Rules Based on Data Mining

A genetic algorithms approach to optimization parameter space of Geant-V prototype

Research on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a

CHAPTER 6 HYBRID AI BASED IMAGE CLASSIFICATION TECHNIQUES

NSGA-II for Biological Graph Compression

CS5401 FS2015 Exam 1 Key

Parameter Control of Genetic Algorithms by Learning and Simulation of Bayesian Networks

An Introduction to Evolutionary Algorithms

Inducing Parameters of a Decision Tree for Expert System Shell McESE by Genetic Algorithm

CHAPTER 6 REAL-VALUED GENETIC ALGORITHMS

Combinational Circuit Design Using Genetic Algorithms

5th International Conference on Information Engineering for Mechanics and Materials (ICIMM 2015)

Machine Learning: Algorithms and Applications Mockup Examination

Particle Swarm Optimization applied to Pattern Recognition

CHAPTER 4 GENETIC ALGORITHM

Research on urban landscape design using the interactive genetic algorithm and 3D images

March 19, Heuristics for Optimization. Outline. Problem formulation. Genetic algorithms

Introduction to Evolutionary Computation

A Fuzzy C-means Clustering Algorithm Based on Pseudo-nearest-neighbor Intervals for Incomplete Data

Study on the Application Analysis and Future Development of Data Mining Technology

Improving Image Segmentation Quality Via Graph Theory

Deriving Trading Rules Using Gene Expression Programming

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

Information Fusion Dr. B. K. Panigrahi

System of Systems Architecture Generation and Evaluation using Evolutionary Algorithms

GENETIC ALGORITHM VERSUS PARTICLE SWARM OPTIMIZATION IN N-QUEEN PROBLEM

Simulation of Robot Manipulator Trajectory Optimization Design

Genetic Algorithm for Dynamic Capacitated Minimum Spanning Tree

Clustering Analysis of Simple K Means Algorithm for Various Data Sets in Function Optimization Problem (Fop) of Evolutionary Programming

The Memetic Algorithm for The Minimum Spanning Tree Problem with Degree and Delay Constraints

Introduction (7.1) Genetic Algorithms (GA) (7.2) Simulated Annealing (SA) (7.3) Random Search (7.4) Downhill Simplex Search (DSS) (7.

A Recommender System Based on Improvised K- Means Clustering Algorithm

A New Selection Operator - CSM in Genetic Algorithms for Solving the TSP

Minimal Test Cost Feature Selection with Positive Region Constraint

Research and Design of Key Technology of Vertical Search Engine for Educational Resources

Monika Maharishi Dayanand University Rohtak

Department of Computer Science & Engineering The Graduate School, Chung-Ang University. CAU Artificial Intelligence LAB

Fuzzy C-means Clustering with Temporal-based Membership Function

Mechanical Component Design for Multiple Objectives Using Elitist Non-Dominated Sorting GA

FSRM Feedback Algorithm based on Learning Theory

An Efficient Algorithm for Finding the Support Count of Frequent 1-Itemsets in Frequent Pattern Mining

Learning Adaptive Parameters with Restricted Genetic Optimization Method

Genetic Algorithms. Kang Zheng Karl Schober

JEvolution: Evolutionary Algorithms in Java

Visualization of Crowd-Powered Impression Evaluation Results

Hybrid Differential Evolution Algorithm for Traveling Salesman Problem

Hybridization EVOLUTIONARY COMPUTING. Reasons for Hybridization - 1. Naming. Reasons for Hybridization - 3. Reasons for Hybridization - 2

A New Modified Binary Differential Evolution Algorithm and its Applications

Intelligent management of on-line video learning resources supported by Web-mining technology based on the practical application of VOD

Dejong Function Optimization by means of a Parallel Approach to Fuzzified Genetic Algorithm

A Parallel Evolutionary Algorithm for Discovery of Decision Rules

Lamarckian Repair and Darwinian Repair in EMO Algorithms for Multiobjective 0/1 Knapsack Problems

Transcription:

2017 2nd International Conference on Information Technology and Management Engineering (ITME 2017) ISBN: 978-1-60595-415-8 Scheme of Big-Data Supported Interactive Evolutionary Computation Guo-sheng HAO 1,*, Na GUO 1, Gai-ge WANG 1, Zhao-jun ZHANG 2 and De-xuan ZOU 2 1 School of Computer Science and Technology, Jiangsu Normal University, Xuzhou, 221116, China 2 School of Electrical Engineering and Automation, Jiangsu Normal University, Xuzhou, 221116, China *Corresponding author Keywords: Interactive evolutionary computation, Big-data, Models repository, Transfer learning. Abstract. On the one hand, for the limitation that a user in Interactive Evolutionary Computation (IEC) is apt to feel tired during the solutions evaluation, the result of IEC is not so desirable. On the other hand, the many-years and many-users accumulated data of the evaluation in IEC is enormous, which could be a pool of information about users preference. By mining the big data of accumulated evaluation information, preference models can be established to form a model repository. When a new coming customer tries to get his/her favorite design aided by IEC, the nearest model in the repository can be selected as the result of model matching and the corresponding optimum can be directly transferred to accelerate the optimization. For this purpose, transfer learning is adopted. The simulated experiments validated this scheme and showed that the performance of big-data supported IEC is more related with volume of the models in big data, but less related with problems difficulty. Introduction Evolutionary Computation (EC) is problem independent and has been reported to perform relatively well on problems such as: search, optimization, and artificial intelligence [1]. However, in order to obtain the most favorable outputs from interactive systems that create or retrieve graphics or music, the outputs must be subjectively evaluated. It is difficult or even impossible to design explicit functions of human s evaluation. Interactive Evolutionary Computation (IEC) is an optimization method by incorporating human s preference as fitness for solutions. Therefore, IEC has been applied to many art designs such as personalized music, popular software menu and so on. The main problem in IEC is users fatigue [2], because a user in IEC is apt to feel tired after evaluating many solutions. Aiming at this problem, many studies to estimate fitness [3] based on knowledge learning have been reported. If fitness can be estimated, the user can be replaced by the machine [4]. For this reason, in IEC, the information transferred from the user to system is generally insufficient for the system to well model the user s preference. This is shown in Figure 1, where the information about the user s preference is labeled as and the information that the system gets from the user at time is labels as. Then the information that the system has not attained is:. (1) Therefore, in order to well model the user s preference and to increase the quality of the optimum, one of methods is to decrease. This is the purpose for us to use big data in this paper. Nowadays, the big data has been applied in many areas, such as education, business and so on, to understand the users better. In the study of IEC, a situation is that accumulated evaluation data is enormous, and they were not jointly utilized. Taking them as a pool of users preference information, big data technique can be used to improve the performance of IEC without worsening the main problem in IEC, i.e. the user s fatigue. Therefore, the scheme of big-data supported IEC is put forward in this paper. 14

Figure 1. Insufficient information that the system attained from the user. Scheme of Big Data Supported Interactive Evolutionary Computation In the past years, many users and volunteers took part in IEC experiments or applications, and multi-dimensional preference evaluation data, including fuzzy type evaluation [5], accurately numeric type evaluation [6], rank type evaluation [7], eye-tracked evaluation [8], interval type evaluation [9], best-selection type evaluation [2] and so on, are stored. Label the set composed by these data as, where is the preference information about the user. Therefore, by taking the above dataset as a pool of users preference, valuable knowledge can be obtained and then be made use to accelerate the algorithm and to improve the optimum quality. The scheme of BD_IEC is shown in Fig. 2. In order to decrease, the information difference between the user and the system, the accumulated users preference, which composed of the big data, are utilized. First, according to the information, the user s preference model is established. After that, the model feature is abstracted. Secondly, compare with the sorted elements in preference features repository. Thirdly, features matching algorithms are used to get the optimum of. Fourthly, transfer the corresponding information to compensate the user s preference or to estimate the necessary absent information. Fifthly, the system actively select solutions, which could maximally eliminate the uncertainty for the system, to participate in the evolutionary process to get the user s preference. By BD_IEC, the information that evolution system can be used is not only, but also. Figure 2. Scheme of big data supported interactive evolutionary computation. The proposal BD_IEC is based on the hypotheses that under the same condition, the quality of the optimum attained is proportional to both the amount and the quality of dataset about users preference. It is easy to understand that problems with similar features also have similar optimum with a high probability. Those previously conquered problems with their optima stored in the big data can be 15

made use to accelerate the evolution and to increase the optimum quality for the current optimization problem. The optima stored in the big data can be calculated according to:, (2) where is the corresponding optimum to the problem feature which is nearest to that of the current optimized problem and is to be transferred to the current optimization. This is feasible for the reason that the optima of those problems with similar features also has similar optimum. With transfer learning aided BD_IEC, the system can make use both the current user s preference dataset and the other similar users preference data. Denote the users whose preference are stored in big data repository as, population in generation as, the evolutionary operators as, then there is: This means that the population is generated based on both evolutionary operators and TL aided BD. In IEC, it is well known that the amount of the user s preference information is limited by users fatigue, therefore, the quality of the information about the search space is vital. If the information is noisy[10], then the model of the users preference is not accurate. In IEC, the information that the system attains is not determined by the information quality but by the promising fitness, i.e. the solutions in new population proposed to user is determined by evolutionary operators, which obey the survival of fittest and other evolutionary rules. But in a long run, those solutions which can eliminate the uncertainty of information that system attains about the user s preference is not always less promising than those solutions that evolutionary operators generate. This is true especially under the condition that, the information of users preference is limited by users fatigue. Therefore, the higher quality the information about users preference, the better the algorithm performs. This is also the base for active learning based BD_IEC. Here the quality of the information is measured by the uncertainty can be most eliminated. Label the certainty of the system about the user s preference in generation as, the improvement of the certainty that an individual in in generation will bring is. Then active learning will select the individual: where is the search space as candidate individual, which is expected to accelerate the algorithm by decreasing the uncertainty. In implementation, the uncertainty is generally measured by entropy, i.e. the more entropy, the more uncertainty is. An actively selected solution will decrease the entropy for the user s preference in the system. For active learning, those history solutions that the user has evaluated compos of labeled dataset and other solutions in the search space belongs to the unlabeled dataset. Those solutions, that are important for the system to determine which model in repository is more close to the one of the current user, are filtered out. Those actively select solutions are located in search space and are embedded in the new population for the user s evaluation. With AL aided BD_IEC, there is: (3) (4) where is the information that stored in big data as the set of users preference. This means that the population is generated based on both evolutionary operators and TL aided BD. (5) Scheme of Big Data Supported IEC The scheme of BD_IEC includes includes transfer learning and active learning. Transfer learning carries out the optimum transfer to accelerate the optimization and active learning carries out 16

solutions recommendation for establishment of the user s model by which the system can generate promising solutions and improve the users satisfaction for the user. For transfer learning and active learning equipped BD_IEC, the solutions in next generation are from the results of the evolutionary operators, the transferred optimum and the actively selected individuals by active learning, therefore, there is: Experiments We carried out experiments by developing system based on jmetal[24], which is an object-oriented Java-based framework for single and multi-objective optimization and a number of classic and modern optimization algorithms were implemented. The design of web page is a time consuming, expensive and complex process. Web page is generally coded with HTML. The presentation semantics of HTML is generally defined by Cascading Style Sheets (CSS), which defines the elements' properties, such as positioning, layout, color and fonts[12]. Therefore, the optimization of HTML page style is based on CSS. The purpose of the page optimization is to get ones' satisfactory style. Different users have different preference for the style of the same web page. According to a user's subjectively satisfactory style, his/her preferred property values in CSS can be figure out and the values can be further used in other pages. In our experiments, we only consider the case that different users are faced with the same search space and to optimize the same web page for different style. The task is to optimize the values of properties in CSS. For the users who participated in the optimization, their preference as the evolution history information is stored in the big data repository. We code CSS value, which determine the style of the pages (or part of the pages), as genotype in genetic algorithm. Figure 3 illustrates the mechanism. (6) Figure 3. Mechanism of the optimization for web pages. The parameters of the experiments are as follows. Two algorithms being compared are: traditional Interactive Genetic Algorithm (IGA) and BD_IGA, and the population size for both algorithms is 20. The crossover operator for both algorithms is simulated-binary-crossover [13] with probability of 0.9 and the mutation operator for both algorithms is polynomial mutation [14] with probability of 0.1. The selection operator for both algorithms is the binary tournament selection [15]. The algorithm termination criterion for both algorithms is: the satisfactory solution is attained. The length of the string that encodes the property value of CSS is 8, and the search space is 256. This is the case that the problem is not large scale and optimized with big data based technique. The experiments were independently carried by 42 students. 17

The generation numbers for users to get the satisfactory design style are shorten by BD_IGA. Table 1 gives the comparison of the generation number of the two algorithms. One can see that BD_IGA performes better that traditional IGA. What more, from the subjective evaluation from user shows that by BD_IGA, they get more satisfactory solutions than traditional IGA. The subjective evaluation for the solutions are also better than that in traditional IGA. Table 1. Number of Generations to Get the Satisfactory Solutions. min max mean standard deviation Traditional IGA 5 25 19.3 5 3.54 BD_IGA 3 14 8.34 2.38 Conclusion We take the many-years accumulated dataset of the evaluation in IEC as the pool of information about users preference, and the big-data supported IEC, which is equipped with TL and AL were studied. The simulated experiments validate the efficiency and it shows that BD_EC and BD_IEC are more related with volume of the model in big data, but less related with problems difficulty. When the similar or the same problem is collected in big data before the optimization, the performance of the algorithm will be highly improved. Acknowledgement This research was financially supported by the National Science Foundation of China under Grants No. 61673196, 61503165, 61305149, 61403174. References [1] G.-S. Hao, C.-S. Chen, G.-G. Wang, Y.-Q. Huang, D.-X. Zou, and Z.-J. Zhang, Efficiency and Effectiveness Metrics in Evolutionary Algorithms and Their Application, in International Conference on Intelligent Computing, Fuzhou, China, (2015)1-12. [2] H. Takagi, Interactive evolutionary computation: Fusion of the capabilities of EC optimization and human evaluation, Proceedings of the IEEE, 89(2001)1275-1296. [3] G.-S. Hao, D.-W. Gong, and Y.-Q. Huang, Interactive Genetic Algorithms Based on Estimation of User's Most Satisfactory Individuals, in Sixth International Conference on Intelligent Systems Design and Applications, (2006)132-137. [4] G.-S. Hao, D.-W. Gong, Y.-Q. Shi, and X.-Y. Sun, Method of replacing the user with Machine in interactive genetic algorithm, Pattern Recognition & Artificial Intelligence, 19(2006)111-115. [5] G. S. Hao, X. J. Zhao, and Y. Q. Huang, Users' Fuzzy Cognition Knowledge Learning in Interactive Evolutionary Computation and its Application, Advanced Materials Research, 204-210(2011) 245-250. [6] D. Gong, J. Chen, X. Sun, and J. Sun, Evaluating individuals in interactive genetic algorithms using variational granularity, Soft Computing, vol. 19, no. 3, pp. 621-635, 2014. [7] G.-S. Hao, User's cognition reules in interactive genetic algorithm and their application, Ph. D, China University of Mining and Technology, 2009. [8] D. Pallez, P. Collard, T. Baccino, and L. Dumercy, Eye-Tracking Evolutionary Algorithm to minimize user fatigue in IEC applied to Interactive One-Max problem, in Proceedings of the 9th annual conference companion on Genetic and evolutionary computation, (2007)2883-2886. 18

[9] X.-Y. Sun, J. Ren, and D.-W. Gong, Interval-fitness interactive genetic algorithms with varying population size based on semi-supervised learning, Control Theory & Applications, 28(2011)610-618. [10] G.-S. Hao, Y.-q. Huang, D.-W. Gong, G.-s. Guo, and Y. Zhang, Fitness Noise in Interactive Evolutionary Computation and the Convergence Robustness, in Intelligent Systems Design and Applications, 2006. ISDA '06. Sixth International Conference on, (2006)429-434. [11] J. J. D. A.J. Nebro, M. Vergne, Redesigning the jmetal Multi-Objective Optimization Framework. (2015)1093-1100. [12] A. Mesbah, and S. Mirshokraie, Automated analysis of CSS rules to support style maintenance, in 34th International Conference on Software Engineering (ICSE), (2012)408-418. [13] R. B. Agrawal, K. Deb, and R. B. Agrawal, Simulated binary crossover for continuous search space, Department of Mechanical Engineering, Indian Institute of Technology, 1994. [14] K. Deb, and M. Goyal, A Combined Genetic Adaptive Search (Geneas) for Engineering Design, Computer Science and Informatics, 26(1996)30-45. [15] M. Chakraborty, and U. K. Chakraborty, An analysis of linear ranking and binary tournament selection in genetic algorithms, Proceedings of 1997 International Conference on Information, Communications and Signal Processing, (1997):407-411. 19