Keywords: Case-Base, Software Effort Estimation, Bees Algorithm

Similar documents
Detection of Aberrant Data Points for an effective Effort Estimation using an Enhanced Algorithm with Adaptive Features

The Artificial Bee Colony Algorithm for Unsupervised Classification of Meteorological Satellite Images

An Optimized Analogy-Based Project Effort Estimation

Artificial bee colony algorithm with multiple onlookers for constrained optimization problems

Power Load Forecasting Based on ABC-SA Neural Network Model

Hybrid Bee Ant Colony Algorithm for Effective Load Balancing And Job Scheduling In Cloud Computing

Solving Travelling Salesman Problem Using Variants of ABC Algorithm

Enhanced ABC Algorithm for Optimization of Multiple Traveling Salesman Problem

The Evaluation of Useful Method of Effort Estimation in Software Projects

Optimization of Benchmark Functions Using Artificial Bee Colony (ABC) Algorithm

CHAPTER 4 STOCK PRICE PREDICTION USING MODIFIED K-NEAREST NEIGHBOR (MKNN) ALGORITHM

Toward Part-based Document Image Decoding

Solving the Scheduling Problem in Computational Grid using Artificial Bee Colony Algorithm

Preprocessing of Stream Data using Attribute Selection based on Survival of the Fittest

FITTING PIECEWISE LINEAR FUNCTIONS USING PARTICLE SWARM OPTIMIZATION

Ahmed T. Sadiq. Ali Makki Sagheer* Mohammed Salah Ibrahim

ABC Optimization: A Co-Operative Learning Approach to Complex Routing Problems

Load Balancing in Cloud Computing Priya Bag 1 Rakesh Patel 2 Vivek Yadav 3

A study of classification algorithms using Rapidminer

Particle Swarm Optimization Artificial Bee Colony Chain (PSOABCC): A Hybrid Meteahuristic Algorithm

NOVEL CONSTRAINED SEARCH-TACTIC FOR OPTIMAL DYNAMIC ECONOMIC DISPATCH USING MODERN META-HEURISTIC OPTIMIZATION ALGORITHMS

Bootstrap Confidence Intervals for Regression Error Characteristic Curves Evaluating the Prediction Error of Software Cost Estimation Models

Keywords Fuzzy, Set Theory, KDD, Data Base, Transformed Database.

Keywords: Cloud, Load balancing, Servers, Nodes, Resources

Centroid Neural Network based clustering technique using competetive learning

IMPLEMENTATION OF A HYBRID LOAD BALANCING ALGORITHM FOR CLOUD COMPUTING

International Journal of Information Technology and Knowledge Management (ISSN: ) July-December 2012, Volume 5, No. 2, pp.

Iteration Reduction K Means Clustering Algorithm

COLLEGE TIMETABLE SCHEDULING USING OPTIMISTIC HYBRID SCHEDULING ALGORITHM

An Efficient Analysis for High Dimensional Dataset Using K-Means Hybridization with Ant Colony Optimization Algorithm

Adaptive Filtering using Steepest Descent and LMS Algorithm

Enhanced Artificial Bees Colony Algorithm for Robot Path Planning

A NOVEL APPROACH FOR TEST SUITE PRIORITIZATION

International Journal of Digital Application & Contemporary research Website: (Volume 1, Issue 7, February 2013)

Metaheuristic Optimization with Evolver, Genocop and OptQuest

An Evolutionary Computation Approach for Project Selection in Analogy based Software Effort Estimation

Hybrid Approach for Energy Optimization in Wireless Sensor Networks

Character Recognition

Fast Efficient Clustering Algorithm for Balanced Data

International Journal of Advanced Research in Computer Science and Software Engineering

6.867 Machine Learning

3 No-Wait Job Shops with Variable Processing Times

Type-2 Fuzzy Logic Approach To Increase The Accuracy Of Software Development Effort Estimation

A MULTI-SWARM PARTICLE SWARM OPTIMIZATION WITH LOCAL SEARCH ON MULTI-ROBOT SEARCH SYSTEM

Journal of Asian Scientific Research FEATURES COMPOSITION FOR PROFICIENT AND REAL TIME RETRIEVAL IN CBIR SYSTEM. Tohid Sedghi

Solving Constraint Satisfaction Problems by Artificial Bee Colony with Greedy Scouts

CHAPTER 4 FUZZY LOGIC, K-MEANS, FUZZY C-MEANS AND BAYESIAN METHODS

LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave.

Optimized Watermarking Using Swarm-Based Bacterial Foraging

Comparison of Some Evolutionary Algorithms for Approximate Solutions of Optimal Control Problems

ENHANCED BEE COLONY ALGORITHM FOR SOLVING TRAVELLING SALESPERSON PROBLEM

Solving the Traveling Salesman Problem using Reinforced Ant Colony Optimization techniques

A Modified Black hole-based Task Scheduling Technique for Cloud Computing Environment

A NEW APPROACH TO SOLVE ECONOMIC LOAD DISPATCH USING PARTICLE SWARM OPTIMIZATION

A Firework Algorithm for Solving Capacitated Vehicle Routing Problem

ABCRNG - Swarm Intelligence in Public key Cryptography for Random Number Generation

R. R. Badre Associate Professor Department of Computer Engineering MIT Academy of Engineering, Pune, Maharashtra, India

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

Effects of Weight Approximation Methods on Performance of Digital Beamforming Using Least Mean Squares Algorithm

Exponential functions and equations Student Activity Sheet 3; use with Exploring Growth and decay

IMPROVING THE PARTICLE SWARM OPTIMIZATION ALGORITHM USING THE SIMPLEX METHOD AT LATE STAGE

Image Edge Detection Using Ant Colony Optimization

I. INTRODUCTION II. RELATED WORK.

A Deterministic Dynamic Programming Approach for Optimization Problem with Quadratic Objective Function and Linear Constraints

Module 1 Lecture Notes 2. Optimization Problem and Model Formulation

Clustering Algorithms for Data Stream

Unsupervised Learning

IMPROVED BUTTERFLY OPTIMIZATION ALGORITHM USING LOCAL SEARCH OPERATOR FOR CAPACITATED VEHICLE ROUTING PROBLEM

Ant Colonies, Self-Organizing Maps, and A Hybrid Classification Model

Curriculum Outcome Assesment using Subject Matter on the FE Examination.

CLUSTERING BIG DATA USING NORMALIZATION BASED k-means ALGORITHM

Sci.Int.(Lahore),28(1), ,2016 ISSN ; CODEN: SINTE 8 201

International Journal of Advanced Research in Computer Science and Software Engineering

Performance Analysis of Min-Min, Max-Min and Artificial Bee Colony Load Balancing Algorithms in Cloud Computing.

With data-based models and design of experiments towards successful products - Concept of the product design workbench

A Genetic Algorithm for Graph Matching using Graph Node Characteristics 1 2

Research Article QOS Based Web Service Ranking Using Fuzzy C-means Clusters

A Survey on Honey Bee Foraging Behavior and Its Improvised Load Balancing Technique

Artificial Bee Colony Based Load Balancing in Cloud Computing

Various Strategies of Load Balancing Techniques and Challenges in Distributed Systems

Artificial Bee Colony (ABC) Optimization Algorithm for Solving Constrained Optimization Problems

Predicting Porosity through Fuzzy Logic from Well Log Data

Traffic Signal Control Based On Fuzzy Artificial Neural Networks With Particle Swarm Optimization

ACONM: A hybrid of Ant Colony Optimization and Nelder-Mead Simplex Search

POWER AWARE BASED WORKFLOW MODEL OPTIMIZATION USING FUZZY BEE COLONY METHOD

Optimization of process parameter for maximizing Material removal rate in turning of EN8 (45C8) material on CNC Lathe machine using Taguchi method

Application of Fuzzy and ABC Algorithm for DG Placement for Minimum Loss in Radial Distribution System

Seismic regionalization based on an artificial neural network

Improving Results and Performance of Collaborative Filtering-based Recommender Systems using Cuckoo Optimization Algorithm

REAL-TIME SCHEDULING OF SOFT PERIODIC TASKS ON MULTIPROCESSOR SYSTEMS: A FUZZY MODEL

Handling Multi Objectives of with Multi Objective Dynamic Particle Swarm Optimization

On the Value of Ranked Voting Methods for Estimation by Analogy

A Novel Hybrid Self Organizing Migrating Algorithm with Mutation for Global Optimization

Argha Roy* Dept. of CSE Netaji Subhash Engg. College West Bengal, India.

Distribution System Network Reconfiguration by using Artificial Bee Colony Algorithm.

Available online at ScienceDirect. Procedia Computer Science 85 (2016 )

New Method for Accurate Parameter Estimation of Induction Motors Based on Artificial Bee Colony Algorithm

Genetic Model Optimization for Hausdorff Distance-Based Face Localization

ESTIMATION BY ANALOGY IN THE PERSPECTIVE EDUCATIONAL WEB HYPERMEDIA APPLICATIONS

Adaptive Spiral Optimization Algorithm for Benchmark Problems

Transcription:

Volume 5, Issue 6, June 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Bees Algorithm A Tool for Case Based Software Effort Estimation S. Bhavani Assistant Professor (Senior) VIT University, Vellore, India Abstract Software effort estimation has been one of the major challenge tasks in software engineering and has achieved a considerable interest from both industry and the research community. The case-base consists of several of software projects which contains the target case for which the effort has to be calculated and the history of successful projects. The analogy that has to be compared between different projects has to be found out. This paper proposed a method to retrieve project effort and find optimal number of analogies using project management activity network and bees algorithm. The bees algorithm will be used to search for analogies and features values that will be used to reduce the estimate errors. Keywords: Case-Base, Software Effort Estimation, Bees Algorithm I. INTRODUCTION One of the major challenge tasks in software engineering is software effort estimation and it has been one of the interest from both researchers and interest. So it maintains its dignity and been the soft cornered topic in software engineering. The importance of software effort estimation is because of its feasibility study, project bedding during software development for progress monitoring,risk evaluation and resource allocation. The most investigated models for solving the software effort estimationproblem is Machine Learning based prediction algorithms. Among them, Case- Based Reasoning has achieved reasonable identity because of its performance in prediction. CBR is a knowledge management technology(kmt) based on the premise that history almost repeats with itself which leads to the problem solving that can be based upon retrieval by similarities [1]. CBR has been used for several software engineering problems to solveill-defined problems or non-trivial or such as those identified by project managers including its support for software project management in predictions and lesson being learned [1]. The Figure1 illustrates the process of case based reasoning in software effort estimation. The case base consists of several numbers of cases described by a bunch of features that is divided into the two parts: problem description and the solutions. The problem description is a set of features whereas the solution can bedetermined by the effort required to accomplish the software project. However, software effort datasets that are characteristically and dramatically noisy datasets and case based reasoning methods which are more capable of handling the noisy datasets than the regression based technologies. But, like other machine learning methods, the performance of case based reasoning is dataset dependent so it has enormous space of configuration possibilities and the design options which are induced for each and every individual datasets [2]. So it is least surprise to see contradictory results and different performance figures when developing this model. Such design methodologies have much stronger impact on its accuracy and reliability of case based reasoning. The adjustment method is one of the most important parameters on case based reasoning as it fits the case in hand and minimizes the variations and deviations between a new founded case and retrieved historical cases. The adjustmentmethodology requires some of the parameters to be set such as number of analogy (i.e. Retrieved similar solutions) and the methods that are used to make the adjustment. This paper claims that, avoiding sticking to a fixed best number of performing analogies that should change from datasets to datasets which varies with the retrieved values. We can also use an alternative method to adjust case based reasoning by optimizing the number of analogies and features similarity coefficients. This paper employs the Bees algorithm (BA) to optimize number of analogy (K) and the weights that are used to adjust the features similarities values between the new case and the other K analogies. The Bees Algorithm employs the model of population based search algorithm, which was first described inthe year of 2005. This algorithm depicts the food foraging and searching behavior of swarms of honey bees. In its zeroth version, the algorithm employs a kind of neighborhood search which will be later combined with the random search and can be used for analogy optimization. This present paper calculatesthe effect on the improvement in the effort estimation accuracy in case based reasoning when the bees algorithm method is employed to find out the optimal number of analogy as well as the coefficient values which can be used to adjust the retrieved project efforts. With respect to our knowledge the bees algorithm has not yet been used in software effort estimation and this paper depicts its significance to provide more reliable results. II. SYSTEM OVERVIEW Case based software effort estimation makes a newprediction for a newly started project by retrieving the previously successful completed projects that has been figured out and remembered as successful historical projects. Even though case based reasoning shows the reliable performance figures for high rated datasets, it is still being suffered from local performance tuning problems when they are to be applied over another environment. 2015, IJARCSSE All Rights Reserved Page 1245

Fig.1. Case Based Reasoning- The overview process. Finding appropriate analogies which fits the procedure of adjustment and depicts the dataset characteristics is done by local tuning, where this process is a hectic process on its own kind of environment. The application characterizes the project with input as project tasks with duration. The Project Management Activity Network will output the three dimensions such as duration, distance and Fitness value. Apart from this it also gives the critical path value for the given input project. From this three dimensional outputs the Bees Algorithm will find the analogy. Using the various information provided the user can make the analysis.the memory used for this application must be in order that it is used in an optimized way. The various other applications might also be running in the system, it must see that the application developed takes a less time and uses the memory available in the system in an optimized way that it does not slow down the process of the other applications. Fig.2. Activity of project management activity network III. MODULAR DESCRIPTION A. Proposed Method Of Case Based Reasoning The adjustment of case based reasoning is a technique which is used to derive a newly founded value by minimizing the gap between the differences between the retrieved number of analogies and close target project for which the effort estimation is required to be calculated. Now It must be noted that this is an important and foremost step in Case based effort estimation as it showsthe structure and strength of analogies on the target case project. This technique should be represented mathematically as a function formula that identifies the differences between problem description of the identified target project and its number of analogies and also it shows the description in an attempt to generate the more reasonable and reliable solution set. In Case based effort predication the function is formulated as shown in Eq. 1: wherept is the project used for target and P1 to PN is the top most familiar K similar projects close to the target one. The similarity degree is calculated using Euclidean distance. F is the adjustment function procedure that is used to view the differences between Pt and all other top similar projects, and then convert these identified differences into the amount of changes in the effort value set. The adjustment function shown above is illustrated in equations 2 and 3, where wj is the optimization coefficient value. The proposed case based reasoning method exploits the searching capability of the bees algorithm to overcome the local tuning problems of effort estimation adjustment. More specifically, the job is to search for appropriate number of weights (wj) and K number of values such that the difference between the performance measures is minimized. 2015, IJARCSSE All Rights Reserved Page 1246

where M is the feature number, ftj is the jth feature value of the close target project. fij is the jth feature value of the analogy project Pi. Before initiating, the bees algorithm parameters must be set carefully, these parameters are as follows: size of the problem is represented as Q, scout bees is represented as n, sites selected out of n visited sites is represented as s, the best sites out of s selected sites is represented ase, bees recruited for best e sites is represented asnep, bees recruited for the other selected sites is represented asnsp, other bees number is represented asosp and patch initial size is represented asngh which includes its site and its neighborhood in addition to Stopping condition in which our study is to minimize MMRE value. The algorithm starts with an initial population of N scout bees which are to be employed to search for food. Each and every bee represents functional solution as a set of K analogy coefficient set of values. The scout bees are placed randomly in the given initial search space. The fitness computation process is computed, using Leave one out by cross validation, for each site which is visited by a scout bee by calculating the Mean Magnitude Relative Error (MMRE). This step is very essential for colony communication which shows the direction of flower patch which will be found, then its distance from the bee hive and its fitness value. This information helps the beescolony to send its bees to flower patches exactly, without using guides or maps. So, the best sites which isvisited by the highest fittest bees are being selected for its neighborhood search. The neighborhood search area is calculated by identifying the radius or diameter of the search area from best site which is considered as the important step in bees algorithm. The algorithm still continues in searching the neighborhood of the selected sites, so therefore the need to recruit more bees to search the best sites which may have promising results. The bees that can be selected directly according to the fitness value associated with the sites that are being visited. Alternatively, the fitness values that are used to calculate the probability of the bees that are being selected. Therefore the fittest and healthiest bee from each and every patch is targeted to form the next bee population value. Our focus is to reduce the number of points to be visited with more promising value. Finally the remaining number of bees are assigned to search for randomly new potential solutions. These steps are repeated and circulated until the end condition is met or iteration number has been finished. At the end of each and every iteration, the colony of bees hive will have two parts of its new population those that were the fittest representatives bees from a patch and those that have been sent out randomly from bees hive which failed to satisfy the fitness value. The pseudo code of bees algorithm is shown in Figure 3. Fig. 3.Bees algorithm workflow B. Methodology The prediction accuracy of several techniques is assessed using MMRE, PREDperformance measure. MMRE computes the mean of the percentage of error between actual and calculated project effort values as shown in Equation 4. PRED(0.25) is used for complementary condition to count the percentage of estimates that is calculated within less than 0.25 of the actual values. where λ is the number of projects that have magnitude relative error less than the value of 0.25 and N is the number of observations. 2015, IJARCSSE All Rights Reserved Page 1247

IV. OBSERVATION Before stepping into case based reasoning the parameter values of the bees algorithm must be set ina order such that it yields optimal analogy. Table 2.Bees algorithm sample values But, there is no way to identify the best number of parameters values of bees algorithm therefore we did some of the investigations to select the best number of values for all datasets used in this environment. The parameter values used in this paper are presented in Table 2: The obtained results show that there is some evidence that isfound using adjustment techniques which is better than that without adjustment as original case based reasoning. Fig. 4.Bees algorithm - pseudo code V. CONCLUSION This paper presents us a new and efficient method for calculating software effort estimation using combination of case based reasoning and Bees algorithm. We have used the bees algorithm to automatically specify the appropriate set of K analogy coefficient value set that are being used to adjust the retrieved closest number of targeted analogies. The results that we obtained shows much improvements on the prediction accuracy for Case based effort estimation. One of the most important advantages of the proposed method is that it does not prone to locally optimal solutions. This is due to the strength of the bees algorithm to perform local and global search randomly. While we are not yet guaranteed that the obtained performance values are the global optimum, the results obtainedis the best performance ever when all features are used without repetitions. The results of the current study proves the scope for further investigation, through better performance when adjustment to Case based effort estimation is experimented when compared with other set of results. Nevertheless, theresults without adjustment are significant as it will be useful for the further research to investigate and find out whether the use of feature weights can also viable in obtaining accurate estimated values. 2015, IJARCSSE All Rights Reserved Page 1248

REFERENCES [1] Mohammad Azzeh. "A replicated assessment and comparison of adaptation techniques for analogy-based effort estimation", EmpiricalSoftware Engineering, 09/23/2011 [2] Mohammad Azzeh. "Adjusted Case-Based Software Effort Estimation Using Bees Optimization Algorithm", Lecture Notes in Computer Science, 2011 [3] Auer M, Trendowicz A, Graser B, Haunschmid E, Biffl S (2006) Optimal project feature weights in analogybased cost estimation: Improvement and limitations. IEEE Trans SoftwEng 32:83 92. doi:10.1109/ TSE.2006.1599418 [4] Azzeh, M., Neagu, D., Cowling, P., 2010. Fuzzy grey relational analysis for software effort estimation, EmpirsoftwEng, 60-90 DOI. 10.1007/s10664-009-9113-0. [5] Azzeh, M., Neagu, D. & Cowling, P. (2008a), Fuzzy Feature Subset Selection for Software Effort Estimation, in proceedings of International workshop on software predictors PROMISE'08 (part of ICSE'08), Leipzig, Germany, 71-78. [6] Boehm, B. (1981), Software Engineering Economics, Prentice-Hall, NY. 2015, IJARCSSE All Rights Reserved Page 1249