Data- and Rule-Based Integrated Mechanism for Job Shop Scheduling

Similar documents
The Establishment of Large Data Mining Platform Based on Cloud Computing. Wei CAI

International Journal of Current Trends in Engineering & Technology Volume: 02, Issue: 01 (JAN-FAB 2016)

Yunfeng Zhang 1, Huan Wang 2, Jie Zhu 1 1 Computer Science & Engineering Department, North China Institute of Aerospace

Web Data mining-a Research area in Web usage mining

A Balancing Algorithm in Wireless Sensor Network Based on the Assistance of Approaching Nodes

EFFICIENT ATTRIBUTE REDUCTION ALGORITHM

The Memetic Algorithm for The Minimum Spanning Tree Problem with Degree and Delay Constraints

RETRACTED ARTICLE. Web-Based Data Mining in System Design and Implementation. Open Access. Jianhu Gong 1* and Jianzhi Gong 2

Related Work The Concept of the Signaling. In the mobile communication system, in addition to transmit the necessary user information (usually voice

Multisource Remote Sensing Data Mining System Construction in Cloud Computing Environment Dong YinDi 1, Liu ChengJun 1

Data Mining Technology Based on Bayesian Network Structure Applied in Learning

Classification with Diffuse or Incomplete Information

Creating Time-Varying Fuzzy Control Rules Based on Data Mining

Improvements and Implementation of Hierarchical Clustering based on Hadoop Jun Zhang1, a, Chunxiao Fan1, Yuexin Wu2,b, Ao Xiao1

An Efficient Approach for Color Pattern Matching Using Image Mining

An Efficient Approach for Model Based Test Path Generation

Research on friction parameter identification under the influence of vibration and collision

Integration of information security and network data mining technology in the era of big data

Classification Using Unstructured Rules and Ant Colony Optimization

Research on QR Code Image Pre-processing Algorithm under Complex Background

AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS

CHAPTER 5 ANT-FUZZY META HEURISTIC GENETIC SENSOR NETWORK SYSTEM FOR MULTI - SINK AGGREGATED DATA TRANSMISSION

Obtaining Rough Set Approximation using MapReduce Technique in Data Mining

Analysis on the technology improvement of the library network information retrieval efficiency

Web Usage Mining: A Research Area in Web Mining

Exploration of Fault Diagnosis Technology for Air Compressor Based on Internet of Things

SOLVING THE JOB-SHOP SCHEDULING PROBLEM WITH A SIMPLE GENETIC ALGORITHM

INTELLIGENT SUPERMARKET USING APRIORI

Construction of the Library Management System Based on Data Warehouse and OLAP Maoli Xu 1, a, Xiuying Li 2,b

Metric and Identification of Spatial Objects Based on Data Fields

Research on Technologies in Smart Substation

Open Access A Sequence List Algorithm For The Job Shop Scheduling Problem

Information Push Service of University Library in Network and Information Age

Hybrid ant colony optimization algorithm for two echelon vehicle routing problem

Research on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a

Intrusion Detection Using Data Mining Technique (Classification)

A Framework for Source Code metrics

The Load Balancing Research of SDN based on Ant Colony Algorithm with Job Classification Wucai Lin1,a, Lichen Zhang2,b

A Row-and-Column Generation Method to a Batch Machine Scheduling Problem

A Novel Data Mining Platform Design with Dynamic Algorithm Base

Surrogate Gradient Algorithm for Lagrangian Relaxation 1,2

Study on the Application Analysis and Future Development of Data Mining Technology

Research Article Modeling and Simulation Based on the Hybrid System of Leasing Equipment Optimal Allocation

Genetic Algorithm for Job Shop Scheduling

Data Cleaning for Power Quality Monitoring

FSRM Feedback Algorithm based on Learning Theory

Test Cases Generation from UML Activity Diagrams

Parallel Computing in Combinatorial Optimization

Design and Implementation of High-Speed Real-Time Data Acquisition and Processing System based on FPGA

Research on Design and Application of Computer Database Quality Evaluation Model

A Web Page Segmentation Method by using Headlines to Web Contents as Separators and its Evaluations

A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

A New Approach to Ant Colony to Load Balancing in Cloud Computing Environment

A hard integer program made easy by lexicography

Research and Application of Unstructured Data Acquisition and Retrieval Technology

Prediction of traffic flow based on the EMD and wavelet neural network Teng Feng 1,a,Xiaohong Wang 1,b,Yunlai He 1,c

Indoor optimal path planning based on Dijkstra Algorithm Yicheng Xu 1,a, Zhigang Wen 1,2,b, Xiaoying Zhang 1

3 The standard grid. N ode(0.0001,0.0004) Longitude

Mobile Cloud Multimedia Services Using Enhance Blind Online Scheduling Algorithm

A motion planning method for mobile robot considering rotational motion in area coverage task

Proceedings of the 2012 International Conference on Industrial Engineering and Operations Management Istanbul, Turkey, July 3 6, 2012

Dynamic Scheduling Based on Simulation of Workflow

Preprocessing of Stream Data using Attribute Selection based on Survival of the Fittest

Categorizing Migrations

BI-OBJECTIVE EVOLUTIONARY ALGORITHM FOR FLEXIBLE JOB-SHOP SCHEDULING PROBLEM. Minimizing Make Span and the Total Workload of Machines

Rapid Modeling of Digital City Based on Sketchup

Priority rule-based reconstruction for total weighted tardiness minimization of job-shop scheduling problem

Unit 2 : Computer and Operating System Structure

A Comparative Study of Data Mining Process Models (KDD, CRISP-DM and SEMMA)

2. Data Preprocessing

Binju Bentex *1, Shandry K. K 2. PG Student, Department of Computer Science, College Of Engineering, Kidangoor, Kottayam, Kerala, India

Application of Wang-Yu Algorithm in the Geometric Constraint Problem

On the Expansion of Access Bandwidth of Manufacturing Cloud Core Network

An Improved Frequent Pattern-growth Algorithm Based on Decomposition of the Transaction Database

Comparative analyses for the performance of Rational Rose and Visio in software engineering teaching

Research Article An Improved Topology-Potential-Based Community Detection Algorithm for Complex Network

A LOCAL SEARCH GENETIC ALGORITHM FOR THE JOB SHOP SCHEDULING PROBLEM

A priority based dynamic bandwidth scheduling in SDN networks 1

A REASONING COMPONENT S CONSTRUCTION FOR PLANNING REGIONAL AGRICULTURAL ADVANTAGEOUS INDUSTRY DEVELOPMENT

Multi-dimensional database design and implementation of dam safety monitoring system

New research on Key Technologies of unstructured data cloud storage

An Ant Colony Optimization Implementation for Dynamic Routing and Wavelength Assignment in Optical Networks

Dynamic Obstacle Detection Based on Background Compensation in Robot s Movement Space

ANT COLONY OPTIMIZATION FOR MANUFACTURING RESOURCE SCHEDULING PROBLEM

Log System Based on Software Testing System Design And Implementation

Design and Realization of Data Mining System based on Web HE Defu1, a

Distributed Service Discovery Algorithm Based on Ant Colony Algorithm

Research on Parallelized Stream Data Micro Clustering Algorithm Ke Ma 1, Lingjuan Li 1, Yimu Ji 1, Shengmei Luo 1, Tao Wen 2

Image Segmentation Techniques

Chongqing, China. *Corresponding author. Keywords: Wireless body area network, Privacy protection, Data aggregation.

An Architecture Model of Distributed Simulation System Based on Quotient Space

Design of Fault Diagnosis System of FPSO Production Process Based on MSPCA

UAPRIORI: AN ALGORITHM FOR FINDING SEQUENTIAL PATTERNS IN PROBABILISTIC DATA

Summary of Last Chapter. Course Content. Chapter 3 Objectives. Chapter 3: Data Preprocessing. Dr. Osmar R. Zaïane. University of Alberta 4

D DAVID PUBLISHING. Big Data; Definition and Challenges. 1. Introduction. Shirin Abbasi

Image Edge Detection Using Ant Colony Optimization

CHAOTIC ANT SYSTEM OPTIMIZATION FOR PATH PLANNING OF THE MOBILE ROBOTS

Enhancing Forecasting Performance of Naïve-Bayes Classifiers with Discretization Techniques

Iterative Removing Salt and Pepper Noise based on Neighbourhood Information

Transcription:

Data- and Rule-Based Integrated Mechanism for Job Shop Scheduling Yanhong Wang*, Dandan Ji Department of Information Science and Engineering, Shenyang University of Technology, Shenyang 187, China. * Corresponding author. Email: wangyh@sut.edu.cn Manuscript submitted July, 214; accepted January 17, 215. doi:.1776/ijcce.215.4.3.18-186 Abstract: In this paper, a new data-based mechanism has been proposed to solve job shop scheduling problems (JSPs). The proposed method is an intelligent-algorithm-based approach implanted with dataand rule-based scheduling algorithm. The proposed data-based framework is based on decision tree algorithm and rule mining technique to solve the scheduling in job shop with abundant information and massive unforeseen events. A data- and rule- based scheduling system framework for JSP was developed, and then, the knowledge discovery and rule-based adaptive scheduling algorithm were developed. The simulation result of a simplified scheduling problem is presented as a preliminary validation to show the feasibility of the proposed integrated scheduling mechanism. The state-of-the art research on the key technologies of data-based scheduling is introduced together with some related research results of information based or rule based ones. The results show that data- and rule-based scheduling offers better solutions. Key words: Data-based, integrated scheduling mechanism, job shop, rule-based. 1. Introduction Scheduling is a decision making process which deals with the allocation of limited resources to tasks over time, which plays an important role in achieving the goal of system optimization. The job shop scheduling problems (JSPs) is one of the most particular hardest combinatorial optimization problems that have been proven to be NP-hard. JSPs presents a simplified model of many real production scheduling problems [1] in manufacturing practice, therefore it has attracted the attention of many researchers due to its wide applicability and inherent difficulty. However, in today s business environment, products need to be manufactured with huge product variety and large volume of small batches to achieving productivity and profitability, which make the actual manufacturing processes becoming increasingly complex. The uncertain and unforeseeable dynamic manufacturing environment presents new challenges for the existing JSPs research results. In such a complex manufacturing environments described above, vast amounts of data are collected in information system or database management systems from all involved areas in manufacturing management, using barcodes, RFID, sensors, vision systems etc. The collected manufacturing data contains valuable information that could be integrated within the manufacturing system to improve decision making and enhance productivity (Elovici and Braha 23). Recently, data-based scheduling has emerged as a new way to solving JSPs in such a manufacturing environment supporting by on- and off-line data accumulated 18

and stored in data base. It has drawn great attentions from both academia and industry to the application of data-based methods to the scheduling problems in complex manufacturing systems [2]. Data-based scheduling method has a distinct advantage in solving complex JSPs, however how to extract useful information and knowledge for scheduling from a large number of data is still an intractable problems. In this paper, we propose our preliminary research results in data-based JSP with a data-based and rule-based integrated scheduling mechanism is discussed. The remainder of this paper is organized as follows. Section 2 reviews some recent works related to the data-based scheduling. Then a data-based scheduling framework is developed and explained in Section 3. Details of our novel knowledge discovery approach as well as the rule-based scheduling algorithm are given in Section 4 and Section 5. A simple example is discussed in Section 6 to illustrate the efficient of the proposed method and this is followed by conclusions in Section 7. 2. Literature Review JSPs is a complex combinatorial optimization problem full of numerous constraints as well as uncertain or unpredictable changed condition factors especially in today s challenge manufacturing environment. Data-based scheduling gives a feasible pathway to deal with the above troublesome in JSPs. Recently, several data-based scheduling models or algorithms have been proposed, which integrated with data processing algorithm, simulation technique or other technical measures [3]. Since there are necessary relation between data and knowledge as well as rule, those knowledge or rule based scheduling approach would give references or lessons for data-based scheduling. We now review some of the literature pertaining to the field of data-based scheduling as well as the knowledge or rule based scheduling. 2.1. Data-Based Scheduling Data accumulated in the production process for scheduling can be classified into on- and off-line data, which includes information related with part, device, as well as other relevant aspects. Choudhary et al. [4] clearly indicated that it is necessary to adapt effective data processing methods, due to the concealment character of data, and data mining approach might become an important tool to obtain knowledge from data. To solve the data-based scheduling, Li et al. [2] proposed a data-based scheduling framework composed of data layer, model layer, and scheduling layer. Meanwhile, it indicated that integrating data-based scheduling with conventional scheduling would bridge the gap between research and application. For dynamic manufacturing system scheduling, Shahzad et al. [5] proposed a data mining based approach framework integrated the data mining optimization and simulation, for the dynamic job shop scheduling problems. The framework applied a tabu-search algorithm to find optimal scheduling solutions, and then these solutions are approximated with a decision-tree based learning algorithm. Compared with traditional scheduling methods, data-based scheduling has the characteristics of complexity. Consequently, the combination of data-based scheduling with existing scheduling methods and algorithm would be a realistic way. 2.2. Knowledge or Rule Based Scheduling Knowledge discovery is a process to obtain potential, novel, and useful knowledge from data, by using some or certain kinds of data processing techniques. The process of knowledge discovery can be regarded as a process of data processing that might be a specific step for data-based scheduling. Panwalkar and Iskander [6] presented a summary of over priority dispatching rules, and classified them into simple priority rules, heuristic scheduling rules and other rules. Each rule had been description in detail, with its features. Koonce and Tsai [7] used data mining methods to discover useful patterns from 181

data for JSPs. The training data were generated by the genetic algorithm firstly, and then the attribute-oriented induction method was used to obtain useful patterns of knowledge. The motivation of research on data-based scheduling is the limits of traditional scheduling facing to complex manufacturing systems [2]. It is naturally a new idea to solve complex scheduling problems, however, how to extract useful knowledge from related on- and off-line data to improve the operational performance of the JSPs is still a novel and unsolved issues. The aim of this paper is therefore to present a trying to use knowledge discovery and rule-based decision tree for data processing and usage in JSPs. 3. System Framework A new generation of methods and tools are required to assist humans in intelligently managing modern manufacturing system. Taking into account features on shop floor related to scheduling, a data-based scheduling framework for job shop is proposed as shown in Fig. 1. In the data-based jobshop scheduling, data processing is one of the basic and critical sections. This framework gives a platform for extracting knowledge or rules by processing on- and off-line data, which would be used for the decision making in scheduling. In this framework, data processing model as well as the followed rule-based algorithm would be a selective use. If there are no obvious changes emerging in manufacturing environment, the JSPs can be optimally solved with existing scheduling methods, and the data-based scheduling will not be triggered. On the other hand, on-line data as well as related off-line data would be processing to digging out rules and to make rational decisions under changed environment. In the proposed framework, there are several key points need to be interpreted to show the train of our thought. The following offered several explanations. 3.1. Data Sources Data are the basis and premise of the realization of data-based scheduling. These data may include both from different information systems in the enterprises, such as ERP, MES, ACP, SCADA, and from production scheduling process. The large amounts of data can be classified into two categories: off-line data and on-line data. The former includes related information of product, order, equipment, and other key element in shop. The latter is reflection of the real-time status of production scenes and their environment, including state information of equipment, WIP status, and so forth. Data Sources ERP MES APC Data Interface Product Data Order Data Equipment Status WIP Status Equipment Data Off-line Data Existing Scheduling Algorithms On-line Data Data Preprocessing Decision 按钮Tree Rule-based Optimization Algorithm Dispatching Rule Data-based Scheduling Fig. 1. Data-based scheduling framework. 182

3.2. Data Preprocessing The obtained data from manufacturing processes might be generally incomplete, inconsistent, and with noisy, due to the lag of updating or inaccurate transmission. Therefore, these data should be pre-processed, for the purpose of data-purification and noise reduction. Since these data would be used as training data to discover scheduling rules, aggregation and attribute selection need to be further carried out to ensure the completeness and reliability of data pre-processing. 1) Data cleaning is designed to eliminate data noise and handle missing data, where Data Smoothing Technique is used to get solutions. 2) Data aggregation is about to merge two or more objects into a single object. Techniques adopted might be support vector machines or fuzzy K-means methods. 3) Data feature subset selection is characteristic selection, which eliminating some irrelevant and redundant attributes. Commonly, technologies been used are computational intelligence methods. 4) Discretion will transfer continuous attributes into discrete attributes to facilitate certain algorithms using discrete value. Decision tree, fuzzy aggregation and other methods are might be used. 3.3. Decision Tree After the data have been pre-processed, data mining algorithms are used to obtain effective knowledge rules. Decision tree is one of the common used data mining algorithms [4], which has relatively small calculation and high intelligibility. It could derive rules from the training or sample data for scheduling. The branch nodes of the tree are made up of attribute values of training data, while the root node is made up of the most informative one among the training data. Further scheduling rules as well as knowledge will be acquired via constructing trees and pruning trees. The proposed framework also provided a flexible switching mechanism for fitting the actual shop floor environment. It usually runs in the regular mode, which making scheduling decision directly by these derived rules. However, if there are any non-routine incidents happened on the floor, the data-based scheduling modeler will be awaken to making a occasional decision using the rule-base intelligent scheduling algorithm as well as online data. Besides, the results of scheduling will be record as resources for acquiring new rules for future use under the similar occasion. 4. Process of Knowledge Discovery Knowledge discovery is a process of discovering interesting and useful knowledge from the objective data set [8]. For data-based scheduling, knowledge discovery can help for making helpful knowledge to making further suitable scheduling decision. Main steps of the process are specialized now. Step 1. Data selection. Determine the scope of objective data to be processed, and select corresponding data set related to the objective task. Step 2. Data pre-processing. This step include several operations such as cleaning data noise, eliminating duplicate data, deriving and repairing missing data, completing the data type conversion and et al. The object is to ensure the completeness, reliability and operability of the data using processes. Step 3. Data transformation. Data transformation will convert the data into a unified form which might suitable for mining. Operations include data summarizing or aggregating, standardization. Step 4. Data mining. Suitable data mining algorithms will be selected to obtain useful patterns and make further efforts to achieve effective knowledge acquisition. Step 5. Interpretation and knowledge presentation. This process will run iteratively until a rational and meaningful pattern has been gained. If a meaningful pattern has been identified, visualization and knowledge representation technology will be adopted to present the obtained knowledge for use. 183

5. Rule-Based Adaptive Scheduling Algorithm To solve the problem under the proposed data-based scheduling framework, an intelligent scheduling algorithm should be roused with the help of related rule from rule base. In this paper, A rule-based ant colony algorithm is elaborated to make the optimal decision where scheduling rules, such as the longest processing time (LPT), which are derived from the decision tree algorithm as the suitable rule for this occasion, are applied to the ant colony algorithm for the optimal scheduling decision making. Details of the algorithm include the following steps. Step 1. Initialization parameters, such as pheromone, number of ants, the influence of pheromone heuristic factor and pheromone self-heuristic factor. Step 2. Create a generation of ants, and construct paths to search the feasible solution in this iteration. Step 3. Optimize the local or global feasible solution, according to the dispatching rule in rule base Step 4. Adjust and update the pheromones, meanwhile, update the influence of pheromone heuristic factor and pheromone self-heuristic factor dynamically based on the ant colony algorithm [9]. Step 5. Judging whether the result meets the stopping criterion. If the answer is yes, export the best solution and go to Step 6, otherwise return to Step 1. Step 6. End the program. 6. Simulation and Results A preliminary simulation research is given to verify the effective of the proposed approach. To demonstrate the viability of the developed method, a miniature JSPs with two-job and three-machine has been considered. The process planning characteristics of the instance are described in the Table 1. There are two jobs in the example, and each has three different operations to be processed according to a given sequence. Table 1. Parameters of a Two-Job and Three-Machine Problem Jobs Operation sequence Machine alterative Processing time (s) O11 O12 M3 18 O13 O21 2 O22 2 O23 M3 J1 J2 Fig. 2 gives the result which adopts the conventional ant colony optimization algorithm [9], and the rule-based adaptive ant colony algorithm is adopted in Fig. 3. The minimum make-span of the conventional ant colony algorithm is 6, and the proposed data- and rule-based algorithm is 58, which gives a shorter minimum make-span. It suggests that the dispatching rule has a great impact on the total processing time. M3 1 12 28 6 5 11 3 2 5 6 3 Fig. 2. Gantt chart of a conventional algorithm. 184

1 3 11 2 12 48 58 2 4 48 58 2 3 Fig. 3. Gantt chart of the proposed data-based algorithm. 7. Conclusion In this paper, many attempts have been made to construct effective novel data- and rule-based scheduling algorithm with special decision tree implanted rule mining for solving flexible job shop scheduling problems. The data-based scheduling framework as well as accompanying algorithms is proposed, after a brief overview of scheduling strategy related to the data-based scheduling research is provided. The performance of the proposed algorithm has been compared with conventional algorithms, such as ant cloning scheduling. From the results of simulations, we can see, our approach, novel scheduling algorithm with data-based decision tree and rule mining uses ant cloning intelligent, take into account the current state of the scheduling environment, can make better scheduling decisions. However, the research and simulation is only preliminary work. Future work includes applying the proposed method to other scheduling problems with different constraints, especially for the scenario with machine disruption, job released delay, processing time prolonging or other similar environment. Acknowledgment This work is supported by Natural Science Foundation of Liaoning Province, China, under Grant 213222. References [1] Ziaee, M. (214). A heuristic algorithm for solving flexible job shop scheduling problem. International Journal of Advanced Manufacturing Technology, 71(1-4), 519-528. [2] Li, L., Sun, Z. J., Ni, J. C., & Qiao, F. (213). Data-based scheduling framework and adaptive dispatching rule of complex manufacturing systems. International Journal of Advanced Manufacturing Technology, 66, 1891-195. [3] Liu, M. (29). A survey of data-based production scheduling methods. Acta Automatica Sinica, 35(6), 785-86. [4] Choudhary, A. K., Harding, J. A., & Tiwari, M. K. (29). Data mining in manufacturing: a review based on the kind of knowledge. Journal of Intelligent Manufacturing, 2(5), 51-521. [5] Shahzad, A., Mebarki, N. & Irccyn, I. (2). Discovering dispatching rules for job shop scheduling problem through data mining. Proceedings of the 8th International Conference of Modeling and Simulation. [6] Panwalkar, S. S., & Iskander, W. (1977). A survey of scheduling rules. Operations Research, 25(1), 45-61. [7] Koonce, D. A., & Tsai, S. C. (2). Using data mining to find patterns in genetic algorithm solutions to a job shop schedule. Computers & Industrial Engineering, 38(3), 361-374. [8] Kurgan, L. A., & Musilek, P. (26). A survey of knowledge discovery and data mining process models. Knowledge Engineering Review, 21(1), 1-24. [9] Wang, W. X., Wang, Y. H., Yu H. X., & Zhang, C. Y. (213). Dynamic-balance-adaptive ant colony 185

optimization algorithm for job-shop scheduling. Proceedings of the Fifth Conference on Measuring Technology and Mechatronics Automation. Yanhong Wang received the PhD degree in control theory and control engineering in 23 from Jilin University, Changchun, China. She is currently a professor in the Department of Information Science and Engineering, Shenyang University of Technology, Shenyang, China. Her current research interests include production scheduling, distributed information processing, and enterprise information system. Dandan Ji is a graduate student in the Department of Information Science and Engineering, Shenyang University of Technology, Shenyang, China. Her research interests include data-based scheduling, data mining, and distributed information processing. She intends to expand research issues to big data and data-based information systems in the future. 186