Application of Data Mining in Manufacturing Industry

Similar documents
Enhancing Forecasting Performance of Naïve-Bayes Classifiers with Discretization Techniques

ISSN: (Online) Volume 3, Issue 9, September 2015 International Journal of Advance Research in Computer Science and Management Studies

CHAPTER 4 METHODOLOGY AND TOOLS

Classifying Twitter Data in Multiple Classes Based On Sentiment Class Labels

Improving Quality of Products in Hard Drive Manufacturing by Decision Tree Technique

A SURVEY- WEB MINING TOOLS AND TECHNIQUE

PREDICTION OF POPULAR SMARTPHONE COMPANIES IN THE SOCIETY

The Data Mining usage in Production System Management

International Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X

A Comparative Study of Data Mining Process Models (KDD, CRISP-DM and SEMMA)

DATA MINING AND ITS TECHNIQUE: AN OVERVIEW

An Effective Performance of Feature Selection with Classification of Data Mining Using SVM Algorithm

COMPARISON OF DIFFERENT CLASSIFICATION TECHNIQUES

Community edition(open-source) Enterprise edition

Data Mining. Introduction. Piotr Paszek. (Piotr Paszek) Data Mining DM KDD 1 / 44

A Comparative Study of Selected Classification Algorithms of Data Mining

Improving Quality of Products in Hard Drive Manufacturing by Decision Tree Technique

A Survey On Different Text Clustering Techniques For Patent Analysis

TIM 50 - Business Information Systems

Iteration Reduction K Means Clustering Algorithm

Frequent Item Set using Apriori and Map Reduce algorithm: An Application in Inventory Management

AN OVERVIEW AND EXPLORATION OF JMP A DATA DISCOVERY SYSTEM IN DAIRY SCIENCE

IJMIE Volume 2, Issue 9 ISSN:

IMPLEMENTATION OF CLASSIFICATION ALGORITHMS USING WEKA NAÏVE BAYES CLASSIFIER

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

DATA WAREHOUSING IN LIBRARIES FOR MANAGING DATABASE

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 05, 2016 ISSN (online):

WEKA: Practical Machine Learning Tools and Techniques in Java. Seminar A.I. Tools WS 2006/07 Rossen Dimov

Clustering of Data with Mixed Attributes based on Unified Similarity Metric

Comparative Study on Classification Meta Algorithms

An Improved Document Clustering Approach Using Weighted K-Means Algorithm

A Data Classification Algorithm of Internet of Things Based on Neural Network

Comparative Study of Clustering Algorithms using R

International Journal of Software and Web Sciences (IJSWS)

Data Mining Technology Based on Bayesian Network Structure Applied in Learning

DATA MINING AND WAREHOUSING

An Efficient Clustering for Crime Analysis

International Journal of Advanced Research in Computer Science and Software Engineering

Dr. Prof. El-Bahlul Emhemed Fgee Supervisor, Computer Department, Libyan Academy, Libya

R07. FirstRanker. 7. a) What is text mining? Describe about basic measures for text retrieval. b) Briefly describe document cluster analysis.

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

AMOL MUKUND LONDHE, DR.CHELPA LINGAM

Data Mining Techniques Methods Algorithms and Tools

Chapter 3. Foundations of Business Intelligence: Databases and Information Management

Data Mining with Oracle 10g using Clustering and Classification Algorithms Nhamo Mdzingwa September 25, 2005

Correlation Based Feature Selection with Irrelevant Feature Removal

SK International Journal of Multidisciplinary Research Hub Research Article / Survey Paper / Case Study Published By: SK Publisher

TIM 50 - Business Information Systems

Effect of Principle Component Analysis and Support Vector Machine in Software Fault Prediction

Keywords: clustering algorithms, unsupervised learning, cluster validity

A STUDY OF SOME DATA MINING CLASSIFICATION TECHNIQUES

International Journal of Advance Engineering and Research Development. A Survey on Data Mining Methods and its Applications

Research on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a

A Framework for Detecting Unnecessary Industrial Data in ETL Processes

Research on Data Mining Technology Based on Business Intelligence. Yang WANG

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives

Data Mining Practical Machine Learning Tools And Techniques With Java Implementations The Morgan Kaufmann Series In Data Management Systems

Analysis of a Population of Diabetic Patients Databases in Weka Tool P.Yasodha, M. Kannan

CoE CENTRE of EXCELLENCE ON DATA WAREHOUSING

Index Terms Data Mining, Classification, Rapid Miner. Fig.1. RapidMiner User Interface

The Procedure Proposal of Manufacturing Systems Management by Using of Gained Knowledge from Production Data

NORMALIZATION INDEXING BASED ENHANCED GROUPING K-MEAN ALGORITHM

Introduction to Data Mining

Keywords Clustering, Goals of clustering, clustering techniques, clustering algorithms.

Estimating Missing Attribute Values Using Dynamically-Ordered Attribute Trees

Data Mining Practical Machine Learning Tools and Techniques

Fraud Detection Using Random Forest Algorithm

Database and Knowledge-Base Systems: Data Mining. Martin Ester

An Empirical Study of Lazy Multilabel Classification Algorithms

RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE

Summary. RapidMiner Project 12/13/2011 RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE

Time: 3 hours. Full Marks: 70. The figures in the margin indicate full marks. Answers from all the Groups as directed. Group A.

Data mining with sparse grids

Performance Analysis of an After Cooler Used In High Power Engine Using CFD

Evaluation of Neural Networks in the Subject of Prognostics As Compared To Linear Regression Model

Different Classification Technique for Data mining in Insurance Industry using weka

DATA ANALYSIS WITH WEKA. Author: Nagamani Mutteni Asst.Professor MERI

Web Data mining-a Research area in Web usage mining

Detecting and Analyzing Communities in Social Network Graphs for Targeted Marketing

An Algorithm for user Identification for Web Usage Mining

An Overview of various methodologies used in Data set Preparation for Data mining Analysis

Advance analytics and Comparison study of Data & Data Mining

Web Mining Evolution & Comparative Study with Data Mining

CSE 626: Data mining. Instructor: Sargur N. Srihari. Phone: , ext. 113

Relevance Feature Discovery for Text Mining

Web Mining Using Cloud Computing Technology

Analysis of Dendrogram Tree for Identifying and Visualizing Trends in Multi-attribute Transactional Data

Management Information Systems Review Questions. Chapter 6 Foundations of Business Intelligence: Databases and Information Management

A REVIEW PAPER ON DATA MINING TECHNIQUES, ALGORITHMS AND TOOLS

Keywords Hadoop, Map Reduce, K-Means, Data Analysis, Storage, Clusters.

ADVANCED ANALYTICS USING SAS ENTERPRISE MINER RENS FEENSTRA

A Comparison of Text-Categorization Methods applied to N-Gram Frequency Statistics

ABSTRACT I. INTRODUCTION II. METHODS AND MATERIAL

Graph Sampling Approach for Reducing. Computational Complexity of. Large-Scale Social Network

Prototyping DM Techniques with WEKA and YALE Open-Source Software

Review on Data Mining Techniques for Intrusion Detection System

Analyzing Outlier Detection Techniques with Hybrid Method

Implementation of JIT Technique in a Mixed Model Assembly Line: A Case Study

An Efficient Algorithm for Finding the Support Count of Frequent 1-Itemsets in Frequent Pattern Mining

Security analytics: From data to action Visual and analytical approaches to detecting modern adversaries

Transcription:

International Journal of Information Sciences and Application. ISSN 0974-2255 Volume 3, Number 2 (2011), pp. 59-64 International Research Publication House http://www.irphouse.com Application of Data Mining in Manufacturing Industry 1 V.K. Jha and 2 R.K. Singh 1 Assistant Professor, Department of Information Technology, BIT Mesra, Ranchi, India 2 Professor & Head, Department of Information Technology, MIT, Muzaffarpur, India Abstract The paper reviews applications of data mining in manufacturing engineering, in particular production processes, operations, fault detection, decision support, and product quality improvement. Data mining offers tools for discovery of relationships, patterns, and knowledge in large databases. The knowledge extraction process is computationally complex and therefore a subset of all data is normally considered for mining. In the case study reported in this paper, a data mining approach is applied to extract knowledge from a data set. The extracted knowledge is helpful for the prediction and prevention of manufacturing faults in engines. Keywords: Data Mining, Manufacturing, Rapid Miner, JIT, Product development, Data interpretation, case study of automobile industry, Product Development, Software based Data analysis. Introduction Many organizations have based their aggressive strategies around automation and new production technology by adopting different applicable tools. Amongst these tools is Data Mining. The advancement in the computing technologies has made owning and running knowledge management systems and data warehouses or data marts easier and more cost efficient than it has ever been. This is particularly desired in order to keep pace with the changing customer s and business s needs. Managers are required to identify that the workforce knowledge and know-how collection must be strategically positioned to progress. An organization must have the ability to perform daily operations and continuous improvements. Different technologies can be used to track and monitor the organization s performance along several dimensions to ensure that the trends and values are on track. If not, critical signals can be inferred and

60 V.K. Jha and R.K. Singh actions can be initiated to improve the physical or resource value chains and their interactions with associated information and knowledge value chains. Data Mining can be used in the following manufacturing domain: Data Mining in product design Data Mining in manufacturing lead time estimation Data Mining in quality Data Mining in supply chain management Data Mining in Just In Time manufacturing environment Methodology Data Collection Preparing input for a data mining investigation usually consumes the bulk of the effort and time invested in the entire data mining process. When beginning work on a data mining problem, it is first necessary to bring all the data together into a set of instances. The data must be assembled, integrated, and cleaned up. Data Mining tool The Data Mining tool used on this paper is Rapid Miner (formerly YALE). Rapid Miner (fig 1)is the most comprehensive open-source software for intelligent data analysis, data mining, knowledge discovery, machine learning, predictive analytics, forecasting, and analytics in business intelligence (BI). Rapid Miner provides more than 400 data mining operators, a graphical user interface (GUI), an online tutorial with hands-on data mining applications, a comprehensive PDF tutorial, many visualization schemes (fig 2, fig 3) for data sets and data mining results, many different learning and meta-learning schemes ranging from decision tree and rule learners to neural networks, SVMs, ensemble methods, etc. Rapid Miner is implemented in Java and available under GPL (GNU General public License) as well as under a developer license (OEM license) for closed-source developers. The Data representation obtained are illustrated in fig 3 a,b,c and d. Figure 1: Rapid Miner.

Application of Data Mining in Manufacturing Industry 61 Figure 2: 3D Plot. Dia (mm) Bore Dimension Distribution 38 37 Series1 0 10 20 Figure 3(a-d): Data Representation. Inferring Rules and Knowledge Representation Most of the techniques used to present the output produce easily comprehensible descriptions of the structural patterns in the data. There are many different ways for representing the patterns that can be discovered by machine learning, and each one dictates the kind of technique that can be used to infer that output structure from data. The different types of techniques include: Decision Tables Decision Trees

62 V.K. Jha and R.K. Singh Classification rules Association rules Clusters Algorithms Used Decision Tables The simplest, most rudimentary way of representing the output from machine learning is to make it just the same as the input a decision table. creating a decision table might involve selecting some of the attributes. The problem is, of course, to decide which attributes to leave out without affecting the final decision. (fig 4). The no of iteration involved is dependent on available data and higher the iterations higher is the accuracy of results obtained. Figure 4(a): Decision Table. Figure 4(b): Rule iteration. Decision Trees A divide-and-conquer approach to the problem of learning from a set of independent instances leads naturally to a style of representation called a decision tree (fig 5). Nodes in a decision tree involve testing a particular attribute. Leaf nodes give a classification that applies to all instances that reach the leaf. If the attribute that is tested at a node is a nominal one, the number of children is usually the number of possible values of the attribute.

Application of Data Mining in Manufacturing Industry 63 Figure 5: Decision Tree. Case Study Engine specifications were collected from a company from the quality department. The table-i below shows the data. The data shows the specification of 15 engines, which were tested for quality. The NOGO implies that those engines did not satisfy the quality tests. BOR STROK ENGIN ROCKER E E (mm) E ARM to (mm) WEIGH SHAFT T (kg) CLEARAN Table I: Data Set. VALVES CLEARAN CE INTERNAL (mm) PISTON OUTER DIAMETE R (mm) ALTERNAT OR CAPACITY (Kw) REMAR K CE (mm) 37.6 44 70 0.012 0.26 37.75 0.1 GO 37.3 43 68 0.014 0.19 37.78 0.099 NOGO 37.9 46 73 0.034 0.18 37 0.097 GO 37.7 43 72 0.035 0.23 37.8 0.104 GO 37.5 42 70 0.044 0.3 37.79 0.106 GO 37.8 46 68 0.037 0.29 37.74 0.103 GO 37.6 45 69 0.038 0.27 37.54 0.098 GO 37.7 43 70 0.027 0.18 37.69 0.104 NOGO 37.9 44 72 0.028 0.31 37.86 0.106 GO 37.8 45 71 0.031 0.2 37.78 0.109 GO 37.7 46 75 0.018 0.25 37.77 0.11 GO 37.9 44 76 0.019 0.24 37.79 0.095 NOGO 37.8 47 70 0.027 0.18 37.75 0.102 GO 37.3 42 72 0.028 0.23 37.68 0.103 GO 37.5 47 73 0.016 0.22 37.73 0.099 NOGO

64 V.K. Jha and R.K. Singh Rule Sets p1 = (37.3, 37.9) p1 >37.4 && p2>43 p3>75 && p7>0.1 p5>2 p6<37.86 p2>42 p6<p1 p5<0.3 && p5<0.16 p4>0.012 && p4<0.043 p5<p4 Rapid Miner was used to learn pattern from the data set given above. Since the volume of data was small the results could not be found properly. So if- then analysis was used to fine the trend in the data. Excel was used to learn the variation in the data on altering some of the parameters. The rule sets shown above were thus found out. (Where p1 is parameter 1, p2 is parameter 2 and so on as shown in the table.) Conclusion This paper outlines a framework for decision making based on the knowledge provided by the data mining techniques. Basic control limits and boundary conditions required for different attributes of an engine are specified. The origin of error can be found out using this method. It was also realized that the population density representation of the data needs to be much more. The data should be filtered and error in the data should be minimized. The growing volume of data in manufacturing and service industries is a challenge that needs research on tools that discover unique properties of the data. Data mining is a discipline that offers tools for data analysis and knowledge discovery. References [1] Morgan Kaufmann Data Mining Practical Machine Learning Tools And Techniques - Ian H. Witten Department of Computer Science, University of Waikato and Eibe Frank Department of Computer Science, University of Waikato [2] Data Mining: Techniques and Applications - Thaer Al-Nimer [3] Data mining: manufacturing and service applications - A. KUSIAK, International Journal of Production Research, Vol. 44, Nos. 18 19, 15 September 1 October 2006, 4175 4191 [4] Survey of the Computational Intelligence Methods for Data Mining -Zdenˇek Buk [5] Data mining - Wikipedia (http://en.wikipedia.org/wiki/data_mining) [6] http://www.the-data-mine.com/