Data Mining for Fault Diagnosis and Machine Learning. for Rotating Machinery
|
|
- MargaretMargaret Blair
- 5 years ago
- Views:
Transcription
1 Key Engineering Materials Vols (2005) pp online at (2005) Trans Tech Publications, Switzerland Online available since 2005/Sep/15 Data Mining for Fault Diagnosis and Machine Learning for Rotating Machinery Gang Zhao 1, DongXiang Jiang 2, Kai Li 3, JinHui Diao 4 1, 2, 3, 4 Department of Thermal Engineering, Tsinghua University Beijing , P.R. China 1 zhaog02@mails.tsinghua.edu.cn, 2 jiangdx@tsinghua.edu.cn Keywords: faults diagnosis, machinery learning, data mining, rotating machinery Abstract. Data mining is used not only for database analyses, but also for machine learning. The data mining technique described in this paper was used for steam turbine fault diagnostics based on continuous data measurements. The classification rules are based on standardized vibration frequency data for steam turbines and field experts analyses of turbine vibration problems. The expert knowledge enables the steam turbine fault diagnosis system to be more powerful and accurate. The system can identify twenty types of standard steam turbine faults. The system was developed using 2000 simulated data sets. The data mining methods were then used to identify 20 explicit rules for the turbine faults. The method was also used with actual power plant data to successfully diagnose real faults. The results indicate that data mining can be effectively applied to diagnosis of rotating machinery by giving useful rules to interpret the data. Introduction Beginning in around 1985, the goal of rotating machinery fault diagnostics was primarily to store the vibration spectra and to provide graphical tools so that the analyst could quickly access the data and determine what might be wrong with the machine. But as the data collection devices (originally spectrum analyzers) became smaller, faster, and more portable, the amount of data to be analyzed rapidly grew. The data acquisition system could soon store hundreds of spectra. As the data acquisition systems and measurement techniques improved, the analyst was faced with mountains of data. Similar problem was developing in other area with large data warehouses with the rapid developments in digital data acquisition and storage technology. Although valuable information may be hiding in the data, the overwhelming data volume makes it difficult, if not impossible, for human beings to extract the information without powerful tools. The overwhelming amount of data resulted in the new technique of data mining, which seeks to extract knowledge from huge volumes of data through numerical analysis of the data. Data mining is not only database analysis method, but also an important machine learning tool. For machine learning, data mining is defined by Witten and Frank as the extraction of implicit, previously unknown, and potentially useful information from data [1]. This paper describes the application of data mining techniques to steam turbine fault diagnostics. Many methods have been used for data mining, with the decision tree often shown to be the most valuable form of data mining. The decision tree classifier (DTC) has been used successfully in many diverse areas such as radar signal classification, character recognition, remote sensing, medical diagnosis, expert systems, and speech recognition, to name only a few [2]. Perhaps, the All rights reserved. No part of contents of this paper may be reproduced or transmitted in any form or by any means without the written permission of the publisher: Trans Tech Publications Ltd, Switzerland, (ID: /04/08,11:10:38)
2 176 Damage Assessment of Structures VI most important feature of DTC's is their capability to break down a complex decision-making process into a collection of simpler decisions, thus providing a solution which is often easier to interpret. Fault diagnosis is based on pattern identification and classification. The first step in steam turbine fault diagnostics is pattern identification from the measured data. The next step is to interpret what the patterns indicate about the machine, but proper interpretation requires some knowledge about the machine. Decision trees provide a good approach to supervised classification and prediction in artificial intelligence and statistical pattern recognition. A tree is "grown" from data using a recursive partitioning algorithm to create a tree which (hopefully) accurately predicts classes on new data. Neural networks provide another valuable method for fault classification for rotating machinery. Crupi et al. [3] describe the use of neural networks to evaluate vibration signatures in rotating machinery and recognize the occurrence of faults. The procedure can be used to diagnose fault not considered in the training set. However, the neural networks knowledge is hidden in the network, so the rules can not be easily extracted and interpreted. Decision trees can be more effectively applied to steam turbine fault diagnosis because the fault diagnosis requires not only pattern classification, but also rule extraction and knowledge interpretation. Steam turbine fault diagnostics Steam turbine faults are generally classified into twenty types listed in Table 1 based on field experts experience and theoretical analyses. Table 1. Steam turbine fault classification. Fault No. Description Fault No. Description F0 normal F10 pedestal looseness F1 imbalance F11 foundation looseness F2 components missing F12 worn coupling F3 bent shaft F13 electricity magnet excited F4 shaft-seal rubbing F14 sub-harmonic vibration F5 axial rubbing F15 oil whirl F6 axial misalignment F16 oil whip F7 eccentricity faults F17 steam excited vibration F8 rotor crack F18 valve vibration F9 shrunk-on-disc failure F19 power disturbance In any faults diagnosis, feature extraction is an important step for detecting steam turbine faults. Features can be extracted from the frequency domain of a typical steam turbine vibration analysis. However, analysis of the steam turbine data requires a detailed understanding of the steam turbine design, operation, and maintenance. Vibration spectrum analysis is a practical and powerful tool for steam turbine fault diagnosis because it is based on a great deal of engineering experience. Although there have recently been many new methods applied to fault diagnosis, most approaches are based on or related to the vibration spectrum data. However, the fault can not be easily related to the spectrum data because the steam turbine system is very complex and influenced by numerous process parameters. The best method is to use the feature-fault relationship matrices in well-established machining reference databases, expert intelligence for the reasoning and decision-making and experimental results of signal characteristics for various working conditions. Table 3 show a fuzzy feature-faults relationship matrix for a steam turbine developed using fuzzy
3 Key Engineering Materials Vols mathematics. The table relates the typical twenty steam turbine faults with ten vibration spectrum features. The alphabetic symbols used to describe the spectrum and process features are listed in Table 2. The notation n X in the second column of table 2 denotes a frequency component (or range) in the spectrum at n times the turbine s rotational speed. Table 2. Symbols for vibration frequency and process feature description. Frequency Description Process Description feature feature f1 0.01~0.39X P1 Amplitude jump during operation f2 0.4~0.49X P2 vibrations at various power load f3 0.50X P3 axial vibration f4 0.51~0.99X P4 shaft average centerline f5 1 X P5 critical speed spectrum f6 2 X P6 stable at various running speeds f7 3~5 X P7 vibration level increase during running up f8 odd of X P8 level jump during run up f9 high X P9 3x at 1/3 critical speed f10 Power line P10 half-speed whirl Table 3. Spectrum feature-fault relationship chart. Faults f1 f2 f3 f4 f5 f6 f7 f8 f9 f10 F F F F F F F F F F F F F F F F F F F F The relationships listed in Table 3 show that some faults such as an imbalance, F1, and a bent shaft, F3, can not be distinguished since they have similar spectrum features. Therefore, a second relationship matrix given in Table 4 is used to relate the process features to the steam turbine faults. Table 4 was derived directly from the author and other field expert experience, so it can be used to efficiently diagnose faults. The two relationship charts in Tables 3 and 4 provide the basis for steam
4 178 Damage Assessment of Structures VI turbine fault diagnosis. Table 4. Process feature- fault relation chart. Fault P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 F0 N N L L N N Y N N N F1 N N L L N N Y N N N F2 Y N L L N N N N N N F3 N Y L L N N N N N N F4 N N L L N N N N N N F5 N N H L N N N N N N F6 N N H L N N N N N N F7 N N L H N N N N N N F8 N N L L N N N N Y N F9 N N L L N N N Y N N F10 N N M L N N N P N N F11 N N L L N N N Y N N F12 N N L L N N N N N N F13 N N L L N N N Y N N F14 N N L L N N N N N N F15 N N L L N N N N N Y F16 N N L L Y N N N N N F17 N P L L P N N N N N F18 N N L L N Y N N N N F19 N N L L N N N N N N Y=Yes, N=No, L=Low, M=Middle, H=High, P=Possible Decision Tree Classifier Methods Decision trees are based on the following terminology. (1) A decision tree is a flow chart or diagram representing a classification system or a predictive model. The tree is structured as a sequence of simple questions with the answers to those questions tracing a path down the tree. (2) The end product is a collection of hierarchical rules that segment the data into groups, where a decision (classification or prediction) is made for each group. (3) The hierarchy is called a tree, and each segment is called a node. (4) The original segment containing the entire data set is referred to as the root node of the tree. (5) A node with all of its successors forms a branch of the node that created it. (6) The final nodes are called leaves. A decision made at each leaf is applied to all observations in the leaf. The depth of a node in a tree is the path length from the root to the node. The height of a node in a tree is the largest path length from the node to a leaf. The height of a tree is the height of its root. The Ctree software [4] was used for the fault diagnostics because the software is easy to use and the data preparation is quite straight forward. Ctree is based on C4.5 algorithm. The Node Splitting Criterion calculated the entropy to select the split between branches. While growing the tree, a predictor is chosen at any point to split a node so that the information gain is maximized after the split. The C4.5 algorithm actually uses the gain ratio (= Gain / Split Information) to select the split.
5 Key Engineering Materials Vols The Stopping Criteria stops the node splitting and identifies the node as a leaf node if any one of the following criterion is met: (1) The number of records in the node is less than some pre-specified limit. (2) The node purity is more than some pre-specified limit p which means that the proportion of records in the node with class equal to the majority class is p or more. (3) The node depth is more than some pre-specified limit. (4) The predictor values for all the records are identical. Tree Pruning is based on the pessimistic error rate at the node. Each node has a 50% error rate confidence interval with its upper limit taken as the pessimistic error rate. If the pessimistic error rate of a node is less than that of the subtree rooted at that node, the node is pruned. Rule Generation is based on final tree geometry. The path from root to each leaf node gives a rule for that leaf node. Thus, a tree with k leaf nodes has a set of k rules. Then individual rules are pruned by dropping clauses one by one from that rule. The decision to drop a clause is based on the outcome of a statistical independence test. The test evaluates whether keeping a clause is independent of the final decision of the rule. If it is independent, then the clause is NOT contributing towards the final decision and it is dropped to simplify the rule. The independence tests are based on the Chi-square test and Fisher's exact test. Table 5. Classification results for various purities. Maximum purity Training set Test set misclassification rate misclassification rate 100% 0% 1.8% 95% 0% 1.73% 90% 0% 1.6% 80% 0% 1.33% 75% 0% 0.50% 70% 0% 1.19% 65% 0% 1.9% Application of data mining to steam turbine fault diagnosis A numerical simulation was developed based on the two relationship matrices in Tables 3 and 4 to test the decision tree classifier. The simulation firstly generated one hundred data points including spectrum features and process features for each type of steam turbine fault for a total of 2000 data points. Then, the data set was randomly divided into training and test sets. Next, the Ctree software was used to analyze the data set and to grow the decision tree. The pruning technique was used to generate a stable tree. The maximum purity of the tree was adjusted to get better results. For example, for a maximum purity rate of 100%, the misclassification rate for the training set was 0% and for the test set was 1.8%. However, when maximum purity was reduced to 75%, the misclassification rate for test set was reduced to 0.50%. Table 5 lists the results for various purities.
6 180 Damage Assessment of Structures VI Table 6. Classification Tree Information for a purity of 75%. Tree Information Item Value Tree Information Item Value Number of Training observations 993 Total Number of Nodes 40 Number of Test observations 1007 Number of Leaf Nodes 22 Number of Predictors 20 Number of Levels 12 Class Variable Faults Training Data Misclassification rate 0.00% Number of Classes 20 Test data Misclassification rate 0.50% Table 7. Test set results. Rule ID Fault Class Length Support Confidence Capture 1 F % 100.0% 100.0% 2 F % 100.0% 100.0% 3 F % 100.0% 100.0% 4 F % 100.0% 100.0% 5 F % 100.0% 100.0% 6 F % 100.0% 100.0% 7 F % 100.0% 100.0% 8 F % 100.0% 100.0% 9 F % 100.0% 100.0% 10 F % 100.0% 100.0% 11 F % 100.0% 100.0% 12 F % 93.8% 100.0% 13 F % 100.0% 100.0% 14 F % 98.2% 100.0% 15 F % 100.0% 100.0% 16 F % 94.3% 100.0% 17 F % 100.0% 100.0% 18 F % 100.0% 100.0% 19 F % 100.0% 100.0% 20 F % 100.0% 100.0% Simulation Results Tables 6, 7, and 8 list the classification results for the simulated steam turbine faults data. Table 6 describes the resulting decision tree for a maximum purity of 75%. The misclassification rate is sufficiently low for common engineering applications. The decision tree was then used to develop the if-then rules used by engineers to analyze and interpret the fault diagnosis results. The method can automatically extract the knowledge from the data as part of a fault diagnosis expert system. Table 7 summarizes the rule results for the test set, including the support, confidence and capture rates. The support rate measures how widely applicable the rule is in the training set. The confidence rate measures the accuracy of the rule. The capture indicates how many records of a fault were correctly captured by the rule. The twenty rules after pruning correspond to the twenty types of faults. Most of the confidence rates were 100%, with only 3 confidence rates less than 100% due to misclassification of the test data. Table 8 lists the specific rules for each fault type. The rules agree well with spectrum analysis theory. In addition, many process features from the field
7 Key Engineering Materials Vols experts experience are integrated into the rules to improve the classification process. Table 8. Rules derived from the classification tree. Rule IF Then 1 f F14 2 f3 <.1051, P9 = Y F8 3 f3 <.1051, f , P9 = N F6 4 f , f3 <.1051, f6 <.47008, P7 = Y, P9 = N F1 5 f1 <.84369, f3 <.1051, f , f6 <.47008, P6 = N, P9 = N F10 6 f1 < ,f3 <.1051, f6 <.47008, P6 = Y, P9 = N F18 7 f1 <.84369, f , f3 <.1051, f6 <.47008, P6 = Y, P9 = N F0 8 f , f3 <.1051, f6 <.47008, P1 = N, P7 = N,P9 = N F3 9 f , f3 <.1051,f6 <.47008,P1 = Y,P7 = N,P9 = N F2 10 f1<.18795,f2< ,f3<.1051,f5<.47041,f6<.47008, P6=N, P9=N F19 11 f1<.8437,f1.188,f2.186,f3<.105,f5<.47041,f6<.47008,p6=n,p9=n F12 12 f2 <.37561, f , f5 < F17 13 f1<.188,.009 f2<.20,f3<.105, f5<.470,f6<.470,p6=n,p9=n F11 14 f , f5 <.47041, f F f1<.395,f2 <.0947, f3<.105,f5<.47041, f6<.47008, P6=N,P9=N F13 16 f1 <.84369, f F f1<.84,f2<.19,f2.095,f3<.11,f5<.470,f6<.47,p3=h,p6=n,p9=n F f1<.84,f2 <.186,f2.095,f3<.11,f5<.47,f6<.47,P3=L,P6=N,P9=N F4 19 f1<.18795,f ,f3<.1051,f5< ,f6<.470,p5=n,p6=n,p9=n F15 20 f1<.18795,f2.3756,f3<.105,f5<.09472,f6<.470,p5=y,p6=n,p9=n F16 Application to power plant data The rules automatically extracted using the simulated data mining accurately summarize the rules for faults in rotating machinery such as steam turbines. The method was tested with real data by implementing the program as part of a power plant steam turbine diagnostic system. The original data is from remote online monitors and diagnostic systems installed in several power plants of the Shandong Electricity Power Company, China [5]. The data was measured by vibration transducers from the Bently Nevada Company. The data was then transformed to the frequency domain using Fast Fourier Transforms. Cases 1-4 are for various bearings at two different 300 MW units. The data extracted from the spectra was analyzed using the fuzzy logic algorithm used to generate Table 3. After the process data was added into the data sets, the data was analyzed using the rules extracted from the training set. The diagnostic results are listed in Table 9 which lists the faults identified from the spectrum data by the fuzzy logical algorithm. Table 10 lists the process data and diagnosis results. Case 1 is due to from steam excited vibration. Case 2 is due to axial misalignment. Case 3 is due to electrical-magnet excitation. Case 4 indicates a normal operating condition. The diagnostic results were then used to identify measures to suppress the vibrations. The results demonstrate that these methods can be effectively applied steam turbine fault diagnosis in actual power plants.
8 182 Damage Assessment of Structures VI Table 9. Spectrum feature data. Case No. f1 f2 f3 f4 f5 f6 f7 f8 f9 f10 Case Case Case Case Table 10. Process data and classification results. Case No. P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 Predicted faults Case 1 N Y L L N N N N N N F17 Case 2 N N H L N N N N N N F6 Case 3 N N L L N N N N N N F13 Case 4 N N L L N N Y N N N F0 Conclusions Data mining method was used to classify simulated data and real data into known classes for steam turbine fault diagnostics. The use of the simulated data enabled the system to directly capture the field experts knowledge into the resulting classification rules. The classification rules were automatically extracted from the data sets for use by engineers to diagnose and interpret steam turbine faults. The simulation results and the results using actual data from operating power plants shows that the data mining methods can be effectively applied to steam turbine fault diagnostics. The automatic extraction of the classification rules shows that these machine learning methods can be applied to large turbo-machinery databases and can include engineering knowledge and field experience. The results can then be used for fault diagnosis of large rotating machines, such as steam turbines. Reference [1] I.H. Witten and E. Frank: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations (Academic Press. USA, 2000). [2] S.R. Safavian and D. Landgrebe: IEEE Transactions on Systems, Man, and Cybernetics Vol. 21 (1991), p. 660 [3] V. Crupi, E. Guglielmino and G. Milazzo G: Journal of Vibration and Control Vol. 10 (2004), p [4] A. Saha: Ctree in Excel. http: // [5] D. Jiang, H. Sun and X. Zhan: 5th International Conference Acoustical and Vibratory Surveillance Methods and Diagnostic Techniques, France (2004)
AMSE JOURNALS 2014-Series: Advances B; Vol. 57; N 2 ; pp 1-21 Submitted April 2013; Revised Oct. 30, 2013; Accepted July 20, 2014
AMSE JOURNALS 2014-Series: Advances B; Vol. 57; N 2 ; pp 1-21 Submitted April 2013; Revised Oct. 30, 2013; Accepted July 20, 2014 Development of Soft Decision Tree Classifier using Interval Type-2 Fuzzy
More informationInternet/Intranet Based Remote Condition Monitoring and Fault. Diagnosis Scheme and System for Steam Turboset
Key Engineering Materials Vols. 293-294 (2005) pp 365-372 online at http://www.scientific.net (2005) Trans Tech Publications, Switzerland Online available since 2005/Sep/15 Internet/Intranet Based Remote
More informationClustering Analysis based on Data Mining Applications Xuedong Fan
Applied Mechanics and Materials Online: 203-02-3 ISSN: 662-7482, Vols. 303-306, pp 026-029 doi:0.4028/www.scientific.net/amm.303-306.026 203 Trans Tech Publications, Switzerland Clustering Analysis based
More informationDecision Trees Dr. G. Bharadwaja Kumar VIT Chennai
Decision Trees Decision Tree Decision Trees (DTs) are a nonparametric supervised learning method used for classification and regression. The goal is to create a model that predicts the value of a target
More informationVibration analysis goes mainstream
Vibration analysis goes mainstream With advances in sensor, recording, and analysis technology, vibration analysis is now within the reach of even small organizations Fast Forward Measuring the vibrations
More informationData Mining. 3.2 Decision Tree Classifier. Fall Instructor: Dr. Masoud Yaghini. Chapter 5: Decision Tree Classifier
Data Mining 3.2 Decision Tree Classifier Fall 2008 Instructor: Dr. Masoud Yaghini Outline Introduction Basic Algorithm for Decision Tree Induction Attribute Selection Measures Information Gain Gain Ratio
More informationA Comparative Study of Selected Classification Algorithms of Data Mining
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 6, June 2015, pg.220
More informationNeural network based vibration analysis with novelty in data detection for a large steam turbine
Shock and Vibration 19 (2012) 25 35 25 DOI 10.3233/SAV-2012-0614 IOS Press Neural network based vibration analysis with novelty in data detection for a large steam turbine K. P. Kumar a, K.V.N.S. Rao c,,
More informationText Categorization. Foundations of Statistic Natural Language Processing The MIT Press1999
Text Categorization Foundations of Statistic Natural Language Processing The MIT Press1999 Outline Introduction Decision Trees Maximum Entropy Modeling (optional) Perceptrons K Nearest Neighbor Classification
More informationFault Diagnosis of Wind Turbine Based on ELMD and FCM
Send Orders for Reprints to reprints@benthamscience.ae 76 The Open Mechanical Engineering Journal, 24, 8, 76-72 Fault Diagnosis of Wind Turbine Based on ELMD and FCM Open Access Xianjin Luo * and Xiumei
More informationAutomatic Machinery Fault Detection and Diagnosis Using Fuzzy Logic
Automatic Machinery Fault Detection and Diagnosis Using Fuzzy Logic Chris K. Mechefske Department of Mechanical and Materials Engineering The University of Western Ontario London, Ontario, Canada N6A5B9
More informationLecture outline. Decision-tree classification
Lecture outline Decision-tree classification Decision Trees Decision tree A flow-chart-like tree structure Internal node denotes a test on an attribute Branch represents an outcome of the test Leaf nodes
More informationFAULT DETECTION AND ISOLATION USING SPECTRAL ANALYSIS. Eugen Iancu
FAULT DETECTION AND ISOLATION USING SPECTRAL ANALYSIS Eugen Iancu Automation and Mechatronics Department University of Craiova Eugen.Iancu@automation.ucv.ro Abstract: In this work, spectral signal analyses
More informationHEALTH MONITORING OF INDUCTION MOTOR FOR VIBRATION ANALYSIS
HEALTH MONITORING OF INDUCTION MOTOR FOR VIBRATION ANALYSIS Chockalingam ARAVIND VAITHILINGAM aravind_147@yahoo.com UCSI University Kualalumpur Gilbert THIO gthio@ucsi.edu.my UCSI University Kualalumpur
More informationData Mining Practical Machine Learning Tools and Techniques
Decision trees Extending previous approach: Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 6 of Data Mining by I. H. Witten and E. Frank to permit numeric s: straightforward
More informationBusiness Club. Decision Trees
Business Club Decision Trees Business Club Analytics Team December 2017 Index 1. Motivation- A Case Study 2. The Trees a. What is a decision tree b. Representation 3. Regression v/s Classification 4. Building
More informationLecture 19: Decision trees
Lecture 19: Decision trees Reading: Section 8.1 STATS 202: Data mining and analysis November 10, 2017 1 / 17 Decision trees, 10,000 foot view R2 R5 t4 1. Find a partition of the space of predictors. X2
More informationBuilding Intelligent Learning Database Systems
Building Intelligent Learning Database Systems 1. Intelligent Learning Database Systems: A Definition (Wu 1995, Wu 2000) 2. Induction: Mining Knowledge from Data Decision tree construction (ID3 and C4.5)
More informationDecision trees. Decision trees are useful to a large degree because of their simplicity and interpretability
Decision trees A decision tree is a method for classification/regression that aims to ask a few relatively simple questions about an input and then predicts the associated output Decision trees are useful
More informationISSN: (Online) Volume 3, Issue 9, September 2015 International Journal of Advance Research in Computer Science and Management Studies
ISSN: 2321-7782 (Online) Volume 3, Issue 9, September 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
More informationThe Comparative Study of Machine Learning Algorithms in Text Data Classification*
The Comparative Study of Machine Learning Algorithms in Text Data Classification* Wang Xin School of Science, Beijing Information Science and Technology University Beijing, China Abstract Classification
More informationBig Data Methods. Chapter 5: Machine learning. Big Data Methods, Chapter 5, Slide 1
Big Data Methods Chapter 5: Machine learning Big Data Methods, Chapter 5, Slide 1 5.1 Introduction to machine learning What is machine learning? Concerned with the study and development of algorithms that
More informationAnalysis of Process and biological data using support vector machines
Analysis of Process and biological data using support vector machines Sankar Mahadevan, PhD student Supervisor : Dr. Sirish Shah Department of Chemical and Materials Engineering University of Alberta Outline
More informationTree-based methods for classification and regression
Tree-based methods for classification and regression Ryan Tibshirani Data Mining: 36-462/36-662 April 11 2013 Optional reading: ISL 8.1, ESL 9.2 1 Tree-based methods Tree-based based methods for predicting
More informationData Mining Part 5. Prediction
Data Mining Part 5. Prediction 5.4. Spring 2010 Instructor: Dr. Masoud Yaghini Outline Using IF-THEN Rules for Classification Rule Extraction from a Decision Tree 1R Algorithm Sequential Covering Algorithms
More informationsolid model rotor dynamics
solid model rotor dynamics A traditional practice in the rotor dynamics analysis is to use beam models for both the lateral and the torsion analysis. Such an analysis limits the capabilities for the modern
More informationVIBRATION TRAINING BROCHURE
VIBRATION TRAINING BROCHURE - 2018 - ISO Vibration Training Courses - ISO Vibration Analyst Certifications - Short Courses on Machinery Vibrations - Turbo Machinery and Advanced Vibration Courses INTRODUCTION
More informationAMOL MUKUND LONDHE, DR.CHELPA LINGAM
International Journal of Advances in Applied Science and Engineering (IJAEAS) ISSN (P): 2348-1811; ISSN (E): 2348-182X Vol. 2, Issue 4, Dec 2015, 53-58 IIST COMPARATIVE ANALYSIS OF ANN WITH TRADITIONAL
More informationMIT 801. Machine Learning I. [Presented by Anna Bosman] 16 February 2018
MIT 801 [Presented by Anna Bosman] 16 February 2018 Machine Learning What is machine learning? Artificial Intelligence? Yes as we know it. What is intelligence? The ability to acquire and apply knowledge
More informationHydraulic pump fault diagnosis with compressed signals based on stagewise orthogonal matching pursuit
Hydraulic pump fault diagnosis with compressed signals based on stagewise orthogonal matching pursuit Zihan Chen 1, Chen Lu 2, Hang Yuan 3 School of Reliability and Systems Engineering, Beihang University,
More informationAnalytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.
Glossary of data mining terms: Accuracy Accuracy is an important factor in assessing the success of data mining. When applied to data, accuracy refers to the rate of correct values in the data. When applied
More informationDeep Learning Based Diagnostics of Orbit Patterns in Rotating Machinery
Deep Learning Based Diagnostics of Orbit Patterns in Rotating Machinery Haedong Jeong, Sunhee Woo, Suhyun Kim, Seungtae Park, Heechang Kim, and Seungchul Lee Ulsan National Institute of Science and echnology,
More informationANN-Based Modeling for Load and Main Steam Pressure Characteristics of a 600MW Supercritical Power Generating Unit
ANN-Based Modeling for Load and Main Steam Pressure Characteristics of a 600MW Supercritical Power Generating Unit Liangyu Ma, Zhiyuan Gao Automation Department, School of Control and Computer Engineering
More information1) Give decision trees to represent the following Boolean functions:
1) Give decision trees to represent the following Boolean functions: 1) A B 2) A [B C] 3) A XOR B 4) [A B] [C Dl Answer: 1) A B 2) A [B C] 1 3) A XOR B = (A B) ( A B) 4) [A B] [C D] 2 2) Consider the following
More informationPerformance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM
Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM Lu Chen and Yuan Hang PERFORMANCE DEGRADATION ASSESSMENT AND FAULT DIAGNOSIS OF BEARING BASED ON EMD AND PCA-SOM.
More informationIntroduction to Machine Learning
Introduction to Machine Learning Decision Tree Example Three variables: Attribute 1: Hair = {blond, dark} Attribute 2: Height = {tall, short} Class: Country = {Gromland, Polvia} CS4375 --- Fall 2018 a
More informationPattern Recognition. Kjell Elenius. Speech, Music and Hearing KTH. March 29, 2007 Speech recognition
Pattern Recognition Kjell Elenius Speech, Music and Hearing KTH March 29, 2007 Speech recognition 2007 1 Ch 4. Pattern Recognition 1(3) Bayes Decision Theory Minimum-Error-Rate Decision Rules Discriminant
More informationGraph Matching: Fast Candidate Elimination Using Machine Learning Techniques
Graph Matching: Fast Candidate Elimination Using Machine Learning Techniques M. Lazarescu 1,2, H. Bunke 1, and S. Venkatesh 2 1 Computer Science Department, University of Bern, Switzerland 2 School of
More informationEnhancing Forecasting Performance of Naïve-Bayes Classifiers with Discretization Techniques
24 Enhancing Forecasting Performance of Naïve-Bayes Classifiers with Discretization Techniques Enhancing Forecasting Performance of Naïve-Bayes Classifiers with Discretization Techniques Ruxandra PETRE
More informationData Mining. Decision Tree. Hamid Beigy. Sharif University of Technology. Fall 1396
Data Mining Decision Tree Hamid Beigy Sharif University of Technology Fall 1396 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1396 1 / 24 Table of contents 1 Introduction 2 Decision tree
More informationA Systematic Overview of Data Mining Algorithms. Sargur Srihari University at Buffalo The State University of New York
A Systematic Overview of Data Mining Algorithms Sargur Srihari University at Buffalo The State University of New York 1 Topics Data Mining Algorithm Definition Example of CART Classification Iris, Wine
More informationKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Lecture 10 - Classification trees Tom Kelsey School of Computer Science University of St Andrews http://tom.home.cs.st-andrews.ac.uk twk@st-andrews.ac.uk Tom Kelsey
More information17. SEISMIC ANALYSIS MODELING TO SATISFY BUILDING CODES
17. SEISMIC ANALYSIS MODELING TO SATISFY BUILDING CODES The Current Building Codes Use the Terminology: Principal Direction without a Unique Definition 17.1 INTRODUCTION { XE "Building Codes" }Currently
More informationInternational Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X
Analysis about Classification Techniques on Categorical Data in Data Mining Assistant Professor P. Meena Department of Computer Science Adhiyaman Arts and Science College for Women Uthangarai, Krishnagiri,
More informationCS Machine Learning
CS 60050 Machine Learning Decision Tree Classifier Slides taken from course materials of Tan, Steinbach, Kumar 10 10 Illustrating Classification Task Tid Attrib1 Attrib2 Attrib3 Class 1 Yes Large 125K
More informationConverter station water-cooled pump vibration monitoring and. condition assessment system based on LabVIEW
International Journal of Research in Engineering and Science (IJRES) ISSN (Online): 2320-9364, ISSN (Print): 2320-9356 Volume 3 Issue 5 ǁ May 2015 ǁ PP.08-13 Converter station water-cooled pump vibration
More informationFabric Defect Detection Based on Computer Vision
Fabric Defect Detection Based on Computer Vision Jing Sun and Zhiyu Zhou College of Information and Electronics, Zhejiang Sci-Tech University, Hangzhou, China {jings531,zhouzhiyu1993}@163.com Abstract.
More informationImplementierungstechniken für Hauptspeicherdatenbanksysteme Classification: Decision Trees
Implementierungstechniken für Hauptspeicherdatenbanksysteme Classification: Decision Trees Dominik Vinan February 6, 2018 Abstract Decision Trees are a well-known part of most modern Machine Learning toolboxes.
More informationFuzzy Logic Intelligent Control System of Magnetic Bearings
Fuzzy Logic Intelligent Control System of Magnetic Bearings Shuliang Lei, Alan Palazzolo and Albert Kascak Abstract-This paper presents a fuzzy logic based intelligent control system applied to magnetic
More informationClassification/Regression Trees and Random Forests
Classification/Regression Trees and Random Forests Fabio G. Cozman - fgcozman@usp.br November 6, 2018 Classification tree Consider binary class variable Y and features X 1,..., X n. Decide Ŷ after a series
More informationData Mining Practical Machine Learning Tools and Techniques. Slides for Chapter 6 of Data Mining by I. H. Witten and E. Frank
Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 6 of Data Mining by I. H. Witten and E. Frank Implementation: Real machine learning schemes Decision trees Classification
More informationMark Bitto / ABB Inc / PSPG / CIBO Technical Focus Group / June Instrumented and Actuated Systems Condition Monitoring of Rotating Equipment
Mark Bitto / ABB Inc / PSPG / CIBO Technical Focus Group / June 2015 Instrumented and Actuated Systems Condition Monitoring of Rotating Equipment About the speaker Speaker name: Speaker title: Company
More informationA Novel Criterion Function in Feature Evaluation. Application to the Classification of Corks.
A Novel Criterion Function in Feature Evaluation. Application to the Classification of Corks. X. Lladó, J. Martí, J. Freixenet, Ll. Pacheco Computer Vision and Robotics Group Institute of Informatics and
More informationClassification and Regression Trees
Classification and Regression Trees Matthew S. Shotwell, Ph.D. Department of Biostatistics Vanderbilt University School of Medicine Nashville, TN, USA March 16, 2018 Introduction trees partition feature
More informationA Systematic Overview of Data Mining Algorithms
A Systematic Overview of Data Mining Algorithms 1 Data Mining Algorithm A well-defined procedure that takes data as input and produces output as models or patterns well-defined: precisely encoded as a
More informationFuzzy Partitioning with FID3.1
Fuzzy Partitioning with FID3.1 Cezary Z. Janikow Dept. of Mathematics and Computer Science University of Missouri St. Louis St. Louis, Missouri 63121 janikow@umsl.edu Maciej Fajfer Institute of Computing
More informationCredit card Fraud Detection using Predictive Modeling: a Review
February 207 IJIRT Volume 3 Issue 9 ISSN: 2396002 Credit card Fraud Detection using Predictive Modeling: a Review Varre.Perantalu, K. BhargavKiran 2 PG Scholar, CSE, Vishnu Institute of Technology, Bhimavaram,
More informationData Mining and Knowledge Discovery: Practice Notes
Data Mining and Knowledge Discovery: Practice Notes Petra Kralj Novak Petra.Kralj.Novak@ijs.si 2016/01/12 1 Keywords Data Attribute, example, attribute-value data, target variable, class, discretization
More informationDecision Tree CE-717 : Machine Learning Sharif University of Technology
Decision Tree CE-717 : Machine Learning Sharif University of Technology M. Soleymani Fall 2012 Some slides have been adapted from: Prof. Tom Mitchell Decision tree Approximating functions of usually discrete
More informationA Combined Method for On-Line Signature Verification
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 14, No 2 Sofia 2014 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.2478/cait-2014-0022 A Combined Method for On-Line
More informationWeka ( )
Weka ( http://www.cs.waikato.ac.nz/ml/weka/ ) The phases in which classifier s design can be divided are reflected in WEKA s Explorer structure: Data pre-processing (filtering) and representation Supervised
More informationEE 589 INTRODUCTION TO ARTIFICIAL NETWORK REPORT OF THE TERM PROJECT REAL TIME ODOR RECOGNATION SYSTEM FATMA ÖZYURT SANCAR
EE 589 INTRODUCTION TO ARTIFICIAL NETWORK REPORT OF THE TERM PROJECT REAL TIME ODOR RECOGNATION SYSTEM FATMA ÖZYURT SANCAR 1.Introductıon. 2.Multi Layer Perception.. 3.Fuzzy C-Means Clustering.. 4.Real
More informationA Data Classification Algorithm of Internet of Things Based on Neural Network
A Data Classification Algorithm of Internet of Things Based on Neural Network https://doi.org/10.3991/ijoe.v13i09.7587 Zhenjun Li Hunan Radio and TV University, Hunan, China 278060389@qq.com Abstract To
More informationThe digital copy of this thesis is protected by the Copyright Act 1994 (New Zealand).
http://waikato.researchgateway.ac.nz/ Research Commons at the University of Waikato Copyright Statement: The digital copy of this thesis is protected by the Copyright Act 1994 (New Zealand). The thesis
More informationRandom Forest A. Fornaser
Random Forest A. Fornaser alberto.fornaser@unitn.it Sources Lecture 15: decision trees, information theory and random forests, Dr. Richard E. Turner Trees and Random Forests, Adele Cutler, Utah State University
More informationNetwork Traffic Classification Based on Deep Learning
Journal of Physics: Conference Series PAPER OPEN ACCESS Network Traffic Classification Based on Deep Learning To cite this article: Jun Hua Shu et al 2018 J. Phys.: Conf. Ser. 1087 062021 View the article
More informationMachine protection and monitoring system
System Overview Application The machine protection and condition monitoring system provides continuous operational supervision of rotating machinery such as turbosets, compressors, blowers, exhaust fans,
More informationDIA TECH Monitoring and Diagnosis System
Monitoring and Diagnosis System www.andritz.com Monitoring and Diagnosis System Top left: turbine operation chart and cavitation indication; top right: frequency spectrum and CEPSTRUM for structure borne
More informationLecture 7: Decision Trees
Lecture 7: Decision Trees Instructor: Outline 1 Geometric Perspective of Classification 2 Decision Trees Geometric Perspective of Classification Perspective of Classification Algorithmic Geometric Probabilistic...
More informationNetwork Traffic Measurements and Analysis
DEIB - Politecnico di Milano Fall, 2017 Sources Hastie, Tibshirani, Friedman: The Elements of Statistical Learning James, Witten, Hastie, Tibshirani: An Introduction to Statistical Learning Andrew Ng:
More informationData Mining. 3.3 Rule-Based Classification. Fall Instructor: Dr. Masoud Yaghini. Rule-Based Classification
Data Mining 3.3 Fall 2008 Instructor: Dr. Masoud Yaghini Outline Using IF-THEN Rules for Classification Rules With Exceptions Rule Extraction from a Decision Tree 1R Algorithm Sequential Covering Algorithms
More informationSTUDY PAPER ON CLASSIFICATION TECHIQUE IN DATA MINING
Journal of Analysis and Computation (JAC) (An International Peer Reviewed Journal), www.ijaconline.com, ISSN 0973-2861 International Conference on Emerging Trends in IOT & Machine Learning, 2018 STUDY
More informationBest First and Greedy Search Based CFS and Naïve Bayes Algorithms for Hepatitis Diagnosis
Best First and Greedy Search Based CFS and Naïve Bayes Algorithms for Hepatitis Diagnosis CHAPTER 3 BEST FIRST AND GREEDY SEARCH BASED CFS AND NAÏVE BAYES ALGORITHMS FOR HEPATITIS DIAGNOSIS 3.1 Introduction
More informationCSI 6500 Machinery Health TM Monitor
0-6500-031010 CSI 6500 Machinery Health TM Monitor Extend PlantWeb with API 670 machinery protection monitoring Integrate machinery protection monitoring with DeltaV and Ovation in three easy steps Monitor
More informationFinding Dominant Parameters For Fault Diagnosis Of a Single Bearing System Using Back Propagation Neural Network
International Journal of Mechanical & Mechatronics Engineering IJMME-IJENS Vol:13 No:01 40 Finding Dominant Parameters For Fault Diagnosis Of a Single Bearing System Using Back Propagation Neural Network
More informationMachine Learning. Decision Trees. Le Song /15-781, Spring Lecture 6, September 6, 2012 Based on slides from Eric Xing, CMU
Machine Learning 10-701/15-781, Spring 2008 Decision Trees Le Song Lecture 6, September 6, 2012 Based on slides from Eric Xing, CMU Reading: Chap. 1.6, CB & Chap 3, TM Learning non-linear functions f:
More informationCHAPTER 4 DETECTION OF DISEASES IN PLANT LEAF USING IMAGE SEGMENTATION
CHAPTER 4 DETECTION OF DISEASES IN PLANT LEAF USING IMAGE SEGMENTATION 4.1. Introduction Indian economy is highly dependent of agricultural productivity. Therefore, in field of agriculture, detection of
More informationResearch on the Application of Digital Images Based on the Computer Graphics. Jing Li 1, Bin Hu 2
Applied Mechanics and Materials Online: 2014-05-23 ISSN: 1662-7482, Vols. 556-562, pp 4998-5002 doi:10.4028/www.scientific.net/amm.556-562.4998 2014 Trans Tech Publications, Switzerland Research on the
More informationAN OVERVIEW AND EXPLORATION OF JMP A DATA DISCOVERY SYSTEM IN DAIRY SCIENCE
AN OVERVIEW AND EXPLORATION OF JMP A DATA DISCOVERY SYSTEM IN DAIRY SCIENCE A.P. Ruhil and Tara Chand National Dairy Research Institute, Karnal-132001 JMP commonly pronounced as Jump is a statistical software
More informationLars Schmidt-Thieme, Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany
Syllabus Fri. 27.10. (1) 0. Introduction A. Supervised Learning: Linear Models & Fundamentals Fri. 3.11. (2) A.1 Linear Regression Fri. 10.11. (3) A.2 Linear Classification Fri. 17.11. (4) A.3 Regularization
More informationInducer: a Rule Induction Workbench for Data Mining
Inducer: a Rule Induction Workbench for Data Mining Max Bramer Faculty of Technology University of Portsmouth Portsmouth, UK Email: Max.Bramer@port.ac.uk Fax: +44-2392-843030 Abstract One of the key technologies
More informationKeywords hierarchic clustering, distance-determination, adaptation of quality threshold algorithm, depth-search, the best first search.
Volume 4, Issue 3, March 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Distance-based
More informationAlgorithms: Decision Trees
Algorithms: Decision Trees A small dataset: Miles Per Gallon Suppose we want to predict MPG From the UCI repository A Decision Stump Recursion Step Records in which cylinders = 4 Records in which cylinders
More informationISSN (Online) Volume 2, Number 1, May October (2011), IAEME
International Journal Journal of Design of Design and Manufacturing and Manufacturing Technology (IJDMT), ISSN 0976 6995(Print), Technology (IJDMT), ISSN 0976 6995(Print), ISSN 0976 7002(Online) Volume
More informationData Mining Lecture 8: Decision Trees
Data Mining Lecture 8: Decision Trees Jo Houghton ECS Southampton March 8, 2019 1 / 30 Decision Trees - Introduction A decision tree is like a flow chart. E. g. I need to buy a new car Can I afford it?
More informationReview on Methods of Selecting Number of Hidden Nodes in Artificial Neural Network
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 11, November 2014,
More informationRONDS Product Catalog : 2017
RONDS Product Catalog : 2017 C Tech Inter Co., Ltd We are authorized dealer. About RONDS RONDS is a high-tech enterprise which owns core technology and provides turnkey solutions for machinery Condition
More information7. Decision or classification trees
7. Decision or classification trees Next we are going to consider a rather different approach from those presented so far to machine learning that use one of the most common and important data structure,
More informationFUZZY C-MEANS ALGORITHM BASED ON PRETREATMENT OF SIMILARITY RELATIONTP
Dynamics of Continuous, Discrete and Impulsive Systems Series B: Applications & Algorithms 14 (2007) 103-111 Copyright c 2007 Watam Press FUZZY C-MEANS ALGORITHM BASED ON PRETREATMENT OF SIMILARITY RELATIONTP
More informationData Mining. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1395
Data Mining Introduction Hamid Beigy Sharif University of Technology Fall 1395 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1395 1 / 21 Table of contents 1 Introduction 2 Data mining
More information8. Tree-based approaches
Foundations of Machine Learning École Centrale Paris Fall 2015 8. Tree-based approaches Chloé-Agathe Azencott Centre for Computational Biology, Mines ParisTech chloe agathe.azencott@mines paristech.fr
More informationAn Information-Theoretic Approach to the Prepruning of Classification Rules
An Information-Theoretic Approach to the Prepruning of Classification Rules Max Bramer University of Portsmouth, Portsmouth, UK Abstract: Keywords: The automatic induction of classification rules from
More informationStructural and Syntactic Pattern Recognition
Structural and Syntactic Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Fall 2017 CS 551, Fall 2017 c 2017, Selim Aksoy (Bilkent
More informationLecture 2 :: Decision Trees Learning
Lecture 2 :: Decision Trees Learning 1 / 62 Designing a learning system What to learn? Learning setting. Learning mechanism. Evaluation. 2 / 62 Prediction task Figure 1: Prediction task :: Supervised learning
More informationLecture 5: Decision Trees (Part II)
Lecture 5: Decision Trees (Part II) Dealing with noise in the data Overfitting Pruning Dealing with missing attribute values Dealing with attributes with multiple values Integrating costs into node choice
More informationModel Selection and Assessment
Model Selection and Assessment CS4780/5780 Machine Learning Fall 2014 Thorsten Joachims Cornell University Reading: Mitchell Chapter 5 Dietterich, T. G., (1998). Approximate Statistical Tests for Comparing
More informationFraud Detection Using Random Forest Algorithm
Fraud Detection Using Random Forest Algorithm Eesha Goel Computer Science Engineering and Technology, GZSCCET, Bhatinda, India eesha1992@rediffmail.com Abhilasha Computer Science Engineering and Technology,
More informationEfficient SQL-Querying Method for Data Mining in Large Data Bases
Efficient SQL-Querying Method for Data Mining in Large Data Bases Nguyen Hung Son Institute of Mathematics Warsaw University Banacha 2, 02095, Warsaw, Poland Abstract Data mining can be understood as a
More informationExtra readings beyond the lecture slides are important:
1 Notes To preview next lecture: Check the lecture notes, if slides are not available: http://web.cse.ohio-state.edu/~sun.397/courses/au2017/cse5243-new.html Check UIUC course on the same topic. All their
More informationStudy on the Application Analysis and Future Development of Data Mining Technology
Study on the Application Analysis and Future Development of Data Mining Technology Ge ZHU 1, Feng LIN 2,* 1 Department of Information Science and Technology, Heilongjiang University, Harbin 150080, China
More information