Data Mining Approaches to Characterize Batch Process Operations

Size: px
Start display at page:

Download "Data Mining Approaches to Characterize Batch Process Operations"

Transcription

1 Data Mining Approaches to Characterize Batch Process Operations Rodolfo V. Tona V., Antonio Espuña and Luis Puigjaner * Universitat Politècnica de Catalunya, Chemical Engineering Department. Diagonal 647, Barcelona, Spain. Abstract n this work, an approach to mine data from batch process operations is presented. The aim is to extract knowledge from data and to support the design or redesign of monitoring systems. Multivariate models at recipe and lower levels are obtained by a multiscale PCA (MsPCA) approach. Then, fuzzy clustering is used to help to identify operational conditions by each product recipe. Cluster membership information is used to define effective rules that aid to characterise the operation of the plant for future productions. How to handle time-varying trajectories and how to catch their associated dynamics with the multiscale PCA is specially considered. An example based on a real pilot plant is used for illustrative purposes. Keywords: Data Mining, Batch Process, MsPCA, Fuzzy Clustering. 1. ntroduction Nowadays, large amounts of process operational data are recorded in Chemical Plants. t has been recognised that these data have a great potential to provide insight into the process (Stockill, 2002). So, developments of advanced data analysis tools and methods are required. Particularly, adoptions and applications of Data Mining approaches that aid to extract useful knowledge from data are claimed (Stockill, 2002, Wang, 2001). Some recent proposals have been made to support monitoring of continuous and batch processes. For batch, existing multivariate methods like Multiway PCA (MPCA) and PLS (Nomikos et al, 1994) has been proposed to obtain reduced characterisations of productions. However, in these methods, issues like time varying operations, outliers, production by recipes and the transitory dynamics of trend variables are not well treated or solved. Also, issues like the multiscale nature of the data have been separately considered. n the case of MPCA it is assumed that operating times of all batches are equal. However, this is not true in many real applications. Extensions to solve the timevarying problem have been proposed by use of aligning of variables (resampling) by reference to a variable indicator with dynamic time warping techniques (Kassidas et al, 1998). The disadvantage of this is that multiscale feature of variables are not taken into account. Additionally, some variables cannot be available at the corresponding resampling intervals. Chen et al (2000) uses orthonormal basis functions to represent the variables profiles and then build an MPCA model over the coefficients of the * To whom correspondence should be addressed mail: luis.puigjaner@upc.es

2 orthonormal functions. This last approach is more appropriate to try the time varying problem, but important issues like the selection of the functions and the multiscale nature of the data are not considered. Clustering has also been proposed for batch operation (Yuan et al., 2001). t is used together with PCA to support the identification of operating conditions and to the design of the monitoring system. The issue of operation by recipes into the analysis is also considered. Nevertheless, time varying, outliers and multiscale are not considered here. As a consequence, the identification and the obtained monitoring model will be suboptimal. n this work an alternative approach is proposed to explore batch operational data and to assist in the design or redesign of monitoring system. The integration of orthonormal bases functions with PCA is adopted by using Wavelet. The resulting MsPCA is combined with Fuzzy Clustering. The identification with clustering allows the generation of operation rules that improves the knowledge of the process and serves as a monitoring system together with the MsPCA. 2. Proposed Approach 2.1 Multiscale Modelling of Data Multivariate Statistical techniques have been extensively used for monitoring Batch Processes. MPCA is one of the most known methods. To apply this method it is assumed that experimental data form a three-dimensional array. The resulting matrix, Xo, is of xjxk, where J variables are measured at K times in each one of the batches. Then, Xo can be unfolded into a large two-dimensional matrix, X, of xjk (figure 1). Then, PCA can be over this unfolded matrix. n the method, it is supposed equal operation time (K) for all the batches which limits the application to the cases where batches are different in time. Observations K Batches Vari ables J J xk Variables x Observations Figure 1. Unfolding of the Three-way Batch data set. To overcome this problem we adopt an approach based on function approximation (Chen et al, 2001). n this approach, the matrix X is obtained like in MPCA, but the resulting matrix is ordered as follows: [ X ] xjk k = 1) k = 2)... k = K ) = M k = 1) k = 2)... k = K ) L L L x J ( k = 1) x J ( k = 2)... x J ( k = K ) M k = 1) k = 2)... k = K ) (1)

3 n the above matrix, each element is the profile of variable x j in batch run i. These profiles can be represented by ƒ i,j (t) functions. Chen (2001) proposed the use of approximation functions to obtain ƒ i,j (t). Approximation functions constitute sets of orthonormal bases with very good properties for represent signals. They allow representing ƒ(x) as: N 1 n= 0 f ( x) c n φ ( t) (2) n where, C N = {c n } n=0,1,,n-1., and {φ n (t)}, represent a set of square integral functions. Then, based on Lagrange polynomial functions, equation (1) becomes: [ X ] M L M = [] c [] c L [] c xjk f1,1 ( t) L f1, J ( t) = [ xn xn xnj ] (3) 11 2 f,1( t) L f, J ( t) where N j is the required number of bases in ƒ i,j (t) to approximate the measurements j and [c] xnj is the trajectory coefficient matrix of measurement j which is spanned by N j. N j is always the same on normal operation. So, by applying PCA on X is obtained a good representation of batches with different time duration. The above solution strategy is proposed over one scale. However, it has been widely recognised the multiscale nature of chemical data (Bakshi, 1998). Also, the selection of Lagrange polynomials is not an obvious alternative. n this work, all this is solved by using Wavelets. Wavelets are families of functions with very good properties as orthonormal functions. By combining approximations, at different scales, they are able to catch fine details and trends with very good accuracy. So, the resulting approximation is expressed as: f ( x) = d mk ψ ( t) (4) m= 1 k= mk where d m represents the c n coefficients in equation 2 at the m th scale and ψ m define the k th basis functions at the m th scale. n relation to the selection of the function, Daubechies wavelets are used because of their very good capabilities to represent polynomial behaviour. The extraction of functions with wavelets is achieved through de-noising (Nounou et al,1999). t allows to eliminate the effect of noise with clear advantages over subsequent analysis. So, the matrix X is built by way of the d m denoised coefficients and PCA is applied on it. There is a clear difference with the multiscale PCA proposed by Bakshi (1998). Here, PCA is only built over the complete wavelets coefficients matrix and the de-noising is ensured before PCA. 2.2 Fuzzy Clustering Clustering are techniques that attempts to assess the relationships among data patterns belonging to different groups. n this work, fuzzy clustering is adopted with the purpose to identify operating region patterns and as a base for rules generation. Fuzzy-c-mean is used as the clustering technique. t is an algorithm that can automatically identify the centre of each cluster and calculate the membership values of each data case to each cluster. t is based on the minimization of the sum of squared Euclidean distances between data (X k, k=1 n) and cluster centres (v i, i =1 c):

4 Min J m c n m ( U, V ) = ( µ ) x v (5) 1 k= 1 ik k i 2 where 1 m is the fuzziness index, c is the number of clusters, and µ ik denotes the matrix of a fuzzy c-partition. The last fuzzy c-partition is constrained as follows: µ ik [0,1] i, k, µ ik = 1 k, C i = 1 n k = 1 µ ik < n, i. (6) n other words, each X k could belong to more than one cluster with each membership (µ) taking a fractional value between 0 and 1. The details of the algorithm are not shown for space reasons (Bezdeck, 1981). However, it is noted that the algorithm is dependently of C. Also, because the objective function is based on Euclidian distances, the method tends to identify clusters with only spherical forms. n this work, the Mahalanobis distance is used (equation 5) to extend the identification up to spherical and ellipsoids forms. Additionally, a simple algorithm, the mountain method (Yager et al, 1994), is used as a pre-estimator of C, for cases when it is not known a priori. Rules generation n the above methods, each cluster centre is in essence a prototypical data point that exemplifies characteristics behaviour of a system. Then, the membership information, µ, of each data point allows associating it to a pattern of the system. So, simple rules can be generated by extract the larger membership of each point and relate it to a pattern as follow: f µ J1 is A then {C 1 = pattern 1 } 1 is produced. f µ J2 is B and µ J2 is D then {C = pattern} 1 is produced. Here, A, B and D, can represent values of variables like Temperature, etc. and patterns can express an associated operating condition for a recipe. As a consequence, a more insight into the process can be obtained and used to design a monitoring system. 2.3 Global procedure. Data mining approaches. The above methods are combined to extract knowledge from batch process data. The way by which data is analysed, is determined by two important issues, the production by recipes and the operation development by stages. Two levels of analysis are proposed: One at the recipe level (entire batch) and another at the stage level (sub-step). At the first level, the overall data from different batches are used. A matrix, X c, with all process variables, is built together with a separate matrix of important quality variables, X Qi. Wavelet coefficient matrices for each matrix (X c, X Qi ) are obtained. The resulting matrices are processed with PCA. Reduced representations (patterns) of batches and the profile of each X Qi are obtained. Next, clustering is applied. Groups of patterns and its memberships µ to groups are obtained. Groups of one or two objects, with their respective coefficients into X c and X Qi are rejected as outliers and PCA models are obtained again. This rejection step allows to eliminate some possible abnormal batches and to select data of good batches. When no rejections are registered, groups of operations are defined. t can occur that some recipes will be grouped in a same cluster

5 which suggests similar operating conditions and, possibly, a single model for these recipes. n a second step, the data set obtained in the above step is used and the recipes identified as similar are analysed together. A pre-processing step with wavelet is applied over the variables profiles to identify stages. Then, MsPCA is applied over individuals X pi matrices and by groups of stages in profiles. The obtained information, µ, is mapped onto the µ information at the level recipe. So, simple rules about conditions in each stage that can conduct to a product grade of a recipe are derived. Finally, data rejected at the recipe level is analysed to identify the potentially abnormal operation. So, the knowledge about the process is expanded. t should be noted that a similar analysis (by levels) has already been proposed by Yuan et al, (2001). They first apply the stage level analysis to obtain a reduced representation of variables with the most significant principal components (PC s). Then, PCA is applied over PC s. However, PC s are not well suited to catch the dynamic trend information of variables. Our proposed use of wavelets is much more appropriate for this task. Checking about abnormal data is no made and the presence of outliers is not considered by Yuan et al., which can mask the identification of groups. Additionally, their approach can not be applied over time varying processes. 3. Batch Pilot Plant Application A Real pilot plant at UPC has been selected as the scenario for testing purposes. t contains three reactors, heat exchangers and a highly flexible connectivity between them that is achieved via a network of pipes, pumps and valves. t has been used to generate data for several products recipes. A total of 24 experiments are generated with different length in operation time. First, the analysis over the recipe level is made. Figure 2a. µ values with a bad batch Figure 2b. µ values without bad batches Membership values (µ) of batches to different groups are obtained (figure 2a). The groups are verified as representing specific recipes. Also, it is noted the effect of a bad batch with a low µ in recipe 3. When it is rejected, the definition of the groups is improved (figure 2b). Subsequent analysis help to identify different operating conditions associated to the existing recipes. Then, operating rules about each one of the recipes are obtained. t can be noted that in the application of this analysis the process variables were recorded with a frequency of one minute while the quality

6 variables were recorded at a frequency of 5-10 minutes. Because PCA is applied over the coefficients of wavelet approximations of each signal, the low difference in sampling is not limiting. However, it should be noted that the method can not be applied in cases with larger differences between sampling frequencies. 4. Conclusions A new methodology to explore batch processes has been proposed. The methodology is capable of deal with important issues like time varying trajectories and outliers. Also, it is appropriate to represent operations, by stages and by recipes, with rules. This last capability is useful to improve the understanding of the process. Also, it is shown as very useful to the design or redesign of monitoring systems. Data about the pilot plant have served to illustrate the methodology. Nevertheless, additional applications over this and/or other real scenarios should be made to establish the generalization of the method. Also, the problem of different sample frequencies must be additionally studied. Finally, the approach is observed as potentially useful to obtain a root cause analysis databases to support tasks like equipment maintenance or scheduling. All this, will be explored in future works. Acknowledgment Financial support from the Generalitat de Catalunya (F research grant for Tona, R.V.) and from the European Community (projects VPNET-GRD-CT and CHEM-GRD-CT ) are gratefully acknowledged. References Bakshi, B. R., 1998, AChE Journal, 44, 7, Bezdek, J., Pattern recognition with fuzzy objective function algorithms, Plenum, N.Y. Chen, J., and Liu, J., 2001, Chem. Eng. Sci., 56, 10, Kassidas, A., MacGregor, J.F., and Taylor, P., 1998, AChE J, 44, 4, pp Nomikos, P. and MacGregor, J.F., 1995, Technometrics, 37, 1, Nounou, M. N. and Bakshi, B. R., 1999, AChE Journal, 45, 5, Stockill, D., 2002, ESCAPE-12 (Ed. Grievink, J., Schjindel, J.,), Elsevier, 70-77, Amsterdam, Netherlands. Wang, X.Z., Application of Neural Networks and other Learning Technologies in Process Engineering. London : mperial College Press. Yager, R.R, and Filev, D.P., 1994, EEE Trans. on Syst., Man, & Cyb., 24, 8, pp Yuan, and Wang, X.Z., 2001, Chem. Eng. Comm., 185,

Combining Complementary Scheduling Approaches into an Enhanced Modular Software

Combining Complementary Scheduling Approaches into an Enhanced Modular Software Combining Complementary Scheduling Approaches into an Enhanced Modular Software Jordi Cantón 1, Moisès Graells 1, Antonio Espuña 1, Luis Puigjaner 1* Wesley Alvarenga 2, Maria Teresa Rodrígues 2, Luis

More information

HFCT: A Hybrid Fuzzy Clustering Method for Collaborative Tagging

HFCT: A Hybrid Fuzzy Clustering Method for Collaborative Tagging 007 International Conference on Convergence Information Technology HFCT: A Hybrid Fuzzy Clustering Method for Collaborative Tagging Lixin Han,, Guihai Chen Department of Computer Science and Engineering,

More information

A Fuzzy C-means Clustering Algorithm Based on Pseudo-nearest-neighbor Intervals for Incomplete Data

A Fuzzy C-means Clustering Algorithm Based on Pseudo-nearest-neighbor Intervals for Incomplete Data Journal of Computational Information Systems 11: 6 (2015) 2139 2146 Available at http://www.jofcis.com A Fuzzy C-means Clustering Algorithm Based on Pseudo-nearest-neighbor Intervals for Incomplete Data

More information

Design of Fault Diagnosis System of FPSO Production Process Based on MSPCA

Design of Fault Diagnosis System of FPSO Production Process Based on MSPCA 2009 Fifth International Conference on Information Assurance and Security Design of Fault Diagnosis System of FPSO Production Process Based on MSPCA GAO Qiang, HAN Miao, HU Shu-liang, DONG Hai-jie ianjin

More information

HARD, SOFT AND FUZZY C-MEANS CLUSTERING TECHNIQUES FOR TEXT CLASSIFICATION

HARD, SOFT AND FUZZY C-MEANS CLUSTERING TECHNIQUES FOR TEXT CLASSIFICATION HARD, SOFT AND FUZZY C-MEANS CLUSTERING TECHNIQUES FOR TEXT CLASSIFICATION 1 M.S.Rekha, 2 S.G.Nawaz 1 PG SCALOR, CSE, SRI KRISHNADEVARAYA ENGINEERING COLLEGE, GOOTY 2 ASSOCIATE PROFESSOR, SRI KRISHNADEVARAYA

More information

Fuzzy-Kernel Learning Vector Quantization

Fuzzy-Kernel Learning Vector Quantization Fuzzy-Kernel Learning Vector Quantization Daoqiang Zhang 1, Songcan Chen 1 and Zhi-Hua Zhou 2 1 Department of Computer Science and Engineering Nanjing University of Aeronautics and Astronautics Nanjing

More information

A Brief Overview of Robust Clustering Techniques

A Brief Overview of Robust Clustering Techniques A Brief Overview of Robust Clustering Techniques Robust Clustering Olfa Nasraoui Department of Computer Engineering & Computer Science University of Louisville, olfa.nasraoui_at_louisville.edu There are

More information

SELECTION OF A MULTIVARIATE CALIBRATION METHOD

SELECTION OF A MULTIVARIATE CALIBRATION METHOD SELECTION OF A MULTIVARIATE CALIBRATION METHOD 0. Aim of this document Different types of multivariate calibration methods are available. The aim of this document is to help the user select the proper

More information

To be presented at the American Control Conference, Denver, CO, June 4 6, Data Compression Issues with Pattern Matching in Historical Data

To be presented at the American Control Conference, Denver, CO, June 4 6, Data Compression Issues with Pattern Matching in Historical Data To be presented at the American Control Conference, Denver, CO, June 4 6, 2003 Data Compression Issues with Pattern Matching in Historical Data Ashish Singhal Dale E. Seborg Department of Chemical Engineering

More information

Improved Version of Kernelized Fuzzy C-Means using Credibility

Improved Version of Kernelized Fuzzy C-Means using Credibility 50 Improved Version of Kernelized Fuzzy C-Means using Credibility Prabhjot Kaur Maharaja Surajmal Institute of Technology (MSIT) New Delhi, 110058, INDIA Abstract - Fuzzy c-means is a clustering algorithm

More information

Novel Intuitionistic Fuzzy C-Means Clustering for Linearly and Nonlinearly Separable Data

Novel Intuitionistic Fuzzy C-Means Clustering for Linearly and Nonlinearly Separable Data Novel Intuitionistic Fuzzy C-Means Clustering for Linearly and Nonlinearly Separable Data PRABHJOT KAUR DR. A. K. SONI DR. ANJANA GOSAIN Department of IT, MSIT Department of Computers University School

More information

Real-time Monitoring of Multi-mode Industrial Processes using Feature-extraction Tools

Real-time Monitoring of Multi-mode Industrial Processes using Feature-extraction Tools Real-time Monitoring of Multi-mode Industrial Processes using Feature-extraction Tools Y. S. Manjili *, M. Niknamfar, M. Jamshidi Department of Electrical and Computer Engineering The University of Texas

More information

A Fuzzy Rule Based Clustering

A Fuzzy Rule Based Clustering A Fuzzy Rule Based Clustering Sachin Ashok Shinde 1, Asst.Prof.Y.R.Nagargoje 2 Student, Computer Science & Engineering Department, Everest College of Engineering, Aurangabad, India 1 Asst.Prof, Computer

More information

OPTIMIZATION. Optimization. Derivative-based optimization. Derivative-free optimization. Steepest descent (gradient) methods Newton s method

OPTIMIZATION. Optimization. Derivative-based optimization. Derivative-free optimization. Steepest descent (gradient) methods Newton s method OPTIMIZATION Optimization Derivative-based optimization Steepest descent (gradient) methods Newton s method Derivative-free optimization Simplex Simulated Annealing Genetic Algorithms Ant Colony Optimization...

More information

Data-driven fault detection with process topology for fault identification

Data-driven fault detection with process topology for fault identification Preprints of the 19th World Congress The International Federation of Automatic Control Data-driven fault detection with process topology for fault identification Brian S. Lindner*. Lidia Auret** * Department

More information

A MODIFIED FUZZY C-REGRESSION MODEL CLUSTERING ALGORITHM FOR T-S FUZZY MODEL IDENTIFICATION

A MODIFIED FUZZY C-REGRESSION MODEL CLUSTERING ALGORITHM FOR T-S FUZZY MODEL IDENTIFICATION 20 8th International Multi-Conference on Systems, Signals & Devices A MODIFIED FUZZY C-REGRESSION MODEL CLUSTERING ALGORITHM FOR T-S FUZZY MODEL IDENTIFICATION Moêz. Soltani, Borhen. Aissaoui 2, Abdelader.

More information

Hydraulic pump fault diagnosis with compressed signals based on stagewise orthogonal matching pursuit

Hydraulic pump fault diagnosis with compressed signals based on stagewise orthogonal matching pursuit Hydraulic pump fault diagnosis with compressed signals based on stagewise orthogonal matching pursuit Zihan Chen 1, Chen Lu 2, Hang Yuan 3 School of Reliability and Systems Engineering, Beihang University,

More information

European Journal of Science and Engineering Vol. 1, Issue 1, 2013 ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM IDENTIFICATION OF AN INDUCTION MOTOR

European Journal of Science and Engineering Vol. 1, Issue 1, 2013 ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM IDENTIFICATION OF AN INDUCTION MOTOR ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM IDENTIFICATION OF AN INDUCTION MOTOR Ahmed A. M. Emam College of Engineering Karrary University SUDAN ahmedimam1965@yahoo.co.in Eisa Bashier M. Tayeb College of Engineering

More information

A New Method For Forecasting Enrolments Combining Time-Variant Fuzzy Logical Relationship Groups And K-Means Clustering

A New Method For Forecasting Enrolments Combining Time-Variant Fuzzy Logical Relationship Groups And K-Means Clustering A New Method For Forecasting Enrolments Combining Time-Variant Fuzzy Logical Relationship Groups And K-Means Clustering Nghiem Van Tinh 1, Vu Viet Vu 1, Tran Thi Ngoc Linh 1 1 Thai Nguyen University of

More information

Multivariate Analysis

Multivariate Analysis Multivariate Analysis Cluster Analysis Prof. Dr. Anselmo E de Oliveira anselmo.quimica.ufg.br anselmo.disciplinas@gmail.com Unsupervised Learning Cluster Analysis Natural grouping Patterns in the data

More information

ECM A Novel On-line, Evolving Clustering Method and Its Applications

ECM A Novel On-line, Evolving Clustering Method and Its Applications ECM A Novel On-line, Evolving Clustering Method and Its Applications Qun Song 1 and Nikola Kasabov 2 1, 2 Department of Information Science, University of Otago P.O Box 56, Dunedin, New Zealand (E-mail:

More information

Kuske Martyna, Rubio, Rubio Rafael, Nicolas Jacques, Marco Santiago, Romain Anne-Claude

Kuske Martyna, Rubio, Rubio Rafael, Nicolas Jacques, Marco Santiago, Romain Anne-Claude Fuzzy k-nn applied to moulds detection. Kuske Martyna, Rubio, Rubio Rafael, Nicolas Jacques, Marco Santiago, Romain Anne-Claude Communication presented at ISOEN 2003 RIGA- Latvia Introduction Excessive

More information

A robust optimization based approach to the general solution of mp-milp problems

A robust optimization based approach to the general solution of mp-milp problems 21 st European Symposium on Computer Aided Process Engineering ESCAPE 21 E.N. Pistikopoulos, M.C. Georgiadis and A. Kokossis (Editors) 2011 Elsevier B.V. All rights reserved. A robust optimization based

More information

Cluster Analysis. Ying Shen, SSE, Tongji University

Cluster Analysis. Ying Shen, SSE, Tongji University Cluster Analysis Ying Shen, SSE, Tongji University Cluster analysis Cluster analysis groups data objects based only on the attributes in the data. The main objective is that The objects within a group

More information

FAULT DIAGNOSIS BASED ON MULTI-SCALE CLASSIFICATION USING KERNEL FISHER DISCRIMINANT ANALYSIS AND GAUSSIAN MIXTURE MODEL AND K-NEAREST NEIGHBOR METHOD

FAULT DIAGNOSIS BASED ON MULTI-SCALE CLASSIFICATION USING KERNEL FISHER DISCRIMINANT ANALYSIS AND GAUSSIAN MIXTURE MODEL AND K-NEAREST NEIGHBOR METHOD Universiti Kebangsaan Malaysia FAULT DIAGNOSIS BASED ON MULTI-SCALE CLASSIFICATION USING KERNEL FISHER DISCRIMINANT ANALYSIS AND GAUSSIAN MIXTURE MODEL AND K-NEAREST NEIGHBOR METHOD NORAZWAN M. NOR*, MOHD

More information

Graph Embedding in Vector Spaces

Graph Embedding in Vector Spaces Graph Embedding in Vector Spaces GbR 2011 Mini-tutorial Jaume Gibert, Ernest Valveny Computer Vision Center, Universitat Autònoma de Barcelona, Barcelona, Spain Horst Bunke Institute of Computer Science

More information

Fuzzy Segmentation. Chapter Introduction. 4.2 Unsupervised Clustering.

Fuzzy Segmentation. Chapter Introduction. 4.2 Unsupervised Clustering. Chapter 4 Fuzzy Segmentation 4. Introduction. The segmentation of objects whose color-composition is not common represents a difficult task, due to the illumination and the appropriate threshold selection

More information

A NEW VARIABLES SELECTION AND DIMENSIONALITY REDUCTION TECHNIQUE COUPLED WITH SIMCA METHOD FOR THE CLASSIFICATION OF TEXT DOCUMENTS

A NEW VARIABLES SELECTION AND DIMENSIONALITY REDUCTION TECHNIQUE COUPLED WITH SIMCA METHOD FOR THE CLASSIFICATION OF TEXT DOCUMENTS A NEW VARIABLES SELECTION AND DIMENSIONALITY REDUCTION TECHNIQUE COUPLED WITH SIMCA METHOD FOR THE CLASSIFICATION OF TEXT DOCUMENTS Ahmed Abdelfattah Saleh University of Brasilia, Brasil ahmdsalh@yahoo.com

More information

Learning a Manifold as an Atlas Supplementary Material

Learning a Manifold as an Atlas Supplementary Material Learning a Manifold as an Atlas Supplementary Material Nikolaos Pitelis Chris Russell School of EECS, Queen Mary, University of London [nikolaos.pitelis,chrisr,lourdes]@eecs.qmul.ac.uk Lourdes Agapito

More information

An adjustable p-exponential clustering algorithm

An adjustable p-exponential clustering algorithm An adjustable p-exponential clustering algorithm Valmir Macario 12 and Francisco de A. T. de Carvalho 2 1- Universidade Federal Rural de Pernambuco - Deinfo Rua Dom Manoel de Medeiros, s/n - Campus Dois

More information

QUALITATIVE MODELING FOR MAGNETIZATION CURVE

QUALITATIVE MODELING FOR MAGNETIZATION CURVE Journal of Marine Science and Technology, Vol. 8, No. 2, pp. 65-70 (2000) 65 QUALITATIVE MODELING FOR MAGNETIZATION CURVE Pei-Hwa Huang and Yu-Shuo Chang Keywords: Magnetization curve, Qualitative modeling,

More information

Outlier Detection and Removal Algorithm in K-Means and Hierarchical Clustering

Outlier Detection and Removal Algorithm in K-Means and Hierarchical Clustering World Journal of Computer Application and Technology 5(2): 24-29, 2017 DOI: 10.13189/wjcat.2017.050202 http://www.hrpub.org Outlier Detection and Removal Algorithm in K-Means and Hierarchical Clustering

More information

Performance Measure of Hard c-means,fuzzy c-means and Alternative c-means Algorithms

Performance Measure of Hard c-means,fuzzy c-means and Alternative c-means Algorithms Performance Measure of Hard c-means,fuzzy c-means and Alternative c-means Algorithms Binoda Nand Prasad*, Mohit Rathore**, Geeta Gupta***, Tarandeep Singh**** *Guru Gobind Singh Indraprastha University,

More information

EVALUATION FUZZY NUMBERS BASED ON RMS

EVALUATION FUZZY NUMBERS BASED ON RMS EVALUATION FUZZY NUMBERS BASED ON RMS *Adel Asgari Safdar Young Researchers and Elite Club, Baft Branch, Islamic Azad University, Baft, Iran *Author for Correspondence ABSTRACT We suggest a new approach

More information

A New Fuzzy Neural System with Applications

A New Fuzzy Neural System with Applications A New Fuzzy Neural System with Applications Yuanyuan Chai 1, Jun Chen 1 and Wei Luo 1 1-China Defense Science and Technology Information Center -Network Center Fucheng Road 26#, Haidian district, Beijing

More information

Rotation Perturbation Technique for Privacy Preserving in Data Stream Mining

Rotation Perturbation Technique for Privacy Preserving in Data Stream Mining 218 IJSRSET Volume 4 Issue 8 Print ISSN: 2395-199 Online ISSN : 2394-499 Themed Section : Engineering and Technology Rotation Perturbation Technique for Privacy Preserving in Data Stream Mining Kalyani

More information

NORMALIZATION INDEXING BASED ENHANCED GROUPING K-MEAN ALGORITHM

NORMALIZATION INDEXING BASED ENHANCED GROUPING K-MEAN ALGORITHM NORMALIZATION INDEXING BASED ENHANCED GROUPING K-MEAN ALGORITHM Saroj 1, Ms. Kavita2 1 Student of Masters of Technology, 2 Assistant Professor Department of Computer Science and Engineering JCDM college

More information

Component grouping for GT applications - a fuzzy clustering approach with validity measure

Component grouping for GT applications - a fuzzy clustering approach with validity measure Loughborough University Institutional Repository Component grouping for GT applications - a fuzzy clustering approach with validity measure This item was submitted to Loughborough University's Institutional

More information

Machine Learning. B. Unsupervised Learning B.1 Cluster Analysis. Lars Schmidt-Thieme

Machine Learning. B. Unsupervised Learning B.1 Cluster Analysis. Lars Schmidt-Thieme Machine Learning B. Unsupervised Learning B.1 Cluster Analysis Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Institute for Computer Science University of Hildesheim, Germany

More information

Cluster analysis of 3D seismic data for oil and gas exploration

Cluster analysis of 3D seismic data for oil and gas exploration Data Mining VII: Data, Text and Web Mining and their Business Applications 63 Cluster analysis of 3D seismic data for oil and gas exploration D. R. S. Moraes, R. P. Espíndola, A. G. Evsukoff & N. F. F.

More information

Pattern Recognition Methods for Object Boundary Detection

Pattern Recognition Methods for Object Boundary Detection Pattern Recognition Methods for Object Boundary Detection Arnaldo J. Abrantesy and Jorge S. Marquesz yelectronic and Communication Engineering Instituto Superior Eng. de Lisboa R. Conselheiro Emídio Navarror

More information

On-Line Monitoring of Particle Shape and Size Distribution in Crystallization Processes through Image Analysis

On-Line Monitoring of Particle Shape and Size Distribution in Crystallization Processes through Image Analysis 17 th European Symposium on Computer Aided Process Engineering ESCAPE17 V. Plesu and P.S. Agachi (Editors) 2007 Elsevier B.V. All rights reserved. 1 On-Line Monitoring of Particle Shape and Size Distribution

More information

CHAPTER 4 AN IMPROVED INITIALIZATION METHOD FOR FUZZY C-MEANS CLUSTERING USING DENSITY BASED APPROACH

CHAPTER 4 AN IMPROVED INITIALIZATION METHOD FOR FUZZY C-MEANS CLUSTERING USING DENSITY BASED APPROACH 37 CHAPTER 4 AN IMPROVED INITIALIZATION METHOD FOR FUZZY C-MEANS CLUSTERING USING DENSITY BASED APPROACH 4.1 INTRODUCTION Genes can belong to any genetic network and are also coordinated by many regulatory

More information

Matrix Inference in Fuzzy Decision Trees

Matrix Inference in Fuzzy Decision Trees Matrix Inference in Fuzzy Decision Trees Santiago Aja-Fernández LPI, ETSIT Telecomunicación University of Valladolid, Spain sanaja@tel.uva.es Carlos Alberola-López LPI, ETSIT Telecomunicación University

More information

High Resolution Remote Sensing Image Classification based on SVM and FCM Qin LI a, Wenxing BAO b, Xing LI c, Bin LI d

High Resolution Remote Sensing Image Classification based on SVM and FCM Qin LI a, Wenxing BAO b, Xing LI c, Bin LI d 2nd International Conference on Electrical, Computer Engineering and Electronics (ICECEE 2015) High Resolution Remote Sensing Image Classification based on SVM and FCM Qin LI a, Wenxing BAO b, Xing LI

More information

Prediction-based diagnosis and loss prevention using qualitative multi-scale models

Prediction-based diagnosis and loss prevention using qualitative multi-scale models European Symposium on Computer Arded Aided Process Engineering 15 L. Puigjaner and A. Espuña (Editors) 2005 Elsevier Science B.V. All rights reserved. Prediction-based diagnosis and loss prevention using

More information

An Endowed Takagi-Sugeno-type Fuzzy Model for Classification Problems

An Endowed Takagi-Sugeno-type Fuzzy Model for Classification Problems Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 11, November 2014,

More information

Chapter 7 UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION

Chapter 7 UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION Supervised and unsupervised learning are the two prominent machine learning algorithms used in pattern recognition and classification. In this

More information

Texture Segmentation and Classification in Biomedical Image Processing

Texture Segmentation and Classification in Biomedical Image Processing Texture Segmentation and Classification in Biomedical Image Processing Aleš Procházka and Andrea Gavlasová Department of Computing and Control Engineering Institute of Chemical Technology in Prague Technická

More information

Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs and Adaptive Motion Frame Method

Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs and Adaptive Motion Frame Method Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs

More information

IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING

IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING SECOND EDITION IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING ith Algorithms for ENVI/IDL Morton J. Canty с*' Q\ CRC Press Taylor &. Francis Group Boca Raton London New York CRC

More information

Change Detection in Remotely Sensed Images Based on Image Fusion and Fuzzy Clustering

Change Detection in Remotely Sensed Images Based on Image Fusion and Fuzzy Clustering International Journal of Electronics Engineering Research. ISSN 0975-6450 Volume 9, Number 1 (2017) pp. 141-150 Research India Publications http://www.ripublication.com Change Detection in Remotely Sensed

More information

Integrated management of hierarchical levels: towards a CAPE tool

Integrated management of hierarchical levels: towards a CAPE tool http://dx.doi.org/10.1016/b978-0-444-63428-3.50006-0 Integrated management of hierarchical levels: towards a CAPE tool Canan Dombayci a, Sergio Medina a, Moisès Graells b, Antonio Espuña a * a Chemical

More information

Methods for Intelligent Systems

Methods for Intelligent Systems Methods for Intelligent Systems Lecture Notes on Clustering (II) Davide Eynard eynard@elet.polimi.it Department of Electronics and Information Politecnico di Milano Davide Eynard - Lecture Notes on Clustering

More information

Semi-Supervised Clustering with Partial Background Information

Semi-Supervised Clustering with Partial Background Information Semi-Supervised Clustering with Partial Background Information Jing Gao Pang-Ning Tan Haibin Cheng Abstract Incorporating background knowledge into unsupervised clustering algorithms has been the subject

More information

A novel firing rule for training Kohonen selforganising

A novel firing rule for training Kohonen selforganising A novel firing rule for training Kohonen selforganising maps D. T. Pham & A. B. Chan Manufacturing Engineering Centre, School of Engineering, University of Wales Cardiff, P.O. Box 688, Queen's Buildings,

More information

Cluster Tendency Assessment for Fuzzy Clustering of Incomplete Data

Cluster Tendency Assessment for Fuzzy Clustering of Incomplete Data EUSFLAT-LFA 2011 July 2011 Aix-les-Bains, France Cluster Tendency Assessment for Fuzzy Clustering of Incomplete Data Ludmila Himmelspach 1 Daniel Hommers 1 Stefan Conrad 1 1 Institute of Computer Science,

More information

Dynamic Clustering of Data with Modified K-Means Algorithm

Dynamic Clustering of Data with Modified K-Means Algorithm 2012 International Conference on Information and Computer Networks (ICICN 2012) IPCSIT vol. 27 (2012) (2012) IACSIT Press, Singapore Dynamic Clustering of Data with Modified K-Means Algorithm Ahamed Shafeeq

More information

SYDE Winter 2011 Introduction to Pattern Recognition. Clustering

SYDE Winter 2011 Introduction to Pattern Recognition. Clustering SYDE 372 - Winter 2011 Introduction to Pattern Recognition Clustering Alexander Wong Department of Systems Design Engineering University of Waterloo Outline 1 2 3 4 5 All the approaches we have learned

More information

A Distance-Based Classifier Using Dissimilarity Based on Class Conditional Probability and Within-Class Variation. Kwanyong Lee 1 and Hyeyoung Park 2

A Distance-Based Classifier Using Dissimilarity Based on Class Conditional Probability and Within-Class Variation. Kwanyong Lee 1 and Hyeyoung Park 2 A Distance-Based Classifier Using Dissimilarity Based on Class Conditional Probability and Within-Class Variation Kwanyong Lee 1 and Hyeyoung Park 2 1. Department of Computer Science, Korea National Open

More information

Multi-Phase Analysis Framework for Handling Batch Process Data

Multi-Phase Analysis Framework for Handling Batch Process Data Multi-Phase Analysis for Handling Batch Process Data J. Camacho, J. Picó Department of Systems Engineering and Control. A. Ferrer Department of Applied Statistics, Operations Research and Quality. Technical

More information

Replacement of Missing Data and Outliers Using Wavelet Transform Methods

Replacement of Missing Data and Outliers Using Wavelet Transform Methods Replacement of Missing Data and Outliers Using Wavelet Transform Methods Liqian Zhang, Research Associate Department of Chemical and Materials Engineering University of Alberta Outline 2 1. Motivation

More information

MultiGrid-Based Fuzzy Systems for Function Approximation

MultiGrid-Based Fuzzy Systems for Function Approximation MultiGrid-Based Fuzzy Systems for Function Approximation Luis Javier Herrera 1,Héctor Pomares 1, Ignacio Rojas 1, Olga Valenzuela 2, and Mohammed Awad 1 1 University of Granada, Department of Computer

More information

Unsupervised learning in Vision

Unsupervised learning in Vision Chapter 7 Unsupervised learning in Vision The fields of Computer Vision and Machine Learning complement each other in a very natural way: the aim of the former is to extract useful information from visual

More information

Computational Statistics The basics of maximum likelihood estimation, Bayesian estimation, object recognitions

Computational Statistics The basics of maximum likelihood estimation, Bayesian estimation, object recognitions Computational Statistics The basics of maximum likelihood estimation, Bayesian estimation, object recognitions Thomas Giraud Simon Chabot October 12, 2013 Contents 1 Discriminant analysis 3 1.1 Main idea................................

More information

Temperature Calculation of Pellet Rotary Kiln Based on Texture

Temperature Calculation of Pellet Rotary Kiln Based on Texture Intelligent Control and Automation, 2017, 8, 67-74 http://www.scirp.org/journal/ica ISSN Online: 2153-0661 ISSN Print: 2153-0653 Temperature Calculation of Pellet Rotary Kiln Based on Texture Chunli Lin,

More information

Multidirectional 2DPCA Based Face Recognition System

Multidirectional 2DPCA Based Face Recognition System Multidirectional 2DPCA Based Face Recognition System Shilpi Soni 1, Raj Kumar Sahu 2 1 M.E. Scholar, Department of E&Tc Engg, CSIT, Durg 2 Associate Professor, Department of E&Tc Engg, CSIT, Durg Email:

More information

An indirect tire identification method based on a two-layered fuzzy scheme

An indirect tire identification method based on a two-layered fuzzy scheme Journal of Intelligent & Fuzzy Systems 29 (2015) 2795 2800 DOI:10.3233/IFS-151984 IOS Press 2795 An indirect tire identification method based on a two-layered fuzzy scheme Dailin Zhang, Dengming Zhang,

More information

Texture Image Segmentation using FCM

Texture Image Segmentation using FCM Proceedings of 2012 4th International Conference on Machine Learning and Computing IPCSIT vol. 25 (2012) (2012) IACSIT Press, Singapore Texture Image Segmentation using FCM Kanchan S. Deshmukh + M.G.M

More information

Machine Learning. B. Unsupervised Learning B.1 Cluster Analysis. Lars Schmidt-Thieme, Nicolas Schilling

Machine Learning. B. Unsupervised Learning B.1 Cluster Analysis. Lars Schmidt-Thieme, Nicolas Schilling Machine Learning B. Unsupervised Learning B.1 Cluster Analysis Lars Schmidt-Thieme, Nicolas Schilling Information Systems and Machine Learning Lab (ISMLL) Institute for Computer Science University of Hildesheim,

More information

Automatic basis selection for RBF networks using Stein s unbiased risk estimator

Automatic basis selection for RBF networks using Stein s unbiased risk estimator Automatic basis selection for RBF networks using Stein s unbiased risk estimator Ali Ghodsi School of omputer Science University of Waterloo University Avenue West NL G anada Email: aghodsib@cs.uwaterloo.ca

More information

Mixture Models and the EM Algorithm

Mixture Models and the EM Algorithm Mixture Models and the EM Algorithm Padhraic Smyth, Department of Computer Science University of California, Irvine c 2017 1 Finite Mixture Models Say we have a data set D = {x 1,..., x N } where x i is

More information

Robust Kernel Methods in Clustering and Dimensionality Reduction Problems

Robust Kernel Methods in Clustering and Dimensionality Reduction Problems Robust Kernel Methods in Clustering and Dimensionality Reduction Problems Jian Guo, Debadyuti Roy, Jing Wang University of Michigan, Department of Statistics Introduction In this report we propose robust

More information

9.1. K-means Clustering

9.1. K-means Clustering 424 9. MIXTURE MODELS AND EM Section 9.2 Section 9.3 Section 9.4 view of mixture distributions in which the discrete latent variables can be interpreted as defining assignments of data points to specific

More information

Chemometrics. Description of Pirouette Algorithms. Technical Note. Abstract

Chemometrics. Description of Pirouette Algorithms. Technical Note. Abstract 19-1214 Chemometrics Technical Note Description of Pirouette Algorithms Abstract This discussion introduces the three analysis realms available in Pirouette and briefly describes each of the algorithms

More information

Improving the Wang and Mendel s Fuzzy Rule Learning Method by Inducing Cooperation Among Rules 1

Improving the Wang and Mendel s Fuzzy Rule Learning Method by Inducing Cooperation Among Rules 1 Improving the Wang and Mendel s Fuzzy Rule Learning Method by Inducing Cooperation Among Rules 1 J. Casillas DECSAI, University of Granada 18071 Granada, Spain casillas@decsai.ugr.es O. Cordón DECSAI,

More information

Web Based Fuzzy Clustering Analysis

Web Based Fuzzy Clustering Analysis Research Inventy: International Journal Of Engineering And Science Vol.4, Issue 11 (November2014), PP 51-57 Issn (e): 2278-4721, Issn (p):2319-6483, www.researchinventy.com Web Based Fuzzy Clustering Analysis

More information

CSE 547: Machine Learning for Big Data Spring Problem Set 2. Please read the homework submission policies.

CSE 547: Machine Learning for Big Data Spring Problem Set 2. Please read the homework submission policies. CSE 547: Machine Learning for Big Data Spring 2019 Problem Set 2 Please read the homework submission policies. 1 Principal Component Analysis and Reconstruction (25 points) Let s do PCA and reconstruct

More information

Image Analysis, Classification and Change Detection in Remote Sensing

Image Analysis, Classification and Change Detection in Remote Sensing Image Analysis, Classification and Change Detection in Remote Sensing WITH ALGORITHMS FOR ENVI/IDL Morton J. Canty Taylor &. Francis Taylor & Francis Group Boca Raton London New York CRC is an imprint

More information

Jing Gao 1, Feng Liang 1, Wei Fan 2, Chi Wang 1, Yizhou Sun 1, Jiawei i Han 1 University of Illinois, IBM TJ Watson.

Jing Gao 1, Feng Liang 1, Wei Fan 2, Chi Wang 1, Yizhou Sun 1, Jiawei i Han 1 University of Illinois, IBM TJ Watson. Jing Gao 1, Feng Liang 1, Wei Fan 2, Chi Wang 1, Yizhou Sun 1, Jiawei i Han 1 University of Illinois, IBM TJ Watson Debapriya Basu Determine outliers in information networks Compare various algorithms

More information

Modeling VM Performance Interference with Fuzzy MIMO Model

Modeling VM Performance Interference with Fuzzy MIMO Model Modeling VM Performance Interference with Fuzzy MIMO Model ABSTRACT Virtual machines (VM) can be a powerful platform for multiplexing resources for applications workloads on demand in datacenters and cloud

More information

Combining Gabor Features: Summing vs.voting in Human Face Recognition *

Combining Gabor Features: Summing vs.voting in Human Face Recognition * Combining Gabor Features: Summing vs.voting in Human Face Recognition * Xiaoyan Mu and Mohamad H. Hassoun Department of Electrical and Computer Engineering Wayne State University Detroit, MI 4822 muxiaoyan@wayne.edu

More information

Creating Time-Varying Fuzzy Control Rules Based on Data Mining

Creating Time-Varying Fuzzy Control Rules Based on Data Mining Research Journal of Applied Sciences, Engineering and Technology 4(18): 3533-3538, 01 ISSN: 040-7467 Maxwell Scientific Organization, 01 Submitted: April 16, 01 Accepted: May 18, 01 Published: September

More information

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,

More information

10701 Machine Learning. Clustering

10701 Machine Learning. Clustering 171 Machine Learning Clustering What is Clustering? Organizing data into clusters such that there is high intra-cluster similarity low inter-cluster similarity Informally, finding natural groupings among

More information

Mostafa Naghizadeh and Mauricio D. Sacchi

Mostafa Naghizadeh and Mauricio D. Sacchi Ground-roll elimination by scale and direction guided curvelet transform Mostafa Naghizadeh and Mauricio D. Sacchi Summary We propose a curvelet domain strategy to subtract ground-roll from seismic records.

More information

FACE RECOGNITION USING INDEPENDENT COMPONENT

FACE RECOGNITION USING INDEPENDENT COMPONENT Chapter 5 FACE RECOGNITION USING INDEPENDENT COMPONENT ANALYSIS OF GABORJET (GABORJET-ICA) 5.1 INTRODUCTION PCA is probably the most widely used subspace projection technique for face recognition. A major

More information

Applied Fuzzy C-means Clustering to Operation Evaluation for Gastric Cancer Patients

Applied Fuzzy C-means Clustering to Operation Evaluation for Gastric Cancer Patients Applied Fuzzy C-means Clustering to Operation Evaluation for Gastric Cancer Patients Hang Zettervall, Elisabeth Rakus-Andersson Department of Mathematics and Science Blekinge Institute of Technology 3779

More information

FUZZY KERNEL K-MEDOIDS ALGORITHM FOR MULTICLASS MULTIDIMENSIONAL DATA CLASSIFICATION

FUZZY KERNEL K-MEDOIDS ALGORITHM FOR MULTICLASS MULTIDIMENSIONAL DATA CLASSIFICATION FUZZY KERNEL K-MEDOIDS ALGORITHM FOR MULTICLASS MULTIDIMENSIONAL DATA CLASSIFICATION 1 ZUHERMAN RUSTAM, 2 AINI SURI TALITA 1 Senior Lecturer, Department of Mathematics, Faculty of Mathematics and Natural

More information

An Approach for Fuzzy Modeling based on Self-Organizing Feature Maps Neural Network

An Approach for Fuzzy Modeling based on Self-Organizing Feature Maps Neural Network Appl. Math. Inf. Sci. 8, No. 3, 27-2 (24) 27 Applied Mathematics & Information Sciences An International Journal http://dx.doi.org/.278/amis/8334 An Approach for Fuzzy Modeling based on Self-Organizing

More information

Self-Organized Similarity based Kernel Fuzzy Clustering Model and Its Applications

Self-Organized Similarity based Kernel Fuzzy Clustering Model and Its Applications Fifth International Workshop on Computational Intelligence & Applications IEEE SMC Hiroshima Chapter, Hiroshima University, Japan, November 10, 11 & 12, 2009 Self-Organized Similarity based Kernel Fuzzy

More information

Introduction to digital image classification

Introduction to digital image classification Introduction to digital image classification Dr. Norman Kerle, Wan Bakx MSc a.o. INTERNATIONAL INSTITUTE FOR GEO-INFORMATION SCIENCE AND EARTH OBSERVATION Purpose of lecture Main lecture topics Review

More information

CHAPTER-6 WEB USAGE MINING USING CLUSTERING

CHAPTER-6 WEB USAGE MINING USING CLUSTERING CHAPTER-6 WEB USAGE MINING USING CLUSTERING 6.1 Related work in Clustering Technique 6.2 Quantifiable Analysis of Distance Measurement Techniques 6.3 Approaches to Formation of Clusters 6.4 Conclusion

More information

CHAPTER 3 PRINCIPAL COMPONENT ANALYSIS AND FISHER LINEAR DISCRIMINANT ANALYSIS

CHAPTER 3 PRINCIPAL COMPONENT ANALYSIS AND FISHER LINEAR DISCRIMINANT ANALYSIS 38 CHAPTER 3 PRINCIPAL COMPONENT ANALYSIS AND FISHER LINEAR DISCRIMINANT ANALYSIS 3.1 PRINCIPAL COMPONENT ANALYSIS (PCA) 3.1.1 Introduction In the previous chapter, a brief literature review on conventional

More information

An Adaptive Threshold LBP Algorithm for Face Recognition

An Adaptive Threshold LBP Algorithm for Face Recognition An Adaptive Threshold LBP Algorithm for Face Recognition Xiaoping Jiang 1, Chuyu Guo 1,*, Hua Zhang 1, and Chenghua Li 1 1 College of Electronics and Information Engineering, Hubei Key Laboratory of Intelligent

More information

AN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION

AN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION AN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION WILLIAM ROBSON SCHWARTZ University of Maryland, Department of Computer Science College Park, MD, USA, 20742-327, schwartz@cs.umd.edu RICARDO

More information

Optimization Under Fuzzy If-Then Rules Using Stochastic Algorithms

Optimization Under Fuzzy If-Then Rules Using Stochastic Algorithms European Symposium on Computer Arded Aided Process Engineering 5 L. Puigjaner and A. Espuña (Editors) 25 Elsevier Science B.V. All rights reserved. Optimization Under Fuzzy If-Then Rules Using Stochastic

More information

Keywords - Fuzzy rule-based systems, clustering, system design

Keywords - Fuzzy rule-based systems, clustering, system design CHAPTER 7 Application of Fuzzy Rule Base Design Method Peter Grabusts In many classification tasks the final goal is usually to determine classes of objects. The final goal of fuzzy clustering is also

More information

Research on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a

Research on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a International Conference on Education Technology, Management and Humanities Science (ETMHS 2015) Research on Applications of Data Mining in Electronic Commerce Xiuping YANG 1, a 1 Computer Science Department,

More information

Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM

Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM Lu Chen and Yuan Hang PERFORMANCE DEGRADATION ASSESSMENT AND FAULT DIAGNOSIS OF BEARING BASED ON EMD AND PCA-SOM.

More information