Data Mining Approaches to Characterize Batch Process Operations
|
|
- Martin McCoy
- 5 years ago
- Views:
Transcription
1 Data Mining Approaches to Characterize Batch Process Operations Rodolfo V. Tona V., Antonio Espuña and Luis Puigjaner * Universitat Politècnica de Catalunya, Chemical Engineering Department. Diagonal 647, Barcelona, Spain. Abstract n this work, an approach to mine data from batch process operations is presented. The aim is to extract knowledge from data and to support the design or redesign of monitoring systems. Multivariate models at recipe and lower levels are obtained by a multiscale PCA (MsPCA) approach. Then, fuzzy clustering is used to help to identify operational conditions by each product recipe. Cluster membership information is used to define effective rules that aid to characterise the operation of the plant for future productions. How to handle time-varying trajectories and how to catch their associated dynamics with the multiscale PCA is specially considered. An example based on a real pilot plant is used for illustrative purposes. Keywords: Data Mining, Batch Process, MsPCA, Fuzzy Clustering. 1. ntroduction Nowadays, large amounts of process operational data are recorded in Chemical Plants. t has been recognised that these data have a great potential to provide insight into the process (Stockill, 2002). So, developments of advanced data analysis tools and methods are required. Particularly, adoptions and applications of Data Mining approaches that aid to extract useful knowledge from data are claimed (Stockill, 2002, Wang, 2001). Some recent proposals have been made to support monitoring of continuous and batch processes. For batch, existing multivariate methods like Multiway PCA (MPCA) and PLS (Nomikos et al, 1994) has been proposed to obtain reduced characterisations of productions. However, in these methods, issues like time varying operations, outliers, production by recipes and the transitory dynamics of trend variables are not well treated or solved. Also, issues like the multiscale nature of the data have been separately considered. n the case of MPCA it is assumed that operating times of all batches are equal. However, this is not true in many real applications. Extensions to solve the timevarying problem have been proposed by use of aligning of variables (resampling) by reference to a variable indicator with dynamic time warping techniques (Kassidas et al, 1998). The disadvantage of this is that multiscale feature of variables are not taken into account. Additionally, some variables cannot be available at the corresponding resampling intervals. Chen et al (2000) uses orthonormal basis functions to represent the variables profiles and then build an MPCA model over the coefficients of the * To whom correspondence should be addressed mail: luis.puigjaner@upc.es
2 orthonormal functions. This last approach is more appropriate to try the time varying problem, but important issues like the selection of the functions and the multiscale nature of the data are not considered. Clustering has also been proposed for batch operation (Yuan et al., 2001). t is used together with PCA to support the identification of operating conditions and to the design of the monitoring system. The issue of operation by recipes into the analysis is also considered. Nevertheless, time varying, outliers and multiscale are not considered here. As a consequence, the identification and the obtained monitoring model will be suboptimal. n this work an alternative approach is proposed to explore batch operational data and to assist in the design or redesign of monitoring system. The integration of orthonormal bases functions with PCA is adopted by using Wavelet. The resulting MsPCA is combined with Fuzzy Clustering. The identification with clustering allows the generation of operation rules that improves the knowledge of the process and serves as a monitoring system together with the MsPCA. 2. Proposed Approach 2.1 Multiscale Modelling of Data Multivariate Statistical techniques have been extensively used for monitoring Batch Processes. MPCA is one of the most known methods. To apply this method it is assumed that experimental data form a three-dimensional array. The resulting matrix, Xo, is of xjxk, where J variables are measured at K times in each one of the batches. Then, Xo can be unfolded into a large two-dimensional matrix, X, of xjk (figure 1). Then, PCA can be over this unfolded matrix. n the method, it is supposed equal operation time (K) for all the batches which limits the application to the cases where batches are different in time. Observations K Batches Vari ables J J xk Variables x Observations Figure 1. Unfolding of the Three-way Batch data set. To overcome this problem we adopt an approach based on function approximation (Chen et al, 2001). n this approach, the matrix X is obtained like in MPCA, but the resulting matrix is ordered as follows: [ X ] xjk k = 1) k = 2)... k = K ) = M k = 1) k = 2)... k = K ) L L L x J ( k = 1) x J ( k = 2)... x J ( k = K ) M k = 1) k = 2)... k = K ) (1)
3 n the above matrix, each element is the profile of variable x j in batch run i. These profiles can be represented by ƒ i,j (t) functions. Chen (2001) proposed the use of approximation functions to obtain ƒ i,j (t). Approximation functions constitute sets of orthonormal bases with very good properties for represent signals. They allow representing ƒ(x) as: N 1 n= 0 f ( x) c n φ ( t) (2) n where, C N = {c n } n=0,1,,n-1., and {φ n (t)}, represent a set of square integral functions. Then, based on Lagrange polynomial functions, equation (1) becomes: [ X ] M L M = [] c [] c L [] c xjk f1,1 ( t) L f1, J ( t) = [ xn xn xnj ] (3) 11 2 f,1( t) L f, J ( t) where N j is the required number of bases in ƒ i,j (t) to approximate the measurements j and [c] xnj is the trajectory coefficient matrix of measurement j which is spanned by N j. N j is always the same on normal operation. So, by applying PCA on X is obtained a good representation of batches with different time duration. The above solution strategy is proposed over one scale. However, it has been widely recognised the multiscale nature of chemical data (Bakshi, 1998). Also, the selection of Lagrange polynomials is not an obvious alternative. n this work, all this is solved by using Wavelets. Wavelets are families of functions with very good properties as orthonormal functions. By combining approximations, at different scales, they are able to catch fine details and trends with very good accuracy. So, the resulting approximation is expressed as: f ( x) = d mk ψ ( t) (4) m= 1 k= mk where d m represents the c n coefficients in equation 2 at the m th scale and ψ m define the k th basis functions at the m th scale. n relation to the selection of the function, Daubechies wavelets are used because of their very good capabilities to represent polynomial behaviour. The extraction of functions with wavelets is achieved through de-noising (Nounou et al,1999). t allows to eliminate the effect of noise with clear advantages over subsequent analysis. So, the matrix X is built by way of the d m denoised coefficients and PCA is applied on it. There is a clear difference with the multiscale PCA proposed by Bakshi (1998). Here, PCA is only built over the complete wavelets coefficients matrix and the de-noising is ensured before PCA. 2.2 Fuzzy Clustering Clustering are techniques that attempts to assess the relationships among data patterns belonging to different groups. n this work, fuzzy clustering is adopted with the purpose to identify operating region patterns and as a base for rules generation. Fuzzy-c-mean is used as the clustering technique. t is an algorithm that can automatically identify the centre of each cluster and calculate the membership values of each data case to each cluster. t is based on the minimization of the sum of squared Euclidean distances between data (X k, k=1 n) and cluster centres (v i, i =1 c):
4 Min J m c n m ( U, V ) = ( µ ) x v (5) 1 k= 1 ik k i 2 where 1 m is the fuzziness index, c is the number of clusters, and µ ik denotes the matrix of a fuzzy c-partition. The last fuzzy c-partition is constrained as follows: µ ik [0,1] i, k, µ ik = 1 k, C i = 1 n k = 1 µ ik < n, i. (6) n other words, each X k could belong to more than one cluster with each membership (µ) taking a fractional value between 0 and 1. The details of the algorithm are not shown for space reasons (Bezdeck, 1981). However, it is noted that the algorithm is dependently of C. Also, because the objective function is based on Euclidian distances, the method tends to identify clusters with only spherical forms. n this work, the Mahalanobis distance is used (equation 5) to extend the identification up to spherical and ellipsoids forms. Additionally, a simple algorithm, the mountain method (Yager et al, 1994), is used as a pre-estimator of C, for cases when it is not known a priori. Rules generation n the above methods, each cluster centre is in essence a prototypical data point that exemplifies characteristics behaviour of a system. Then, the membership information, µ, of each data point allows associating it to a pattern of the system. So, simple rules can be generated by extract the larger membership of each point and relate it to a pattern as follow: f µ J1 is A then {C 1 = pattern 1 } 1 is produced. f µ J2 is B and µ J2 is D then {C = pattern} 1 is produced. Here, A, B and D, can represent values of variables like Temperature, etc. and patterns can express an associated operating condition for a recipe. As a consequence, a more insight into the process can be obtained and used to design a monitoring system. 2.3 Global procedure. Data mining approaches. The above methods are combined to extract knowledge from batch process data. The way by which data is analysed, is determined by two important issues, the production by recipes and the operation development by stages. Two levels of analysis are proposed: One at the recipe level (entire batch) and another at the stage level (sub-step). At the first level, the overall data from different batches are used. A matrix, X c, with all process variables, is built together with a separate matrix of important quality variables, X Qi. Wavelet coefficient matrices for each matrix (X c, X Qi ) are obtained. The resulting matrices are processed with PCA. Reduced representations (patterns) of batches and the profile of each X Qi are obtained. Next, clustering is applied. Groups of patterns and its memberships µ to groups are obtained. Groups of one or two objects, with their respective coefficients into X c and X Qi are rejected as outliers and PCA models are obtained again. This rejection step allows to eliminate some possible abnormal batches and to select data of good batches. When no rejections are registered, groups of operations are defined. t can occur that some recipes will be grouped in a same cluster
5 which suggests similar operating conditions and, possibly, a single model for these recipes. n a second step, the data set obtained in the above step is used and the recipes identified as similar are analysed together. A pre-processing step with wavelet is applied over the variables profiles to identify stages. Then, MsPCA is applied over individuals X pi matrices and by groups of stages in profiles. The obtained information, µ, is mapped onto the µ information at the level recipe. So, simple rules about conditions in each stage that can conduct to a product grade of a recipe are derived. Finally, data rejected at the recipe level is analysed to identify the potentially abnormal operation. So, the knowledge about the process is expanded. t should be noted that a similar analysis (by levels) has already been proposed by Yuan et al, (2001). They first apply the stage level analysis to obtain a reduced representation of variables with the most significant principal components (PC s). Then, PCA is applied over PC s. However, PC s are not well suited to catch the dynamic trend information of variables. Our proposed use of wavelets is much more appropriate for this task. Checking about abnormal data is no made and the presence of outliers is not considered by Yuan et al., which can mask the identification of groups. Additionally, their approach can not be applied over time varying processes. 3. Batch Pilot Plant Application A Real pilot plant at UPC has been selected as the scenario for testing purposes. t contains three reactors, heat exchangers and a highly flexible connectivity between them that is achieved via a network of pipes, pumps and valves. t has been used to generate data for several products recipes. A total of 24 experiments are generated with different length in operation time. First, the analysis over the recipe level is made. Figure 2a. µ values with a bad batch Figure 2b. µ values without bad batches Membership values (µ) of batches to different groups are obtained (figure 2a). The groups are verified as representing specific recipes. Also, it is noted the effect of a bad batch with a low µ in recipe 3. When it is rejected, the definition of the groups is improved (figure 2b). Subsequent analysis help to identify different operating conditions associated to the existing recipes. Then, operating rules about each one of the recipes are obtained. t can be noted that in the application of this analysis the process variables were recorded with a frequency of one minute while the quality
6 variables were recorded at a frequency of 5-10 minutes. Because PCA is applied over the coefficients of wavelet approximations of each signal, the low difference in sampling is not limiting. However, it should be noted that the method can not be applied in cases with larger differences between sampling frequencies. 4. Conclusions A new methodology to explore batch processes has been proposed. The methodology is capable of deal with important issues like time varying trajectories and outliers. Also, it is appropriate to represent operations, by stages and by recipes, with rules. This last capability is useful to improve the understanding of the process. Also, it is shown as very useful to the design or redesign of monitoring systems. Data about the pilot plant have served to illustrate the methodology. Nevertheless, additional applications over this and/or other real scenarios should be made to establish the generalization of the method. Also, the problem of different sample frequencies must be additionally studied. Finally, the approach is observed as potentially useful to obtain a root cause analysis databases to support tasks like equipment maintenance or scheduling. All this, will be explored in future works. Acknowledgment Financial support from the Generalitat de Catalunya (F research grant for Tona, R.V.) and from the European Community (projects VPNET-GRD-CT and CHEM-GRD-CT ) are gratefully acknowledged. References Bakshi, B. R., 1998, AChE Journal, 44, 7, Bezdek, J., Pattern recognition with fuzzy objective function algorithms, Plenum, N.Y. Chen, J., and Liu, J., 2001, Chem. Eng. Sci., 56, 10, Kassidas, A., MacGregor, J.F., and Taylor, P., 1998, AChE J, 44, 4, pp Nomikos, P. and MacGregor, J.F., 1995, Technometrics, 37, 1, Nounou, M. N. and Bakshi, B. R., 1999, AChE Journal, 45, 5, Stockill, D., 2002, ESCAPE-12 (Ed. Grievink, J., Schjindel, J.,), Elsevier, 70-77, Amsterdam, Netherlands. Wang, X.Z., Application of Neural Networks and other Learning Technologies in Process Engineering. London : mperial College Press. Yager, R.R, and Filev, D.P., 1994, EEE Trans. on Syst., Man, & Cyb., 24, 8, pp Yuan, and Wang, X.Z., 2001, Chem. Eng. Comm., 185,
Combining Complementary Scheduling Approaches into an Enhanced Modular Software
Combining Complementary Scheduling Approaches into an Enhanced Modular Software Jordi Cantón 1, Moisès Graells 1, Antonio Espuña 1, Luis Puigjaner 1* Wesley Alvarenga 2, Maria Teresa Rodrígues 2, Luis
More informationHFCT: A Hybrid Fuzzy Clustering Method for Collaborative Tagging
007 International Conference on Convergence Information Technology HFCT: A Hybrid Fuzzy Clustering Method for Collaborative Tagging Lixin Han,, Guihai Chen Department of Computer Science and Engineering,
More informationA Fuzzy C-means Clustering Algorithm Based on Pseudo-nearest-neighbor Intervals for Incomplete Data
Journal of Computational Information Systems 11: 6 (2015) 2139 2146 Available at http://www.jofcis.com A Fuzzy C-means Clustering Algorithm Based on Pseudo-nearest-neighbor Intervals for Incomplete Data
More informationDesign of Fault Diagnosis System of FPSO Production Process Based on MSPCA
2009 Fifth International Conference on Information Assurance and Security Design of Fault Diagnosis System of FPSO Production Process Based on MSPCA GAO Qiang, HAN Miao, HU Shu-liang, DONG Hai-jie ianjin
More informationHARD, SOFT AND FUZZY C-MEANS CLUSTERING TECHNIQUES FOR TEXT CLASSIFICATION
HARD, SOFT AND FUZZY C-MEANS CLUSTERING TECHNIQUES FOR TEXT CLASSIFICATION 1 M.S.Rekha, 2 S.G.Nawaz 1 PG SCALOR, CSE, SRI KRISHNADEVARAYA ENGINEERING COLLEGE, GOOTY 2 ASSOCIATE PROFESSOR, SRI KRISHNADEVARAYA
More informationFuzzy-Kernel Learning Vector Quantization
Fuzzy-Kernel Learning Vector Quantization Daoqiang Zhang 1, Songcan Chen 1 and Zhi-Hua Zhou 2 1 Department of Computer Science and Engineering Nanjing University of Aeronautics and Astronautics Nanjing
More informationA Brief Overview of Robust Clustering Techniques
A Brief Overview of Robust Clustering Techniques Robust Clustering Olfa Nasraoui Department of Computer Engineering & Computer Science University of Louisville, olfa.nasraoui_at_louisville.edu There are
More informationSELECTION OF A MULTIVARIATE CALIBRATION METHOD
SELECTION OF A MULTIVARIATE CALIBRATION METHOD 0. Aim of this document Different types of multivariate calibration methods are available. The aim of this document is to help the user select the proper
More informationTo be presented at the American Control Conference, Denver, CO, June 4 6, Data Compression Issues with Pattern Matching in Historical Data
To be presented at the American Control Conference, Denver, CO, June 4 6, 2003 Data Compression Issues with Pattern Matching in Historical Data Ashish Singhal Dale E. Seborg Department of Chemical Engineering
More informationImproved Version of Kernelized Fuzzy C-Means using Credibility
50 Improved Version of Kernelized Fuzzy C-Means using Credibility Prabhjot Kaur Maharaja Surajmal Institute of Technology (MSIT) New Delhi, 110058, INDIA Abstract - Fuzzy c-means is a clustering algorithm
More informationNovel Intuitionistic Fuzzy C-Means Clustering for Linearly and Nonlinearly Separable Data
Novel Intuitionistic Fuzzy C-Means Clustering for Linearly and Nonlinearly Separable Data PRABHJOT KAUR DR. A. K. SONI DR. ANJANA GOSAIN Department of IT, MSIT Department of Computers University School
More informationReal-time Monitoring of Multi-mode Industrial Processes using Feature-extraction Tools
Real-time Monitoring of Multi-mode Industrial Processes using Feature-extraction Tools Y. S. Manjili *, M. Niknamfar, M. Jamshidi Department of Electrical and Computer Engineering The University of Texas
More informationA Fuzzy Rule Based Clustering
A Fuzzy Rule Based Clustering Sachin Ashok Shinde 1, Asst.Prof.Y.R.Nagargoje 2 Student, Computer Science & Engineering Department, Everest College of Engineering, Aurangabad, India 1 Asst.Prof, Computer
More informationOPTIMIZATION. Optimization. Derivative-based optimization. Derivative-free optimization. Steepest descent (gradient) methods Newton s method
OPTIMIZATION Optimization Derivative-based optimization Steepest descent (gradient) methods Newton s method Derivative-free optimization Simplex Simulated Annealing Genetic Algorithms Ant Colony Optimization...
More informationData-driven fault detection with process topology for fault identification
Preprints of the 19th World Congress The International Federation of Automatic Control Data-driven fault detection with process topology for fault identification Brian S. Lindner*. Lidia Auret** * Department
More informationA MODIFIED FUZZY C-REGRESSION MODEL CLUSTERING ALGORITHM FOR T-S FUZZY MODEL IDENTIFICATION
20 8th International Multi-Conference on Systems, Signals & Devices A MODIFIED FUZZY C-REGRESSION MODEL CLUSTERING ALGORITHM FOR T-S FUZZY MODEL IDENTIFICATION Moêz. Soltani, Borhen. Aissaoui 2, Abdelader.
More informationHydraulic pump fault diagnosis with compressed signals based on stagewise orthogonal matching pursuit
Hydraulic pump fault diagnosis with compressed signals based on stagewise orthogonal matching pursuit Zihan Chen 1, Chen Lu 2, Hang Yuan 3 School of Reliability and Systems Engineering, Beihang University,
More informationEuropean Journal of Science and Engineering Vol. 1, Issue 1, 2013 ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM IDENTIFICATION OF AN INDUCTION MOTOR
ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM IDENTIFICATION OF AN INDUCTION MOTOR Ahmed A. M. Emam College of Engineering Karrary University SUDAN ahmedimam1965@yahoo.co.in Eisa Bashier M. Tayeb College of Engineering
More informationA New Method For Forecasting Enrolments Combining Time-Variant Fuzzy Logical Relationship Groups And K-Means Clustering
A New Method For Forecasting Enrolments Combining Time-Variant Fuzzy Logical Relationship Groups And K-Means Clustering Nghiem Van Tinh 1, Vu Viet Vu 1, Tran Thi Ngoc Linh 1 1 Thai Nguyen University of
More informationMultivariate Analysis
Multivariate Analysis Cluster Analysis Prof. Dr. Anselmo E de Oliveira anselmo.quimica.ufg.br anselmo.disciplinas@gmail.com Unsupervised Learning Cluster Analysis Natural grouping Patterns in the data
More informationECM A Novel On-line, Evolving Clustering Method and Its Applications
ECM A Novel On-line, Evolving Clustering Method and Its Applications Qun Song 1 and Nikola Kasabov 2 1, 2 Department of Information Science, University of Otago P.O Box 56, Dunedin, New Zealand (E-mail:
More informationKuske Martyna, Rubio, Rubio Rafael, Nicolas Jacques, Marco Santiago, Romain Anne-Claude
Fuzzy k-nn applied to moulds detection. Kuske Martyna, Rubio, Rubio Rafael, Nicolas Jacques, Marco Santiago, Romain Anne-Claude Communication presented at ISOEN 2003 RIGA- Latvia Introduction Excessive
More informationA robust optimization based approach to the general solution of mp-milp problems
21 st European Symposium on Computer Aided Process Engineering ESCAPE 21 E.N. Pistikopoulos, M.C. Georgiadis and A. Kokossis (Editors) 2011 Elsevier B.V. All rights reserved. A robust optimization based
More informationCluster Analysis. Ying Shen, SSE, Tongji University
Cluster Analysis Ying Shen, SSE, Tongji University Cluster analysis Cluster analysis groups data objects based only on the attributes in the data. The main objective is that The objects within a group
More informationFAULT DIAGNOSIS BASED ON MULTI-SCALE CLASSIFICATION USING KERNEL FISHER DISCRIMINANT ANALYSIS AND GAUSSIAN MIXTURE MODEL AND K-NEAREST NEIGHBOR METHOD
Universiti Kebangsaan Malaysia FAULT DIAGNOSIS BASED ON MULTI-SCALE CLASSIFICATION USING KERNEL FISHER DISCRIMINANT ANALYSIS AND GAUSSIAN MIXTURE MODEL AND K-NEAREST NEIGHBOR METHOD NORAZWAN M. NOR*, MOHD
More informationGraph Embedding in Vector Spaces
Graph Embedding in Vector Spaces GbR 2011 Mini-tutorial Jaume Gibert, Ernest Valveny Computer Vision Center, Universitat Autònoma de Barcelona, Barcelona, Spain Horst Bunke Institute of Computer Science
More informationFuzzy Segmentation. Chapter Introduction. 4.2 Unsupervised Clustering.
Chapter 4 Fuzzy Segmentation 4. Introduction. The segmentation of objects whose color-composition is not common represents a difficult task, due to the illumination and the appropriate threshold selection
More informationA NEW VARIABLES SELECTION AND DIMENSIONALITY REDUCTION TECHNIQUE COUPLED WITH SIMCA METHOD FOR THE CLASSIFICATION OF TEXT DOCUMENTS
A NEW VARIABLES SELECTION AND DIMENSIONALITY REDUCTION TECHNIQUE COUPLED WITH SIMCA METHOD FOR THE CLASSIFICATION OF TEXT DOCUMENTS Ahmed Abdelfattah Saleh University of Brasilia, Brasil ahmdsalh@yahoo.com
More informationLearning a Manifold as an Atlas Supplementary Material
Learning a Manifold as an Atlas Supplementary Material Nikolaos Pitelis Chris Russell School of EECS, Queen Mary, University of London [nikolaos.pitelis,chrisr,lourdes]@eecs.qmul.ac.uk Lourdes Agapito
More informationAn adjustable p-exponential clustering algorithm
An adjustable p-exponential clustering algorithm Valmir Macario 12 and Francisco de A. T. de Carvalho 2 1- Universidade Federal Rural de Pernambuco - Deinfo Rua Dom Manoel de Medeiros, s/n - Campus Dois
More informationQUALITATIVE MODELING FOR MAGNETIZATION CURVE
Journal of Marine Science and Technology, Vol. 8, No. 2, pp. 65-70 (2000) 65 QUALITATIVE MODELING FOR MAGNETIZATION CURVE Pei-Hwa Huang and Yu-Shuo Chang Keywords: Magnetization curve, Qualitative modeling,
More informationOutlier Detection and Removal Algorithm in K-Means and Hierarchical Clustering
World Journal of Computer Application and Technology 5(2): 24-29, 2017 DOI: 10.13189/wjcat.2017.050202 http://www.hrpub.org Outlier Detection and Removal Algorithm in K-Means and Hierarchical Clustering
More informationPerformance Measure of Hard c-means,fuzzy c-means and Alternative c-means Algorithms
Performance Measure of Hard c-means,fuzzy c-means and Alternative c-means Algorithms Binoda Nand Prasad*, Mohit Rathore**, Geeta Gupta***, Tarandeep Singh**** *Guru Gobind Singh Indraprastha University,
More informationEVALUATION FUZZY NUMBERS BASED ON RMS
EVALUATION FUZZY NUMBERS BASED ON RMS *Adel Asgari Safdar Young Researchers and Elite Club, Baft Branch, Islamic Azad University, Baft, Iran *Author for Correspondence ABSTRACT We suggest a new approach
More informationA New Fuzzy Neural System with Applications
A New Fuzzy Neural System with Applications Yuanyuan Chai 1, Jun Chen 1 and Wei Luo 1 1-China Defense Science and Technology Information Center -Network Center Fucheng Road 26#, Haidian district, Beijing
More informationRotation Perturbation Technique for Privacy Preserving in Data Stream Mining
218 IJSRSET Volume 4 Issue 8 Print ISSN: 2395-199 Online ISSN : 2394-499 Themed Section : Engineering and Technology Rotation Perturbation Technique for Privacy Preserving in Data Stream Mining Kalyani
More informationNORMALIZATION INDEXING BASED ENHANCED GROUPING K-MEAN ALGORITHM
NORMALIZATION INDEXING BASED ENHANCED GROUPING K-MEAN ALGORITHM Saroj 1, Ms. Kavita2 1 Student of Masters of Technology, 2 Assistant Professor Department of Computer Science and Engineering JCDM college
More informationComponent grouping for GT applications - a fuzzy clustering approach with validity measure
Loughborough University Institutional Repository Component grouping for GT applications - a fuzzy clustering approach with validity measure This item was submitted to Loughborough University's Institutional
More informationMachine Learning. B. Unsupervised Learning B.1 Cluster Analysis. Lars Schmidt-Thieme
Machine Learning B. Unsupervised Learning B.1 Cluster Analysis Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Institute for Computer Science University of Hildesheim, Germany
More informationCluster analysis of 3D seismic data for oil and gas exploration
Data Mining VII: Data, Text and Web Mining and their Business Applications 63 Cluster analysis of 3D seismic data for oil and gas exploration D. R. S. Moraes, R. P. Espíndola, A. G. Evsukoff & N. F. F.
More informationPattern Recognition Methods for Object Boundary Detection
Pattern Recognition Methods for Object Boundary Detection Arnaldo J. Abrantesy and Jorge S. Marquesz yelectronic and Communication Engineering Instituto Superior Eng. de Lisboa R. Conselheiro Emídio Navarror
More informationOn-Line Monitoring of Particle Shape and Size Distribution in Crystallization Processes through Image Analysis
17 th European Symposium on Computer Aided Process Engineering ESCAPE17 V. Plesu and P.S. Agachi (Editors) 2007 Elsevier B.V. All rights reserved. 1 On-Line Monitoring of Particle Shape and Size Distribution
More informationCHAPTER 4 AN IMPROVED INITIALIZATION METHOD FOR FUZZY C-MEANS CLUSTERING USING DENSITY BASED APPROACH
37 CHAPTER 4 AN IMPROVED INITIALIZATION METHOD FOR FUZZY C-MEANS CLUSTERING USING DENSITY BASED APPROACH 4.1 INTRODUCTION Genes can belong to any genetic network and are also coordinated by many regulatory
More informationMatrix Inference in Fuzzy Decision Trees
Matrix Inference in Fuzzy Decision Trees Santiago Aja-Fernández LPI, ETSIT Telecomunicación University of Valladolid, Spain sanaja@tel.uva.es Carlos Alberola-López LPI, ETSIT Telecomunicación University
More informationHigh Resolution Remote Sensing Image Classification based on SVM and FCM Qin LI a, Wenxing BAO b, Xing LI c, Bin LI d
2nd International Conference on Electrical, Computer Engineering and Electronics (ICECEE 2015) High Resolution Remote Sensing Image Classification based on SVM and FCM Qin LI a, Wenxing BAO b, Xing LI
More informationPrediction-based diagnosis and loss prevention using qualitative multi-scale models
European Symposium on Computer Arded Aided Process Engineering 15 L. Puigjaner and A. Espuña (Editors) 2005 Elsevier Science B.V. All rights reserved. Prediction-based diagnosis and loss prevention using
More informationAn Endowed Takagi-Sugeno-type Fuzzy Model for Classification Problems
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 11, November 2014,
More informationChapter 7 UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION
UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION Supervised and unsupervised learning are the two prominent machine learning algorithms used in pattern recognition and classification. In this
More informationTexture Segmentation and Classification in Biomedical Image Processing
Texture Segmentation and Classification in Biomedical Image Processing Aleš Procházka and Andrea Gavlasová Department of Computing and Control Engineering Institute of Chemical Technology in Prague Technická
More informationNon-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs and Adaptive Motion Frame Method
Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs
More informationIMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING
SECOND EDITION IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING ith Algorithms for ENVI/IDL Morton J. Canty с*' Q\ CRC Press Taylor &. Francis Group Boca Raton London New York CRC
More informationChange Detection in Remotely Sensed Images Based on Image Fusion and Fuzzy Clustering
International Journal of Electronics Engineering Research. ISSN 0975-6450 Volume 9, Number 1 (2017) pp. 141-150 Research India Publications http://www.ripublication.com Change Detection in Remotely Sensed
More informationIntegrated management of hierarchical levels: towards a CAPE tool
http://dx.doi.org/10.1016/b978-0-444-63428-3.50006-0 Integrated management of hierarchical levels: towards a CAPE tool Canan Dombayci a, Sergio Medina a, Moisès Graells b, Antonio Espuña a * a Chemical
More informationMethods for Intelligent Systems
Methods for Intelligent Systems Lecture Notes on Clustering (II) Davide Eynard eynard@elet.polimi.it Department of Electronics and Information Politecnico di Milano Davide Eynard - Lecture Notes on Clustering
More informationSemi-Supervised Clustering with Partial Background Information
Semi-Supervised Clustering with Partial Background Information Jing Gao Pang-Ning Tan Haibin Cheng Abstract Incorporating background knowledge into unsupervised clustering algorithms has been the subject
More informationA novel firing rule for training Kohonen selforganising
A novel firing rule for training Kohonen selforganising maps D. T. Pham & A. B. Chan Manufacturing Engineering Centre, School of Engineering, University of Wales Cardiff, P.O. Box 688, Queen's Buildings,
More informationCluster Tendency Assessment for Fuzzy Clustering of Incomplete Data
EUSFLAT-LFA 2011 July 2011 Aix-les-Bains, France Cluster Tendency Assessment for Fuzzy Clustering of Incomplete Data Ludmila Himmelspach 1 Daniel Hommers 1 Stefan Conrad 1 1 Institute of Computer Science,
More informationDynamic Clustering of Data with Modified K-Means Algorithm
2012 International Conference on Information and Computer Networks (ICICN 2012) IPCSIT vol. 27 (2012) (2012) IACSIT Press, Singapore Dynamic Clustering of Data with Modified K-Means Algorithm Ahamed Shafeeq
More informationSYDE Winter 2011 Introduction to Pattern Recognition. Clustering
SYDE 372 - Winter 2011 Introduction to Pattern Recognition Clustering Alexander Wong Department of Systems Design Engineering University of Waterloo Outline 1 2 3 4 5 All the approaches we have learned
More informationA Distance-Based Classifier Using Dissimilarity Based on Class Conditional Probability and Within-Class Variation. Kwanyong Lee 1 and Hyeyoung Park 2
A Distance-Based Classifier Using Dissimilarity Based on Class Conditional Probability and Within-Class Variation Kwanyong Lee 1 and Hyeyoung Park 2 1. Department of Computer Science, Korea National Open
More informationMulti-Phase Analysis Framework for Handling Batch Process Data
Multi-Phase Analysis for Handling Batch Process Data J. Camacho, J. Picó Department of Systems Engineering and Control. A. Ferrer Department of Applied Statistics, Operations Research and Quality. Technical
More informationReplacement of Missing Data and Outliers Using Wavelet Transform Methods
Replacement of Missing Data and Outliers Using Wavelet Transform Methods Liqian Zhang, Research Associate Department of Chemical and Materials Engineering University of Alberta Outline 2 1. Motivation
More informationMultiGrid-Based Fuzzy Systems for Function Approximation
MultiGrid-Based Fuzzy Systems for Function Approximation Luis Javier Herrera 1,Héctor Pomares 1, Ignacio Rojas 1, Olga Valenzuela 2, and Mohammed Awad 1 1 University of Granada, Department of Computer
More informationUnsupervised learning in Vision
Chapter 7 Unsupervised learning in Vision The fields of Computer Vision and Machine Learning complement each other in a very natural way: the aim of the former is to extract useful information from visual
More informationComputational Statistics The basics of maximum likelihood estimation, Bayesian estimation, object recognitions
Computational Statistics The basics of maximum likelihood estimation, Bayesian estimation, object recognitions Thomas Giraud Simon Chabot October 12, 2013 Contents 1 Discriminant analysis 3 1.1 Main idea................................
More informationTemperature Calculation of Pellet Rotary Kiln Based on Texture
Intelligent Control and Automation, 2017, 8, 67-74 http://www.scirp.org/journal/ica ISSN Online: 2153-0661 ISSN Print: 2153-0653 Temperature Calculation of Pellet Rotary Kiln Based on Texture Chunli Lin,
More informationMultidirectional 2DPCA Based Face Recognition System
Multidirectional 2DPCA Based Face Recognition System Shilpi Soni 1, Raj Kumar Sahu 2 1 M.E. Scholar, Department of E&Tc Engg, CSIT, Durg 2 Associate Professor, Department of E&Tc Engg, CSIT, Durg Email:
More informationAn indirect tire identification method based on a two-layered fuzzy scheme
Journal of Intelligent & Fuzzy Systems 29 (2015) 2795 2800 DOI:10.3233/IFS-151984 IOS Press 2795 An indirect tire identification method based on a two-layered fuzzy scheme Dailin Zhang, Dengming Zhang,
More informationTexture Image Segmentation using FCM
Proceedings of 2012 4th International Conference on Machine Learning and Computing IPCSIT vol. 25 (2012) (2012) IACSIT Press, Singapore Texture Image Segmentation using FCM Kanchan S. Deshmukh + M.G.M
More informationMachine Learning. B. Unsupervised Learning B.1 Cluster Analysis. Lars Schmidt-Thieme, Nicolas Schilling
Machine Learning B. Unsupervised Learning B.1 Cluster Analysis Lars Schmidt-Thieme, Nicolas Schilling Information Systems and Machine Learning Lab (ISMLL) Institute for Computer Science University of Hildesheim,
More informationAutomatic basis selection for RBF networks using Stein s unbiased risk estimator
Automatic basis selection for RBF networks using Stein s unbiased risk estimator Ali Ghodsi School of omputer Science University of Waterloo University Avenue West NL G anada Email: aghodsib@cs.uwaterloo.ca
More informationMixture Models and the EM Algorithm
Mixture Models and the EM Algorithm Padhraic Smyth, Department of Computer Science University of California, Irvine c 2017 1 Finite Mixture Models Say we have a data set D = {x 1,..., x N } where x i is
More informationRobust Kernel Methods in Clustering and Dimensionality Reduction Problems
Robust Kernel Methods in Clustering and Dimensionality Reduction Problems Jian Guo, Debadyuti Roy, Jing Wang University of Michigan, Department of Statistics Introduction In this report we propose robust
More information9.1. K-means Clustering
424 9. MIXTURE MODELS AND EM Section 9.2 Section 9.3 Section 9.4 view of mixture distributions in which the discrete latent variables can be interpreted as defining assignments of data points to specific
More informationChemometrics. Description of Pirouette Algorithms. Technical Note. Abstract
19-1214 Chemometrics Technical Note Description of Pirouette Algorithms Abstract This discussion introduces the three analysis realms available in Pirouette and briefly describes each of the algorithms
More informationImproving the Wang and Mendel s Fuzzy Rule Learning Method by Inducing Cooperation Among Rules 1
Improving the Wang and Mendel s Fuzzy Rule Learning Method by Inducing Cooperation Among Rules 1 J. Casillas DECSAI, University of Granada 18071 Granada, Spain casillas@decsai.ugr.es O. Cordón DECSAI,
More informationWeb Based Fuzzy Clustering Analysis
Research Inventy: International Journal Of Engineering And Science Vol.4, Issue 11 (November2014), PP 51-57 Issn (e): 2278-4721, Issn (p):2319-6483, www.researchinventy.com Web Based Fuzzy Clustering Analysis
More informationCSE 547: Machine Learning for Big Data Spring Problem Set 2. Please read the homework submission policies.
CSE 547: Machine Learning for Big Data Spring 2019 Problem Set 2 Please read the homework submission policies. 1 Principal Component Analysis and Reconstruction (25 points) Let s do PCA and reconstruct
More informationImage Analysis, Classification and Change Detection in Remote Sensing
Image Analysis, Classification and Change Detection in Remote Sensing WITH ALGORITHMS FOR ENVI/IDL Morton J. Canty Taylor &. Francis Taylor & Francis Group Boca Raton London New York CRC is an imprint
More informationJing Gao 1, Feng Liang 1, Wei Fan 2, Chi Wang 1, Yizhou Sun 1, Jiawei i Han 1 University of Illinois, IBM TJ Watson.
Jing Gao 1, Feng Liang 1, Wei Fan 2, Chi Wang 1, Yizhou Sun 1, Jiawei i Han 1 University of Illinois, IBM TJ Watson Debapriya Basu Determine outliers in information networks Compare various algorithms
More informationModeling VM Performance Interference with Fuzzy MIMO Model
Modeling VM Performance Interference with Fuzzy MIMO Model ABSTRACT Virtual machines (VM) can be a powerful platform for multiplexing resources for applications workloads on demand in datacenters and cloud
More informationCombining Gabor Features: Summing vs.voting in Human Face Recognition *
Combining Gabor Features: Summing vs.voting in Human Face Recognition * Xiaoyan Mu and Mohamad H. Hassoun Department of Electrical and Computer Engineering Wayne State University Detroit, MI 4822 muxiaoyan@wayne.edu
More informationCreating Time-Varying Fuzzy Control Rules Based on Data Mining
Research Journal of Applied Sciences, Engineering and Technology 4(18): 3533-3538, 01 ISSN: 040-7467 Maxwell Scientific Organization, 01 Submitted: April 16, 01 Accepted: May 18, 01 Published: September
More informationStatistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte
Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,
More information10701 Machine Learning. Clustering
171 Machine Learning Clustering What is Clustering? Organizing data into clusters such that there is high intra-cluster similarity low inter-cluster similarity Informally, finding natural groupings among
More informationMostafa Naghizadeh and Mauricio D. Sacchi
Ground-roll elimination by scale and direction guided curvelet transform Mostafa Naghizadeh and Mauricio D. Sacchi Summary We propose a curvelet domain strategy to subtract ground-roll from seismic records.
More informationFACE RECOGNITION USING INDEPENDENT COMPONENT
Chapter 5 FACE RECOGNITION USING INDEPENDENT COMPONENT ANALYSIS OF GABORJET (GABORJET-ICA) 5.1 INTRODUCTION PCA is probably the most widely used subspace projection technique for face recognition. A major
More informationApplied Fuzzy C-means Clustering to Operation Evaluation for Gastric Cancer Patients
Applied Fuzzy C-means Clustering to Operation Evaluation for Gastric Cancer Patients Hang Zettervall, Elisabeth Rakus-Andersson Department of Mathematics and Science Blekinge Institute of Technology 3779
More informationFUZZY KERNEL K-MEDOIDS ALGORITHM FOR MULTICLASS MULTIDIMENSIONAL DATA CLASSIFICATION
FUZZY KERNEL K-MEDOIDS ALGORITHM FOR MULTICLASS MULTIDIMENSIONAL DATA CLASSIFICATION 1 ZUHERMAN RUSTAM, 2 AINI SURI TALITA 1 Senior Lecturer, Department of Mathematics, Faculty of Mathematics and Natural
More informationAn Approach for Fuzzy Modeling based on Self-Organizing Feature Maps Neural Network
Appl. Math. Inf. Sci. 8, No. 3, 27-2 (24) 27 Applied Mathematics & Information Sciences An International Journal http://dx.doi.org/.278/amis/8334 An Approach for Fuzzy Modeling based on Self-Organizing
More informationSelf-Organized Similarity based Kernel Fuzzy Clustering Model and Its Applications
Fifth International Workshop on Computational Intelligence & Applications IEEE SMC Hiroshima Chapter, Hiroshima University, Japan, November 10, 11 & 12, 2009 Self-Organized Similarity based Kernel Fuzzy
More informationIntroduction to digital image classification
Introduction to digital image classification Dr. Norman Kerle, Wan Bakx MSc a.o. INTERNATIONAL INSTITUTE FOR GEO-INFORMATION SCIENCE AND EARTH OBSERVATION Purpose of lecture Main lecture topics Review
More informationCHAPTER-6 WEB USAGE MINING USING CLUSTERING
CHAPTER-6 WEB USAGE MINING USING CLUSTERING 6.1 Related work in Clustering Technique 6.2 Quantifiable Analysis of Distance Measurement Techniques 6.3 Approaches to Formation of Clusters 6.4 Conclusion
More informationCHAPTER 3 PRINCIPAL COMPONENT ANALYSIS AND FISHER LINEAR DISCRIMINANT ANALYSIS
38 CHAPTER 3 PRINCIPAL COMPONENT ANALYSIS AND FISHER LINEAR DISCRIMINANT ANALYSIS 3.1 PRINCIPAL COMPONENT ANALYSIS (PCA) 3.1.1 Introduction In the previous chapter, a brief literature review on conventional
More informationAn Adaptive Threshold LBP Algorithm for Face Recognition
An Adaptive Threshold LBP Algorithm for Face Recognition Xiaoping Jiang 1, Chuyu Guo 1,*, Hua Zhang 1, and Chenghua Li 1 1 College of Electronics and Information Engineering, Hubei Key Laboratory of Intelligent
More informationAN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION
AN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION WILLIAM ROBSON SCHWARTZ University of Maryland, Department of Computer Science College Park, MD, USA, 20742-327, schwartz@cs.umd.edu RICARDO
More informationOptimization Under Fuzzy If-Then Rules Using Stochastic Algorithms
European Symposium on Computer Arded Aided Process Engineering 5 L. Puigjaner and A. Espuña (Editors) 25 Elsevier Science B.V. All rights reserved. Optimization Under Fuzzy If-Then Rules Using Stochastic
More informationKeywords - Fuzzy rule-based systems, clustering, system design
CHAPTER 7 Application of Fuzzy Rule Base Design Method Peter Grabusts In many classification tasks the final goal is usually to determine classes of objects. The final goal of fuzzy clustering is also
More informationResearch on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a
International Conference on Education Technology, Management and Humanities Science (ETMHS 2015) Research on Applications of Data Mining in Electronic Commerce Xiuping YANG 1, a 1 Computer Science Department,
More informationPerformance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM
Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM Lu Chen and Yuan Hang PERFORMANCE DEGRADATION ASSESSMENT AND FAULT DIAGNOSIS OF BEARING BASED ON EMD AND PCA-SOM.
More information