Multivariate Data Visualisation
|
|
- Emory Armstrong
- 6 years ago
- Views:
Transcription
1 A Survey of Methods for Multivariate Data Projection, Visualisation and Interactive Analysis Andreas K ONIG Chair of Electronic Devices and Integrated Circuits Dresden University of Technology Mommsenstr. 13, Dresden, Germany Tel.: Fax.: koenig@iee.et.tu-dresden.de Key Words: data structure analysis, visual exploratory data analysis, topology preserving mapping, distance preserving mapping, Gestalt-Theory, Gestalt preserving mapping Abstract In this paper, algorithms for multivariate data projection, based on topology or distance preserving mappings, as well as tools and techniques for projection display and user interaction are briey reviewed and compared in an unifying approach. Advanced mapping algorithms, that focuse on improved data structure preservation, following laws of perception as given by Gestalt-theory, as well as advanced features of data visualisation and navigation are introduced. These methods help to exploit the remarkable human perceiptive and associative capabilities in man/computer dialog, e.g. for visual exploratory data analysis. 1. Introduction Projection of multivariate data by dimension reducing mapping and ensuing visualisation and interactive analysis is a topic of interest for more than three decades [1] [2]. Recently, there is strong renewed interest in this topic incented by, e.g. data-mining/dataware-house and knowledge discovery applications. To cope with todays ood of data from rapidly growing data bases and related computational resources, especially to discover salient structures and interesting correlations in data requires advanced methods of machine learning, pattern recognition, data analysis and visualisation. The remarkable ability of human observers to perceive clusters and correlations, and thus structure in data, is of great interest and can be well exploited by eective systems for data projection and interactive visualisation. Methods from various domains, e.g. pattern recognition and neural networks were developed and individually applied in conjunction with various visualisation techniques. A survey will be given in section 2 and Multivariate Data Projection signal- preserving linear methods Visualisation of multivariate data requires a dimension reduction to a two or three dimensional representation. Fig. 1 gives a taxonomy of state-of-the-art projection methods in a unied presentation. The focus of this paper is on unsupervised mapping procedures, that work without any a priori information. For demon- Multivariate Data Visualisation discriminance based nonlinear methods topology preserving discriminance based manual PCA scatter NLM Visor Triang. TOPAS Koontz und selection Factor Analysis matrices Method Fukunaga Backpropagation Network (BP) (Autoassociative) distance preserving Kohonen Feature Map BP (Discr.An.) Figure 1: Taxonomy of Projection Methods stration of some mapping properties, an articial data set, denoted as Cube-data will be used in the following. A cube with points only on eight edges of two opposite sides was generated. The cube was rotated by 45 o with regard to the coordinate axes. A mapping can most simply be achieved by a priori selection of two (or three) salient components or factors. However, context knowledge is required for such a selection. The simulataneous display of mul-
2 tiple pairwise plots of the data set is proposed in some tool kits. However, the combinatorial explosion ( M(M?1) 2 plots for M-dimensional data) limits practical application of this approach. A mapping can also be achieved by the rst two (three) principal components of Principal Component Analysis (PCA) [3] If the PCA-assumption of gaussion distribution is met and most variance is presented by the rst two principal components, then suitable plots can be achieved by this linear method [4]. Backpropagation networks Figure 3: U-Matrix principle (left) and plot for Cube-data (right) Figure 4: SOM component planes for cube data Figure 2: SOM trained with Cube-data: grid (left) and NLM-projection (right) have been applied in autoassociative mode and a bottleneck topology to achieve mappings comparable to PCA. With a ve layer topology a non-linear, signal preserving mapping is computed from the input to the middle layer. However, thus achieved projections are hard to interpret and strongly depend on other network parameters. The most promising mapping methods in terms of structure preservation use the criteria of either topology or distance preservation for the nonlinear mapping process. Kohonen's Self-Organzing Map (SOM) [5] is perhaps the most popular method for data visualisation. During training, the SOM unfolds in pattern space and creates a topology preserving mapping of the multivariate data on the xed neuron grid (or cube for 3D-SOM). Though this mapping, as given in Fig. 2, is interesting concerning neighborhood relations, no information is given on intra/inter-cluster distance. Researchers working on exploratory data analysis thus complemented the SOM with this distance information by a method denoted as Unied-Distance-Matrix (U- Matrix) [6]. The U-matrix method exploits the third dimension to plot interpoint distance information as a landscape on the SOM-grid, i.e. a mountain range implicates a large distance between clusters. Kohonen proposed Sammon's Non-Linear-Mapping [1] as means to include distance information in SOM visualisation (s. Fig. 2 right). In addition to the missing distance information, several other practical problems will be met using SOM displays. First, due to the quantisation carried out by SOM along with the topology preserving mapping the SOM is not suited for identifying the position of individual data vectors in the map visualisation. All data vectors falling in the voronoi cell of a certain SOM weight vector are represented by the same point on the SOM-grid. Second, SOM interpolation properties cause the placement of weight vectors in pattern space regions actually void of data vectors. Third, if the training data does not only possess a high absolute dimension but also a high intrinsic dimension (larger than two or three dimensions) [7], the SOM starts to fold and twist in the attempt to establish a mapping to the plane of the neuron grid. This can lead to the scattered representation of an intrinsically high-dimensional cluster all over the map and consequently to misinterpretations by human observers. The Growing-Cells of Fritzke [8] oer a remedy to the second SOM problem, however no improvement is offered for the other issues. The NLM [1] in contrast is a distance preserving nonlinear mapping. Interpoint distances dxij, and thus implicitly the data structure, shall be preserved in the mapping according to the cost function E(m): NX jx E(m) = 1 (dxij? dy ij(m)) 2 (1) c j=1 i=1 dxij q Pd Here dy ij(m) = q=1 (y iq(m)? yjq(m)) 2 denotes the distance of the respective data points in the visu-
3 alisation plane and dxij = q PM q=1 (v iq? vjq) 2 in the original data space and c = P N j=1 P j i=1 d Xij. Based on a gradient descent approach, the new coordinates of the N pivot vectors in the visualisation plane ~yi are determined by: yiq(m + 1) = yiq(m)? MF yiq(m) (2) with 2 iq(m) 2 and 0 < M F 1. It was shown that for large data sets the computational eort is considerable and that the gradient procedure does not always achieve an accurate projection [9]. Lee, Slaggle and Blum [10] presented a fast distance preserving mapping, that focuses on the exact preservation of only a limited number of distances (2(N?3)+3). For this mapping, the Minimal- Spanning-Tree (MST) of the data distance graph is computed. Then the MST is traversed and points are mapped by a triangulation method, based on the previously mapped MST-neighbors (s. Fig. 5). This Figure 5: MST and triangulation mapping achieves a fast and quite accurate projection. They also introduced the idea of a global reference point to be used in the triangulation step. However, MST computation has O(N 2 ) complexity. Thus, in prior work, a mapping algorithm with fast determination of global pivot points for the triangulation step was developed [11]. This algorithm has O(N) complexity and thus provides data projections with a very short response time. As shown by prior investigations with a quantitative mapping quality measure, achievable mapping quality is similar to the NLM [11] (s.fig 6). In com- P2 P1 P3 Figure 7: Four Iris NLM component cards parison with other mapping techniques, distance preserving mappings are esteemed as the most convenient and powerful alternative [4]. 3. Data Visualisation After data projection, the next step is visualisation and user interaction. Some examples of SOM visualisation have already been shown in Fig. 2, 3, and 4. Abstraction to other mapping techniques, e.g. NLM, LSB or Visor will be demonstrated here. In prior work, the WeightWatcher (WW), initially devised to analyse neural networks, was developed. WW oers numerous visualisation aids as, e.g. SOM/NLM display, component planes for SOM/NLM, Voronoi tesselation, SOM orthogonal mesh and weight icons (all component values are plotted contiguously at the projection point, e.g. as grey-value image block). Additional attribute display complements the projection points by class labels, pattern index or textual description in the plot. Further, navigation in the projection Figure 8: Actual local neighborhood display Figure 6: Two-dimensional projections of well known Iris data by NLM (left) and Visor (right) is supported by zoom and pan functions and an additional overview or navigation window. This feature allows to interactively explore the data from global to local aspects. Also, here is already a link to our recent work on more powerful, hierarchical mapping schemes that closely connect mapping and visualisa-
4 tion. The achieved mappings of NLM, LSB or Visor provide mappings that give a fair impression of the underlying global data structure. However, with growing intrinsic dimensionality, mapping faults occur and lead to distortions and twists in the 2D-representation. For accurate interpretation of displays WW provides a Actual local neighborhood display demonstrated in Fig. 8. The user can nd, traverse, and analyse the actual k-nearest neighbors of selected points (data entries) of interest. Thus, mapping faults can be identied and largely overcome by this interactive feature. Currently, search functions are implemented that direct navigation according to simple and in the future more complex search cues. 4. Advanced Projection Methods Advance from the state-of-the-art described in the last two sections is highly desirable concerning improved structure preservation of the mapping without impairing mapping speed. Also, concepts to cope with the often unavoidable mapping error caused by high intrinsic dimensionality are of interest. For instance, a mapper should aim on preserving and accurately map structures of perceptional relevance for the human operator, focusing on the current region of interest (ROI). The unavoidable mapping error should be relocated by the mapping algorithm between perceptionally relevant structures and out of the ROI. Linking mapping and visualisation more closely opens the way to improved data displays and more lucent analysis. For instance, the reference point in the LSBmethod can be interactively selected and a new projection on this ROI-like selection can be carried out in the advanced visualisation tool. In a similar multistep approach, the rapidly computed LSB or Visor mappings [11] give an idea of the basic data structure. Zooming in, using WW, either the rst, simple mapping is just scaled, or a more demanding mapping is started for the data subset enclosed in the current ROI. Higher complexity of this second mapping could be accepted, as the ROI typically comprises only a limited number of data entries. Following this idea, in prior work we developed an experimental mapping (TOpology Preserving mapping of sample Sets, TOPAS) to achieve mappings of high quality and reliability. In brief, this method is based on rank order evaluation of data vectors X in the original space and Y in the mapping space. If the respective rank positions in X- and Y-space are not identical, a correction is required to obtain the proper rank order. To achieve gradual corrections in the iterative mapping process, as in the NLM, distance information is used in the correction rule with and yij(t + 1) = yij(t) + yij(t) (3) yij(t) = p(t; Y (~yi)) [y ij? ypj] dy ip (4) dy ip = vu u t X d [yij? ypj] 2 (5) j=1 Similar to SOM, p(t; Y (~yi)) denotes a time and position dependent learning rate. The temporal factor decays with time. In addition, the regarded neighborhood shrinks as in SOM, so that the mapping focuses more and more on close neighbors. Fig. 9 shows some achieved results in comparison to NLM. Obvi- Figure 9: NLM and TOPAS mapping of Cube ously, TOPAS provides a much better structure preservation than NLM. However, besides a scaling problem associated with the current learning rule, the complexity of O(N 3 ) is a disadvantage of this algorithm. The salient features of TOPAS can be easily integrated in an Enhanced NLM (ENLM). Replacing Sammons Magic Factor MF, the basic correction rule is enhanced to yiq(m + 1) = yiq(m)? (m) yiq(m; (r( ~ Xi); m)) (6)
5 by a time and position dependent learn rate. For the neigborhood function (r( ~ Xi); m), a gaussian function with decaying (m) is chosen, so that after achieving a global arrangement, long distances dxij are more and more neglected in the mapping and a better ne tuning of local data structures is achieved. The idea of the reference point introduced in [10] can be incorporated in TOPAS and ENLM by adding an additional weighting factor in the correction rule: gr (dx ri ) = 1? d X ri? dx rimin dx rimax (7) By this factor, computed for each data vector before the actual mapping step, emphasis is on accurately preserving distances (rank order) close to the reference point, while accuracy of distance preservation may decay with growing displacement from the reference point. Another hierarchical approach, targeting on the preservation of perceptually relevant structures, is pursued in our current work. Data partitioning in clusters takes place as a rst step according to the work of Zahn [12]. This clustering process exploits simple laws from Gestalt theory. After this identication of perceptually relevant structures, mapping will take place in a hierarchical scheme, e.g. using variants of the ENLM or the presented triangulation techniques. 5. Conclusions and Future Work A review and a brief assessment of state-of-the-art data projection and visualisation techniques was given. Salient SOM visualisation techniques were generalized to distance preserving mappings. Advanced mapping techniques were introduced in conjunction with improved visualisation techniques and tools. The presented work based on the experience gathered with our PD SUN-tool-package ( koeniga) ~ that incorporates several mapping methods and the WW. In autumn 1998 our QuickCog-System on PC (Windows'95, NT) becomes commercially available which comprises most of the described mappers and the WW as part of an environment for image/signal processing and rapid cognitive systems design. Our new mapping algorithms and visualisation techniques will be implemented on this platform to be instantly available for applications, ranging from pattern recognition, medical data analysis to data base navigation tasks. References [1] J. W. Sammon. A Nonlinear Mapping for Data Structure Analysis. In IEEE Transactions on Computers C-18, No.5, pages 401{409, [2] J. W. Sammon. Interactive Pattern Analysis and Classication. In IEEE Transactions on Computers C-19, No.7, pages 594{616, [3] K. Fukunaga. Introduction to Statistical Pattern Recognition. ACADEMIC PRESS, INC. Harcourt Brace Jovanovich, Publishers Boston San Diego New York London Sydney Tokyo Toronto, [4] W. Siedlecki, K. Siedlecka, and J. Sklansky. An Overview of Mapping Techniques for Exploratory Pattern Analysis. In Pattern Recognition, Vol. 21, No.5, Pergammon Press plc, pages 411 { 429, [5] T. Kohonen. Self-Organization and Associative Memory. Springer Verlag Berlin Heidelberg London Paris Tokyo Hong Kong, [6] A. Ultsch and H. P. Siemon. Exploratory Data Analysis: Using Kohonen Networks on Transputers. In Interner Bericht Nr. 329 Universitat Dortmund, Dezember 1989, [7] K. Fukunaga and D. R. Olsen. An Algorithm for Finding Intrinsic Dimensionality of Data. In IEEE Transactions on Computers C-20, No.2, pages 176{183, [8] B. Fritzke. Growing Cell Structures { A Self- Organizing Network for Unsupervised and Supervised Learning. In Neural Networks, Vol. 7, No. 9, pages 1441{1460. Pergammon Press, [9] W. Dzwinel. How to make Sammon's Mapping useful for Multidimensional Data Structure Analysis. In Pattern Recognition, Vol. 27, No.7, Elsevier Science Ltd, pages 949 { 959, [10] R. C. T. Lee, J. R. Slagle, and H. Blum. A Triangulation Method for the Sequential Mapping of Points from N-Space to Two-Space. In IEEE Transactions on Computers C-26, pages 288{292, [11] A. Konig, O. Bulmahn, and M. Glesner. Systematic Methods for Multivariate Data Visualization and Numerical Assessment of Class Separability and Overlap in Automated Visual Industrial Quality Control. In Proceedings of the 5th British Machine Vision Conference BMVC'94, pages 195 { 204, September [12] C. T. Zahn. Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters. In IEEE Transactions on Computers, Vol. C-20, pages 68 { 86, 1971.
Cluster Analysis and Visualization. Workshop on Statistics and Machine Learning 2004/2/6
Cluster Analysis and Visualization Workshop on Statistics and Machine Learning 2004/2/6 Outlines Introduction Stages in Clustering Clustering Analysis and Visualization One/two-dimensional Data Histogram,
More informationCOMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS
COMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS Toomas Kirt Supervisor: Leo Võhandu Tallinn Technical University Toomas.Kirt@mail.ee Abstract: Key words: For the visualisation
More informationDimension reduction : PCA and Clustering
Dimension reduction : PCA and Clustering By Hanne Jarmer Slides by Christopher Workman Center for Biological Sequence Analysis DTU The DNA Array Analysis Pipeline Array design Probe design Question Experimental
More informationNonlinear dimensionality reduction of large datasets for data exploration
Data Mining VII: Data, Text and Web Mining and their Business Applications 3 Nonlinear dimensionality reduction of large datasets for data exploration V. Tomenko & V. Popov Wessex Institute of Technology,
More informationESANN'2006 proceedings - European Symposium on Artificial Neural Networks Bruges (Belgium), April 2006, d-side publi., ISBN
ESANN'26 proceedings - European Symposium on Artificial Neural Networks Bruges (Belgium), 26-28 April 26, d-side publi., ISBN 2-9337-6-4. Visualizing the trustworthiness of a projection Michaël Aupetit
More informationGraph projection techniques for Self-Organizing Maps
Graph projection techniques for Self-Organizing Maps Georg Pölzlbauer 1, Andreas Rauber 1, Michael Dittenbach 2 1- Vienna University of Technology - Department of Software Technology Favoritenstr. 9 11
More informationNonlinear projections. Motivation. High-dimensional. data are. Perceptron) ) or RBFN. Multi-Layer. Example: : MLP (Multi(
Nonlinear projections Université catholique de Louvain (Belgium) Machine Learning Group http://www.dice.ucl ucl.ac.be/.ac.be/mlg/ 1 Motivation High-dimensional data are difficult to represent difficult
More informationCluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1
Cluster Analysis Mu-Chun Su Department of Computer Science and Information Engineering National Central University 2003/3/11 1 Introduction Cluster analysis is the formal study of algorithms and methods
More informationDynamic Analysis of Structures Using Neural Networks
Dynamic Analysis of Structures Using Neural Networks Alireza Lavaei Academic member, Islamic Azad University, Boroujerd Branch, Iran Alireza Lohrasbi Academic member, Islamic Azad University, Boroujerd
More informationSelecting Models from Videos for Appearance-Based Face Recognition
Selecting Models from Videos for Appearance-Based Face Recognition Abdenour Hadid and Matti Pietikäinen Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering P.O.
More informationWeek 7 Picturing Network. Vahe and Bethany
Week 7 Picturing Network Vahe and Bethany Freeman (2005) - Graphic Techniques for Exploring Social Network Data The two main goals of analyzing social network data are identification of cohesive groups
More informationImages Reconstruction using an iterative SOM based algorithm.
Images Reconstruction using an iterative SOM based algorithm. M.Jouini 1, S.Thiria 2 and M.Crépon 3 * 1- LOCEAN, MMSA team, CNAM University, Paris, France 2- LOCEAN, MMSA team, UVSQ University Paris, France
More informationCSE 6242 A / CS 4803 DVA. Feb 12, Dimension Reduction. Guest Lecturer: Jaegul Choo
CSE 6242 A / CS 4803 DVA Feb 12, 2013 Dimension Reduction Guest Lecturer: Jaegul Choo CSE 6242 A / CS 4803 DVA Feb 12, 2013 Dimension Reduction Guest Lecturer: Jaegul Choo Data is Too Big To Do Something..
More informationLocality Preserving Projections (LPP) Abstract
Locality Preserving Projections (LPP) Xiaofei He Partha Niyogi Computer Science Department Computer Science Department The University of Chicago The University of Chicago Chicago, IL 60615 Chicago, IL
More informationModular network SOM : Theory, algorithm and applications
Modular network SOM : Theory, algorithm and applications Kazuhiro Tokunaga and Tetsuo Furukawa Kyushu Institute of Technology, Kitakyushu 88-96, Japan {tokunaga, furukawa}@brain.kyutech.ac.jp Abstract.
More informationRoad Sign Visualization with Principal Component Analysis and Emergent Self-Organizing Map
Road Sign Visualization with Principal Component Analysis and Emergent Self-Organizing Map H6429: Computational Intelligence, Method and Applications Assignment One report Written By Nguwi Yok Yen (nguw0001@ntu.edu.sg)
More informationStability Assessment of Electric Power Systems using Growing Neural Gas and Self-Organizing Maps
Stability Assessment of Electric Power Systems using Growing Gas and Self-Organizing Maps Christian Rehtanz, Carsten Leder University of Dortmund, 44221 Dortmund, Germany Abstract. Liberalized competitive
More informationUnsupervised Learning
Unsupervised Learning Unsupervised learning Until now, we have assumed our training samples are labeled by their category membership. Methods that use labeled samples are said to be supervised. However,
More informationTexture Image Segmentation using FCM
Proceedings of 2012 4th International Conference on Machine Learning and Computing IPCSIT vol. 25 (2012) (2012) IACSIT Press, Singapore Texture Image Segmentation using FCM Kanchan S. Deshmukh + M.G.M
More informationContextual priming for artificial visual perception
Contextual priming for artificial visual perception Hervé Guillaume 1, Nathalie Denquive 1, Philippe Tarroux 1,2 1 LIMSI-CNRS BP 133 F-91403 Orsay cedex France 2 ENS 45 rue d Ulm F-75230 Paris cedex 05
More informationt 1 y(x;w) x 2 t 2 t 3 x 1
Neural Computing Research Group Dept of Computer Science & Applied Mathematics Aston University Birmingham B4 7ET United Kingdom Tel: +44 (0)121 333 4631 Fax: +44 (0)121 333 4586 http://www.ncrg.aston.ac.uk/
More informationData Mining Chapter 9: Descriptive Modeling Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University
Data Mining Chapter 9: Descriptive Modeling Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Descriptive model A descriptive model presents the main features of the data
More informationImage Coding with Active Appearance Models
Image Coding with Active Appearance Models Simon Baker, Iain Matthews, and Jeff Schneider CMU-RI-TR-03-13 The Robotics Institute Carnegie Mellon University Abstract Image coding is the task of representing
More informationMTTTS17 Dimensionality Reduction and Visualization. Spring 2018 Jaakko Peltonen. Lecture 11: Neighbor Embedding Methods continued
MTTTS17 Dimensionality Reduction and Visualization Spring 2018 Jaakko Peltonen Lecture 11: Neighbor Embedding Methods continued This Lecture Neighbor embedding by generative modeling Some supervised neighbor
More information3. Multidimensional Information Visualization II Concepts for visualizing univariate to hypervariate data
3. Multidimensional Information Visualization II Concepts for visualizing univariate to hypervariate data Vorlesung Informationsvisualisierung Prof. Dr. Andreas Butz, WS 2009/10 Konzept und Basis für n:
More informationLocality Preserving Projections (LPP) Abstract
Locality Preserving Projections (LPP) Xiaofei He Partha Niyogi Computer Science Department Computer Science Department The University of Chicago The University of Chicago Chicago, IL 60615 Chicago, IL
More informationAdvanced visualization techniques for Self-Organizing Maps with graph-based methods
Advanced visualization techniques for Self-Organizing Maps with graph-based methods Georg Pölzlbauer 1, Andreas Rauber 1, and Michael Dittenbach 2 1 Department of Software Technology Vienna University
More informationLecture Topic Projects
Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, basic tasks, data types 3 Introduction to D3, basic vis techniques for non-spatial data Project #1 out 4 Data
More informationMachine Learning : Clustering, Self-Organizing Maps
Machine Learning Clustering, Self-Organizing Maps 12/12/2013 Machine Learning : Clustering, Self-Organizing Maps Clustering The task: partition a set of objects into meaningful subsets (clusters). The
More informationMineral Exploation Using Neural Netowrks
ABSTRACT I S S N 2277-3061 Mineral Exploation Using Neural Netowrks Aysar A. Abdulrahman University of Sulaimani, Computer Science, Kurdistan Region of Iraq aysser.abdulrahman@univsul.edu.iq Establishing
More informationSELECTION OF THE OPTIMAL PARAMETER VALUE FOR THE LOCALLY LINEAR EMBEDDING ALGORITHM. Olga Kouropteva, Oleg Okun and Matti Pietikäinen
SELECTION OF THE OPTIMAL PARAMETER VALUE FOR THE LOCALLY LINEAR EMBEDDING ALGORITHM Olga Kouropteva, Oleg Okun and Matti Pietikäinen Machine Vision Group, Infotech Oulu and Department of Electrical and
More information5.6 Self-organizing maps (SOM) [Book, Sect. 10.3]
Ch.5 Classification and Clustering 5.6 Self-organizing maps (SOM) [Book, Sect. 10.3] The self-organizing map (SOM) method, introduced by Kohonen (1982, 2001), approximates a dataset in multidimensional
More informationRelative Constraints as Features
Relative Constraints as Features Piotr Lasek 1 and Krzysztof Lasek 2 1 Chair of Computer Science, University of Rzeszow, ul. Prof. Pigonia 1, 35-510 Rzeszow, Poland, lasek@ur.edu.pl 2 Institute of Computer
More informationControlling the spread of dynamic self-organising maps
Neural Comput & Applic (2004) 13: 168 174 DOI 10.1007/s00521-004-0419-y ORIGINAL ARTICLE L. D. Alahakoon Controlling the spread of dynamic self-organising maps Received: 7 April 2004 / Accepted: 20 April
More informationUnsupervised learning
Unsupervised learning Enrique Muñoz Ballester Dipartimento di Informatica via Bramante 65, 26013 Crema (CR), Italy enrique.munoz@unimi.it Enrique Muñoz Ballester 2017 1 Download slides data and scripts:
More informationFunction approximation using RBF network. 10 basis functions and 25 data points.
1 Function approximation using RBF network F (x j ) = m 1 w i ϕ( x j t i ) i=1 j = 1... N, m 1 = 10, N = 25 10 basis functions and 25 data points. Basis function centers are plotted with circles and data
More informationData Compression. The Encoder and PCA
Data Compression The Encoder and PCA Neural network techniques have been shown useful in the area of data compression. In general, data compression can be lossless compression or lossy compression. In
More informationManifold Learning for Video-to-Video Face Recognition
Manifold Learning for Video-to-Video Face Recognition Abstract. We look in this work at the problem of video-based face recognition in which both training and test sets are video sequences, and propose
More informationGradient visualization of grouped component planes on the SOM lattice
Gradient visualization of grouped component planes on the SOM lattice Gradient visualization of grouped component planes on the SOM lattice Georg Pölzlbauer 1, Michael Dittenbach 2, Andreas Rauber 1 1
More informationUnsupervised Learning
Networks for Pattern Recognition, 2014 Networks for Single Linkage K-Means Soft DBSCAN PCA Networks for Kohonen Maps Linear Vector Quantization Networks for Problems/Approaches in Machine Learning Supervised
More informationABSTRACT. Keywords: visual training, unsupervised learning, lumber inspection, projection 1. INTRODUCTION
Comparison of Dimensionality Reduction Methods for Wood Surface Inspection Matti Niskanen and Olli Silvén Machine Vision Group, Infotech Oulu, University of Oulu, Finland ABSTRACT Dimensionality reduction
More informationPerformance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM
Performance Degradation Assessment and Fault Diagnosis of Bearing Based on EMD and PCA-SOM Lu Chen and Yuan Hang PERFORMANCE DEGRADATION ASSESSMENT AND FAULT DIAGNOSIS OF BEARING BASED ON EMD AND PCA-SOM.
More informationSOM+EOF for Finding Missing Values
SOM+EOF for Finding Missing Values Antti Sorjamaa 1, Paul Merlin 2, Bertrand Maillet 2 and Amaury Lendasse 1 1- Helsinki University of Technology - CIS P.O. Box 5400, 02015 HUT - Finland 2- Variances and
More informationCIE L*a*b* color model
CIE L*a*b* color model To further strengthen the correlation between the color model and human perception, we apply the following non-linear transformation: with where (X n,y n,z n ) are the tristimulus
More information/00/$10.00 (C) 2000 IEEE
A SOM based cluster visualization and its application for false coloring Johan Himberg Helsinki University of Technology Laboratory of Computer and Information Science P.O. Box 54, FIN-215 HUT, Finland
More informationFast Associative Memory
Fast Associative Memory Ricardo Miguel Matos Vieira Instituto Superior Técnico ricardo.vieira@tagus.ist.utl.pt ABSTRACT The associative memory concept presents important advantages over the more common
More informationSelf-Organizing Feature Map. Kazuhiro MINAMIMOTO Kazushi IKEDA Kenji NAKAYAMA
Topology Analysis of Data Space Using Self-Organizing Feature Map Kazuhiro MINAMIMOTO Kazushi IKEDA Kenji NAKAYAMA Department of Electrical and Computer Eng., Faculty of Eng., Kanazawa Univ. 2-4-2, Kodatsuno,
More informationTreeGNG - Hierarchical Topological Clustering
TreeGNG - Hierarchical Topological lustering K..J.Doherty,.G.dams, N.Davey Department of omputer Science, University of Hertfordshire, Hatfield, Hertfordshire, L10 9, United Kingdom {K..J.Doherty,.G.dams,
More informationSimilarity Image Retrieval System Using Hierarchical Classification
Similarity Image Retrieval System Using Hierarchical Classification Experimental System on Mobile Internet with Cellular Phone Masahiro Tada 1, Toshikazu Kato 1, and Isao Shinohara 2 1 Department of Industrial
More informationPoints Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked
Plotting Menu: QCExpert Plotting Module graphs offers various tools for visualization of uni- and multivariate data. Settings and options in different types of graphs allow for modifications and customizations
More informationA Comparative Study of Conventional and Neural Network Classification of Multispectral Data
A Comparative Study of Conventional and Neural Network Classification of Multispectral Data B.Solaiman & M.C.Mouchot Ecole Nationale Supérieure des Télécommunications de Bretagne B.P. 832, 29285 BREST
More informationNon-Bayesian Classifiers Part I: k-nearest Neighbor Classifier and Distance Functions
Non-Bayesian Classifiers Part I: k-nearest Neighbor Classifier and Distance Functions Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Fall 2017 CS 551,
More informationProcess. Measurement vector (Feature vector) Map training and labeling. Self-Organizing Map. Input measurements 4. Output measurements.
Analysis of Complex Systems using the Self-Organizing Map Esa Alhoniemi, Olli Simula and Juha Vesanto Helsinki University of Technology Laboratory of Computer and Information Science P.O. Box 2200, FIN-02015
More informationPATTERN CLASSIFICATION AND SCENE ANALYSIS
PATTERN CLASSIFICATION AND SCENE ANALYSIS RICHARD O. DUDA PETER E. HART Stanford Research Institute, Menlo Park, California A WILEY-INTERSCIENCE PUBLICATION JOHN WILEY & SONS New York Chichester Brisbane
More informationFacial Expression Classification with Random Filters Feature Extraction
Facial Expression Classification with Random Filters Feature Extraction Mengye Ren Facial Monkey mren@cs.toronto.edu Zhi Hao Luo It s Me lzh@cs.toronto.edu I. ABSTRACT In our work, we attempted to tackle
More informationSelf-Organizing Maps for cyclic and unbounded graphs
Self-Organizing Maps for cyclic and unbounded graphs M. Hagenbuchner 1, A. Sperduti 2, A.C. Tsoi 3 1- University of Wollongong, Wollongong, Australia. 2- University of Padova, Padova, Italy. 3- Hong Kong
More informationOBJECT-CENTERED INTERACTIVE MULTI-DIMENSIONAL SCALING: ASK THE EXPERT
OBJECT-CENTERED INTERACTIVE MULTI-DIMENSIONAL SCALING: ASK THE EXPERT Joost Broekens Tim Cocx Walter A. Kosters Leiden Institute of Advanced Computer Science Leiden University, The Netherlands Email: {broekens,
More informationTexture Classification by Combining Local Binary Pattern Features and a Self-Organizing Map
Texture Classification by Combining Local Binary Pattern Features and a Self-Organizing Map Markus Turtinen, Topi Mäenpää, and Matti Pietikäinen Machine Vision Group, P.O.Box 4500, FIN-90014 University
More informationComparison of supervised self-organizing maps using Euclidian or Mahalanobis distance in classification context
6 th. International Work Conference on Artificial and Natural Neural Networks (IWANN2001), Granada, June 13-15 2001 Comparison of supervised self-organizing maps using Euclidian or Mahalanobis distance
More informationClustering and Visualisation of Data
Clustering and Visualisation of Data Hiroshi Shimodaira January-March 28 Cluster analysis aims to partition a data set into meaningful or useful groups, based on distances between data points. In some
More informationArtificial Neural Networks Unsupervised learning: SOM
Artificial Neural Networks Unsupervised learning: SOM 01001110 01100101 01110101 01110010 01101111 01101110 01101111 01110110 01100001 00100000 01110011 01101011 01110101 01110000 01101001 01101110 01100001
More informationPattern Clustering with Similarity Measures
Pattern Clustering with Similarity Measures Akula Ratna Babu 1, Miriyala Markandeyulu 2, Bussa V R R Nagarjuna 3 1 Pursuing M.Tech(CSE), Vignan s Lara Institute of Technology and Science, Vadlamudi, Guntur,
More informationA Topography-Preserving Latent Variable Model with Learning Metrics
A Topography-Preserving Latent Variable Model with Learning Metrics Samuel Kaski and Janne Sinkkonen Helsinki University of Technology Neural Networks Research Centre P.O. Box 5400, FIN-02015 HUT, Finland
More informationHartmut Pohlheim. DaimlerChrysler AG, Research and Technology Alt-Moabit 96a, Berlin, Germany.
Multidimensional Scaling for Evolutionary Algorithms - Visualization of the Path through Search Space and Solution Space using SAMMON Mapping Hartmut Pohlheim DaimlerChrysler AG, Research and Technology
More informationUnsupervised Learning
Harvard-MIT Division of Health Sciences and Technology HST.951J: Medical Decision Support, Fall 2005 Instructors: Professor Lucila Ohno-Machado and Professor Staal Vinterbo 6.873/HST.951 Medical Decision
More informationMachine Learning Methods in Visualisation for Big Data 2018
Machine Learning Methods in Visualisation for Big Data 2018 Daniel Archambault1 Ian Nabney2 Jaakko Peltonen3 1 Swansea University 2 University of Bristol 3 University of Tampere, Aalto University Evaluating
More informationESANN'2001 proceedings - European Symposium on Artificial Neural Networks Bruges (Belgium), April 2001, D-Facto public., ISBN ,
Interpretation and Comparison of Multidimensional Data Partitions Esa Alhoniemi and Olli Simula Neural Networks Research Centre Helsinki University of Technology P. O.Box 5400 FIN-02015 HUT, Finland esa.alhoniemi@hut.fi
More informationCSE 6242 A / CX 4242 DVA. March 6, Dimension Reduction. Guest Lecturer: Jaegul Choo
CSE 6242 A / CX 4242 DVA March 6, 2014 Dimension Reduction Guest Lecturer: Jaegul Choo Data is Too Big To Analyze! Limited memory size! Data may not be fitted to the memory of your machine! Slow computation!
More informationECG782: Multidimensional Digital Signal Processing
ECG782: Multidimensional Digital Signal Processing Object Recognition http://www.ee.unlv.edu/~b1morris/ecg782/ 2 Outline Knowledge Representation Statistical Pattern Recognition Neural Networks Boosting
More informationScalar Visualization
Scalar Visualization Visualizing scalar data Popular scalar visualization techniques Color mapping Contouring Height plots outline Recap of Chap 4: Visualization Pipeline 1. Data Importing 2. Data Filtering
More informationGrowing Neural Gas A Parallel Approach
Growing Neural Gas A Parallel Approach Lukáš Vojáček 1 and JiříDvorský 2 1 IT4Innovations Centre of Excellence Ostrava, Czech Republic lukas.vojacek@vsb.cz 2 Department of Computer Science, VŠB Technical
More informationPreface to the Second Edition. Preface to the First Edition. 1 Introduction 1
Preface to the Second Edition Preface to the First Edition vii xi 1 Introduction 1 2 Overview of Supervised Learning 9 2.1 Introduction... 9 2.2 Variable Types and Terminology... 9 2.3 Two Simple Approaches
More informationThree-Dimensional Computer Vision
\bshiaki Shirai Three-Dimensional Computer Vision With 313 Figures ' Springer-Verlag Berlin Heidelberg New York London Paris Tokyo Table of Contents 1 Introduction 1 1.1 Three-Dimensional Computer Vision
More informationCurvilinear Distance Analysis versus Isomap
Curvilinear Distance Analysis versus Isomap John Aldo Lee, Amaury Lendasse, Michel Verleysen Université catholique de Louvain Place du Levant, 3, B-1348 Louvain-la-Neuve, Belgium {lee,verleysen}@dice.ucl.ac.be,
More informationoutput vector controller y s... y 2 y 1
A Neuro-Fuzzy Solution for Fine-Motion Control Based on Vision and Force Sensors Yorck von Collani, Jianwei Zhang, Alois Knoll Technical Computer Science, Faculty of Technology, University of Bielefeld,
More informationAlgorithm That Mimics Human Perceptual Grouping of Dot Patterns
Algorithm That Mimics Human Perceptual Grouping of Dot Patterns G. Papari and N. Petkov Institute of Mathematics and Computing Science, University of Groningen, P.O.Box 800, 9700 AV Groningen, The Netherlands
More informationVisualizing the quality of dimensionality reduction
Visualizing the quality of dimensionality reduction Bassam Mokbel 1, Wouter Lueks 2, Andrej Gisbrecht 1, Michael Biehl 2, Barbara Hammer 1 1) Bielefeld University - CITEC Centre of Excellence, Germany
More informationData mining with sparse grids
Data mining with sparse grids Jochen Garcke and Michael Griebel Institut für Angewandte Mathematik Universität Bonn Data mining with sparse grids p.1/40 Overview What is Data mining? Regularization networks
More informationEstimating the Intrinsic Dimensionality of. Jorg Bruske, Erzsebet Merenyi y
Estimating the Intrinsic Dimensionality of Hyperspectral Images Jorg Bruske, Erzsebet Merenyi y Abstract. Estimating the intrinsic dimensionality (ID) of an intrinsically low (d-) dimensional data set
More informationEvgeny Maksakov Advantages and disadvantages: Advantages and disadvantages: Advantages and disadvantages: Advantages and disadvantages:
Today Problems with visualizing high dimensional data Problem Overview Direct Visualization Approaches High dimensionality Visual cluttering Clarity of representation Visualization is time consuming Dimensional
More informationClassifier C-Net. 2D Projected Images of 3D Objects. 2D Projected Images of 3D Objects. Model I. Model II
Advances in Neural Information Processing Systems 7. (99) The MIT Press, Cambridge, MA. pp.949-96 Unsupervised Classication of 3D Objects from D Views Satoshi Suzuki Hiroshi Ando ATR Human Information
More informationHigh throughput Data Analysis 2. Cluster Analysis
High throughput Data Analysis 2 Cluster Analysis Overview Why clustering? Hierarchical clustering K means clustering Issues with above two Other methods Quality of clustering results Introduction WHY DO
More informationComparing Self-Organizing Maps Samuel Kaski and Krista Lagus Helsinki University of Technology Neural Networks Research Centre Rakentajanaukio 2C, FIN
Kaski, S. and Lagus, K. (1996) Comparing Self-Organizing Maps. In C. von der Malsburg, W. von Seelen, J. C. Vorbruggen, and B. Sendho (Eds.) Proceedings of ICANN96, International Conference on Articial
More informationRobust Pose Estimation using the SwissRanger SR-3000 Camera
Robust Pose Estimation using the SwissRanger SR- Camera Sigurjón Árni Guðmundsson, Rasmus Larsen and Bjarne K. Ersbøll Technical University of Denmark, Informatics and Mathematical Modelling. Building,
More informationVisual object classification by sparse convolutional neural networks
Visual object classification by sparse convolutional neural networks Alexander Gepperth 1 1- Ruhr-Universität Bochum - Institute for Neural Dynamics Universitätsstraße 150, 44801 Bochum - Germany Abstract.
More informationData Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University
Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Exploratory data analysis tasks Examine the data, in search of structures
More informationClustering with Reinforcement Learning
Clustering with Reinforcement Learning Wesam Barbakh and Colin Fyfe, The University of Paisley, Scotland. email:wesam.barbakh,colin.fyfe@paisley.ac.uk Abstract We show how a previously derived method of
More informationTopological Correlation
Topological Correlation K.A.J. Doherty, R.G. Adams and and N. Davey University of Hertfordshire, Department of Computer Science College Lane, Hatfield, Hertfordshire, UK Abstract. Quantifying the success
More informationPreface Here come the golden words Place, Month 1999 First Name Hagen First Name Rodrian
Hagen Rodrian (Eds.) Please write here your book title Springer Internal Project number if known January 12, 1999 Springer-Verlag Berlin Heidelberg New York London Paris Tokyo Hong Kong Barcelona Budapest
More informationUnsupervised learning in Vision
Chapter 7 Unsupervised learning in Vision The fields of Computer Vision and Machine Learning complement each other in a very natural way: the aim of the former is to extract useful information from visual
More informationSlide07 Haykin Chapter 9: Self-Organizing Maps
Slide07 Haykin Chapter 9: Self-Organizing Maps CPSC 636-600 Instructor: Yoonsuck Choe Spring 2012 Introduction Self-organizing maps (SOM) is based on competitive learning, where output neurons compete
More informationINTERNATIONAL CONFERENCE ON ENGINEERING DESIGN ICED 05 MELBOURNE, AUGUST 15-18, 2005
INTERNATIONAL CONFERENCE ON ENGINEERING DESIGN ICED MELBOURNE, AUGUST -, METHOD USING A SELF-ORGANISING MAP FOR DRIVER CLASSIFI- CATION AS A PRECONDITION FOR CUSTOMER ORIENTED DESIGN Albert Albers and
More informationChemometrics. Description of Pirouette Algorithms. Technical Note. Abstract
19-1214 Chemometrics Technical Note Description of Pirouette Algorithms Abstract This discussion introduces the three analysis realms available in Pirouette and briefly describes each of the algorithms
More informationCHAPTER 6 PERCEPTUAL ORGANIZATION BASED ON TEMPORAL DYNAMICS
CHAPTER 6 PERCEPTUAL ORGANIZATION BASED ON TEMPORAL DYNAMICS This chapter presents a computational model for perceptual organization. A figure-ground segregation network is proposed based on a novel boundary
More informationSOMSN: An Effective Self Organizing Map for Clustering of Social Networks
SOMSN: An Effective Self Organizing Map for Clustering of Social Networks Fatemeh Ghaemmaghami Research Scholar, CSE and IT Dept. Shiraz University, Shiraz, Iran Reza Manouchehri Sarhadi Research Scholar,
More informationView Reconstruction by Linear Combination of Sample Views
in: Proceedings of the 12th British Machine Vision Conference (BMVC:2001), Vol. 1, pp. 223-232, Manchester, UK, 2001. View Reconstruction by Linear Combination of Sample Views Gabriele Peters & Christoph
More informationSequential Nonlinear Mapping versus Simultaneous One
INFORMATICA, 2002, Vol. 13, No. 3, 333 344 333 2002 Institute of Mathematics and Informatics, Vilnius Sequential Nonlinear Mapping versus Simultaneous One Algirdas Mykolas MONTVILAS Institute of Mathematics
More informationData Preprocessing. Javier Béjar. URL - Spring 2018 CS - MAI 1/78 BY: $\
Data Preprocessing Javier Béjar BY: $\ URL - Spring 2018 C CS - MAI 1/78 Introduction Data representation Unstructured datasets: Examples described by a flat set of attributes: attribute-value matrix Structured
More informationUsing Decision Boundary to Analyze Classifiers
Using Decision Boundary to Analyze Classifiers Zhiyong Yan Congfu Xu College of Computer Science, Zhejiang University, Hangzhou, China yanzhiyong@zju.edu.cn Abstract In this paper we propose to use decision
More informationPATTERN RECOGNITION USING NEURAL NETWORKS
PATTERN RECOGNITION USING NEURAL NETWORKS Santaji Ghorpade 1, Jayshree Ghorpade 2 and Shamla Mantri 3 1 Department of Information Technology Engineering, Pune University, India santaji_11jan@yahoo.co.in,
More information