Projection Techniques. Problems PCA. Y R p={1,2,3} X R m. Multi-dimensional Visualization based on Bidimensional Mapping

Size: px
Start display at page:

Download "Projection Techniques. Problems PCA. Y R p={1,2,3} X R m. Multi-dimensional Visualization based on Bidimensional Mapping"

Transcription

1 Projection Techniques Multi-dimensional Visualization based on Bidimensional Mapping X R m f Y R p={1,2,3} Rosane Minghim Instituto de Ciências Matemáticas e de Computação USP-São Carlos 1 : x i, x j R, x i,x j X d: y i, y j R, y i,y j Y f: X Y, (x i,x j ) d(f(x i ), f(x j )) 0, x i,x j X 2 Ex: Mapping to plane of patents surgery, drugs, molecular bio Problems PCA 390 dimensions 3 4 1

2 Problems PCA Problems PCA 5 6 Ex: Sammon Mapping Force Based Point Placement Let X be the points in the original space R n, we apply a distance measure d ij * between Xi an Xj., and find Y, the projected point, ex. R 2 and d ij the Euclidean distance between them. Sammon s method applies an error function to measure the target

3 Force Scheme [Tejada et al., 2003] Force Based Point Placement q' x' 9 10 Force Scheme [Tejada et al., 2003] q' Force Scheme [Tejada et al., 2003] q' x' x'

4 Force Scheme [Tejada et al., 2003] Force Scheme [Tejada et al., 2003] < 0 q' < 0 < 0 x' x' > 0 < 0 > LSP: Laplacian Matrix Let V i = {p i1,,p iki } be a neighborhood of a point p i and let c i be the coordinates of p i em R p 1 c 0 i c j ki p j V i Each p i is the centroid of points in V i p i4 p i5 p i p i3 p i1 p i2 LSP: Laplacian Matrix Lx 1 =0, Lx 2 =0,, Lx p =0 Where x 1, x 2,, x p are vectors containing the coordinates of the points and L is the matrix given by: x 1 1 i j 0 x2 1 Lij p j Vi L = ki 0 otherwise x n

5 LSP: Adicionando os Pontos de Controle L 1 A Cij C 0 p j is otherwise a control point LSP: Overview Escolher os Pontos pontos de em R m controle 0 bi x p c i i n n i n nc L x1 x2 x n c1 c 2 Projetar os pontos de controle Determinar a vizinhança dos pontos Resolver um sistema linear esparso Pontos em R p Choosing the Control Points In order to select the control points the space R m is split into nc clusters using k-medoids. the control points are the medoids of each cluster Choosing the Control Points Once the control points are chosen, these points are projected onto R d through a fast dimensionality reduction method Fast Projection (Fastmap or NNP) Force Placement

6 Content based by Projections Control points in blue Example Projection Example: IDH

7 Projection Example: voting in US Senate Point Placement by Phylogenetic Tree Construction Algorithms (N-J Trees) Point Placement by Phylogenetic Tree Construction Algorithms (N-J Trees) NJ similarity Tree

8 Alternate view (N-J Tree) Exploration Exploration Finding Relationships

9 Building a Surface RSS News Flash Palestinian Bird and Flu

10 Bush Iraq Bush Iraq Application 1: Visual Text Mapping Relationships : Topic Bursts and co-word Approach 1: Relationship Based (Metadata) (Mane and Borner) 2004 Approach 2: Content based

11 Relationships : Citation and Co-citation Content-based Text Mapping Approach 1: Pre-clustering & View (Borner) (2003) Approach 2: Dimension reduction (Projections) Content - based Content - based (Skupin) (2002) (abstracts) SOM (Dimensional Reduction) News flash IN-SPIRE (PNL)

12 Content - based SOM based Self-Organization Maps (SOMs) cartográficos (ex. Skurpin 2002) (Surface View) IN-SPIRE Mapeamento para o plano permitindo a exploração. Ex: Patents surgery, drugs, molecular bio Exemplos de Mapas

13 Exemplos de Mapas Detailing topics

14 Time Series Streamflow in Hidroelectrics Text from attributes Cattle performance data Translated to text from categorical information, e.g., Ranges of weight to words such as: {weight_below_fifty_percent; weight_between_fifty_seventy_five; etc..} 9135 individuals Cattle performance data Cattle performance data Colored by word top

15 Cattle performance data Cattle performance data Colored by female Colored by farm Images? Pipeline Image Data Set Feature Acquisition Feature Selection Interaction Classification Visualization Similarity Calculation

16 PEx-Image Sample Content PEx-Image Group Content PEx-Image Image as Visual Mark PEx-Image Coordination

17 PEx-Image Coordination Comparison of Distance Metrics Euclidean City Block Cosine 512 MRI medical images 12 classes Comparison of Distance Metrics Euclidean City Block Cosine Comparison of Feature Space (1) 16 Gabor Filters Fourier, Mean and Deviation 72 co-ocurrence matrices All combined 512 MRI medical images 12 classes 512 MRI medical images 12 classes

18 Comparison of Feature Space (1) 16 Gabor Filters Fourier, Mean and Deviation 72 co-ocurrence matrices All combined Comparison of Feature Space (2) All combined 1024 Wavelet Features 512 MRI medical images 12 classes X-Ray images from ImageCLEF 116 classes 70 Comparison of Feature Space (2) Detailed Inspection All combined 1024 Wavelet Features 1000 X-Ray images from ImageCLEF 116 classes

19 Detailed Inspection ImageCLEF Training Data Set (1) X-Ray images 116 classes 74 ImageCLEF Training Data Set (2) Class 108 Class 111 Further Examples on Text RSS Patent Data, recovered from the Web Case 1: 170 files Graphics processing, printer, database, document, ai

20 Further Examples Further Examples Further Examples (ink jet, document)

21 81 82 Patents case 2 Patents surgery, drugs, molecular bio files surgery (2), drugs(2), molecular biology

22 Patents surgery, drugs, molecular bio stopwords selection Patents surgery, drugs, molecular bio topics Patents surgery, drugs, molecular bio Patents surgery, drugs, molecular bio

23 Projection Explorer (PEx) Collaborators Alneu de Andrade Lopes Mineração de textos Haim Levkowitz Visualization João E. S. Batista Neto Imaging Visualization Group Maria Cristina F. Oliveira Fernando Vieira Paulovich Visualização/Projeções Rosane Minghim Luis Gustavo Nonato malhas Doutorandos Danilo Medeiros Eler Aretha Barbosa Kátia Felizardo Mestrandos Jorge Poco Medina Christian Tácito Neves Renato Oliveira Gabrial Andery

24 Other Partnerships Link Sérgio Furuie (Poli USP), Brazil infoserver.lcad.icmc.usp.br (Pex, Pex-WEB, Pex-Temporal, Pex-Image). Lars Linsen (Jacobs University Bremen), Germany Charl Botha (TU Delft ); Anton Heijs (Treparel Inc.), The Netherlands Referências Cuadros, A. M, Paulovich, F. V., Minghim, R., Telles, G. P - Point Placement by Phylogenetic Trees and its Application to Visual Analysis of Document Collections IEEE VAST 2007, Sacramento, CA, USA, IEEE CS Press, pp Paulovih, F. V., Oliveira, M.C.F., Minghim, R. - The Projection Explorer: A Flexible Tool for Projection-based Multidimensional Visualization, IEEE Sibgrapi 2007, IEEE CS Press, Belo Horizonte, Brazil,pp Lopes, A. A., Minghim, R., Melo, V., Paulovich, F.V.; Mapping texts through dimensionality reduction and visualization techniques for interactive exploration of document collections, SPIE Conference on Visualization and Data Analysis, San Jose, CA, USA Jan. 2006, 6060T-11. Minghim, R., Paulovich, F.V., Lopes, A. A.; Content-based text mapping using multidimensional projections for exploration of document collections, SPIE Conference on Visualization and Data Analysis, San Jose, CA, USA Jan. 2006, 6060T-11. Referências Pinho, R. D. ; Oliveira, M. C. F. ; Minghim, R. ; Andrade, M. G.. Voromap: A Voronoi-based Tool for Visual Exploration of Multidimensional Data. In: 10th International Conference on Information Visualization, 2006, Londres. Proceedings of Information Visualisation 2006, v. 1. p Paulovich, F. V. ; Minghim, R.. Text Map Explorer: a Tool to Create and Explore Document Maps. In: Information Visualisation 2006 (IV06) 10th International Conference on Information Visualisation, 2006, Londres. Proceedings of Information Visualisation 2006, v. 1. p Paulovich, F. V. ; Nonato, L. G. ; MINGHIM, R. ; Levkowitz, H.. Least Square Projection: a fast high precision multidimensional projection technique and its application to document mapping. IEEE Transactions on Visualization and Computer Graphics, Minghim, R. ; Levkowitz, H. ; Nonato, L. G. ; Watanabe, L. S. ; Salvador, V. C. L. ; Lopes, H. ; Pesco, S. ; Tavares, G.. Spider Cursor: A simple versatile interaction tool for data visualization and exploration. In: ACM GRAPHITE'05-3rd International Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia, 2005, Dunedin. Proceedings of Graphite 2005, p

Rosane Minghim + The team. Instituto de Ciências Matemáticas e de Computação USP-São Carlos

Rosane Minghim + The team. Instituto de Ciências Matemáticas e de Computação USP-São Carlos Rosane Minghim + The team Instituto de Ciências Matemáticas e de Computação USP-São Carlos Applications Visual Exploration of Image Retrieval results Photograph Visual exploration Support for development

More information

FEATURE SPACE UNIDIMENSIONAL PROJECTIONS FOR SCATTERPLOTS

FEATURE SPACE UNIDIMENSIONAL PROJECTIONS FOR SCATTERPLOTS 58 FEATURE SPACE UNIDIMENSIONAL PROJECTIONS FOR SCATTERPLOTS PROJEÇÕES DE ESPAÇOS DE CARACTERÍSTICAS UNIDIMENSIONAIS PARA GRÁFICOS DE DISPERSÃO Danilo Medeiros Eler 1 ; Alex C. de Almeida 2 ; Jaqueline

More information

On improved projection techniques to support visual exploration of multi-dimensional data sets

On improved projection techniques to support visual exploration of multi-dimensional data sets On improved projection techniques to support visual exploration of multi-dimensional data sets Eduardo Tejada-Gamero, Rosane Minghim and Luis Gustavo Nonato {eduardo,rminghim,gnonato}@icmc.usp.br High

More information

Mapping Texts Through Dimensionality Reduction and Visualization Techniques for Interactive Exploration of Document Collections

Mapping Texts Through Dimensionality Reduction and Visualization Techniques for Interactive Exploration of Document Collections Mapping Texts Through Dimensionality Reduction and Visualization Techniques for Interactive Exploration of Document Collections Alneu de Andrade Lopes, Rosane Minghim, Vinícius Melo and Fernando Vieira

More information

UNIVERSIDADE DE SÃO PAULO Instituto de Ciências Matemáticas e de Computação

UNIVERSIDADE DE SÃO PAULO Instituto de Ciências Matemáticas e de Computação UNIVERSIDADE DE SÃO PAULO Instituto de Ciências Matemáticas e de Computação ISSN 0103-2569 RULE-BASED TOPIC MINING TO ASSIST USER-CENTERED VISUAL EXPLORATION OF DOCUMENT COLLECTIONS ROBERTO PINHO MARIA

More information

Universidade de São Paulo Instituto de Ciências Matemáticas e de Computação ISSN

Universidade de São Paulo Instituto de Ciências Matemáticas e de Computação ISSN Universidade de São Paulo Instituto de Ciências Matemáticas e de Computação ISSN - 0103-2569 Fast Content-based Visual Mapping for Interactive Exploration of Document Collections Rosane Minghim Fernando

More information

Visual text mining using association rules

Visual text mining using association rules Computers & Graphics 31 (2007) 316 326 www.elsevier.com/locate/cag Visual text mining using association rules A.A. Lopes, R. Pinho, F.V. Paulovich, R. Minghim Universidade de São Paulo, Instituto de Ciências

More information

Combining diversity queries and visual mining to improve content-based image retrieval systems: the DiVI method

Combining diversity queries and visual mining to improve content-based image retrieval systems: the DiVI method Universidade de São Paulo Biblioteca Digital da Produção Intelectual - BDPI Departamento de Ciências de Computação - ICMC/SCC Comunicações em Eventos - ICMC/SCC 2015-12 Combining diversity queries and

More information

Visualization and text mining of patent and non-patent data

Visualization and text mining of patent and non-patent data of patent and non-patent data Anton Heijs Information Solutions Delft, The Netherlands http://www.treparel.com/ ICIC conference, Nice, France, 2008 Outline Introduction Applications on patent and non-patent

More information

Luis Gustavo Nonato Complete List of Publications November 2017

Luis Gustavo Nonato Complete List of Publications November 2017 Journal Papers Luis Gustavo Nonato Complete List of Publications November 2017 1. SAGRISTA, A.; JORDAN, S.; JUST, A.; DIAS, F.; NONATO, L. G.; SADLO, F. Topological Analysis of Inertial Dynamics. IEEE

More information

Multidimensional Projection with Radial Basis Function and Control Points Selection

Multidimensional Projection with Radial Basis Function and Control Points Selection Multidimensional Projection with Radial Basis Function and Control Points Selection Elisa Amorim 1 Emilio Vital Brazil 1 Luis Gustavo Nonato 2 Faramarz Samavati 1 Mario Costa Sousa 1 1 University of Calgary

More information

STATS306B STATS306B. Clustering. Jonathan Taylor Department of Statistics Stanford University. June 3, 2010

STATS306B STATS306B. Clustering. Jonathan Taylor Department of Statistics Stanford University. June 3, 2010 STATS306B Jonathan Taylor Department of Statistics Stanford University June 3, 2010 Spring 2010 Outline K-means, K-medoids, EM algorithm choosing number of clusters: Gap test hierarchical clustering spectral

More information

CSE 6242 A / CS 4803 DVA. Feb 12, Dimension Reduction. Guest Lecturer: Jaegul Choo

CSE 6242 A / CS 4803 DVA. Feb 12, Dimension Reduction. Guest Lecturer: Jaegul Choo CSE 6242 A / CS 4803 DVA Feb 12, 2013 Dimension Reduction Guest Lecturer: Jaegul Choo CSE 6242 A / CS 4803 DVA Feb 12, 2013 Dimension Reduction Guest Lecturer: Jaegul Choo Data is Too Big To Do Something..

More information

Colorization by Multidimensional Projection

Colorization by Multidimensional Projection Colorization by Multidimensional Projection Wallace Casaca, Erick Gomez-Nieto, Cynthia O. L. Ferreira, Geovan Tavares Paulo Pagliosa, Fernando Paulovich, Luis Gustavo Nonato and Afonso Paiva ICMC, USP,

More information

Exploratory data analysis for microarrays

Exploratory data analysis for microarrays Exploratory data analysis for microarrays Jörg Rahnenführer Computational Biology and Applied Algorithmics Max Planck Institute for Informatics D-66123 Saarbrücken Germany NGFN - Courses in Practical DNA

More information

Machine Learning Methods in Visualisation for Big Data 2018

Machine Learning Methods in Visualisation for Big Data 2018 Machine Learning Methods in Visualisation for Big Data 2018 Daniel Archambault1 Ian Nabney2 Jaakko Peltonen3 1 Swansea University 2 University of Bristol 3 University of Tampere, Aalto University Evaluating

More information

Cluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1

Cluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1 Cluster Analysis Mu-Chun Su Department of Computer Science and Information Engineering National Central University 2003/3/11 1 Introduction Cluster analysis is the formal study of algorithms and methods

More information

Exploratory Data Analysis using Self-Organizing Maps. Madhumanti Ray

Exploratory Data Analysis using Self-Organizing Maps. Madhumanti Ray Exploratory Data Analysis using Self-Organizing Maps Madhumanti Ray Content Introduction Data Analysis methods Self-Organizing Maps Conclusion Visualization of high-dimensional data items Exploratory data

More information

Machine Learning for OR & FE

Machine Learning for OR & FE Machine Learning for OR & FE Unsupervised Learning: Clustering Martin Haugh Department of Industrial Engineering and Operations Research Columbia University Email: martin.b.haugh@gmail.com (Some material

More information

Link prediction in graph construction for supervised and semi-supervised learning

Link prediction in graph construction for supervised and semi-supervised learning Link prediction in graph construction for supervised and semi-supervised learning Lilian Berton, Jorge Valverde-Rebaza and Alneu de Andrade Lopes Laboratory of Computational Intelligence (LABIC) University

More information

CS 1675 Introduction to Machine Learning Lecture 18. Clustering. Clustering. Groups together similar instances in the data sample

CS 1675 Introduction to Machine Learning Lecture 18. Clustering. Clustering. Groups together similar instances in the data sample CS 1675 Introduction to Machine Learning Lecture 18 Clustering Milos Hauskrecht milos@cs.pitt.edu 539 Sennott Square Clustering Groups together similar instances in the data sample Basic clustering problem:

More information

Distance-based Phylogenetic Methods Near a Polytomy

Distance-based Phylogenetic Methods Near a Polytomy Distance-based Phylogenetic Methods Near a Polytomy Ruth Davidson and Seth Sullivant NCSU UIUC May 21, 2014 2 Phylogenetic trees model the common evolutionary history of a group of species Leaves = extant

More information

Overview. Spectral Processing of Point- Sampled Geometry. Introduction. Introduction. Fourier Transform. Fourier Transform

Overview. Spectral Processing of Point- Sampled Geometry. Introduction. Introduction. Fourier Transform. Fourier Transform Overview Spectral Processing of Point- Sampled Geometry Introduction Fourier transform Spectral processing pipeline Spectral filtering Adaptive subsampling Summary Point-Based Computer Graphics Markus

More information

CIE L*a*b* color model

CIE L*a*b* color model CIE L*a*b* color model To further strengthen the correlation between the color model and human perception, we apply the following non-linear transformation: with where (X n,y n,z n ) are the tristimulus

More information

1. Discovering Important Nodes through Graph Entropy The Case of Enron Database

1. Discovering Important Nodes through Graph Entropy The Case of Enron  Database 1. Discovering Important Nodes through Graph Entropy The Case of Enron Email Database ACM KDD 2005 Chicago, Illinois. 2. Optimizing Video Search Reranking Via Minimum Incremental Information Loss ACM MIR

More information

The Pre-Image Problem in Kernel Methods

The Pre-Image Problem in Kernel Methods The Pre-Image Problem in Kernel Methods James Kwok Ivor Tsang Department of Computer Science Hong Kong University of Science and Technology Hong Kong The Pre-Image Problem in Kernel Methods ICML-2003 1

More information

Figure (5) Kohonen Self-Organized Map

Figure (5) Kohonen Self-Organized Map 2- KOHONEN SELF-ORGANIZING MAPS (SOM) - The self-organizing neural networks assume a topological structure among the cluster units. - There are m cluster units, arranged in a one- or two-dimensional array;

More information

Lecture Topic Projects

Lecture Topic Projects Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, basic tasks, data types 3 Introduction to D3, basic vis techniques for non-spatial data Project #1 out 4 Data

More information

OBJECT-CENTERED INTERACTIVE MULTI-DIMENSIONAL SCALING: ASK THE EXPERT

OBJECT-CENTERED INTERACTIVE MULTI-DIMENSIONAL SCALING: ASK THE EXPERT OBJECT-CENTERED INTERACTIVE MULTI-DIMENSIONAL SCALING: ASK THE EXPERT Joost Broekens Tim Cocx Walter A. Kosters Leiden Institute of Advanced Computer Science Leiden University, The Netherlands Email: {broekens,

More information

Clustering. Sandrien van Ommen

Clustering. Sandrien van Ommen Clustering Sandrien van Ommen Overview Why clustering When clustering Types of clustering Dialects Distances Dutch towns Buldialect Conclusion Why clustering To find similarity in your data T-test & Anova

More information

K-means clustering Based in part on slides from textbook, slides of Susan Holmes. December 2, Statistics 202: Data Mining.

K-means clustering Based in part on slides from textbook, slides of Susan Holmes. December 2, Statistics 202: Data Mining. K-means clustering Based in part on slides from textbook, slides of Susan Holmes December 2, 2012 1 / 1 K-means Outline K-means, K-medoids Choosing the number of clusters: Gap test, silhouette plot. Mixture

More information

Cluster Analysis and Visualization. Workshop on Statistics and Machine Learning 2004/2/6

Cluster Analysis and Visualization. Workshop on Statistics and Machine Learning 2004/2/6 Cluster Analysis and Visualization Workshop on Statistics and Machine Learning 2004/2/6 Outlines Introduction Stages in Clustering Clustering Analysis and Visualization One/two-dimensional Data Histogram,

More information

Non-linear dimension reduction

Non-linear dimension reduction Sta306b May 23, 2011 Dimension Reduction: 1 Non-linear dimension reduction ISOMAP: Tenenbaum, de Silva & Langford (2000) Local linear embedding: Roweis & Saul (2000) Local MDS: Chen (2006) all three methods

More information

Methods for Intelligent Systems

Methods for Intelligent Systems Methods for Intelligent Systems Lecture Notes on Clustering (II) Davide Eynard eynard@elet.polimi.it Department of Electronics and Information Politecnico di Milano Davide Eynard - Lecture Notes on Clustering

More information

1 Introduction. 3 Data Preprocessing. 2 Literature Review

1 Introduction. 3 Data Preprocessing. 2 Literature Review Rock or not? This sure does. [Category] Audio & Music CS 229 Project Report Anand Venkatesan(anand95), Arjun Parthipan(arjun777), Lakshmi Manoharan(mlakshmi) 1 Introduction Music Genre Classification continues

More information

CS 2750 Machine Learning. Lecture 19. Clustering. CS 2750 Machine Learning. Clustering. Groups together similar instances in the data sample

CS 2750 Machine Learning. Lecture 19. Clustering. CS 2750 Machine Learning. Clustering. Groups together similar instances in the data sample Lecture 9 Clustering Milos Hauskrecht milos@cs.pitt.edu 539 Sennott Square Clustering Groups together similar instances in the data sample Basic clustering problem: distribute data into k different groups

More information

Unsupervised learning

Unsupervised learning Unsupervised learning Enrique Muñoz Ballester Dipartimento di Informatica via Bramante 65, 26013 Crema (CR), Italy enrique.munoz@unimi.it Enrique Muñoz Ballester 2017 1 Download slides data and scripts:

More information

What to come. There will be a few more topics we will cover on supervised learning

What to come. There will be a few more topics we will cover on supervised learning Summary so far Supervised learning learn to predict Continuous target regression; Categorical target classification Linear Regression Classification Discriminative models Perceptron (linear) Logistic regression

More information

Clustering: Overview and K-means algorithm

Clustering: Overview and K-means algorithm Clustering: Overview and K-means algorithm Informal goal Given set of objects and measure of similarity between them, group similar objects together K-Means illustrations thanks to 2006 student Martin

More information

CS7267 MACHINE LEARNING NEAREST NEIGHBOR ALGORITHM. Mingon Kang, PhD Computer Science, Kennesaw State University

CS7267 MACHINE LEARNING NEAREST NEIGHBOR ALGORITHM. Mingon Kang, PhD Computer Science, Kennesaw State University CS7267 MACHINE LEARNING NEAREST NEIGHBOR ALGORITHM Mingon Kang, PhD Computer Science, Kennesaw State University KNN K-Nearest Neighbors (KNN) Simple, but very powerful classification algorithm Classifies

More information

Dimension reduction : PCA and Clustering

Dimension reduction : PCA and Clustering Dimension reduction : PCA and Clustering By Hanne Jarmer Slides by Christopher Workman Center for Biological Sequence Analysis DTU The DNA Array Analysis Pipeline Array design Probe design Question Experimental

More information

Histogram Equalization and Specification for High-dimensional Data Visualization using RadViz

Histogram Equalization and Specification for High-dimensional Data Visualization using RadViz Histogram Equalization and Specification for High-dimensional Data Visualization using RadViz Yan-Chao Wang Rolls-Royce@NTU Corporate Lab, ywang054@e.ntu.edu.sg Chi-Keong Goh Advanced Technology Centre,

More information

Texture Segmentation by Windowed Projection

Texture Segmentation by Windowed Projection Texture Segmentation by Windowed Projection 1, 2 Fan-Chen Tseng, 2 Ching-Chi Hsu, 2 Chiou-Shann Fuh 1 Department of Electronic Engineering National I-Lan Institute of Technology e-mail : fctseng@ccmail.ilantech.edu.tw

More information

Vehicle Formations Based on Formations Based on Ranges and Ranges and Bearings Bearings

Vehicle Formations Based on Formations Based on Ranges and Ranges and Bearings Bearings Collaborative Collaborative Localization Localization of Vehicle of Vehicle Formations Based on Formations Based on Ranges and Ranges and Bearings Bearings Beatriz Q. Ferreira, João Gomes, Cláudia Soares,

More information

Clustering and Visualisation of Data

Clustering and Visualisation of Data Clustering and Visualisation of Data Hiroshi Shimodaira January-March 28 Cluster analysis aims to partition a data set into meaningful or useful groups, based on distances between data points. In some

More information

Gene Clustering & Classification

Gene Clustering & Classification BINF, Introduction to Computational Biology Gene Clustering & Classification Young-Rae Cho Associate Professor Department of Computer Science Baylor University Overview Introduction to Gene Clustering

More information

Content-Aware Image Resizing

Content-Aware Image Resizing Content-Aware Image Resizing EE368 Project Report Parnian Zargham Stanford University Electrical Engineering Department Stanford, CA pzargham@stanford.edu Sahar Nassirpour Stanford University Electrical

More information

Shape spaces. Shape usually defined explicitly to be residual: independent of size.

Shape spaces. Shape usually defined explicitly to be residual: independent of size. Shape spaces Before define a shape space, must define shape. Numerous definitions of shape in relation to size. Many definitions of size (which we will explicitly define later). Shape usually defined explicitly

More information

IFSC/USP at ImageCLEF 2011: Plant identification task

IFSC/USP at ImageCLEF 2011: Plant identification task IFSC/USP at ImageCLEF 2011: Plant identification task Dalcimar Casanova, João Batista Florindo, and Odemir Martinez Bruno USP - Universidade de São Paulo IFSC - Instituto de Física de São Carlos, São Carlos,

More information

Multidimensional scaling Based in part on slides from textbook, slides of Susan Holmes. October 10, Statistics 202: Data Mining

Multidimensional scaling Based in part on slides from textbook, slides of Susan Holmes. October 10, Statistics 202: Data Mining Multidimensional scaling Based in part on slides from textbook, slides of Susan Holmes October 10, 2012 1 / 1 Multidimensional scaling A visual tool Recall the PCA scores were X V = U where X = HX S 1/2

More information

Text Mining. Munawar, PhD. Text Mining - Munawar, PhD

Text Mining. Munawar, PhD. Text Mining - Munawar, PhD 10 Text Mining Munawar, PhD Definition Text mining also is known as Text Data Mining (TDM) and Knowledge Discovery in Textual Database (KDT).[1] A process of identifying novel information from a collection

More information

Unsupervised Learning

Unsupervised Learning Unsupervised Learning Learning without Class Labels (or correct outputs) Density Estimation Learn P(X) given training data for X Clustering Partition data into clusters Dimensionality Reduction Discover

More information

Clustering: Overview and K-means algorithm

Clustering: Overview and K-means algorithm Clustering: Overview and K-means algorithm Informal goal Given set of objects and measure of similarity between them, group similar objects together K-Means illustrations thanks to 2006 student Martin

More information

Non-Bayesian Classifiers Part I: k-nearest Neighbor Classifier and Distance Functions

Non-Bayesian Classifiers Part I: k-nearest Neighbor Classifier and Distance Functions Non-Bayesian Classifiers Part I: k-nearest Neighbor Classifier and Distance Functions Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Fall 2017 CS 551,

More information

Lecture 6: Unsupervised Machine Learning Dagmar Gromann International Center For Computational Logic

Lecture 6: Unsupervised Machine Learning Dagmar Gromann International Center For Computational Logic SEMANTIC COMPUTING Lecture 6: Unsupervised Machine Learning Dagmar Gromann International Center For Computational Logic TU Dresden, 23 November 2018 Overview Unsupervised Machine Learning overview Association

More information

Cluster Analysis. Prof. Thomas B. Fomby Department of Economics Southern Methodist University Dallas, TX April 2008 April 2010

Cluster Analysis. Prof. Thomas B. Fomby Department of Economics Southern Methodist University Dallas, TX April 2008 April 2010 Cluster Analysis Prof. Thomas B. Fomby Department of Economics Southern Methodist University Dallas, TX 7575 April 008 April 010 Cluster Analysis, sometimes called data segmentation or customer segmentation,

More information

CURRICULUM VITAE. Personal Data. Education. Research Projects. Name Silvia Cristina Dias Pinto Birth January 2, 1974 Nationality Brazilian

CURRICULUM VITAE. Personal Data. Education. Research Projects. Name Silvia Cristina Dias Pinto Birth January 2, 1974 Nationality Brazilian CURRICULUM VITAE Personal Data Name Silvia Cristina Dias Pinto Birth January 2, 1974 Nationality Brazilian Address Rua Araraquara, 673 Jardim Iguatemi - Sorocaba - SP 18085-470, Brazil Phone (home) +55

More information

Unsupervised Learning

Unsupervised Learning Unsupervised Learning Unsupervised learning Until now, we have assumed our training samples are labeled by their category membership. Methods that use labeled samples are said to be supervised. However,

More information

Many application fields produce large amounts of. Beyond the Third Dimension: Visualizing High-dimensional Data with Projections

Many application fields produce large amounts of. Beyond the Third Dimension: Visualizing High-dimensional Data with Projections SECTion ViSualizaTion TiTlE CornEr Editors: Konrad Joao Comba, Hinsen, UFRGS, hinsen@cnrs-orleans.fr comba@inf.ufrgs.br, Konstantin and Daniel Läufer, Weiskopf, laufer@cs.luc.edu weiskopf@visus.uni-stuttgart.de

More information

Unsupervised Learning

Unsupervised Learning Unsupervised Learning Chapter 14: The Elements of Statistical Learning Presented for 540 by Len Tanaka Objectives Introduction Techniques: Association Rules Cluster Analysis Self-Organizing Maps Projective

More information

Visual Analysis of Classification Scheme

Visual Analysis of Classification Scheme Visual Analysis of Classification Scheme Veslava Osińska Institute of Information Science and Book Studies Nicolaus Copernicus University Toruń, Poland 1/26 Information Visualization (INFOVIS) Visualization

More information

Robot Manifolds for Direct and Inverse Kinematics Solutions

Robot Manifolds for Direct and Inverse Kinematics Solutions Robot Manifolds for Direct and Inverse Kinematics Solutions Bruno Damas Manuel Lopes Abstract We present a novel algorithm to estimate robot kinematic manifolds incrementally. We relate manifold learning

More information

Abstractacceptedforpresentationatthe2018SEGConventionatAnaheim,California.Presentationtobemadeinsesion

Abstractacceptedforpresentationatthe2018SEGConventionatAnaheim,California.Presentationtobemadeinsesion Abstractacceptedforpresentationatthe2018SEGConventionatAnaheim,California.Presentationtobemadeinsesion MLDA3:FaciesClasificationandReservoirProperties2,onOctober17,2018from 11:25am to11:50am inroom 204B

More information

Feature Extraction and Image Processing, 2 nd Edition. Contents. Preface

Feature Extraction and Image Processing, 2 nd Edition. Contents. Preface , 2 nd Edition Preface ix 1 Introduction 1 1.1 Overview 1 1.2 Human and Computer Vision 1 1.3 The Human Vision System 3 1.3.1 The Eye 4 1.3.2 The Neural System 7 1.3.3 Processing 7 1.4 Computer Vision

More information

MEDICAL IMAGE ANALYSIS

MEDICAL IMAGE ANALYSIS SECOND EDITION MEDICAL IMAGE ANALYSIS ATAM P. DHAWAN g, A B IEEE Engineering in Medicine and Biology Society, Sponsor IEEE Press Series in Biomedical Engineering Metin Akay, Series Editor +IEEE IEEE PRESS

More information

CHAPTER 3 TUMOR DETECTION BASED ON NEURO-FUZZY TECHNIQUE

CHAPTER 3 TUMOR DETECTION BASED ON NEURO-FUZZY TECHNIQUE 32 CHAPTER 3 TUMOR DETECTION BASED ON NEURO-FUZZY TECHNIQUE 3.1 INTRODUCTION In this chapter we present the real time implementation of an artificial neural network based on fuzzy segmentation process

More information

Biomedical Image Analysis based on Computational Registration Methods. João Manuel R. S. Tavares

Biomedical Image Analysis based on Computational Registration Methods. João Manuel R. S. Tavares Biomedical Image Analysis based on Computational Registration Methods João Manuel R. S. Tavares tavares@fe.up.pt, www.fe.up.pt/~tavares Outline 1. Introduction 2. Methods a) Spatial Registration of (2D

More information

Network Traffic Measurements and Analysis

Network Traffic Measurements and Analysis DEIB - Politecnico di Milano Fall, 2017 Introduction Often, we have only a set of features x = x 1, x 2,, x n, but no associated response y. Therefore we are not interested in prediction nor classification,

More information

The K-modes and Laplacian K-modes algorithms for clustering

The K-modes and Laplacian K-modes algorithms for clustering The K-modes and Laplacian K-modes algorithms for clustering Miguel Á. Carreira-Perpiñán Electrical Engineering and Computer Science University of California, Merced http://faculty.ucmerced.edu/mcarreira-perpinan

More information

Unsupervised learning: Clustering & Dimensionality reduction. Theo Knijnenburg Jorma de Ronde

Unsupervised learning: Clustering & Dimensionality reduction. Theo Knijnenburg Jorma de Ronde Unsupervised learning: Clustering & Dimensionality reduction Theo Knijnenburg Jorma de Ronde Source of slides Marcel Reinders TU Delft Lodewyk Wessels NKI Bioalgorithms.info Jeffrey D. Ullman Stanford

More information

Understanding Clustering Supervising the unsupervised

Understanding Clustering Supervising the unsupervised Understanding Clustering Supervising the unsupervised Janu Verma IBM T.J. Watson Research Center, New York http://jverma.github.io/ jverma@us.ibm.com @januverma Clustering Grouping together similar data

More information

THE COMPUTER MODELLING OF GLUING FLAT IMAGES ALGORITHMS. Alekseí Yu. Chekunov. 1. Introduction

THE COMPUTER MODELLING OF GLUING FLAT IMAGES ALGORITHMS. Alekseí Yu. Chekunov. 1. Introduction MATEMATIQKI VESNIK Corrected proof Available online 01.10.2016 originalni nauqni rad research paper THE COMPUTER MODELLING OF GLUING FLAT IMAGES ALGORITHMS Alekseí Yu. Chekunov Abstract. In this paper

More information

Searching non-text information objects

Searching non-text information objects Non-text digital objects Searching non-text information objects Music Speech Images 3D models Video? 1 2 Ways to query for something Query by describing content 1. Query by category/ theme easiest - work

More information

CLUSTERING. JELENA JOVANOVIĆ Web:

CLUSTERING. JELENA JOVANOVIĆ   Web: CLUSTERING JELENA JOVANOVIĆ Email: jeljov@gmail.com Web: http://jelenajovanovic.net OUTLINE What is clustering? Application domains K-Means clustering Understanding it through an example The K-Means algorithm

More information

Clustering. Bruno Martins. 1 st Semester 2012/2013

Clustering. Bruno Martins. 1 st Semester 2012/2013 Departamento de Engenharia Informática Instituto Superior Técnico 1 st Semester 2012/2013 Slides baseados nos slides oficiais do livro Mining the Web c Soumen Chakrabarti. Outline 1 Motivation Basic Concepts

More information

Generalized Principal Component Analysis CVPR 2007

Generalized Principal Component Analysis CVPR 2007 Generalized Principal Component Analysis Tutorial @ CVPR 2007 Yi Ma ECE Department University of Illinois Urbana Champaign René Vidal Center for Imaging Science Institute for Computational Medicine Johns

More information

Linear and Non-linear Dimentionality Reduction Applied to Gene Expression Data of Cancer Tissue Samples

Linear and Non-linear Dimentionality Reduction Applied to Gene Expression Data of Cancer Tissue Samples Linear and Non-linear Dimentionality Reduction Applied to Gene Expression Data of Cancer Tissue Samples Franck Olivier Ndjakou Njeunje Applied Mathematics, Statistics, and Scientific Computation University

More information

Clustering and Dimensionality Reduction. Stony Brook University CSE545, Fall 2017

Clustering and Dimensionality Reduction. Stony Brook University CSE545, Fall 2017 Clustering and Dimensionality Reduction Stony Brook University CSE545, Fall 2017 Goal: Generalize to new data Model New Data? Original Data Does the model accurately reflect new data? Supervised vs. Unsupervised

More information

Viz3D: Effective Exploratory Visualization of Large Multidimensional Data Sets

Viz3D: Effective Exploratory Visualization of Large Multidimensional Data Sets Viz3D: Effective Exploratory Visualization of Large Multidimensional Data Sets Almir Olivette Artero Maria Cristina Ferreira de Oliveira Instituto de Ciências Matemáticas e Computação Universidade de São

More information

COMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS

COMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS COMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS Toomas Kirt Supervisor: Leo Võhandu Tallinn Technical University Toomas.Kirt@mail.ee Abstract: Key words: For the visualisation

More information

GEMINI GEneric Multimedia INdexIng

GEMINI GEneric Multimedia INdexIng GEMINI GEneric Multimedia INdexIng GEneric Multimedia INdexIng distance measure Sub-pattern Match quick and dirty test Lower bounding lemma 1-D Time Sequences Color histograms Color auto-correlogram Shapes

More information

DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING DS7201 ADVANCED DIGITAL IMAGE PROCESSING II M.E (C.S) QUESTION BANK UNIT I 1. Write the differences between photopic and scotopic vision? 2. What

More information

An Improvement of Centroid-Based Classification Algorithm for Text Classification

An Improvement of Centroid-Based Classification Algorithm for Text Classification An Improvement of Centroid-Based Classification Algorithm for Text Classification Zehra Cataltepe, Eser Aygun Istanbul Technical Un. Computer Engineering Dept. Ayazaga, Sariyer, Istanbul, Turkey cataltepe@itu.edu.tr,

More information

Linear Discriminant Analysis for 3D Face Recognition System

Linear Discriminant Analysis for 3D Face Recognition System Linear Discriminant Analysis for 3D Face Recognition System 3.1 Introduction Face recognition and verification have been at the top of the research agenda of the computer vision community in recent times.

More information

Olmo S. Zavala Romero. Clustering Hierarchical Distance Group Dist. K-means. Center of Atmospheric Sciences, UNAM.

Olmo S. Zavala Romero. Clustering Hierarchical Distance Group Dist. K-means. Center of Atmospheric Sciences, UNAM. Center of Atmospheric Sciences, UNAM November 16, 2016 Cluster Analisis Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster)

More information

Contents. List of Figures. List of Tables. List of Algorithms. I Clustering, Data, and Similarity Measures 1

Contents. List of Figures. List of Tables. List of Algorithms. I Clustering, Data, and Similarity Measures 1 Contents List of Figures List of Tables List of Algorithms Preface xiii xv xvii xix I Clustering, Data, and Similarity Measures 1 1 Data Clustering 3 1.1 Definition of Data Clustering... 3 1.2 The Vocabulary

More information

Overview. Efficient Simplification of Point-sampled Surfaces. Introduction. Introduction. Neighborhood. Local Surface Analysis

Overview. Efficient Simplification of Point-sampled Surfaces. Introduction. Introduction. Neighborhood. Local Surface Analysis Overview Efficient Simplification of Pointsampled Surfaces Introduction Local surface analysis Simplification methods Error measurement Comparison PointBased Computer Graphics Mark Pauly PointBased Computer

More information

Fast Document Clustering Based on Weighted Comparative Advantage

Fast Document Clustering Based on Weighted Comparative Advantage Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Fast Document Clustering Based on Weighted Comparative Advantage Jie Ji Intelligent

More information

9.1. K-means Clustering

9.1. K-means Clustering 424 9. MIXTURE MODELS AND EM Section 9.2 Section 9.3 Section 9.4 view of mixture distributions in which the discrete latent variables can be interpreted as defining assignments of data points to specific

More information

Unsupervised Feature Selection for Sparse Data

Unsupervised Feature Selection for Sparse Data Unsupervised Feature Selection for Sparse Data Artur Ferreira 1,3 Mário Figueiredo 2,3 1- Instituto Superior de Engenharia de Lisboa, Lisboa, PORTUGAL 2- Instituto Superior Técnico, Lisboa, PORTUGAL 3-

More information

Department of Computer Science and Engineering B.E/B.Tech/M.E/M.Tech : B.E. Regulation: 2013 PG Specialisation : _

Department of Computer Science and Engineering B.E/B.Tech/M.E/M.Tech : B.E. Regulation: 2013 PG Specialisation : _ COURSE DELIVERY PLAN - THEORY Page 1 of 6 Department of Computer Science and Engineering B.E/B.Tech/M.E/M.Tech : B.E. Regulation: 2013 PG Specialisation : _ LP: CS6007 Rev. No: 01 Date: 27/06/2017 Sub.

More information

INF4820 Algorithms for AI and NLP. Evaluating Classifiers Clustering

INF4820 Algorithms for AI and NLP. Evaluating Classifiers Clustering INF4820 Algorithms for AI and NLP Evaluating Classifiers Clustering Erik Velldal & Stephan Oepen Language Technology Group (LTG) September 23, 2015 Agenda Last week Supervised vs unsupervised learning.

More information

THE COMPUTER MODELLING OF GLUING FLAT IMAGES ALGORITHMS. Alekseí Yu. Chekunov. 1. Introduction

THE COMPUTER MODELLING OF GLUING FLAT IMAGES ALGORITHMS. Alekseí Yu. Chekunov. 1. Introduction MATEMATIČKI VESNIK MATEMATIQKI VESNIK 69, 1 (2017), 12 22 March 2017 research paper originalni nauqni rad THE COMPUTER MODELLING OF GLUING FLAT IMAGES ALGORITHMS Alekseí Yu. Chekunov Abstract. In this

More information

ECLT 5810 Clustering

ECLT 5810 Clustering ECLT 5810 Clustering What is Cluster Analysis? Cluster: a collection of data objects Similar to one another within the same cluster Dissimilar to the objects in other clusters Cluster analysis Grouping

More information

Machine Learning for Signal Processing Clustering. Bhiksha Raj Class Oct 2016

Machine Learning for Signal Processing Clustering. Bhiksha Raj Class Oct 2016 Machine Learning for Signal Processing Clustering Bhiksha Raj Class 11. 13 Oct 2016 1 Statistical Modelling and Latent Structure Much of statistical modelling attempts to identify latent structure in the

More information

VC 16/17 TP5 Single Pixel Manipulation

VC 16/17 TP5 Single Pixel Manipulation VC 16/17 TP5 Single Pixel Manipulation Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos Hélder Filipe Pinto de Oliveira Outline Dynamic Range Manipulation

More information

Content Based Image Retrieval Using Color and Texture Feature with Distance Matrices

Content Based Image Retrieval Using Color and Texture Feature with Distance Matrices International Journal of Scientific and Research Publications, Volume 7, Issue 8, August 2017 512 Content Based Image Retrieval Using Color and Texture Feature with Distance Matrices Manisha Rajput Department

More information

Today. Gradient descent for minimization of functions of real variables. Multi-dimensional scaling. Self-organizing maps

Today. Gradient descent for minimization of functions of real variables. Multi-dimensional scaling. Self-organizing maps Today Gradient descent for minimization of functions of real variables. Multi-dimensional scaling Self-organizing maps Gradient Descent Derivatives Consider function f(x) : R R. The derivative w.r.t. x

More information

Unsupervised Learning. Presenter: Anil Sharma, PhD Scholar, IIIT-Delhi

Unsupervised Learning. Presenter: Anil Sharma, PhD Scholar, IIIT-Delhi Unsupervised Learning Presenter: Anil Sharma, PhD Scholar, IIIT-Delhi Content Motivation Introduction Applications Types of clustering Clustering criterion functions Distance functions Normalization Which

More information

Iterative random projections for high-dimensional data clustering

Iterative random projections for high-dimensional data clustering Iterative random projections for high-dimensional data clustering Ângelo Cardoso, Andreas Wichert INESC-ID Lisboa and Instituto Superior Técnico, Technical University of Lisbon Av. Prof. Dr. Aníbal Cavaco

More information