Generalized Transitive Distance with Minimum Spanning Random Forest

Similar documents
On Order-Constrained Transitive Distance

Visual Representations for Machine Learning

Subspace Clustering. Weiwei Feng. December 11, 2015

Understanding Clustering Supervising the unsupervised

Subspace Clustering with Global Dimension Minimization And Application to Motion Segmentation

Segmentation Computer Vision Spring 2018, Lecture 27

Hard clustering. Each object is assigned to one and only one cluster. Hierarchical clustering is usually hard. Soft (fuzzy) clustering

Random Forest Clustering and Application to Video Segmentation

Statistics 202: Data Mining. c Jonathan Taylor. Week 8 Based in part on slides from textbook, slides of Susan Holmes. December 2, / 1

Clustering. CE-717: Machine Learning Sharif University of Technology Spring Soleymani

Cluster Analysis. Angela Montanari and Laura Anderlucci

Data Mining Cluster Analysis: Basic Concepts and Algorithms. Slides From Lecture Notes for Chapter 8. Introduction to Data Mining

Clustering CS 550: Machine Learning

Data Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining

Applications. Foreground / background segmentation Finding skin-colored regions. Finding the moving objects. Intelligent scissors

Low Rank Representation Theories, Algorithms, Applications. 林宙辰 北京大学 May 3, 2012

Discrete geometry. Lecture 2. Alexander & Michael Bronstein tosca.cs.technion.ac.il/book

Clustering. Informal goal. General types of clustering. Applications: Clustering in information search and analysis. Example applications in search

Introduction to spectral clustering

Spectral clustering Lecture 3

Unsupervised Learning : Clustering

Explore Co-clustering on Job Applications. Qingyun Wan SUNet ID:qywan

Lecture 7: Segmentation. Thursday, Sept 20

Clustering algorithms and introduction to persistent homology

MSA220 - Statistical Learning for Big Data

STATS306B STATS306B. Clustering. Jonathan Taylor Department of Statistics Stanford University. June 3, 2010

Clustering Algorithms for general similarity measures

BBS654 Data Mining. Pinar Duygulu. Slides are adapted from Nazli Ikizler

Segmentation: Clustering, Graph Cut and EM

Clustering. CS294 Practical Machine Learning Junming Yin 10/09/06

Principal Coordinate Clustering

Clustering and Dissimilarity Measures. Clustering. Dissimilarity Measures. Cluster Analysis. Perceptually-Inspired Measures

Introduction to Machine Learning

NULL SPACE CLUSTERING WITH APPLICATIONS TO MOTION SEGMENTATION AND FACE CLUSTERING

THE key to graph-based learning algorithms is the sparse. Constructing the L2-Graph for Robust Subspace Learning and Subspace Clustering

Enhancing Clustering Results In Hierarchical Approach By Mvs Measures

DATA MINING LECTURE 7. Hierarchical Clustering, DBSCAN The EM Algorithm

Spectral Clustering. Presented by Eldad Rubinstein Based on a Tutorial by Ulrike von Luxburg TAU Big Data Processing Seminar December 14, 2014

Graph Connectivity in Sparse Subspace Clustering

Analysis of high dimensional data via Topology. Louis Xiang. Oak Ridge National Laboratory. Oak Ridge, Tennessee

CS 534: Computer Vision Segmentation and Perceptual Grouping

Hierarchical Clustering

Theoretical Foundations of Clustering. Margareta Ackerman

Learning Low-rank Transformations: Algorithms and Applications. Qiang Qiu Guillermo Sapiro

Image Segmentation continued Graph Based Methods

Supplementary Materials for Salient Object Detection: A

CS522: Advanced Algorithms

Lecture 2 September 3

A Spectral-based Clustering Algorithm for Categorical Data Using Data Summaries (SCCADDS)

Segmentation. Bottom Up Segmentation

CSE 5243 INTRO. TO DATA MINING

Machine Learning (BSMC-GA 4439) Wenke Liu

Kapitel 4: Clustering

Behavioral Data Mining. Lecture 18 Clustering

Unsupervised Learning

Clustering Lecture 3: Hierarchical Methods

Data Mining Concepts & Techniques

Hierarchical Clustering: Objectives & Algorithms. École normale supérieure & CNRS

Machine Learning (BSMC-GA 4439) Wenke Liu

HW4 VINH NGUYEN. Q1 (6 points). Chapter 8 Exercise 20

Statistics 202: Data Mining. c Jonathan Taylor. Clustering Based in part on slides from textbook, slides of Susan Holmes.

Introduction to Computer Science

Types of general clustering methods. Clustering Algorithms for general similarity measures. Similarity between clusters

Lesson 3. Prof. Enza Messina

Lecture 6: Unsupervised Machine Learning Dagmar Gromann International Center For Computational Logic

However, this is not always true! For example, this fails if both A and B are closed and unbounded (find an example).

Joint Unsupervised Learning of Deep Representations and Image Clusters Supplementary materials

Network Traffic Measurements and Analysis

Data-Driven Geometry Processing Map Synchronization I. Qixing Huang Nov. 28 th 2018

Classification. Vladimir Curic. Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University

Clustering. SC4/SM4 Data Mining and Machine Learning, Hilary Term 2017 Dino Sejdinovic

K-means clustering Based in part on slides from textbook, slides of Susan Holmes. December 2, Statistics 202: Data Mining.

Lecture Notes for Chapter 7. Introduction to Data Mining, 2 nd Edition. by Tan, Steinbach, Karpatne, Kumar

10701 Machine Learning. Clustering

CSE 6242 A / CS 4803 DVA. Feb 12, Dimension Reduction. Guest Lecturer: Jaegul Choo

Stability of closedness of convex cones under linear mappings

Clustering: Classic Methods and Modern Views

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

Segmentation (continued)

Graphs: Introduction. Ali Shokoufandeh, Department of Computer Science, Drexel University

Grouping and Segmentation

CS 664 Slides #11 Image Segmentation. Prof. Dan Huttenlocher Fall 2003

Lifting Transform, Voronoi, Delaunay, Convex Hulls

Nonparametric Clustering of High Dimensional Data

A Geometric Analysis of Subspace Clustering with Outliers

Clustering. Chapter 10 in Introduction to statistical learning

Lecture 16 Segmentation and Scene understanding

Small Libraries of Protein Fragments through Clustering

On Information-Maximization Clustering: Tuning Parameter Selection and Analytic Solution

Improving Image Segmentation Quality Via Graph Theory

Robust Subspace Clustering via Thresholding Ridge Regression

The goals of segmentation

Gene Clustering & Classification

SYDE Winter 2011 Introduction to Pattern Recognition. Clustering

Unsupervised Learning

Using the Kolmogorov-Smirnov Test for Image Segmentation

Optic Cup Segmentation for Glaucoma Detection Using Low-Rank Superpixel Representation

Unsupervised Learning and Clustering

Introduction to Clustering

Summer School in Statistics for Astronomers & Physicists June 15-17, Cluster Analysis

Transcription:

Generalized Transitive Distance with Minimum Spanning Random Forest Author: Zhiding Yu and B. V. K. Vijaya Kumar, et al. Dept of Electrical and Computer Engineering Carnegie Mellon University 1

Clustering Problem Important Issues: Maximally reveal intra-cluster similarity Maximally reveal inter-cluster dis-similarity Discover clusters with non-convex shape Robust to noise 2

Existing Methods & Literatures Early Methods Centroid-Based K-Means (Lloyd 1982); Fuzzy Methods (Bezdek 1981) Connectivity-Based Hierarchical Clustering (Sibson 1973; Defays 1977) Distribution-Based More Recent Developments Mixture Models + EM Density-Based Mean Shift (Cheng 1995; Comaniciu and Meer 2002) Spectral-Based Transitive Distance (Path-Based) Subspace Clustering Spectral Clustering (Ng et al. 2002); Self-Tuning SC (Zelnik-Manor and Perona 2004); Normalized Cuts (Shi and Malik 2000); Path-Based Clustering (Fischer and Buhmann 2003b); Connectivity Kernel (Fischer, Roth, and Buhmann 2004); Transitive Dist Closure (Ding et al. 2006); Transitive Affinity (Chang and Yeung 2005; 2008) SSC (Elhamifar and Vidal 2009); LSR (Lu et al. 2012); LRR (Liu et al. 2013); L1-Graph (Cheng et al., 2010); L2-Graph (Peng et al, 2015); L0- Graph (Yang et al, 2015); SMR (Hu et al., 2014); 3

Addressing Non-Convex Clusters K-means Spectral Clustering Transitive Distance (Path-based) Clustering 4

Background: Transitive Distance 5

Transitive Distance x q x s Ideally, we want: x p Zhiding Yu et al., Transitive Distance Clustering with K-Means Duality, CVPR 2014. 6

Transitive Distance x q x s Euclidean Distance: x p Zhiding Yu et al., Transitive Distance Clustering with K-Means Duality, CVPR 2014. 7

Transitive Distance x q P 1 P 2 x s Intuition: Far away samples can belong to the same class, because there is strong evidence of a path connecting them. x p Zhiding Yu et al., Transitive Distance Clustering with K-Means Duality, CVPR 2014. 8

Transitive Distance x q P 1 P 2 x p x s The size of the maximum gap on the path decides how strong the path evidence is. It is therefore a better measure of point distances than Euclidean distance. Zhiding Yu et al., Transitive Distance Clustering with K-Means Duality, CVPR 2014. 9

Transitive Distance x q P 3 x s But there could exist many other path combinations x p P 4 Zhiding Yu et al., Transitive Distance Clustering with K-Means Duality, CVPR 2014. 10

Transitive Distance x q P 1 P 2 Transitive Edge Transitive Edge x p x s Just select the path with the minimum max gap from all possible paths. The max gaps on the selected path are called transitive edges and defines the final distance. Zhiding Yu et al., Transitive Distance Clustering with K-Means Duality, CVPR 2014. 11

Transitive Distance x q P 1 P 2 Transitive Edge x s Transitive Distance: Transitive Edge x p Transitive Distance: Zhiding Yu et al., Transitive Distance Clustering with K-Means Duality, CVPR 2014. 12

Transitive Distance x q P 1 P 2 Transitive Edge x s Transitive Distance: Transitive Edge x p Transitive Distance: Theorem 1: Given a weighted graph with edge weights, each transitive edge lies on the minimum spanning tree (MST). Zhiding Yu et al., Transitive Distance Clustering with K-Means Duality, CVPR 2014. 13

A Distance Embedding Point of View Lemma 1: The Transitive Distance is an ultrametric (metric with strong triangle property). Original Space Projected Space Lemma 2: Every finite ultrametric space with n distinct points can be embedded into an n 1 dim Euclidean space. Theorem 2: If a labeling scheme of a dataset is consistent with the original distance, then given the derived transitive distance, the convex hulls of the projected images in the TD embedded space do not intersect with each other. Zhiding Yu et al., Transitive Distance Clustering with K-Means Duality, CVPR 2014. 14

A Distance Embedding Point of View Lemma 1: The Transitive Distance is an ultrametric (metric with strong triangle property). Original Space Projected Space Lemma 2: Every finite ultrametric space with n distinct points can be embedded into an n 1 dim Euclidean space. Remarks: TD can be embedded into an Euclidean space. Intuitively, for manifold or path cluster structures, TD drags far away intra-cluster data to be closer. The projected data show nice and compact clusters. It is very desirable to perform k-means clustering in the embedded space. Here, TD is doing a similar job as spectral embedding. Zhiding Yu et al., Transitive Distance Clustering with K-Means Duality, CVPR 2014. 15

Top-Down Methods for Clustering with TD Algorithm 1: (Non-SVD) Given a computed TD pairwise distance matrix D, treat each row as a data sample Perform k-means on the rows to generate final clustering labels. Algorithm 2: (SVD) Given a computed TD pairwise distance matrix D, perform SVD: Extract the top several columns of U with the largest singular values. Treat each row of the columns a data sample. Perform k-means on the rows to generate final clustering labels. Zhiding Yu et al., Transitive Distance Clustering with K-Means Duality, CVPR 2014. 16

Generalized Transitive Distance 17

Robustness: Short Link Problem MST is an over-simplified representation of data. Therefore, clustering with transitive distance is sensitive to noise. (but still much better than single linkage algorithm) Zhiding Yu et al., Generalized Transitive Distance with Minimum Spanning Random Forest, IJCAI 2015. 18

Generalized Transitive Distance (GTD): Introducing Robustness from Bagging Definition: Notes: Function gmin denotes the generalized min returning a set of minimum values from multiple sets. denotes multiple sets of paths, each containing a set of all possible paths from one configuration (realization) of perturbed (bagged) graph. Zhiding Yu et al., Generalized Transitive Distance with Minimum Spanning Random Forest, IJCAI 2015. 19

Generalized Transitive Distance (GTD): Introducing Robustness from Bagging MST-1 MST-2 MST-N TD Dist Mat 1 TD Dist Mat N Element-Wise Max Pooling Zhiding Yu et al., Generalized Transitive Distance with Minimum Spanning Random Forest, IJCAI 2015. 20

Theoretical Properties Theorem 1: The generalized transitive distance is also an ultrametric, and can also be embedded into a finite dimensional Euclidean space. Theorem 2: Given a set of bagged graphs, the transitive distance edges lie on the minimum spanning random forest formed by MSTs extracted from these bagged graphs. Zhiding Yu et al., Generalized Transitive Distance with Minimum Spanning Random Forest, IJCAI 2015. 21

Bagging Algorithms Zhiding Yu et al., Generalized Transitive Distance with Minimum Spanning Random Forest, IJCAI 2015. 22

Experiments 23

Experiment: Toy Example TD + SVD GTD (Seq Kruskal / Perturb) + SVD Zhiding Yu et al., Generalized Transitive Distance with Minimum Spanning Random Forest, IJCAI 2015. 24

Experiment: Toy Example Col 1: TD + SVD. Col 2: Spectral clustering. Col 3: Self-tuning spectral clustering (Auto scale & cluster num). Col 4: Normalized cuts. Col 5: GTD (Seq Kruskal) + Non-SVD. Col 6: GTD (Perturb) + Non-SVD. Col 7: GTD (Seq Kruskal) + SVD. Zhiding Yu et al., Generalized Transitive Distance with Minimum Spanning Random Forest, IJCAI 2015. 25

Image Segmentation Algorithm RAG Input Superpixelization GTD+ Non-SVD GTD Mat Texton Feature Edge Detetection 26

Experiment: Image Segmentation Qualitative result on BSDS300 Normalized Cuts TD + Non-SVD GTD + Non-SVD Zhiding Yu et al., Generalized Transitive Distance with Minimum Spanning Random Forest, IJCAI 2015. 27

Experiment: Image Segmentation Qualitative result on BSDS300 Normalized Cuts TD + Non-SVD GTD + Non-SVD Zhiding Yu et al., Generalized Transitive Distance with Minimum Spanning Random Forest, IJCAI 2015. 28

Experiment: Image Segmentation Quantitative result on BSDS300 Zhiding Yu et al., Generalized Transitive Distance with Minimum Spanning Random Forest, IJCAI 2015. 29

Thank You! 30