Image Similarities for Learning Video Manifolds. Selen Atasoy MICCAI 2011 Tutorial

Image Similarities for Learning Video Manifolds Selen Atasoy MICCAI 2011 Tutorial

Image Spaces

Image Manifolds Tenenbaum2000 Roweis2000 Tenenbaum2000 [Tenenbaum2000: J. B. Tenenbaum, V. Silva, J. C. Langford: A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500), 2000.] [Roweis2000: S. T. Roweis, L. K. Saul: Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500), 2000]

Video Manifolds Pless2003 Atasoy2010 [Pless2003: R. Pless: Using Isomap to Explore Video Sequences: ICCV, 2003.] [Atasoy2010: S. Atasoy, D. Mateus, J. Lallemand, A. Meining, G.Z. Yang, N. Navab: Endoscopic Video Manifolds, MICCAI, 2010.] [Atasoy2011: S. Atasoy, D. Mateus, A. Meining, G.Z. Yang, N. Navab: Targeted Optical Biopsies for Surveillance Endoscopies, MICCAI, 2011.]

Theoretical Background

High dimensional data points lying on or near a manifold Manifold Learning Low dimensional representation Find a mapping that best preserves???...

1. Define a matrix based on the relations between data points Manifold Learning A General Recipe 2. Compute the eigenvectors & eigenvalues 3. Embed each sample

Manifold Learning A General Recipe Method Operator/Matrix Preserved Objective Function PCA Covariance matrix Variance of the dataset / Euclidean distances between data points Laplacian Eigenmaps Graph Laplacian Distances within the local neighbourhood of each data point ISOMAP Geodesic distance matrix Geodesic distances between data points LLE Reconstruction weights Reconstruction weights within the local neighbourhood of each data point

Rayleigh-Ritz Theorem: Manifold Learning Why does it work? eigenvalues eigenvectors Recall: Scalar product: Scalar product in H: Norm: Norm in H:

Manifold Learning Why does it work? Discrete Domain vectors Continuous Domain functions Schwarz s Kernel Theorem: Each linear operator is given as an integration against a unique kernel. That kernel is the impulse response of the linear system to an impulse (a delta function).

Manifold Learning Why does it work? Discrete Domain vectors Continuous Domain functions

Manifold Learning Why does it work? Discrete Domain vectors Continuous Domain functions The matrix H defines: which operator is applied which (Hilbert) space we are working in which quantity will be conserved

Laplacian Eigenmaps

Solve Manifold Learning Laplacian Eigenmaps Find the eigenvectors of the graph Laplacian Equivalent to solving the Helmholtz Equation [Belkin2003: M. Belkin, P. Niyogi: Laplacian eigenmaps for dimensionality reduction and data representation. Neural computation, 15(6), 1373-1396. MIT Press, 2003]

Manifold Learning Laplacian Eigenmaps - Interpretation [Chladni1787] [Levy2010] [Levy2010] [Chladni1787: E. Chladni: Discoveries in the Theory of Sound, 1787.] [Levy2010: B. Levy: Spectral Geometry Processing: ACM SIGGRAPH Course Notes, 2010.]

Non-linear Manifold Learning Laplacian Eigenmaps - Interpretation Manifold learning as bending, stretching without cutting or creating wholes Vibrational modes are preserved while bending the manifold

Endoscopic Video Manifolds (EVMs)

Endoscopic Video Manifolds Challenges Clustering Uninformative Frames

Endoscopic Video Manifolds Clustering Uninformative Frames

Endoscopic Video Manifolds Clustering Uninformative Frames Informative frame & power spectrum Uninformative frame & power spectrum Informative frame Uninformative frame [Atasoy2010: S. Atasoy, D. Mateus, J. Lallemand, A. Meining, G.Z. Yang, N. Navab: Endoscopic Video Manifolds, MICCAI, 2010.]

Endoscopic Video Manifolds Clustering Uninformative Frames [Atasoy2010: S. Atasoy, D. Mateus, J. Lallemand, A. Meining, G.Z. Yang, N. Navab: Endoscopic Video Manifolds, MICCAI, 2010.]

Significant change in endoscope viewpoint Endoscopic Video Manifolds Challenges Small overlap between frames showing the same scene Scenes do not necessarily contain distinctive features

Endoscopic Video Manifolds Clustering Endoscopic Scenes Euclidean Distance Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10 0.04 0.02 0-0.02-0.04-0.06-0.08 0.1 0.05 Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10-0.1-0.12-0.06-0.04-0.02 0 0.02 0.04 0.06 0.08-0.1 0-0.05 338 frames 150 frames 137 frames 102 frames 98 frames 78 frames 71 frames 71 frames 44 frames 38 frames [Belkin2003: M. Belkin, P. Niyogi: Laplacian eigenmaps for dimensionality reduction and data representation. Neural computation, 15(6), 1373-1396. MIT Press, 2003] [Atasoy2010: S. Atasoy, D. Mateus, J. Lallemand, A. Meining, G.Z. Yang, N. Navab: Endoscopic Video Manifolds, MICCAI, 2010.]

Endoscopic Video Manifolds Clustering Endoscopic Scenes Euclidean Distances Cluster 3 0.04 0.02 0-0.02-0.04-0.06-0.08 0.1 0.05 Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10-0.1-0.12-0.06-0.04-0.02 0 0.02 0.04 0.06 0.08-0.1 0-0.05

Endoscopic Video Manifolds Clustering Endoscopic Scenes - NCC Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10 0.04 0.02 0-0.02-0.04-0.06-0.08-0.1-0.12 0.1 Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10 0.05 0-0.05-0.1-0.08-0.06-0.04-0.02 0 0.02 0.04 389 frames 137 frames 103 frames 98 frames 85 frames 82 frames 81 frames 64 frames 44 frames 44 frames [Atasoy2010: S. Atasoy, D. Mateus, J. Lallemand, A. Meining, G.Z. Yang, N. Navab: Endoscopic Video Manifolds, MICCAI, 2010.]

Endoscopic Video Manifolds Clustering Endoscopic Scenes - NCC Euclidean Distance Normalized Cross Correlation [Atasoy2010: S. Atasoy, D. Mateus, J. Lallemand, A. Meining, G.Z. Yang, N. Navab: Endoscopic Video Manifolds, MICCAI, 2010.]

Endoscopic Video Manifolds Clustering Endoscopic Scenes - NCC

Endoscopic Video Manifolds Clustering Endoscopic Scenes with Temporal Constraints Change the adjacency matrix to include temporal constraints [Atasoy2011: S. Atasoy, D. Mateus, A. Meining, G.Z. Yang, N. Navab: Targeted Optical Biopsies for Surveillance Endoscopies, MICCAI, 2011]

Endoscopic Video Manifolds Clustering Endoscopic Scenes with Temporal Constraints Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10 0.04 0.02 0-0.02-0.04-0.06 Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10-0.08 0-0.05-0.1 0.1 0.08 0.06 0.04 0.02 0-0.02-0.04 0.15 0.1 0.05 344 frames 143 frames 126 frames 120 frames 112 frames 88 frames 55 frames 53 frames 43 frames 43 frames [Atasoy2011: S. Atasoy, D. Mateus, A. Meining, G.Z. Yang, N. Navab: Targeted Optical Biopsies for Surveillance Endoscopies, MICCAI, 2011]

Acknowledgements Prof. Nassir Navab Prof. Guang-Zhong Yang Prof. Alexander Meining Dr. Diana Mateus Thank you for your attention!!!