From processing to learning on graphs

Similar documents
Deep Learning on Graphs

Self-supervised Multi-level Face Model Learning for Monocular Reconstruction at over 250 Hz Supplemental Material

Computational Design. Stelian Coros

Lecture 10 CNNs on Graphs

Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong)

MoFA: Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

Edge Detection (with a sidelight introduction to linear, associative operators). Images

Single Image Super Resolution of Textures via CNNs. Andrew Palmer

Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis

Deep Learning on Graphs

Paper Motivation. Fixed geometric structures of CNN models. CNNs are inherently limited to model geometric transformations

Multimedia Computing: Algorithms, Systems, and Applications: Edge Detection

Photo-realistic Renderings for Machines Seong-heum Kim

Broad field that includes low-level operations as well as complex high-level algorithms

Generalized Tree-Based Wavelet Transform and Applications to Patch-Based Image Processing

CHAPTER 3 DIFFERENT DOMAINS OF WATERMARKING. domain. In spatial domain the watermark bits directly added to the pixels of the cover

Data mining with sparse grids

Edge detection. Convert a 2D image into a set of curves. Extracts salient features of the scene More compact than pixels

Specification and Computation of Warping and Morphing Transformations. Bruno Costa da Silva Microsoft Corp.

An M-Channel Critically Sampled Graph Filter Bank

Edge and Texture. CS 554 Computer Vision Pinar Duygulu Bilkent University

LEARNING TO INFER GRAPHICS PROGRAMS FROM HAND DRAWN IMAGES

Edge and corner detection

Semantic Segmentation. Zhongang Qi

Translation Symmetry Detection: A Repetitive Pattern Analysis Approach

Point Lattices in Computer Graphics and Visualization how signal processing may help computer graphics

Parametric. Practices. Patrick Cunningham. CAE Associates Inc. and ANSYS Inc. Proprietary 2012 CAE Associates Inc. and ANSYS Inc. All rights reserved.

Edge detection. Goal: Identify sudden. an image. Ideal: artist s line drawing. object-level knowledge)

Randomized sampling strategies

DIGITAL IMAGE PROCESSING

Surfaces, meshes, and topology

Multi-level Partition of Unity Implicits

Rotation Invariance Neural Network

One Network to Solve Them All Solving Linear Inverse Problems using Deep Projection Models

Deep 3D Machine Learning for Reconstruction and Repair of 3D Surfaces

Markov Networks in Computer Vision

3.5 Filtering with the 2D Fourier Transform Basic Low Pass and High Pass Filtering using 2D DFT Other Low Pass Filters

Markov Networks in Computer Vision. Sargur Srihari

Frequency analysis, pyramids, texture analysis, applications (face detection, category recognition)

Neural Networks and Deep Learning

Geometric Modeling and Processing

Cluster Analysis (b) Lijun Zhang

Scanning Real World Objects without Worries 3D Reconstruction

Advanced phase retrieval: maximum likelihood technique with sparse regularization of phase and amplitude

Data mining with sparse grids using simplicial basis functions

Edge Detection. Computer Vision Shiv Ram Dubey, IIIT Sri City

Su et al. Shape Descriptors - III

A Wavelet Tour of Signal Processing The Sparse Way

Final Exam Schedule. Final exam has been scheduled. 12:30 pm 3:00 pm, May 7. Location: INNOVA It will cover all the topics discussed in class

Cluster EMD and its Statistical Application

CoE4TN3 Image Processing. Wavelet and Multiresolution Processing. Image Pyramids. Image pyramids. Introduction. Multiresolution.

CS468: 3D Deep Learning on Point Cloud Data. class label part label. Hao Su. image. May 10, 2017

Markov Random Fields and Gibbs Sampling for Image Denoising

A Deep Learning Framework for Authorship Classification of Paintings

Learning Visual Semantics: Models, Massive Computation, and Innovative Applications

Watershed Sciences 4930 & 6920 GEOGRAPHIC INFORMATION SYSTEMS

Neural style transfer

Algorithms for 3D Isometric Shape Correspondence

Locally Weighted Learning for Control. Alexander Skoglund Machine Learning Course AASS, June 2005

Machine Problem 8 - Mean Field Inference on Boltzman Machine

CS201: Computer Vision Introduction to Tracking

Unsupervised learning in Vision

Nonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H.

EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines

03 - Reconstruction. Acknowledgements: Olga Sorkine-Hornung. CSCI-GA Geometric Modeling - Spring 17 - Daniele Panozzo

Mesh Processing Pipeline

Lecture 13 Segmentation and Scene Understanding Chris Choy, Ph.D. candidate Stanford Vision and Learning Lab (SVL)

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction

Lecture 8 Object Descriptors

Index. Umberto Michelucci 2018 U. Michelucci, Applied Deep Learning,

Schedule for Rest of Semester

Unsupervised Learning of Spatiotemporally Coherent Metrics

Bilevel Sparse Coding

Supplemental Document for Deep Photo Style Transfer

Dietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++

Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction

CS231N Project Final Report - Fast Mixed Style Transfer

Image Processing. Filtering. Slide 1

Computer Vision I. Announcements. Fourier Tansform. Efficient Implementation. Edge and Corner Detection. CSE252A Lecture 13.

COMP 551 Applied Machine Learning Lecture 16: Deep Learning

WAVELET USE FOR IMAGE RESTORATION

Gabor and Wavelet transforms for signals defined on graphs An invitation to signal processing on graphs

Graph Wavelets and Filter Banks Designs in Graph Spectral Domain

Edge Detection. Today s reading. Cipolla & Gee on edge detection (available online) From Sandlot Science

List of Accepted Papers for ICVGIP 2018

Understanding Faces. Detection, Recognition, and. Transformation of Faces 12/5/17

Non-Differentiable Image Manifolds

Geometric VLAD for Large Scale Image Search. Zixuan Wang 1, Wei Di 2, Anurag Bhardwaj 2, Vignesh Jagadesh 2, Robinson Piramuthu 2

Robotics Programming Laboratory

Image Enhancement Techniques for Fingerprint Identification

Digital Image Processing

Sparsity and image processing

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Introduction to Computer Graphics. Animation (1) May 19, 2016 Kenshi Takayama

Comparison of Default Patient Surface Model Estimation Methods

EE795: Computer Vision and Intelligent Systems

Geodesics in heat: A new approach to computing distance

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

Facial Expression Classification with Random Filters Feature Extraction

Transcription:

From processing to learning on graphs Patrick Pérez Maths and Images in Paris IHP, 2 March 2017

Signals on graphs Natural graph: mesh, network, etc., related to a real structure, various signals can live on it Instrumental graph: derived from a collection or a signal, captures its structure, other signals leverage it 2

Playing with graph signals Coding Compress Sample Reconstruct Processing Transform Enhance Edit Learning Cluster Label Infer 3

Playing with graph signals Puy 2016-2017 Coding Compress Sample Reconstruct Processing Transform Enhance Edit Learning Cluster Label Infer 4

Playing with graph signals Coding Compress Sample Reconstruct Processing Transform Enhance Edit Learning Cluster Label Infer Puy 2017 5

Playing with graph signals Coding Compress Sample Reconstruct Processing Transform Enhance Edit Learning Cluster Label Infer Garrido 2016 6

7 Undirected weighted graph

Graph Laplacian(s) Vertex degree and degree matrix Symmetric p.s.d. Laplacians Combinatorial Laplacian Normalized Laplacian 8

Graph signal and smoothness Signals / functions on graph Scalar Multi-dim. Graph smoothness Scalar Multi-dimensional 9

Spectral graph analysis Laplacian diagonalization and graph harmonics of increasing frequencies Graph Fourier transform and its inverse Smooth (k-bandlimited) signals 10

11 Spectral graph analysis

Spectral vertex embedding Rows of truncated Fourier basis k-dim embedding of vertices Clustered with k-means in spectral clustering 12

Linear filters and convolutions Filtering in the spectral domain With filter Fourier transform Through frequency filtering Issues locality on graph computational complexity Polynomial filtering: from spectral to vertex domain Controlled locality and complexity 13

Sampling graph signals Random sampling Define vertex sampling distribution Draw signal samples accordingly Problems Reconstruction of smooth signals Performance as function of m Best sampling distribution [Puy et al. 2016] 14

Reconstructing smooth signals from samples Smooth interpolation / approximation (noisy measures) k-bandlimited approximation: exact or approximate 15 [Puy et al. 2016]

Reconstruction quality (1) Assuming RIP* Noisy measurements: Noiseless measurements: exact recovery * m large enough, for now 16

Reconstruction quality (2) Assuming RIP* * m large enough, for now 17

Optimizing sampling Some vertices are more important Norm of spectral embedding: max. energy fraction on vertex from k-bandlimited signal Exists a k-bandlimited signal concentrated on this node; should be sampled Exists no k-bandlimited signal concentrated on this node; can be ignored Graph weighted coherence of distribution should be as small as possible 18

Restricted Isometry Property (RIP) vertices are enough to sample all k-bandlimited signals In best case, suffice Once selected, vertices can be used to sample all k-bandlimited signals 19

20 Empirical RIP

Optimal and practical sampling Optimal sampling distribution measurements suffice, but requires computation of harmonics Efficient approximation Rapid computation of alternative vertex embedding of similar norms with columns of R obtained by polynomial filtering of suitable Gaussian signals Can serve also for efficient spectral clustering [Tremblay et al. 2016] 21

22 Optimal and practical sampling

Extension to group sampling Given a suitable partition of vertices Smooth graph signals almost piece-wise constant on groups [Puy and Pérez 2017] under submission Random sampling? Reconstruction? Interest Speed and memory gains (working on reduced signal versions) Interactive systems: propose sampled groups for user to annotate 23

Extension to group sampling Given a suitable partition of vertices Smooth graph signals almost piece-wise constant on groups [Puy and Pérez 2017] under submission Random sampling? Reconstruction? Interest Speed and memory gains (working on reduced signal versions) Interactive systems: propose sampled groups for user to annotate 24

Group sampling and group coherence Reasoning at group level Group sampling Local group coherence: max energy fraction in group from a k-bandlimited signal* Group coherence: * 25

Restricted Isometry Property (RIP) In best case, groups are enough to sample all k-bandlimited signals groups suffice 26

27 Smooth piece-wise constant reconstruction

Smooth piece-wise constant reconstruction Assuming RIP 28

29 Empirical RIP

30 Group sampling distributions

31

Convolutional Neural Nets (CNNs) on graph CNNs Immensely successful for image-related task (recognition, prediction, processing, editing) Layers: Convolutions, non-linearities and pooling Extension to graph signals? No natural convolution and pooling Graph structure may vary (not only size as with lattices) Computational complexity A simple proposal [Puy et al. 2017] 32

Graph-CNNs Convolution in spectral domain [Bruna et al. 2013] Computation and use of Fourier basis not scalable Difficult handling of graph changes across inputs Convolution with polynomial filters [Defferrard et al. 2016, Kipf et al. 2016] Better control of complexity and locality Not clear handling of graph changes across inputs Lack of filter diversity (e.g., rotation invariance on 2D lattice) Direct convolutions [Monti et al. 2016, Niepert et al. 2016, Puy et al. 2017] Local or global pseudo-coordinates Include convolution on regular grid as special case 33

Direct convolution on weighted graph At each vertex Extract a fixed-size signal patch Dot product with filter kernel Order, [Puy et al. 2017] Weigh, Assemble 5 4 2 1 3 6 34

Direct convolution on weighted graph Back to classic convolution Weight-based ordering and weighting Lexicographical order, no weighting 1 4 2 3 2 3 4 5 1 6 5 35

Non-local weighted pixel graph Feature-based nearest neighbor graph Given an image, one feature vector at each pixel Connect each pixel to its d nearest neighbor in feature space Weigh with exponential of feature similarity 36

One graph convolutional layer ReLU 37

Style transfer Neural example-based stylization [Gatys et al. 2015] Iterative modification of noise to fit statistics of style image and content of target image Neural statistics: Gram matrix of feature maps at a layer of a pre-trained deep CNN 38

Style transfer Using only a single random graph convolution layer Input image only used to build the graph 39

Style transfer Using only a single random graph convolution layer Input image only used to build the graph 40 Non-local graph only

Style transfer Using only a single random graph convolution layer Input image only used to build the graph 41 Non-local graph + Local graph

XXX XXX 42

Color palette transfer Using only a single random graph convolution layer target image proposed optimal transport source palette 43

Signal denoising Trained 3-layer graph CNN Local and non-local graphs from noisy input loc. weighted graph soft thresholding local or not no non-linearity loc. weighted graph no non-linearity 1 20 20 1 44

Image denoising non local 2 nd layer local 2 nd layer Noisy Trained Local Trained Non-local Haar soft thresh. 23.10dB 29.13dB 29.42dB 26.78dB 45

Triangular 3D mesh Graph Vertices: points in 3D space Edges: forming triangulated graph Weights (if any): associated to local 3D shape Signals Colors Normals Mesh deformations 46

Face capture from single video [Cao et al., 2015] [Suwajanakorn et al., 2014] [Garrido et al., 2016] Detailed 3D face rig 47

Parametric face model Two-level coarse linear modelling Inter-individual variations: linear space around average neutral face (AAM) Expressions: linear space of main modes of deformations around neutral (blendshapes) Reconstruction and tracking from raw measurements Extract person s neutral shape (morphology) Extract/track main deformations (expression/performance) Mitigate model limitations through smooth corrections Recover person-specific fine scale details 48

Parametric face model Two-level coarse linear modelling Inter-individual variations: linear space around average neutral face (AAM) Expressions: linear space of main modes of deformations around neutral (blendshapes) Reconstruction and tracking from raw measurements Extract person s neutral shape (morphology) Extract/track main deformations (expression/performance) Mitigate model limitations through smooth corrections Recover person-specific fine scale details 49

Parametric face model Two-level coarse linear modelling Inter-individual variations: linear space around average neutral face (AAM) Expressions: linear space of main modes of deformations around neutral (blendshapes) Reconstruction and tracking from raw measurements Extract person s neutral shape (morphology) Extract/track main deformations (expression/performance) Mitigate model limitations through smooth corrections Recover person-specific fine scale details 50

Parametric face model Two-level coarse linear modelling Inter-individual variations: linear space around average neutral face (AAM) Expressions: linear space of main modes of deformations around neutral (blendshapes) Reconstruction and tracking from raw measurements Extract person s neutral shape (morphology) Extract/track main deformations (expression/performance) Mitigate model limitations through smooth corrections Recover person-specific fine scale details 51

Smooth correction Layered mesh model [Garrido et al. 2016] Graph harmonics on each coordinate [Vallet and Levy 2008][Li et al. 2013] 52

Model personalization and tracking in single video Multi-layer performance capture Generic face prior Coarse Medium Fine Monocular video 53

54 Multi-layer performance capture

From capture to animation Multi-layer performance capture Generic face prior Coarse Medium Fine Monocular video Fine 55 Personalized face rig Medium X Detail learning Coarse

Personalized face rig Turn model into a face rig (puppet) Ridge regression 56

Personalized face rig Turn model into a face rig (puppet) regression fixed editable Ridge regression 57

58 Rig animation from capture

59 Rig animation from capture

From processing to learning on graphs Patrick Pérez Maths and Images in Paris IHP, 2 March 2017 G. Puy, P. Pérez. Structured sampling and fast reconstruction of smooth graph signals. Submitted to Information and Inference G. Puy, S. Kitic, P. Pérez. Unifying local and non-local signal processing with graph CNNs. arxiv:1702.07759 P. Garrido, M. Zollhoefer, D. Casas, L. Valgaerts, K. Varanasi, P. Pérez, Ch. Theobalt. Reconstruction of personalized 3D face rigs from monocular video. ACM Trans. on Graghics, 35(3), 2016