Photo-realistic Renderings for Machines Seong-heum Kim

Similar documents
When Big Datasets are Not Enough: The need for visual virtual worlds.

Final Project: Real-Time Global Illumination with Radiance Regression Functions

Physically Realistic Ray Tracing

Seeing the unseen. Data-driven 3D Understanding from Single Images. Hao Su

Distribution Ray Tracing. University of Texas at Austin CS384G - Computer Graphics Fall 2010 Don Fussell

Computer Graphics. Sampling Theory & Anti-Aliasing. Philipp Slusallek

Volumetric and Multi-View CNNs for Object Classification on 3D Data Supplementary Material

Reading. 8. Distribution Ray Tracing. Required: Watt, sections 10.6,14.8. Further reading:

Today. Anti-aliasing Surface Parametrization Soft Shadows Global Illumination. Exercise 2. Path Tracing Radiosity

CS468: 3D Deep Learning on Point Cloud Data. class label part label. Hao Su. image. May 10, 2017

Object detection using Region Proposals (RCNN) Ernest Cheung COMP Presentation

Deep Learning for Computer Vision II

Synscapes A photorealistic syntehtic dataset for street scene parsing Jonas Unger Department of Science and Technology Linköpings Universitet.

Deep Learning with Tensorflow AlexNet

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Global Illumination The Game of Light Transport. Jian Huang

Topic 12: Texture Mapping. Motivation Sources of texture Texture coordinates Bump mapping, mip-mapping & env mapping

Monte Carlo Integration

3D Deep Learning on Geometric Forms. Hao Su

Topic 11: Texture Mapping 11/13/2017. Texture sources: Solid textures. Texture sources: Synthesized

High Performance GPU-Based Preprocessing for Time-of-Flight Imaging in Medical Applications

Beyond Correlation Filters: Learning Continuous Convolution Operators for Visual Tracking

PROBLEM FORMULATION AND RESEARCH METHODOLOGY

DeepIM: Deep Iterative Matching for 6D Pose Estimation - Supplementary Material

Topic 11: Texture Mapping 10/21/2015. Photographs. Solid textures. Procedural

Deep Learning-driven Depth from Defocus via Active Multispectral Quasi-random Projections with Complex Subpatterns

Halftoning and quasi-monte Carlo

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. Charles R. Qi* Hao Su* Kaichun Mo Leonidas J. Guibas

Monte Carlo Integration

Probabilistic PPM. Claude Knaus Matthias Zwicker University of Bern. State of the Art in Photon Density Estimation

MULTI-DIMENSIONAL MONTE CARLO INTEGRATION

Presented at the FIG Congress 2018, May 6-11, 2018 in Istanbul, Turkey

Transfer Learning. Style Transfer in Deep Learning

Advanced Computer Graphics CS 563: Screen Space GI Techniques: Real Time

782 Schedule & Notes

Structured Prediction using Convolutional Neural Networks

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601

GAMES Webinar: Rendering Tutorial 2. Monte Carlo Methods. Shuang Zhao

ELEC Dr Reji Mathew Electrical Engineering UNSW

COMP9444 Neural Networks and Deep Learning 7. Image Processing. COMP9444 c Alan Blair, 2017

Ray tracing. EECS 487 March 19,

Photorealism vs. Non-Photorealism in Computer Graphics

Numerical Integration

ECCV Presented by: Boris Ivanovic and Yolanda Wang CS 331B - November 16, 2016

Sung-Eui Yoon ( 윤성의 )

Hide-and-Seek: Forcing a network to be Meticulous for Weakly-supervised Object and Action Localization

Monte Carlo Ray Tracing. Computer Graphics CMU /15-662

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

AUTOMATIC 3D HUMAN ACTION RECOGNITION Ajmal Mian Associate Professor Computer Science & Software Engineering

3D Deep Learning

Sampling, Aliasing, & Mipmaps

Accelerating Ray-Tracing

3/1/2010. Acceleration Techniques V1.2. Goals. Overview. Based on slides from Celine Loscos (v1.0)

Learning Photographic Image Synthesis With Cascaded Refinement Networks. Jonathan Louie Huy Doan Siavash Motalebi

Deep Learning. Visualizing and Understanding Convolutional Networks. Christopher Funk. Pennsylvania State University.

A Frequency Analysis of Light Transport

Supplementary A. Overview. C. Time and Space Complexity. B. Shape Retrieval. D. Permutation Invariant SOM. B.1. Dataset

6.837 Introduction to Computer Graphics Quiz 2 Thursday November 20, :40-4pm One hand-written sheet of notes allowed

Distributed Ray Tracing

Learning 6D Object Pose Estimation and Tracking

Articulated Pose Estimation with Flexible Mixtures-of-Parts

Georgios Tziritas Computer Science Department

Structured light 3D reconstruction

Deep Learning for Visual Manipulation and Synthesis

Rendering Algorithms: Real-time indirect illumination. Spring 2010 Matthias Zwicker

Anti-aliasing. Images and Aliasing

INFOGR Computer Graphics. J. Bikker - April-July Lecture 10: Ground Truth. Welcome!

Scientific Visualization Example exam questions with commented answers

ImageNet Classification with Deep Convolutional Neural Networks

Constrained Convolutional Neural Networks for Weakly Supervised Segmentation. Deepak Pathak, Philipp Krähenbühl and Trevor Darrell

From processing to learning on graphs

POINT CLOUD DEEP LEARNING

Applications: Sampling/Reconstruction. Sampling and Reconstruction of Visual Appearance. Motivation. Monte Carlo Path Tracing.

Object Detection on Self-Driving Cars in China. Lingyun Li

Interactive Methods in Scientific Visualization

EE640 FINAL PROJECT HEADS OR TAILS

Multiple target detection in video using quadratic multi-frame correlation filtering

CS 354R: Computer Game Technology

An Effective Stratified Sampling Scheme for Environment Maps with Median Cut Method

TRANSPARENT OBJECT DETECTION USING REGIONS WITH CONVOLUTIONAL NEURAL NETWORK

Computer Vision. Fourier Transform. 20 January Copyright by NHL Hogeschool and Van de Loosdrecht Machine Vision BV All rights reserved

A Survey of Adaptive Sampling in Realistic Image Synthesis

Sampling and Monte-Carlo Integration

Alternatives to Direct Supervision

Visual cues to 3D geometry. Light Reflection and Advanced Shading. Shading. Recognizing materials. size (perspective) occlusion shading

ECG782: Multidimensional Digital Signal Processing

What was Monet seeing while painting? Translating artworks to photo-realistic images M. Tomei, L. Baraldi, M. Cornia, R. Cucchiara

CS770/870 Spring 2017 Radiosity

Conemarching in VR. Johannes Saam Mariano Merchante FRAMESTORE. Developing a Fractal experience at 90 FPS. / Framestore

Towards Interactive Global Illumination Effects via Sequential Monte Carlo Adaptation. Carson Brownlee Peter S. Shirley Steven G.

Inception Network Overview. David White CS793

3D Pose Estimation using Synthetic Data over Monocular Depth Images

3D Shape Segmentation with Projective Convolutional Networks

Anti-aliasing and sampling

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

Scalable many-light methods

Research on the New Image De-Noising Methodology Based on Neural Network and HMM-Hidden Markov Models

Multi-view stereo. Many slides adapted from S. Seitz

CAP 6412 Advanced Computer Vision

Image Base Rendering: An Introduction

Transcription:

Photo-realistic Renderings for Machines 20105034 Seong-heum Kim CS580 Student Presentations 2016.04.28

Photo-realistic Renderings for Machines Scene radiances Model descriptions (Light, Shape, Material, Camera) Light Transport Physically-based rendering (Rendering equation with Monte Carlo Integration) Part 1. Analysis on stochastic sampling errors Pixel values + Simulated annotation database. Machine Learning Artificial Intelligence (Convolutional Neural Network) Part 2. One application of this approach Self-training? AlphaGo vs AlphaGo 2

Presentation Papers Part 1. Analysis on stochastic sampling errors Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration (SIGGRAPH 2013) Kartic Subr, Jan Kautz To study useful indicators for evaluating sampling patterns To analyze Gaussian jittered sampling Part 2. One application of this approach 3 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views (ICCV 2015) Hao Su*, Charles R. Qi*, Yangyan Li, Leonidas J. Guibas To find a good application of utilizing CG renderings To use PBRT results for learning camera viewpoints

Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration Shiny ball in motion Pixel value = multi-dim. integral sampled integrand. But, High variance High bias Analysis is non-trivial! 4 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration Author s Slides from http://home.eps.hw.ac.uk/~ks400/research.html

Overview of this Paper Stochastic sampling strategies involves random variables. Accuracy and precision of my estimations? Spectral analysis reduces aliasing effects in estimators Direct insight for predicting into the first and second order statistics of the integrators. (theoretical contribution) Let s apply it to analyse simple variants of jittered sampling. Experimental results to show trade-off relationships between random and regular sampling patterns. Quantitative, qualitative comparisons with other strategies (PBRT). 5 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Monte Carlo Estimator sampled integrand 6 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration Author s Slides from http://home.eps.hw.ac.uk/~ks400/research.html

Strategies to Improve Estimators 1. modify weights 2. modify locations eg. quadrature rules eg. importance sampling * Key idea PDF!! Amplitude (sampling spectrum) DC Phase (sampling spectrum) Frequency 7 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration Author s Slides from http://home.eps.hw.ac.uk/~ks400/research.html

Assessing Estimators using Sampling Spectrum High-level Messages from Fourier domain 8 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Bias and variance of secondary MC estimators Error definition: Bias: (Expected error) Variance: Band-limited *Point: Ideal sampling spectrum has no energy in sampling spectrum at frequencies where integrand has high energy 9 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration Author s Slides from http://home.eps.hw.ac.uk/~ks400/research.html

Bias and variance of secondary MC estimators Unbiased estimator: General sampling function: ( PDF ) Expected Fourier spectrum Weighting scheme (unbiased): PDF Random sampling: g(x) constant PDF 10 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Bias and variance of secondary MC estimators Unbiased sampling, Imaginary centroid Key idea: Jitter in position manifests as phase jitter Real Complex plane 11 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration Author s Slides from http://home.eps.hw.ac.uk/~ks400/research.html

Case study: Gaussian jittered sampling Overall, jitters affect expected centroid. (More derivations in Appendix.) 12 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration Author s Slides from http://home.eps.hw.ac.uk/~ks400/research.html

Case study: Gaussian jittered sampling Frequency expected centroid (relative phase is key!) Imaginary Imaginary Amplitude Phase Real 4 1 2 3 centroid variance 13 Time Fourier Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration Author s Slides from http://home.eps.hw.ac.uk/~ks400/research.html

Quantitative tests Three types of synthetic datasets 1) Four random samples to form a white quad (binary planes) 2) Delaunay triangulation of random samples (mesh) with random weights 3) Similar to Case 2. with linearly interpolated weights 14 Relative errors of mean, variance for methods in PBRT-v2 and other implem. (50 iterations of the secondary estimator with up to 1024 primary samples) Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Quantitative tests: Bias-variance trade-off using Gaussian jitter Random Gaussianjittered No jitter at all Fast convergence 15 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Qualitative tests Gaussian jitter allows trade-offs between random and grid. - High variation Trade-offs High bias 16 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Conclusion A study of the spectral characteristics of stochastic sampling patterns Two measures for the quality of sampling strategies in terms of their accuracy and precision in integration The amplitude of the expected sampling spectrum The variance of the sampling spectrum Applied these measures to assess Gaussian jittered sampling and compared it with the box-jittered case. Performed quantitative and qualitative evaluations of various sampling methods in this new framework. 17 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Appendix 1. Additional derivations Gaussian jitter of stochastic samples: Expected spectrum Gaussian jitter of fixed-location samples: Variance of spectrum Spectrum for uniform jitter (1D): 18 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Bias and variance of secondary MC estimators Statistics of Fourier spectrum over several sampling functions 19-512x512 grid, 256 2D samples, 20 iterations - 5 sampling strategies (Gaussian jittered sampling, Halton sequences, Poisson-disk sampling, Random sampling) - Amplitude of expected sampling spectrum, variance of sampling spectrum are more informative indicators for predicting stochastic sampling errors. Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Bias and variance of secondary MC estimators Statistics of Fourier spectrum over several sampling functions - Simple representations in each instance of the sampling patterns (more expressive & informative than the conventional Periodogram) * Shifted Dirac deltas are expressed as Euler formulas. 20 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Case study: Gaussian jittered sampling 21 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Case study: Gaussian jittered sampling Expectation and variance of FT for Box-jittered samples Comparison of box-jitter and Gaussian jitter Simple alternative to Gaussian Box jitter is more biased if the integrand contains energy at frequencies where the ratio is greater than 1. Variance of box filter is lower. 22 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Quantitative tests: Gaussian jitter converges rapidly - - 23 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Qualitative tests: Blur, Soft shadow - 24 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration

Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Wants to know this viewpoint from a photo (One of classical CV problems) If 2D renders = 2D photos, our high-level descriptions naturally leads to high-level understandings for machines. For example, let s take a look at estimating camera viewpoints. 25 Fourier Analysis of Stochastic Sampling Strategies for Assessing Bias and Variance in Integration Author s Slides from http://home.eps.hw.ac.uk/~ks400/research.html

Convolutional Neural Network (CNN) ImageNet: Millions of images + Human annotations (2009) ILSVRC Image Classification Top-5 Error (%) Deep Learning! CPU GPU 26 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Introduction Go beyond 2D image classification Cropped, Background clutters Occlusion Natural illumination 27 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Introduction Our challenge here is that human annotation is expensive. 28 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Introduction 29 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Key idea: Render for CNN CG renderings are generated by known model descriptions *All data is already annotated when created. Renderings from 3D models 30 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Rendering pipeline Rendering pipeline for the training stage 31 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Rendering pipeline Self-generated data collections for machine learning 32 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Rendering pipeline Exp. 1. 80K rendered chair images with fixed lighting sources 33 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Rendering pipeline Exp. 2. Randomize lighting 34 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Rendering pipeline Exp. 3. Composite them with random backgrounds 35 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Rendering pipeline Exp. 4. Apply bounding boxes with proper texture 36 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Rendering pipeline 4M synthesized images for 12 categories 37 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Experimental results Real test images from PASCAL3D+ dataset Metric: median angle error (lower the better) 38

Experimental results Azimuth Viewpoint Estimation 39 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Experimental results Failure cases 40 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Conclusion Images rendered from 3D models can be effectively used to train CNNs, especially for 3D tasks. State-of-the-art results has been achieved. Key to success * Self-imaging from descriptions Quantity: Large scale 3D model collection (ShapeNet) + Augmentation Quality: Overfit-resistant, scalable image synthesis pipeline 41 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

Appendix 2. Convolutional Neural Network (CNN) Multiple layers of small neuron collections * Parametric model for simulating a neural network: Weighted inputs (Parameters) Limiter (non-linear step-function) Output to Other neurons * Application: Image classification any semantic inferences 42 * Observations: 1. Activation energy with for a threshold barrier 2. Sensitive certain electric stimulus 3. Activation pattern changes from repeated inputs True? False? Require a huge number of labelled data for training! Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views

Network details Loss function: Network structure (Based on AlexNet): 43 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

More experimental results (Supp.) Azimuth Viewpoint Estimation 44 Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views Author s Slides from http://ai.stanford.edu/~haosu/slides/iccv_renderforcnn_website.pdf (ICCV15 Oral)

45 Quiz