CS 395T Numerical Optimization for Graphics and AI (3D Vision) Qixing Huang August 29 th 2018

Similar documents
Signed Distance Function Representation, Tracking, and Mapping. Tanner Schmidt

CS354 Computer Graphics Introduction. Qixing Huang Januray 17 8h 2017

3D Deep Learning

3D Scanning. Qixing Huang Feb. 9 th Slide Credit: Yasutaka Furukawa

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

Two-view geometry Computer Vision Spring 2018, Lecture 10

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

Monocular Tracking and Reconstruction in Non-Rigid Environments

Course 23: Multiple-View Geometry For Image-Based Modeling

Mobile Robotics. Mathematics, Models, and Methods. HI Cambridge. Alonzo Kelly. Carnegie Mellon University UNIVERSITY PRESS

Processing 3D Surface Data

Depth Estimation Using Collection of Models

ECS289: Scalable Machine Learning

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

Multiview Stereo COSC450. Lecture 8

Structured light 3D reconstruction

Introduction to Computer Vision

3D Object Representations. COS 526, Fall 2016 Princeton University

3D model classification using convolutional neural network

URBAN STRUCTURE ESTIMATION USING PARALLEL AND ORTHOGONAL LINES

Project Updates Short lecture Volumetric Modeling +2 papers

Stereo Vision. MAN-522 Computer Vision

Multiple View Geometry

CSE 4392/5369. Dr. Gian Luca Mariottini, Ph.D.

3D Object Recognition and Scene Understanding from RGB-D Videos. Yu Xiang Postdoctoral Researcher University of Washington

All human beings desire to know. [...] sight, more than any other senses, gives us knowledge of things and clarifies many differences among them.

Processing 3D Surface Data

Geometry of Multiple views

ME 555: Distributed Optimization

Making Machines See. Roberto Cipolla Department of Engineering. Research team

Camera Registration in a 3D City Model. Min Ding CS294-6 Final Presentation Dec 13, 2006

Epipolar Geometry Prof. D. Stricker. With slides from A. Zisserman, S. Lazebnik, Seitz

Learning from 3D Data

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

What is computer vision?

CV: 3D to 2D mathematics. Perspective transformation; camera calibration; stereo computation; and more

CS201 Computer Vision Camera Geometry

Step-by-Step Model Buidling

3D Photography: Active Ranging, Structured Light, ICP

VISION FOR AUTOMOTIVE DRIVING

Perceiving the 3D World from Images and Videos. Yu Xiang Postdoctoral Researcher University of Washington

08 An Introduction to Dense Continuous Robotic Mapping

Learning to generate 3D shapes

Intrinsic3D: High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting

Rigid ICP registration with Kinect

Finding Surface Correspondences With Shape Analysis

LASERDATA LIS build your own bundle! LIS Pro 3D LIS 3.0 NEW! BETA AVAILABLE! LIS Road Modeller. LIS Orientation. LIS Geology.

Using Faster-RCNN to Improve Shape Detection in LIDAR

W4. Perception & Situation Awareness & Decision making

Contents. 1 Introduction Background Organization Features... 7

Reconstruction of complete 3D object model from multi-view range images.

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. Charles R. Qi* Hao Su* Kaichun Mo Leonidas J. Guibas

Simultaneous Localization and Mapping (SLAM)

Block Diagram. Physical World. Image Acquisition. Enhancement and Restoration. Segmentation. Feature Selection/Extraction.

Department of Computer Science & Engineering University of Kalyani. Syllabus for Ph.D. Coursework

PART IV: RS & the Kinect

Application questions. Theoretical questions

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

Grasp Recognition using a 3D Articulated Model and Infrared Images

1 Projective Geometry

Hierarchical Volumetric Fusion of Depth Images

Assignment 2: Stereo and 3D Reconstruction from Disparity

Computer Vision I - Algorithms and Applications: Multi-View 3D reconstruction

3D Reconstruction from Scene Knowledge

COMPUTER AND ROBOT VISION

Multiple View Geometry in Computer Vision

Correspondence. CS 468 Geometry Processing Algorithms. Maks Ovsjanikov

Outline. 1 Why we re interested in Real-Time tracking and mapping. 3 Kinect Fusion System Overview. 4 Real-time Surface Mapping

Prof. Fanny Ficuciello Robotics for Bioengineering Visual Servoing

Miniature faking. In close-up photo, the depth of field is limited.

Polygonal Meshes. Thomas Funkhouser Princeton University COS 526, Fall 2016

3D Computer Vision. Structure from Motion. Prof. Didier Stricker

Neue Verfahren der Bildverarbeitung auch zur Erfassung von Schäden in Abwasserkanälen?

Contents I IMAGE FORMATION 1

A NEW AUTOMATIC SYSTEM CALIBRATION OF MULTI-CAMERAS AND LIDAR SENSORS

The Kinect Sensor. Luís Carriço FCUL 2014/15

3D Photography: Stereo

CONCENTRATIONS: HIGH-PERFORMANCE COMPUTING & BIOINFORMATICS CYBER-SECURITY & NETWORKING

Nonlinear State Estimation for Robotics and Computer Vision Applications: An Overview

Announcements. Stereo

Object Reconstruction

Fundamentals of Digital Image Processing

Multiple View Geometry in Computer Vision Second Edition

Multi-view Stereo. Ivo Boyadzhiev CS7670: September 13, 2011

Computer Vision at Cambridge: Reconstruction,Registration and Recognition

Lecture 9: Epipolar Geometry

Reconstructing Reflective and Transparent Surfaces from Epipolar Plane Images

Learning Probabilistic Models from Collections of 3D Meshes

Image Processing, Analysis and Machine Vision

The Hilbert Problems of Computer Vision. Jitendra Malik UC Berkeley & Google, Inc.

Incremental Real-time Bundle Adjustment for Multi-camera Systems with Points at Infinity

Welcome to CS 4/57101 Computer Graphics

Registration of Dynamic Range Images

3D Scanning. Lecture courtesy of Szymon Rusinkiewicz Princeton University

CS-235 Computational Geometry

Dense 3D Reconstruction. Christiano Gava

Computer Vision. Introduction

3D Computer Vision. Depth Cameras. Prof. Didier Stricker. Oliver Wasenmüller

CSCD18: Computer Graphics. Instructor: Leonid Sigal

Transcription:

CS 395T Numerical Optimization for Graphics and AI (3D Vision) Qixing Huang August 29 th 2018

3D Vision Understanding geometric relations between images and the 3D world between images Obtaining 3D information describing our 3D world from images From dedicated sensors Geometry understanding (new addition to 3D Vision) Due to the emergence of big 3D data Semantics for example

Virtual tourism

3D Urban Modeling

Kinect Scanner

KinectFusion

DynamicFusion

Reconstructing of the Octagon Monument

Finding matching fragments is a tedious job

3D reconstruction of fragments

3D reconstruction and completion

Visual impact of the 3D restoration

3D ShapeNets [Wu et al. 15]

PointNet [R.Qi et al. 17]

Many other applications Augmented Reality Self-Driving Cars Human-machine interface Autonomous micro-helicopter navigation Performance capture Forensics Interactive 3D modeling

Image-based modeling

Interdisciplinary aspect Vision Biology Geometry Processing 3D Vision Architecture Robotics Medicine

Topics to be Covered

Topic I: 3D Reconstruction Geometry Epipolar geometry Fundamental matrix Extrinsic/Intrinsic camera parameters Camera calibration Vanishing points Homogeneous coordinates

Topic I: 3D Reconstruction Algorithms Feature extraction Feature correspondences Relative camera pose Structure-from-motion Multiview stereo Bundle adjustment ICP

Textbook

Topic II: How to represent 3D Data Triangular mesh Part-based models Implicit surface Light Field Representation Point cloud

Conversion between different representations Implicit -> mesh (Marching Cube)

Conversion between different representations Point cloud -> triangular mesh (Delaunay triangulation)

Conversion between different representations Pointcloud -> Implicit -> Mesh [Kazhdan et al. 06]

Two recommended books

Topic III: How to understand 3D Data Design algorithms to extract semantic information from one or a collection of shapes [van Kaick et al. 11] Matching [Funkhouser et al. 05] Retrieval [Karz and Tal 03] [Mitra et al. 06] Segmentation Classification & Clustering

Big 3D Data

Princeton Shape Benchmark 1800 models in 90 categories Shilane et al. 04

Large-scale online repositories 3D Warehouse Yobi3D 3M models in more than 4K categories

Image-based shape retrieval 20 years ago 10 years ago now

Trends

Trends in Geometry Understanding Single Shape/Scene Analysis Joint Data Analysis (Unsupervised) ML on 3D Data (Supervised)

Numerical Optimization

Optimization problems We will see time and time again: This course: how to formulate P, how to solve P, and what are the guarantees Many similar domains: Graphics/Robotics/Vision

Why bother how to solve P and what are the guarantees There are plenty of optimization softwares Almost all algorithms are data-dependent and can perform better or worse on different problems and data sets In many cases, studying P leads to new algorithms and research papers

Categories of optimization models Linear vs. Nonlinear Convex vs. Nonconvex Continuous vs. Discrete Deterministic vs. Stochastic We see all of them in 3D Vision tasks

Example I: Iterative Closest Point (or ICP) Multi-view Coordinate system alignment Optimization problem Performance Local/global convergence Convergence rate Uncertainty ICP [Besel and Mckay 92]

Example II: MRF Inference

Trends in optimization Non-linear optimization Convex optimization Non-convex optimization

Trends in optimization Second-order methods First-order methods Distributed optimization

Grading

Five assignments (70%) Theory + practice (Choose one of them) Theory 3D geometry (for formulating numerical optimization problems) Proof of local/global convergence Practice Programming assignments of real computer vision tasks

Final Project (30%) Work in groups of 2-4 people Publishable results ICCV/ICML/NIPS/SIGGRAPH/SIGGRAPH ASIA Written report 8 pages Introduction/related works/approach/results/conclusions Final presentation

Potential project topic I Potential topic I: Image-based modeling from internet images A few images Non-identical objects Symmetries Machine learning

Potential project topics Potential topic II: single-view reconstruction from real images 3D Representation? Network architecture Domain adaptation Depth/semantic labels Human Hand [Song et al. 17]

Potential project topics Potential topic III: neural networks for 3D representations Implicit surface Multi-view Point cloud Mesh? Arrangement? Scene graph? Hybrid? [Wu et al. 16]

Potential project topics Potential topic IV: Geometry understanding Task Normal/Curvature Feature extraction Segmentation Part/object decomposition Correspondence Affordance Data Thousands of categories

Potential project topics Potential topic V: Reconstruction from hybrid sensors Low-res depth scan + High-res stereo Depth + shading Web cameras + internet images 360 images + internet images Street views + images from drones

Potential project topics Potential topic VI: Robot 3D Vision Salient views Next-best-view Model-based View planning Active vision Policies for exploration

Potential project topics Potential topic VII: structure recovery via optimization 2D human pose -> 3D human pose Structure from symmetry Structure from template Shape from shading Local/global convergence Exact recovery condition [Zhou et al. 16]

Potential project topics Potential topic VIII: Synchronization Multiview structure from motion Pose Pose + Point cloud Geometric alignment of point clouds Pose Pose + shape Global/Local convergence Exact recovery condition

Potential project topics Potential topic IX: uncertainties Human pose estimation Depth prediction Camera poses in MVSFM Geometry reconstruction Geometry understanding Quantification/Visualization

Logistics Office hour: Wednesdays 5:00 pm 7:00 pm TA: Xiangru Huang Office hour: TBD Homeworks Six late days Final project No late day

Questions?