Video Texture. A.A. Efros

Similar documents
Data-driven methods: Video & Texture. A.A. Efros

Data-driven methods: Video & Texture. A.A. Efros

Topics. Image Processing Techniques and Smart Image Manipulation. Texture Synthesis. Topics. Markov Chain. Weather Forecasting for Dummies

Texture Synthesis. Fourier Transform. F(ω) f(x) To understand frequency ω let s reparametrize the signal by ω: Fourier Transform

Image Processing Techniques and Smart Image Manipulation : Texture Synthesis

Video Textures. Arno Schödl Richard Szeliski David H. Salesin Irfan Essa. presented by Marco Meyer. Video Textures

+ = The Goal of Texture Synthesis. Image Quilting for Texture Synthesis & Transfer. The Challenge. Texture Synthesis for Graphics

Image Composition. COS 526 Princeton University

Texture. The Challenge. Texture Synthesis. Statistical modeling of texture. Some History. COS526: Advanced Computer Graphics

Texture. COS 429 Princeton University

More details on presentations

Machine Learning for Video-Based Rendering

Lecture 6: Texture. Tuesday, Sept 18

Machine Learning for Video-Based Rendering

Texture Synthesis and Manipulation Project Proposal. Douglas Lanman EN 256: Computer Vision 19 October 2006

Texture and Other Uses of Filters

CSCI 1290: Comp Photo

Motion Texture. Harriet Pashley Advisor: Yanxi Liu Ph.D. Student: James Hays. 1. Introduction

Motion Capture. Motion Capture in Movies. Motion Capture in Games

Topics to be Covered in the Rest of the Semester. CSci 4968 and 6270 Computational Vision Lecture 15 Overview of Remainder of the Semester

Texture April 17 th, 2018

Automatic Image Alignment (feature-based)

To Do. Advanced Computer Graphics. The Story So Far. Course Outline. Rendering (Creating, shading images from geometry, lighting, materials)

Consider a partially transparent object that is illuminated with two lights, one visible from each side of the object. Start with a ray from the eye

Texture Synthesis. Darren Green (

Modeling Light. Michal Havlik : Computational Photography Alexei Efros, CMU, Fall 2007

BCC Optical Stabilizer Filter

Synthesizing Realistic Facial Expressions from Photographs

Texture. Announcements. Markov Chains. Modeling Texture. Guest lecture next Tuesday. Evals at the end of class today

Texture Synthesis. Darren Green (

Hidden Markov Model for Sequential Data

Modeling Light. Michal Havlik

Image Warping and Morphing. Alexey Tikhonov : Computational Photography Alexei Efros, CMU, Fall 2007

Texture April 14 th, 2015

Inferring 3D People from 2D Images

Epitomic Analysis of Human Motion

Human body animation. Computer Animation. Human Body Animation. Skeletal Animation

Motion and Tracking. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

Hidden Markov Models. Slides adapted from Joyce Ho, David Sontag, Geoffrey Hinton, Eric Xing, and Nicholas Ruozzi

Synthesis by Example. Connecting Motion Planning and Example based Movement. Michael Gleicher

Regularization and Markov Random Fields (MRF) CS 664 Spring 2008

Modeling Light. Michal Havlik

Style-based Inverse Kinematics

Tracking Pedestrians using Local Spatio-temporal Motion Patterns in Extremely Crowded Scenes

The SIFT (Scale Invariant Feature

Modeling Light. Michal Havlik : Computational Photography Alexei Efros, CMU, Fall 2011

Lecture 18: Human Motion Recognition

Announcements. Texture. Review: last time. Texture 9/15/2009. Write your CS login ID on the pset hardcopy. Tuesday, Sept 15 Kristen Grauman UT-Austin

Light Field Occlusion Removal

New Features in BCC AVX 4.0

Problem Set 4. Assigned: March 23, 2006 Due: April 17, (6.882) Belief Propagation for Segmentation

IMA Preprint Series # 2016

Ping Tan. Simon Fraser University

COMPUTER VISION FOR VISUAL EFFECTS

Course Outline. Advanced Computer Graphics. Animation. The Story So Far. Animation. To Do

Beyond Mere Pixels: How Can Computers Interpret and Compare Digital Images? Nicholas R. Howe Cornell University

Interactive Video Object Extraction & Inpainting 清華大學電機系. Networked Video Lab, Department of Electrical Engineering, National Tsing Hua University

Modeling Light. Slides from Alexei A. Efros and others

What have we leaned so far?

3D Human Motion Analysis and Manifolds

Finally: Motion and tracking. Motion 4/20/2011. CS 376 Lecture 24 Motion 1. Video. Uses of motion. Motion parallax. Motion field

Leveraging Textural Features for Recognizing Actions in Low Quality Videos

Channels & Keyframes. CSE169: Computer Animation Instructor: Steve Rotenberg UCSD, Winter 2017

Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions

Animation. Itinerary. What is Animation? What is Animation? Animation Methods. Modeling vs. Animation Computer Graphics Lecture 22

Motion Estimation and Optical Flow Tracking

Image Compositing and Blending

Bayesian Reconstruction of 3D Human Motion from Single-Camera Video

Animation. Itinerary Computer Graphics Lecture 22

Computer Vision 2. SS 18 Dr. Benjamin Guthier Professur für Bildverarbeitung. Computer Vision 2 Dr. Benjamin Guthier

Motion Synthesis and Editing. Yisheng Chen

Texture Sensitive Image Inpainting after Object Morphing

The Light Field and Image-Based Rendering

Lecture 15: Image-Based Rendering and the Light Field. Kayvon Fatahalian CMU : Graphics and Imaging Architectures (Fall 2011)

Today s lecture. Image Alignment and Stitching. Readings. Motion models

Recap. DoF Constraint Solver. translation. affine. homography. 3D rotation

Image-Based Modeling and Rendering

Announcements. Texture. Review. Today: Texture 9/14/2015. Reminder: A1 due this Friday. Tues, Sept 15. Kristen Grauman UT Austin

Motion Interpretation and Synthesis by ICA

Panoramic Video Texture

Photo Tourism: Exploring Photo Collections in 3D

CSCI 1290: Comp Photo

Drawing Fast The Graphics Pipeline

Multiple View Geometry

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

Global Flow Estimation. Lecture 9

arxiv: v1 [cs.cv] 22 Feb 2017

3D Editing System for Captured Real Scenes

Local Feature Detectors

Quality Guided Image Denoising for Low-Cost Fundus Imaging

Representing Moving Images with Layers. J. Y. Wang and E. H. Adelson MIT Media Lab

Image Processing. Overview. Trade spatial resolution for intensity resolution Reduce visual artifacts due to quantization. Sampling and Reconstruction

Velocity adaptation of space-time interest points

LAB 1: INTRODUCTION TO DATA STUDIO AND ONE-DIMENSIONAL MOTION

Spatio-Temporal Stereo Disparity Integration

Markov Random Fields and Segmentation with Graph Cuts

Normalized cuts and image segmentation

Lecture 20: Tracking. Tuesday, Nov 27

Interpolation, extrapolation, etc.

Matching. Compare region of image to region of image. Today, simplest kind of matching. Intensities similar.

Transcription:

Video Texture A.A. Efros 15-463: Computational Photography Alexei Efros, CMU, Fall 2005

Weather Forecasting for Dummies Let s predict weather: Given today s weather only, we want to know tomorrow s Suppose weather can only be {Sunny, Cloudy, Raining} The Weather Channel algorithm: Over a long period of time, record: How often S followed by R How often S followed by S Etc. Compute percentages for each state: P(R S), P(S S), etc. Predict the state with highest probability! It s a Markov Chain

Markov Chain 0.3 0.4 0.2 0.6 0.3 0.4 0.1 0.3 0.4 What if we know today and yestarday s weather?

Text Synthesis [Shannon, 48] proposed a way to generate English-looking text using N-grams: Assume a generalized Markov model Use a large text to compute prob. distributions of each letter given N-1 previous letters Starting from a seed repeatedly sample this Markov chain to generate new letters Also works for whole words WE NEED TO EAT CAKE

Mark V. Shaney (Bell Labs) Results (using alt.singles corpus): As I've commented before, really relating to someone involves standing next to impossible. One morning I shot an elephant in my arms and kissed him. I spent an interesting evening recently with a grain of salt

Video Textures Arno Schödl Richard Szeliski David Salesin Irfan Essa Microsoft Research, Georgia Tech

Still photos

Video clips

Video textures

Problem statement video clip video texture

Our approach How do we find good transitions?

Finding good transitions Compute L 2 distance D i, j between all frames vs. frame i frame j Similar frames make good transitions

Markov chain representation 1 2 3 4 Similar frames make good transitions

Transition costs Transition from i to j if successor of i is similar to j Cost function: C i j = D i+1, j i i+1 i j D i+1, j j-1 j

Transition probabilities Probability for transition P i j inversely related to cost: P i j ~exp ( C i j / σ 2 ) high σ low σ

Preserving dynamics

Preserving dynamics

Preserving dynamics Cost for transition i j N-1 C i j = Σ w k D i+k+1, j+k k = -N i-1 i i+1 i+2 D i-1, j-2 D i, j-1 i j D D i+1, j i+2, j+1 j-2 j-1 j j+1

Preserving dynamics effect Cost for transition i j N-1 C i j = Σ w k D i+k+1, j+k k = -N

Dead ends No good transition at the end of sequence 1 2 3 4

Future cost Propagate future transition costs backward Iteratively compute new cost F i j = C i j + α min k F j k 1 2 3 4

Future cost Propagate future transition costs backward Iteratively compute new cost F i j = C i j + α min k F j k 1 2 3 4

Future cost Propagate future transition costs backward Iteratively compute new cost F i j = C i j + α min k F j k 1 2 3 4

Future cost Propagate future transition costs backward Iteratively compute new cost F i j = C i j + α min k F j k 1 2 3 4

Future cost Propagate future transition costs backward Iteratively compute new cost F i j = C i j + α min k F j k Q-learning 1 2 3 4

Future cost effect

Finding good loops Alternative to random transitions Precompute set of loops up front

Visual discontinuities Problem: Visible Jumps

Crossfading Solution: Crossfade from one sequence to the other. A i-2 3 4 A i-1 2 1 A 4 i 4 A i+1 1 2 3 + B 4 j-2 + B + 4 j-1 4 B j B j+1 A i-2 15 A i-1 /B j-2 A i-1 /B j-2 A i-1 /B j-2 B j+1

Morphing Interpolation task: 2 A B 5 2 1 C 5 + + 5

Morphing Interpolation task: 2 A B 5 2 1 C 5 + + 5 Compute correspondence between pixels of all frames

Morphing Interpolation task: 2 A B 5 2 1 C 5 + + 5 Compute correspondence between pixels of all frames Interpolate pixel position and color in morphed frame based on [Shum 2000]

Results crossfading/morphing

Results crossfading/morphing Jump Cut Crossfade Morph

Crossfading

Frequent jump & crossfading

Video portrait Useful for web pages

Region-based analysis Divide video up into regions Generate a video texture for each region

Automatic region analysis

Video-based animation Like sprites computer games Extract sprites from real video Interactively control desired motion 1985 Nintendo of America Inc.

Video sprite extraction blue screen matting and velocity estimation

Video sprite control Augmented transition cost: Animation C i j = α + β angle C i j Similarity term Control term vector to mouse pointer velocity vector

Interactive fish

Lord of the Flies

Summary Video clips video textures define Markov process preserve dynamics avoid dead-ends disguise visual discontinuities

Motion Analysis & Synthesis [Efros 03] What are they doing? Activity recognition, surveillance, anti-terrorism Can we do the same? Motion retargeting, movies, video games, etc.

Gathering action data Low resolution, noisy data Moving camera Occlusions

Figure-centric Representation Stabilized spatio-temporal volume No translation information All motion caused by person s limbs Good news: indifferent to camera motion Bad news: hard! Good test to see if actions, not just translation, are being captured

Remembrance of Things Past Explain novel motion sequence with bits and pieces of previously seen video clips input sequence run walk left motion analysis swing walk right motion synthesis jog synthetic sequence Challenge: how to compare motions?

How to describe motion? Appearance Not preserved across different clothing Gradients (spatial, temporal) same (e.g. contrast reversal) Edges Too unreliable Optical flow Explicitly encodes motion Least affected by appearance but too noisy

Motion Descriptor Image frame Optical flow F x, y F x, F y F, F, F, F blurred x + x y + y F, F, F, F x + x y + y

Comparing motion descriptors Σ t I matrix frame-to-frame similarity matrix blurry I motion-to-motion similarity matrix

Recognizing Tennis Red bars show classification results

Do as I Do Motion Synthesis input sequence synthetic sequence Matching two things: Motion similarity across sequences Appearance similarity within sequence Dynamic Programming

Smoothness for Synthesis is similarity between source and target frames W act is appearance similarity within target frames W app For every source frame i, find best target frame by maximizing following cost function: π i n i= 1 α act W act ( i, π ) i + n i= 2 α app W app ( π, π i i 1 + 1) Optimize using dynamic programming

Do as I Do Source Motion Source Appearance 3400 Frames Result

Do as I Say Synthesis run walk left swing walk right jog run walk left swing walk right jog synthetic sequence Synthesize given action labels e.g. video game control

Do as I Say Red box shows when constraint is applied

Application: Motion Retargeting Rendering new character into existing footage Algorithm Track original character Find matches from new character Erase original character Render in new character Need to worry about occlusions

Context-based Image Correction Input sequence 3 closest frames median images

Actor Replacement SHOW VIDEO