CAP 5415 Computer Vision. Fall 2011

Similar documents
Computer Vision EE837, CS867, CE803

Global Flow Estimation. Lecture 9

Global Flow Estimation. Lecture 9

Camera Model and Calibration. Lecture-12

Course Name: Computer Vision Course Code: IT444

Computer Vision. Introduction

What is Computer Vision? Introduction. We all make mistakes. Why is this hard? What was happening. What do you see? Intro Computer Vision

Miniature faking. In close-up photo, the depth of field is limited.

Digital Image Processing COSC 6380/4393

Final Exam Study Guide

Computer Vision Lecture 17

Computer Vision Lecture 17

Why study Computer Vision?

CSc I6716 Spring D Computer Vision. Introduction. Instructor: Zhigang Zhu City College of New York

Structure from Motion. Lecture-15

Face Recognition At-a-Distance Based on Sparse-Stereo Reconstruction

Lecture 24: More on Reflectance CAP 5415

Project 4 Results. Representation. Data. Learning. Zachary, Hung-I, Paul, Emanuel. SIFT and HoG are popular and successful.

CS4670: Computer Vision

3D Computer Vision. Introduction. Introduction. CSc I6716 Fall Instructor: Zhigang Zhu City College of New York

Computer and Machine Vision

Image formation - About the course. Grading & Project. Tentative Schedule. Course Content. Students introduction

Large-Scale 3D Point Cloud Processing Tutorial 2013

CS595:Introduction to Computer Vision

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Introduction to Computer Vision

Complex Sensors: Cameras, Visual Sensing. The Robotics Primer (Ch. 9) ECE 497: Introduction to Mobile Robotics -Visual Sensors

Image Based Rendering. D.A. Forsyth, with slides from John Hart

Topics and things to know about them:

Video Mosaics for Virtual Environments, R. Szeliski. Review by: Christopher Rasmussen

COMPUTER VISION. Dr. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai

Introduction to Computer Vision

MERGING POINT CLOUDS FROM MULTIPLE KINECTS. Nishant Rai 13th July, 2016 CARIS Lab University of British Columbia

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

Contents I IMAGE FORMATION 1

Think-Pair-Share. What visual or physiological cues help us to perceive 3D shape and depth?

Passive 3D Photography

Image Rectification (Stereo) (New book: 7.2.1, old book: 11.1)

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1

3D Sensing. 3D Shape from X. Perspective Geometry. Camera Model. Camera Calibration. General Stereo Triangulation.

Why is computer vision difficult?

Thanks to Chris Bregler. COS 429: Computer Vision

3D Fusion of Infrared Images with Dense RGB Reconstruction from Multiple Views - with Application to Fire-fighting Robots

Passive 3D Photography

Vision is inferential. (

Multiple View Geometry

Step-by-Step Model Buidling

Announcements. Introduction. Why is this hard? What is Computer Vision? We all make mistakes. What do you see? Class Web Page is up:

Assignment 2: Stereo and 3D Reconstruction from Disparity

Vision Review: Image Formation. Course web page:

Announcements. Stereo Vision Wrapup & Intro Recognition

Perception, Part 2 Gleitman et al. (2011), Chapter 5

Correspondence and Stereopsis. Original notes by W. Correa. Figures from [Forsyth & Ponce] and [Trucco & Verri]

Making Machines See. Roberto Cipolla Department of Engineering. Research team

Topics to be Covered in the Rest of the Semester. CSci 4968 and 6270 Computational Vision Lecture 15 Overview of Remainder of the Semester

Epipolar Geometry and Stereo Vision

(Sample) Final Exam with brief answers

Light source estimation using feature points from specular highlights and cast shadows

Depth. Common Classification Tasks. Example: AlexNet. Another Example: Inception. Another Example: Inception. Depth

Computer Vision. Alexandra Branzan Albu Spring 2009

DEPTH AND GEOMETRY FROM A SINGLE 2D IMAGE USING TRIANGULATION

EECS 442 Computer Vision fall 2011

Prof. Fanny Ficuciello Robotics for Bioengineering Visual Servoing

Other approaches to obtaining 3D structure

Prof. Feng Liu. Spring /27/2014

An Introduc+on to Mathema+cal Image Processing IAS, Park City Mathema2cs Ins2tute, Utah Undergraduate Summer School 2010

Projector Calibration for Pattern Projection Systems

Announcements. Hough Transform [ Patented 1962 ] Generalized Hough Transform, line fitting. Assignment 2: Due today Midterm: Thursday, May 5 in class

Recap: Features and filters. Recap: Grouping & fitting. Now: Multiple views 10/29/2008. Epipolar geometry & stereo vision. Why multiple views?

All good things must...

CV: 3D to 2D mathematics. Perspective transformation; camera calibration; stereo computation; and more

Computer Vision. Geometric Camera Calibration. Samer M Abdallah, PhD

CHAPTER 3 DISPARITY AND DEPTH MAP COMPUTATION

Epipolar Geometry and Stereo Vision

Announcements. Motion. Structure-from-Motion (SFM) Motion. Discrete Motion: Some Counting

CSE 527: Intro. to Computer

A New Representation for Video Inspection. Fabio Viola

AUTOMATED UAV-BASED VIDEO EXPLOITATION FOR MAPPING AND SURVEILLANCE

Computer and Machine Vision

Introductory Techniques For 3-D Computer Vision Unknown Edition By Emanuele Trucco, Alessandro Verri (1998) READ ONLINE

Web site. Introduction to Computer Vision. Computer Vision. Text Book. Computer Vision. Relation to other fields

Computer Vision. I-Chen Lin, Assistant Professor Dept. of CS, National Chiao Tung University

Lecture 14: Computer Vision

Computer Vision. Alexandra Branzan Albu Spring 2010

Homework 4 Computer Vision CS 4731, Fall 2011 Due Date: Nov. 15, 2011 Total Points: 40

Introduction. Prof. Kyoung Mu Lee SoEECS, Seoul National University

Motion Tracking and Event Understanding in Video Sequences

Course 23: Multiple-View Geometry For Image-Based Modeling

3D Computer Vision 1

Announcements. Motion. Structure-from-Motion (SFM) Motion. Discrete Motion: Some Counting

Introductory Techniques For 3-D Computer Vision By Emanuele Trucco, Alessandro Verri READ ONLINE

Expectations. Computer Vision. Grading. Grading. Our Goal. Our Goal

CS4733 Class Notes, Computer Vision

COSC579: Scene Geometry. Jeremy Bolton, PhD Assistant Teaching Professor

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

EECS 442 Computer vision. Announcements

Introducing Robotics Vision System to a Manufacturing Robotics Course

All human beings desire to know. [...] sight, more than any other senses, gives us knowledge of things and clarifies many differences among them.

Stereo CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

Computer Vision: Making machines see

Transcription:

CAP 5415 Computer Vision Fall 2011

General Instructor: Dr. Mubarak Shah Email: shah@eecs.ucf.edu Office: 247-F HEC

Course Class Time Tuesdays, Thursdays 12 Noon to 1:15PM 383 ENGR Office hours Tuesdays 1:15 PM to 2:00 PM Thursdays 11 AM to 12 Noon And by appointment Grading Midterm 20% Final 30% Assignments 10% Programs 40% Grading Policy: 90-100 = A; 80-89 = B; 70-79 = C;

Course Reference Texts: Mubarak Shah, "Fundamentals of Computer Vision". Richard Szeliski, "Computer Vision: Algorithms and Application, Springer. Emanuele Trucco, Alessandro Verri, "Introductory Techniques for 3-D Computer Vision", Prentice Hall, 1998. Course Slides from Previous Years http://www.cs.ucf.edu/courses/cap5415/fall2004/ http://www.cs.ucf.edu/courses/cap6411/cap5415/

Topics We ll Cover Image Filtering, Edge Detection, Interest Point Detectors Motion and Optical Flow Region Segmentation Object Detection and tracking Line and Curve Detection Shape Analysis Stereopsis Imaging Geometry, Camera Modeling and Calibration We may change order

Computer Vision The ability of computers to see. Image Understanding Machine Vision Robot Vision Image Analysis Video Understanding

A picture is worth a thousand words.

A word is worth a thousand pictures. A HUNT

Image 2-D array of numbers (intensity values, gray levels) Gray levels 0 (black) to 255 (white) Color image is 3 2-D arrays of numbers Red Green Blue Resolution (number of rows and columns) 128X128 256X256 512X512 640X480

Image Formats TIF PGM PBM GIF JPEG

Video Sequence of frames 30 frames per second Formats AVI MPEG Quick Time

Video Clip

Sequence of Images

Image Formation Light Source Camera (extrinsic and intrinsic parameters) Scene (Surface reflectance, Surface shape )

Perspective Projection (Pin Hole) Image Plane f Lens (X,Y,Z) World point image y Z

Orthographic Projection Image Plane (X,Y,Z) World point image y

Shape from X Recover 3-D shape from 2-D image(s) Stereo Motion Shading Texture Contours

Stereo

http://www.vision3d.com/stereo.html

Renault Stereo Pair

Depth Map

Stereo Pair

Shape from Shading

Lambertian Model S=L, light source I=S.N

Vase (1, 0, 1) (-1, 1, 1) (-1,-1, 1)

Shape from Texture

Visual Motion

Hamburg Taxi seq (Optical Flow)

Optical Flow Field Examples

Sequence Raw Optical flow 31

Video Clip & Mosaic

Structure From Motion Reconstructed Shape

Applications of Computer Vision Face Recognition Object Recognition Video Surveillance and Monitoring Object detection, tracking and behavior analysis Remote Sensing: UAVs Robotics Computer Graphics

Face Recognition

Object Recognition Finding People in images Problem 1: Given an image I Question: Does I contain an image of a person?

Yes Instances

No Instances

Localize People (Human Detection)

Human Detection

Airplanes

Motor Cycles

FACIAL EXPRESSIONS RAISE EYE BROWS SMILE

Detecting Driver Alertness

Lipreading

Video Surveillance and Monitoring Object detection Object tracking Object categorization and classification Event or Activities Recognition Automated Surveillance System (Detection & Tracking)

COCOA COCOA System Flow Aerial Video Telemetry* Ego Motion Compensation Feature based + Gradient Based Motion Detection Accumulative Frame Differencing + Background Modeling + Object Segmentation Object Tracking Kernel Tracking + Blob Tracking + Occlusion Handling Registered Images Motion Detection Tracks Event Detection & Indexing

Ego Motion Compensation Results - I Aerial Video - EO Mosaic Alignment Mask

Ego Motion Compensation Results - II Aerial Video - IR Mosaic Alignment Mask

Detection Result

Tracking Results

Tracking Results

UCF YouTube Action Dataset Cycling Diving Golf Swinging Riding Juggling Basketball Shooting Swinging Tennis Swinging Volleyball Spiking Trampoline Jumping Walking Dog

Making A Sandwich

Human Behavior Recognition

Key Frames Sequence 1 (350 frames), Part 1

Robot Vision (Unmanned Ground Vehicle) UGV

Geo-registration

Geo-registration

Layer Based Video Composition

Results of Doll

Results of Mom-Daughter