CS595:Introduction to Computer Vision

Similar documents
What is Computer Vision? Introduction. We all make mistakes. Why is this hard? What was happening. What do you see? Intro Computer Vision

EECS 442 Computer Vision fall 2011

Computer Vision EE837, CS867, CE803

EECS 442 Computer Vision fall 2012

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Introduction to Computer Vision

Introduction to Computer Vision. CSE 152, Spring Introduction to Computer Vision. CSE 152, Spring Introduction to Computer Vision

CSc I6716 Spring D Computer Vision. Introduction. Instructor: Zhigang Zhu City College of New York

CSCD18: Computer Graphics. Instructor: Leonid Sigal

Introduction to Computer Vision. Srikumar Ramalingam School of Computing University of Utah

3D Computer Vision. Introduction. Introduction. CSc I6716 Fall Instructor: Zhigang Zhu City College of New York

Thanks to Chris Bregler. COS 429: Computer Vision

3D Computer Vision Introduction. Guido Gerig CS 6320, Spring 2012

ECG782: Multidimensional Digital Signal Processing

CAP 5415 Computer Vision. Fall 2011

Computer Vision, CS766. Staff. Instructor: Li Zhang TA: Jake Rosin

CSE 455: Computer Vision Winter 2007

Tecnologie per la ricostruzione di modelli 3D da immagini. Marco Callieri ISTI-CNR, Pisa, Italy

Digital Images. Kyungim Baek. Department of Information and Computer Sciences. ICS 101 (November 1, 2016) Digital Images 1

An Automatic Method for Adjustment of a Camera Calibration Room

Overview of Computer Vision. CS308 Data Structures

Computer Vision. CS664 Computer Vision. 1. Introduction. Course Requirements. Preparation. Applications. Applications of Computer Vision

Announcements. Introduction. Why is this hard? What is Computer Vision? We all make mistakes. What do you see? Class Web Page is up:

Multiple View Geometry

Using infrared proximity sensors for close 2D localization and object size recognition. Richard Berglind Neonode

3D from Images - Assisted Modeling, Photogrammetry. Marco Callieri ISTI-CNR, Pisa, Italy

Depth. Common Classification Tasks. Example: AlexNet. Another Example: Inception. Another Example: Inception. Depth

Introduction. What is Computer Vision? Why is this hard? Underestimates. What is computer vision? We ll begin with some introductory material

Tecnologie per la ricostruzione di modelli 3D da immagini. Marco Callieri ISTI-CNR, Pisa, Italy

Course 23: Multiple-View Geometry For Image-Based Modeling

Why study Computer Vision?

Topics to be Covered in the Rest of the Semester. CSci 4968 and 6270 Computational Vision Lecture 15 Overview of Remainder of the Semester

Overview. Introduction to Computer Vision. CSE 152 Lecture 1. Introduction to Computer Vision. CSE 152, Spring 2018

CS4495/6495 Introduction to Computer Vision. 1A-L1 Introduction

All human beings desire to know. [...] sight, more than any other senses, gives us knowledge of things and clarifies many differences among them.

Computer Vision Lecture 17

Computer Vision Lecture 17

Introduction. Computer Vision. Computer Vision. What is computer vision? Related Fields. We ll begin with some introductory material.

3D Scanning. Qixing Huang Feb. 9 th Slide Credit: Yasutaka Furukawa

Introductions. Today. Computer Vision Jan 19, What is computer vision? 1. Vision for measurement. Computer Vision

Discuss Proven technologies that addresses

Miniature faking. In close-up photo, the depth of field is limited.

Human Detection. A state-of-the-art survey. Mohammad Dorgham. University of Hamburg

CS 534: Computer Vision Segmentation and Perceptual Grouping

COS Lecture 10 Autonomous Robot Navigation

3D Computer Vision Introduction. Guido Gerig CS 6320, Spring 2012

Marcel Worring Intelligent Sensory Information Systems

Motion in 2D image sequences

Image Registration Lecture 1: Introduction

3D Computer Vision Introduction

CSc Topics in Computer Graphics 3D Photography

Chapter 9 Object Tracking an Overview

Why study Computer Vision?

COMP 102: Computers and Computing

Project 4 Results. Representation. Data. Learning. Zachary, Hung-I, Paul, Emanuel. SIFT and HoG are popular and successful.

Emerging Vision Technologies: Enabling a New Era of Intelligent Devices

Model-Based Stereo. Chapter Motivation. The modeling system described in Chapter 5 allows the user to create a basic model of a

What have we leaned so far?

Recap from Previous Lecture

CSE 4392/5369. Dr. Gian Luca Mariottini, Ph.D.

Photoneo's brand new PhoXi 3D Camera is the highest resolution and highest accuracy area based 3D

Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV Venus de Milo

CS201 Computer Vision Lect 4 - Image Formation

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Computer Vision. Introduction

3D Computer Vision Introduction

12/3/2009. What is Computer Vision? Applications. Application: Assisted driving Pedestrian and car detection. Application: Improving online search

AI in Computer Vision

EE795: Computer Vision and Intelligent Systems

Block Diagram. Physical World. Image Acquisition. Enhancement and Restoration. Segmentation. Feature Selection/Extraction.

Computer Vision. Alexandra Branzan Albu Spring 2009

Mobile Robotics. Mathematics, Models, and Methods. HI Cambridge. Alonzo Kelly. Carnegie Mellon University UNIVERSITY PRESS

Horus: Object Orientation and Id without Additional Markers

Real-time target tracking using a Pan and Tilt platform

Median filter. Non-linear filtering example. Degraded image. Radius 1 median filter. Today

Non-linear filtering example

Vision Review: Image Formation. Course web page:

Project 2 due today Project 3 out today. Readings Szeliski, Chapter 10 (through 10.5)

Applications of Image-Based Modeling and Rendering in Art and Cinema

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

CS354 Computer Graphics Introduction. Qixing Huang Januray 17 8h 2017

Digital Image Processing Lectures 1 & 2

DEPTH AND GEOMETRY FROM A SINGLE 2D IMAGE USING TRIANGULATION

Image Based Lighting with Near Light Sources

Image Based Lighting with Near Light Sources

Computer Graphics Introduction. Taku Komura

What is computer vision?

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Outdoor Scene Reconstruction from Multiple Image Sequences Captured by a Hand-held Video Camera

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Lecture 1. Computer Graphics and Systems. Tuesday, January 15, 13

Photo Tourism: Exploring Photo Collections in 3D

Have you ever used computer vision? How? Where? Think-Pair-Share

Operation of machine vision system

Computer Science 175. Introduction to Computer Graphics lib175 time: m/w 2:30-4:00 pm place:md g125 section times: tba

VOLUMETRIC VIDEO // PLENOPTIC LIGHTFIELD // MULTI CAMERA METHODOLOGIES JORDAN HALSEY // VR PLAYHOUSE

COMPUTER VISION. Dr. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai

Vehicle Dimensions Estimation Scheme Using AAM on Stereoscopic Video

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

Transcription:

CS595:Introduction to Computer Vision Instructor: Qi Li Instructor Course syllabus E-mail: qi.li@cs.wku.edu Office: TCCW 135 Office hours MW: 9:00-10:00, 15:00-16:00 T: 9:00-12:00, 14:00-16:00 F: 9:00-10:00 By appointment 1

Course syllabus Textbook Computer Vision by Linda Shapiro and George Stockman Grading 15%: Midterm exam (Mar, 5) 30%: Final exam (May, 6) 50%: Projects 5%: Attendance Background: Linear algebra Calculus Course syllabus Programming languages: Mathematica Additional: C/C++, Java 2

Course description An introduction to the analysis of images and video in order to recognize, reconstruct and model objects in the three-dimensional world. We will study the geometry of image formation; basic concepts in image processing such as smoothing, edge and feature detection, color, and texture; motion estimation; segmentation; stereo vision; 3-D modeling; and statistical recognition. Outline What is Vision? Course outline Applications 3

The Vision Problem How to infer salient properties of 3-D world from time-varying 2-D image projection Computer Vision Outline Image formation Low-level Single image processing Multiple views Mid-level Estimation, segmentation High-level Recognition 4

Image Formation 3-D geometry Physics of light Camera properties Focal length Distortion Sampling issues Spatial Temporal Low-level: Single Image Processing Filtering Edge Color Local pattern similarity Texture Appearance characterization from the statistics of applying multiple filters 3-D structure estimation from Shading Texture 5

Low-level: Multiple Views Stereo Structure from two views Structure from motion What can we learn in general from many views, whether they were taken simultaneously or sequentially? Mid-Level: Estimation, Segmentation Estimation: Fitting parameters to data Static (e.g., shape) Dynamic (e.g., tracking) Segmentation/clustering Breaking an image or image sequence into a few meaningful pieces with internal similarity 6

High-level: Recognition Recognition: Finding and parametrizing a known object Assignment to known categories using statistics/probability to make best choice Applications Inspection Factory monitoring: Analyze components for deviations Character recognition for mail delivery, scanning Biometrics (face recognition, etc.), surveillance Image databases: Image search on Google, etc. Medicine Segmentation for radiology Motion capture for gait analysis Entertainment 1 st down line in football, virtual advertising Matchmove, rotoscoping in movies Motion capture for movies, video games Architecture, archaeology: Image-based modeling, etc. Robot vision Obstacle avoidance, object recognition Motion compensation/image stabilization 7

Applications: Factory Inspection Cognex s CapInspect system Applications: Mosaicing courtesy of H. Shum and R. Szeliski 8

Applications: Face Detection courtesy of H. Rowley Applications: Face Recognition ORL face dataset 9

Applications: Text Detection & Recognition from J. Zhang et al. Applications: MRI Interpretation Coronal slice of brain from W. Wells et al. Segmented white matter 10

Detection and Recognition: How? Build models of the appearance characteristics (color, texture, etc.) of all objects of interest Detection: Look for areas of image with similar appearance to a targeting category of objects Recognition: Decide the identity of an object Segmentation: Categorize every pixel Applications: Football First-Down Line courtesy of Sportvision 11

Applications: Virtual Advertising courtesy of Princeton Video Image First-Down Line, Virtual Advertising: How? Sensors that measure pan, tilt, zoom and focus are attached to calibrated cameras at surveyed positions Knowledge of the 3-D position of the line, advertising rectangle, etc. can be directly translated into where in the image it should appear for a given camera The part of the image where the graphic is to be inserted is examined for occluding objects like the ball, players, and so on. These are recognized by being a sufficiently different color from the background at that point. This allows pixel-by-pixel compositing. 12

Applications: Inserting Computer Graphics with a Moving Camera Opening titles from the movie Panic Room Applications: Inserting Computer Graphics with a Moving Camera courtesy of 2d3 13

CG Insertion with a Moving Camera: How? This technique is often called matchmove Once again, we need camera calibration, but also information on how the camera is moving its egomotion. This allows the CG object to correctly move with the real scene, even if we don t know the 3-D parameters of that scene. Estimating camera motion: Much simpler if we know camera is moving sideways (e.g., some of the Panic Room shots), because then the problem is only 2-D For general motions: By identifying and following scene features over the entire length of the shot, we can solve retrospectively for what 3-D camera motion would be consistent with their 2-D image tracks. Must also make sure to ignore independently moving objects like cars and people. Applications: Rotoscoping 2d3 s Pixeldust 14

Applications: Motion Capture Vicon software: 12 cameras, 41 markers for body capture; 6 zoom cameras, 30 markers for face Applications: Motion Capture without Markers courtesy of C. Bregler 15

Motion Capture: How? Similar to matchmove in that we follow features and estimate underlying motion that explains their tracks Difference is that the motion is not of the camera but rather of the subject (though camera could be moving, too) Face/arm/person has more degrees of freedom than camera flying through space, but still constrained Special markers make feature identification and tracking considerably easier Multiple cameras gather more information Applications: Image-Based Modeling courtesy of P. Debevec Façade project: UC Berkeley Campanile 16

Image-Based Modeling: How? 3-D model constructed from manuallyselected line correspondences in images from multiple calibrated cameras Novel views generated by texturemapping selected images onto model Applications: Robotics Autonomous driving: Lane & vehicle tracking (with radar) 17