Introductions. Today. Computer Vision Jan 19, What is computer vision? 1. Vision for measurement. Computer Vision

Similar documents
Tracking wrapup Course recap. Announcements

EECS 442 Computer Vision fall 2011

EECS 442 Computer Vision fall 2012

Why is computer vision difficult?

CS 2770: Computer Vision. Introduction. Prof. Adriana Kovashka University of Pittsburgh January 9, 2018

Thanks to Chris Bregler. COS 429: Computer Vision

Computer Vision 2 Lecture 1

Miniature faking. In close-up photo, the depth of field is limited.

Local features: detection and description. Local invariant features

CSCD18: Computer Graphics. Instructor: Leonid Sigal

Introduction to Computer Vision. Srikumar Ramalingam School of Computing University of Utah

Course Administration

Finally: Motion and tracking. Motion 4/20/2011. CS 376 Lecture 24 Motion 1. Video. Uses of motion. Motion parallax. Motion field

Computer Vision EE837, CS867, CE803

Think-Pair-Share. What visual or physiological cues help us to perceive 3D shape and depth?

CS4495/6495 Introduction to Computer Vision. 1A-L1 Introduction

12/3/2009. What is Computer Vision? Applications. Application: Assisted driving Pedestrian and car detection. Application: Improving online search

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

Midterm Wed. Local features: detection and description. Today. Last time. Local features: main components. Goal: interest operator repeatability

Digital Images. Kyungim Baek. Department of Information and Computer Sciences. ICS 101 (November 1, 2016) Digital Images 1

Stereo: Disparity and Matching

Local features: detection and description May 12 th, 2015

EE290T : 3D Reconstruction and Recognition

Computer Vision. I-Chen Lin, Assistant Professor Dept. of CS, National Chiao Tung University

Introduction to Computer Vision

Epipolar Geometry and Stereo Vision

What is Computer Vision? Introduction. We all make mistakes. Why is this hard? What was happening. What do you see? Intro Computer Vision

Computer Vision Lecture 17

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Computer Vision. Marc Pollefeys Vittorio Ferrari Luc Van Gool

CS5670: Computer Vision

Computer Vision. Introduction

Computer Vision. Lecture # 1 Introduction & Fundamentals

Computer Vision Lecture 17

CS559: Computer Graphics. Lecture 1 Introduction Li Zhang University of Wisconsin, Madison

CS535: Interactive Computer Graphics

Wikipedia - Mysid

Project 4 Results. Representation. Data. Learning. Zachary, Hung-I, Paul, Emanuel. SIFT and HoG are popular and successful.

CS380: Computer Graphics Introduction. Sung-Eui Yoon ( 윤성의 ) Course URL:

CSE 167: Introduction to Computer Graphics. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2013

Introduction to Computer Vision MARCH 2018

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Visual Object Recognition

3D Computer Vision Introduction. Guido Gerig CS 6320, Spring 2012

Linear Algebra Review

3D Computer Vision Introduction

Local features and image matching. Prof. Xin Yang HUST

Computer Vision I - Introduction

Local Feature Detectors

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

Assignment 2: Stereo and 3D Reconstruction from Disparity

CPSC 425: Computer Vision

CS595:Introduction to Computer Vision

CSc I6716 Spring D Computer Vision. Introduction. Instructor: Zhigang Zhu City College of New York

Recap: Features and filters. Recap: Grouping & fitting. Now: Multiple views 10/29/2008. Epipolar geometry & stereo vision. Why multiple views?

CSE 167: Introduction to Computer Graphics. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2016

Image warping and stitching

Local invariant features

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Computer Vision, CS766. Staff. Instructor: Li Zhang TA: Jake Rosin

Epipolar Geometry and Stereo Vision

Lecture 12 Recognition

Topics to be Covered in the Rest of the Semester. CSci 4968 and 6270 Computational Vision Lecture 15 Overview of Remainder of the Semester

Computer Vision. Alexandra Branzan Albu Spring 2009

Computer Vision. Alexandra Branzan Albu Spring 2010

Supervised learning. y = f(x) function

Image warping and stitching

Introduction to Computer Graphics. Jürgen P. Schulze, Ph.D. University of California, San Diego Fall Quarter 2011

Course Administration

What is computer vision?

0. Introduction: What is Computer Graphics? 1. Basics of scan conversion (line drawing) 2. Representing 2D curves

Optical flow and tracking

Have you ever used computer vision? How? Where? Think-Pair-Share

Intro. to Computer Network. Course Reading. Class Resources. Important Info.

Expectations. Computer Vision. Grading. Grading. Our Goal. Our Goal

CS 564: DATABASE MANAGEMENT SYSTEMS. Spring 2018

Image warping , , Computational Photography Fall 2017, Lecture 10

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Feature Tracking and Optical Flow

CS 682: Computer Vision

Feature Tracking and Optical Flow

Chaplin, Modern Times, 1936

Homographies and RANSAC

Understanding Faces. Detection, Recognition, and. Transformation of Faces 12/5/17

Snakes, level sets and graphcuts. (Deformable models)

2%34 #5 +,,% ! # %& ()% #% +,,%. & /%0%)( 1 ! # %&# (&)# +, %& ./01 /1&2% # /&

CSE506: Operating Systems CSE 506: Operating Systems

Supervised learning. y = f(x) function

CS4670: Computer Vision

CS 241 Data Organization. August 21, 2018

BSB663 Image Processing Pinar Duygulu. Slides are adapted from Selim Aksoy

CS 241 Data Organization using C

Welcome. Orientation to online CPS102 Computer Science 2 (Java 2)

3D Computer Vision. Introduction. Introduction. CSc I6716 Fall Instructor: Zhigang Zhu City College of New York

The Hilbert Problems of Computer Vision. Jitendra Malik UC Berkeley & Google, Inc.

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

BOSTON UNIVERSITY Metropolitan College MET CS342 Data Structures with Java Dr. V.Shtern (Fall 2011) Course Syllabus

UNIK 4690 Maskinsyn Introduction

Fundamental matrix. Let p be a point in left image, p in right image. Epipolar relation. Epipolar mapping described by a 3x3 matrix F

Overview. Introduction to Computer Vision. CSE 152 Lecture 1. Introduction to Computer Vision. CSE 152, Spring 2018

Transcription:

Introductions Instructor: Prof. Kristen Grauman grauman@cs.utexas.edu Computer Vision Jan 19, 2011 TA: Shalini Sahoo shalini@cs.utexas.edu Today What is computer vision? Course overview Requirements, logistics Done? Computer Vision Automatic understanding of images and video 1. Computing properties of the 3D world from visual data (measurement) 1. Vision for measurement Real-time stereo Structure from motion Tracking NASA Mars Rover Snavely et al. Demirdjian et al. Wang et al. 1

Computer Vision Automatic understanding of images and video 1. Computing properties of the 3D world from visual data (measurement) 2. Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation) 2. Vision for perception, interpretation The Wicked Twister ride Lake Erie deck sky tree tree bench water Ferris wheel amusement park tree Cedar Point tree ride 12 E Objects Activities Scenes Locations Text / writing Faces Gestures Motions Emotions ride people waiting in line people sitting on ride umbrellas maxair carousel pedestrians Computer Vision 3. Visual search, organization Automatic understanding of images and video 1. Computing properties of the 3D world from visual data (measurement) 2. Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation) 3. Algorithms to mine, search, and interact with visual data (search and organization) Query Image or video archives Relevant content Related disciplines Vision and graphics Graphics Image processing Artificial intelligence Computer vision Algorithms Machine learning Cognitive science Images Vision Graphics Model Inverse problems: analysis and synthesis. 2

Visual data in 1963 Visual data in 2011 L. G. Roberts, Machine Perception of Three Dimensional i Solids, Ph.D. thesis, MIT Department of Electrical Engineering, 1963. Personal photo albums Movies, news, sports Surveillance and security Medical and scientific images Slide credit; L. Lazebnik Why vision? Faces and digital cameras As image sources multiply, so do applications Relieve humans of boring, easy tasks Enhance human abilities Advance human-computer interaction, visualization Perception for robotics / autonomous agents Organize and give access to visual content Camera waits for everyone to smile to take a photo [Canon] Setting camera focus via face detection Linking to info with a mobile device Video-based interfaces Situated search Yh Yeh et al., MIT Human joystick, NewsBreaker Live Assistive technology systems Camera Mouse, Boston College kooaba MSR Lincoln Microsoft Kinect 3

What else? Vision for medical & neuroimages fmri data Golland et al. Image guided surgery MIT AI Vision Group Special visual effects Safety & security The Matrix Navigation, di driver safety Monitoring pool (Poseidon) What Dreams May Come Mocap for Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz Pedestrian detection MERL, Viola et al. Surveillance Obstacles? What the computer gets 4

Why is vision difficult? Challenges: many nuisance parameters Ill-posed problem: real world much more complex than what we can measure in images 3D 2D Impossible to literally invert image formation process Illumination Object pose Clutter Occlusions Intra-class appearance Viewpoint Challenges: intra-class variation Challenges: importance of context slide credit: Fei-Fei, Fergus & Torralba slide credit: Fei-Fei, Fergus & Torralba Challenges: complexity Thousands to millions of pixels in an image 3,000-30,000 human recognizable object categories 30+ degrees of freedom in the pose of articulated objects (humans) Billions of images indexed by Google Image Search 18 billion+ prints produced from digital camera images in 2004 295.5 million camera phones sold in 2005 About half of the cerebral cortex in primates is devoted to processing visual information [Felleman and van Essen 1991] Ok, vision is very challenging Yet also active research area with exciting progress! 5

Brainstorm 1. What functionality should the system have? 2. Intuitively, what are the technical sub-problems that must be solved? Goals of this course Upper division undergrad course Introduction to primary topics Hands-on experience with algorithms Views of vision as a research area Topics overview Features and filters Features & filters Grouping & fitting Multiple views and motion Recognition Video processing Focus is on algorithms, rather than specific systems. Transforming and describing images; textures, colors, edges Grouping & fitting Multiple views and motion Multi-view geometry, matching, invariant features, stereo vision Lowe Hartley and Zisserman Clustering, segmentation, fitting; what parts belong together? [fig from Shi et al] Fei-Fei Li 6

Recognition and learning Video processing Tracking objects, video analysis, low level motion, optical flow Recognizing objects and categories, learning techniques Tomas Izo Textbooks Recommended book: Computer Vision: Algorithms and Applications By Rick Szeliski http://szeliski.org/book/ And others on reserve at PCL Requirements / Grading Problem sets (50%) Midterm exam (20%) Final exam (20%) Class participation, including attendance (10%) A quote from a student evaluation: To be honest, I think without going to class, the course would be very hard. Problem sets Some short answer concept questions Programming problem Implementation Explanation, results Code in Matlab available on CS Unix machines (see course page) These assignments are substantial. They will take significant time to do. Start early. Matlab Built-in toolboxes for lowlevel image processing, visualization Compact programs Intuitive interactive debugging Widely used in engineering 7

Pset 0 Pset 0: Matlab warmup + basic image manipulation Out Fri Jan 21, Due Fri Jan 28 Images as matrices Digital images Verify CS account and Matlab access Look at the tutorial t online Intensity : [0,255] Digital images j=1 width 520 i=1 Color images, RGB color space im[176][201] has value 164 im[194][203] has value 37 500 height R G B Preview of some problem sets Preview of some problem sets Grouping Image mosaics / stitching Image from Fei-Fei Li 8

Preview of some problem sets Preview of some problem sets Object search and recognition Tracking, activity recognition Collaboration policy All responses and code must be written individually. Students submitting answers or code found to be identical or substantially similar (due to inappropriate collaboration) risk failing the course. Assignment deadlines Assignments in by11:59 PM on due date Follow submission instructions given in assignment regarding hardcopy/electronic. Deadlines are firm. We ll use turnin timestamp. 3 free late days, total for the term. Use as you want, but note that first two assignments lighter than rest. If your program doesn t work, clean up the code, comment it well, explain what you have, and still submit. Miscellaneous Check class website regularly We ll use Blackboard to send email No laptops, p phones, etc. open in class please. Use our office hours! Coming up Now: check out Matlab tutorial online Friday 21st: Pset 0 out Monday 24 th : first lecture on linear filters Friday 28 th : Pset 0 due 9