Computer Vision I - Algorithms and Applications: Introduction

Size: px
Start display at page:

Download "Computer Vision I - Algorithms and Applications: Introduction"

Transcription

1 Computer Vision I - Algorithms and Applications: Introduction Carsten Rother 22/10/2013 Computer Vision I:Introduction

2 Admin Stuff Computer Vision I: Introduction 22/10/ Language: German/English; Slides: English (all the terminology and books are in English) Lecturer: Carsten Rother Exercises: Dmitri Schlesinger Staff carsten.rother@tu-dresden.de Announcements: online (to be set up) Course Books: Main: Computer Vision: Algorithms and Applications by Rick Szeliski; Springer An earlier version of the book is online: Secondary: Multiple View Geometry; Hartley and Zisserman; Cambridge Press Second edition. Parts of book are online: Also pointers to conference and journal articles

3 Course Overview (total 14 lectures) Computer Vision I: Introduction 22/10/ VL1 (21.10): Introduction Ex1: Intro to OpenCV (H.Heidrich) VL2 (28.10): Basics of Digital Image Processing Ex2: Fast Methods for Filtering; explain home work VL3 (4.11): Image Formation Models Ex3: homework VL4 (11.11): Single View Geometry and Camera Calibration Ex4: homework VL5 (18.11): Feature Extraction and Matching Ex5: homework VL6 (25.11): Two View Geometry Ex6: Robust Panoramic Stitching; explain home work VL7 (2.12): 3D reconstruction from multiple views Ex7: homework

4 Course Overview (total 14 lectures) Computer Vision I: Introduction 22/10/ VL8 (9.12): Dense Motion Estimation Ex1: homework VL9 (16.12): Image Segmentation Ex2: homework VL10 (6.1): Object Recognition (in progress) Ex3: Object Recognition; explain homework VL11 (13.1): Part-based recognition (in progress) Ex4: homework VL12 (20.1): Model-based Vision (in progress) Ex5: homework VL13 (27.1): Having Fun with Images (in progress) Ex6: homework VL14(3.2): Wrap-Up: 100 things we have learned (in progress) Ex5: homework

5 Related Lectures Computer Vision I: Introduction 22/10/ Machine Learning (D. Schlesinger) WS 13/14 (2/2/0) Basics on machine learning from data (not neccesrily image) Computer Vision 2: Models, Inference and Learning SS 14 (4/2/0) Requirement: ML WS 13/14 and CV WS 13/14 Goes mathematically deeper For doing a Thesis/PhD in the CVLD all three classes are compulsory Computer graphics (Prof. Gumhold) (Introduction, I, II) 3D Scanning with structured light; Illumination models; Geometry

6 Exams and Exercises Computer Vision I: Introduction 22/10/ Exam: in person (maybe written exams in future) Exercises/homework: There are 3 blocks Each block has several excersises with different points You have to collect in total 10 points to sit the exam The exercises have to be handed in until end of semester (ideally after each block) Last possible date to hand in is end of semester (end of January) Collaboration: You are encouraged to discuss the topics You are not allowed to copy any code for the homework from other people

7 Computer Vision I: Introduction 22/10/ Before we start some Advertisement CVLD Overview Interactive Image and Data manipulation Applied Optimization, Models, and Learning 3D Scene Understanding Inverse rendering from moving images Benchmarking and Label collection Image Analysis for System Biology

8 Advertisement Master/Bachelor/Project/ Computer Vision I: Introduction 22/10/ We have a lot of different topics to offer: ranging from theoretical to practical ones Main emphasis: 3D Image Editing 3D Scene understanding Learning and Inference in undirected Graphical models A list of topics will be available soon on:

9 Advertisement Master/Bachelor/Project/ Computer Vision I: Introduction 22/10/ A project work in the CVLD is a good stepping stone if you: want to do a PhD in computer vision, graphics, machine learning want to become a researcher or software developer in one of the big research labs (Microsoft Research, Google, Adobe, TechniColor, etc) If you are interested in doing a start-up Other computer vision related industry Example: Master Thesis on Video Matting jointly with Adobe Seattle (e.g. for Adobe Adobe After Effects)

10 Advertisement: Komplex Praktikum; Einfuehrungspraktikum Computer Vision I: Introduction 22/10/ Komplex Praktikum: Segmentation in 3D images (3D Image Editing) real-time Pose estimation (3D scene understanding) Einfuerhungspraktium: 6 Exercises to get to know Computer Vision (ranging from physics based vision to semantic vision)

11 Before we start. Computer Vision I: Introduction 22/10/ Please give Feedback during the lecture, after the lecture or via This my first full course I give at University Feedback helps me to adjust the level.

12 Introduction to Computer Vision Computer Vision I: Introduction 22/10/ What is computer Vision? (Potential) Definition: Developing computational models and algorithms to interpret digital images and visual data in order to understand the visual world we live in.

13 Introduction to Computer Vision Computer Vision I: Introduction 22/10/ What is computer Vision? (Potential) Definition: Developing computational models and algorithms to interpret digital images and visual data in order to understand the visual world we live in.

14 What does it mean to understand? Computer Vision I: Introduction 22/10/ Physics-based vision: Geometry Segmentation Camera parameters Emitted light (sun) Surface properties: Reflectance, material (Potential) Definition: Developing computational models and algorithms to interpret digital images and visual data in order to understand the visual world we live in. Semantic-based vision: Objects: class, pose Scene: outdoor, Attributes/Properties: - old-fashioned train - A-on-top-of-B

15 Image-formation model Very many sources of variability Image [Slide Credits: John Winn, ICML 2008] Computer Vision I: Introduction 22/10/

16 Image-formation model Scene type Scene geometry Street scene [Slide Credits: John Winn, ICML 2008] Computer Vision I: Introduction 22/10/

17 Image-formation model Scene type Scene geometry Object classes Sky Building 3 Road Street scene Sidewalk Bicycle Tree 3 Car 5 Person 4 Bench Bollard [Slide Credits: John Winn, ICML 2008] Computer Vision I: Introduction 22/10/

18 Image-formation model Scene type Scene geometry Object classes Object position Object orientation Sky Building 3 Road Street scene Sidewalk Bicycle Tree 3 Car 5 Person 4 Bench Bollard [Slide Credits: John Winn, ICML 2008] Computer Vision I: Introduction 22/10/

19 Image-formation model Scene type Scene geometry Object classes Object position Object orientation Object shape Street scene [Slide Credits: John Winn, ICML 2008] Computer Vision I: Introduction 22/10/

20 Image-formation model Scene type Scene geometry Object classes Object position Object orientation Object shape Depth/occlusions [Slide Credits: John Winn, ICML 2008] Computer Vision I: Introduction 22/10/

21 Image-formation model Scene type Scene geometry Object classes Object position Object orientation Object shape Depth/occlusions Object appearance [Slide Credits: John Winn, ICML 2008] Computer Vision I: Introduction 22/10/

22 Image-formation model Scene type Scene geometry Object classes Object position Object orientation Object shape Depth/occlusions Object appearance Illumination Shadows [Slide Credits: John Winn, ICML 2008] Computer Vision I: Introduction 22/10/

23 Image-formation model Scene type Scene geometry Object classes Object position Object orientation Object shape Depth/occlusions Object appearance Illumination Shadows [Slide Credits: John Winn, ICML 2008] Computer Vision I: Introduction 22/10/

24 Image-formation model Scene type Scene geometry Object classes Object position Object orientation Object shape Depth/occlusions Object appearance Illumination Shadows Motion blur Camera effects [Slide Credits: John Winn, ICML 2008] Computer Vision I: Introduction 22/10/

25 Image-formation model Scene type Scene geometry Object classes Object position Object orientation Object shape Depth/occlusions Object appearance Illumination Shadows Motion blur Camera effects [Slide Credits: John Winn, ICML 2008] Computer Vision I: Introduction 22/10/

26 The Scene Parsing challenge --- a grand challenge of computer vision Computer Vision I: Introduction 22/10/ (Probabilistic) Script = {Camera, Light, Geometry, Material, Objects, Scene, Attributes, Others} Single image Many applications do not have to extract the full probabilistic script but only a subset, e.g. does the image contain a car? many examples to come later

27 Why is scene parsing hard? Computer Vision I: Introduction 22/10/ Computer Graphics 3D Rich Representation, Script = {Camera, Light, Geometry, Material, Objects, Scene, Attributes, Others} 2D pixel representation Computer Vision Computer Vision can be seen as inverse graphics

28 Example of a recent work Computer Vision I: Introduction 22/10/ Scene graph Input [Gupta, Efros, Herbert, ECCV 10]

29 Why is scene parsing hard? [Xiao et al. NIPS 2012] [Sussman, Lamport, Guzman 1966] [Slide credits Andrew Blake] Computer Vision I: Introduction 22/10/

30 Introduction to Computer Vision Computer Vision I: Introduction 22/10/ What is computer Vision? (Potential) Definition: Developing computational models and algorithms to interpret digital images and visual data in order to understand the visual world we live in.

31 How can we interpret visual data? Computer Vision I: Introduction 22/10/ D pixel representation Computer Graphics 3D Rich Representation, Script = {Camera, Light, Geometry, Material, Objects, Scene, Attributes, Others} Computer Vision What general (prior) knowledge of the world (not necessarily visual) can be exploit? What properties / cues from the image can be used? Both aspects are quite well understood (a lot is based on physics) but how to use them is efficiently is open challenged (see later)

32 How can we interpret visual data? Computer Vision I: Introduction 22/10/ D pixel representation Computer Graphics 3D Rich Representation, Script = {Camera, Light, Geometry, Material, Objects, Scene, Attributes, Others} Computer Vision What general (prior) knowledge of the world (not necessarily visual) can be exploit? What properties / cues from the image can be used? Both aspects are quite well understood (a lot is based on physics) but how to use them is efficiently is open challenged (see later)

33 Prior knowledge (examples) Computer Vision I: Introduction 22/10/ Hard prior knowledge Trains do not fly in the air Objects are connected in 3D Soft prior knowledge: The camera is more likely 1.70m above ground and not 0.1m. Self-similarity: all black pixels belong to the same object

34 Prior knowledge harder to describe Computer Vision I: Introduction 22/10/ Describe Image Texture Real Image zoom Not a real Image zoom Microscopic Images. What is the true shape of these objects

35 Example: State-of-the Art Denoising Computer Vision I: Introduction 22/10/ Ground truth Dust on Lens Perceptual Error normal Error [Janscary, Nowozin, Rother, ECCV 12]

36 The importance of Prior knowledge Computer Vision I: Introduction 22/10/ Which patch is brighter: A or B? [Edward Adelson]

37 The importance of Prior knowledge Computer Vision I: Introduction 22/10/ Which patch is brighter: A or B? [Edward Adelson]

38 The importance of Prior knowledge Computer Vision I: Introduction 22/10/ D 3D 3D 2D Image - local What the computer sees A B Ambient Light An unlikely 3D representation (hard to see for a human) A B True colors in 3D world The most likely 3D representation Direct Light True colours In 3D world This is what humans see implicitly. Ideally the computer sees the sane. A B

39 The importance of Prior knowledge Computer Vision I: Introduction 22/10/ Light 2D Image 3D representation Humans see an image not as a set of 2D pixels. They understand an image as a projection of the 3D world we live in Humans have the prior knowledge about the world encoded, such as: Light cast shadows Objects do not fly in the air A car is likely to move but a table is unlikely to move We have to teach the computer this prior knowledge to understand 2D images as picture of the 3D world

40 The importance of Prior knowledge Computer Vision I: Introduction 22/10/ Which monster is bigger?

41 The importance of Prior knowledge Computer Vision I: Introduction 22/10/ In the 2D Image Which monster is bigger? 1meter 2meter In the 3D world (true)

42 Two Explanations: a) People are different height and room right shape b) People are same height but room weirdly shaped Computer Vision I: Introduction 22/10/

43 Human Vision can be fooled Computer Vision I: Introduction 22/10/

44 Male or Female Computer Vision I: Introduction 22/10/

45 How can we interpret visual data? Computer Vision I: Introduction 22/10/ D pixel representation Computer Graphics 3D Rich Representation, Script = {Camera, Light, Geometry, Material, Objects, Scene, Attributes, Others} Computer Vision What general (prior) knowledge of the world (not necessarily visual) can be exploit? What properties / cues from the image can be used? Both aspects are quite well understood (a lot is based on physics) but how to use them is efficiently is open challenged (see later)

46 Cue: Appearance (Colour, Texture) for object recognition Computer Vision I: Introduction 22/10/ To what object does the patch belong to?

47 Cue: Outlines (shape) for object recognition Computer Vision I: Introduction 22/10/

48 Guess the Object Computer Vision I: Introduction 22/10/ Colour Texture Shape [from JohnWinn ICML 2008]

49 Guess the ob ject Computer Vision I: Introduction 22/10/ ? Colour Texture Shape [from JohnWinn ICML 2008]

50 Cue: Context for object recognition Computer Vision I: Introduction 22/10/

51 Cue: Context for object recognition Computer Vision I: Introduction 22/10/

52 Cue: stereo vision (2 frames) for geometry estimation Computer Vision I: Introduction 22/10/ Ground truth Algorithmic output

53 Cue: Multiple Frames for geometry estimation Computer Vision I: Introduction 22/10/

54 Cue: Convergence for geometry estimation Computer Vision I: Introduction 22/10/ vp Lines with same vanishing point may also be parallel in 3D

55 Cue: Shading & shadows for geometry and Light estimation Computer Vision I: Introduction 22/10/

56 Texture gradient for geometry estimation Computer Vision I: Introduction 22/10/

57 The Scene Parsing challenge --- a grand challenge of computer vision Computer Vision I: Introduction 22/10/ (Probabilistic) Script = {Camera, Light, Geometry, Material, Objects, Scene, Attributes, Others} Single image Many applications do not have to extract the full probabilistic script but only a subset, e.g. does the image contain a car? many examples to come later

58 many application scenarios are in reach Computer Vision I: Introduction 22/10/ To simplify the problem: 1) Richer Input: - Modern sensing technology - Moving images - User involvement 2) Rich Data to learn from: - use the web - crowdsourcing to get labels (online games, mechanical turk) - Powerful graphics engines 3) For many practical applications: We do not have to infer the full probabilistic script

59 Kinect has simplified (revolutionized) computer vision Computer Vision I: Introduction 22/10/ [Izadi et al. 11]

60 Animate the world Computer Vision I: Introduction 22/10/ [Chen et al. UIST 12]

61 New hardware design Computer Vision I: Introduction 22/10/

62 Kinect Body Pose estimation and tracking Computer Vision I: Introduction 22/10/

63 Kinect Body Pose estimation and tracking Computer Vision I: Introduction 22/10/

64 behind the scene Computer Vision I: Introduction 22/10/ Graphics simulation Synthetic (graphics) Real (hand-labelled)

65 Body tracking and Gesture Recognition has many applications Computer Vision I: Introduction 22/10/ StartUp 2012: Try Fashion online Very large impact in many field: Gaming, Robotics, HCI, Medicine,

66 Real-time pedestrian detection Computer Vision I: Introduction 22/10/

67 Real-time Face recognition Computer Vision I: Introduction 22/10/ e.g. Canon powershot

68 General Object recognition & segmentation Computer Vision I: Introduction 22/10/ Good results [TextonBoost; Shotton et al, 06]

69 General Object recognition & segmentation Computer Vision I: Introduction 22/10/ Failure cases [TextonBoost; Shotton et al, 06]

70 Image Search Computer Vision I: Introduction 22/10/

71 Start-Up Company: Like.com Computer Vision I: Introduction 22/10/

72 Interactive Image manipulation Computer Vision I: Introduction 22/10/ [Agrawal et al 04]

73 Interactive Image manipulation Computer Vision I: Introduction 22/10/

74 Image manipulation - stitching Computer Vision I: Introduction 22/10/

75 Image manipulation - stitching Computer Vision I: Introduction 22/10/

76 Video manipulation Computer Vision I: Introduction 22/10/ Video (duplicated) Video

77 Image de-convolution Computer Vision I: Introduction 22/10/ Input Output Output kernel [Schmidt, Rother, Nowozin, Jancsary, Roth 2013] Best Student Paper award

78 Image de-convolution (other domains) Computer Vision I: Introduction 22/10/ input output

79 Video Editing Computer Vision I: Introduction 22/10/ [Rav-Acha et al. 08]

80 Automatic Video Summary (StartUp: Magisto) Computer Vision I: Introduction 22/10/

81 Automatic Photo Summary - Commercial Computer Vision I: Introduction 22/10/ AutoCollage Microsoft Research [Rother et al. Siggraph 2006]

82 Movie Industry Computer Vision I: Introduction 22/10/ Pirates of the Caribbean, Industrial Light and Magic

83 Medicine and Biology Computer Vision I: Introduction 22/10/ <tod o>

84 Robotics Computer Vision I: Introduction 22/10/ Robocup Nasa Mars exploration

85 Introduction to Computer Vision Computer Vision I: Introduction 22/10/ What is computer Vision? (Potential) Definition: Developing computational models and algorithms to interpret digital images and visual data in order to understand the visual world we live in.

86 Interactive Segmentation Computer Vision I: Introduction 22/10/

87 Model versus Algorithm Computer Vision I: Introduction 22/10/ Example: Interactive Segmentation Goal z = R, G, B n x = 0,1 n (user-specified pixels are not optimized for) Given z; derive binary x: Model: Energy function E x (implicitly models a statistical model P(x z) ) Algorithm to minimization: x = argmin x E(x)

88 Model for a starfish Computer Vision I: Introduction 22/10/ Goal: formulate E(x) such that E x = 0.01 E x = 0.05 E x = 0.05 E x = 0.1 Optimal solution x = argmin x E(x)

89 How does the energy looks like? Computer Vision I: Introduction 22/10/ Energy function (sum of terms θ): E(x) = θ i x i + i Unary terms i,j θ ij (x i, x j ) Pairwise terms

90 How does the energy looks like? Computer Vision I: Introduction 22/10/ x j θ ij (x i, x j ) pairwise terms x i θ i (x i ) unary terms Visualization: Undirected graphical models

91 Green Unary term Green Computer Vision I: Introduction 22/10/ Red User labelled pixels Red Gaussian Mixture Model Fit

92 Unary term New query image z i θ i (x i = 0) θ i (x i = 1) Dark means likely background Dark means likely foreground Optimum with unary terms only Computer Vision I: Introduction 22/10/

93 Pairwise term Computer Vision I: Introduction 22/10/ θ ij x i, x j = x i x j Ising Prior When is θ ij (x i, x j ) small, i.e. likely configuration? Most likely Most likely Intermediate likely most unlikely This models the assumption that the object is spatially coherent Next step could be: model shapes of starfishes

94 Energy minimization (optimization) Computer Vision I: Introduction 22/10/ ω = 0 ω = 10 E(x) = i ω i,j θ i x i + x i x j ω = 40 ω = 200

95 The key Questions What type of modelling language should be chosen: undirected or directed discrete Graphical models, Continuous-Domain models How does the exact model look like: What is the structure How do the terms look like Can we learn the Model from Data: Learn structure Learn potential functions This is the focus of the course (SS 14): Computer Vision II: Models, Inference and Learning This lecture: mainly about algorithms (optimization) How do we optimize the model (perform inference): fast, approximate Exactly solvable? NP-hard? Computer Vision I: Introduction 22/10/

96 Another Example: Model versus Algorithm Computer Vision I: Introduction 22/10/ Input: Image sequence [Data courtesy from Oliver Woodford] Output: New view Model: Minimize a binary 4-connected pair-wise graph (choose a colour-mode at each pixel) [Fitzgibbon et al. 03]

97 Another Example: Model versus Algorithm Computer Vision I: Introduction 22/10/ Ground Truth Graph Cut with truncation [Rother et al. 05] (approximate solution) Belief Propagation (approximate solution) ICM, Simulated Annealing (approximate solution) QPBOP [Boros et al. 06; Rother et al. 07] (exact solution) Why is the result not perfect? Model or Optimization

98 Why is computer vision interesting (to you)? Computer Vision I: Introduction 22/10/ It is a challenging problem that is far from being solved It combines insights and tools from many fields and disciplines: Mathematics and statistics Cognition and perception Engineering (signal processing) And of course, computer science

99 Why is computer vision interesting (to you)? Computer Vision I: Introduction 22/10/ Allows you to apply theoretical skills... that you may otherwise only use rarely. Quite rewarding: Often visually intuitive and encouraging results. It is a growing field: Cameras are becoming more and more popular There are a lot of companies (big, small, startups) working in vision Conferences are growing rapidly.

100 Relationship to other fields Computer Vision I: Introduction 22/10/ [Wikipedia]

101 Relationship to other fields my personal view Computer Vision I: Introduction 22/10/ Human-Computer Interaction Biology Robotics Medicine (many more) Computer Vision Applications AI

102 Reading for next class Computer Vision I: Introduction 22/10/ This lecture: Chapter 1 (in particular: 1.1) Next lecture: Chapter 3 (in particular: 3.2, 3.3) - Basics of Digital Image Processing Chapter 4.2 and Edge and Line detection

Computer Vision I - Introduction

Computer Vision I - Introduction Computer Vision I - Introduction Carsten Rother 21/10/2014 Computer Vision I:Introduction Computer Vision I: Introduction 21/10/2014 2 Admin Stuff Language: German/English; Slides: English (all the terminology

More information

Computer Vision eine Herausforderung in der Künstlichen Intelligenz

Computer Vision eine Herausforderung in der Künstlichen Intelligenz Computer Vision eine Herausforderung in der Künstlichen Intelligenz Prof. Carsten Rother Computer Vision Lab Dresden Institute of Artificial Intelligence 11/12/2013 Computer Vision a hard case for AI Roadmap

More information

Segmentation. Bottom up Segmentation Semantic Segmentation

Segmentation. Bottom up Segmentation Semantic Segmentation Segmentation Bottom up Segmentation Semantic Segmentation Semantic Labeling of Street Scenes Ground Truth Labels 11 classes, almost all occur simultaneously, large changes in viewpoint, scale sky, road,

More information

Data-driven Depth Inference from a Single Still Image

Data-driven Depth Inference from a Single Still Image Data-driven Depth Inference from a Single Still Image Kyunghee Kim Computer Science Department Stanford University kyunghee.kim@stanford.edu Abstract Given an indoor image, how to recover its depth information

More information

Introduction to Computer Vision. Srikumar Ramalingam School of Computing University of Utah

Introduction to Computer Vision. Srikumar Ramalingam School of Computing University of Utah Introduction to Computer Vision Srikumar Ramalingam School of Computing University of Utah srikumar@cs.utah.edu Course Website http://www.eng.utah.edu/~cs6320/ What is computer vision? Light source 3D

More information

Introductions. Today. Computer Vision Jan 19, What is computer vision? 1. Vision for measurement. Computer Vision

Introductions. Today. Computer Vision Jan 19, What is computer vision? 1. Vision for measurement. Computer Vision Introductions Instructor: Prof. Kristen Grauman grauman@cs.utexas.edu Computer Vision Jan 19, 2011 TA: Shalini Sahoo shalini@cs.utexas.edu Today What is computer vision? Course overview Requirements, logistics

More information

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009 Analysis: TextonBoost and Semantic Texton Forests Daniel Munoz 16-721 Februrary 9, 2009 Papers [shotton-eccv-06] J. Shotton, J. Winn, C. Rother, A. Criminisi, TextonBoost: Joint Appearance, Shape and Context

More information

What is Computer Vision? Introduction. We all make mistakes. Why is this hard? What was happening. What do you see? Intro Computer Vision

What is Computer Vision? Introduction. We all make mistakes. Why is this hard? What was happening. What do you see? Intro Computer Vision What is Computer Vision? Trucco and Verri (Text): Computing properties of the 3-D world from one or more digital images Introduction Introduction to Computer Vision CSE 152 Lecture 1 Sockman and Shapiro:

More information

Computer Vision. I-Chen Lin, Assistant Professor Dept. of CS, National Chiao Tung University

Computer Vision. I-Chen Lin, Assistant Professor Dept. of CS, National Chiao Tung University Computer Vision I-Chen Lin, Assistant Professor Dept. of CS, National Chiao Tung University About the course Course title: Computer Vision Lectures: EC016, 10:10~12:00(Tues.); 15:30~16:20(Thurs.) Pre-requisites:

More information

CS4495/6495 Introduction to Computer Vision. 1A-L1 Introduction

CS4495/6495 Introduction to Computer Vision. 1A-L1 Introduction CS4495/6495 Introduction to Computer Vision 1A-L1 Introduction Outline What is computer vision? State of the art Why is this hard? Course overview Software Why study Computer Vision? Images (and movies)

More information

Computer Vision. Introduction

Computer Vision. Introduction Computer Vision Introduction Filippo Bergamasco (filippo.bergamasco@unive.it) http://www.dais.unive.it/~bergamasco DAIS, Ca Foscari University of Venice Academic year 2016/2017 About this course Official

More information

Thanks to Chris Bregler. COS 429: Computer Vision

Thanks to Chris Bregler. COS 429: Computer Vision Thanks to Chris Bregler COS 429: Computer Vision COS 429: Computer Vision Instructor: Thomas Funkhouser funk@cs.princeton.edu Preceptors: Ohad Fried, Xinyi Fan {ohad,xinyi}@cs.princeton.edu Web page: http://www.cs.princeton.edu/courses/archive/fall13/cos429/

More information

EECS 442 Computer Vision fall 2011

EECS 442 Computer Vision fall 2011 EECS 442 Computer Vision fall 2011 Instructor Silvio Savarese silvio@eecs.umich.edu Office: ECE Building, room: 4435 Office hour: Tues 4:30-5:30pm or under appoint. (after conversation hour) GSIs: Mohit

More information

Computer Graphics Disciplines. Grading. Textbooks. Course Overview. Assignment Policies. Computer Graphics Goals I

Computer Graphics Disciplines. Grading. Textbooks. Course Overview. Assignment Policies. Computer Graphics Goals I CSCI 480 Computer Graphics Lecture 1 Course Overview January 10, 2011 Jernej Barbic University of Southern California Administrative Issues Modeling Animation Rendering OpenGL Programming Course Information

More information

Finally: Motion and tracking. Motion 4/20/2011. CS 376 Lecture 24 Motion 1. Video. Uses of motion. Motion parallax. Motion field

Finally: Motion and tracking. Motion 4/20/2011. CS 376 Lecture 24 Motion 1. Video. Uses of motion. Motion parallax. Motion field Finally: Motion and tracking Tracking objects, video analysis, low level motion Motion Wed, April 20 Kristen Grauman UT-Austin Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys, and S. Lazebnik

More information

Why is computer vision difficult?

Why is computer vision difficult? Why is computer vision difficult? Viewpoint variation Illumination Scale Why is computer vision difficult? Intra-class variation Motion (Source: S. Lazebnik) Background clutter Occlusion Challenges: local

More information

Topics to be Covered in the Rest of the Semester. CSci 4968 and 6270 Computational Vision Lecture 15 Overview of Remainder of the Semester

Topics to be Covered in the Rest of the Semester. CSci 4968 and 6270 Computational Vision Lecture 15 Overview of Remainder of the Semester Topics to be Covered in the Rest of the Semester CSci 4968 and 6270 Computational Vision Lecture 15 Overview of Remainder of the Semester Charles Stewart Department of Computer Science Rensselaer Polytechnic

More information

Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction

Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction Marc Pollefeys Joined work with Nikolay Savinov, Christian Haene, Lubor Ladicky 2 Comparison to Volumetric Fusion Higher-order ray

More information

Computer Vision I - Filtering and Feature detection

Computer Vision I - Filtering and Feature detection Computer Vision I - Filtering and Feature detection Carsten Rother 30/10/2015 Computer Vision I: Basics of Image Processing Roadmap: Basics of Digital Image Processing Computer Vision I: Basics of Image

More information

Computer Vision I - Algorithms and Applications: Basics of Image Processing

Computer Vision I - Algorithms and Applications: Basics of Image Processing Computer Vision I - Algorithms and Applications: Basics of Image Processing Carsten Rother 28/10/2013 Computer Vision I: Basics of Image Processing Link to lectures Computer Vision I: Basics of Image Processing

More information

Computer Vision: Making machines see

Computer Vision: Making machines see Computer Vision: Making machines see Roberto Cipolla Department of Engineering http://www.eng.cam.ac.uk/~cipolla/people.html http://www.toshiba.eu/eu/cambridge-research- Laboratory/ Vision: what is where

More information

Contexts and 3D Scenes

Contexts and 3D Scenes Contexts and 3D Scenes Computer Vision Jia-Bin Huang, Virginia Tech Many slides from D. Hoiem Administrative stuffs Final project presentation Nov 30 th 3:30 PM 4:45 PM Grading Three senior graders (30%)

More information

Making Machines See. Roberto Cipolla Department of Engineering. Research team

Making Machines See. Roberto Cipolla Department of Engineering. Research team Making Machines See Roberto Cipolla Department of Engineering Research team http://www.eng.cam.ac.uk/~cipolla/people.html Cognitive Systems Engineering Cognitive Systems Engineering Introduction Making

More information

Markov Random Fields and Segmentation with Graph Cuts

Markov Random Fields and Segmentation with Graph Cuts Markov Random Fields and Segmentation with Graph Cuts Computer Vision Jia-Bin Huang, Virginia Tech Many slides from D. Hoiem Administrative stuffs Final project Proposal due Oct 27 (Thursday) HW 4 is out

More information

Computer and Machine Vision

Computer and Machine Vision Computer and Machine Vision Lecture Week 4 Part-2 February 5, 2014 Sam Siewert Outline of Week 4 Practical Methods for Dealing with Camera Streams, Frame by Frame and De-coding/Re-encoding for Analysis

More information

Computer Vision I - Appearance-based Matching and Projective Geometry

Computer Vision I - Appearance-based Matching and Projective Geometry Computer Vision I - Appearance-based Matching and Projective Geometry Carsten Rother 05/11/2015 Computer Vision I: Image Formation Process Roadmap for next four lectures Computer Vision I: Image Formation

More information

3D Scanning. Qixing Huang Feb. 9 th Slide Credit: Yasutaka Furukawa

3D Scanning. Qixing Huang Feb. 9 th Slide Credit: Yasutaka Furukawa 3D Scanning Qixing Huang Feb. 9 th 2017 Slide Credit: Yasutaka Furukawa Geometry Reconstruction Pipeline This Lecture Depth Sensing ICP for Pair-wise Alignment Next Lecture Global Alignment Pairwise Multiple

More information

Automatic Photo Popup

Automatic Photo Popup Automatic Photo Popup Derek Hoiem Alexei A. Efros Martial Hebert Carnegie Mellon University What Is Automatic Photo Popup Introduction Creating 3D models from images is a complex process Time-consuming

More information

COMP 102: Computers and Computing

COMP 102: Computers and Computing COMP 102: Computers and Computing Lecture 23: Computer Vision Instructor: Kaleem Siddiqi (siddiqi@cim.mcgill.ca) Class web page: www.cim.mcgill.ca/~siddiqi/102.html What is computer vision? Broadly speaking,

More information

Computer Vision I - Basics of Image Processing Part 2

Computer Vision I - Basics of Image Processing Part 2 Computer Vision I - Basics of Image Processing Part 2 Carsten Rother 07/11/2014 Computer Vision I: Basics of Image Processing Roadmap: Basics of Digital Image Processing Computer Vision I: Basics of Image

More information

Quadratic Pseudo-Boolean Optimization(QPBO): Theory and Applications At-a-Glance

Quadratic Pseudo-Boolean Optimization(QPBO): Theory and Applications At-a-Glance Quadratic Pseudo-Boolean Optimization(QPBO): Theory and Applications At-a-Glance Presented By: Ahmad Al-Kabbany Under the Supervision of: Prof.Eric Dubois 12 June 2012 Outline Introduction The limitations

More information

What have we leaned so far?

What have we leaned so far? What have we leaned so far? Camera structure Eye structure Project 1: High Dynamic Range Imaging What have we learned so far? Image Filtering Image Warping Camera Projection Model Project 2: Panoramic

More information

Computer Vision at Cambridge: Reconstruction,Registration and Recognition

Computer Vision at Cambridge: Reconstruction,Registration and Recognition Computer Vision at Cambridge: Reconstruction,Registration and Recognition Roberto Cipolla Research team http://www.eng.cam.ac.uk/~cipolla/people.html Cognitive Systems Engineering Cognitive Systems Engineering

More information

3D Spatial Layout Propagation in a Video Sequence

3D Spatial Layout Propagation in a Video Sequence 3D Spatial Layout Propagation in a Video Sequence Alejandro Rituerto 1, Roberto Manduchi 2, Ana C. Murillo 1 and J. J. Guerrero 1 arituerto@unizar.es, manduchi@soe.ucsc.edu, acm@unizar.es, and josechu.guerrero@unizar.es

More information

Prof. Feng Liu. Spring /17/2017. With slides by F. Durand, Y.Y. Chuang, R. Raskar, and C.

Prof. Feng Liu. Spring /17/2017. With slides by F. Durand, Y.Y. Chuang, R. Raskar, and C. Prof. Feng Liu Spring 2017 http://www.cs.pdx.edu/~fliu/courses/cs510/ 05/17/2017 With slides by F. Durand, Y.Y. Chuang, R. Raskar, and C. Rother Last Time Image segmentation Normalized cut and segmentation

More information

Jamie Shotton, Andrew Fitzgibbon, Mat Cook, Toby Sharp, Mark Finocchio, Richard Moore, Alex Kipman, Andrew Blake CVPR 2011

Jamie Shotton, Andrew Fitzgibbon, Mat Cook, Toby Sharp, Mark Finocchio, Richard Moore, Alex Kipman, Andrew Blake CVPR 2011 Jamie Shotton, Andrew Fitzgibbon, Mat Cook, Toby Sharp, Mark Finocchio, Richard Moore, Alex Kipman, Andrew Blake CVPR 2011 Auto-initialize a tracking algorithm & recover from failures All human poses,

More information

EE795: Computer Vision and Intelligent Systems

EE795: Computer Vision and Intelligent Systems EE795: Computer Vision and Intelligent Systems Spring 2012 TTh 17:30-18:45 FDH 204 Lecture 14 130307 http://www.ee.unlv.edu/~b1morris/ecg795/ 2 Outline Review Stereo Dense Motion Estimation Translational

More information

Markov Networks in Computer Vision

Markov Networks in Computer Vision Markov Networks in Computer Vision Sargur Srihari srihari@cedar.buffalo.edu 1 Markov Networks for Computer Vision Some applications: 1. Image segmentation 2. Removal of blur/noise 3. Stereo reconstruction

More information

intro, applications MRF, labeling... how it can be computed at all? Applications in segmentation: GraphCut, GrabCut, demos

intro, applications MRF, labeling... how it can be computed at all? Applications in segmentation: GraphCut, GrabCut, demos Image as Markov Random Field and Applications 1 Tomáš Svoboda, svoboda@cmp.felk.cvut.cz Czech Technical University in Prague, Center for Machine Perception http://cmp.felk.cvut.cz Talk Outline Last update:

More information

Lecture 19: Depth Cameras. Visual Computing Systems CMU , Fall 2013

Lecture 19: Depth Cameras. Visual Computing Systems CMU , Fall 2013 Lecture 19: Depth Cameras Visual Computing Systems Continuing theme: computational photography Cameras capture light, then extensive processing produces the desired image Today: - Capturing scene depth

More information

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision What Happened Last Time? Human 3D perception (3D cinema) Computational stereo Intuitive explanation of what is meant by disparity Stereo matching

More information

Chapter 7. Conclusions and Future Work

Chapter 7. Conclusions and Future Work Chapter 7 Conclusions and Future Work In this dissertation, we have presented a new way of analyzing a basic building block in computer graphics rendering algorithms the computational interaction between

More information

CSc I6716 Spring D Computer Vision. Introduction. Instructor: Zhigang Zhu City College of New York

CSc I6716 Spring D Computer Vision. Introduction. Instructor: Zhigang Zhu City College of New York Introduction CSc I6716 Spring 2012 Introduction Instructor: Zhigang Zhu City College of New York zzhu@ccny.cuny.edu Course Information Basic Information: Course participation p Books, notes, etc. Web page

More information

Markov Random Fields and Gibbs Sampling for Image Denoising

Markov Random Fields and Gibbs Sampling for Image Denoising Markov Random Fields and Gibbs Sampling for Image Denoising Chang Yue Electrical Engineering Stanford University changyue@stanfoed.edu Abstract This project applies Gibbs Sampling based on different Markov

More information

Markov Networks in Computer Vision. Sargur Srihari

Markov Networks in Computer Vision. Sargur Srihari Markov Networks in Computer Vision Sargur srihari@cedar.buffalo.edu 1 Markov Networks for Computer Vision Important application area for MNs 1. Image segmentation 2. Removal of blur/noise 3. Stereo reconstruction

More information

CS 558: Computer Vision 13 th Set of Notes

CS 558: Computer Vision 13 th Set of Notes CS 558: Computer Vision 13 th Set of Notes Instructor: Philippos Mordohai Webpage: www.cs.stevens.edu/~mordohai E-mail: Philippos.Mordohai@stevens.edu Office: Lieb 215 Overview Context and Spatial Layout

More information

Stereo Epipolar Geometry for General Cameras. Sanja Fidler CSC420: Intro to Image Understanding 1 / 33

Stereo Epipolar Geometry for General Cameras. Sanja Fidler CSC420: Intro to Image Understanding 1 / 33 Stereo Epipolar Geometry for General Cameras Sanja Fidler CSC420: Intro to Image Understanding 1 / 33 Stereo Epipolar geometry Case with two cameras with parallel optical axes General case Now this Sanja

More information

Lecture 13 Segmentation and Scene Understanding Chris Choy, Ph.D. candidate Stanford Vision and Learning Lab (SVL)

Lecture 13 Segmentation and Scene Understanding Chris Choy, Ph.D. candidate Stanford Vision and Learning Lab (SVL) Lecture 13 Segmentation and Scene Understanding Chris Choy, Ph.D. candidate Stanford Vision and Learning Lab (SVL) http://chrischoy.org Stanford CS231A 1 Understanding a Scene Objects Chairs, Cups, Tables,

More information

Supervised texture detection in images

Supervised texture detection in images Supervised texture detection in images Branislav Mičušík and Allan Hanbury Pattern Recognition and Image Processing Group, Institute of Computer Aided Automation, Vienna University of Technology Favoritenstraße

More information

Decomposing a Scene into Geometric and Semantically Consistent Regions

Decomposing a Scene into Geometric and Semantically Consistent Regions Decomposing a Scene into Geometric and Semantically Consistent Regions Stephen Gould sgould@stanford.edu Richard Fulton rafulton@cs.stanford.edu Daphne Koller koller@cs.stanford.edu IEEE International

More information

ART OF 3D MODELLING & ANIMATION

ART OF 3D MODELLING & ANIMATION ART OF 3D MODELLING & ANIMATION COURSE OVERVIEW This is a three year long course for secondary students covering on the various aspects of Drawing, 2D animation and 3D animation. At the end of each year,

More information

human vision: grouping k-means clustering graph-theoretic clustering Hough transform line fitting RANSAC

human vision: grouping k-means clustering graph-theoretic clustering Hough transform line fitting RANSAC COS 429: COMPUTER VISON Segmentation human vision: grouping k-means clustering graph-theoretic clustering Hough transform line fitting RANSAC Reading: Chapters 14, 15 Some of the slides are credited to:

More information

Computer Vision, CS766. Staff. Instructor: Li Zhang TA: Jake Rosin

Computer Vision, CS766. Staff. Instructor: Li Zhang TA: Jake Rosin Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Jake Rosin rosin@cs.wisc.edu Today Introduction Administrative Stuff Overview of the Course About Me Li Zhang ( 张力 ) Last name

More information

Computer Vision Lecture 17

Computer Vision Lecture 17 Computer Vision Lecture 17 Epipolar Geometry & Stereo Basics 13.01.2015 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de leibe@vision.rwth-aachen.de Announcements Seminar in the summer semester

More information

ELL 788 Computational Perception & Cognition July November 2015

ELL 788 Computational Perception & Cognition July November 2015 ELL 788 Computational Perception & Cognition July November 2015 Module 6 Role of context in object detection Objects and cognition Ambiguous objects Unfavorable viewing condition Context helps in object

More information

Data driven 3D shape analysis and synthesis

Data driven 3D shape analysis and synthesis Data driven 3D shape analysis and synthesis Head Neck Torso Leg Tail Ear Evangelos Kalogerakis UMass Amherst 3D shapes for computer aided design Architecture Interior design 3D shapes for information visualization

More information

Computer Vision Lecture 17

Computer Vision Lecture 17 Announcements Computer Vision Lecture 17 Epipolar Geometry & Stereo Basics Seminar in the summer semester Current Topics in Computer Vision and Machine Learning Block seminar, presentations in 1 st week

More information

ECE-161C Cameras. Nuno Vasconcelos ECE Department, UCSD

ECE-161C Cameras. Nuno Vasconcelos ECE Department, UCSD ECE-161C Cameras Nuno Vasconcelos ECE Department, UCSD Image formation all image understanding starts with understanding of image formation: projection of a scene from 3D world into image on 2D plane 2

More information

Local cues and global constraints in image understanding

Local cues and global constraints in image understanding Local cues and global constraints in image understanding Olga Barinova Lomonosov Moscow State University *Many slides adopted from the courses of Anton Konushin Image understanding «To see means to know

More information

Computer Vision I - Basics of Image Processing Part 1

Computer Vision I - Basics of Image Processing Part 1 Computer Vision I - Basics of Image Processing Part 1 Carsten Rother 28/10/2014 Computer Vision I: Basics of Image Processing Link to lectures Computer Vision I: Basics of Image Processing 28/10/2014 2

More information

Depth. Common Classification Tasks. Example: AlexNet. Another Example: Inception. Another Example: Inception. Depth

Depth. Common Classification Tasks. Example: AlexNet. Another Example: Inception. Another Example: Inception. Depth Common Classification Tasks Recognition of individual objects/faces Analyze object-specific features (e.g., key points) Train with images from different viewing angles Recognition of object classes Analyze

More information

Tracking wrapup Course recap. Announcements

Tracking wrapup Course recap. Announcements Tracking wrapup Course recap Tuesda, Dec 1 Announcements Pset 4 grades and solutions available toda Reminder: Pset 5 due 12/4, extended to 12/8 if needed Choose between Section I (short answers) and II

More information

Lecture 12 Recognition. Davide Scaramuzza

Lecture 12 Recognition. Davide Scaramuzza Lecture 12 Recognition Davide Scaramuzza Oral exam dates UZH January 19-20 ETH 30.01 to 9.02 2017 (schedule handled by ETH) Exam location Davide Scaramuzza s office: Andreasstrasse 15, 2.10, 8050 Zurich

More information

Deep Incremental Scene Understanding. Federico Tombari & Christian Rupprecht Technical University of Munich, Germany

Deep Incremental Scene Understanding. Federico Tombari & Christian Rupprecht Technical University of Munich, Germany Deep Incremental Scene Understanding Federico Tombari & Christian Rupprecht Technical University of Munich, Germany C. Couprie et al. "Toward Real-time Indoor Semantic Segmentation Using Depth Information"

More information

Image-Based Modeling and Rendering

Image-Based Modeling and Rendering Image-Based Modeling and Rendering Richard Szeliski Microsoft Research IPAM Graduate Summer School: Computer Vision July 26, 2013 How far have we come? Light Fields / Lumigraph - 1996 Richard Szeliski

More information

12/3/2009. What is Computer Vision? Applications. Application: Assisted driving Pedestrian and car detection. Application: Improving online search

12/3/2009. What is Computer Vision? Applications. Application: Assisted driving Pedestrian and car detection. Application: Improving online search Introduction to Artificial Intelligence V22.0472-001 Fall 2009 Lecture 26: Computer Vision Rob Fergus Dept of Computer Science, Courant Institute, NYU Slides from Andrew Zisserman What is Computer Vision?

More information

CS5670: Computer Vision

CS5670: Computer Vision CS5670: Computer Vision Noah Snavely Lecture 4: Harris corner detection Szeliski: 4.1 Reading Announcements Project 1 (Hybrid Images) code due next Wednesday, Feb 14, by 11:59pm Artifacts due Friday, Feb

More information

Computer Vision I - Appearance-based Matching and Projective Geometry

Computer Vision I - Appearance-based Matching and Projective Geometry Computer Vision I - Appearance-based Matching and Projective Geometry Carsten Rother 01/11/2016 Computer Vision I: Image Formation Process Roadmap for next four lectures Computer Vision I: Image Formation

More information

Shadows in the graphics pipeline

Shadows in the graphics pipeline Shadows in the graphics pipeline Steve Marschner Cornell University CS 569 Spring 2008, 19 February There are a number of visual cues that help let the viewer know about the 3D relationships between objects

More information

Miniature faking. In close-up photo, the depth of field is limited.

Miniature faking. In close-up photo, the depth of field is limited. Miniature faking In close-up photo, the depth of field is limited. http://en.wikipedia.org/wiki/file:jodhpur_tilt_shift.jpg Miniature faking Miniature faking http://en.wikipedia.org/wiki/file:oregon_state_beavers_tilt-shift_miniature_greg_keene.jpg

More information

3D Computer Vision. Introduction. Introduction. CSc I6716 Fall Instructor: Zhigang Zhu City College of New York

3D Computer Vision. Introduction. Introduction. CSc I6716 Fall Instructor: Zhigang Zhu City College of New York Introduction CSc I6716 Fall 2010 3D Computer Vision Introduction Instructor: Zhigang Zhu City College of New York zzhu@ccny.cuny.edu Course Information Basic Information: Course participation Books, notes,

More information

Human Body Recognition and Tracking: How the Kinect Works. Kinect RGB-D Camera. What the Kinect Does. How Kinect Works: Overview

Human Body Recognition and Tracking: How the Kinect Works. Kinect RGB-D Camera. What the Kinect Does. How Kinect Works: Overview Human Body Recognition and Tracking: How the Kinect Works Kinect RGB-D Camera Microsoft Kinect (Nov. 2010) Color video camera + laser-projected IR dot pattern + IR camera $120 (April 2012) Kinect 1.5 due

More information

EXAM SOLUTIONS. Image Processing and Computer Vision Course 2D1421 Monday, 13 th of March 2006,

EXAM SOLUTIONS. Image Processing and Computer Vision Course 2D1421 Monday, 13 th of March 2006, School of Computer Science and Communication, KTH Danica Kragic EXAM SOLUTIONS Image Processing and Computer Vision Course 2D1421 Monday, 13 th of March 2006, 14.00 19.00 Grade table 0-25 U 26-35 3 36-45

More information

Digital Images. Kyungim Baek. Department of Information and Computer Sciences. ICS 101 (November 1, 2016) Digital Images 1

Digital Images. Kyungim Baek. Department of Information and Computer Sciences. ICS 101 (November 1, 2016) Digital Images 1 Digital Images Kyungim Baek Department of Information and Computer Sciences ICS 101 (November 1, 2016) Digital Images 1 iclicker Question I know a lot about how digital images are represented, stored,

More information

Project 2 due today Project 3 out today. Readings Szeliski, Chapter 10 (through 10.5)

Project 2 due today Project 3 out today. Readings Szeliski, Chapter 10 (through 10.5) Announcements Stereo Project 2 due today Project 3 out today Single image stereogram, by Niklas Een Readings Szeliski, Chapter 10 (through 10.5) Public Library, Stereoscopic Looking Room, Chicago, by Phillips,

More information

CS 534: Computer Vision Segmentation and Perceptual Grouping

CS 534: Computer Vision Segmentation and Perceptual Grouping CS 534: Computer Vision Segmentation and Perceptual Grouping Spring 2005 Ahmed Elgammal Dept of Computer Science CS 534 Segmentation - 1 Where are we? Image Formation Human vision Cameras Geometric Camera

More information

Visualization 2D-to-3D Photo Rendering for 3D Displays

Visualization 2D-to-3D Photo Rendering for 3D Displays Visualization 2D-to-3D Photo Rendering for 3D Displays Sumit K Chauhan 1, Divyesh R Bajpai 2, Vatsal H Shah 3 1 Information Technology, Birla Vishvakarma mahavidhyalaya,sumitskc51@gmail.com 2 Information

More information

Segmentation by Clustering Reading: Chapter 14 (skip 14.5)

Segmentation by Clustering Reading: Chapter 14 (skip 14.5) Segmentation by Clustering Reading: Chapter 14 (skip 14.5) Data reduction - obtain a compact representation for interesting image data in terms of a set of components Find components that belong together

More information

Announcements. Introduction. Why is this hard? What is Computer Vision? We all make mistakes. What do you see? Class Web Page is up:

Announcements. Introduction. Why is this hard? What is Computer Vision? We all make mistakes. What do you see? Class Web Page is up: Announcements Introduction Computer Vision I CSE 252A Lecture 1 Class Web Page is up: http://www.cs.ucsd.edu/classes/wi05/cse252a/ Assignment 0: Getting Started with Matlab is posted to web page, due 1/13/04

More information

Computer Vision and Virtual Reality. Introduction

Computer Vision and Virtual Reality. Introduction Computer Vision and Virtual Reality Introduction Tomáš Svoboda, svoboda@cmp.felk.cvut.cz Czech Technical University in Prague, Center for Machine Perception http://cmp.felk.cvut.cz Last update: October

More information

Segmentation by Clustering. Segmentation by Clustering Reading: Chapter 14 (skip 14.5) General ideas

Segmentation by Clustering. Segmentation by Clustering Reading: Chapter 14 (skip 14.5) General ideas Reading: Chapter 14 (skip 14.5) Data reduction - obtain a compact representation for interesting image data in terms of a set of components Find components that belong together (form clusters) Frame differencing

More information

Stereo: Disparity and Matching

Stereo: Disparity and Matching CS 4495 Computer Vision Aaron Bobick School of Interactive Computing Administrivia PS2 is out. But I was late. So we pushed the due date to Wed Sept 24 th, 11:55pm. There is still *no* grace period. To

More information

Combining Appearance and Structure from Motion Features for Road Scene Understanding

Combining Appearance and Structure from Motion Features for Road Scene Understanding STURGESS et al.: COMBINING APPEARANCE AND SFM FEATURES 1 Combining Appearance and Structure from Motion Features for Road Scene Understanding Paul Sturgess paul.sturgess@brookes.ac.uk Karteek Alahari karteek.alahari@brookes.ac.uk

More information

a real convergence computer vision finally becoming useful

a real convergence computer vision finally becoming useful Introduce the field a real convergence computer vision finally becoming useful Pikachu Nintendo - http://www.freelargeimages.com/wpcontent/uploads/2014/12/pikachu_04.png HoloLens Microsoft - http://www.glassappsource.com/wpcontent/uploads/2016/01/hololens.jpg

More information

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science. Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Introduction to 2 What is? A process that produces from images of the external world a description

More information

Lecture 16 Segmentation and Scene understanding

Lecture 16 Segmentation and Scene understanding Lecture 16 Segmentation and Scene understanding Introduction! Mean-shift! Graph-based segmentation! Top-down segmentation! Silvio Savarese Lecture 15 -! 3-Mar-14 Segmentation Silvio Savarese Lecture 15

More information

Computer Vision : Exercise 4 Labelling Problems

Computer Vision : Exercise 4 Labelling Problems Computer Vision Exercise 4 Labelling Problems 13/01/2014 Computer Vision : Exercise 4 Labelling Problems Outline 1. Energy Minimization (example segmentation) 2. Iterated Conditional Modes 3. Dynamic Programming

More information

CSCD18: Computer Graphics. Instructor: Leonid Sigal

CSCD18: Computer Graphics. Instructor: Leonid Sigal CSCD18: Computer Graphics Instructor: Leonid Sigal CSCD18: Computer Graphics Instructor: Leonid Sigal (call me Leon) lsigal@utsc.utoronto.ca www.cs.toronto.edu/~ls/ Office: SW626 Office Hour: M, 12-1pm?

More information

Contexts and 3D Scenes

Contexts and 3D Scenes Contexts and 3D Scenes Computer Vision Jia-Bin Huang, Virginia Tech Many slides from D. Hoiem Administrative stuffs Final project presentation Dec 1 st 3:30 PM 4:45 PM Goodwin Hall Atrium Grading Three

More information

The Hilbert Problems of Computer Vision. Jitendra Malik UC Berkeley & Google, Inc.

The Hilbert Problems of Computer Vision. Jitendra Malik UC Berkeley & Google, Inc. The Hilbert Problems of Computer Vision Jitendra Malik UC Berkeley & Google, Inc. This talk The computational power of the human brain Research is the art of the soluble Hilbert problems, circa 2004 Hilbert

More information

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching Stereo Matching Fundamental matrix Let p be a point in left image, p in right image l l Epipolar relation p maps to epipolar line l p maps to epipolar line l p p Epipolar mapping described by a 3x3 matrix

More information

CS595:Introduction to Computer Vision

CS595:Introduction to Computer Vision CS595:Introduction to Computer Vision Instructor: Qi Li Instructor Course syllabus E-mail: qi.li@cs.wku.edu Office: TCCW 135 Office hours MW: 9:00-10:00, 15:00-16:00 T: 9:00-12:00, 14:00-16:00 F: 9:00-10:00

More information

Dense 3D Reconstruction. Christiano Gava

Dense 3D Reconstruction. Christiano Gava Dense 3D Reconstruction Christiano Gava christiano.gava@dfki.de Outline Previous lecture: structure and motion II Structure and motion loop Triangulation Wide baseline matching (SIFT) Today: dense 3D reconstruction

More information

Why study Computer Vision?

Why study Computer Vision? Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications building representations of the 3D world from pictures automated surveillance (who s doing what)

More information

Capturing, Modeling, Rendering 3D Structures

Capturing, Modeling, Rendering 3D Structures Computer Vision Approach Capturing, Modeling, Rendering 3D Structures Calculate pixel correspondences and extract geometry Not robust Difficult to acquire illumination effects, e.g. specular highlights

More information

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman Stereo 11/02/2012 CS129, Brown James Hays Slides by Kristen Grauman Multiple views Multi-view geometry, matching, invariant features, stereo vision Lowe Hartley and Zisserman Why multiple views? Structure

More information

A Simple Vision System

A Simple Vision System Chapter 1 A Simple Vision System 1.1 Introduction In 1966, Seymour Papert wrote a proposal for building a vision system as a summer project [4]. The abstract of the proposal starts stating a simple goal:

More information

Region-based Segmentation and Object Detection

Region-based Segmentation and Object Detection Region-based Segmentation and Object Detection Stephen Gould Tianshi Gao Daphne Koller Presented at NIPS 2009 Discussion and Slides by Eric Wang April 23, 2010 Outline Introduction Model Overview Model

More information

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching Stereo Matching Fundamental matrix Let p be a point in left image, p in right image l l Epipolar relation p maps to epipolar line l p maps to epipolar line l p p Epipolar mapping described by a 3x3 matrix

More information

Multiple Model Estimation : The EM Algorithm & Applications

Multiple Model Estimation : The EM Algorithm & Applications Multiple Model Estimation : The EM Algorithm & Applications Princeton University COS 429 Lecture Nov. 13, 2007 Harpreet S. Sawhney hsawhney@sarnoff.com Recapitulation Problem of motion estimation Parametric

More information