Making Machines See. Roberto Cipolla Department of Engineering. Research team

Size: px

Start display at page:

Download "Making Machines See. Roberto Cipolla Department of Engineering. Research team"

Irma Wiggins
6 years ago
Views:

1 Making Machines See Roberto Cipolla Department of Engineering Research team

2 Cognitive Systems Engineering Cognitive Systems Engineering

3 Introduction Making machines see Vision: What, Why and How? 3Rs of computer vision: Reconstruction Registration Recognition

4 Registration? Target detection and pose estimation

5 Registration

6 Registration

7 Registration

8 Reconstruction? Recovery of 3D shape from images

9 Reconstruction

10 3D models

11 Recognition?

12 road Recognition image classification categorical object detection horses airplanes background semantic segmentation tree bicycle building grass dog car sky building road

13 Pedestrian detection

14 Introduction Existing applications Traditional research in Computer vision developed for: Visual inspection Medical imaging Remote sensing Surveillance and biometrics Target detection and tracking

15 Introduction New applications Computer vision has now found a place in consumer products Mobile phones and PDAs Games Cars Image and video search Internet and shopping

16 Smart erase on a mobile phone Introduction

17 How to make machines that see?

18 Why not study biology?

19 Perspective

20 Transformations

21 Shape

22 Machine Learning?

23 Machine Learning

24 I. Reconstruction: Recovery of accurate 3D shape from uncalibrated images Cipolla and Blake 1992 Cipolla and Giblin 1999 Mendonca, Wong and Cipolla Vogiatzis, Hernandez and Cipolla

25 Digital Pygmalion Project

26 Digital Pygmalion the myth

27 Scanning technologies Laser range finders Very accurate Very expensive Complicated to use Minolta Michelangelo project MENSI 3D Scanner

28 3D models We need a way to get them that is practical fast non-intrusive low cost

29 Stereo vision 3D point

30 Automatic 3d modeller Camera motion Segmentation of object outline Multi-view volumetric stereo 3D segmentation input images silhouettes visual hull 3D model

31 From images to model capture images of object on top of a coloured sheet

32 From images to model calibrate cameras (i.e. estimate position, pose and focal length of camera in each photo) using pattern on sheet

33 From images to model identify object of interest by using fixation constraint and simultaneous 3D segmentation

34 From images to model construct visuall hull (largest object that can fit inside silhouettes)

35 Photo-consistency Non-photo-consistent point

36 Photo-consistency Photo-consistent point

37 Finding the surface

38 3D segmentation

39 3D Models

40 Gormley - input Images

41 Recovery of camera motion Input images Feature extraction Feature matching Bundle adjustment

42 Probabilistic 3D segmentation

43 3D Models

44 Final installation

45 3D models

46 Multiview photometric stereo Vogiatzis, Hernandez and Cipolla 2006 and 2008

47 Untextured objects Almost impossible to establish correspondence Use shading cue

48 Untextured objects Changing lighting uncovers fine geometric detail Assumptions: Single, distant light-source Silhouettes can be extracted No texture, single colour

49 Image acquisition setup

50 Surface Evolution of 3D Mesh Evolve mesh until it is predicted appearance under recovered illumination matches images

51 3D Models

52 Making physical copies Real Replica

53 3D Models

54 3D Models

55 3D Models

56 3D Models

57 3D Models

58 Deformable objects: Real-time photometric stereo using colour lighting Hernandez et al 2007

59 Photometric stereo with colour

60 Deformable objects? a method for reconstructing a textureless deforming object in 2.5d setup calibration pattern

61 Shape from colour observation: 1-1 mapping between colour and surface orientation get map of surface orientations from colour image integrate orientations to get depth map do this for colour video to get 2.5d reconstruction of deforming object!

62 Photometric stereo with colour

63 shape from colour

64 II. Registration: Target detection and pose estimation

65 Image matching

66 Registration: Where am I? What am I looking at? Johansson and Cipolla 2002 Robertson and Cipolla 2004 Cipolla et al 2004

67 Where I am? Determine pose from single image by matching

68 Register database view

69 Localisation of query view

70 Image-based localisation

71 Image-based localisation

72 Image-based localisation

73 Registration: Ageing infrastructure inspection

74 Registration with concrete Can appear very repetitive to the eye However, plenty of distinguished features can be extracted Very accurate matching is possible

75 Registration with concrete

76 Finding 2D shapes and applications to HCI Stenger, Thayananthan, Torr and Cipolla 2003 Williams, Blake and Cipolla 2003 and 2006 Ramanan, Fitzgibbon and Cipolla

77 Matching shape templates Oriented Oriented Canny Distance Edge Transform Detector

78 Matching shape templates Oriented Chamfer Matching

79 Hand detection system

80 Tracking - 3D mouse

81 Real-time visual controller for Dasher

82 People and pose detection

83 People and pose detection

84 III. Object recognition and machine learning Shotton, Blake and Cipolla Kim, Kittler and Cipolla 2006 Wong and Cipolla 2007

85 road Overview image classification categorical object detection horses airplanes background semantic segmentation tree bicycle building grass dog car sky building road

86 Using interest points and visual words Johnson and Cipolla

87 Image matching

88 Using contour and shape Shotton, Blake and Cipolla

89 Cognitive Systems Engineering Learning and Adaptability : : } : :

90 Object Model F σ p c F = (T, p,,,, a, b)

91 Segmented Unsegmented Shape Exemplar Centroid votes Exemplar Sub-cluster members Centroid votes

95 Recognition in video

96 Making machines see What is vision and how to duplicate it? 3D shape: making digital copies of sculpture from photographs from multiple viewpoints Recognition of a painting/picture from a single photo using a mobile (camera) phone Detection of objects: hands, faces and people and use in novel man-machine interfaces

97 Summary Image registration and matching 3D shape from uncalibrated images. Object detection and tracking

Computer Vision at Cambridge: Reconstruction,Registration and Recognition

Computer Vision at Cambridge: Reconstruction,Registration and Recognition Roberto Cipolla Research team http://www.eng.cam.ac.uk/~cipolla/people.html Cognitive Systems Engineering Cognitive Systems Engineering