CSIS. Computer Vision. Prof. Sung-Hyuk Cha Fall of School of Computer Science & Information Systems. Artificial Intelligence CSIS

Similar documents
CSIS. Pattern Recognition. Prof. Sung-Hyuk Cha Fall of School of Computer Science & Information Systems. Artificial Intelligence CSIS

Data Mining and Analytics

2. Basic Task of Pattern Classification

Representing structural patterns: Reading Material: Chapter 3 of the textbook by Witten

Summary. Machine Learning: Introduction. Marcin Sydow

CS4442/9542b Artificial Intelligence II prof. Olga Veksler

Machine Learning 13. week

CS4442/9542b Artificial Intelligence II prof. Olga Veksler

Online Signature Verification Technique

Computer Graphics and Image Processing

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints

CS 4510/9010 Applied Machine Learning. Neural Nets. Paula Matuszek Fall copyright Paula Matuszek 2016

CSIS Introduction to Pattern Classification. CSIS Introduction to Pattern Classification 1

Machine Learning Chapter 2. Input

Nominal Data. May not have a numerical representation Distance measures might not make sense. PR and ANN

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 1.852

Image segmentation. Václav Hlaváč. Czech Technical University in Prague

Data Mining. Practical Machine Learning Tools and Techniques. Slides for Chapter 3 of Data Mining by I. H. Witten, E. Frank and M. A.

Data Mining and Machine Learning: Techniques and Algorithms

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

Basic Concepts Weka Workbench and its terminology

Digital Image Fundamentals

Practical Data Mining COMP-321B. Tutorial 1: Introduction to the WEKA Explorer

Hybrid Biometric Person Authentication Using Face and Voice Features

ECG782: Multidimensional Digital Signal Processing

Face Detection Using Convolutional Neural Networks and Gabor Filters

Image Processing, Analysis and Machine Vision

DD2423 Image Analysis and Computer Vision IMAGE FORMATION. Computational Vision and Active Perception School of Computer Science and Communication

Lecture 4: Spatial Domain Transformations

COMPUTER VISION. Dr. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai

Data Mining. Practical Machine Learning Tools and Techniques. Slides for Chapter 3 of Data Mining by I. H. Witten, E. Frank and M. A.

Introduction. Introduction. Related Research. SIFT method. SIFT method. Distinctive Image Features from Scale-Invariant. Scale.

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution

Applications Video Surveillance (On-line or off-line)

Bayes Risk. Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Discriminative vs Generative Models. Loss functions in classifiers

Simulation of Zhang Suen Algorithm using Feed- Forward Neural Networks

Data Mining Practical Machine Learning Tools and Techniques

Stacked Denoising Autoencoders for Face Pose Normalization

Scale Invariant Feature Transform

Classifiers for Recognition Reading: Chapter 22 (skip 22.3)

Nominal Data. May not have a numerical representation Distance measures might not make sense PR, ANN, & ML

CS4733 Class Notes, Computer Vision

Robotics Programming Laboratory

Face Detection using Hierarchical SVM

CPSC 695. Geometric Algorithms in Biometrics. Dr. Marina L. Gavrilova

Computer Vision with MATLAB MATLAB Expo 2012 Steve Kuznicki

User Signature Identification and Image Pixel Pattern Verification

ECE 172A: Introduction to Intelligent Systems: Machine Vision, Fall Midterm Examination

Matlab Workshop 2008

Lecture 12 Recognition

Input: Concepts, Instances, Attributes

Dietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++

Local Image Registration: An Adaptive Filtering Framework

EXAM SOLUTIONS. Image Processing and Computer Vision Course 2D1421 Monday, 13 th of March 2006,

Supervised Sementation: Pixel Classification

Perception. Autonomous Mobile Robots. Sensors Vision Uncertainties, Line extraction from laser scans. Autonomous Systems Lab. Zürich.

Announcements. CS 188: Artificial Intelligence Spring Generative vs. Discriminative. Classification: Feature Vectors. Project 4: due Friday.

Naïve Bayes Classification. Material borrowed from Jonathan Huang and I. H. Witten s and E. Frank s Data Mining and Jeremy Wyatt and others

Lecture 12 Recognition. Davide Scaramuzza

Case-Based Reasoning. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. Parametric / Non-parametric.

CS 188: Artificial Intelligence Fall 2008

ENGR3390: Robotics Fall 2009

A Method of Annotation Extraction from Paper Documents Using Alignment Based on Local Arrangements of Feature Points

Digital Image Processing COSC 6380/4393

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14

Features Points. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

Multimedia Information Retrieval

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1

Fundamentals of Digital Image Processing

Augmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit

Feature Descriptors. CS 510 Lecture #21 April 29 th, 2013

A Systematic Overview of Data Mining Algorithms. Sargur Srihari University at Buffalo The State University of New York

Spatial Frequency Domain Methods for Face and Iris Recognition

Machine Learning in Biology

CS 584 Data Mining. Classification 1

Babu Madhav Institute of Information Technology Years Integrated M.Sc.(IT)(Semester - 7)

Instance-Based Representations. k-nearest Neighbor. k-nearest Neighbor. k-nearest Neighbor. exemplars + distance measure. Challenges.

Computer and Machine Vision

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

3D object recognition used by team robotto

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 2, ISSUE 1 JAN-2015

Finally: Motion and tracking. Motion 4/20/2011. CS 376 Lecture 24 Motion 1. Video. Uses of motion. Motion parallax. Motion field

WEKA: Practical Machine Learning Tools and Techniques in Java. Seminar A.I. Tools WS 2006/07 Rossen Dimov

Detecting and Identifying Moving Objects in Real-Time

Problem definition Image acquisition Image segmentation Connected component analysis. Machine vision systems - 1

IMAGE SEGMENTATION. Václav Hlaváč

All human beings desire to know. [...] sight, more than any other senses, gives us knowledge of things and clarifies many differences among them.

Computer Vision I - Filtering and Feature detection

CP467 Image Processing and Pattern Recognition

PATTERN CLASSIFICATION AND SCENE ANALYSIS

Verification: is that a lamp? What do we mean by recognition? Recognition. Recognition

Sensor Modalities. Sensor modality: Different modalities:

A Hybrid Face Detection System using combination of Appearance-based and Feature-based methods

cse 252c Fall 2004 Project Report: A Model of Perpendicular Texture for Determining Surface Geometry

Bus Detection and recognition for visually impaired people

Chapter 3 Image Registration. Chapter 3 Image Registration

Implementing the Scale Invariant Feature Transform(SIFT) Method

Data Mining Practical Machine Learning Tools and Techniques

Motion and Tracking. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

Handwritten Script Recognition at Block Level

Transcription:

Computer Vision Prof. Sung-Hyuk Cha Fall of 2002 School of Computer Science & Information Systems Artificial Intelligence 1

Perception Lena & Computer vision 2

Machine Vision Pattern Recognition Applications 3

Iris authentication 4

Face Recognition System Each person has different faces. Face Recognition System Query? Face DB 5

Head Pose Recognition left strt rght up Complex Pattern Recognition Applications Sargur N. Srihari 520 Lee Entrance STE 202 f 5 primary number f 6 street name f 2 state abbr. f 7 secondary designator abbr. f 3 5-digit ZIP Code f 8 secondary number Amherst NY 14228-2583 f 1 city name Delivery point: 142282583 f 4 4-digit ZIP+4 add-on 6

Speech Recognition System ( 1 2 x x x x x x x x x x x x x x x x x x f, f,..., f ) f, f,..., f ) f, f,..., f ) f, f,..., f ) f, f,..., f ) f, f,..., f ) d ( 1 2 d ( 1 2 Applications d ( 1 2 d ( 1 2 d ( 1 2 d LCD Pen tablet Microphone Digital Camera biomouse Fingerprint scanner Vital Sign monitor 7

Measurements brightness, length Salmon1 = ( 12, 16 ) Salmon2 = ( 11, 20 ) Salmon Bass1 = ( 7, 6 ) Bass2 = ( 3, 4 ) Truth features Bass 8

9

Decision theory (cost) 10

Distributions and Errors Salmon Decision boundary Bass identified as salmon salmon identified as bass Bass Parametric Univariate Dichotomizer (a) length (b) lightness (c) width (a) (b) (c) Type I 9 % 7 % 5 % Type II 39 % 27 % 26 % 11

Multivariate Analysis Nearest Neighbor Classifier bright ness? = salmon Salmon Bass length 12

Nearest Neighbor Classifier reference set testing set R = r1 = 5, 7 r2 = 3, 8 r3 = 10, 16 r4 = 12, 14 Bass Bass Salmon Salmon q = 4, 6 T = t1 = 4, 6 t2 = 2, 5 t3 = 11, 17 t4 = 14, 12 Bass Bass Salmon Salmon Bass Bass Salmon Bass rn = 14, 15 Salmon tn = 14, 17 Salmon Bass too slow for users to wait for the output. Performance is evaluated by using a testing set. Machine Learning (Linear function) bright ness? = salmon Salmon Bass Y > ax + b length 13

Artificial Neural Network synapse x 1 (t) nucleus axon x 2 (t) w 2 w 1 Σ a(t) y y=f(a) a O(t+1) w 0 dendrites x n (t) w n the biological neuron the artificial neuron Machine Learning (Linear function) reference training set set testing set R = r1 = 5, 7 r2 = 3, 8 r3 = 10, 16 r4 = 12, 14 Bass Bass Salmon Salmon Y > ax + b T = t1 = 4, 6 t2 = 2, 5 t3 = 11, 17 t4 = 14, 12 Bass Bass Salmon Salmon Bass Bass Salmon Bass rn = 14, 15 Salmon tn = 14, 17 Salmon Bass extremely fast. No need to load the training data during the classification. Performance is evaluated by using a testing set. Performance is not as good as the NN classifier s. 14

Non-Linear case Salmon bright ness Bass Y > ax + b length Non-Linear case Salmon bright ness Bass length NN is better. will learn artificial neural network which is non-linear function. 15

Human Brain Artificial Neural Network f 1 f 2 f 3 f 4 Class f 5 f 6 f 7 Fully Connected, feed forward, back-propagation multi-layer Artificial neural network (11-6-1) (ANN). 16

17

Purpose of Pattern Recognition Predict unseen future instance. Generalization. Inductive step. width Generalizability (statistical inferece) width training set universe length length width validating set length 18

Inferential Statistics 1. Inferential Statistics is inferring a conclusion about population of interest from a sample. - need a procedure for sampling the population. - need a measure of reliability for the inference. 2. If error rate in a random sample set is the same as in universe, then the procedure is a sound inferential statistical procedure. 3. If error rate in one random sample set is the same as in another random sample set, then the procedure is sound. Generalization δf2 δf1 Universe 19

Sampling & learning δf2 δf1 Sample 1 Testing on another sample δf2 δf1 Sample 2 20

Generalization δf2 δf1 Universe Multiple classification f2 class 1 class 3 class 2 f1 Classes = {class 1, 2, 3} 21

22

Template for PR Applications 1. Data acquisition: a. Recruit subjects. b. Modality interface (Scanning, picturing, recording, etc). 2. Feature Extraction: a. Raw data to feature vectors. b. Involves image/ voice/ signal processing techniques. 3. Training a classifier: a. Design a classifier (e.g., ANN). b. Enter the training (& validating) feature vector set(s). 4. Classification system: a. embed the ANN engine to your actual program (Java/C) b. User interface for the Final Product. Further Pattern Recognition http://www.csis.pace.edu/~scha/pr Fast Nearest Neighbor Search Algorithms Decision Tree Statistical Pattern Recognition. Artificial Neural Network. Clustering etc. 23

Decision Tree outlook temperature humidity windy play sunny hot high false no sunny hot high true no overcast hot high false yes rainy mild high false yes rainy cool normal false yes rainy cool normal true no overcast cool normal true yes sunny mild high false no sunny cool normal false yes rainy mild normal false yes sunny mild normal true yes overcast mild high true yes overcast hot normal false yes rainy mild high true no Clustering (a) d a k g e j h f i c b (b) d a k g e j c h i f b (c) 1 2 3 (d) a b c... 0.4 0.1 0.5 0.1 0.8 0.1 0.3 0.3 0.4 g a c i e d k b j f h 24

Terminology Classification: The process of assigning one of a limited set of alternative interpretations to (the generator of) a set of data. Often requires the steps of the computation of relative probabilities (or a quantity related to them) followed by the application of a decision rule. All classification processes can be evaluated in terms of "detection" and "misclassification" rates. Terminology Computer Vision: Compter Vision is the subject area which deals with the automatic analysis of images for the purposes of quantification or system control (often mimicking tasks which humans find trivial). It is to be distinguised from "Image Processing" which deals only with the computational processes applied to images, including enhancement and compression, but does not deal with abstract representation for the purposes of reasoning and interpretation. Compter Vision can be seen as the inverse of Computer Graphics, though generally the representations and methods of this area are not of use in Computer Vision due to the incomplete and therefore ambiguous nature of images. This requires prior knowledge to be used in order to obtain robust scene interpretation. 25

Terminology Machine Vision: Like "computer vision" but generally more closely associated with its use in robotics. Pattern Recognition Pattern recognition is the process of assigning a pattern classification to a particular set of measurements, normally represented as a high dimensional vector. This is normally done within the context of "probability theory", whereby a particular set of assumptions regarding the expected statistical distribution of measurements is used to compute classification probabilities which can be used as the basis for a decision such as the "Bayes decision rule". There are several popular forms of classifier including "k-nearest neighbour", "parzen windows", "mixture methods" and more recently "artificial neural networks". Terminology Images: An image is two dimensional spatial representation of a group of "objects" (or "scene") which exists in two or more dimensions. It is an intuitive way of presenting data for computer interfaces in the area of graphics, but in machine vision it may be defined as a continuous function of two variables defined within a bounded (generally rectangular) region. Histograms A histogram is an array of non negative integer counts from a set of data, which represents the frequency of occurance of values within a set of non-overlapping regions. 26

Features & Class Features class dark blob hole slant width skew ht pixel hslope nslope pslope vslope int int int real int real int int int int int int.95.49.70.71.50.10.51.92.13.47.32.21.94.49.75.70.50.11.53.84.26.54.35.18.94.49.67.74.50.10.45.85.23.48.32.22.93.72.33.47.50.21.28.30.66.60.42.10.93.74.33.48.50.22.26.30.60.59.45.10.93.79.36.54.50.18.27.32.60.59.52.09 S S S S S S.92.30.61.66.60.11.35.49.70.71.57.10.94.42.72.66.60.11.32.49.67.74.53.10.94.40.75.67.60.12.34.49.75.70.54.11.96.30.60.59.50.10.21.30.66.60.36.10.95.32.60.59.50.09.22.30.60.59.39.10.95.30.66.60.50.10.21.32.60.59.34.09 B B B B B B Representation length a = (12,6,-5) b = (16,9,10) c = (19,7,-10) 40 35 30 25 20 15 10 5 0-5 -10 10 12 width a 14 16 18 20 22 b c 24 lightness 5 6 7 8 9 10 27

Image Classification??? Image Indexing & Retrieval 28

Query by Image Content? Acute myeloid leukemia Acute myeloid leukemia Acute myeloid leukemia Dissimilarity (distance) / Similarity D( ) =?, S( ) =?, 29

Overview Image processing vs. computer vision Human vision & illusion. Basic Image Processing Machine Vision Applications. Histogram based Image Indexing & Retrieval. Digital Image Processing vs. Computer Vision There are no clear distinction Image processing Applications where humans are in the loop. Humans supply the intelligence Image Analysis - extracting quantitative info. Size of a tumor distance between objects facial expression Image restoration. Try to undo damage needs a model of how the damage was made Image enhancement. Try to improve the quality of an image Image compression. How to convey the most amount of information with the least amount of data 30

Digital Image Processing vs. Computer Vision Computer Vision Take the human out of the loop The computer supplies the intelligence Where does the computer get it s intelligence? Human Vision 31

Cerebral Cortex Human Vision Monocular Visual Field: 160 deg (w) X 175 deg (h) Binocular Visual Field: 200 deg (w) X 135 deg (h) 32

The figure-ground Problem The Bunny/Duck illusion Mouth Mouth 33

More illusions Squares or lines? More illusions: How many colors? 34

More illusions: parallel line More illusions 35

More illusions More illusions 36

More illusions Photometry Concerned with mechanisms for converting light energy into electrical energy. World Optics Sensor Signal Digitizer Digital Representation 37

. Binary image i 1 2 3 4 5 6 7 j 1 2 3 4 5 6 7 Grey image E(x,y) : Electrical video signal Optics Video Camera Image Plane Image L(x,y) A/D Converter and Sampler I(i,j) Digital Image 22 34 22 0 18 Grayscale Image Data Computer Memory 38

. Color image E(x,y) : Electrical video signal Optics Video Camera Image Plane Image L(x,y) B(i,j) G(i,j) R(i,j) Blue Channel Green A/D Channel Converter Red A/D Channel Converter A/D Converter 22 3422 0 18 Digital Image Computer Memory HSL Color Space Lightness Hue (color) Saturation (white) 39

Color Contrast Stretching 40

Linear Stretching 255 OUTPUT 0 INPUT 255 Histogram Equalization Adjust peaks and plains 41

False Color Warping http://www.doctorwarp.com/index.php?id=23&flx=world 42

Compression Mosaics 43

Stereo Stereo vision 44

Noise Removal salt pepper Zooming Important for size invariance 45

Rotation Important for rotation invariance Subtraction 46

Connected Components/ Image Labeling Goal: To find clusters of pixels that are similar and connected to each other How it works: Assign a value to each pixel Define what similar values mean e.g., 10 +/- 2 Determine if like pixels are connected 4- connected 8-connected Connected Components/ Image Labeling 1 1 1 1 1 1 1 0 0 1 1 1 1 1 1 0 1 1 1 2 2 0 0 1 1 2 2 0 0 1 A A A A A A A B B A A A A A A C A A A D D C C A A D D C C A 47

Connected Components/ Image Labeling 1 1 1 1 1 1 1 0 0 1 1 1 1 1 1 0 1 1 1 2 2 0 0 1 1 2 2 0 0 1 A A A A A A A B B A A A A A A B A A A C C B B A A C C B B A Segmentation 48

Segmentation Edge Detection convolution B = 1 1 1 mask 1 + 8 1 1 1 1 49

Handwriting Each person writes differently. Handwriting Analysis Taxonomy Analysis of Handwriting Recognition Examination Personality identification (Graphology) On-line Off-line Writer Identification Writer Verification Natural Writing Forgery Disguised Writing 50

Pattern Recognition The End See U all next week. 51