Seminar. Topic: Object and character Recognition

Similar documents
Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

HCR Using K-Means Clustering Algorithm

Handwritten Hindi Numerals Recognition System

OCR For Handwritten Marathi Script

Exploratory Data Analysis using Self-Organizing Maps. Madhumanti Ray

DATABASE DEVELOPMENT OF HISTORICAL DOCUMENTS: SKEW DETECTION AND CORRECTION

LECTURE 6 TEXT PROCESSING

6. Applications - Text recognition in videos - Semantic video analysis

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

DESIGNING A REAL TIME SYSTEM FOR CAR NUMBER DETECTION USING DISCRETE HOPFIELD NETWORK

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

Extraction and Recognition of Alphanumeric Characters from Vehicle Number Plate

IMPLEMENTING ON OPTICAL CHARACTER RECOGNITION USING MEDICAL TABLET FOR BLIND PEOPLE

INTERNATIONAL RESEARCH JOURNAL OF MULTIDISCIPLINARY STUDIES

Radial Basis Function Networks: Algorithms

II. WORKING OF PROJECT

Clustering and Visualisation of Data

A Review on Plant Disease Detection using Image Processing

2009 International Conference on Emerging Technologies

Skew Detection and Correction of Document Image using Hough Transform Method

OPTICAL CHARACTER RECOGNITION FOR VIETNAMESE SCANNED TEXT

Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity

Vision. OCR and OCV Application Guide OCR and OCV Application Guide 1/14

In this assignment, we investigated the use of neural networks for supervised classification

Indian Currency Recognition Based on ORB

Handwritten Script Recognition at Block Level

An Efficient Character Segmentation Based on VNP Algorithm

Unsupervised Learning : Clustering

Scene Text Detection Using Machine Learning Classifiers

Segmentation algorithm for monochrome images generally are based on one of two basic properties of gray level values: discontinuity and similarity.

Offline Signature verification and recognition using ART 1

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

How to...create a Video VBOX Gauge in Inkscape. So you want to create your own gauge? How about a transparent background for those text elements?

One type of these solutions is automatic license plate character recognition (ALPR).

DATA EMBEDDING IN TEXT FOR A COPIER SYSTEM

Topic 7 Machine learning

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 2, ISSUE 1 JAN-2015

CITS 4402 Computer Vision

Pixels. Orientation π. θ π/2 φ. x (i) A (i, j) height. (x, y) y(j)

[Kaur*, 5(2): February, 2016] ISSN: (I2OR), Publication Impact Factor: 3.785

Comparative Study of Hand Gesture Recognition Techniques

SKEW DETECTION AND CORRECTION

Types of image feature and segmentation

Unsupervised learning in Vision

REAL TIME BRAILLE TRANSLATION. Andrew Petersen, Logan Schuelke, Marcus Turner

Feature Extraction from Wavelet Coefficients for Pattern Recognition Tasks. Rajat Aggarwal Chandu Sharvani Koteru Gopinath

Carmen Alonso Montes 23rd-27th November 2015

Automated Digital Conversion of Hand-Drawn Plots

Clustering Color/Intensity. Group together pixels of similar color/intensity.

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera

What to come. There will be a few more topics we will cover on supervised learning

Robust PDF Table Locator

Practice Exam Sample Solutions

International Journal of Advance Research in Engineering, Science & Technology

A Review of Skew Detection Techniques for Document

How to use the open-access scanners 1. Basic instructions (pg 2) 2. How to scan a document and perform OCR (pg 3 7) 3. How to scan a document and

Mobile Application with Optical Character Recognition Using Neural Network

Layout Segmentation of Scanned Newspaper Documents

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques

5/15/16. Computational Methods for Data Analysis. Massimo Poesio UNSUPERVISED LEARNING. Clustering. Unsupervised learning introduction

Skeletonization Algorithm for Numeral Patterns

Handwritten Text Recognition

ELEC Dr Reji Mathew Electrical Engineering UNSW

CAMERA METERS & HOW THEY WORK. Michael Kellogg

Skew Detection Technique for Binary Document Images based on Hough Transform

RULE BASED SIGNATURE VERIFICATION AND FORGERY DETECTION

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

SECTION E: DOCUMENT DIGITIZATION

An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique

Topic 6 Representation and Description

Kapitel 4: Clustering

IJIRST International Journal for Innovative Research in Science & Technology Volume 1 Issue 10 March 2015 ISSN (online):

Segmentation of Characters of Devanagari Script Documents

A Document Image Analysis System on Parallel Processors

Lecture 12 Recognition

A Technique for Classification of Printed & Handwritten text

Image Normalization and Preprocessing for Gujarati Character Recognition

3-D MRI Brain Scan Classification Using A Point Series Based Representation

LICENSE PLATE RECOGNITION FOR TOLL PAYMENT APPLICATION

A Hierarchical Pre-processing Model for Offline Handwritten Document Images

Effects Of Shadow On Canny Edge Detection through a camera

Data Mining. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of Computer Science

Automatic Detection of Change in Address Blocks for Reply Forms Processing

K S Prasanna Kumar et al,int.j.computer Techology & Applications,Vol 3 (1),

A Multimodal Framework for the Recognition of Ancient Tamil Handwritten Characters in Palm Manuscript Using Boolean Bitmap Pattern of Image Zoning

Processing of binary images

11. Image Data Analytics. Jacobs University Visualization and Computer Graphics Lab

CS231A Section 6: Problem Set 3

Case-Based Reasoning. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. Parametric / Non-parametric.

CS 188: Artificial Intelligence Fall 2008

Large Scale Data Analysis Using Deep Learning

Computer Vision 5 Segmentation by Clustering

Learning to Learn: additional notes

Countermeasure for the Protection of Face Recognition Systems Against Mask Attacks

Advanced Image Processing, TNM034 Optical Music Recognition

Information Retrieval and Web Search Engines

A Generalized Method to Solve Text-Based CAPTCHAs

DTU M.SC. - COURSE EXAM Revised Edition

In this lesson we are going to review some of the most used scanning devices.

Transcription:

Seminar Topic: Object and character Recognition Tse Ngang Akumawah Lehrstuhl für Praktische Informatik 3

Table of content What's OCR? Areas covered in OCR Procedure Where does clustering come in Neural Networks K-means etc Application Examples Briefing on Object Recognition

Motivation Scanning Payment order Name: John Mustermann KNr: xyzabc Bank: MarxBank OCR-System Output

The different areas covered in character recognition.

The different areas covered in character recognition.

The different areas covered in character recognition. 1. Mechanical character recognition 2. Magnetic character recognition 3. Optical character 1. On-line character recognition 2. Fixed font character recognition 3. Handwritten character recognition 4. Script Recognition => Seminar concentrates on OCR

What s character recognition all about? multiple layer procedure

Preprocessing!!! Scanning / photographing N.B. minimal variation from the original document. Content extraction using horizontal and vertical lines. rotation by necessity Binary conversion 0 s = black spots 1 s = white spots

Character segmentation!!! Example based on 0 for black pixels and 1 for white s Analyse line after line For each line column of 1 s => disjoint characters the lager the number of columns, clearer the demarcation (see fig)

Character segmentation!!!

Character segmentation!!! After identification of distinct character form new matrices with each character empty spaces are filled up with 1 s nominalise and resize each character matrix By necessity smoothen the characters. (see fig)

Character segmentation!!!

Feature extraction!!! Analyse each character separately. 1. Statistical Features 1. number of black pixels in a line(zoning) 2. total mass(nr of pixels) 3. etc 2. Structural Features joining points centroid end points strokes etc Other ALGORITHMS Hough Transformation Fourier Transformation

Statistical feature extraction

Structural feature extraction

Where does clustering come in? Is OCR really unsupervised? Objective of clustering: partitioning the sample set into subsets. OCR systems before the training are unsupervised Problems: 1. optimal partition 2. ideal number of partitions Character of Clustering techniques for OCR systems 1. large amount of sample data. 2. small number of prototypes

Finding the optimal partition for a given number of prototypes NEURAL NETWORKS

Finding the optimal partition for a given number of prototypes NEURAL NETWORKS curve-1 length centre

Finding the optimal partition for a given number of prototypes K-MEANS The K mean technique some kind of like iteratively process 1. for a start, one could randomly pick up such prototypes(cluster centres) 2. For each prototype clusters are build for the remaining spots in the graph 3. When the clusters have been form we then look for the ideal(mean) prototype for each cluster 4. repeat step 1 3 till the changes(variance) in prototype is below a certain range 5. In this way we get the optimal cluster

Finding the optimal partition for a given number of prototypes Others Nearest Neighbour Analysis Mean squared error. (MSE) N.B. Most commercial OCR systems don t use just one clustering technique but a combination of two or more depending on the area of usage.

What's left? So far build up clusters identify characters possible? TRAINING

Real life application of character recognition systems 1. For Data Entry. - e.g. in banks 2. For Text entry. - e.g. in newspapers 3. Process automation. -e.g. in post offices. 4. Readers for the blind

Examples of existing systems Iris Reading System(Visuaide 2000 Inc.) Arkenstone Readers(Arkenstone Inc.) Reading Advant Edge(Xerox Imaging Systems. ) etc.

Summary

What s Object recognition all about? similar to OCR Differences More complex more clusters needed demarcations of an object Input Camera Moving Other Algorithms needed colour histograms gray scale histograms Edge tracing using Graph approach Least cost Trajectory Region growing etc.

Motivation Control Traffic sign recogniton

The End