Polytechnic University of Tirana

Similar documents
Keras: Handwritten Digit Recognition using MNIST Dataset

Keras: Handwritten Digit Recognition using MNIST Dataset

Tutorial on Machine Learning Tools

ABSTRACT I. INTRODUCTION. Dr. J P Patra 1, Ajay Singh Thakur 2, Amit Jain 2. Professor, Department of CSE SSIPMT, CSVTU, Raipur, Chhattisgarh, India

ECE 5470 Classification, Machine Learning, and Neural Network Review

Machine Learning 13. week

Scaled Machine Learning at Matroid

MACHINE LEARNING CLASSIFIERS ADVANTAGES AND CHALLENGES OF SELECTED METHODS

Character Recognition from Google Street View Images

Code Mania Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python:

Dynamic Routing Between Capsules

MoonRiver: Deep Neural Network in C++

M. Sc. (Artificial Intelligence and Machine Learning)

Classifying Depositional Environments in Satellite Images

Deep Learning Workshop. Nov. 20, 2015 Andrew Fishberg, Rowan Zellers

Deep Learning for Computer Vision with MATLAB By Jon Cherrie

Image Classification pipeline. Lecture 2-1

POINT CLOUD DEEP LEARNING

MACHINE LEARNING Example: Google search

Sony's deep learning software "Neural Network Libraries/Console and its use cases in Sony

Image Transformation via Neural Network Inversion

Two-Stream Convolutional Networks for Action Recognition in Videos

Image Classification pipeline. Lecture 2-1

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Artificial Intelligence Introduction Handwriting Recognition Kadir Eren Unal ( ), Jakob Heyder ( )

Apparel Classifier and Recommender using Deep Learning

Skin and Face Detection

Yiqi Yan. May 10, 2017

OCR and OCV. Tom Brennan Artemis Vision Artemis Vision 781 Vallejo St Denver, CO (303)

Why data science is the new frontier in software development

2. Basic Task of Pattern Classification

Classification using Weka (Brain, Computation, and Neural Learning)

DECISION SUPPORT SYSTEM USING TENSOR FLOW

A Generalized Method to Solve Text-Based CAPTCHAs

Deep Learning Photon Identification in a SuperGranular Calorimeter

CS5670: Computer Vision

Large-Scale Lasso and Elastic-Net Regularized Generalized Linear Models

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Machine Learning Workshop

Deep Learning with Tensorflow AlexNet

INTRODUCTION TO DEEP LEARNING

Big Data Era Time Domain Astronomy

Neural Networks. Single-layer neural network. CSE 446: Machine Learning Emily Fox University of Washington March 10, /10/2017

Note: TAO OCR is a derivitive of the same OCR engine used in Microsoft's "Seeing AI" application!

MATLAB is a multi-paradigm numerical computing environment fourth-generation programming language. A proprietary programming language developed by

Beyond Training The next steps of Machine Learning. Chris /in/chrisparsonsdev

The OpenVX Computer Vision and Neural Network Inference

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

Crossing the AI Chasm. What We Learned from building Apache PredictionIO (incubating)

Ensemble methods in machine learning. Example. Neural networks. Neural networks

Vinnie Saini Cloud Solution Architect Big Data & AI

SEMANTIC COMPUTING. Lecture 8: Introduction to Deep Learning. TU Dresden, 7 December Dagmar Gromann International Center For Computational Logic

Back propagation Algorithm:

Chapter 2 Learning Basics and Linear Models

Using Machine Learning to Optimize Storage Systems

Handwritten Hindi Numerals Recognition System

Object Detection Lecture Introduction to deep learning (CNN) Idar Dyrdal

INTRODUCTION TO ARTIFICIAL INTELLIGENCE

NVIDIA DLI HANDS-ON TRAINING COURSE CATALOG

! References: ! Computer eyesight gets a lot more accurate, NY Times. ! Stanford CS 231n. ! Christopher Olah s blog. ! Take ECS 174!

Machine Learning using Matlab. Lecture 3 Logistic regression and regularization

ML 프로그래밍 ( 보충 ) Scikit-Learn

Improving the way neural networks learn Srikumar Ramalingam School of Computing University of Utah

Detection of a Single Hand Shape in the Foreground of Still Images

Multi-Task Self-Supervised Visual Learning

Hand Written Digit Recognition Using Tensorflow and Python

The Kinect Sensor. Luís Carriço FCUL 2014/15

Review: The best frameworks for machine learning and deep learning

Some fast and compact neural network solutions for artificial intelligence applications

Case-Based Reasoning. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. Parametric / Non-parametric.

CS 188: Artificial Intelligence Fall 2008

Unsupervised Learning

Deep Learning Frameworks. COSC 7336: Advanced Natural Language Processing Fall 2017

Automation.

SHIV SHAKTI International Journal in Multidisciplinary and Academic Research (SSIJMAR) Vol. 7, No. 2, April 2018 (ISSN )

Experiments with Tensor Flow

Contents PART I: CLOUD, BIG DATA, AND COGNITIVE COMPUTING 1

Deep Generative Models Variational Autoencoders

ndpi & Machine Learning A future concrete idea

Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect

An Exploration of Computer Vision Techniques for Bird Species Classification

Machine Learning in WAN Research

Shape Context Matching For Efficient OCR

Deconvolution Networks

Populating the Galaxy Zoo

Deep Learning and Its Applications

Outlier detection using autoencoders

Bayes Risk. Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Discriminative vs Generative Models. Loss functions in classifiers

6.034 Quiz 2, Spring 2005

Pattern Recognition in Image Analysis

CERN openlab & IBM Research Workshop Trip Report

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

What s inside: What is deep learning Why is deep learning taking off now? Multiple applications How to implement a system.

Classifiers for Recognition Reading: Chapter 22 (skip 22.3)

Deep Learning. Deep Learning provided breakthrough results in speech recognition and image classification. Why?

APPLICATION OF SOFTMAX REGRESSION AND ITS VALIDATION FOR SPECTRAL-BASED LAND COVER MAPPING

Large Scale Data Analysis Using Deep Learning

Scene Text Recognition for Augmented Reality. Sagar G V Adviser: Prof. Bharadwaj Amrutur Indian Institute Of Science

Alternatives to Direct Supervision

FACE DETECTION AND RECOGNITION OF DRAWN CHARACTERS HERMAN CHAU

Transcription:

1 Polytechnic University of Tirana Department of Computer Engineering SIBORA THEODHOR ELINDA KAJO M ECE

2 Computer Vision OCR AND BEYOND

THE PRESENTATION IS ORGANISED IN 3 PARTS : 3 Introduction, previous work on OCR Idea for the program, OCR introduction, demonstration and functionalities Sof t w ar e Engineering aspect of the program Designing, development, implementation, testing Computer Vision, Machine learning Introduction to software engineering approaches in machine learning

OCR (Optical Character Recognition) 4 T E X T T E X 00111010110010 11010010100011 10101010111011 Artificial Neural Net work 84 69 88 T E X T T 01011010110110 84 Image with text File with tekst

IDEA & technologies 5 Developing an app that captures an image with text in albanian language and translates it to english. Technologies : Android and OCR (Optical Character Recognition) Services : Hp OCR API & Mymemory Translate API

Demonstration 6 1 Capturing or add from album. IDEA

Demonstration 7 1 Capturing or add from album. 2 Image scanning and Ocr IDEA

Demonstration 8 1 Capturing or add from album. 2 Image scanning and Ocr 3 Conversion from image to char and translation IDEA

9 App functionality Flow diagram Open the application

10 Select an existing picture from album 1 OR 2 Capture a picture from camera The image is displayed on the screen and it s path is saved in a variable 3 4 HP OCR API is called with Http protocol (POST request with the picture)

Server returns jobid string which we send in Http GET request 5 11 6 Server returns the text OCR recognised MyMemory translate API is called (with the text as parameter) 7 8 MyMemory translate API returns the translated text 9 Display translated text

SOFTWARE ENGINEERING ASPECT OF THE PROGRAM : 12 PLANNING DEVELOPING VALIDATION TESTING

13 Zooming in to OCR: Software engineering in machine learning practices Developing software versus developing practices and processes

14 Observation from a software engineering prospective AI solutions have their own processes that are somewhat different from more traditional Software engineering. Two tutorials on building character recognition through the algorithms and capabilities in Azure Machine Learning and the other one through Tensor Flow.

Phases of experiment development in ML Planning Development Validation? Unknown outcome 15

Character recognition using Multiclass Decision Jungle - Experiment 16 Set of 20,000 unique letter images generated by randomly distorting pixel and preprocessed. First the data is split 50% to train the model and the other 50 % for testing. Multiclass Decision Jungle is the algorithm that does the training and the classification. After training, the model is used to score the other 50 % of the data and see how well the model predictted and classified new data.

17 Results Character recognition using Multiclass Decision Jungle The algorithm correctly classifies class A 85.1% of the time

18 Deep learning approach Building digit recognition in tensor flow

Another alternative to doing ML is using deep learning and neural networks. 19 Feature engineering (development part ) Transferred To the neural network Time allocation now shifts to modeling the neural network.

Data wrangling 20 MNIST - database of handwritten digits used for training and testing MNIST Data is split for training (55,000), testing(10,000) and validation (5,000). 28 px 28 px "x" 4 (label) "y"

Data wrangling 21 Each image is 28 pixels by 28 pixels which can be interpreted as a big array of numbers. We can flatten this array into a vector of 28x28 = 784 numbers MNIST images are just a bunch of points in a 784-dimensional vector space The result in our case is a tensor with a shape of [55000, 784].

Engineering the neural network 22 Softmax regression model used to assign probabilities to an object being one of several different things, it gives a list of values between 0 and 1 that add up to 1. It has two steps: first we add up the evidence of our input being in certain classes, and then we convert that evidence into probabilities. Previous Previous work work

To tally up the evidence that a given image is in a particular class, we do a weighted sum of the pixel intensities : 23 The weight is negative if that pixel having a high intensity is evidence against the image being in that class, and positive if it is evidence in favor. Diagram shows the weights one model learned for each of these classes. Red negative weights Blue positive weights

24 In machine learning, having more data is almost always as important as having better algorithms and the amount of effort that goes into data engineering is more than the effort that goes on algorithm picking.

References 25 Machine Learning is Fun! Deep Learning and Convolutional Neural Networks : Part 1,2,3,4 Distributed Machine Learning Pipelines in Apache Spark Train, Test, Evaluate for Multiclass Classification: Letter Recognition Dataset TFLearn: Deep learning library featuring a higher-level API for TensorFlow The MNIST Data

Thank you! 26