Smart Parking System using Deep Learning. Sheece Gardezi Supervised By: Anoop Cherian Peter Strazdins

Similar documents
Supplementary material for Analyzing Filters Toward Efficient ConvNet

Fuzzy Set Theory in Computer Vision: Example 3

Study of Residual Networks for Image Recognition

Channel Locality Block: A Variant of Squeeze-and-Excitation

Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition

Elastic Neural Networks for Classification

Deep learning for object detection. Slides from Svetlana Lazebnik and many others

Structured Prediction using Convolutional Neural Networks

Real Time Monitoring of CCTV Camera Images Using Object Detectors and Scene Classification for Retail and Surveillance Applications

Traffic Multiple Target Detection on YOLOv2

Inception and Residual Networks. Hantao Zhang. Deep Learning with Python.

Deconvolutions in Convolutional Neural Networks

Proceedings of the International MultiConference of Engineers and Computer Scientists 2018 Vol I IMECS 2018, March 14-16, 2018, Hong Kong

CNNS FROM THE BASICS TO RECENT ADVANCES. Dmytro Mishkin Center for Machine Perception Czech Technical University in Prague

Facial Key Points Detection using Deep Convolutional Neural Network - NaimishNet

REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION

3D model classification using convolutional neural network

Neural Networks with Input Specified Thresholds

Real-time Object Detection CS 229 Course Project

HENet: A Highly Efficient Convolutional Neural. Networks Optimized for Accuracy, Speed and Storage

Face Recognition A Deep Learning Approach

Progressive Neural Architecture Search

Deep Learning in Visual Recognition. Thanks Da Zhang for the slides

Object Detection and Its Implementation on Android Devices

Deeply Cascaded Networks

3D Densely Convolutional Networks for Volumetric Segmentation. Toan Duc Bui, Jitae Shin, and Taesup Moon

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech

arxiv: v1 [cs.cv] 14 Jul 2017

CrescendoNet: A New Deep Convolutional Neural Network with Ensemble Behavior

BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks

Mimicking Very Efficient Network for Object Detection

Application of Convolutional Neural Network for Image Classification on Pascal VOC Challenge 2012 dataset

Evaluation of Convnets for Large-Scale Scene Classification From High-Resolution Remote Sensing Images

Convolutional Layer Pooling Layer Fully Connected Layer Regularization

Tiny ImageNet Visual Recognition Challenge

Computer Vision Lecture 16

Convolutional Neural Networks

Computer Vision Lecture 16

Keras: Handwritten Digit Recognition using MNIST Dataset

COMP9444 Neural Networks and Deep Learning 7. Image Processing. COMP9444 c Alan Blair, 2017

Object detection with CNNs

Convolution Neural Network for Traditional Chinese Calligraphy Recognition

Kaggle Data Science Bowl 2017 Technical Report

Tiny ImageNet Challenge

Defense against Adversarial Attacks Using High-Level Representation Guided Denoiser SUPPLEMENTARY MATERIALS

R-FCN++: Towards Accurate Region-Based Fully Convolutional Networks for Object Detection

Convolutional Neural Networks: Applications and a short timeline. 7th Deep Learning Meetup Kornel Kis Vienna,

Towards Real-Time Automatic Number Plate. Detection: Dots in the Search Space

arxiv: v1 [cs.cv] 5 Oct 2015

Visual Inspection of Storm-Water Pipe Systems using Deep Convolutional Neural Networks

arxiv: v1 [cs.cv] 4 Dec 2014

arxiv: v2 [cs.cv] 2 Apr 2018

FUSION MODEL BASED ON CONVOLUTIONAL NEURAL NETWORKS WITH TWO FEATURES FOR ACOUSTIC SCENE CLASSIFICATION

A Novel Weight-Shared Multi-Stage Network Architecture of CNNs for Scale Invariance

Encoder-Decoder Networks for Semantic Segmentation. Sachin Mehta

Object Detection Based on Deep Learning

Content-Based Image Recovery

Inception Network Overview. David White CS793

KamiNet A Convolutional Neural Network for Tiny ImageNet Challenge

Residual Networks for Tiny ImageNet

Extend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network

Convolution Neural Networks for Chinese Handwriting Recognition

Mind the regularized GAP, for human action classification and semi-supervised localization based on visual saliency.

A Deep Learning Approach to Vehicle Speed Estimation

Deep Learning. Visualizing and Understanding Convolutional Networks. Christopher Funk. Pennsylvania State University.

Deep learning for dense per-pixel prediction. Chunhua Shen The University of Adelaide, Australia

FCHD: A fast and accurate head detector

INTRODUCTION TO DEEP LEARNING

Exploration of the Effect of Residual Connection on top of SqueezeNet A Combination study of Inception Model and Bypass Layers

Spatial Localization and Detection. Lecture 8-1

Background-Foreground Frame Classification

Know your data - many types of networks

Fei-Fei Li & Justin Johnson & Serena Yeung

FaceNet. Florian Schroff, Dmitry Kalenichenko, James Philbin Google Inc. Presentation by Ignacio Aranguren and Rahul Rana

Deep Learning for Computer Vision with MATLAB By Jon Cherrie

DeepBIBX: Deep Learning for Image Based Bibliographic Data Extraction

Cultural Event Recognition by Subregion Classification with Convolutional Neural Network

In Defense of Fully Connected Layers in Visual Representation Transfer

Deep Neural Network Hyperparameter Optimization with Genetic Algorithms

Mask R-CNN. Kaiming He, Georgia, Gkioxari, Piotr Dollar, Ross Girshick Presenters: Xiaokang Wang, Mengyao Shi Feb. 13, 2018

Machine Learning. Deep Learning. Eric Xing (and Pengtao Xie) , Fall Lecture 8, October 6, Eric CMU,

arxiv: v1 [cs.cv] 6 Sep 2017

arxiv: v2 [cs.cv] 26 Jan 2018

Seminars in Artifiial Intelligenie and Robotiis

ECE 5470 Classification, Machine Learning, and Neural Network Review

arxiv: v1 [cs.cv] 29 Sep 2016

LARGE-SCALE PERSON RE-IDENTIFICATION AS RETRIEVAL

arxiv: v1 [cs.cv] 15 Oct 2018

An Accurate and Real-time Self-blast Glass Insulator Location Method Based On Faster R-CNN and U-net with Aerial Images

Residual vs. Inception vs. Classical Networks for Low-Resolution Face Recognition

Final Report: Smart Trash Net: Waste Localization and Classification

Tiny ImageNet Challenge Submission

Multi-Task Self-Supervised Visual Learning

Automatic Detection of Multiple Organs Using Convolutional Neural Networks

RETRIEVAL OF FACES BASED ON SIMILARITIES Jonnadula Narasimha Rao, Keerthi Krishna Sai Viswanadh, Namani Sandeep, Allanki Upasana

arxiv: v1 [cs.cv] 30 Apr 2018

arxiv: v1 [cs.cv] 2 Jan 2019

Residual Networks And Attention Models. cs273b Recitation 11/11/2016. Anna Shcherbina

Deep neural networks II

arxiv: v2 [cs.cv] 23 May 2016

Transcription:

Smart Parking System using Deep Learning Sheece Gardezi Supervised By: Anoop Cherian Peter Strazdins

Content Labeling tool Neural Networks Visual Road Map Labeling tool Data set Vgg16 Resnet50 Inception_v3 Test Real Time App

Labelling Tool Gamification Automatic wireframe creation Ubuntu, Mac, Windows

Neural Networks Infinitely flexible function All purpose parameter fitting Fast and scalable

Visualizing Convolutional Networks Filters learn to find features from pictures and each filter activate those pixels that match the shape of the filter Ref: Zeiler MD, Fergus R. Visualizing and understanding convolutional networks. InEuropean conference on computer vision 2014 Sep 6 (pp. 818-833). Springer, Cham.

Road Map Collect data Preprocesses Data Fillter data remove pictures with people, night time pictures Create wireframe Label data create labeling tool Segment data Create test cases Create test/train/ validitation datasets for each case Implement Models Vgg16, Vgg19, resnet50 and Inception_v3 Create mini-vgg16 Train from scratch Evaluate Run tests on models Gadge their preformance

Labelling Tool It automatically create standard wireframe for each image Gamification

Dataset Dataset Free spaces Busy spaces Total Spaces CNRPark 65,658 79,307 144,965 PkLot 337,780 358,119 695,899 SSRLot 17,334 11,247 28,581

Vgg Major Features: 3x3 Convolution Layers Stacked 16 weight layers fixed image size 224 x 224 and three color channels RGB Spatial pooling was carried out by five max-pooling layers each had 4096 channels Major drawbacks: It is extremely slow to train and consumes a lot of time and resources of the machine. The network architecture s weights themselves are quite large (in terms of the disk space/ bandwidth covered) Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. arxiv preprint arxiv:1409.1556. 2014 Sep 4.

Resnet As we go deeper; the training of neural network becomes difficult and also the accuracy starts saturating and then degrades also. Residual Learning tries to solve both these problems. Residual can be simply understood as subtraction of feature learned from input of that layer. ResNet does this using shortcut connections (directly connecting input of nth layer to some (n+x)th layer. It has proved that training this form of networks is easier than training simple deep convolutional neural networks and also the problem of degrading accuracy is resolved He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. InProceedings of the IEEE conference on computer vision and pattern recognition 2016 (pp. 770-778).

Inception An average pooling layer with 5 5 filter size and stride 3,resultinginan 4 4 512 output for the (4a), and 4 4 528 for the (4d) stage. A 1 1 convolution with 128 filters for dimension reduction and rectified linear activation. A fully connected layer with 1024 units and rectified linear activation. A dropout layer with 70% ratio of dropped outputs. A linear layer with softmax loss as the classifier (predicting the same 1000 classes as the main classifier, but removed at inference time). Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A. Going deeper with convolutions. InProceedings of the IEEE conference on computer vision and pattern recognition 2015 (pp. 1-9).

Purpose of conducting tests of various models Different architectures have different strengths Using stochastic gradient decent and universal approximation theorem given enough time any model could be trained for any task Evaluate which model trains fastest for car parking problem Evaluate which model is more porous/flexible to changed light conditions Evaluate which model is more flexible to orientations of cars

Evaluation Method Model Number of Epoch Training Time(s) Training Accuracy(%) Validation Accuracy(%) Error(%) Vgg16 1 3015 98.27 99.49 7.35 Vgg19 1 567 75.71 55.36 23.75 Resnet50 1 578 99.43 98.35 9.57 Inceptionv3 1 622 99.9 97.15 3.31 Results for test case PkLot-Complete

Real Time information about the parking lot in google maps.

Comercial Limitation No cost reduction for parking lot owners No added benfit added to commerical entities Places for implemention University where organization cares for its populous life improvement Government owned parkinglots where object is to work for betterment of it citizens

Questions