Enhancing Intelligent Video Analytics with Machine Learning

Similar documents
Architectures for Scalable Media Object Search

CV of Qixiang Ye. University of Chinese Academy of Sciences

FACE RECOGNITION FROM A SINGLE SAMPLE USING RLOG FILTER AND MANIFOLD ANALYSIS

AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS

Survey on Recommendation of Personalized Travel Sequence

Building a Cybersecurity R&D Ecosystem in Singapore

DL Tutorial. Xudong Cao

with Deep Learning A Review of Person Re-identification Xi Li College of Computer Science, Zhejiang University

Datasets and Representation Learning

Lingxi Xie. Post-doctoral Researcher (supervised by Prof. Alan Yuille) Center for Imaging Science, the Johns Hopkins University

A Brief Introduction to CFINS

GUANGHAN NING Pinard St, Milpitas, CA, 95035

Singapore s Smart Nation Initiative

Transforming Transport Infrastructure with GPU- Accelerated Machine Learning Yang Lu and Shaun Howell

About. Established 1 September 2016 Engagement platform for cross-sector interaction and collaboration. Cybersecurity Consortium

A Study on Different Challenges in Facial Recognition Methods

Face Spoofing Detection from a Single Image using Diffusion Speed Model

2 Proposed Methodology

Characterization and Benchmarking of Deep Learning. Natalia Vassilieva, PhD Sr. Research Manager

Presenter: Felix Goldberg, Ph.D. Chief AI Scientist Artificial Intelligence Cart AIC

JOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS. Zhao Chen Machine Learning Intern, NVIDIA

NVIDIA DEEP LEARNING INSTITUTE

VISION FOR AUTOMOTIVE DRIVING

APPLICABILITY ANALYSIS OF CLOTH SIMULATION FILTERING ALGORITHM FOR MOBILE LIDAR POINT CLOUD

Arbee L.P. Chen ( 陳良弼 )

arxiv: v1 [cs.cv] 2 May 2017

Tracking driver actions and guiding phone usage for safer driving. Hongyu Li Jan 25, 2018

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Comprehensive analysis and evaluation of big data for main transformer equipment based on PCA and Apriority

3D Digitization of Human Foot Based on Computer Stereo Vision Combined with KINECT Sensor Hai-Qing YANG a,*, Li HE b, Geng-Xin GUO c and Yong-Jun XU d

LARGE-SCALE PERSON RE-IDENTIFICATION AS RETRIEVAL

International Journal of Modern Engineering and Research Technology

About Us. Funded by the National Research Foundation (NRF) and anchored at the National University of Singapore (NUS) since 1 September 2016

Learning the Three Factors of a Non-overlapping Multi-camera Network Topology

An efficient face recognition algorithm based on multi-kernel regularization learning

Saint Petersburg Electrotechnical University "LETI" (ETU "LETI") , Saint Petersburg, Russian FederationProfessoraPopova str.

Bidirectional Recurrent Convolutional Networks for Video Super-Resolution

Automatic Search Engine Evaluation with Click-through Data Analysis. Yiqun Liu State Key Lab of Intelligent Tech. & Sys Jun.

2 nd Year. Module Basket of Courses Duration Credit Offered Status. 12 Weeks 4 NPTEL Programming in Java

Internet of things that video

Lecture 12 Recognition

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia

HK ASTRI FinTech Initiative

Internet Traffic Classification Using Machine Learning. Tanjila Ahmed Dec 6, 2017

The Hilbert Problems of Computer Vision. Jitendra Malik UC Berkeley & Google, Inc.

Object Tracking System Using Motion Detection and Sound Detection

Design of student information system based on association algorithm and data mining technology. CaiYan, ChenHua

Baidu-I 2 R Research Centre Launches Singapore-developed Music Search Technology

SOLUTION BRIEF AN INTELLIGENT VIDEO ANALYTICS PLATFORM FOR SMART CITIES

CONTENT ENGINEERING & VISION LABORATORY. Régis Vinciguerra

Building the Most Efficient Machine Learning System

GPU-ACCELERATED PLATFORM TRANSFORMING THE SMART CITIES LANDSCAPE PRADEEP GUPTA SENIOR SOLUTIONS ARCHITECT, NVIDIA

Research at Google: With a Case of YouTube-8M. Joonseok Lee, Google Research

FEATURE EXTRACTION IN HADOOP IMAGE PROCESSING INTERFACE

LSTM and its variants for visual recognition. Xiaodan Liang Sun Yat-sen University

Urban Scene Segmentation, Recognition and Remodeling. Part III. Jinglu Wang 11/24/2016 ACCV 2016 TUTORIAL

EECS 442 Computer Vision fall 2011

Geometry-aware Traffic Flow Analysis by Detection and Tracking

safely connecting to future smart hospital & healthcare

A FRAMEWORK OF EXTRACTING MULTI-SCALE FEATURES USING MULTIPLE CONVOLUTIONAL NEURAL NETWORKS. Kuan-Chuan Peng and Tsuhan Chen

Volume 6, Issue 12, December 2018 International Journal of Advance Research in Computer Science and Management Studies

CS595:Introduction to Computer Vision

Recognize Complex Events from Static Images by Fusing Deep Channels Supplementary Materials

Efficient Path Finding Method Based Evaluation Function in Large Scene Online Games and Its Application

Semantic Segmentation

MIOVISION DEEP LEARNING TRAFFIC ANALYTICS SYSTEM FOR REAL-WORLD DEPLOYMENT. Kurtis McBride CEO, Miovision

Lecture 12 Recognition. Davide Scaramuzza

Research on QR Code Image Pre-processing Algorithm under Complex Background

Building the Most Efficient Machine Learning System

Event Detection through Differential Pattern Mining in Internet of Things

Multi-region Bilinear Convolutional Neural Networks for Person Re-Identification

AI: A UAE Perspective. Nawaf I. Almoosa Deputy Director - EBTIC March 7 th 2018

Learning with Side Information through Modality Hallucination

arxiv: v1 [cs.lg] 12 Jul 2018

AMID BASED CROWD DENSITY ESTIMATION

Person Re-identification for Improved Multi-person Multi-camera Tracking by Continuous Entity Association

Face Liveness Detection Using Euler Method Based On Diffusion Speed Calculation

Graphics, Vision, HCI. K.P. Chan Wenping Wang Li-Yi Wei Kenneth Wong Yizhou Yu

13th Asia-Pacific Eco-Business Forum in Kawasaki

Stanford University Packing and Padding, Coupled Multi-index for Accurate Image Retrieval.

Electronic commerce recommendation mobile crowd system based on cooperative data collection and embedded control

Jinwei Gu. Ph.D. in Computer Science Dissertation: Measurement, Modeling, and Synthesis of Time-Varying Appearance of Natural

Analyzing Working of FP-Growth Algorithm for Frequent Pattern Mining

IN computer vision develop mathematical techniques in

2015 The MathWorks, Inc. 1

Summarization of Egocentric Moving Videos for Generating Walking Route Guidance

Predicting Service Outage Using Machine Learning Techniques. HPE Innovation Center

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

IoT: Here or Hype? Steve Eglash Executive Director, Secure Internet of Things Project. ǀ Secure Internet of Things Project 1

DAISY Data Analysis and Information SecuritY Lab

MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK. Wenjie Guan, YueXian Zou*, Xiaoqun Zhou

Introduction to Computer Vision. Srikumar Ramalingam School of Computing University of Utah

R-CNN Based Object Detection and Classification Methods for Complex Sceneries

Modeling and Analysis of the Support System of Space-based Anti-missile Information Based on UML

Experiments with Tensor Flow

The Research of Internet of Things in Operation and Maintenance for Distribution Grid

DOTNET Projects. DotNet Projects IEEE I. DOTNET based CLOUD COMPUTING. DOTNET based ARTIFICIAL INTELLIGENCE

Fast Hardware For AI

Human pose estimation using Active Shape Models

CAP 6412 Advanced Computer Vision

Transcription:

Enhancing Intelligent Video Analytics with Machine Learning Dennis Sng Deputy Director & Principal Scientist NVIDIA AI Conference 24 October 2017 ROSE LAB OVERVIEW 2 1

Visual Object Categorisation 2D (Planar) objects: Logos, book covers, CD covers, labels Faces: Detection, Verification, Spoofing Detection, Aging Prediction 3D rigid objects: Cars, hardware, product packages Landmark & Scenery Deformable objects: Clothes, shoes, bags, toys 3 Solution Architecture Applications Tourism Social Media Digital Archive Health & Wellness Smart Surveillance E-Commerce & Digital Advertising SDK Tools API API API API API Algorithm Visual Object Search Visual Analytics Deep Learning Biometrics Surveillance & Image Forensics Visual Object Database Infrastructure Cloud 4 2

Commercial Partners ROSE Partner Ecosystem NTU/PKU Joint-Lab Visual Object Search Large-Scale Object Dataset Media Cloud Platform Research Partners Technology Partners Supported by 5 DIRECTOR Prof. Alex KOT, EEE Biometrics, Face Spoofing, Image Forensics, Fast Text Image Detection Faculty & Key Staff DEPUTY DIRECTOR Dr Dennis Sng Systems Architecture, Research Management Prof. TAN Yap Peng, EEE Image/Video Processing, Face Recognition Prof. JIANG Xudong, EEE Biometrics, Face Recognition Prof. CHEN Tsuhan, CoE Computer Vision, Pattern Recognition, Machine Learning Prof. LAM Kwok Yan, SCSE Distrib Systems Security, Cyber-Security, Multi-modal Biometrics Prof. CAI Jianfei, SCSE Computer Vision, Image Segmentation Prof. CHAU Lap Pui, EEE Visual Signal Processing Algo, Light-field Imaging, Human Motion Analysis Prof. YAP Kim Hui, EEE Image/Video Processing, Visual Object Search Prof. YUAN Junsong, EEE Video Analytics, Visual Object Search, Anomaly Detection, Object Tracking Prof. CONG Gao, SCSE Mining Social Media, Geo-spatial Data Management Prof. LIN Weisi, SCSE Image Processing, Video Compression Prof. Adams KONG, SCSE Biometrics, Forensics, Image Processing and Pattern Recognition 3

24/10/2017 Alumni Mr. YIN Jianxiong Dr. Rahul RAMA VARIOR Prof. WANG Gang Prof. Xu Dong Dr. Amir SHAHROU Dr. LU Haifeng Dr. SHI Boxin Dr. YANG Gao Dr. WENG Renliang Dr. LI Sheng Dr. CHEN Qi Dr. MIAO Zhenwei Dr. LIN Jie Dr. ZUO Zhen Dr. WANG Yan Dr. CHEN Jie Dr. Khosro BAHRA Dr. WANG Shiqi Dr. Anirban CHAKRABORTY 7 Factors Driving Deep Learning New Algorithms & Techniques Compute Density Big Data Availability 4

Developing New Algorithms & Techniques RESEARCH PROGRAMMES 9 Industry Partners (TRL 6-7) Translational R&D (TRL 4-5) 26 RSEs Basic Research (TRL 2-3) 40 PhD students Visual Object Search Visual Search for Logo Recognition Product Detection & Recognition Compact Descriptor for Visual Search Scene & Landmark Recognition Vehicle Detection & Classification Computer Vision Pattern Recognition Core Activities Video Analytics & Deep Learning Object Detection & Classification Anomaly Detection Multiple Pedestrian Detection Cross Camera Person Re- Identification Person & Object Tracking Action Recognition Machine Learning Compact Descriptors Multimedia Forensics & Biometrics Image Forensics Face Spoofing & Liveness Detection Face Aging Transformation Fast Text Image Detection Image Quality Assessment Systems Arch & Security 5

Visual Object Search Logo Detection & Localization Handbag Recognition Shoe Retrieval WeChat Image Search Landmark Search Visual Indoor Localization 11 Verification vs Recognition Is this Barack Obama? Verification (1-to-1) Who is this person? Recognition (1-to-n) 6

Barack Obama Michelle Obama Detection PM Lee Where is PM Lee, Mr. Obama and Michelle Obama in this picture? Detection (localization) Consumer Profiling YOLOv2 Self-supervised Structure-sensitive Learning * Courtesy of Redmonetc. * Courtesy of Gong etc. 7

Consumer Profiling: Sample Parsing Results Consumer Profiling: Sample Results 8

Video Analytics & Deep Learning Object Classification Pedestrian Detection Cross Camera Person Re-ID Visual Anomaly Detection Action Recognition Person Tracking 20 Cross Camera Human Re-Identification Algorithm aims to recognize an individual Same or Different Camera Overlapping or Non- Overlapping Example Forensic Applications Searching for a suspect Evidence gathering Etc 9

Cross Camera Human Re-Identification Front-end API 7 layer Siamese CNN Light weight and fast, but less accurate. Serves as a single camera-based fast scanning, filtering out obviously irrelevant person objects. Roughly positive results can be found in top-25 ranking in most of time. Back-end API 50 layer CNN Have more computation power and more accurate. Requires more computation resources. Roughly positive results can be found in top-10 ranking in 80% of time. Multimedia Forensics & Biometrics Face Detection & Recognition Face Spoofing Detection Face Ageing Fast Text Image Detection 27 10

Face Spoofing Attack Scenarios Door Controlled Access Spoof Camera model & environment are known in advance. Spoofing detection can be easily done with deep learning or other computer vision techniques. 29 Face Spoofing Attack Scenarios Mobile Unlock/Payment Spoof Camera model, environment are not known in advance. Existing Algorithms can suffer from over-fitting problem. 30 11

Big Data Availability LARGE-SCALE DATASETS 31 ROSE Shareable Datasets Recaptured Image Video Object Instance RGB+D Action Recognition Multiple Pedestrian Detection 32 12

ROSE Large scale RGB+D Action Recognition Dataset 56880 Video Samples More than 4M frames 60 Classes 80 Views 40 Different Human Subjects Kinect V2 Sensor Amir Shahroudy, Jun Liu, Tian-TsongNg, and Gang Wang, NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analytis, IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016 33 ROSE RGB+D Action Recognition Dataset: Comparison with other datasets 34 13

Industry Baidu Facebook AI Research HikVision IBM Research Tokyo Intel Labs China Microsoft Research Asia Mercedes Benz R&D India Mitsubishi Electric Panasonic R&D S pore United Technologies Research Chinese Academy of Sciences ETH Zurich INRIA Academia UC Berkeley CUHK NUS Stanford U Tsinghua U TUM Zhejiang U 35 Computing Density: Training Platform for Deep Learning GPU COMPUTING ARCHITECTURE 36 14

ROSE Network Architecture Overview 2-GPU Server Cluster Storage Server Cluster NTU Campus Network #1 #2 #3 #4 #5 #14 #15 #1 #2 #3 #4 1G Access Network 1G IPMI Network 10G Storage Network 8-GPU Server Cluster 4-GPU Server Cluster #1 #2 #3 #6 DGX-1 #1 DGX-1 #2 #1 #2 #3 #9 #10 Infiniband Compute Network Thank You rose.ntu.edu.sg 42 15