Defense Data Generation in Distributed Deep Learning System Se-Yoon Oh / ADD-IDAR

Similar documents
Deep Learning with Tensorflow AlexNet

HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads. Natalia Vassilieva, Sergey Serebryakov

Machine Learning With Python. Bin Chen Nov. 7, 2017 Research Computing Center

Characterization and Benchmarking of Deep Learning. Natalia Vassilieva, PhD Sr. Research Manager

Fuzzy Set Theory in Computer Vision: Example 3

DEEP NEURAL NETWORKS CHANGING THE AUTONOMOUS VEHICLE LANDSCAPE. Dennis Lui August 2017

An introduction to Machine Learning silicon

POINT CLOUD DEEP LEARNING

Generative Modeling with Convolutional Neural Networks. Denis Dus Data Scientist at InData Labs

A NEW COMPUTING ERA JENSEN HUANG, FOUNDER & CEO GTC CHINA 2017

NVIDIA FOR DEEP LEARNING. Bill Veenhuis

Object recognition and computer vision using MATLAB and NVIDIA Deep Learning SDK

A performance comparison of Deep Learning frameworks on KNL

GPU FOR DEEP LEARNING. 周国峰 Wuhan University 2017/10/13

Deep Neural Network Hyperparameter Optimization with Genetic Algorithms

DEEP NEURAL NETWORKS AND GPUS. Julie Bernauer

All You Want To Know About CNNs. Yukun Zhu

NVIDIA DLI HANDS-ON TRAINING COURSE CATALOG

Deep Learning for Computer Vision II

Introduction to Deep Learning in Signal Processing & Communications with MATLAB

DEEP LEARNING WITH GPUS Maxim Milakov, Senior HPC DevTech Engineer, NVIDIA

IBM Deep Learning Solutions

Alternatives to Direct Supervision

Tutorial on Keras CAP ADVANCED COMPUTER VISION SPRING 2018 KISHAN S ATHREY

CS 2750: Machine Learning. Neural Networks. Prof. Adriana Kovashka University of Pittsburgh April 13, 2016

Beyond Training The next steps of Machine Learning. Chris /in/chrisparsonsdev

Cisco UCS C480 ML M5 Rack Server Performance Characterization

Deep Learning for Computer Vision with MATLAB By Jon Cherrie

Unsupervised Learning

Why data science is the new frontier in software development

Lab meeting (Paper review session) Stacked Generative Adversarial Networks

TOWARDS ACCELERATED DEEP LEARNING IN HPC AND HYPERSCALE ARCHITECTURES Environnement logiciel pour l apprentissage profond dans un contexte HPC

Pouya Kousha Fall 2018 CSE 5194 Prof. DK Panda

Deep Learning Frameworks with Spark and GPUs

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech

COMP9444 Neural Networks and Deep Learning 7. Image Processing. COMP9444 c Alan Blair, 2017

Know your data - many types of networks

Deep Learning Basic Lecture - Complex Systems & Artificial Intelligence 2017/18 (VO) Asan Agibetov, PhD.

Object Detection Lecture Introduction to deep learning (CNN) Idar Dyrdal

NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORKS

NVIDIA DEEP LEARNING INSTITUTE

Intro to Deep Learning. Slides Credit: Andrej Karapathy, Derek Hoiem, Marc Aurelio, Yann LeCunn

In partnership with. VelocityAI REFERENCE ARCHITECTURE WHITE PAPER

MoonRiver: Deep Neural Network in C++

Unified Deep Learning with CPU, GPU, and FPGA Technologies

ImageNet Classification with Deep Convolutional Neural Networks

Deep Learning in Visual Recognition. Thanks Da Zhang for the slides

Deep Learning. Volker Tresp Summer 2017

INTRODUCTION TO DEEP LEARNING

Real Time Monitoring of CCTV Camera Images Using Object Detectors and Scene Classification for Retail and Surveillance Applications

Structured Prediction using Convolutional Neural Networks

Review: The best frameworks for machine learning and deep learning

DEEP LEARNING ALISON B LOWNDES. Deep Learning Solutions Architect & Community Manager EMEA

GPU-Accelerated Deep Learning

Spatial Localization and Detection. Lecture 8-1

TESLA V100 PERFORMANCE GUIDE. Life Sciences Applications

Machine Learning. MGS Lecture 3: Deep Learning

Perceptron: This is convolution!

Machine Learning. Deep Learning. Eric Xing (and Pengtao Xie) , Fall Lecture 8, October 6, Eric CMU,

DEEP LEARNING AND DIGITS DEEP LEARNING GPU TRAINING SYSTEM

GPU Coder: Automatic CUDA and TensorRT code generation from MATLAB

A NEW COMPUTING ERA. DAVID B. KIRK, FELLOW NVIDIA AI Conference Singapore 2017

Deep Learning Workshop. Nov. 20, 2015 Andrew Fishberg, Rowan Zellers

Deep Learning. Visualizing and Understanding Convolutional Networks. Christopher Funk. Pennsylvania State University.

Deep learning in MATLAB From Concept to CUDA Code

High Performance Computing

Sony's deep learning software "Neural Network Libraries/Console and its use cases in Sony

Making Sense of Artificial Intelligence: A Practical Guide

Mocha.jl. Deep Learning in Julia. Chiyuan Zhang CSAIL, MIT

Deep Learning for Embedded Security Evaluation

Fei-Fei Li & Justin Johnson & Serena Yeung

Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks

Autoencoders. Stephen Scott. Introduction. Basic Idea. Stacked AE. Denoising AE. Sparse AE. Contractive AE. Variational AE GAN.

POWERING THE AI REVOLUTION JENSEN HUANG, FOUNDER & CEO GTC 2017

Deep Learning mit PowerAI - Ein Überblick

Convolutional Neural Networks

Efficient Large Message Broadcast using NCCL and CUDA-Aware MPI for Deep Learning

EFFICIENT INFERENCE WITH TENSORRT. Han Vanholder

Object Detection Based on Deep Learning

Deep Learning and Its Applications

High-Performance Training for Deep Learning and Computer Vision HPC

Study of Residual Networks for Image Recognition

Brainchip OCTOBER

Fine-tuning Pre-trained Large Scaled ImageNet model on smaller dataset for Detection task

Profiling DNN Workloads on a Volta-based DGX-1 System

Characterizing and Benchmarking Deep Learning Systems on Modern Data Center Architectures

Como funciona o Deep Learning

CNN optimization. Rassadin A

COMP 551 Applied Machine Learning Lecture 16: Deep Learning

Training Deep Neural Networks (in parallel)

Deep Learning for IoT Big Data and Streaming Analytics: A Survey

PRACTICAL SCALING TECHNIQUES. Ujval Kapasi Dec 9, 2017

CENG 783. Special topics in. Deep Learning. AlchemyAPI. Week 11. Sinan Kalkan

Deep learning in action with DL4J

Joint Object Detection and Viewpoint Estimation using CNN features

Convolutional Neural Networks

Convolutional Neural Network based Medical Imaging Segmentation: Recent Progress and Challenges. Jiaxing Tan

Code Mania Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python:

Demystifying Deep Learning

Exploiting the OpenPOWER Platform for Big Data Analytics and Cognitive. Rajesh Bordawekar and Ruchir Puri IBM T. J. Watson Research Center

Transcription:

Defense Data Generation in Distributed Deep Learning System Se-Yoon Oh / 2017. 10. 31 syoh@add.re.kr Page 1/36

Overview 1. Introduction 2. Data Generation Synthesis 3. Distributed Deep Learning 4. Conclusions Page 2/36

ML Process (Supervised Learning, DNN) 1 Labeled Training Images Train Model on GPU Accelerated Cluster input conv 2 Neural Network Framework 4 Trained Network conv pool fullycon fullycon softmax 3 NN Model Definition model.add(convolution2d(128, 3, 3) CAT Page 3/36

ML Process (Supervised Learning, DNN) 1 2 Labeled Training Images Neural Network Framework Benchmark DB? Transfer learning? One-shot learning? 3 NN Model Definition model.add(convolution2d(128, 3, 3) 4 Train Model on GPU Accelerated Cluster Page 4/36

NN Hyper-Parameter Page 5/36

ImageNet Classification with Deep Convolutional Neural Networks (Alex Krizhevsky,2012) - 650,000 neurons, 60,000,000 parameters - 1.2 M images(224*224*3), 1,000 classes - 7 Hidden layers, ReLU, DropOut, 2 GPU(GTX580)/week - Overlapped max pooling, Augmentation 1 st Layer ConV L : 90~95% 연산량 (5% Para) - 290,400 개 (N) Full L : 5~10% 연산량 (95% Para) - ((11*11*3)+1) * 96 = 34,944 개 (P) - 290,400 * 364 = 105,750,600 개 (C) Page 6/36

Tech. Trends Page 7/36

Supervised learning(sl) Standard in the current state of ML Performed well on many task Ground-truth annotation/label is required in training Unsupervised learning(ul) How most learning is done in the real world Impractical to accurately annotate data UL seems to be the key to real artificial intelligence Transfer learning & fine-tuning CNN -> Training with millions of unlabeled images(ul) -> Deep understanding in images Quickly learn the new task with much smaller labelled images (?) Page 8/36

Two Things Considered Labeled data Few labeled data in defense Classification/object detection/ segmentation/tracking/autonomy/ Computing power ML/DL based on HPC & GPU Big data hub & data analytics Page 9/36

Overview 1. Introduction 2. Data Generation Synthesis 3. Distributed Deep Learning 4. Conclusions Page 10/36

Problem Statement ML demands access to very large sets of training data Significant problem may arise in real-world defense ML applications To overcome problems of small-data, novel synthetic data generation frameworks for massive ML training system are required Page 11/36

Page 12/36

Page 13/36

Page 14/36

Objectives & Motivation Develop a massive ML training dataset generation/synthesis system 1) Automatic Data Augmentation framework 2) 3D CAD modeling based synthetic/virtual image datasets generation framework 3) Game Development Engine based virtual scenes/military env. creation framework 4) Generative Models(GANs) framework for photorealistic images tuning Page 15/36

Generative Adversarial Networks(GANs) Data : a dataset as samples from a true data distribution Generator : Random noise as input, and transforms it, outputting a sample of data Discriminator : Classifies sample input data as real(from true data distribution) or fake(from the generator), x = g(z) Play a song on guitar - Song (Data) - Hands/brain (G) - Ears/brain (D) Page 16/36

GANs Latent Space Variation AYLIEN 2016 PPGN 2016 / Evolving AI Lab Yann Le Cun 2017.3 China Page 17/36

Difficulties Mode collapsing Oscillating during training No learning CycleGAN Ian Goodfellow GTC 2017 (Zhu et al., 2017) Page 18/36

GANs-based Image Generation Dataset: 1,300 images of Army-tank (ILSVRC2012) Model: DCGAN Real army-tank training images(imagenet DB) Generated synthetic armytank images (Epoch=189) Page 19/36

GANs-based Image Generation Changes of generated synthetic images along with iterations Epoch=5 Epoch=26 Epoch=99 Epoch=189 Page 20/36

Overview 1. Introduction 2. Data Generation Synthesis 3. Distributed Deep Learning 4. Conclusions Page 21/36

Grade Examination 50 EA 200 EA 10,000 EA Page 22/36

Grade Examination 10,000 EA Employ 50 P Grading/ Management Employee Mgt. : absence/medical/missing/ General Management Management/processing/copying/data missing/summary/reporting/ Distributed Computing Big Data Computing : Apache-Hadoop S/W Page 23/36

Sizes of data units ZB/2yr/2 배 /40ZB(2020) 57*Sand 24PB Google/Day/2012 1.2ZB/2%/2013 1TB (@ 80 Mb/s) 1 disk - 3.4 hr 100 disks - 2 min 1000 disks - 12 sec Page 24/36

Big Data / AI Big Data A I Page 25/36

RDBMS 1Terabyte/200$, Hadoop 1Terabyte/5$ RDB Big Data DB Library Logistics Page 26/36

Big Data Component (Hadoop) Infomatica/Talend/InfoShere/DataStage/Apach e.hadoop/spark/avro/ambari/chukwa/mahout /Tez/Yarn/Zookeeper/Pig/Hive/MapReduce/Hc atalog/impalra/kafka/hbase/singa/stinger/m apr/aster/splunk/cassandra/mongo/riak/vor mic/hana/esper/oracle/tibco/r/sas/spss/ta bleau/spotfire/d3/qliktech/microstrategy/exa data/greenplum/netizza/radian6/sma/cloudb igdata/amazon/softlayer/rackspac/cloudant/ Dynamo/Teradata cloud/ Page 27/36

Apache Spark 2012.06 : V.0.5.1 2014.05 : V.1.0.2 2015.09 : V.1.5.0 2015.10 : V.1.5.1 2015.11 : V.1.5.2 2016.01 : V.1.6.0 2016.03 : V.1.6.1 2016.07 : V.2.0.0 2017.10 : V.2.1.2 Google TensorFlow ( 15.11) Page 28/36

Distributed Computing Data/Model parallelism Full-mesh/Star topology ETRI E/T Trends ( 안신영 et al. 2016) Page 29/36

- 1,200,000 Image - 1,000 Classes [Image Big Data : ImageNet DB] Trained w, b Image(x) Label(y) Computer (Training) NN Model Page 30/36

DNN Error w.r.t Hidden Layer DNN Layer Error Year Remark - - 25.80% 2011 AlexNet 8 16.40% 2012 VGG 19 7.30% 2014 GoogleNet 22 6.70% 2014 ResNet 152 5.50% 2015 200 4.80% 2016 1,000 4.62% 2016 Benchmark DNN - ResNet-200 (2016) - ImageNet (1.2M, 1,000Class) 200 Layers Page 31/36

Computing Time w.r.t GPU GPU Petaflops (FP32) Time Remark Case 1 Case 2 2016 40 0.4 4.2 Days - 2,000 20.0 2 hrs AIST- AIRC (J) 2015 8 0.1 16.8 Days 2018 6,500 65~100 37 min 0.7 PF * 90(P) Tsubame 3(J) HPE 540 Node Tesla P100 : 2,160 HPC with 8 GPU 3 Weeks (ResNet 200 Layer) ImageNet 1.2M RGB Images with 1,000 classes 200,000 USD for 8-GPU HPC Page 32/36

BD/DL System : Open Source Platform Architecture Component Applications Vision / Speech / Behavior / DL Framework Libraries(SDK) System Software Hardware Caffe, Torch, Theano, DL4J, CNTK, Pylearn2, TensorFlow, KERAS, cudnn, NCCL, IMSL Lib., GPU VSIPL, cublas, cusparse, cufft Apache Hadoop, Spark, Ambari, HBase, Yarn, Drivers, CUDA, Clustered Computer (Linux) X86, GP-GPU Page 33/36

Overview 1. Introduction 2. Data Generation Synthesis 3. Distributed Deep Learning 4. Conclusions Page 34/36

Two Things Considered Labeled data Few labeled data in defense Classification/object detection/ segmentation/tracking/autonomy/ Synthetic/virtual data generation Computing power ML/DL based on HPC & GPU Big data hub & data analytics Distributed deep learning Page 35/36

Artificial Intelligence (Machine Learning,...) Autonomous System Learning Platform (Vision/Speech/Behavior) VR/AR Design & Analysis Analytics AI-Learning Intelligent Distributed Computing Big Data based AI Platform Page 36/36