Edge of Tomorrow Deploying Collaborative Machine Intelligence to the Edge

Similar documents
Accelerating intelligence at the edge for embedded and IoT applications

Embarquez votre Intelligence Artificielle (IA) sur CPU, GPU et FPGA

Bring Intelligence to the Edge with Intel Movidius Neural Compute Stick

Onto Petaflops with Kubernetes

Integrating NVIDIA Deep Learning Accelerator (NVDLA) with RISC-V SoC on FireSim

Deploying Deep Learning Networks to Embedded GPUs and CPUs

Edge-to-Cloud Compute with MxNet

Deep Learning Inference on Openshift with GPUs

DNNBuilder: an Automated Tool for Building High-Performance DNN Hardware Accelerators for FPGAs

Lecture 12: Model Serving. CSE599W: Spring 2018

Course Microsoft Dynamics 365 Customization and Configuration with Visual Development (CRM)

NVIDIA'S DEEP LEARNING ACCELERATOR MEETS SIFIVE'S FREEDOM PLATFORM. Frans Sijstermans (NVIDIA) & Yunsup Lee (SiFive)

The Changing Face of Edge Compute

Managing data flows. Martyn Winn Scientific Computing Dept. STFC Daresbury Laboratory Cheshire. 8th May 2014

Comprehensive Arm Solutions for Innovative Machine Learning (ML) and Computer Vision (CV) Applications

An introduction to Machine Learning silicon

Introduction to Cisco IoT Tools for Developers IoT 101

Deep Learning on Arm Cortex-M Microcontrollers. Rod Crawford Director Software Technologies, Arm

MC-SDN: Supporting Mixed-Criticality Scheduling on Switched-Ethernet Using Software-Defined Networking

Public Sensing Using Your Mobile Phone for Crowd Sourcing

What s New in MATLAB and Simulink

Embedded GPGPU and Deep Learning for Industrial Market

Advanced Cloud Infrastructures

How to Build Optimized ML Applications with Arm Software

Artificial Intelligence Enriched User Experience with ARM Technologies

NVIDIA FOR DEEP LEARNING. Bill Veenhuis

Outline. Person detection in RGB/IR images 8/11/2017. Pedestrian detection how? Pedestrian detection how? Comparing different methodologies?

Voice-controlled Home Automation Using Watson, Raspberry Pi, and Openwhisk

Copyright 2017 Samsung. All Rights Reserved. O-Hoon Kwon, Ph.D. Samsung Electronics

XPU A Programmable FPGA Accelerator for Diverse Workloads

IoT Edge within the IoT Framework

Declarative Modeling for Cloud Deployments

A Global Operating System «from the Things to the Clouds»

Clover Overview: Gambia release. April 16, 2018

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI

THE NVIDIA DEEP LEARNING ACCELERATOR

End to End Optimization Stack for Deep Learning

Making Sense of Artificial Intelligence: A Practical Guide

Machine learning for the Internet of Things

Xilinx ML Suite Overview

The Mont-Blanc approach towards Exascale

OPNFV overview and Edge Cloud

Atlys (Xilinx Spartan-6 LX45)

S INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS

IoT Project Proposals

Accelerating your Embedded Vision / Machine Learning design with the revision Stack. Giles Peckham, Xilinx

Welcome to SUSE Expert Days 2017 Service Delivery with DevOps

Rok: Decentralized storage for the cloud native world

Building Kubernetes cloud: real world deployment examples, challenges and approaches. Alena Prokharchyk, Rancher Labs

What s New in MATLAB and Simulink

Mobile Cloud Computing Challenges and Solutions

Informix Sensor Data: End-To-End Live Demo

Open Mobile Platforms. EE 392I, Lecture-6 May 4 th, 2010

How to Build Optimized ML Applications with Arm Software

GPU Coder: Automatic CUDA and TensorRT code generation from MATLAB

DeepThings: Distributed Adaptive Deep Learning Inference on Resource-Constrained IoT Edge Clusters

MIOVISION DEEP LEARNING TRAFFIC ANALYTICS SYSTEM FOR REAL-WORLD DEPLOYMENT. Kurtis McBride CEO, Miovision

Getting Started With Serverless: Key Use Cases & Design Patterns

Mali-G72 Enabling tomorrow s technology today

What s New in MATLAB and Simulink The MathWorks, Inc. 1

Knative: Building serverless platforms on top of Kubernetes

Lesson 5 Nimbits. Chapter-6 L05: "Internet of Things ", Raj Kamal, Publs.: McGraw-Hill Education

Docker Live Hacking: From Raspberry Pi to Kubernetes

The Wearable Application Server

Fosdem Feb/2018. Frederic Marec Embedded Engineer

Deep Learning: Transforming Engineering and Science The MathWorks, Inc.

Enable AI on Mobile Devices

Persistent Memory Productization driven by AI & ML. Danny Sabour VP Marketing, Avalanche Technology

Democratizing Machine Learning on Kubernetes

EDGE / FOG COMPUTING EVOLUTIONS LAURENT REMONT, CTO KONTRON-S&T

NC STATE UNIVERSITY. Advanced Diagnosis Automation and Control (ADAC) Lab. Raleigh, NC 27695

An Introduction to Developing for Cisco Kinetic

Dr Mark Little Red Hat, Inc.

IBM Power Systems: Open innovation to put data to work Dexter Henderson Vice President IBM Power Systems

Embedded Linux With Raspberry Pi IOT

LegUp: Accelerating Memcached on Cloud FPGAs

Intelligent Video Analytics for Urban Management

Technology Overview. Gallery SIENNA London, England T

Autonomous Driving Solutions

EMBEDDED VISION AND 3D SENSORS: WHAT IT MEANS TO BE SMART

Delivering Deep Learning to Mobile Devices via Offloading

The NE010 iwarp Adapter

F28HS Hardware-Software Interface: Systems Programming

Open Source IoT. Eclipse IoT. Tim De Borger - Senior Solution Architect 13/06/2017

EYWA: a Distributed Graph Engine in the Huawei MIND Platform. Yinglong Xia Huawei Research America 02/09/2017

What s New for MATLAB David Willingham

Agenda Time (PT) 8:45 a.m. Event Platform Opening 9:00 a.m. Keynote - Java: Present and Future Java EE 7 Java SE 8 Java Embedded

Kevin Meehan Stephen Moskal Computer Architecture Winter 2012 Dr. Shaaban

CoreOS and Red Hat. Reza Shafii Joe Fernandes Brandon Philips Clayton Coleman May 2018

Citrix Ready Workspace Hub by NComputing (NComputing RX-HDX)

Realtime Object Detection and Segmentation for HD Mapping

AMM Feb/2018. Frederic Marec Embedded Engineer

EDGE COMPUTING & IOT MAKING IT SECURE AND MANAGEABLE FRANCK ROUX MARKETING MANAGER, NXP JUNE PUBLIC

Scaling Convolutional Neural Networks on Reconfigurable Logic Michaela Blott, Principal Engineer, Xilinx Research

Kuber-what?! Learn about Kubernetes

Evaluation of Real-time operating systems for FGC controls

What s inside: What is deep learning Why is deep learning taking off now? Multiple applications How to implement a system.

SIMPLE, FLEXIBLE CONNECTIONS FOR TODAY S BUSINESS. Ethernet Services from Verizon

Building an Operating System for AI

AsTeRICS Introduction Demos, Model creation, Packaging (APE)

Transcription:

Edge of Tomorrow Deploying Collaborative Machine Intelligence to the Edge Adarsh Pal Singh International Institute of Information Technology, Hyderabad Wenjing Chu Futurewei Technologies, Inc. @wenjing

Do you like this Edge of Tomorrow? Well, let s work on creating a better one

AI Delivered through a Cloud Today A camera A Raspberry Pi A storage service A deep training engine A server An inference engine + application An actuator to control physical environment

So, What s the Problem? A camera A Raspberry Pi >100 s ms, 100 s MB/s, loss of privacy... A server A storage service A deep training engine An inference engine + application An actuator to control physical environment Another >100 s ms, loss of security, reliability...

AI Delivered through an Edge Cloud of Tomorrow A camera A Raspberry Pi An image processor An inference engine An actuator to control physical environment Device Edge Local: <5-20ms Local: OK MB/s Improved privacy Improved reliability, security

AI Delivered through an Edge Cloud of Tomorrow A camera A Raspberry Pi An image processor An inference engine A model serving service An actuator to control physical environment

AI Delivered through an Edge Cloud of Tomorrow A camera A Raspberry Pi A storage service A deep training engine An image processor A server A conventional app An inference engine A model serving service An actuator to control physical environment

AI Delivered through an Edge Cloud of Tomorrow A camera A Raspberry Pi A storage service A deep training engine An image processor A server A conventional app An inference engine A model serving service An actuator to control physical environment The default path

AI Delivered through an Edge Cloud of Tomorrow A camera A Raspberry Pi A storage service A deep training engine An image processor A server A conventional app An inference engine A model serving service An actuator to control physical environment The default path Device Local: <5-20ms Local: OK MB/s Improved privacy, reliability, security Edge Less MB/s, save cost OK 100 s ms latency Data center

Let s Experiment: Real Time Object Detection *YOLOv3: https://arxiv.org/abs/1804.02767

YOLO: You Only Live Look Once Source: https://arxiv.org/pdf/1506.02640.pdf

Let s Experiment: Latency Edge vs. Cloud Aim: To trigger an action locally based on the detection of an object of interest. 720x480 image stream @ 30FPS Raw Data Stream over Internet NN trained in cloud and off-loaded to edge Raw Data Stream over Intranet 720x480 image stream @ 30FPS RPi Camera V2 RPi Camera V2 CMD Google Kubernetes Engine (Cloud) 0 or 1 Raspberry Pi 3 interfaced with sensor and actuator. Raspberry Pi Kubernetes Cluster (Edge) CMD Raspberry Pi 3 interfaced with sensor and actuator. 0 or 1 LED as a logical actuator Demonstration 1 Demonstration 2 LED as a logical actuator

Some of the Details YOLOv3-Tiny Object Detection pod runs on Edge/GKE. Darknet compiled with NNPACK and ARM Neon on Edge and CUDA/cuDNN on GKE. Source RPi does the following parallel tasks: (a) Video streaming and (b) Waiting for CMD from Edge/Cloud (for actuation). Pod workflow: Capture image stream -> Run NN -> Send CMD. Simple Socket programming used for sending/receiving CMD. Latency test: Edge vs. Cloud!

Demo: Let s See it. RPi3 on the edge vs. GKE with GPU on control loop latency running on YOLO object detection NN algorithm. *YOLOv3: https://arxiv.org/abs/1804.02767

Some Numbers Prediction Time Edge: 2-2.5s / Image vs. Cloud: 0.007-0.01s / Image. Due to high prediction time, Edge can take upto 4-5s for detection in worst case. Image Stream lag Edge: 0.009-0.02s vs. Cloud: 0.5-1s CMD lag Edge: Negligible vs. Cloud: ~0.5s

Some Observations The Edge of Tomorrow is an intelligent one. The intelligent services can be realized with cloud methodology today, but have to address some crucial challenges Latency requirement for real-time control Bandwidth cost Privacy concerns Reliability and security concerns The cloud needs to be extended to the edge to address these problems. The edge node needs to have sufficient compute resources to fulfill these potentials.

Questions? Adarsh s work is supported by an Linux Foundation Networking internship in the OPNFV Clover and Edge Cloud projects. Clover: https://wiki.opnfv.org/display/clov Edge cloud: https://wiki.opnfv.org/display/ec