How GPUs Power Comcast's X1 Voice Remote and Smart Video Analytics. Jan Neumann Comcast Labs DC May 10th, 2017

Similar documents
Characterization and Benchmarking of Deep Learning. Natalia Vassilieva, PhD Sr. Research Manager

AWS Machine Learning Transformation Day - Munich. 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

An introduction to Machine Learning silicon

USING DEEP LEARNING TECHNOLOGIES TO PROVIDE DESCRIPTIVE METADATA FOR LIVE VIDEO CONTENTS

NVIDIA PLATFORM FOR AI

Inference Optimization Using TensorRT with Use Cases. Jack Han / 한재근 Solutions Architect NVIDIA

Deep Learning mit PowerAI - Ein Überblick

NVIDIA DLI HANDS-ON TRAINING COURSE CATALOG

DEFECT INSPECTION FROM SCRATCH TO PRODUCTION. Andrew Liu, Ryan Shen Deep Learning Solution Architect

Recurrent Neural Network (RNN) Industrial AI Lab.

Leveraging AI on the Cloud to transform your business. Florida Business Analytics Forum 2018 at University of South Florida

Vinnie Saini Cloud Solution Architect Big Data & AI

Deep Learning Inference on Openshift with GPUs

Visual Concept Detection and Linked Open Data at the TIB AV- Portal. Felix Saurbier, Matthias Springstein Hamburg, November 6 SWIB 2017

Practical Applications of Machine Learning for Image and Video in the Cloud

S INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS

Wightman DIGITAL TV. Quick Reference Guide

Connecting relevant video content to audiences CREDENTIALS DECK

The Big Game on YouTube. Trends Report 2016

Presenter: Felix Goldberg, Ph.D. Chief AI Scientist Artificial Intelligence Cart AIC

Voice, Image, Video : AI in action with AWS. 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

NVIDIA DEEP LEARNING INSTITUTE

Why data science is the new frontier in software development

Welcome to Google Fiber

A NEW COMPUTING ERA JENSEN HUANG, FOUNDER & CEO GTC CHINA 2017

Embedded GPGPU and Deep Learning for Industrial Market

Features Guide. Get the most from your device.

IPTV Middleware Remote Control & DVR User Guide. Version 2.0

Internet Quick Start Guide

Using the Program Guide

How to Build Optimized ML Applications with Arm Software

Automation.

Multimedia Databases. Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig

If you are watching a program and decide that you want to record the remainder of the program you can easily start the recording.

Digital TV. Quick Reference Guide

All it takes is One to experience it all.

2015 The MathWorks, Inc. 1

Click to edit Master title style

IPTV Middleware Remote Control & DVR User Guide. Version 4.0

Artificial Intelligence Powered Brand Identification and Attribution for On Screen Content

A Hybrid Neural Model for Type Classification of Entity Mentions

AI & Machine Learning at Amazon

How to Build Optimized ML Applications with Arm Software

Real-Time Content-Based Adaptive Streaming of Sports Videos

Remote Control & Cloud DVR User Guide

Demystifying Deep Learning

Application of Deep Learning Techniques in Satellite Telemetry Analysis.

Features Guide. Get the most from your phone.

MAKING SEARCH INTELLIGENT WITH AI

NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORKS

Machine Learning 13. week

Multimedia Databases. 9 Video Retrieval. 9.1 Hidden Markov Model. 9.1 Hidden Markov Model. 9.1 Evaluation. 9.1 HMM Example 12/18/2009

TELEVISION. WiFi Plans. Interactive Guide and DVR (Digital Video Recorder) Manual for the Amazon Fire TV Stick. WiFi ARVIG arvig.

Research at Google: With a Case of YouTube-8M. Joonseok Lee, Google Research

Reveal The Secrets of Your Videos. Zoltan Nemeth, Watson Media / Budapest Lab

DIGITAL TV USER GUIDE

Features Guide. Get the most from your phone.

Lucida Sirius and DjiNN Tutorial

Deep Learning Frameworks. COSC 7336: Advanced Natural Language Processing Fall 2017

Guiding your way to great entertainment

Remote Control & DVR User Guide

State of the Art and Trends in Search Engine Technology. Gerhard Weikum

BRIEF CASE STUDY COMPETITION AREA: The area for residential customers / Department for marketing TV services and content

Highlights from the TV & video. consumer. trend report 2011

An Introduction to Deep Learning with RapidMiner. Philipp Schlunder - RapidMiner Research

IPTV Middleware Remote Control & DVR User Guide. Version 5.0

Important Note: Not all features described in this guide are available to WCTA customers.

HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads. Natalia Vassilieva, Sergey Serebryakov

Pay TV solution from ADB

Features Guide. Get the most from your BlackBerry Curve 8530 smartphone.

Defense Data Generation in Distributed Deep Learning System Se-Yoon Oh / ADD-IDAR

~~ Read Deep Learning in Python: Master Data Science and Machine Learning with Modern Neural Networks written in Python... how to download books

Features Guide Get the most from your phone.

WELCOME TO KAPOST. Kapost Content Gallery: Getting Started Guide for Admins. Kapost Content Gallery

IPTV Middleware Remote Control & DVR User Guide

New Research Trends in Cybersecurity Privacy Attacks on Decentralized Deep Learning

A backward glance and a forward view

ManageMyTVs iphone & ipad App Application User Guide

TensorFlow: A System for Learning-Scale Machine Learning. Google Brain

BlueSky TV. User guide. See it. Say it. Love it. Shaw BlueSky TV Manual

Deep Learning for Recommender Systems

Demystifying Deep Learning

Building the Most Efficient Machine Learning System

Search Framework for a Large Digital Records Archive DLF SPRING 2007 April 23-25, 25, 2007 Dyung Le & Quyen Nguyen ERA Systems Engineering National Ar

H6201 SMART LED TV 46" 50" SPEC SHEET PRODUCT HIGHLIGHTS. Wi-Fi Built In. key features

MIOVISION DEEP LEARNING TRAFFIC ANALYTICS SYSTEM FOR REAL-WORLD DEPLOYMENT. Kurtis McBride CEO, Miovision

IBM Leading High Performance Computing and Deep Learning Technologies

H6203 SMART LED TV 40" 46" 50" 55" 60" 65" SPEC SHEET PRODUCT HIGHLIGHTS. Wi-Fi Built In. key features

Core ML in Depth. System Frameworks #WWDC17. Krishna Sridhar, Core ML Zach Nation, Core ML

Embarquez votre Intelligence Artificielle (IA) sur CPU, GPU et FPGA

Building the Most Efficient Machine Learning System

Transform your video services with a cloud platform: Succeed in a fragmented marketplace

Machine Learning in WAN Research

Code Mania Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python:

Features Guide Get the most from your phone.

Deep learning for music, galaxies and plankton

Contents. Remote Control Playback Controls What s on TV? Using the OK Button Using the Info Button... 6

All it takes is One to experience it all.

CSC 578 Neural Networks and Deep Learning

TO IMAGINE USER GUIDE

Transcription:

How GPUs Power Comcast's X1 Voice Remote and Smart Video Analytics Jan Neumann Comcast Labs DC May 10th, 2017

Comcast Applied Artificial Intelligence Lab Media & Video Analytics Smart TV Deep Learning Data Science Voice & NLP Smart Home Recommendations & Search Smart Internet 2

Today: How Comcast Uses AI to Evolve and Reinvent the TV Experience Media & Video Analytics Smart TV Deep Learning Data Science Voice & NLP Smart Home Recommendations & Search Smart Internet 3

Online Video Netflix AI for Content Discovery Voice Search LIVETV 4

X1 Smart TV with Voice Query: HBO Voice remote ASR NLP modules Set-top Box TV query action Answer Selector 5

Open NLP: Multiple Domains with Voice TV query Domain Selector HOME Answer Answer Selector Selector response NEWS CUSTOMER CARE...... 6

Open NLP: Multiple Domains with Voice 0.15 TV turn on the heat Domain Selector 0.80 0.02 HOME NEWS Answer Answer Selector Selector response 0.03 CUSTOMER CARE Threshold=0.10 Selected={TV, Home} Applicable={TV, Home} Precision=100% Recall=100% 7

Open NLP: Multiple Domains with Voice 0.04 TV Show me my password Domain Selector 0.03 HOME Answer Answer Selector Selector response 0.03 NEWS 0.90 CUSTOMER CARE Threshold=0.10 Selected={Customer Care} Applicable={Customer Care} Precision=100% Recall=100% 8

Domain Selector in Practice Cascade of Deep Learning Models of increasing complexity HBO Entity Detection Service YES NO NO NO Simple Model YES Complex Model YES DO NOT SEND TO DOMAIN SEND TO DOMAIN 9

Domain Selector in Practice Cascade of Deep Learning Models of increasing complexity Show me funny comedies Entity Detection Service NO YES NO NO Simple Model YES Complex Model YES DO NOT SEND TO DOMAIN SEND TO DOMAIN 10

X1 Smart TV with Voice Query: who plays the oracle in matrix Voice remote ASR Question (text) query 11 Set-top Box NLP modules QA Answer (id or text) action TV

First-order Question Answering Given: Question in natural-language form q Structured knowledge base that contains list of facts [ subject relation (attribute) object ] Tom subject Matrix Hanks 9/1/1956 Keanu object Reeves Return: Answer to q attribute Neo Assuming: q answerable by a single fact. Source entity mentioned in q. Answer is neighbor of source entity node. 12

Question Answering with Knowledge Graph Question How old is Tom Hanks? Predict Relation Extract names / titles Entities [ e 1,, e N ] Train relation r Knowledge Graph Structured Query subj rel obj Subj=e 1 Obj=? Rel=r Search e 1 r e 2 Generate Answer Text answer 13

Question Answering with Knowledge Graph Question How old is Tom Hanks? Predict Relation relation birth r Extract names Tom / titles Entities [ Hanks e 1,, e N ] Train Knowledge Graph Structured Query subj rel obj Subj=e 1 Obj=e = 2? Rel=r Subj=Tom Hanks Rel=birth Search Tom Hanks birth 1956 Tom Hanks e 1 r e is 55 years 2 old. Generate Answer Tom Hanks Text is 59 answer years old 14

Question Answering with Knowledge Graph using Recurrent Neural Networks (RNNs) Question Predict Relation relation r Entity names / titles Detection [ e 1,, e N ] Structured Query subj=e obj=? attr=? rel=r memory Entity Detection ~ Tagging NA Subj Subj NA NA memory Relation Prediction ~ Classification place of birth where Tom Hanks was born where Tom Hanks was born 15

Recurrent Neural Networks output LOC PER LOC PER 0.39 0.61 0.89 0.11 hidden memory input word washington heights 16

Online Video AI for Content Discovery Automatic NetflixContent Analysis LIVETV 17

Most metadata is at the asset level Genres Credits Synopsis Keywords 18

Much more data exists within the asset Chapters Moments Annotations Movie Frame Shot Scene Chapter 19

Why is this useful? What are the best moments on TV? In-game highlight navigation Who is in this scene? Search & Recommendations 20

How does Automatic Content Analysis work? Video Computer Vision Chaptering Audio Analysis Natural Language Processing AI & Machine Learning Scene-level Annotations Frame-level Annotations 21

Why is it possible now? Big Data Cloud/GPU Computing Better Algorithms (Deep learning) Large-scale Image recognition performance 22

Super-human accuracy in speech and image recognition! Big Data Cloud/GPU Computing Better Algorithms (Deep learning) Large-scale Image recognition performance 23

New experiences! Big Data 24 Cloud/GPU Computing Better Algorithms (Deep learning)

Example Application: In-Game Highlights Place highlights over games recorded onto customers DVRs for football, baseball, hockey, basketball and soccer. In-Game Highlights Feature for NFL has been released on Comcast X1 last fall I ll record as many games as I can. When I don t want to watch the whole game, it s a great way to do it. Customer Testimonial 25

Online Video Netflix AI for Content Discovery Personalization LIVETV 26

Personalized Entertainment Experiences What is popular right now? What do you like? + Personalized Recommendations = 27

What should I watch right now? Deep learning-based recommender system for Live TV - Training a joint embedding space to combine the scores - Channel- and Program-based recommendations - Time-dependent recommendations - Trending/popular and personal favorite channels, programs, sport teams - Rich content descriptions from automatic content analysis Favorite Channels Favorite Programs Content Descriptions Collaborative Filtering Trending Popularity Live TV Recommender System 28

Online Video Netflix Deep Learning Infrastructure LIVETV 29

Deep Learning Infrastructure Deep Learning Frameworks Keras, Tensorflow, Theano, PyTorch, Caffee (older models) All deployments using nvidia-docker Thanks to Nvidia solutions team to help with best practices All deep learning training done on multi-gpu servers Nvidia Tesla (Production) and 8xTitan X (Dev) GPUs Nvidia DGX-1 for large scale training video and nlp Next steps Container scheduler Kubernetes and Hashicorp Nomad Network compression/simplification for increased efficiency (TensorRT) 30

Deep Learning-based ML is applied everywhere at Comcast Machine Learning Data Science Big Data AI Improving Customer Experience Everywhere at Comcast/NBCU For more info see: dclabs.comcast.com High Speed Internet Video IP Telephony Home Security / Automation Universal Parks Media Properties 31