Conversational Knowledge Graphs. Larry Heck Microsoft Research

Similar documents
Leveraging Knowledge Graphs for Web-Scale Unsupervised Semantic Parsing. Interspeech 2013

Situated Conversational Interaction

Larry Heck Contributors

USING A KNOWLEDGE GRAPH AND QUERY CLICK LOGS FOR UNSUPERVISED LEARNING OF RELATION DETECTION. Microsoft Research Mountain View, CA, USA

Rapidly Building Domain-Specific Entity-Centric Language Models Using Semantic Web Knowledge Sources

Learning Semantic Entity Representations with Knowledge Graph and Deep Neural Networks and its Application to Named Entity Disambiguation

Introduction to Text Mining. Hongning Wang

CS249: ADVANCED DATA MINING

The Connected World and Speech Technologies: It s About Effortless

Large-Scale Syntactic Processing: Parsing the Web. JHU 2009 Summer Research Workshop

Searching complex graphs

LIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases

Searching the Deep Web

From HyperTEXT to HyperTEC

Unsupervised Semantic Parsing

Available online at ScienceDirect. Procedia Computer Science 60 (2015 )

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

Personalized Entity Recommendation: A Heterogeneous Information Network Approach

Searching the Deep Web

Using Data Mining to Determine User-Specific Movie Ratings

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia

End-User Evaluations of Semantic Web Technologies

Human Robot Interaction

Logic Programming: from NLP to NLU?

VOICE ENABLING THE AUTOSCOUT24 CAR SEARCH APP. 1 Introduction. 2 Previous work

Bing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer

5/13/2009. Introduction. Introduction. Introduction. Introduction. Introduction

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.

Information Retrieval

An Efficient Approach for Color Pattern Matching Using Image Mining

Question Answering over Knowledge Bases: Entity, Text, and System Perspectives. Wanyun Cui Fudan University

Extending Keyword Search to Metadata in Relational Database

An Integrated Framework to Enhance the Web Content Mining and Knowledge Discovery

Automatic Transcription of Speech From Applied Research to the Market

Columbia University High-Level Feature Detection: Parts-based Concept Detectors

I. Historical Person Movie Student pd

Named Entity Detection and Entity Linking in the Context of Semantic Web

Part I: Data Mining Foundations

EVALITA 2009: Loquendo Spoken Dialog System

Text Mining for Software Engineering

SQL Lab: Assignment 2 (due to )

Background. Problem Statement. Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web. Deep (hidden) Web

MIND THE GOOGLE! Understanding the impact of the. Google Knowledge Graph. on your shopping center website.

KNOWLEDGE GRAPH INFERENCE FOR SPOKEN DIALOG SYSTEMS

Beyond Ten Blue Links Seven Challenges

Titanic And The Making Of James Cameron By Paula Parisi

A Simple Syntax-Directed Translator

Enhancing applications with Cognitive APIs IBM Corporation

The Latest Progress of KB-QA. Ming Zhou Microsoft Research Asia NLPCC 2018 Workshop Hohhot, 2018

ROCHESTER INSTITUTE OF TECHNOLOGY. SQL Tool Kit

Natural Language Processing as Key Component to Successful Information Products

Module 3: GATE and Social Media. Part 4. Named entities

Dragon TV Overview. TIF Workshop 24. Sept Reimund Schmald mob:

Outline. Part I. Introduction Part II. ML for DI. Part III. DI for ML Part IV. Conclusions and research direction

An Oracle White Paper October Oracle Social Cloud Platform Text Analytics

Hello. Welcome to Loqu8 ice Learn Chinese. Start understanding and learning Chinese from your mouse.

Author-Specific Sentiment Aggregation for Rating Prediction of Reviews

How to create dialog system

Conclusion and review

(12) Patent Application Publication (10) Pub. No.: US 2014/ A1

Session 2 A virtual Observatory for TerraSAR-X data

Semantics Isn t Easy Thoughts on the Way Forward

Sentiments Analysis of Users Review to Improve 5 Star Rating Method for a Recommendation System

Historical Text Mining:

A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2

Voice Access to Music: Evolution from DSLI 2014 to DSLI 2016

Jianyong Wang Department of Computer Science and Technology Tsinghua University

Online music is constantly changing. Static artist bios and old images from the archives no longer deliver an engaging experience.

Chapter 50 Tracing Related Scientific Papers by a Given Seed Paper Using Parscit

What s in a name? Rebecca Dridan -

Graph-Based Synopses for Relational Data. Alkis Polyzotis (UC Santa Cruz)

Large Scale Semantic Annotation, Indexing, and Search at The National Archives Diana Maynard Mark Greenwood

Media Retrieval (2) Prepared by. Ling Guan Jose Lay Paisarn Muneesawang Ning Zhang Rui Zhang. Outlines (revisited)

Development of Mobile Search Applications over Structured Web Data through Domain-Specific Modeling Languages. M.Sc. Thesis Atakan ARAL June 2012

DBpedia Spotlight at the MSM2013 Challenge

How to Find Your Most Cost-Effective Keywords

Papers for comprehensive viva-voce

Social Data Exploration

Hello. Quick Start Guide

Design of Ontology for The Internet Movie Database (IMDb) Sasikanth Avancha, Srikanth Kallurkar, Tapan Kamdar

Semantic Annotation for Semantic Social Networks. Using Community Resources

Fraunhofer IAIS Audio Mining Solution for Broadcast Archiving. Dr. Joachim Köhler LT-Innovate Brussels

GFI has tens of thousands of customers worldwide and distribution is served by a 10,000-strong Channel.

Entry Name: "INRIA-Perin-MC1" VAST 2013 Challenge Mini-Challenge 1: Box Office VAST

SEMANTIC INDEXING (ENTITY LINKING)

A Technical Overview: Voiyager Dynamic Application Discovery

INTRODUCTION. Chapter GENERAL

Introduction to Information Retrieval. Hongning Wang

The Anatomy of a Large-Scale Hypertextual Web Search Engine

Natural Language Processing. SoSe Question Answering

BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network

Syntax and Grammars 1 / 21

Mastering Xcode 7 and Swift

Advanced Training Guide

Semantic Annotation of Web Resources Using IdentityRank and Wikipedia

QuickView: NLP-based Tweet Search

COMMUNICATE. Advanced Training. West Corporation. 100 Enterprise Way, Suite A-300. Scotts Valley, CA

A BFS-BASED SIMILAR CONFERENCE RETRIEVAL FRAMEWORK

Content-Based Multimedia Information Retrieval

MarkLogic Server. Application Builder Developer s Guide. MarkLogic 8 February, Copyright 2015 MarkLogic Corporation. All rights reserved.

Transcription:

Conversational Knowledge Graphs Larry Heck Microsoft Research

Multi-modal systems e.g., Microsoft MiPad, Pocket PC TV Voice Search e.g., Bing on Xbox Task-specific argument extraction (e.g., Nuance, SpeechWorks) User: I want to fly from Boston to New York next week. Early 1990s Early 2000s Intent Determination (e.g. Nuance s Emily, AT&T HMIHY) User: Uh we want to move we want to change our phone line from this house to another house Keyword Spotting (e.g., AT&T) System: Please say collect, calling card, person, third number, or operator 2014 Virtual Personal Assistants: e.g., Siri, Google now DARPA CALO Project

Semantic Depth Scaling Depth and Breadth Head Strategy: Deep & Narrow Scale Strategy: Shallow & Broad Domain Breadth

Drama Kate Winslet Genre Starring Titanic Director James Cameron Award Release Year Oscar, Best director 1997

Pivot on Knowledge

Pivot on Knowledge

Link Satori entities to NL Surface forms: Bing queries Wikipedia Twitter MusicBrainz Drama IMDB etc. Kate Winslet Starring Genre Titanic Director James Cameron Award Release Year Oscar, Best director 1997 Solution Start from knowledge graph, mine data, auto-annotate

Manual Transcriptions Movie Actor Genre Director All Supervised CRF Lexical + Gazetteers 51.25% 86.29% 93.26% 64.86% 66.53% CRF Lexical only 46.44% 80.22% 92.83% 52.94% 61.72% Unsupervised Gazetteers only 69.69% 50.70% 15.76% 2.63% 51.14% CRF Lexical only 0.19% 9.67% 0.00% 62.83% 5.61% + Gazetteers 1.96% 72.35% 4.73% 79.03% 31.94% + Adaptation 71.72% 58.61% 29.55% 77.42% 60.38% + Relations 84.62% 61.02% ASR Output Movie Actor Genre Director All 45.15% 82.56% 88.58% 58.59% 60.96% 39.21% 74.86% 86.21% 45.36% 54.10% 59.66% 47.78% 11.80% 2.82% 43.88% 0.20% 9.67% 0.00% 57.14% 5.27% 1.74% 69.76% 3.57% 75.00% 30.77% 55.74% 62.70% 30.95% 73.21% 54.69% 80.67% 55.40% Unsupervised learning @ supervised (F-measure)

Relation Entity Entity

Entities anchor higher-level grammatical structure Induce grammars over high-confidence entities (anchor points) Repair missing entities Canadian born? directed titanic Titanic James Cameron Canada Ten most frequently occurring templates among entitybased queries (Pound et al., CIKM 12)

Manual Transcriptions Movie Actor Genre Director All Supervised CRF Lexical + Gazetteers 51.25% 86.29% 93.26% 64.86% 66.53% CRF Lexical only 46.44% 80.22% 92.83% 52.94% 61.72% Unsupervised Gazetteers only 69.69% 50.70% 15.76% 2.63% 51.14% CRF Lexical only 0.19% 9.67% 0.00% 62.83% 5.61% + Gazetteers 1.96% 72.35% 4.73% 79.03% 31.94% + Adaptation 71.72% 58.61% 29.55% 77.42% 60.38% + Relations 84.62% 61.02% ASR Output Movie Actor Genre Director All 45.15% 82.56% 88.58% 58.59% 60.96% 39.21% 74.86% 86.21% 45.36% 54.10% 59.66% 47.78% 11.80% 2.82% 43.88% 0.20% 9.67% 0.00% 57.14% 5.27% 1.74% 69.76% 3.57% 75.00% 30.77% 55.74% 62.70% 30.95% 73.21% 54.69% 80.67% 55.40% > +7% F-measure with induced relation grammars

Leveraging Knowledge Graphs for Web-Scale Unsupervised Semantic Parsing Exploiting the Semantic Web for Unsupervised Spoken Language Understanding Relation Detection Natural Language Semantic Parsing Using a Knowledge Graph and Query Click Logs for Unsupervised Learning of Exploiting the Semantic Web for Unsupervised

Unsupervised learning @ supervised from Search Query Logs A Weakly-Supervised Approach for Discovering New User Intents

Pivot on Knowledge

Idea: can we leverage Web (IE) session data combined with Knowledge Graphs Web search & browse Conversations/dialog Massive volume of interactions >100M queries/day, Millions of users Coverage of user interactions is high (broad domains across the web)

New Approach Successful Dialogs Goal 1: Q 4s RL 1s SR 53s SR 118s END Goal 2: Q 3s Q 5s SR 10s AD 44s END Goal 3: Q 4s RL 1s SR 53s SR 118s END Goal 4: Q 3s Q 5s SR 10s AD 44s END.. Goal n: Q 4s RL 1s SR 53s SR 118s END Goal n-1: Q 3s Q 5s SR 10s AD 44s END 1 2 3 8 6 7 4 5 9 Unsuccessful Dialogs Goal 1: Q 4s RL 1s SR 53s SR 118s END Goal 2: Q 3s Q 5s SR 10s AD 44s END Goal 3: Q 4s RL 1s SR 53s SR 118s END Goal 4: Q 3s Q 5s SR 10s AD 44s END.. Goal n: Q 4s RL 1s SR 53s SR 118s END Goal n-1: Q 3s Q 5s SR 10s AD 44s END 1 2 3 8 6 7 4 5 9

> 18% (rel.)

Probability of Intent Dialog Context Visual Context User Histories NL Realizations Semantic Parser Meaning

Conversational Systems with Depth & Breadth

Vision Strategy Thank you CONVERSATIONAL KNOWLEDGE GRAPHS 30