Query Intent Detection using Convolutional Neural Networks

Similar documents
Query Intent Detection using Convolutional Neural Networks

Natural Language Processing with Deep Learning CS224N/Ling284

Neural Network Models for Text Classification. Hongwei Wang 18/11/2016

Empirical Evaluation of RNN Architectures on Sentence Classification Task

Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank text

Machine Learning 13. week

COMP 551 Applied Machine Learning Lecture 16: Deep Learning

Densely Connected Bidirectional LSTM with Applications to Sentence Classification

ABC-CNN: Attention Based CNN for Visual Question Answering

Convolutional Networks for Text

Perceptron: This is convolution!

A Text Classification Model Using Convolution Neural Network and Recurrent Neural Network

End-To-End Spam Classification With Neural Networks

Domain-Aware Sentiment Classification with GRUs and CNNs

Natural Language Processing with Deep Learning CS224N/Ling284

FastText. Jon Koss, Abhishek Jindal

Combining Neural Networks and Log-linear Models to Improve Relation Extraction

JOINT INTENT DETECTION AND SLOT FILLING USING CONVOLUTIONAL NEURAL NETWORKS. Puyang Xu, Ruhi Sarikaya. Microsoft Corporation

Code Mania Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python:

A Comparison of Sequence-Trained Deep Neural Networks and Recurrent Neural Networks Optical Modeling For Handwriting Recognition

Semantic image search using queries

Clinical Name Entity Recognition using Conditional Random Field with Augmented Features

Deep Learning in Visual Recognition. Thanks Da Zhang for the slides

SEMANTIC COMPUTING. Lecture 9: Deep Learning: Recurrent Neural Networks (RNNs) TU Dresden, 21 December 2018

Machine Learning. Deep Learning. Eric Xing (and Pengtao Xie) , Fall Lecture 8, October 6, Eric CMU,

Object Detection Lecture Introduction to deep learning (CNN) Idar Dyrdal

CIS 660. Image Searching System using CNN-LSTM. Presented by. Mayur Rumalwala Sagar Dahiwala

Deep Learning Applications

Index. Umberto Michelucci 2018 U. Michelucci, Applied Deep Learning,

Study of Residual Networks for Image Recognition

Know your data - many types of networks

A Hybrid Neural Model for Type Classification of Entity Mentions

Slide credit from Hung-Yi Lee & Richard Socher

Deep Learning For Video Classification. Presented by Natalie Carlebach & Gil Sharon

Sentiment Classification of Food Reviews

Deep Learning. Visualizing and Understanding Convolutional Networks. Christopher Funk. Pennsylvania State University.

Neural Machine Translation In Linear Time

How GPUs Power Comcast's X1 Voice Remote and Smart Video Analytics. Jan Neumann Comcast Labs DC May 10th, 2017

Lecture: Deep Convolutional Neural Networks

All You Want To Know About CNNs. Yukun Zhu

Deep Learning with Tensorflow AlexNet

PTE : Predictive Text Embedding through Large-scale Heterogeneous Text Networks

Deep Learning for Computer Vision II

Hello Edge: Keyword Spotting on Microcontrollers

Classifying Depositional Environments in Satellite Images

Question Answering over Knowledge Bases: Entity, Text, and System Perspectives. Wanyun Cui Fudan University

Index. Springer Nature Switzerland AG 2019 B. Moons et al., Embedded Deep Learning,

LSTM: An Image Classification Model Based on Fashion-MNIST Dataset

Layerwise Interweaving Convolutional LSTM

16-785: Integrated Intelligence in Robotics: Vision, Language, and Planning. Spring 2018 Lecture 14. Image to Text

Transition-based Parsing with Neural Nets

Encoding RNNs, 48 End of sentence (EOS) token, 207 Exploding gradient, 131 Exponential function, 42 Exponential Linear Unit (ELU), 44

A Deep Learning primer

Lecture 2 Notes. Outline. Neural Networks. The Big Idea. Architecture. Instructors: Parth Shah, Riju Pahwa

Outline. Morning program Preliminaries Semantic matching Learning to rank Entities

Image-Sentence Multimodal Embedding with Instructive Objectives

Sentiment Analysis for Amazon Reviews

Unstructured Data. CS102 Winter 2019

DEEP LEARNING REVIEW. Yann LeCun, Yoshua Bengio & Geoffrey Hinton Nature Presented by Divya Chitimalla

CS231N Section. Video Understanding 6/1/2018

CS 523: Multimedia Systems

Deep Learning on Graphs

Recurrent Neural Nets II

Team UKNLP: Detecting ADRs, Classifying Medication Intake Messages, and Normalizing ADR Mentions on Twitter

Deep neural networks II

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset. By Joa õ Carreira and Andrew Zisserman Presenter: Zhisheng Huang 03/02/2018

Report: Privacy-Preserving Classification on Deep Neural Network

Show, Discriminate, and Tell: A Discriminatory Image Captioning Model with Deep Neural Networks

Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction

Tutorial on Machine Learning Tools

Deep Learning on Graphs

(Multinomial) Logistic Regression + Feature Engineering

Application of Deep Learning Techniques in Satellite Telemetry Analysis.

Neural Nets & Deep Learning

CMU Lecture 18: Deep learning and Vision: Convolutional neural networks. Teacher: Gianni A. Di Caro

Characterization and Benchmarking of Deep Learning. Natalia Vassilieva, PhD Sr. Research Manager

Action recognition in robot-assisted minimally invasive surgery

CSC 578 Neural Networks and Deep Learning

Deep Learning. Volker Tresp Summer 2014

LSTM and its variants for visual recognition. Xiaodan Liang Sun Yat-sen University

Sentiment Analysis using Recursive Neural Network

Word2vec and beyond. presented by Eleni Triantafillou. March 1, 2016

Predicting your Next Stop-Over from Location-Based Social Network Data with Recurrent Neural Networks RecTour workshop 2017 RecSys 2017, Como, Italy

Keras: Handwritten Digit Recognition using MNIST Dataset

Real-time Gesture Pattern Classification with IMU Data

Weighted Convolutional Neural Network. Ensemble.

Deep Learning Workshop. Nov. 20, 2015 Andrew Fishberg, Rowan Zellers

Face Recognition A Deep Learning Approach

Indoor Object Recognition of 3D Kinect Dataset with RNNs

Convolutional-Recursive Deep Learning for 3D Object Classification

Asynchronous Parallel Learning for Neural Networks and Structured Models with Dense Features

XES Tensorflow Process Prediction using the Tensorflow Deep-Learning Framework

CNN Basics. Chongruo Wu

Recurrent Neural Networks and Transfer Learning for Action Recognition

Machine Learning With Python. Bin Chen Nov. 7, 2017 Research Computing Center

C. The system is equally reliable for classifying any one of the eight logo types 78% of the time.

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech

EECS 496 Statistical Language Models. Winter 2018

A Quick Guide on Training a neural network using Keras.

Learning Binary Code with Deep Learning to Detect Software Weakness

Transcription:

Query Intent Detection using Convolutional Neural Networks Homa B. Hashemi, Amir Asiaee, Reiner Kraft QRUMS workshop - February 22, 2016

Query Intent Detection michelle obama age Query Intent Detection Person Birth Date 2 Query Intent Detection using CNN

Query Intent Detection michelle obama age barack obama wife birthday Query Intent Detection Person Birth Date 3 Query Intent Detection using CNN

Roadmap Approaches Proposed Method Architecture Convolutional Neural Networks model Experiments Query Classification Query Clustering 4 Query Intent Detection using CNN

Approaches Rule-Based Ø Define new rules for a new use case Query Intent Detection Feature Engineering Ø Define features Classification-Based 5 Query Intent Detection using CNN Deep Learning Representations Ø Automatically embed queries into a vector space

Deep Learning in NLP Deep learning achieved state-of-the-art results in computer vision and speech Fast becoming popular in NLP Learning word embeddings (Word2Vec) Learning sentence and document embeddings Semantic parsing (Yih, 2014) Information retrieval (Shen 2014, Huang 2013) Sentiment analysis (RNN and RNTN, Socher et al. 2011, 2012, 2013; Kalchbrenner 2014;Dos Santos 2014; Le 2014; Kim 2014) Ø Not much work on Short Text like queries Ø Not much work on multi-class classification with high number of classes 6

Model Architecture Train Time/Offline Queries CNN Query Vectors Classifier Test Time/Online New Query CNN Query Vectors Classifier Predicted Intents 7

Query representations with Word2Vec 1) Get query word vectors 2) Sum or Average vectors barack obama wife age barack obama wife age 0.1 2.3 4.5 0.6 7 8

Convolutional Neural Networks Use pre-trained word vector representations (Google word2vec) c i = f (w.x + b) c = [c 1, c 2,..., c n ] ĉ = max{c} barack obama wife age Multiple filters 0.1 2.3 4.5 0.6 7 Word embedding - Query matrix Feature maps max pooling Query representation 9 Based on Collober and Weston (2011) and Kim (2014)

Data # of queries 10,000 # of low-level intent classes 125 # of high-level intent classes 14 10 High-level intent Movie Person Low-level intent Rating Cast Length Release data Spouse Birth date Children

Query intent detection with 125 low-level intent classes 83.00 73.00 75.06 81.26 81.57 10,000 queries 125 intent classes 10-fold CV Random Forest 63.00 53.00 43.00 46 56 62 39 45 47 Jabba Rule-based Sum w2v Average w2v Unigram Unigram+bigram CNN 11 33.00 Accuracy Avg Precision Avg Recall

Person and Movie entities 1 0.8 0.6 0.4 Precision Recall Precision Recall Precision Recall Precision Recall Precision Recall Spouse BirthDate Height Children Birth Location Rule-based Unigram CNN 1 0.8 0.6 0.4 Precision Recall Precision Recall Precision Recall Precision Recall Precision Recall Movie Cast Release Date Rating Length Watch Online 12

Query intent detection with 14 high-level intent classes 10,000 queries 14 intent classes 100 89.9 90.3 10-fold CV Random Forest 80 77 71 Jabba Rule-based Sum w2v 60 56 Unigram 40 45 34 37 43 CNN 20 Accuracy Precision Recall 13

Query Clustering K-means K= 125 10,000 query vector representations george selvie wiki james brown biography bob toski bio cast of annie 2014 big bang cast independence day cast and crew cast of hollow man lee rocker band members santana band members salif keita band members lisa bonet pictures bleona qereti photos valeen montenegro images jennifer aniston wallpapers hd alma wade pics hitler images ninja turtles youtube if i stay full movie viooz watch homeland online the outlander streaming john candy death ryan knight has died is bruce jenner dead 14

Conclusion Use CNN to get query vector representations for Query Classification Query Clustering Query vector representations Can be used in all the tasks related to query Can be combined with other methods E.g. combining with rule-based outputs and N-grams to get more discriminative features Future work: Exploit query vectors using recurrent neural networks with LSTM cells 15

Detect Emerging intents Advantage of CNN over n-grams: Embedding queries into a vector space Detect outliers as new set of queries 17

Detect Emerging intents Advantage of CNN over n-grams: Embedding queries into a vector space Detect outliers as new set of queries Possible new query intent 18

CNN hyperparameters Window filter sizes = 2, 3, 4 Each filter has 100 feature maps Mibi batch size = 50 Epochs = 40 Nonlinearity = relu Regularization = Dropout Dropout rate = 0.5 Randomly set some weights to zero to prevent overfitting 19