Liangjie Hong*, Dawei Yin*, Jian Guo, Brian D. Davison*

Similar documents
Partitioning Algorithms for Improving Efficiency of Topic Modeling Parallelization

Spatial Latent Dirichlet Allocation

Parallelism for LDA Yang Ruan, Changsi An

Latent Topic Model Based on Gaussian-LDA for Audio Retrieval

Configuring Topic Models for Software Engineering Tasks in TraceLab

A Measurement Design for the Comparison of Expert Usability Evaluation and Mobile App User Reviews

A LDA-Based Approach for Semi-Supervised Document Clustering

jldadmm: A Java package for the LDA and DMM topic models

Finding Influencers within Fuzzy Topics on Twitter

Feature LDA: a Supervised Topic Model for Automatic Detection of Web API Documentations from the Web

Score function estimator and variance reduction techniques

Applied Bayesian Nonparametrics 5. Spatial Models via Gaussian Processes, not MRFs Tutorial at CVPR 2012 Erik Sudderth Brown University

Integrating Image Clustering and Codebook Learning

The Open University s repository of research publications and other research outputs. Search Personalization with Embeddings

Scalable Object Classification using Range Images

Dynamic Embeddings for User Profiling in Twitter

Effective Latent Space Graph-based Re-ranking Model with Global Consistency

Multimodal topic model for texts and images utilizing their embeddings

Warped Mixture Models

Improving Difficult Queries by Leveraging Clusters in Term Graph

Factor Topographic Latent Source Analysis: Factor Analysis for Brain Images

Monitoring of Manufacturing Process Conditions with Baseline Changes Using Generalized Regression

Variational Inference for Non-Stationary Distributions. Arsen Mamikonyan

Continuous Time Group Discovery in Dynamic Graphs

Improving Face Recognition by Exploring Local Features with Visual Attention

Topic-Level Random Walk through Probabilistic Model

Mondrian Forests: Efficient Online Random Forests

Variational Methods for Discrete-Data Latent Gaussian Models

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data

Analysis of Incomplete Multivariate Data

Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting

10-701/15-781, Fall 2006, Final

Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language

Ensemble of Specialized Neural Networks for Time Series Forecasting. Slawek Smyl ISF 2017

Semantic Segmentation. Zhongang Qi

ECE521: Week 11, Lecture March 2017: HMM learning/inference. With thanks to Russ Salakhutdinov

Predictive Discrete Latent Factor Models for Large Scale Dyadic Data Deepak Agarwal, Srujana Merugu Yahoo! Research

A spatio-temporal model for extreme precipitation simulated by a climate model.

Exploiting Conversation Structure in Unsupervised Topic Segmentation for s

Tag2Word: Using Tags to Generate Words for Content Based Tag Recommendation

Statistical Matching using Fractional Imputation

Clustering Relational Data using the Infinite Relational Model

TGNet: Learning to Rank Nodes in Temporal Graphs. Qi Song 1 Bo Zong 2 Yinghui Wu 1,3 Lu-An Tang 2 Hui Zhang 2 Guofei Jiang 2 Haifeng Chen 2

Modeling pigeon behaviour using a Conditional Restricted Boltzmann Machine

Mondrian Forests: Efficient Online Random Forests

Recommendation System Using Yelp Data CS 229 Machine Learning Jia Le Xu, Yingran Xu

Clustering and The Expectation-Maximization Algorithm

MCMC Methods for data modeling

arxiv: v1 [cs.cl] 18 Jan 2015

Cut-And-Stitch: Efficient Parallel Learning of Linear Dynamical Systems on SMPs

RankTopic: Ranking Based Topic Modeling

Applications of admixture models

RSDC 09: Tag Recommendation Using Keywords and Association Rules

Semantic Image Retrieval Using Correspondence Topic Model with Background Distribution

Topic Modeling for Large-Scale Multimedia Analysis and Retrieval

CS281 Section 9: Graph Models and Practical MCMC

Risk Minimization and Language Modeling in Text Retrieval Thesis Summary

Case Study IV: Bayesian clustering of Alzheimer patients

19: Inference and learning in Deep Learning

Constructing Popular Routes from Uncertain Trajectories

STREAMING RANKING BASED RECOMMENDER SYSTEMS

The Pre-Image Problem in Kernel Methods

Graphical Models, Bayesian Method, Sampling, and Variational Inference

Measuring Similarity to Detect

2 E6885 Network Science Lecture 10: Analysis of Network Flow

Implementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky

Lecture 21 : A Hybrid: Deep Learning and Graphical Models

Comparing different interpolation methods on two-dimensional test functions

arxiv: v1 [stat.me] 29 May 2015

Hierarchical Modelling for Large Spatial Datasets

Multilabel Learning for Automatic Web Services Tagging

Logistic Regression. Abstract

CS229 Final Project: Predicting Expected Response Times

Desiging and combining kernels: some lessons learned from bioinformatics

Promoting Ranking Diversity for Biomedical Information Retrieval based on LDA

AUTOMATIC LABELING OF RSS ARTICLES USING ONLINE LATENT DIRICHLET ALLOCATION

Unusual Event Detection using Sparse Spatio- Temporal Features and Bag of Words Model

COMS 4771 Clustering. Nakul Verma

Modeling Criminal Careers as Departures From a Unimodal Population Age-Crime Curve: The Case of Marijuana Use

Deep Generative Models Variational Autoencoders

Using Semantic Similarity in Crawling-based Web Application Testing. (National Taiwan Univ.)

Text Document Clustering Using DPM with Concept and Feature Analysis

Clustering K-means. Machine Learning CSEP546 Carlos Guestrin University of Washington February 18, Carlos Guestrin

LTI Thesis Defense: Riemannian Geometry and Statistical Machine Learning

VisoLink: A User-Centric Social Relationship Mining

Ludwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer

A Shape Aware Model for semi-supervised Learning of Objects and its Context

Document Clustering and Visualization with Latent Dirichlet Allocation and Self-Organizing Maps

WebSci and Learning to Rank for IR

Clustering Lecture 5: Mixture Model

#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks

Bilevel Sparse Coding

Classification. 1 o Semestre 2007/2008

Statistical Models for Exploring Individual Communication Behavior

A Text Clustering Algorithm Using an Online Clustering Scheme for Initialization

Gradient of the lower bound

Lecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM)

Spatial Outlier Detection

Dynamic Models with R

Person Re-identification for Improved Multi-person Multi-camera Tracking by Continuous Entity Association

Transcription:

Tracking Trends: Incorporating Term Volume into Temporal Topic Models Liangjie Hong*, Dawei Yin*, Jian Guo, Brian D. Davison* Dept. of Computer Science and Engineering, Lehigh University, Bethlehem, PA, USA * Dept. of Statistics, University of Michigan, Ann Arbor, MI, USA

Outline motivation related work our model experiments conclusion & future work

Motivation ever-growing datasets topics or interests drift over time

Motivation track the popularity of topics understand correlations between terms

Motivation track the popularity of topics understand correlations between terms

tracking might be difficult

tracking might be difficult

tracking might be difficult

Problems need to know all the keywords beforehand whether they are thematically related or not?

Problems need to know all the keywords beforehand whether they are thematically related or not? based on historical data, is it possible to do prediction?

Related Work temporal topic models

Related Work temporal topic models general-purpose models hard to evaluate

Related Work evaluation of temporal topic models Temporal Perplexity [Blei & Lafferty, 2006] [Nallapati et al., 2007] [Wang et al., 2008] [Wang et al., 2009] [Wang et al., 2010] [Ahmed et al., 2010] [Iwata et al., 2010] [N. Kawamae et al., 2010] Timestamp Prediction [Wang et al., 2006] [Wang et al., 2008] [N. Kawamae et al., 2010] Classification/Clustering [Zhang et al., 2010] Ad-hoc [Wang et al., 2006] [Wang et al., 2009] [Zhang et al., 2010]

Problem Definition Input: text documents, segmented into time epochs Output: cluster terms track term volumes

Our Approach two sub-tasks: cluster terms temporal topic models tracking volumes linear regression

Our Approach two sub-tasks: cluster terms temporal topic models tracking volumes linear regression combine two tasks reinforce with each other

Input: text documents, segmented into time epochs Output: cluster terms topics (distribution over terms) track term volumes volume regression

Input: text documents, segmented into time epochs Output: cluster terms topics (distribution over terms) track term volumes volume regression

Term Volume Tracking topics as latent features term volumes as response

Term Volume Tracking topics as latent features term volumes as response independence assumption

+ Term Volume Time

β + π Term Volume Y Time

Problem How to obtain β over time?

Problem How to obtain β over time? state-space model special proposed functions linear combination

Problem How to obtain β over time? state-space model special proposed functions linear combination

Incorporate Term Volumes with LDA θ θ z z w w β β Y Π Y

Incorporate Term Volumes with LDA

Approximate Inference Obtain per document: θ per word: z per time epoch, per topic: β per word: π

Approximate Inference Obtain per document: θ per word: z per time epoch, per topic: β per word: π Gibbs Sampling Variational Inference

Approximate Inference Obtain per document: θ per word: z per time epoch, per topic: β per word: π Gibbs Sampling Variational Inference

Tracking Trends: Incorporating Term Volume into Temporal Topic Models Variational Inference Initialize β randomly. while relative improvement in L > 0.00001 do E step : for t = 1 to T do for i = 1 to D do Update λ d Update ϕ d M Step : for v = 1 to V do Update π v Update σ v for t = 1 to T do Update β t by using Conjugate Gradient Prediction: state-space model s common practice

Tracking Trends: Incorporating Term Volume into Temporal Topic Models Experiments NIPS dataset: 4,360 papers with 38,029 distinct terms, 24 years. ACL dataset: 14,590 papers with 74,189 distinct terms, 37 years. Metric:

Tracking Trends: Incorporating Term Volume into Temporal Topic Models Baselines Univariate Autoregressive Model AR(p): Multivariate Autoregressive Model MAR(p): LDA Dynamic Topic Model [Blei & Lafferty, 2006]

Tracking Trends: Incorporating Term Volume into Temporal Topic Models Baselines Univariate Autoregressive Model AR(p): Multivariate Autoregressive Model MAR(p): LDA Dynamic Topic Model [Blei & Lafferty, 2006]

Tracking Trends: Incorporating Term Volume into Temporal Topic Models Baselines

Tracking Trends: Incorporating Term Volume into Temporal Topic Models Baselines

Tracking Trends: Incorporating Term Volume into Temporal Topic Models Conclusion & Future work clustering terms + tracking terms topic modeling + state-space model + supervised learning latent features help prediction explore other temporal models (see [Hong et al. 2011]) capture correlations between terms explore more efficient inference algorithms

Tracking Trends: Incorporating Term Volume into Temporal Topic Models Thank you! Contact Info: Liangjie Hong hongliangjie@lehigh.edu WUME Laboratory Computer Science and Engineering Lehigh University Bethlehem, Pennsylvania, USA

Variational Inference with Kalman Filter θ θ θ θ z z z z w w w w β β β β β β β β t t+1 t+2 t+3

Variational Inference In variational inference, we consider a simplified graphical model with variational parameters γ, φ and minimize the KL Divergence between the variational and posterior distributions.

Tracking Trends: Incorporating Term Volume into Temporal Topic Models Variational Inference with Kalman Filter State-space Model Two basic operations: Smoothing Filtering

Tracking Trends: Incorporating Term Volume into Temporal Topic Models Variational Inference with Kalman Filter The variational parameters are: Dirichlet Multinomial Observations for Kalman Filter