Liangjie Hong*, Dawei Yin*, Jian Guo, Brian D. Davison*
|
|
- Henry Robinson
- 5 years ago
- Views:
Transcription
1 Tracking Trends: Incorporating Term Volume into Temporal Topic Models Liangjie Hong*, Dawei Yin*, Jian Guo, Brian D. Davison* Dept. of Computer Science and Engineering, Lehigh University, Bethlehem, PA, USA * Dept. of Statistics, University of Michigan, Ann Arbor, MI, USA
2 Outline motivation related work our model experiments conclusion & future work
3 Motivation ever-growing datasets topics or interests drift over time
4 Motivation track the popularity of topics understand correlations between terms
5 Motivation track the popularity of topics understand correlations between terms
6 tracking might be difficult
7 tracking might be difficult
8 tracking might be difficult
9 Problems need to know all the keywords beforehand whether they are thematically related or not?
10 Problems need to know all the keywords beforehand whether they are thematically related or not? based on historical data, is it possible to do prediction?
11 Related Work temporal topic models
12 Related Work temporal topic models general-purpose models hard to evaluate
13 Related Work evaluation of temporal topic models Temporal Perplexity [Blei & Lafferty, 2006] [Nallapati et al., 2007] [Wang et al., 2008] [Wang et al., 2009] [Wang et al., 2010] [Ahmed et al., 2010] [Iwata et al., 2010] [N. Kawamae et al., 2010] Timestamp Prediction [Wang et al., 2006] [Wang et al., 2008] [N. Kawamae et al., 2010] Classification/Clustering [Zhang et al., 2010] Ad-hoc [Wang et al., 2006] [Wang et al., 2009] [Zhang et al., 2010]
14 Problem Definition Input: text documents, segmented into time epochs Output: cluster terms track term volumes
15 Our Approach two sub-tasks: cluster terms temporal topic models tracking volumes linear regression
16 Our Approach two sub-tasks: cluster terms temporal topic models tracking volumes linear regression combine two tasks reinforce with each other
17 Input: text documents, segmented into time epochs Output: cluster terms topics (distribution over terms) track term volumes volume regression
18 Input: text documents, segmented into time epochs Output: cluster terms topics (distribution over terms) track term volumes volume regression
19 Term Volume Tracking topics as latent features term volumes as response
20 Term Volume Tracking topics as latent features term volumes as response independence assumption
21 + Term Volume Time
22 β + π Term Volume Y Time
23 Problem How to obtain β over time?
24 Problem How to obtain β over time? state-space model special proposed functions linear combination
25 Problem How to obtain β over time? state-space model special proposed functions linear combination
26 Incorporate Term Volumes with LDA θ θ z z w w β β Y Π Y
27 Incorporate Term Volumes with LDA
28 Approximate Inference Obtain per document: θ per word: z per time epoch, per topic: β per word: π
29 Approximate Inference Obtain per document: θ per word: z per time epoch, per topic: β per word: π Gibbs Sampling Variational Inference
30 Approximate Inference Obtain per document: θ per word: z per time epoch, per topic: β per word: π Gibbs Sampling Variational Inference
31 Tracking Trends: Incorporating Term Volume into Temporal Topic Models Variational Inference Initialize β randomly. while relative improvement in L > do E step : for t = 1 to T do for i = 1 to D do Update λ d Update ϕ d M Step : for v = 1 to V do Update π v Update σ v for t = 1 to T do Update β t by using Conjugate Gradient Prediction: state-space model s common practice
32 Tracking Trends: Incorporating Term Volume into Temporal Topic Models Experiments NIPS dataset: 4,360 papers with 38,029 distinct terms, 24 years. ACL dataset: 14,590 papers with 74,189 distinct terms, 37 years. Metric:
33 Tracking Trends: Incorporating Term Volume into Temporal Topic Models Baselines Univariate Autoregressive Model AR(p): Multivariate Autoregressive Model MAR(p): LDA Dynamic Topic Model [Blei & Lafferty, 2006]
34 Tracking Trends: Incorporating Term Volume into Temporal Topic Models Baselines Univariate Autoregressive Model AR(p): Multivariate Autoregressive Model MAR(p): LDA Dynamic Topic Model [Blei & Lafferty, 2006]
35 Tracking Trends: Incorporating Term Volume into Temporal Topic Models Baselines
36 Tracking Trends: Incorporating Term Volume into Temporal Topic Models Baselines
37
38
39
40 Tracking Trends: Incorporating Term Volume into Temporal Topic Models Conclusion & Future work clustering terms + tracking terms topic modeling + state-space model + supervised learning latent features help prediction explore other temporal models (see [Hong et al. 2011]) capture correlations between terms explore more efficient inference algorithms
41 Tracking Trends: Incorporating Term Volume into Temporal Topic Models Thank you! Contact Info: Liangjie Hong WUME Laboratory Computer Science and Engineering Lehigh University Bethlehem, Pennsylvania, USA
42 Variational Inference with Kalman Filter θ θ θ θ z z z z w w w w β β β β β β β β t t+1 t+2 t+3
43 Variational Inference In variational inference, we consider a simplified graphical model with variational parameters γ, φ and minimize the KL Divergence between the variational and posterior distributions.
44 Tracking Trends: Incorporating Term Volume into Temporal Topic Models Variational Inference with Kalman Filter State-space Model Two basic operations: Smoothing Filtering
45 Tracking Trends: Incorporating Term Volume into Temporal Topic Models Variational Inference with Kalman Filter The variational parameters are: Dirichlet Multinomial Observations for Kalman Filter
Partitioning Algorithms for Improving Efficiency of Topic Modeling Parallelization
Partitioning Algorithms for Improving Efficiency of Topic Modeling Parallelization Hung Nghiep Tran University of Information Technology VNU-HCMC Vietnam Email: nghiepth@uit.edu.vn Atsuhiro Takasu National
More informationSpatial Latent Dirichlet Allocation
Spatial Latent Dirichlet Allocation Xiaogang Wang and Eric Grimson Computer Science and Computer Science and Artificial Intelligence Lab Massachusetts Tnstitute of Technology, Cambridge, MA, 02139, USA
More informationParallelism for LDA Yang Ruan, Changsi An
Parallelism for LDA Yang Ruan, Changsi An (yangruan@indiana.edu, anch@indiana.edu) 1. Overview As parallelism is very important for large scale of data, we want to use different technology to parallelize
More informationLatent Topic Model Based on Gaussian-LDA for Audio Retrieval
Latent Topic Model Based on Gaussian-LDA for Audio Retrieval Pengfei Hu, Wenju Liu, Wei Jiang, and Zhanlei Yang National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy
More informationConfiguring Topic Models for Software Engineering Tasks in TraceLab
Configuring Topic Models for Software Engineering Tasks in TraceLab Bogdan Dit Annibale Panichella Evan Moritz Rocco Oliveto Massimiliano Di Penta Denys Poshyvanyk Andrea De Lucia TEFSE 13 San Francisco,
More informationA Measurement Design for the Comparison of Expert Usability Evaluation and Mobile App User Reviews
A Measurement Design for the Comparison of Expert Usability Evaluation and Mobile App User Reviews Necmiye Genc-Nayebi and Alain Abran Department of Software Engineering and Information Technology, Ecole
More informationA LDA-Based Approach for Semi-Supervised Document Clustering
International Journal of Machine Learning and Computing, Vol. 4, o. 4, August 014 A LDA-Based Approach for Semi-Supervised Document Clustering Ruizhang Huang, Ping Zhou, and Li Zhang Abstract In this paper,
More informationjldadmm: A Java package for the LDA and DMM topic models
jldadmm: A Java package for the LDA and DMM topic models Dat Quoc Nguyen School of Computing and Information Systems The University of Melbourne, Australia dqnguyen@unimelb.edu.au Abstract: In this technical
More informationFinding Influencers within Fuzzy Topics on Twitter
1 1. Introduction Finding Influencers within Fuzzy Topics on Twitter Tal Stramer, Stanford University In recent years, there has been an explosion in usage of online social networks such as Twitter. This
More informationFeature LDA: a Supervised Topic Model for Automatic Detection of Web API Documentations from the Web
Feature LDA: a Supervised Topic Model for Automatic Detection of Web API Documentations from the Web Chenghua Lin, Yulan He, Carlos Pedrinaci, and John Domingue Knowledge Media Institute, The Open University
More informationScore function estimator and variance reduction techniques
and variance reduction techniques Wilker Aziz University of Amsterdam May 24, 2018 Wilker Aziz Discrete variables 1 Outline 1 2 3 Wilker Aziz Discrete variables 1 Variational inference for belief networks
More informationApplied Bayesian Nonparametrics 5. Spatial Models via Gaussian Processes, not MRFs Tutorial at CVPR 2012 Erik Sudderth Brown University
Applied Bayesian Nonparametrics 5. Spatial Models via Gaussian Processes, not MRFs Tutorial at CVPR 2012 Erik Sudderth Brown University NIPS 2008: E. Sudderth & M. Jordan, Shared Segmentation of Natural
More informationIntegrating Image Clustering and Codebook Learning
Integrating Image Clustering and Codebook Learning Pengtao Xie and Eric Xing {pengtaox,epxing}@cs.cmu.edu School of Computer Science, Carnegie Mellon University 5000 Forbes Ave, Pittsburgh, PA 15213 Abstract
More informationThe Open University s repository of research publications and other research outputs. Search Personalization with Embeddings
Open Research Online The Open University s repository of research publications and other research outputs Search Personalization with Embeddings Conference Item How to cite: Vu, Thanh; Nguyen, Dat Quoc;
More informationScalable Object Classification using Range Images
Scalable Object Classification using Range Images Eunyoung Kim and Gerard Medioni Institute for Robotics and Intelligent Systems University of Southern California 1 What is a Range Image? Depth measurement
More informationDynamic Embeddings for User Profiling in Twitter
Dynamic Embeddings for User Profiling in Twitter Shangsong Liang 1, Xiangliang Zhang 1, Zhaochun Ren 2, Evangelos Kanoulas 3 1 KAUST, Saudi Arabia 2 JD.com, China 3 University of Amsterdam, The Netherlands
More informationEffective Latent Space Graph-based Re-ranking Model with Global Consistency
Effective Latent Space Graph-based Re-ranking Model with Global Consistency Feb. 12, 2009 1 Outline Introduction Related work Methodology Graph-based re-ranking model Learning a latent space graph A case
More informationMultimodal topic model for texts and images utilizing their embeddings
Multimodal topic model for texts and images utilizing their embeddings Nikolay Smelik, smelik@rain.ifmo.ru Andrey Filchenkov, afilchenkov@corp.ifmo.ru Computer Technologies Lab IDP-16. Barcelona, Spain,
More informationWarped Mixture Models
Warped Mixture Models Tomoharu Iwata, David Duvenaud, Zoubin Ghahramani Cambridge University Computational and Biological Learning Lab March 11, 2013 OUTLINE Motivation Gaussian Process Latent Variable
More informationImproving Difficult Queries by Leveraging Clusters in Term Graph
Improving Difficult Queries by Leveraging Clusters in Term Graph Rajul Anand and Alexander Kotov Department of Computer Science, Wayne State University, Detroit MI 48226, USA {rajulanand,kotov}@wayne.edu
More informationFactor Topographic Latent Source Analysis: Factor Analysis for Brain Images
Factor Topographic Latent Source Analysis: Factor Analysis for Brain Images Jeremy R. Manning 1,2, Samuel J. Gershman 3, Kenneth A. Norman 1,3, and David M. Blei 2 1 Princeton Neuroscience Institute, Princeton
More informationMonitoring of Manufacturing Process Conditions with Baseline Changes Using Generalized Regression
Proceedings of the 10th International Conference on Frontiers of Design and Manufacturing June 10~1, 01 Monitoring of Manufacturing Process Conditions with Baseline Changes Using Generalized Regression
More informationVariational Inference for Non-Stationary Distributions. Arsen Mamikonyan
Variational Inference for Non-Stationary Distributions by Arsen Mamikonyan B.S. EECS, Massachusetts Institute of Technology (2012) B.S. Physics, Massachusetts Institute of Technology (2012) Submitted to
More informationContinuous Time Group Discovery in Dynamic Graphs
Continuous Time Group Discovery in Dynamic Graphs Kurt T. Miller 1,2 tadayuki@cs.berkeley.edu 1 EECS University of California Berkeley, CA 94720 Tina Eliassi-Rad 2 eliassi@llnl.gov 2 Larence Livermore
More informationImproving Face Recognition by Exploring Local Features with Visual Attention
Improving Face Recognition by Exploring Local Features with Visual Attention Yichun Shi and Anil K. Jain Michigan State University Difficulties of Face Recognition Large variations in unconstrained face
More informationTopic-Level Random Walk through Probabilistic Model
Topic-Level Random Walk through Probabilistic Model Zi Yang, Jie Tang, Jing Zhang, Juanzi Li, and Bo Gao Department of Computer Science & Technology, Tsinghua University, China Abstract. In this paper,
More informationMondrian Forests: Efficient Online Random Forests
Mondrian Forests: Efficient Online Random Forests Balaji Lakshminarayanan (Gatsby Unit, UCL) Daniel M. Roy (Cambridge Toronto) Yee Whye Teh (Oxford) September 4, 2014 1 Outline Background and Motivation
More informationVariational Methods for Discrete-Data Latent Gaussian Models
Variational Methods for Discrete-Data Latent Gaussian Models University of British Columbia Vancouver, Canada March 6, 2012 The Big Picture Joint density models for data with mixed data types Bayesian
More informationSmoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data
Smoothing Dissimilarities for Cluster Analysis: Binary Data and unctional Data David B. University of South Carolina Department of Statistics Joint work with Zhimin Chen University of South Carolina Current
More informationAnalysis of Incomplete Multivariate Data
Analysis of Incomplete Multivariate Data J. L. Schafer Department of Statistics The Pennsylvania State University USA CHAPMAN & HALL/CRC A CR.C Press Company Boca Raton London New York Washington, D.C.
More informationDiffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting
Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting Yaguang Li Joint work with Rose Yu, Cyrus Shahabi, Yan Liu Page 1 Introduction Traffic congesting is wasteful of time,
More information10-701/15-781, Fall 2006, Final
-7/-78, Fall 6, Final Dec, :pm-8:pm There are 9 questions in this exam ( pages including this cover sheet). If you need more room to work out your answer to a question, use the back of the page and clearly
More informationOntology based Model and Procedure Creation for Topic Analysis in Chinese Language
Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language Dong Han and Kilian Stoffel Information Management Institute, University of Neuchâtel Pierre-à-Mazel 7, CH-2000 Neuchâtel,
More informationEnsemble of Specialized Neural Networks for Time Series Forecasting. Slawek Smyl ISF 2017
Ensemble of Specialized Neural Networks for Time Series Forecasting Slawek Smyl slawek@uber.com ISF 2017 Ensemble of Predictors Ensembling a group predictors (preferably diverse) or choosing one of them
More informationSemantic Segmentation. Zhongang Qi
Semantic Segmentation Zhongang Qi qiz@oregonstate.edu Semantic Segmentation "Two men riding on a bike in front of a building on the road. And there is a car." Idea: recognizing, understanding what's in
More informationECE521: Week 11, Lecture March 2017: HMM learning/inference. With thanks to Russ Salakhutdinov
ECE521: Week 11, Lecture 20 27 March 2017: HMM learning/inference With thanks to Russ Salakhutdinov Examples of other perspectives Murphy 17.4 End of Russell & Norvig 15.2 (Artificial Intelligence: A Modern
More informationPredictive Discrete Latent Factor Models for Large Scale Dyadic Data Deepak Agarwal, Srujana Merugu Yahoo! Research
Predictive Discrete Latent Factor Models for Large Scale Dyadic Data Deepak Agarwal, Srujana Merugu Yahoo! Research March 5, 2009 Outline 1 Motivation 2 Background 3 Algorithm 4 Analysis Recommender System
More informationA spatio-temporal model for extreme precipitation simulated by a climate model.
A spatio-temporal model for extreme precipitation simulated by a climate model. Jonathan Jalbert Joint work with Anne-Catherine Favre, Claude Bélisle and Jean-François Angers STATMOS Workshop: Climate
More informationExploiting Conversation Structure in Unsupervised Topic Segmentation for s
Exploiting Conversation Structure in Unsupervised Topic Segmentation for Emails Shafiq Joty, Giuseppe Carenini, Gabriel Murray, Raymond Ng University of British Columbia Vancouver, Canada EMNLP 2010 1
More informationTag2Word: Using Tags to Generate Words for Content Based Tag Recommendation
Tag2Word: Using Tags to Generate Words for Content Based Tag Recommendation Yong Wu wy@smail.nju.edu.cn Hanghang Tong Arizona State University, USA hanghang.tong@asu.edu Yuan Yao y.yao@nju.edu.cn Jian
More informationStatistical Matching using Fractional Imputation
Statistical Matching using Fractional Imputation Jae-Kwang Kim 1 Iowa State University 1 Joint work with Emily Berg and Taesung Park 1 Introduction 2 Classical Approaches 3 Proposed method 4 Application:
More informationClustering Relational Data using the Infinite Relational Model
Clustering Relational Data using the Infinite Relational Model Ana Daglis Supervised by: Matthew Ludkin September 4, 2015 Ana Daglis Clustering Data using the Infinite Relational Model September 4, 2015
More informationTGNet: Learning to Rank Nodes in Temporal Graphs. Qi Song 1 Bo Zong 2 Yinghui Wu 1,3 Lu-An Tang 2 Hui Zhang 2 Guofei Jiang 2 Haifeng Chen 2
TGNet: Learning to Rank Nodes in Temporal Graphs Qi Song 1 Bo Zong 2 Yinghui Wu 1,3 Lu-An Tang 2 Hui Zhang 2 Guofei Jiang 2 Haifeng Chen 2 1 2 3 Node Ranking in temporal graphs Temporal graphs have been
More informationModeling pigeon behaviour using a Conditional Restricted Boltzmann Machine
Modeling pigeon behaviour using a Conditional Restricted Boltzmann Machine Matthew D. Zeiler 1,GrahamW.Taylor 1, Nikolaus F. Troje 2 and Geoffrey E. Hinton 1 1- University of Toronto - Dept. of Computer
More informationMondrian Forests: Efficient Online Random Forests
Mondrian Forests: Efficient Online Random Forests Balaji Lakshminarayanan Joint work with Daniel M. Roy and Yee Whye Teh 1 Outline Background and Motivation Mondrian Forests Randomization mechanism Online
More informationRecommendation System Using Yelp Data CS 229 Machine Learning Jia Le Xu, Yingran Xu
Recommendation System Using Yelp Data CS 229 Machine Learning Jia Le Xu, Yingran Xu 1 Introduction Yelp Dataset Challenge provides a large number of user, business and review data which can be used for
More informationClustering and The Expectation-Maximization Algorithm
Clustering and The Expectation-Maximization Algorithm Unsupervised Learning Marek Petrik 3/7 Some of the figures in this presentation are taken from An Introduction to Statistical Learning, with applications
More informationMCMC Methods for data modeling
MCMC Methods for data modeling Kenneth Scerri Department of Automatic Control and Systems Engineering Introduction 1. Symposium on Data Modelling 2. Outline: a. Definition and uses of MCMC b. MCMC algorithms
More informationarxiv: v1 [cs.cl] 18 Jan 2015
Workshop on Knowledge-Powered Deep Learning for Text Mining (KPDLTM-2014) arxiv:1501.04325v1 [cs.cl] 18 Jan 2015 Lars Maaloe DTU Compute, Technical University of Denmark (DTU) B322, DK-2800 Lyngby Morten
More informationCut-And-Stitch: Efficient Parallel Learning of Linear Dynamical Systems on SMPs
School of Computer Science Cut-And-Stitch: Efficient Parallel Learning of Linear Dynamical Systems on SMPs Lei Li, WenjieFu, Fan Guo, Todd C. Mowry, Christos Faloutsos Computer Science Department Carnegie
More informationRankTopic: Ranking Based Topic Modeling
RankTopic: Ranking Based Topic Modeling Dongsheng Duan, Yuhua Li, Ruixuan Li, Rui Zhang and Aiming Wen School of Computer Science and Technology, Huazhong University of Science and Technology, China Email:
More informationApplications of admixture models
Applications of admixture models CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar, Alkes Price Applications of admixture models 1 / 27
More informationRSDC 09: Tag Recommendation Using Keywords and Association Rules
RSDC 09: Tag Recommendation Using Keywords and Association Rules Jian Wang, Liangjie Hong and Brian D. Davison Department of Computer Science and Engineering Lehigh University, Bethlehem, PA 18015 USA
More informationSemantic Image Retrieval Using Correspondence Topic Model with Background Distribution
Semantic Image Retrieval Using Correspondence Topic Model with Background Distribution Nguyen Anh Tu Department of Computer Engineering Kyung Hee University, Korea Email: tunguyen@oslab.khu.ac.kr Jinsung
More informationTopic Modeling for Large-Scale Multimedia Analysis and Retrieval
C h a p t e r 19 Significant language editing was done. Please check. Topic Modeling for Large-Scale Multimedia Analysis and Retrieval Juan Hu, Yi Fang, Nam Ling, and Li Song Contents 19.1 Introduction
More informationCS281 Section 9: Graph Models and Practical MCMC
CS281 Section 9: Graph Models and Practical MCMC Scott Linderman November 11, 213 Now that we have a few MCMC inference algorithms in our toolbox, let s try them out on some random graph models. Graphs
More informationRisk Minimization and Language Modeling in Text Retrieval Thesis Summary
Risk Minimization and Language Modeling in Text Retrieval Thesis Summary ChengXiang Zhai Language Technologies Institute School of Computer Science Carnegie Mellon University July 21, 2002 Abstract This
More informationCase Study IV: Bayesian clustering of Alzheimer patients
Case Study IV: Bayesian clustering of Alzheimer patients Mike Wiper and Conchi Ausín Department of Statistics Universidad Carlos III de Madrid Advanced Statistics and Data Mining Summer School 2nd - 6th
More information19: Inference and learning in Deep Learning
10-708: Probabilistic Graphical Models 10-708, Spring 2017 19: Inference and learning in Deep Learning Lecturer: Zhiting Hu Scribes: Akash Umakantha, Ryan Williamson 1 Classes of Deep Generative Models
More informationConstructing Popular Routes from Uncertain Trajectories
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei, Yu Zheng, Wen-Chih Peng presented by Slawek Goryczka Scenarios A trajectory is a sequence of data points recording location information
More informationSTREAMING RANKING BASED RECOMMENDER SYSTEMS
STREAMING RANKING BASED RECOMMENDER SYSTEMS Weiqing Wang, Hongzhi Yin, Zi Huang, Qinyong Wang, Xingzhong Du, Quoc Viet Hung Nguyen University of Queensland, Australia & Griffith University, Australia July
More informationThe Pre-Image Problem in Kernel Methods
The Pre-Image Problem in Kernel Methods James Kwok Ivor Tsang Department of Computer Science Hong Kong University of Science and Technology Hong Kong The Pre-Image Problem in Kernel Methods ICML-2003 1
More informationGraphical Models, Bayesian Method, Sampling, and Variational Inference
Graphical Models, Bayesian Method, Sampling, and Variational Inference With Application in Function MRI Analysis and Other Imaging Problems Wei Liu Scientific Computing and Imaging Institute University
More informationMeasuring Similarity to Detect
Measuring Similarity to Detect Qualified Links Xiaoguang Qi, Lan Nie, and Brian D. Davison Dept. of Computer Science & Engineering Lehigh University Introduction Approach Experiments Discussion & Conclusion
More information2 E6885 Network Science Lecture 10: Analysis of Network Flow
E 6885 Topics in Signal Processing -- Network Science E6885 Network Science Lecture 1: Analysis of Network Flow Ching-Yung Lin, Dept. of Electrical Engineering, Columbia University November 21st, 211 Course
More informationImplementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky
Implementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky The Chinese University of Hong Kong Abstract Husky is a distributed computing system, achieving outstanding
More informationLecture 21 : A Hybrid: Deep Learning and Graphical Models
10-708: Probabilistic Graphical Models, Spring 2018 Lecture 21 : A Hybrid: Deep Learning and Graphical Models Lecturer: Kayhan Batmanghelich Scribes: Paul Liang, Anirudha Rayasam 1 Introduction and Motivation
More informationComparing different interpolation methods on two-dimensional test functions
Comparing different interpolation methods on two-dimensional test functions Thomas Mühlenstädt, Sonja Kuhnt May 28, 2009 Keywords: Interpolation, computer experiment, Kriging, Kernel interpolation, Thin
More informationarxiv: v1 [stat.me] 29 May 2015
MIMCA: Multiple imputation for categorical variables with multiple correspondence analysis Vincent Audigier 1, François Husson 2 and Julie Josse 2 arxiv:1505.08116v1 [stat.me] 29 May 2015 Applied Mathematics
More informationHierarchical Modelling for Large Spatial Datasets
Hierarchical Modelling for Large Spatial Datasets Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,
More informationMultilabel Learning for Automatic Web Services Tagging
Multilabel Learning for Automatic Web Services Tagging Mustapha AZNAG Aix-Marseille University, LSIS UMR 7296, France. mustapha.aznag@univ-amu.fr Mohamed QUAFAFOU Aix-Marseille University, LSIS UMR 7296,
More informationLogistic Regression. Abstract
Logistic Regression Tsung-Yi Lin, Chen-Yu Lee Department of Electrical and Computer Engineering University of California, San Diego {tsl008, chl60}@ucsd.edu January 4, 013 Abstract Logistic regression
More informationCS229 Final Project: Predicting Expected Response Times
CS229 Final Project: Predicting Expected Email Response Times Laura Cruz-Albrecht (lcruzalb), Kevin Khieu (kkhieu) December 15, 2017 1 Introduction Each day, countless emails are sent out, yet the time
More informationDesiging and combining kernels: some lessons learned from bioinformatics
Desiging and combining kernels: some lessons learned from bioinformatics Jean-Philippe Vert Jean-Philippe.Vert@mines-paristech.fr Mines ParisTech & Institut Curie NIPS MKL workshop, Dec 12, 2009. Jean-Philippe
More informationPromoting Ranking Diversity for Biomedical Information Retrieval based on LDA
Promoting Ranking Diversity for Biomedical Information Retrieval based on LDA Yan Chen, Xiaoshi Yin, Zhoujun Li, Xiaohua Hu and Jimmy Huang State Key Laboratory of Software Development Environment, Beihang
More informationAUTOMATIC LABELING OF RSS ARTICLES USING ONLINE LATENT DIRICHLET ALLOCATION
Michigan Technological University Digital Commons @ Michigan Tech Dissertations, Master's Theses and Master's Reports - Open Dissertations, Master's Theses and Master's Reports 2014 AUTOMATIC LABELING
More informationUnusual Event Detection using Sparse Spatio- Temporal Features and Bag of Words Model
Proceedings of the 03 IEEE Second International Conference on Image Information Processing (ICIIP-03) Unusual Event Detection using Sparse Spatio- Temporal Features and Bag of Words Model Balarishna Mandadi
More informationCOMS 4771 Clustering. Nakul Verma
COMS 4771 Clustering Nakul Verma Supervised Learning Data: Supervised learning Assumption: there is a (relatively simple) function such that for most i Learning task: given n examples from the data, find
More informationModeling Criminal Careers as Departures From a Unimodal Population Age-Crime Curve: The Case of Marijuana Use
Modeling Criminal Careers as Departures From a Unimodal Population Curve: The Case of Marijuana Use Donatello Telesca, Elena A. Erosheva, Derek A. Kreader, & Ross Matsueda April 15, 2014 extends Telesca
More informationDeep Generative Models Variational Autoencoders
Deep Generative Models Variational Autoencoders Sudeshna Sarkar 5 April 2017 Generative Nets Generative models that represent probability distributions over multiple variables in some way. Directed Generative
More informationUsing Semantic Similarity in Crawling-based Web Application Testing. (National Taiwan Univ.)
Using Semantic Similarity in Crawling-based Web Application Testing Jun-Wei Lin Farn Wang Paul Chu (UC-Irvine) (National Taiwan Univ.) (QNAP, Inc) Crawling-based Web App Testing the web app under test
More informationText Document Clustering Using DPM with Concept and Feature Analysis
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 2, Issue. 10, October 2013,
More informationClustering K-means. Machine Learning CSEP546 Carlos Guestrin University of Washington February 18, Carlos Guestrin
Clustering K-means Machine Learning CSEP546 Carlos Guestrin University of Washington February 18, 2014 Carlos Guestrin 2005-2014 1 Clustering images Set of Images [Goldberger et al.] Carlos Guestrin 2005-2014
More informationLTI Thesis Defense: Riemannian Geometry and Statistical Machine Learning
Outline LTI Thesis Defense: Riemannian Geometry and Statistical Machine Learning Guy Lebanon Motivation Introduction Previous Work Committee:John Lafferty Geoff Gordon, Michael I. Jordan, Larry Wasserman
More informationVisoLink: A User-Centric Social Relationship Mining
VisoLink: A User-Centric Social Relationship Mining Lisa Fan and Botang Li Department of Computer Science, University of Regina Regina, Saskatchewan S4S 0A2 Canada {fan, li269}@cs.uregina.ca Abstract.
More informationLudwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer
Ludwig Fahrmeir Gerhard Tute Statistical odelling Based on Generalized Linear Model íecond Edition. Springer Preface to the Second Edition Preface to the First Edition List of Examples List of Figures
More informationA Shape Aware Model for semi-supervised Learning of Objects and its Context
A Shape Aware Model for semi-supervised Learning of Objects and its Context Abhinav Gupta 1, Jianbo Shi 2 and Larry S. Davis 1 1 Dept. of Computer Science, Univ. of Maryland, College Park 2 Dept. of Computer
More informationDocument Clustering and Visualization with Latent Dirichlet Allocation and Self-Organizing Maps
Proceedings of the Twenty-Second International FLAIRS Conference (2009) Document Clustering and Visualization with Latent Dirichlet Allocation and Self-Organizing Maps Jeremy R. Millar and Gilbert L. Peterson
More informationWebSci and Learning to Rank for IR
WebSci and Learning to Rank for IR Ernesto Diaz-Aviles L3S Research Center. Hannover, Germany diaz@l3s.de Ernesto Diaz-Aviles www.l3s.de 1/16 Motivation: Information Explosion Ernesto Diaz-Aviles
More informationClustering Lecture 5: Mixture Model
Clustering Lecture 5: Mixture Model Jing Gao SUNY Buffalo 1 Outline Basics Motivation, definition, evaluation Methods Partitional Hierarchical Density-based Mixture model Spectral methods Advanced topics
More information#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks
1 #mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks Bang Hui Lim, Dongyuan Lu Tao Chen, Min-yen Kan Web Information Retrieval / Natural Language Processing Group 1. http://www.statista.com/statistics/272014/global-social-networks-ranked-by-number-of-users/
More informationBilevel Sparse Coding
Adobe Research 345 Park Ave, San Jose, CA Mar 15, 2013 Outline 1 2 The learning model The learning algorithm 3 4 Sparse Modeling Many types of sensory data, e.g., images and audio, are in high-dimensional
More informationClassification. 1 o Semestre 2007/2008
Classification Departamento de Engenharia Informática Instituto Superior Técnico 1 o Semestre 2007/2008 Slides baseados nos slides oficiais do livro Mining the Web c Soumen Chakrabarti. Outline 1 2 3 Single-Class
More informationStatistical Models for Exploring Individual Communication Behavior
JMLR: Workshop and Conference Proceedings 25:1 16, 2012 Asian Conference on Machine Learning Statistical Models for Exploring Individual Email Communication Behavior Nicholas Navaroli Department of Computer
More informationA Text Clustering Algorithm Using an Online Clustering Scheme for Initialization
A Text Clustering Algorithm Using an Online Clustering Scheme for Initialization Jianhua Yin, Jianyong Wang Tsinghua National Laboratory for Information Science and Technology (TNList) Department of Computer
More informationGradient of the lower bound
Weakly Supervised with Latent PhD advisor: Dr. Ambedkar Dukkipati Department of Computer Science and Automation gaurav.pandey@csa.iisc.ernet.in Objective Given a training set that comprises image and image-level
More informationLecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM)
School of Computer Science Probabilistic Graphical Models Structured Sparse Additive Models Junming Yin and Eric Xing Lecture 7, April 4, 013 Reading: See class website 1 Outline Nonparametric regression
More informationSpatial Outlier Detection
Spatial Outlier Detection Chang-Tien Lu Department of Computer Science Northern Virginia Center Virginia Tech Joint work with Dechang Chen, Yufeng Kou, Jiang Zhao 1 Spatial Outlier A spatial data point
More informationDynamic Models with R
Dynamic Models with R Book Summary State space models serve as a good entry point in to modern Bayesian inference for time series. The authors in the preface mention that State space models provide a very
More informationPerson Re-identification for Improved Multi-person Multi-camera Tracking by Continuous Entity Association
Person Re-identification for Improved Multi-person Multi-camera Tracking by Continuous Entity Association Neeti Narayan, Nishant Sankaran, Devansh Arpit, Karthik Dantu, Srirangaraj Setlur, Venu Govindaraju
More information