Analyzing Time-Series Data. Presentation by Colin Shea-Blymyer

Size: px
Start display at page:

Download "Analyzing Time-Series Data. Presentation by Colin Shea-Blymyer"

Transcription

1 Analyzing Time-Series Data Presentation by Colin Shea-Blymyer

2 Outline 1. Time Series Chains a. Motivation b. Problem c. Concepts d. Approach e. Results f. Conclusion 2. Analyzing Epidemics - FUNNEL a. Motivation b. Problem c. Models d. Algorithms e. Results f. Conclusion

3 Matrix Profile VII: Time Series Chains: A New Primitive for Time Series Data Mining Yan Zhu, Makoto Imamura, Daniel Nikovski, Eamonn Keogh Some content adapted from a presentation given at ICDM 2018

4 Motivation A major retailer wants to know how consumers shopping habits have changed. Given the number of times the retailer s name has been queried on Google, develop a model for the trend of the data. Data is obviously periodic, but this periodic behavior evolves. Model can be used to understand, and to forecast. Google query volume for Kohl s

5 Problem Given one example time series, No domain knowledge, Find a representation of the system s evolution Using a scalable algorithm.

6 Concepts - Matrix Profile (MP) Data structure that stores the distance from each subsequence of length m in a time series to its nearest neighbor.

7 Concepts - Motifs If a matrix profile s window size is 2... These clusters are motifs. Motif Region in time series with low matrix profile values.

8 Concepts - Chains Related to motifs - Chains integrate directionality Like a motif, but evolves over time Directionality is related to time

9 Approach - Overview 1. Compute left and right nearest neighbor a. Based off STOMP algorithm b. O(n 2 ) time, O(n) space 2. Find cains a. Find Anchored chains - O(n) time b. Find Unanchored chain - O(n) time

10 Approach - Example Matrix Profile window of m=1 Chain requires consecutive links to be connected by a loop. The all chain set is the set of chains that are not included in larger chains. All numbers are included exactly once.

11 Results 4 empirical evaluations - Blood flow, penguin dives, walking, and Blood Flow Penguin Dives Walking Web Queries 1 Robustness test Embed chains in synthetic data Increase noise and see if patterns are recovered

12 Results - Kohl s Understanding - links on chain show growing importance of Cyber Monday Forecasting - link analysis on these chains (RMSE = 0.17)

13 Conclusion Summary Independent of domain knowledge Requires only one example of the data Very fast One parameter - MP window size Robust to noise Future Work You reasoned it out beautifully, it is so long a chain, and yet every link rings true. -Sir Arthur Conan Doyle: Adventures of Sherlock Holmes More applications Multidimensional time series Different distance measure Something other than time series?

14 FUNNEL: Automatic Mining of Spatially Coevolving Epidemics Yasuko Matsubara, Yasushi Sakurai, Willem G. van Panhuis, Christos Faloutsos Some content adapted from a presentation given at SIGKDD 2014

15 Motivation Given a large data set of many diseases, in many locations, over a long period of time Discover 5 key properties of epidemics using an algorithm that is Fully automatic, Efficient, And generalizable across many types of epidemic.

16 Problem Project Tycho 50 locations (states) 56 diseases Weekly number of infected from 1888 to 2013 Forms a 3 rd order tensor Develop a model that summarizes each disease by: P1 - Periodicity P2 - Disease reduction effects P3 - Locality effects P4 - External shock effects P5 - Mistakes and incorrect values

17 Models - Single Epidemic FUNNEL-BASE b={n,β 0,δ,γ,Pa,Ps} FUNNEL-R FUNNEL-RE (t) - Strength of infection N: Population δ: Healing rate γ: Forgetting rate P a : Amplitude of fluctuation (peak season value) P s : Phase shift of seasonal cycle

18 Models - Multi-Evolving Epidemics

19 Algorithms - FUNNELFIT 1) Find external shocks and mistakes through Model Description Cost 2) Fit model parameters through Multi-Layer Optimization Global Fit Local Fit Levenberg-Marquardt (LM), aka the damped least-squares method - a least-squares curve fitting algorithm that uses both the Gauss-Newton method, and gradient descent - is used to minimize the cost function.

20 Results

21 Conclusion Finds the 5 properties of epidemics Execution time scales linearly to data size Can be applied to computer viruses as well Model is often 5x more accurate than competing algorithms

Extension of time-series: tensor analysis

Extension of time-series: tensor analysis Part 3 Extension of time-series: tensor analysis Yasushi Sakurai (Kumamoto University) Yasuko Matsubara (Kumamoto University) Christos Faloutsos (Carnegie Mellon University) ~yasuko/talks/15- SIGMOD- tut/

More information

Matrix Profile VII: Time Series Chains: A New Primitive for Time Series Data Mining

Matrix Profile VII: Time Series Chains: A New Primitive for Time Series Data Mining Matrix Profile VII: Time Series Chains: A New Primitive for Time Series Data Mining Yan Zhu 1, Makoto Imamura 2, Daniel Nikovski 3, Eamonn Keogh 1 1 University of California, Riverside, yzhu015@ucr.edu,

More information

Data mining --- mining graphs

Data mining --- mining graphs Data mining --- mining graphs University of South Florida Xiaoning Qian Today s Lecture 1. Complex networks 2. Graph representation for networks 3. Markov chain 4. Viral propagation 5. Google s PageRank

More information

Experimental Data and Training

Experimental Data and Training Modeling and Control of Dynamic Systems Experimental Data and Training Mihkel Pajusalu Alo Peets Tartu, 2008 1 Overview Experimental data Designing input signal Preparing data for modeling Training Criterion

More information

DYNAMMO: MINING AND SUMMARIZATION OF COEVOLVING SEQUENCES WITH MISSING VALUES

DYNAMMO: MINING AND SUMMARIZATION OF COEVOLVING SEQUENCES WITH MISSING VALUES DYNAMMO: MINING AND SUMMARIZATION OF COEVOLVING SEQUENCES WITH MISSING VALUES Christos Faloutsos joint work with Lei Li, James McCann, Nancy Pollard June 29, 2009 CHALLENGE Multidimensional coevolving

More information

Multiresolution Motif Discovery in Time Series

Multiresolution Motif Discovery in Time Series Tenth SIAM International Conference on Data Mining Columbus, Ohio, USA Multiresolution Motif Discovery in Time Series NUNO CASTRO PAULO AZEVEDO Department of Informatics University of Minho Portugal April

More information

User Activity Recognition Based on Kalman Filtering Approach

User Activity Recognition Based on Kalman Filtering Approach User Activity Recognition Based on Kalman Filtering Approach EEC 592, Prosthesis Design and Control Final Project Report Gholamreza Khademi khademi.gh@gmail.com December 2, 214 Abstract Different control

More information

CS 395T Lecture 12: Feature Matching and Bundle Adjustment. Qixing Huang October 10 st 2018

CS 395T Lecture 12: Feature Matching and Bundle Adjustment. Qixing Huang October 10 st 2018 CS 395T Lecture 12: Feature Matching and Bundle Adjustment Qixing Huang October 10 st 2018 Lecture Overview Dense Feature Correspondences Bundle Adjustment in Structure-from-Motion Image Matching Algorithm

More information

Non-linear Time-series Analysis of Social Influence

Non-linear Time-series Analysis of Social Influence WebDB Forum 215 Non-linear Time-series Analysis of Social Influence THINH MINH DO 1,a) YASUKO MATSUBARA 1 YASUSHI SAKURAI 1 Abstract: Given a large collection of time-evolving online user activities, such

More information

CS490D: Introduction to Data Mining Prof. Chris Clifton

CS490D: Introduction to Data Mining Prof. Chris Clifton CS490D: Introduction to Data Mining Prof. Chris Clifton April 5, 2004 Mining of Time Series Data Time-series database Mining Time-Series and Sequence Data Consists of sequences of values or events changing

More information

Centroid Decomposition Based Recovery for Segmented Time Series

Centroid Decomposition Based Recovery for Segmented Time Series Department of Informatics, University of Zürich Facharbeit Centroid Decomposition Based Recovery for Segmented Time Series Jonathan Nagel Matrikelnummer: 08-737-421 Bülach, Zürich, CH Email: jonathan.nagel@uzh.ch

More information

Clustering part II 1

Clustering part II 1 Clustering part II 1 Clustering What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods Hierarchical Methods 2 Partitioning Algorithms:

More information

Data mining overview. Data Mining. Data mining overview. Data mining overview. Data mining overview. Data mining overview 3/24/2014

Data mining overview. Data Mining. Data mining overview. Data mining overview. Data mining overview. Data mining overview 3/24/2014 Data Mining Data mining processes What technological infrastructure is required? Data mining is a system of searching through large amounts of data for patterns. It is a relatively new concept which is

More information

COMPUTER AND ROBOT VISION

COMPUTER AND ROBOT VISION VOLUME COMPUTER AND ROBOT VISION Robert M. Haralick University of Washington Linda G. Shapiro University of Washington A^ ADDISON-WESLEY PUBLISHING COMPANY Reading, Massachusetts Menlo Park, California

More information

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Exploratory data analysis tasks Examine the data, in search of structures

More information

Mining Web Data. Lijun Zhang

Mining Web Data. Lijun Zhang Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems

More information

A generative model for feedback networks

A generative model for feedback networks A generative model for feedback networks D.R. White 1 N. Kejžar 2 C. Tsallis 3 D. Farmer 4 S. White 1 1 University of California Irvine, USA 2 University of Ljubljana, Slovenia 3 Centro Brasileiro de Pesquisas

More information

Cluster Analysis (b) Lijun Zhang

Cluster Analysis (b) Lijun Zhang Cluster Analysis (b) Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Grid-Based and Density-Based Algorithms Graph-Based Algorithms Non-negative Matrix Factorization Cluster Validation Summary

More information

PARALLEL AND DISTRIBUTED PLATFORM FOR PLUG-AND-PLAY AGENT-BASED SIMULATIONS. Wentong CAI

PARALLEL AND DISTRIBUTED PLATFORM FOR PLUG-AND-PLAY AGENT-BASED SIMULATIONS. Wentong CAI PARALLEL AND DISTRIBUTED PLATFORM FOR PLUG-AND-PLAY AGENT-BASED SIMULATIONS Wentong CAI Parallel & Distributed Computing Centre School of Computer Engineering Nanyang Technological University Singapore

More information

ICRA 2016 Tutorial on SLAM. Graph-Based SLAM and Sparsity. Cyrill Stachniss

ICRA 2016 Tutorial on SLAM. Graph-Based SLAM and Sparsity. Cyrill Stachniss ICRA 2016 Tutorial on SLAM Graph-Based SLAM and Sparsity Cyrill Stachniss 1 Graph-Based SLAM?? 2 Graph-Based SLAM?? SLAM = simultaneous localization and mapping 3 Graph-Based SLAM?? SLAM = simultaneous

More information

Multiview Stereo COSC450. Lecture 8

Multiview Stereo COSC450. Lecture 8 Multiview Stereo COSC450 Lecture 8 Stereo Vision So Far Stereo and epipolar geometry Fundamental matrix captures geometry 8-point algorithm Essential matrix with calibrated cameras 5-point algorithm Intersect

More information

HOT asax: A Novel Adaptive Symbolic Representation for Time Series Discords Discovery

HOT asax: A Novel Adaptive Symbolic Representation for Time Series Discords Discovery HOT asax: A Novel Adaptive Symbolic Representation for Time Series Discords Discovery Ninh D. Pham, Quang Loc Le, Tran Khanh Dang Faculty of Computer Science and Engineering, HCM University of Technology,

More information

Ripplet: a New Transform for Feature Extraction and Image Representation

Ripplet: a New Transform for Feature Extraction and Image Representation Ripplet: a New Transform for Feature Extraction and Image Representation Dr. Dapeng Oliver Wu Joint work with Jun Xu Department of Electrical and Computer Engineering University of Florida Outline Motivation

More information

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu FMA901F: Machine Learning Lecture 3: Linear Models for Regression Cristian Sminchisescu Machine Learning: Frequentist vs. Bayesian In the frequentist setting, we seek a fixed parameter (vector), with value(s)

More information

Object Recognition with Invariant Features

Object Recognition with Invariant Features Object Recognition with Invariant Features Definition: Identify objects or scenes and determine their pose and model parameters Applications Industrial automation and inspection Mobile robots, toys, user

More information

Clustering. Bruno Martins. 1 st Semester 2012/2013

Clustering. Bruno Martins. 1 st Semester 2012/2013 Departamento de Engenharia Informática Instituto Superior Técnico 1 st Semester 2012/2013 Slides baseados nos slides oficiais do livro Mining the Web c Soumen Chakrabarti. Outline 1 Motivation Basic Concepts

More information

Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011

Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011 Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011 1. Introduction Reddit is one of the most popular online social news websites with millions

More information

Robot Mapping. Graph-Based SLAM with Landmarks. Cyrill Stachniss

Robot Mapping. Graph-Based SLAM with Landmarks. Cyrill Stachniss Robot Mapping Graph-Based SLAM with Landmarks Cyrill Stachniss 1 Graph-Based SLAM (Chap. 15) Use a graph to represent the problem Every node in the graph corresponds to a pose of the robot during mapping

More information

Introduction to Optimization

Introduction to Optimization Introduction to Optimization Second Order Optimization Methods Marc Toussaint U Stuttgart Planned Outline Gradient-based optimization (1st order methods) plain grad., steepest descent, conjugate grad.,

More information

Mining Web Data. Lijun Zhang

Mining Web Data. Lijun Zhang Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems

More information

Distributed model calibration using Levenberg-Marquardt algorithm

Distributed model calibration using Levenberg-Marquardt algorithm Distributed model calibration using Levenberg-Marquardt algorithm Mark Lu a, Liang Zhu a, Li Ling b, Gary Zhang b, Walter Chan c, Xin Zhou *c a Grace Semiconductor Manufacturing Corp, 818 GuoShouJing Rd,

More information

Recommender Systems New Approaches with Netflix Dataset

Recommender Systems New Approaches with Netflix Dataset Recommender Systems New Approaches with Netflix Dataset Robert Bell Yehuda Koren AT&T Labs ICDM 2007 Presented by Matt Rodriguez Outline Overview of Recommender System Approaches which are Content based

More information

Chapter 1, Introduction

Chapter 1, Introduction CSI 4352, Introduction to Data Mining Chapter 1, Introduction Young-Rae Cho Associate Professor Department of Computer Science Baylor University What is Data Mining? Definition Knowledge Discovery from

More information

Data Mining 4. Cluster Analysis

Data Mining 4. Cluster Analysis Data Mining 4. Cluster Analysis 4.5 Spring 2010 Instructor: Dr. Masoud Yaghini Introduction DBSCAN Algorithm OPTICS Algorithm DENCLUE Algorithm References Outline Introduction Introduction Density-based

More information

Package rucrdtw. October 13, 2017

Package rucrdtw. October 13, 2017 Package rucrdtw Type Package Title R Bindings for the UCR Suite Version 0.1.3 Date 2017-10-12 October 13, 2017 BugReports https://github.com/pboesu/rucrdtw/issues URL https://github.com/pboesu/rucrdtw

More information

Learning Network Graph of SIR Epidemic Cascades Using Minimal Hitting Set based Approach

Learning Network Graph of SIR Epidemic Cascades Using Minimal Hitting Set based Approach Learning Network Graph of SIR Epidemic Cascades Using Minimal Hitting Set based Approach Zhuozhao Li and Haiying Shen Dept. of Electrical and Computer Engineering Clemson University, SC, USA Kang Chen

More information

Tracking Groups in Mobile Network Traces

Tracking Groups in Mobile Network Traces Tracking Groups in Mobile Network Traces Kun Tu*, Bruno Ribeiro**, Ananthram Swami***, Don Towsley* *University of Massachusetts, Amherst **Purdue University ***Army Research Lab Presented by Gayane Vardoyan

More information

Hartley - Zisserman reading club. Part I: Hartley and Zisserman Appendix 6: Part II: Zhengyou Zhang: Presented by Daniel Fontijne

Hartley - Zisserman reading club. Part I: Hartley and Zisserman Appendix 6: Part II: Zhengyou Zhang: Presented by Daniel Fontijne Hartley - Zisserman reading club Part I: Hartley and Zisserman Appendix 6: Iterative estimation methods Part II: Zhengyou Zhang: A Flexible New Technique for Camera Calibration Presented by Daniel Fontijne

More information

Revisit Behavior in Social Media: The Phoenix-R Model and Discoveries

Revisit Behavior in Social Media: The Phoenix-R Model and Discoveries Revisit Behavior in Social Media: The Phoenix-R Model and Discoveries Flavio Figueiredo 1, Jussara M. Almeida 1, Yasuko Matsubara 2,3, Bruno Ribeiro 3, and Christos Faloutsos 3 1 Department of Computer

More information

Reviewer Profiling Using Sparse Matrix Regression

Reviewer Profiling Using Sparse Matrix Regression Reviewer Profiling Using Sparse Matrix Regression Evangelos E. Papalexakis, Nicholas D. Sidiropoulos, Minos N. Garofalakis Technical University of Crete, ECE department 14 December 2010, OEDM 2010, Sydney,

More information

Supervised Learning with Neural Networks. We now look at how an agent might learn to solve a general problem by seeing examples.

Supervised Learning with Neural Networks. We now look at how an agent might learn to solve a general problem by seeing examples. Supervised Learning with Neural Networks We now look at how an agent might learn to solve a general problem by seeing examples. Aims: to present an outline of supervised learning as part of AI; to introduce

More information

arxiv: v1 [cs.si] 12 Jan 2019

arxiv: v1 [cs.si] 12 Jan 2019 Predicting Diffusion Reach Probabilities via Representation Learning on Social Networks Furkan Gursoy furkan.gursoy@boun.edu.tr Ahmet Onur Durahim onur.durahim@boun.edu.tr arxiv:1901.03829v1 [cs.si] 12

More information

Spatial Outlier Detection

Spatial Outlier Detection Spatial Outlier Detection Chang-Tien Lu Department of Computer Science Northern Virginia Center Virginia Tech Joint work with Dechang Chen, Yufeng Kou, Jiang Zhao 1 Spatial Outlier A spatial data point

More information

The Return Of Sherlock Holmes: Classic Detective Stories (Top 100 Detective Novels) By Arthur Conan Doyle

The Return Of Sherlock Holmes: Classic Detective Stories (Top 100 Detective Novels) By Arthur Conan Doyle The Return Of Sherlock Holmes: Classic Detective Stories (Top 100 Detective Novels) By Arthur Conan Doyle If you are searching for a book by Arthur Conan Doyle The Return of Sherlock Holmes: Classic Detective

More information

Sherlock Holmes: Complete Long Stories By Sir Arthur Conan Doyle

Sherlock Holmes: Complete Long Stories By Sir Arthur Conan Doyle Sherlock Holmes: Complete Long Stories By Sir Arthur Conan Doyle If you are searched for the ebook Sherlock Holmes: Complete Long Stories by Sir Arthur Conan Doyle in pdf format, then you've come to loyal

More information

CS Introduction to Data Mining Instructor: Abdullah Mueen

CS Introduction to Data Mining Instructor: Abdullah Mueen CS 591.03 Introduction to Data Mining Instructor: Abdullah Mueen LECTURE 8: ADVANCED CLUSTERING (FUZZY AND CO -CLUSTERING) Review: Basic Cluster Analysis Methods (Chap. 10) Cluster Analysis: Basic Concepts

More information

Geometric Registration for Deformable Shapes 2.2 Deformable Registration

Geometric Registration for Deformable Shapes 2.2 Deformable Registration Geometric Registration or Deormable Shapes 2.2 Deormable Registration Variational Model Deormable ICP Variational Model What is deormable shape matching? Example? What are the Correspondences? Eurographics

More information

On the use of Fuzzy Logic Controllers to Comply with Virtualized Application Demands in the Cloud

On the use of Fuzzy Logic Controllers to Comply with Virtualized Application Demands in the Cloud On the use of Fuzzy Logic Controllers to Comply with Virtualized Application Demands in the Cloud Kyriakos M. Deliparaschos Cyprus University of Technology k.deliparaschos@cut.ac.cy Themistoklis Charalambous

More information

The Memoirs Of Sherlock Holmes By Arthur Conan Doyle READ ONLINE

The Memoirs Of Sherlock Holmes By Arthur Conan Doyle READ ONLINE The Memoirs Of Sherlock Holmes By Arthur Conan Doyle READ ONLINE If you are looking for the book The Memoirs of Sherlock Holmes by Arthur Conan Doyle in pdf form, then you have come on to correct website.

More information

10701 Machine Learning. Clustering

10701 Machine Learning. Clustering 171 Machine Learning Clustering What is Clustering? Organizing data into clusters such that there is high intra-cluster similarity low inter-cluster similarity Informally, finding natural groupings among

More information

Automatic Wrapper Adaptation by Tree Edit Distance Matching

Automatic Wrapper Adaptation by Tree Edit Distance Matching Automatic Wrapper Adaptation by Tree Edit Distance Matching E. Ferrara 1 R. Baumgartner 2 1 Department of Mathematics University of Messina, Italy 2 Lixto Software GmbH Vienna, Austria 2nd International

More information

OBJECT-CENTERED INTERACTIVE MULTI-DIMENSIONAL SCALING: ASK THE EXPERT

OBJECT-CENTERED INTERACTIVE MULTI-DIMENSIONAL SCALING: ASK THE EXPERT OBJECT-CENTERED INTERACTIVE MULTI-DIMENSIONAL SCALING: ASK THE EXPERT Joost Broekens Tim Cocx Walter A. Kosters Leiden Institute of Advanced Computer Science Leiden University, The Netherlands Email: {broekens,

More information

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation IJCSNS International Journal of Computer Science and Network Security, VOL.13 No.11, November 2013 1 Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial

More information

The Boundary Graph Supervised Learning Algorithm for Regression and Classification

The Boundary Graph Supervised Learning Algorithm for Regression and Classification The Boundary Graph Supervised Learning Algorithm for Regression and Classification! Jonathan Yedidia! Disney Research!! Outline Motivation Illustration using a toy classification problem Some simple refinements

More information

The Complete Sherlock Holmes, Volume II (Barnes & Noble Classics Series) By Kyle Freeman, Sir Arthur Conan Doyle

The Complete Sherlock Holmes, Volume II (Barnes & Noble Classics Series) By Kyle Freeman, Sir Arthur Conan Doyle The Complete Sherlock Holmes, Volume II (Barnes & Noble Classics Series) By Kyle Freeman, Sir Arthur Conan Doyle The Complete Sherlock Holmes Volume I Barnes Noble Classics Series The complete sherlock

More information

Variational Methods II

Variational Methods II Mathematical Foundations of Computer Graphics and Vision Variational Methods II Luca Ballan Institute of Visual Computing Last Lecture If we have a topological vector space with an inner product and functionals

More information

Application of Clustering Techniques to Energy Data to Enhance Analysts Productivity

Application of Clustering Techniques to Energy Data to Enhance Analysts Productivity Application of Clustering Techniques to Energy Data to Enhance Analysts Productivity Wendy Foslien, Honeywell Labs Valerie Guralnik, Honeywell Labs Steve Harp, Honeywell Labs William Koran, Honeywell Atrium

More information

Automatic Learning of Predictive CEP Rules Bridging the Gap between Data Mining and Complex Event Processing

Automatic Learning of Predictive CEP Rules Bridging the Gap between Data Mining and Complex Event Processing Automatic Learning of Predictive CEP Rules Bridging the Gap between Data Mining and Complex Event Processing Raef Mousheimish, Yehia Taher and Karine Zeitouni DAIVD Laboratory, University of Versailles,

More information

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1 Preface to the Second Edition Preface to the First Edition vii xi 1 Introduction 1 2 Overview of Supervised Learning 9 2.1 Introduction... 9 2.2 Variable Types and Terminology... 9 2.3 Two Simple Approaches

More information

The Adventures And Memoirs Of Sherlock Holmes By Sir Arthur Conan Doyle READ ONLINE

The Adventures And Memoirs Of Sherlock Holmes By Sir Arthur Conan Doyle READ ONLINE The Adventures And Memoirs Of Sherlock Holmes By Sir Arthur Conan Doyle READ ONLINE If you are searching for the book The Adventures and Memoirs of Sherlock Holmes by Sir Arthur Conan Doyle in pdf format,

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Biological Networks Department of Computing Imperial College London Spring 2010 1. Motivation Large Networks model many real-world phenomena technological: www, internet,

More information

Finding and Resolving Performance Problems in a Multi-Application Deployment. Sasha Kraljevic Sr. Systems Engineer October 8 th 2013

Finding and Resolving Performance Problems in a Multi-Application Deployment. Sasha Kraljevic Sr. Systems Engineer October 8 th 2013 Finding and Resolving Performance Problems in a Multi-Application Deployment Sasha Kraljevic Sr. Systems Engineer October 8 th 2013 It is a capital mistake to theorize before one has data. Insensibly

More information

COMP 551 Applied Machine Learning Lecture 16: Deep Learning

COMP 551 Applied Machine Learning Lecture 16: Deep Learning COMP 551 Applied Machine Learning Lecture 16: Deep Learning Instructor: Ryan Lowe (ryan.lowe@cs.mcgill.ca) Slides mostly by: Class web page: www.cs.mcgill.ca/~hvanho2/comp551 Unless otherwise noted, all

More information

Effective Latent Space Graph-based Re-ranking Model with Global Consistency

Effective Latent Space Graph-based Re-ranking Model with Global Consistency Effective Latent Space Graph-based Re-ranking Model with Global Consistency Feb. 12, 2009 1 Outline Introduction Related work Methodology Graph-based re-ranking model Learning a latent space graph A case

More information

Data Mining Concepts & Tasks

Data Mining Concepts & Tasks Data Mining Concepts & Tasks Duen Horng (Polo) Chau Georgia Tech CSE6242 / CX4242 Sept 9, 2014 Partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko, Christos Faloutsos Last Time

More information

CPSC 340: Machine Learning and Data Mining. Logistic Regression Fall 2016

CPSC 340: Machine Learning and Data Mining. Logistic Regression Fall 2016 CPSC 340: Machine Learning and Data Mining Logistic Regression Fall 2016 Admin Assignment 1: Marks visible on UBC Connect. Assignment 2: Solution posted after class. Assignment 3: Due Wednesday (at any

More information

Event Detection using Archived Smart House Sensor Data obtained using Symbolic Aggregate Approximation

Event Detection using Archived Smart House Sensor Data obtained using Symbolic Aggregate Approximation Event Detection using Archived Smart House Sensor Data obtained using Symbolic Aggregate Approximation Ayaka ONISHI 1, and Chiemi WATANABE 2 1,2 Graduate School of Humanities and Sciences, Ochanomizu University,

More information

Sensor Tasking and Control

Sensor Tasking and Control Sensor Tasking and Control Outline Task-Driven Sensing Roles of Sensor Nodes and Utilities Information-Based Sensor Tasking Joint Routing and Information Aggregation Summary Introduction To efficiently

More information

Outline 7/2/201011/6/

Outline 7/2/201011/6/ Outline Pattern recognition in computer vision Background on the development of SIFT SIFT algorithm and some of its variations Computational considerations (SURF) Potential improvement Summary 01 2 Pattern

More information

An RSSI Gradient-based AP Localization Algorithm

An RSSI Gradient-based AP Localization Algorithm An RSSI Gradient-based AP Localization Algorithm Presenter: Cheng-Xuan, Wu Year: 2014, Volume: 11, Issue: 2 Pages: 100-108, DOI: 10.1109/CC.2014.6821742 Cited by: Papers (2) IEEE Journals & Magazines Outline

More information

Neural Network Optimization and Tuning / Spring 2018 / Recitation 3

Neural Network Optimization and Tuning / Spring 2018 / Recitation 3 Neural Network Optimization and Tuning 11-785 / Spring 2018 / Recitation 3 1 Logistics You will work through a Jupyter notebook that contains sample and starter code with explanations and comments throughout.

More information

Fast Nearest Neighbor Search on Large Time-Evolving Graphs

Fast Nearest Neighbor Search on Large Time-Evolving Graphs Fast Nearest Neighbor Search on Large Time-Evolving Graphs Leman Akoglu Srinivasan Parthasarathy Rohit Khandekar Vibhore Kumar Deepak Rajan Kun-Lung Wu Graphs are everywhere Leman Akoglu Fast Nearest Neighbor

More information

Data Mining. Jeff M. Phillips. January 7, 2019 CS 5140 / CS 6140

Data Mining. Jeff M. Phillips. January 7, 2019 CS 5140 / CS 6140 Data Mining CS 5140 / CS 6140 Jeff M. Phillips January 7, 2019 What is Data Mining? What is Data Mining? Finding structure in data? Machine learning on large data? Unsupervised learning? Large scale computational

More information

A Weighted Majority Voting based on Normalized Mutual Information for Cluster Analysis

A Weighted Majority Voting based on Normalized Mutual Information for Cluster Analysis A Weighted Majority Voting based on Normalized Mutual Information for Cluster Analysis Meshal Shutaywi and Nezamoddin N. Kachouie Department of Mathematical Sciences, Florida Institute of Technology Abstract

More information

Sherlock Holmes The Sign Of The Four

Sherlock Holmes The Sign Of The Four We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with sherlock holmes the

More information

DOWNLOAD OR READ : SHERLOCK HOLMES THE COMPLETE COLLECTION PDF EBOOK EPUB MOBI

DOWNLOAD OR READ : SHERLOCK HOLMES THE COMPLETE COLLECTION PDF EBOOK EPUB MOBI DOWNLOAD OR READ : SHERLOCK HOLMES THE COMPLETE COLLECTION PDF EBOOK EPUB MOBI Page 1 Page 2 sherlock holmes the complete collection sherlock holmes the complete pdf sherlock holmes the complete collection

More information

EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines

EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines EECS 556 Image Processing W 09 Interpolation Interpolation techniques B splines What is image processing? Image processing is the application of 2D signal processing methods to images Image representation

More information

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining. About the Tutorial Data Mining is defined as the procedure of extracting information from huge sets of data. In other words, we can say that data mining is mining knowledge from data. The tutorial starts

More information

SCHEME OF COURSE WORK. Data Warehousing and Data mining

SCHEME OF COURSE WORK. Data Warehousing and Data mining SCHEME OF COURSE WORK Course Details: Course Title Course Code Program: Specialization: Semester Prerequisites Department of Information Technology Data Warehousing and Data mining : 15CT1132 : B.TECH

More information

Vivaldi: A Decentralized Network Coordinate System. Authors: Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris MIT. Published at SIGCOMM 04

Vivaldi: A Decentralized Network Coordinate System. Authors: Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris MIT. Published at SIGCOMM 04 Vivaldi: A Decentralized Network Coordinate System Authors: Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris MIT Published at SIGCOMM 04 Presented by: Emmanouel Kyriakakis Key tool: Synthetic Coordinates

More information

Theoretical Concepts of Machine Learning

Theoretical Concepts of Machine Learning Theoretical Concepts of Machine Learning Part 2 Institute of Bioinformatics Johannes Kepler University, Linz, Austria Outline 1 Introduction 2 Generalization Error 3 Maximum Likelihood 4 Noise Models 5

More information

node2vec: Scalable Feature Learning for Networks

node2vec: Scalable Feature Learning for Networks node2vec: Scalable Feature Learning for Networks A paper by Aditya Grover and Jure Leskovec, presented at Knowledge Discovery and Data Mining 16. 11/27/2018 Presented by: Dharvi Verma CS 848: Graph Database

More information

Machine Learning and Data Mining. Clustering (1): Basics. Kalev Kask

Machine Learning and Data Mining. Clustering (1): Basics. Kalev Kask Machine Learning and Data Mining Clustering (1): Basics Kalev Kask Unsupervised learning Supervised learning Predict target value ( y ) given features ( x ) Unsupervised learning Understand patterns of

More information

Mining for Patterns and Anomalies in Data Streams. Sampath Kannan University of Pennsylvania

Mining for Patterns and Anomalies in Data Streams. Sampath Kannan University of Pennsylvania Mining for Patterns and Anomalies in Data Streams Sampath Kannan University of Pennsylvania The Problem Data sizes too large to fit in primary memory Devices with small memory Access times to secondary

More information

Lecture 12 Recognition

Lecture 12 Recognition Institute of Informatics Institute of Neuroinformatics Lecture 12 Recognition Davide Scaramuzza 1 Lab exercise today replaced by Deep Learning Tutorial Room ETH HG E 1.1 from 13:15 to 15:00 Optional lab

More information

Effects of Constant Optimization by Nonlinear Least Squares Minimization in Symbolic Regression

Effects of Constant Optimization by Nonlinear Least Squares Minimization in Symbolic Regression Effects of Constant Optimization by Nonlinear Least Squares Minimization in Symbolic Regression Michael Kommenda, Gabriel Kronberger, Stephan Winkler, Michael Affenzeller, and Stefan Wagner Contact: Michael

More information

Clustering Part 4 DBSCAN

Clustering Part 4 DBSCAN Clustering Part 4 Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida, Gainesville DBSCAN DBSCAN is a density based clustering algorithm Density = number of

More information

Challenges in Ubiquitous Data Mining

Challenges in Ubiquitous Data Mining LIAAD-INESC Porto, University of Porto, Portugal jgama@fep.up.pt 1 2 Very-short-term Forecasting in Photovoltaic Systems 3 4 Problem Formulation: Network Data Model Querying Model Query = Q( n i=0 S i)

More information

How Infections Spread on Networks CS 523: Complex Adaptive Systems Assignment 4: Due Dec. 2, :00 pm

How Infections Spread on Networks CS 523: Complex Adaptive Systems Assignment 4: Due Dec. 2, :00 pm 1 Introduction How Infections Spread on Networks CS 523: Complex Adaptive Systems Assignment 4: Due Dec. 2, 2015 5:00 pm In this assignment you will investigate the spread of disease on networks. In class

More information

LAB 2: DATA FILTERING AND NOISE REDUCTION

LAB 2: DATA FILTERING AND NOISE REDUCTION NAME: LAB TIME: LAB 2: DATA FILTERING AND NOISE REDUCTION In this exercise, you will use Microsoft Excel to generate several synthetic data sets based on a simplified model of daily high temperatures in

More information

The Curse of Dimensionality. Panagiotis Parchas Advanced Data Management Spring 2012 CSE HKUST

The Curse of Dimensionality. Panagiotis Parchas Advanced Data Management Spring 2012 CSE HKUST The Curse of Dimensionality Panagiotis Parchas Advanced Data Management Spring 2012 CSE HKUST Multiple Dimensions As we discussed in the lectures, many times it is convenient to transform a signal(time

More information

DATA MINING AND WAREHOUSING

DATA MINING AND WAREHOUSING DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making

More information

Towards Energy Proportionality for Large-Scale Latency-Critical Workloads

Towards Energy Proportionality for Large-Scale Latency-Critical Workloads Towards Energy Proportionality for Large-Scale Latency-Critical Workloads David Lo *, Liqun Cheng *, Rama Govindaraju *, Luiz André Barroso *, Christos Kozyrakis Stanford University * Google Inc. 2012

More information

Temporal Networks. Hiroki Sayama

Temporal Networks. Hiroki Sayama Temporal Networks Hiroki Sayama sayama@binghamton.edu Temporal networks Networks whose topologies and activities change over time Heavily data-driven research E.g. human contacts (email, social media,

More information

Lecture 12 Recognition. Davide Scaramuzza

Lecture 12 Recognition. Davide Scaramuzza Lecture 12 Recognition Davide Scaramuzza Oral exam dates UZH January 19-20 ETH 30.01 to 9.02 2017 (schedule handled by ETH) Exam location Davide Scaramuzza s office: Andreasstrasse 15, 2.10, 8050 Zurich

More information

Parsimonious Active Monitoring and Overlay Routing

Parsimonious Active Monitoring and Overlay Routing Séminaire STORE Parsimonious Active Monitoring and Overlay Routing Olivier Brun LAAS CNRS, Toulouse, France February 23 rd, 2017 Olivier Brun STORE Seminar, February 20th, 2017 1 / 34 Outline Introduction

More information

Supplementary material to Epidemic spreading on complex networks with community structures

Supplementary material to Epidemic spreading on complex networks with community structures Supplementary material to Epidemic spreading on complex networks with community structures Clara Stegehuis, Remco van der Hofstad, Johan S. H. van Leeuwaarden Supplementary otes Supplementary ote etwork

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu [Kumar et al. 99] 2/13/2013 Jure Leskovec, Stanford CS246: Mining Massive Datasets, http://cs246.stanford.edu

More information

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring

More information

Using Subspace Constraints to Improve Feature Tracking Presented by Bryan Poling. Based on work by Bryan Poling, Gilad Lerman, and Arthur Szlam

Using Subspace Constraints to Improve Feature Tracking Presented by Bryan Poling. Based on work by Bryan Poling, Gilad Lerman, and Arthur Szlam Presented by Based on work by, Gilad Lerman, and Arthur Szlam What is Tracking? Broad Definition Tracking, or Object tracking, is a general term for following some thing through multiple frames of a video

More information