Interactive Visualization of the Stock Market Graph

Similar documents
Data Mining and. in Dynamic Networks

DualNet: A Coordinated View Approach to Network Visualization

Geometric Techniques. Part 1. Example: Scatter Plot. Basic Idea: Scatterplots. Basic Idea. House data: Price and Number of bedrooms

Large Scale Information Visualization. Jing Yang Fall Tree and Graph Visualization (2)

Visualization Systems & Toolkits

A Source Code History Navigator

Scalable Pixel-based Visual Interfaces: Challenges and Solutions

Large Scale Information Visualization. Jing Yang Fall Graph Visualization

Courtesy of Prof. Shixia University

Visualization and text mining of patent and non-patent data

Computing Largest Correcting Codes and Their Estimates Using Optimization on Specially Constructed Graphs p.1/30

Potential issues: * Difficulty in coding and/or finding the. * Should I provide more source code? * Others

Information Visualization. Jing Yang Spring Graph Visualization

Week 4: Facet. Tamara Munzner Department of Computer Science University of British Columbia

Multidimensional Interactive Visualization

A Search Set Model of Path Tracing in Graphs: Supplementary Materials

Visualizing LiveJournal Social Networks Through. Clustering

Narrowing It Down: Information Retrieval, Supporting Effective Visual Browsing, Semantic Networks

GLO-STIX! Graph-Level Operations for Specifying Techniques and Interactive exploration

A Constrained Spreading Activation Approach to Collaborative Filtering

Finding Anomalies in Time-Series using Visual Correlation for Interactive Root Cause Analysis

Trees & Graphs. Nathalie Henry Riche, Microsoft Research

Data Visualization Principles: Interaction, Filtering, Aggregation CSC444

Overview. Social Networks Vis. Measures in SNA. What are social networks?

Learning video saliency from human gaze using candidate selection

Information Visualization & Visual Analytics

Overlapping Clustered Graphs: Co-authorship Networks Visualization

Differential Privacy. Seminar: Robust Data Mining Techniques. Thomas Edlich. July 16, 2017

Operation Manual for Real Time Quotes Service

Progressive Parallel Coordinates

Tuple Information Visualization Publications Browser

Learning a Distance Metric for Structured Network Prediction. Stuart Andrews and Tony Jebara Columbia University

Statistical analysis of nancial networks

Modeling Dynamic Behavior in Large Evolving Graphs

A Web Application to Visualize Trends in Diabetes across the United States

A Dual-View Approach to Interactive Network Visualization

Lecture 13: Graphs and Trees

Visual Analysis of Classification Scheme

CIS 467/602-01: Data Visualization

Graphs and Networks 2

Glyphs. Presentation Overview. What is a Glyph!? Cont. What is a Glyph!? Glyph Fundamentals. Goal of Paper. Presented by Bertrand Low

A Constrained Spreading Activation Approach to Collaborative Filtering

New Bee. Samyuktha Sridharan Xuanyi Qi Hanshu Lin

Multi-Dimensional Vis

OpenMP for next generation heterogeneous clusters

Theme Identification in RDF Graphs

OBJECT-CENTERED INTERACTIVE MULTI-DIMENSIONAL SCALING: ASK THE EXPERT

Overview of Clustering

Frequent Pattern Mining in Data Streams. Raymond Martin

OVERVIEW AND DETAIL FOCUS+CONTEXT. Information Visualization Fall 2009 Jinwook Seo SNU CSE

Event Detection through Differential Pattern Mining in Internet of Things

Fennel: Streaming Graph Partitioning for Massive Scale Graphs

Data Clustering Hierarchical Clustering, Density based clustering Grid based clustering

Social-Network Graphs

Lecture 13: Graphs/Trees

2015 Entrinsik, Inc.

Review: Identification of cell types from single-cell transcriptom. method

A Parallel Workflow for Online Correlation and Clique-finding

Comp/Phys/Mtsc 715. Preview Videos 4/5/2012. Information Display and Spatial Embeddings. Information Visualization and Tufte. Vis 2004: robbins.

Data Visualization. Fall 2016

An Intelligent Clustering Algorithm for High Dimensional and Highly Overlapped Photo-Thermal Infrared Imaging Data

Matrix Representation of Graphs

Topological map merging

An Empirical Analysis of Communities in Real-World Networks

Construction Change Order analysis CPSC 533C Analysis Project

Visualization Tools. Interaction. How do people create visualizations? Jeffrey Heer Stanford University

Detecting Controversial Articles in Wikipedia

i Maximum Number of Windows Open: 4 (incl. HK-Share & A-Share)

Independent Study Report IS2970, Fall How Your Users Help You: A Survey of Feedback Mechanism in Social Network Sites.

MODEL VERSIONING IN CONTEXT OF LIVING MODELS

Exploratory Analysis: Clustering

PSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets

Multidimensional Visualization and Clustering

Week 6: Networks, Stories, Vis in the Newsroom

Visual Analysis of Set Relations in a Graph

InfoVis Systems & Toolkits

Web Structure Mining Community Detection and Evaluation

Clustering in Networks

RINGS : A Technique for Visualizing Large Hierarchies

Network visualization techniques and evaluation

Document Visualization Overview and Detail by Selectively Varying Density

Unsupervised Data Mining: Clustering. Izabela Moise, Evangelos Pournaras, Dirk Helbing

CS Information Visualization Sep. 2, 2015 John Stasko

INFO 424, UW ischool 11/1/2007

A Top-Down Visual Approach to GUI development

Visualization of EU Funding Programmes

Fast Efficient Clustering Algorithm for Balanced Data

Incremental Real-time Bundle Adjustment for Multi-camera Systems with Points at Infinity

Understanding Clustering Supervising the unsupervised

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

Further Thoughts on Precision

Clustering. Chapter 10 in Introduction to statistical learning

Effects of Semantic Cache on Mobile Database System

Recommendation System for Location-based Social Network CS224W Project Report

Predictive Analysis: Evaluation and Experimentation. Heejun Kim

Network Based Models For Analysis of SNPs Yalta Opt

Graph/Network Visualization

Panning and Zooming. CS 4460/ Information Visualization April 8, 2010 John Stasko

Interaction. What is Interaction? From Google: Reciprocal action between a human and a computer One of the two main components in infovis

A P2P-based Incremental Web Ranking Algorithm

Transcription:

Interactive Visualization of the Stock Market Graph Presented by Camilo Rostoker rostokec@cs.ubc.ca Department of Computer Science University of British Columbia

Overview 1. Introduction 2. The Market Graph 3. Motivation 4. Visualization Goals 5. Solutions & Methods 6. Future Work 7. Conclusion 8. Demo

Stock Market Data Huge amounts of accessible data on a daily basis Consists of a variety of fields such as price, volume, change Stock price interactions form a complex system Want to understand these interactions of the subsystems

Constructing the Market Graph 1. From a dataset, compute the correlation matrix 2. Convert correlation matrix to a graph, where Vertices represent stocks Edges represents a relationships between two stocks correlation(stock1,stock2) > THRESHOLD

What Are We Visualizing? Find clusters/groups of stocks that exhibit certain trading patterns Maximum Cliques Highly positively/negatively correlated subsets of stocks Independent Sets Completely diversified stocks Quasi-Cliques/Independent Sets Generalizations allow for near matches Clusters? Cliques/IS interchangeably

Existing Approaches to Visualizing Graph Structures 1. Determine target structures (i.e. clusters) a priori and use a standard layout algorithm to show the results 2. Use a layout algorithm optimized to visually differentiate target structures 3. Our approach: combine the two Find target structures first, but include additional nodes and edges for context Then use force-directed layout algorithm to effectively visualize the results

Example: Vizster

Motivation: Usage Scenarios Portfolio management (static) Real-time market analysis (dynamic) Exploratory analysis of trading data to gain new insights, spot patterns/trends, etc (static)

Motivation: Visualizing Results from a Real-time Data-mining Pipeline FILE WEB DB Data Collector Data Filter Compute Distance Matrix Graph Clustering Graph Update Server Viz Client Viz Client

Visualization Goals 1. Visualize different graph structures representing various patterns and trends (quasi-)cliques and (quasi-)independent sets positively and negatively correlations 2. Represent inter-cluster relationships 3. Dynamic graph capabilities 4. Interaction for efficient data exploration 5. Information integration

Force-Directed Graph Layout Model Create summaries of the graph using the clusters and their induced subgraphs Force model: spring-embedded layout Spring lengths and tensions parameterized to optimize layout Highly related clusters should be close Independent clusters and minimally related clusters should be further apart

F.D. Model Parameterizations Edge Length Cluster-Cluster edges (CC) # intra-cluster edges (shows connectedness of clusters) Cluster-Member edges (CM) Quasi-cliques # intra-cluster edges ( clique contribution ) Cliques: cluster sizes (more space to larger clusters) Tension CM edges use constant tight tension CC edge tension proportional to # of inter-cluster links

Differentiating cluster types Correlation Metrics: positive, negative, independent Color encoded Cluster types: (quasi-) Cliques and (quasi-) Independent sets Transparency-encoding for cluster summary Individual members edge length encodes clique contribution

Interaction & Information Integration Interaction Features Geometric pan/zoom Display/hide cluster outlines Symbol search for quick navigation Overview display for global context Node context menus provide stock quotes and news: Stock news from various sources integrated via RSS feeds Online quote details and Google search for provided by opening an external web browser

Dynamic graph capabilities Receive remote graph updates via socket connection to a graph update server Nodes/edges can be added, removed or replaced Event-based architecture allows for automatic processing of new updates Force-model allows for efficient incremental layouts when new nodes/edges placed intelligently

Future Work & Improvements Handle overlapping clusters Encode other variables i.e. node size could encode trade volume Ability to view underlying edge weights Ability to optionally view complete underlying graph especially the intra-cluster edges

Future Work & Improvements (2) Interactively adding/removing nodes and edges Semantic zoom Focus+Context Other clustering methods besides partitioning via (quasi-)cliques and independent sets

Conclusion Implemented basic Visualization tool for exploring the market graph Visualizes different cluster types and their attributes User interaction for pan/zoom, on-demand details (quotes, news, web search) Dynamic graph capability to support a real-time data processing pipeline

References 1. Jeffrey Heer and Danah Boyd. Vizster: Visualizing online social networks. InfoVis 2005 IEEE Symposium on Information Visualization, 2005. 2. Jeffrey Heer, Stuart K. Card, and James A. Landay. prefuse: a toolkit for interactive information visualization. In CHI 05: Proceedings of the SIGCHI conference on Human factors in computing systems, pages 421 430, New York, NY, USA, 2005. ACM Press. 3. Frank van Ham and Jarke J. van Wijk. Interactive visualization of small world graphs. In Proceedings of the IEEE Symposium on Information Visualization, pages 199 206, Washington, DC, USA, 2004. IEEE Computer Society. 4. Vladimir Boginski, Sergiy Butenko, and Panos M. Pardalos. Mining market data: A network approach.

DEMO

THE END!

Construct a Similarity Matrix Currently, our similarity measure is where: