cs6964 February TABULAR DATA Miriah Meyer University of Utah

Similar documents
cs6964 March TREES & GRAPHS Miriah Meyer University of Utah

Lecture 6: Statistical Graphics

CIS 467/602-01: Data Visualization

Parallel Coordinates ++

Few s Design Guidance

Navigating Hierarchies with Structure-Based Brushes

Parallel Coordinates CS 6630 Scientific Visualization

CIE L*a*b* color model

DSC 201: Data Analysis & Visualization

Lecture 12: Graphs/Trees

MoVis Movie Recommendation and Visualization

Background. Parallel Coordinates. Basics. Good Example

Data Visualization (CIS/DSC 468)

Week 6: Networks, Stories, Vis in the Newsroom

Computational Foundations of Cognitive Science

DSC 201: Data Analysis & Visualization

MoVis. Movie Recommendation and Visualization. CPSC 547 Infomation Visualization Project Presentation. Ye Chen & Yujie Yang 2015/12/15

CS Information Visualization Sep. 2, 2015 John Stasko

Separable Kernels and Edge Detection

Domain, task, and dataset

Multivariate Data & Tables and Graphs

Representation: Design Idioms 1

Multivariate Data & Tables and Graphs. Agenda. Data and its characteristics Tables and graphs Design principles

Multiple Dimensional Visualization

Multidimensional Visualization and Clustering

CS-5630 / CS-6630 Visualization for Data Science The Visualization Alphabet: Marks and Channels

Multivariate Data & Tables and Graphs. Agenda. Data and its characteristics Tables and graphs Design principles

Glyphs. Presentation Overview. What is a Glyph!? Cont. What is a Glyph!? Glyph Fundamentals. Goal of Paper. Presented by Bertrand Low

CP SC 8810 Data Visualization. Joshua Levine

Lecture 13: Graphs and Trees

Statistical graphics in analysis Multivariable data in PCP & scatter plot matrix. Paula Ahonen-Rainio Maa Visual Analysis in GIS

Part I, Chapters 4 & 5. Data Tables and Data Analysis Statistics and Figures

BIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26

Texture. Texture. 2) Synthesis. Objectives: 1) Discrimination/Analysis

RT_Image v0.2β User s Guide

MGMT 3125 Introduction to Data Visualization

This research aims to present a new way of visualizing multi-dimensional data using generalized scatterplots by sensitivity coefficients to highlight

Visual Encoding Design

Information Visualization

Chapter 2: From Graphics to Visualization

3. Multidimensional Information Visualization II Concepts for visualizing univariate to hypervariate data

Structure-Based Brushes: A Mechanism for Navigating Hierarchically Organized Data and Information Spaces

Information Visualization. Jing Yang Spring Multi-dimensional Visualization (1)

Grade 5 Unit 5 Addition and Multiplication with Volume and Area (5 Weeks)

Gene Clustering & Classification

Digital Image Processing. Image Enhancement - Filtering

8. Visual Analytics. Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai

Vector Visualization

Evgeny Maksakov Advantages and disadvantages: Advantages and disadvantages: Advantages and disadvantages: Advantages and disadvantages:

We will start at 2:05 pm! Thanks for coming early!

CS Information Visualization Sep. 19, 2016 John Stasko

Lecture 4: Spatial Domain Transformations

CSE 5243 INTRO. TO DATA MINING

Data Visualization (DSC 530/CIS )

Digital Image Processing (CS/ECE 545) Lecture 5: Edge Detection (Part 2) & Corner Detection

Marks. Marks can be classified according to the number of dimensions required for their representation: Zero: points. One: lines.

Lecture 11: High Dimensionality

cs6964 March TOOLKITS Miriah Meyer University of Utah

3D Models and Matching

Projekt 1 Ausarbeitung

Facet: Multiple View Methods

Visual Encoding Design

Introduction to Geospatial Analysis

Data Mining Cluster Analysis: Basic Concepts and Algorithms. Slides From Lecture Notes for Chapter 8. Introduction to Data Mining

GRID BASED CLUSTERING

Excel 2010 Worksheet 3. Table of Contents

Data Visualization. Fall 2016

Lecture 13: Graphs/Trees

EE368/CS232 Digital Image Processing Winter

Lecture Topic Projects

DASHBOARDPRO & DASHBOARD

COMP 465: Data Mining Still More on Clustering

Points Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked

BBS654 Data Mining. Pinar Duygulu. Slides are adapted from Nazli Ikizler

[Slides Extracted From] Visualization Analysis & Design Full-Day Tutorial Session 4

Interactive Visual Exploration

CS535 Fall Department of Computer Science Purdue University

Data Visualization (CIS/DSC 468)

CPSC 425: Computer Vision

Components for Xcelsius. Micro Components

Visualization with Data Clustering DIVA Seminar Winter 2006 University of Fribourg

Analytics and Visualization of Big Data

Week 4: Facet. Tamara Munzner Department of Computer Science University of British Columbia

Preprocessing Short Lecture Notes cse352. Professor Anita Wasilewska

Panning and Zooming. CS 4460/ Information Visualization April 8, 2010 John Stasko

Graph Clarity, Simplification, & Interaction

1 (5 max) 2 (10 max) 3 (20 max) 4 (30 max) 5 (10 max) 6 (15 extra max) total (75 max + 15 extra)

DATA ABSTRACTION & INTRO TO TABLEAU

10-701/15-781, Fall 2006, Final

Atypical Behavior Identification in Large Scale Network Traffic

> objective(s): Students will create a text-only design in either Adobe Illustrator or Photoshop

Image Rights Options: Using Banding, Branding, and

Graphs and Networks 2

SHOW ME THE NUMBERS: DESIGNING YOUR OWN DATA VISUALIZATIONS PEPFAR Applied Learning Summit September 2017 A. Chafetz

DATA MINING LECTURE 7. Hierarchical Clustering, DBSCAN The EM Algorithm

Clustering CS 550: Machine Learning

/5 Stacks. Displays the slice that follows the currently displayed slice. As a shortcut, press the > key.

CPSC 340: Machine Learning and Data Mining. Outlier Detection Fall 2018

Final Exam Study Guide

Network Traffic Measurements and Analysis

Transcription:

cs6964 February 23 2012 TABULAR DATA Miriah Meyer University of Utah

cs6964 February 23 2012 TABULAR DATA Miriah Meyer University of Utah slide acknowledgements: John Stasko, Georgia Tech Tamara Munzner, UBC

administrivia 3

- for final projects that have been approved, email me: - working title - group member names - two or three sentence description 4

LAST TIME 5

6

- curse of dimensionality - linear methods - multidimensional scale - manifold learning 7

WHERE ARE WE? - covered so far - abstractions - methods - visual representations - interactions - next stage: use these ideas for analysis and design - analyze previously proposed techniques and systems - design new techniques and systems - me: next couple of lectures as examples - you: project proposal and topic presentations 8

- multiscale scatterplots - hierarchical parallel coordinates - streamgraph 9

10

MULTISCALE SCATTERPLOTS - blur shows structure at multiple scales - convolve with Gaussian - slider to control scale parameter interactively - easily selectable regions in quantized image 11 Chiricota 2004

MULTISCALE SCATTERPLOTS - problem characterization: - generic network exploration - minimal problem context - paper is technique-driven not problem-driven - abstraction - task - selecting and filtering at different scales (within scatterplots) 12

DATA ABSTRACTION - original data - relational network - such as links between Java classes - derived attributes - two structure metrics for network - edge width: cluster cohesiveness - edge color: logical dependencies between classes - thus, table of numbers! 13 Chiricota 2004

DESIGN - basic solution - visual representation: scatterplots - mark type: points - channels: horizontal and vertical position - interaction technique: range sliders - filter max / min - challenge - interesting areas might not be easy to select as rectangular bounding box 14 Chiricota 2004

MULTISCALE SCATTERPLOT SELECTION TECHNIQUE - new representation - derived space created from original scatterplot image - greyscale patches forming complex shapes - enclosure of darker patches within lighter patches - new interaction - simple - sliders for filtering size of patch and number of levels - complex - single click to select all items at and below the specified level 15 Chiricota 2004

ALGORITHM - creating derived space - greyscale intensity is combination of: - blurred proximity relationships from original scatterplot image: convolve with Gaussian filter - point density in original scatterplot image - similar to splatting techniques - quantize image into k levels 16 Chiricota 2004

METHOD: LINKED VIEWS - linked scatterplot and node-link network view - linked highlighting - linked filtering 17 Chiricota 2004

RESULTS: IMDB - original data: IMDB graph of actors - metrics: network centrality, node degree - three hubs selected in network view 18 Chiricota 2004

RESULTS: IMDB - single click in blurred scatterplot view selects entire clique 19 Chiricota 2004

CRITIQUE: what do you think? 20

CRITIQUE - strengths - successful construction and use of derived space - appropriate validation - qualitative discussion of result images to show new technique capabilities - synergy between encoding and interaction choices - weaknesses - tricky to follow thread of argument - intro/framing focuses on network exploration - but, fundamental technique contribution more about scatterplot encoding and interaction 21

22

HIERARCHICAL PARALLEL COORDINATES - technique-driven paper - no problem characterization - goal: scale up parallel coordinates to large datasets - challenge: overplotting/occlusion 23 Fua 1999

PARALLEL COORDINATES - scatterplot limitation: visual representation with orthogonal axes - can show only two attributes with spatial position channel - alternative: line up axes in parallel to show many attributes with position - item encoded with a line with n segments - n is the number of attributes shown 24

EXAMPLE 25

EXAMPLE V1 V2 V3 V4 V5 D1

EXAMPLE V1 V2 V3 V4 V5 D2

EXAMPLE V1 V2 V3 V4 V5 D3

PARALLEL COORDINATES TASK - show correlation - positive correlation: straight lines - negative correlation: all lines cross at a single pt 29 Wegman 1990

PARALLEL COORDINATES TASK do you see any correlations? 30 Fua 1999

PARALLEL COORDINATES TASK - visible patterns only between neighboring axis pairs - how to pick axis order? - usual solution: reorderable axes, interactive exploration - same weakness as many other techniques - downside: human-powered search - not directly addressed in HPC paper 31 Fua 1999

HIERARCHICAL PARALLEL COORDINATES - data abstraction - original data - table of numbers - derived data - hierarchical clustering of items in table - clustering stats: # of points, mean, min, max, size, depth - cluster density: points/size - cluster proximity: linear ordering from tree traversal - task abstraction - find correlations - find trends and outliers at multiple scales 32

HPC: ENCODING DERIVED DATA - visual representation: variable-width opacity bands - show whole cluster, not just single item - min / max: spatial position - cluster density: transparency - mean: opaque 33 Fua 1999

HPC: INTERACTING WITH DERIVED DATA - interactively change level of detail to navigate cluster hierarchy 34 Fua 1999

HPC: ENCODING DERIVED DATA - visual encoding: color based on cluster proximity derived attribute - resolves ambiguity from crossings, clarifies structure 35 Fua 1999

HPC: MAGNIFICATION INTERACTION - dimensional zooming: use all available space - methods - linked vies to show true extent - overview + detail to maintain context 36 Fua 1999

CRITIQUE: what do you think? 37

CRITIQUE - parallel coordinates - strengths - can be a useful additional view - (rare to use completely stand-alone) - now popular, many follow-on techniques - weakness - major learning curve, difficult for novices - hierarchical parallel coordinates - strengths - success with major scalability improvement - careful construction and use of derived space - appropriate validation (result image discussion) - weakness - interface complexity (structure-based brushing) 38

39

STREAMGRAPH - problem-driven paper - development of new technique to solve a specific problem - challenge - convey a large amount of data in a way that engages mass audiences 40 Byron 2008

STREAMGRAPH - problem: show personal last.fm history - want to visually embody personal connection that listeners have with their music - design considerations - use stacked graph - focus on legibility and aesthetics - abstraction - task: engage audience 41

DATA ABSTRACTION - original data - set of time series - derived data - layer silhouette - consider baseline - consider deviation - consider wiggliness 42 Byron 2008

DESIGN - visual representation: stacked graph - new technique for minimizing wiggle of layers - color: 2D colormap - hue: time of onset - saturation: popularity - labels - placed where embedded labels can be largest 43 Byron 2008

DESIGN - layer ordering - inside-out ordering - avoid diagonal striping effect - burst are on outside which minimizes effect on other layers - prevents drift away from x- axis 44 Byron 2008

EVALUATION - case study of NYTimes graphic - gathered many comments from social media sites - categorized comments - legibility issues - engagement - aethestics 45

COMMENTS 46

47

CRITIQUE: what do you think? 48

CRITIQUE - strengths - clear target problem - thorough evaluation of aesthetic and legibility issues - reached and engaged a large audience - weaknesses - stacked graphs make between comparisons difficult - both between time points and time series 49

THE SMARTPHONE CHALLENGE 50

part 6 - get back into large groups - share sketches - turn in abstraction and individual sketches 51

L14: Graphs and Trees REQUIRED READING 52

53

54

55