CIS 467/602-01: Data Visualization

Similar documents
Data Visualization (CIS/DSC 468)

Data Visualization (DSC 530/CIS )

DSC 201: Data Analysis & Visualization

Telecommunications and Internet Access By Schools & School Districts

CostQuest Associates, Inc.

DEPARTMENT OF HOUSING AND URBAN DEVELOPMENT. [Docket No. FR-6090-N-01]

Ocean Express Procedure: Quote and Bind Renewal Cargo

Accommodating Broadband Infrastructure on Highway Rights-of-Way. Broadband Technology Opportunities Program (BTOP)

2018 NSP Student Leader Contact Form

The Lincoln National Life Insurance Company Universal Life Portfolio

Panelists. Patrick Michael. Darryl M. Bloodworth. Michael J. Zylstra. James C. Green

Figure 1 Map of US Coast Guard Districts... 2 Figure 2 CGD Zip File Size... 3 Figure 3 NOAA Zip File Size By State...

A New Method of Using Polytomous Independent Variables with Many Levels for the Binary Outcome of Big Data Analysis

Fall 2007, Final Exam, Data Structures and Algorithms

Distracted Driving- A Review of Relevant Research and Latest Findings

MAKING MONEY FROM YOUR UN-USED CALLS. Connecting People Already on the Phone with Political Polls and Research Surveys. Scott Richards CEO

THE LINEAR PROBABILITY MODEL: USING LEAST SQUARES TO ESTIMATE A REGRESSION EQUATION WITH A DICHOTOMOUS DEPENDENT VARIABLE

2018 Supply Cheat Sheet MA/PDP/MAPD

NSA s Centers of Academic Excellence in Cyber Security

State IT in Tough Times: Strategies and Trends for Cost Control and Efficiency

Presented on July 24, 2018

Silicosis Prevalence Among Medicare Beneficiaries,

B.2 Measures of Central Tendency and Dispersion

Name: Business Name: Business Address: Street Address. Business Address: City ST Zip Code. Home Address: Street Address

IT Modernization in State Government Drivers, Challenges and Successes. Bo Reese State Chief Information Officer, Oklahoma NASCIO President

MERGING DATAFRAMES WITH PANDAS. Appending & concatenating Series

Team Members. When viewing this job aid electronically, click within the Contents to advance to desired page. Introduction... 2

Global Forum 2007 Venice

Department of Business and Information Technology College of Applied Science and Technology The University of Akron

Tina Ladabouche. GenCyber Program Manager

DTFH61-13-C Addressing Challenges for Automation in Highway Construction

Post Graduation Survey Results 2015 College of Engineering Information Networking Institute INFORMATION NETWORKING Master of Science

Charter EZPort User Guide

CSE 781 Data Base Management Systems, Summer 09 ORACLE PROJECT

Best Practices in Rapid Deployment of PI Infrastructure and Integration with OEM Supplied SCADA Systems

Moonv6 Update NANOG 34

cs6964 February TABULAR DATA Miriah Meyer University of Utah

AASHTO s National Transportation Product Evaluation Program

State HIE Strategic and Operational Plan Emerging Models. February 16, 2011

2015 DISTRACTED DRIVING ENFORCEMENT APRIL 10-15, 2015

Sideseadmed (IRT0040) loeng 4/2012. Avo

PulseNet Updates: Transitioning to WGS for Reference Testing and Surveillance

Amy Schick NHTSA, Occupant Protection Division April 7, 2011

Touch Input. CSE 510 Christian Holz Microsoft Research February 11, 2016

Homework Assignment #5

Representation: Design Idioms 1

Prizm. manufactured by. White s Electronics, Inc Pleasant Valley Road Sweet Home, OR USA. Visit our site on the World Wide Web

Presentation Outline. Effective Survey Sampling of Rare Subgroups Probability-Based Sampling Using Split-Frames with Listed Households

A Capabilities Presentation

ACCESS PROCESS FOR CENTRAL OFFICE ACCESS

The Outlook for U.S. Manufacturing

DSC 201: Data Analysis & Visualization

Contact Center Compliance Webinar Bringing you the ANSWERS you need about compliance in your call center.

Steve Stark Sales Executive Newcastle

IAT 355 Intro to Visual Analytics Graphs, trees and networks 2. Lyn Bartram

Geographic Accuracy of Cell Phone RDD Sample Selected by Area Code versus Wire Center

Expanding Transmission Capacity: Options and Implications. What role does renewable energy play in driving transmission expansion?

Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records

CMPE 180A Data Structures and Algorithms in C++ Spring 2018

Be sure to fax AGA a copy of every application you submitted direct along with the fax confirmation page to ensure you receive your commission!

Strengthening connections today, while building for tomorrow. Wireless broadband, small cells and 5G

What Did You Learn? Key Terms. Key Concepts. 68 Chapter P Prerequisites

Visualization Analysis & Design Full-Day Tutorial Session 2

Presentation to NANC. January 22, 2003

Data Visualization (CIS/DSC 468)

We will start at 2:05 pm! Thanks for coming early!

Selling Compellent Hardware: Controllers, Drives, Switches and HBAs Chad Thibodeau

Scalable Data Analysis (CIS )

Development and Maintenance of the Electronic Reference Library

CIS 467/602-01: Data Visualization

Elevation Data Acquisition Update

Medium voltage Marketing contacts

Week 4: Facet. Tamara Munzner Department of Computer Science University of British Columbia

Configuring Oracle GoldenGate OGG 11gR2 local integrated capture and using OGG for mapping and transformations

Distracted Driving U.S. Federal Initiatives. Amy Schick, MS, CHES Office of Impaired Driving and Occupant Protection

Combo Charts. Chapter 145. Introduction. Data Structure. Procedure Options

Jurisdictional Guidelines for Accepting a UCC Record Presented for Filing 2010 Amendments & the 2011 IACA Forms

HYPERVARIATE DATA VISUALIZATION

OPERATOR CERTIFICATION

National Continuity Programs

Visualization Analysis & Design

Visual Encoding Design

Multiple Dimensional Visualization

8. Visual Analytics. Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai

2013 Product Catalog. Quality, affordable tax preparation solutions for professionals Preparer s 1040 Bundle... $579

Pro look sports football guide

On All Forms. Financing Statement (Form UCC1) Statutory, MARS or Other Regulatory Authority to Deviate

How to Make an Impressive Map of the United States with SAS/Graph for Beginners Sharon Avrunin-Becker, Westat, Rockville, MD

Multivariate Data & Tables and Graphs

Data Visualization (CIS/DSC 468)

Lecture 6: Statistical Graphics

Week 6: Networks, Stories, Vis in the Newsroom

MOVR. Mobile Overview Report January March The first step in a great mobile experience

Data Visualization (CIS 468)

The evolution of US state government home pages from 1997 to 2002

Today s Lecture. Factors & Sampling. Quick Review of Last Week s Computational Concepts. Numbers we Understand. 1. A little bit about Factors

cs6964 March TREES & GRAPHS Miriah Meyer University of Utah

CIE L*a*b* color model

STATE DATA BREACH NOTIFICATION LAWS OVERVIEW OF REQUIREMENTS FOR RESPONDING TO A DATA BREACH UPDATED JUNE 2017

Information Visualization. Jing Yang Spring Multi-dimensional Visualization (1)

Transcription:

CIS 467/602-01: Data Visualization Tables Dr. David Koop

Assignment 2 http://www.cis.umassd.edu/ ~dkoop/cis467/assignment2.html Plagiarism on Assignment 1 Any questions? 2

Recap (Interaction) Important in visualization; it enables many tasks Types of interactions: - Feedback/Animation - Storytelling - Selection - Details/Highlighting - Navigation Concerns: - Change Blindness - Latency - Human Effort 3

Recap (Tables) Arrange Tables Express Values Separate, Order, Align Regions Separate Order Align 1 Key 2 Keys 3 Keys Many Keys List Matrix Volume Recursive Subdivision Axis Orientation Rectilinear Parallel Radial Layout Density Dense Space-Filling [Munzner (ill. Maguire), 2014] 4

Recap (Scatterplots) Prices in 1980 450 400 350 300 250 200 150 100 50 0 Fish Prices over the Years 0 50 100 150 200 250 300 350 400 450 Prices in 1970 Data: two quantitative values Task: find trends, clusters, outliers How: marks at spatial position in horizontal and vertical directions Scalability: hundreds of points Correlation: dependence between two attributes - Positive and negative correlation - Indicated by lines Coordinate system (axes) and labels are important! 5

Separate, Order, and Align: Categorical Regions Categorical: =,!= Spatial position can be used for categorical attributes Use regions, distinct contiguous bounded areas, to encode categorical attributes Three operations on the regions: - Separate (use categorical attribute) - Align - Order (use some other ordered attribute) Alignment and order can use same or different attribute 6

List Alignment: Bar Charts Data: one quantitative attribute, one categorical attribute Task: lookup & compare values How: line marks, vertical position (quantitative), horizontal position (categorical) What about length? Ordering criteria: alphabetical or using quantitative attribute Scalability: distinguishability - bars at least one pixel wide - hundreds 100 75 50 25 0 100 75 50 25 0 Animal Type Animal Type [Munzner (ill. Maguire), 2014] 7

Stacked Bar Charts 35M 30M 25M Population 65 Years and Over 45 to 64 Years 25 to 44 Years 18 to 24 Years 14 to 17 Years 5 to 13 Years Under 5 Years 20M 15M 10M 5.0M 0.0 CA TX NY FL IL PA OH MI GA NC NJ VA WA AZ MA IN TN MOMD WI MN CO AL SC LA KY OR OK CT IA MS AR KS UT NV NMWV NE ID ME NH HI RI MT DE SD AK ND VT DC WY [Bostock, 2012] 8

Stacked Bar Charts Data: multidimensional table: one quantitative, two categorical Task: lookup values, part-to-whole relationship, trends How: line marks: position (both horizontal & vertical), subcomponent line marks: length, color Scalability: main axis (hundreds like bar chart), bar classes (<12) Orientation: vertical or horizontal (swap how horizontal and vertical position are used. 9

Streamgraphs Include a time attribute Data: multidimensional table, one quantitative attribute (count), one ordered key attribute (time), one categorical key attribute + derived attribute: layer ordering (quantitative) Task: analyze trends in time, find (maxmial) outliers How: derived position+geometry, length, color Scalability: more categories than stacked bar charts fig 2 films from the summer of 2007 [Byron and Wattenberg, 2012] 10

NYTimes Ebb and Flow of Movies" [http://www.nytimes.com/interactive/2008/02/23/movies/20080223_revenue_graphic.html] 11

Dot and Line Charts 20 15 10 5 0 20 15 10 5 0 Year Data: one quantitative attribute, one ordered attribute Task: lookup values, find outliers and trends How: point mark and positions Line Charts: add connection mark (line) Similar to scatterplots but allow ordered attribute Year [Munzner (ill. Maguire), 2014] 12

Proper Use of Line and Bar Charts 60 60 50 50 40 40 30 30 20 20 10 10 0 10-year-olds 12-year-olds 0 10-year-olds 12-year-olds [Zacks and Tversky, 1999, Munzner (ill. Maguire), 2014] 13

Proper Use of Line and Bar Charts 60 60 50 50 40 40 30 30 20 20 10 10 0 Female Male 0 Female Male [Zacks and Tversky, 1999, Munzner (ill. Maguire), 2014] 14

Aspect Ratio Trends in line charts are more apparent because we are using angle as a channel Perception of angle (and the relative difference between angles) is important Initial experiments found people judge differences in slope when angles are around 45 degrees (Cleveland et al., 1988, 1993) 15

Multiscale Banking 706 Use frequency analysis to determine ratios ON VISUALIZATION A IEEEaspect TRANSACTIONS Add trend curves using a low-pass filter Sunspot Cycles Aspect Ratio = 3.96 Aspect Ratio = 22.35 Power Spectrum [Heer and Agrawala, 2006] 16

Multiscale Banking ON AND COMPUTER GRAPHICS, VOL. 12, NO. 5, SEPTEMBER/OCTOBER 2006 PRMTX Mutual Fund Aspect Ratio = 4.23 Aspect Ratio = 14.55 Power Spectrum [Heer and Agrawala, 2006] 17

Expanding the Study Cleveland et al. did not study the entire space of slope comparisons and 45 degrees was at the low end of their study (blue marks on right) Talbot et al. compared more slopes and found that people do better with smaller slopes Baselines may aid with this Slope ratio (pij) 87% 65% 48% 35% 25% 17% 11% 9 18 30 45 60 72 81 Mid angle (θ m ) [Talbot et al., 2013] 18

Heatmaps Data: Two keys, one quantitative attribute Task: Find clusters, outliers, summarize How: area marks in grid, color encoding of quantitative attribute Scalability: number of pixels for area marks (millions), <12 colors Red-green color scales often used - Be aware of colorblindness! Fast-Pitch Softball Slugging Percentage [fastpitchanalytics.com] 19

Bertin Matrices Must we only use color? - What other marks might be appropriate? [C.Perrin et al., 2014] 20

Bertin Matrices Must we only use color? - What other marks might be appropriate? [C.Perrin et al., 2014] 20

Bertin s Encodings Text 0 1 2 3 4 5 6 7 8 9 10 11 text Grayscale QUANTITY OF INK ENCODINGS POSITIONAL ENCODING MEAN-BASED ENCODINGS Circle Dual bar chart Bar chart Line Black and white bar chart Average bar chart [C.Perrin et al., 2014] 21

Matrix Reordering [Bertin Exhibit (INRIA, Vis 2014), Photo by Robert Kosara] 22

Cluster Heatmap [File System Similarity, R. Musăloiu-E., 2009] 23

Cluster Heatmap Data & Task: Same as Heatmap How: Area marks but matrix is ordered by cluster hierarchies Scalability: limited by the cluster dendrogram Dendrogram: a visual encoding of tree data with leaves aligned 24

Sepal.Length 2.0 3.0 4.0 0.5 1.5 2.5 4.5 5.5 6.5 7.5 2.0 3.0 4.0 Sepal.Width Petal.Length 1 2 3 4 5 6 7 4.5 5.5 6.5 7.5 0.5 1.5 2.5 1 2 3 4 5 6 7 Petal.Width Iris Data (red=setosa,green=versicolor,blue=virginica) Scatterplot Matrix (SPLOM) Data: Many quantitative attributes Derived Data: names of attributes Task: Find correlations, trends, outliers How: Scatterplots in matrix alignment Scale: attributes: ~12, items: hundreds? Visualizations in a visualization: at high level, marks are themselves visualizations 25 [R command from Wikipedia]

Spatial Axis Orientation So far, we have seen the vertical and horizontal axes (a rectilinear layout) used to encode almost everything What other possibilities are there for axes? [Munzner (ill. Maguire), 2014] 26

Spatial Axis Orientation So far, we have seen the vertical and horizontal axes (a rectilinear layout) used to encode almost everything What other possibilities are there for axes? - Parallel axes Math Physics Dance Drama 100 90 80 70 60 50 40 30 20 10 0 Parallel Coordinates [Munzner (ill. Maguire), 2014] 26

Spatial Axis Orientation So far, we have seen the vertical and horizontal axes (a rectilinear layout) used to encode almost everything What other possibilities are there for axes? - Parallel axes - Radial axes Math Physics Dance Drama 100 90 80 70 60 50 40 30 20 10 0 Parallel Coordinates [Munzner (ill. Maguire), 2014] 26

Parallel Coordinates Data: many quantitative attributes Task: Find trends, extremes, correlation How: vertical spatial position for each attribute, connection marks for identity, axes horizontally spaced Scalability: <40 attributes, hundreds of values Connection marks help visualize trends between particular values Ordering the horizontal axes is important Not as well-known, often requires learning 27

Comparing SPLOMs and Parallel Coordinates Math Physics Dance Drama Scatterplot Matrix Math Physics Dance Drama Math Physics Dance Drama 100 90 80 70 60 50 40 30 20 10 0 Parallel Coordinates [Munzner (ill. Maguire), 2014] 28

Correlation in Parallel Coordinates 1.0 0.8 0.2 0-0.2-0.8-1 [Wegman, 1990] 29

Overdraw in Parallel Coordinates [Fua et al., 1999] 30

Hierarchical Parallel Coordinates [Fua et al., 1999] Figure 4: This image sequence shows a Fatal Accident data set of 230,000 data elements at different level of details. The first image shows a cut across the root node. The last image shows the cut chaining all the leaf nodes. The rest of the images show intermediate cuts at varying 31 CIS 467, Spring 2015 levels-of-detail. (See Color Plates).