CS839: Epistemic Net Visualization Challenge

Similar documents
Using Perspective Rays and Symmetry to Model Duality

Week 7 Picturing Network. Vahe and Bethany

Assessment Schedule 2012 Mathematics and Statistics: Investigate relationships between tables, equations and graphs (91028)

Year 9: Long term plan

Prentice Hall Mathematics: Course Correlated to: Colorado Model Content Standards and Grade Level Expectations (Grade 6)

8. Visual Analytics. Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai

acute angle An angle with a measure less than that of a right angle. Houghton Mifflin Co. 2 Grade 5 Unit 6

Shading Techniques Denbigh Starkey

What is it that we want our students to know, understand, do and communicate KUDCO?

Knowledge Discovery and Data Mining 1 (VO) ( )

Creating a Box-and-Whisker Graph in Excel: Step One: Step Two:

SHAPE AND STRUCTURE. Shape and Structure. An explanation of Mathematical terminology

CIS 4930/ SCIENTIFICVISUALIZATION

Number/Computation. addend Any number being added. digit Any one of the ten symbols: 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9

Midterm Exam CS 184: Foundations of Computer Graphics page 1 of 11

Polygons and Angles: Student Guide

INFORMATION FOR PARENTS AND CARERS TARGETS IN MATHEMATICS

Grade 4 CCSS Pacing Guide: Math Expressions

KS3 Curriculum Plan Maths - Core Year 7

6th Grade ~ Conceptual Foundations for Unit of Study 8 Geometry DRAFT 6/30/11 Geometry (Spatial Sense & Reasoning) 1. Shapes, Solids and Properties

Scalar Visualization

2nd Grade. 2D Shapes - Part 1 Angles and Sides. Slide 1 / 117 Slide 2 / 117. Slide 3 / 117. Slide 4 / 117. Slide 6 / 117.

Grade 6 Math Circles. Spatial and Visual Thinking

Vector Visualization

Big Mathematical Ideas and Understandings

TIMSS 2011 Fourth Grade Mathematics Item Descriptions developed during the TIMSS 2011 Benchmarking

BIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26

CS 229 Final Project - Using machine learning to enhance a collaborative filtering recommendation system for Yelp

A triangle that has three acute angles Example:

Whole Numbers and Integers. Angles and Bearings

Geometric Entities for Pilot3D. Copyright 2001 by New Wave Systems, Inc. All Rights Reserved

Advanced data visualization (charts, graphs, dashboards, fever charts, heat maps, etc.)

Distances, Clustering! Rafael Irizarry!

Barrhead High School Mathematics Department. National 4 Mathematics. Learning Intentions & Success Criteria: Assessing My Progress

Presentation of GcoPS Icy plugin

Mathematics Curriculum

Just change the sign of the -coordinate. Let s look at the triangle from our previous example and reflect

Scalar Field Visualization I

Scott Foresman Investigations in Number, Data, and Space Content Scope & Sequence Correlated to Academic Language Notebooks The Language of Math

Chapter 3 Image Registration. Chapter 3 Image Registration

TIPS4Math Grades 4 to 6 Overview Grade 4 Grade 5 Grade 6 Collect, Organize, and Display Primary Data (4+ days)

Vectors and the Geometry of Space

GTPS Curriculum Mathematics Grade 8

Visual Analytics. Visualizing multivariate data:

Assessment for learning activity. Stage 4 or 5 (linked to the Mathematics and steel rolling unit) Activity name: Mathematics and steel rolling

Let s use Technology Use Data from Cycle 14 of the General Social Survey with Fathom for a data analysis project

Key Learning for Grade 3

4. Basic Mapping Techniques

Curriculum Catalog

Fast, Easy, and Publication-Quality Ecological Analyses with PC-ORD

Appendix 14C: TIMSS 2015 Eighth Grade Mathematics Item Descriptions Developed During the TIMSS 2015 Benchmarking

Chapter 4. Clustering Core Atoms by Location

Introduction : Applying Lines of Symmetry

Section 9.3 Graphing Quadratic Functions

This strand involves properties of the physical world that can be measured, the units used to measure them and the process of measurement.

Summary Of Topics covered in Year 7. Topic All pupils should Most pupils should Some pupils should Learn formal methods for

POSITION, DIRECTION AND MOVEMENT Year 1 Year 2 Year 3 Year 4 Year 5 Year 6 Use mathematical

Transforming Objects and Components

S8.6 Volume. Section 1. Surface area of cuboids: Q1. Work out the surface area of each cuboid shown below:

Analysis and visualization with v isone

Lesson 1. Unit 2 Practice Problems. Problem 2. Problem 1. Solution 1, 4, 5. Solution. Problem 3

February 07, Dimensional Geometry Notebook.notebook. Glossary & Standards. Prisms and Cylinders. Return to Table of Contents

Class Lesson Materials Time P 1 Numbers Counting 50

Transformation, tessellation and symmetry line symmetry

Regular polygons and tessellations Criterion D: Reflection

MicroStrategy Desktop

Information Technology and Media Services. Office Excel. Charts

Mathematics Grade 4 Operations and Algebraic Thinking Number and Operations in Base Ten Number and Operations- Fractions Measurement and Data

Sections Graphical Displays and Measures of Center. Brian Habing Department of Statistics University of South Carolina.

3 Dimensional Solids. Table of Contents. 3 Dimensional Solids Nets Volume Prisms and Cylinders Pyramids, Cones & Spheres

Escher s Circle Limit Anneke Bart Saint Louis University Introduction

School District of Marshfield Mathematics Standards

Unit 1, Lesson 1: Moving in the Plane

Topics and things to know about them:

HOUR 12. Adding a Chart

Matrix Transformations The position of the corners of this triangle are described by the vectors: 0 1 ] 0 1 ] Transformation:

MERBEIN P-10 COLLEGE MATHS SCOPE & SEQUENCE

Scalar Visualization

5. Compare the volume of a three dimensional figure to surface area.

This image cannot currently be displayed. Course Catalog. Pre-algebra Glynlyon, Inc.

Integers & Absolute Value Properties of Addition Add Integers Subtract Integers. Add & Subtract Like Fractions Add & Subtract Unlike Fractions

BD Lyoplate Human Screen Analysis Instructions For analysis using FCS Express or FlowJo and heatmap representation in Excel 2007

Year Nine Scheme of Work. Overview

Evolution of Impossible Objects

Oklahoma C 3 Mathematics Standards

The exam begins at 2:40pm and ends at 4:00pm. You must turn your exam in when time is announced or risk not having it accepted.

Design and Analysis of an Euler Transformation Algorithm Applied to Full-Polarimetric ISAR Imagery

Houghton Mifflin MATHEMATICS Level 5 correlated to NCTM Standard

Lecture 3 Sections 2.2, 4.4. Mon, Aug 31, 2009

Integrated Algebra/Geometry 1

9/17/2009. Wenyan Li (Emily Li) Sep. 15, Introduction to Clustering Analysis

Chapter 1: Symmetry and Surface Area

Madison County Schools Suggested 4 th Grade Math Pacing Guide

Which is better? Sentential. Diagrammatic Indexed by location in a plane

Websites of different companies

Reflection and Refraction

Symmetry Students complete symmetric figures that are partially given and create their own symmetric figures.

Trigonometry Review Version 0.1 (September 6, 2004)

For Exercises 1 4, follow these directions. Use the given side lengths.

Office Excel. Charts

Transcription:

CS839: Epistemic Net Visualization Challenge Chaman Singh Verma, Kim Nak Ho and Emma Turetsky March 17, 2010 Abstract Epistemic networks consist of sets of correlated concepts. The data is represented in matrices where entry M(i, j) reveals the strength of association between concept i and concept j. Designing visualizations of these networks to make comparisons between different datasets poses many challenges, especially when the networks become large and relational ties are prevalent. In this project, we present two ways to represent these network aiming at large networks. 1 Introduction Visualization of social networks is challenging because of the visual pollution generated by large number of colors, overlapping connections and space crunch. Especially in networks where the nodes are densely interconnected with diverse node strength, the prevalent visualization methods of social network diagrams are less than effective to read the data and derive meaningful patterns. We propose two visualiztion methodologies which can significantly remove the visual pollution and compare two or more networks efficiently. 2 Data and Task Abstraction To develop solutions, our team chose collaborative brainstroming over aggregating individual member assignments. Ideas were discussed in each stage extensively through face-to-face meetings. The three principles we set were originality, clutter removal and pattern recognition effectiveness. To generate ideas, we loosely implemented the 4-steps of Munzer s nested model(munzner, 2009). The first step, problem identification, was to understand what merits the visualization should give to the domain science research problem. We identified the task as twofold: one is to browse the network ties of a specific data node, the other is to look at the shape of the general network to derive a pattern. In both tasks, differences between various skillsets should be able to be compared easily. We concluded that combining the two tasks into csverma@cs.wisc.edu nkim3@cs.wisc.edu turetsky@cs.wisc.edu 1

a single visualization is hard to achieve and inefficient because the data gets easily cluttered. As a result, we decided to tackle the two taks separately into a egocentric view and a general view. The second step, abstraction, started with critiques on existing models which were shown as part of the initial challenge explanation. The common problem was that egocentric views worked while general views did not, largely due to data cluttering. Moreover, we decided not to merely upgrade existing approaches but to focus on out-of-the-box thinking, given the purpose of the challenge. In the initial discussion, ideas to utilize social network diagrams and adding visual elements such as bar graphs to the data matrix was discarded. As a result, the cylinder view was introduced which focused intensively on data clutter removal by limiting the dataload on one viewpoint and utilizing parallel coordinates. To deal with problems not addressed by the egocentric cylinder view, we expanded our ideas in two ways. One was to build a 2D representation suitable for print, and the other was to build a general view. For the noninteractive 2D version, various dataset comparisons were attempted including a symmetrical distance method. For the general view, the initial idea was to build a 3D surface graph on top of the data matrix. However, it was not effective to represent patterns and not easily comparable with other datasets. What eventually stayed was the top-view of the graph, which was essentially a wafer plot. To further remove data clutter we decided to drop one of the two symmetrical triangular data cell areas inherent in network data. Also we integrated the symmetrical comparison from the 2D approach by putting a different dataset in its place. It was based on the idea that if the human eye can detect symmetry easily, so it can detect non-symmetry where ther should be a symmetry. The third step, encoding, focused on position and color which are suitable to represent metric and categorical data. In the cylinder we used superimposition to compare datasets, while using juxtaposition for the symmetry view. In the fourth step, implementation, we used simple drawing programs to test 2D layouts, programmed a cylinder prototype in OpenGL, and used existing statistics software to draw wafer plots which was reedited in to symmetrical comparison mockups in graphic editing software. In testing those methods, we used the actual dataset instead of generated sample data to represent the real research results as closely as possible. 2

3 Cylindrical View In this view, each concept is represented as a spherical ball ( colored arbitray but distinct for each concept). The center line which lies on the cylinderical surface and drawn with the white line represents the reference line. Association values monotonically decrease away from the reference line and becomes zero at diametrically opposite side. All the association values are normalized in the range (0.0, 1.0) where one represents very high association and zero represents no association and with this range the surface of the cylinder is also colormapped. User can rotate the cylinder at any given angle and with the associate color it is easy to find the association value approximately. With this view, if an user is interested in knowing high correlated concepts, then he/she rotates the cylinder so that the reference line is in the front ( Red Color ) and if he/she wants to understand low correlation data, the cylinder is rotated so that blue color is in front. 3.1 Design Principles It is a long-standing theory that the one of the purposes of visualization is to separate signal from noise in an effective and relevant manner (see Tufte, 1997). To discard a large number of nodes from the first view, a wellknown heuristic that people are interested in knowing either high or low correlation data but not both at the same time can be used. Removing unwanted regions is the prime design goal of the cylindrical view. 3.2 Advantages of the Cylindrical View The cylindrical view has advantages than many other views. Sparsity The data have been distributed along the circumference and if there are two or more concepts having the same association values, than they can be placed along the height of the cylinder and therefore no overlapping will occur. Clustering Data are naturally clustered on the surface and therefore, it is easy to find low/high correlated data immediately. No Visual Pollution Unlike graph representations, there is no visual pollution, no connection line and no cluterring, which results in a clean visualization. 3.3 Disadvantages of the Cylindrical View However, there are also weaknesses in the cylindrical design. Symmetric Placement Seen from the reference line, the concepts can be replaced either left or right side (because they have the same value). Therefore, an user have to rotate cylinder in front and back to see similar concepts. This could be cumbersome and inconvenient after some time. 3

Three Dimensional Also this representation is not printer-friendly because of the 3D view and therefore could be used only on the computers. 3.4 Implementation Details This module was implemented with libqglviewer which is one of the higher level library on top of OpenGL. The entire user defined code consists of less than 100 lines of code. The input to the program is any csv format datafile. 4 Heat-Map Visual Representation The overall visualization is a heat map view of the matrix of data. Since the matrices are symmetrical, one triangle of one matrix of data was removed and replaced with another. Each triangle half was normalized from 0-1 and each point, a vertex of data points on the graph, is mapped to a color based on that number, from a gradient of green to red. The graph is then rotated 45 degrees so that the two triangles would be a mirror images if the data was the same. This was implemented using Statistica, which takes data in an excel like format. 4.1 Design Principles This view mainly focused on the idea of symmetry, and how humans are very quick to focus and identify it. It sets up a view that easily allows for these comparisons. It also was not trying to be too complex. It was meant to be a simple picture that could be interpreted quickly and simply to derive patterns in relatively low-level feature perception(see Ware, 2008). 4.2 Advantages of the Symmetrical View The strength of the symmetraical view is that it is very simple and easy to understand. It uses the redundancy of the data to allow for one image to compare with rather than two. Our current prototype lacks interactivity, but future versions can include the ability to move one column or row and have it mirrored on the other side to allow for easier visualization of groupings. We might also experiment with ways of displaying the triangles with the same orientation, as that would be easier to compare than mirrored images. 4.3 Disadvantages of the Symmetrical View Unfortunately, the method implies a connection between the data sets as the map is interpolated between the known points. Also, there could be observed similarity where there actually isn t any. For example, in one image, there is a similar curve of red in both triangles. Their difference in the direction indicates that they do not share same characteristics, but the shape similarity can still mislead attention. Also the symmetry view does not allow to compare more than two data sets in one visualization. We discussed the possibility of implementing more symmetrical wedges to address this problem, but concluded it will result in data cluttering. 4

4.4 Implementation Details The heatmap was implemented using STATISTICA, which is a data analysis and statistical software tool. It s interface is very much like excel, so the data was modified in the excel spreadsheet form and then transferred into STATISTICA, where the wafer plot was used. The picture was rotated afterwards. 5 Referemce Munzner, T. (2009), A Nested Model for Visualization Design and Validation, IEEE Tufte, E. (1997), Visual Explanations: Images and Quantities, Evidence and Narrative, Graphics Press Ware, C. (2008), Visual Thinking for Design, Morgan Kaufmann 5