DSC 201: Data Analysis & Visualization

Size: px
Start display at page:

Download "DSC 201: Data Analysis & Visualization"

Transcription

1 DSC 201: Data Analysis & Visualization Visualization Tools Dr. David Koop

2 Visualization for Exploration 2

3 MTA Fare Data Exploration 3

4 MTA Fare Data Exploration 4

5 MTA Fare Data Exploration 5

6 MTA Fare Data Exploration 5

7 MTA Fare Data Exploration 6

8 MTA Fare Data Exploration 7

9 MTA Fare Data Exploration A UGUST S U N M O N T U E W E D T H U F R I S A T :10 1 1: :10 3 8:40 YES YES YES 1 T OR 4 SD 11 DET 18 BOS 25 TB DET 5 CHW 12 LAA 19 TOR 26 TOR DET 6 CHW 13 LAA 20 TOR 27 TOR HOU 7 CHW 14 LAA 21 TOR 28 TOR DET 8 COL 15 LAA 22 TOR 29 TOR SD 9 DET 16 BOS 23 TB 30 BAL SD 10 DET 17 BOS 24 TB 31 BAL 4:10 8:10 8:10 8:10 1:10 7:05 1:05 YES YES MY9 YES YES YES YES TBA 7:05 7:05 7:05 1:05 7:10 4:05 TBA YES YES YES YES MY9 FOX TBA 1:10 7:05 7:05 1:05 7:10 7:10 TBA YES MY9 YES YES MY9 YES 1:40 7:07 7:07 7:07 7:07 7:05 1:05 YES YES YES YES YES YES YES S EPT EMBER S U N M O N T U E W E D T H U F R I S A T 1:05 2 1:05 3 7:05 4 7:05 5 7:05 6 7:05 7 1:05 YES YES YES YES YES YES FOX 1 BAL 8 BOS 15 BOS 22 SF 29 HOU CHW CHW CHW BOS BOS 9 BAL 7:05 10 BAL 11 BAL 12 BAL 13 BOS 16 1: TOR TOR TOR TOR SF 23 1:10 YES TB TB TB TB HOU 30 1:10 YES ALL GAMES ARE EASTERN TIME. HOU HOU CHW CHW T OR BOS 14 BOS 21 SF 28 HOU 30 T OR TBA 7:05 7:05 7:05 7:10 1:05 TBA YES MY9 YES YES MY9 FOX TBA 7:07 2:10 1:10 7:07 7:07 7:05 TBA TBA YES MY9 YES YES YES TBA 1:05 7:05 7:05 7:05 8:10 TBA YES YES MY9 YES YES YES TBA R EGULAR SE ASON SCHED ULE 7

10 Visualization for Explanation 8

11 Design Iteration 9

12 Definition Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively. T. Munzner 10

13 Definition Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively. 11

14 Definition NYC Subway Fare Data Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively. 11

15 Definition NYC Subway Fare Data Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively. Find Interesting NYC Subway Ridership Patterns 11

16 Definition Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively. 12

17 Computers Helping People [Cerebral, Barsky et al., 2007] 13

18 Definition Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively 14

19 Why Visual? y y x 1 x y y x x 4 [F. J. Anscombe] 15

20 Definition Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively 16

21 Design Iteration 17

22 Definition Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively 18

23 Effectiveness [S. Hayward, 2015] 19

24 Visualization Tools Analysis Apps: Tableau, Excel, SAS Illustration Apps: Illustrator, Inkscape R Libraries: base, ggplot Python Modules: matplotlib, seaborn, bokeh, altair Lower-level Frameworks: D3, Processing Many, many more: Google "data visualization tools" 20

25 This Class Tableau Python modules: matplotlib, bokeh 21

26 Data Types Items - An item is an individual discrete entity - e.g. row in a table, node in a network Attributes - An attribute is some specific property that can be measured, observed, or logged - a.k.a. variable, (data) dimension, column in a table 22

27 Items & Attributes attribute Field item 22 23

28 Dataset Types Dataset Types Tables Networks Fields (Continuous) Geometry (Spatial) Attributes (columns) Grid of positions Items (rows) Cell containing value Link Node (item) Cell Attributes (columns) Position Multidimensional Table Trees Value in cell Value in cell [Munzner (ill. Maguire), 2014] 24

29 Attribute Types Attribute Types Categorical Ordered Ordinal Quantitative Ordering Direction Sequential Diverging Cyclic [Munzner (ill. Maguire), 2014] 25

30 Categorial, Ordinal, and Quantitative 1 = Quantitative 23 2 ordinal = Nominal 3 = Ordinal quantitative categorical 26

31 Categorial, Ordinal, and Quantitative 1 = Quantitative 24 2 ordinal = Nominal 3 = Ordinal quantitative categorical 27

32 Tableau Overview Grew out of research at Stanford University on how to explore multidimensional datasets & relational databases Tableau Desktop: standalone (free trial, student license) Tableau Public: cloud-based system (free) Tableau Vizable: mobile app Tableau's Introduction Videos 28

33 Tableau High-level GUI that connects to data, helps organize it, and provides intuitive routines for visualizing it plus customization Lots of possibilities Great for exploration 29

34 Tableau Example 30

35 Data In Tableau Categorical data = Dimension Quantitative data = Measures 31

36 matplotlib The workhorse of python visualization seaborn builds on top of matplotlib Many new kids on the block: bokeh, altair 32

37 matplotlib %matplotlib inline (show plots in the notebook!) Always create a figure first, then draw plots Lots of high-level plotting types (line plots, scatterplots, histograms) Lots of customizability 33

38 Shortcuts in pandas Connect directly to pandas data frames - df.plot pandas.dataframe.plot.html 34

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Visualization Design Dr. David Koop Definition Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks

More information

Scalable Data Analysis (CIS )

Scalable Data Analysis (CIS ) Scalable Data Analysis (CIS 602-01) Introduction Dr. David Koop NYC Taxi Data [Analyzing 1.1 Billion NYC Taxi and Uber Trips, with a Vengeance, T. W. Schneider] 2 What are your questions about this data?

More information

Data Visualization (CIS/DSC 468)

Data Visualization (CIS/DSC 468) Data Visualization (CIS/DSC 468) Data & Tasks Dr. David Koop Programmatic SVG Example Draw a horizontal bar chart - var a = [6, 2, 6, 10, 7, 18, 0, 17, 20, 6]; Steps: - Programmatically create SVG - Create

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Exploratory Data Analysis Dr. David Koop What is Exploratory Data Analysis? "Detective work" to summarize and explore datasets Includes: - Data acquisition and input

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Python and Notebooks Dr. David Koop Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively.

More information

Data Visualization (DSC 530/CIS )

Data Visualization (DSC 530/CIS ) Data Visualization (DSC 530/CIS 602-01) HTML, CSS, & SVG Dr. David Koop Data Visualization What is it? How does it differ from computer graphics? What types of data can we visualize? What tasks can we

More information

Data Visualization (CIS 468)

Data Visualization (CIS 468) Data Visualization (CIS 468) Web Programming Dr. David Koop What is Data Visualization? 2 Exploration Communication Spectrum Consecutive Starts by a Quarterback for a Single Team Exploration Confirmation

More information

Data Visualization (CIS/DSC 468)

Data Visualization (CIS/DSC 468) Data Visualization (CIS/DSC 468) Web Programming Dr. David Koop Definition of Visualization Computer-based visualization systems provide visual representations of datasets designed to help people carry

More information

THINKING VISUALLY: AN INTRODUCTION TO DATA & INFORMATION VISUALIZATION

THINKING VISUALLY: AN INTRODUCTION TO DATA & INFORMATION VISUALIZATION THINKING VISUALLY: AN INTRODUCTION TO DATA & INFORMATION VISUALIZATION Learning Event for CES Ontario Li Ka Shing Knowledge Institute June 22, 2016 Jesse Carliner ACTING COMMUNICATIONS & REFERENCE LIBRARIAN

More information

3.Data Abstraction. Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai 1 / 26

3.Data Abstraction. Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai   1 / 26 3.Data Abstraction Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai www.learnersdesk.weebly.com 1 / 26 Outline What can be visualized? Why Do Data Semantics and Types Matter? Data Types Items, Attributes,

More information

Matplotlib Python Plotting

Matplotlib Python Plotting Matplotlib Python Plotting 1 / 6 2 / 6 3 / 6 Matplotlib Python Plotting Matplotlib is a Python 2D plotting library which produces publication quality figures in a variety of hardcopy formats and interactive

More information

Data Visualization (DSC 530/CIS )

Data Visualization (DSC 530/CIS ) Data Visualization (DSC 530/CIS 602-01) Data Dr. David Koop HTML and CSS HTML: Tags define the boundaries of the structures of the content this is cool. What about this?

More information

Visualization Analysis & Design Full-Day Tutorial Session 1

Visualization Analysis & Design Full-Day Tutorial Session 1 Visualization Analysis & Design Full-Day Tutorial Session 1 Tamara Munzner Department of Computer Science University of British Columbia Sanger Institute / European Bioinformatics Institute June 2014,

More information

Python for Data Analysis. Prof.Sushila Aghav-Palwe Assistant Professor MIT

Python for Data Analysis. Prof.Sushila Aghav-Palwe Assistant Professor MIT Python for Data Analysis Prof.Sushila Aghav-Palwe Assistant Professor MIT Four steps to apply data analytics: 1. Define your Objective What are you trying to achieve? What could the result look like? 2.

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Exploratory Data Analysis Dr. David Koop http://www.cis.umassd.edu/~dkoop/dsc201 2 Chicago Food Inspections Exploration Based on David Beazley's PyData Chicago talk

More information

Pandas plotting capabilities

Pandas plotting capabilities Pandas plotting capabilities Pandas built-in capabilities for data visualization it's built-off of matplotlib, but it's baked into pandas for easier usage. It provides the basic statistic plot types. Let's

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Exploratory Data Analysis Dr. David Koop Python Support for Time The datetime package - Has date, time, and datetime classes -.now() method: the current datetime

More information

Data Visualization (CIS 468)

Data Visualization (CIS 468) Data Visualization (CIS 468) D3 + Marks & Channels Dr. David Koop Tasks Actions Targets Analyze All Data Consume Trends Outliers Features Discover Present Enjoy Produce Annotate Record Derive tag Attributes

More information

STA 490H1S Initial Examination of Data

STA 490H1S Initial Examination of Data Initial Examination of Data Alison L. Department of Statistics University of Toronto Winter 2011 Course mantra It s OK not to know. Expressing ignorance is encouraged. It s not OK to not have a willingness

More information

Code Mania Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python:

Code Mania Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python: Code Mania 2019 Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python: 1. Introduction to Artificial Intelligence 2. Introduction to python programming and Environment

More information

Business Analytics Nanodegree Syllabus

Business Analytics Nanodegree Syllabus Business Analytics Nanodegree Syllabus Master data fundamentals applicable to any industry Before You Start There are no prerequisites for this program, aside from basic computer skills. You should be

More information

Automation.

Automation. Automation www.austech.edu.au WHAT IS AUTOMATION? Automation testing is a technique uses an application to implement entire life cycle of the software in less time and provides efficiency and effectiveness

More information

Certified Data Science with Python Professional VS-1442

Certified Data Science with Python Professional VS-1442 Certified Data Science with Python Professional VS-1442 Certified Data Science with Python Professional Certified Data Science with Python Professional Certification Code VS-1442 Data science has become

More information

CSC 1315! Data Science

CSC 1315! Data Science CSC 1315! Data Science Data Visualization Based on: Python for Data Analysis: http://hamelg.blogspot.com/2015/ Learning IPython for Interactive Computation and Visualization by C. Rossant Plotting with

More information

CP SC 8810 Data Visualization. Joshua Levine

CP SC 8810 Data Visualization. Joshua Levine CP SC 8810 Data Visualization Joshua Levine levinej@clemson.edu Lecture 15 Text and Sets Oct. 14, 2014 Agenda Lab 02 Grades! Lab 03 due in 1 week Lab 2 Summary Preferences on x-axis label separation 10

More information

DATA 301 Introduction to Data Analytics Visualization. Dr. Ramon Lawrence University of British Columbia Okanagan

DATA 301 Introduction to Data Analytics Visualization. Dr. Ramon Lawrence University of British Columbia Okanagan DATA 301 Introduction to Data Analytics Visualization Dr. Ramon Lawrence University of British Columbia Okanagan ramon.lawrence@ubc.ca DATA 301: Data Analytics (2) Why learn Visualization? Visualization

More information

Subject. Creating a diagram. Dataset. Importing the data file. Descriptive statistics with TANAGRA.

Subject. Creating a diagram. Dataset. Importing the data file. Descriptive statistics with TANAGRA. Subject Descriptive statistics with TANAGRA. The aim of descriptive statistics is to describe the main features of a collection of data in quantitative terms 1. The visualization of the whole data table

More information

What are we working with? Data Abstractions. Week 4 Lecture A IAT 814 Lyn Bartram

What are we working with? Data Abstractions. Week 4 Lecture A IAT 814 Lyn Bartram What are we working with? Data Abstractions Week 4 Lecture A IAT 814 Lyn Bartram Munzner s What-Why-How What are we working with? DATA abstractions, statistical methods Why are we doing it? Task abstractions

More information

CTSI Module 8 Workshop Introduction to Biomedical Informatics, Part V

CTSI Module 8 Workshop Introduction to Biomedical Informatics, Part V CTSI Module 8 Workshop Introduction to Biomedical Informatics, Part V Practical Tools: Data Processing & Analysis William Hsu, PhD Assistant Professor Medical Imaging Informatics Group Dept of Radiological

More information

Data Visualization (DSC 530/CIS )

Data Visualization (DSC 530/CIS ) Data Visualization (DSC 530/CIS 60201) CSS, SVG, and JavaScript Dr. David Koop Definition of Visualization Computerbased visualization systems provide visual representations of datasets designed to help

More information

SPSS TRAINING SPSS VIEWS

SPSS TRAINING SPSS VIEWS SPSS TRAINING SPSS VIEWS Dataset Data file Data View o Full data set, structured same as excel (variable = column name, row = record) Variable View o Provides details for each variable (column in Data

More information

Intermediate/Advanced Python. Michael Weinstein (Day 2)

Intermediate/Advanced Python. Michael Weinstein (Day 2) Intermediate/Advanced Python Michael Weinstein (Day 2) Topics Review of basic data structures Accessing and working with objects in python Numpy How python actually stores data in memory Why numpy can

More information

Python Quant Platform

Python Quant Platform Python Quant Platform Web-based Financial Analytics and Rapid Financial Engineering with Python Yves Hilpisch The Python Quant Platform offers Web-based, scalable, collaborative financial analytics and

More information

INTRODUCTION TO DATA. Welcome to the course!

INTRODUCTION TO DATA. Welcome to the course! INTRODUCTION TO DATA Welcome to the course! High School and Beyond id gender race socst 70 male white 57 121 female white 61 86 male white 31 137 female white 61 Loading data > # Load package > library(openintro)

More information

OREKIT IN PYTHON ACCESS THE PYTHON SCIENTIFIC ECOSYSTEM. Petrus Hyvönen

OREKIT IN PYTHON ACCESS THE PYTHON SCIENTIFIC ECOSYSTEM. Petrus Hyvönen OREKIT IN PYTHON ACCESS THE PYTHON SCIENTIFIC ECOSYSTEM Petrus Hyvönen 2017-11-27 SSC ACTIVITIES Public Science Services Satellite Management Services Engineering Services 2 INITIAL REASON OF PYTHON WRAPPED

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Aggregation Dr. David Koop Data Analysis Scenarios Often want to analyze data by some grouping: - Dogs vs. cats - Millennials vs. Gen-X vs. Baby Boomers - Physics

More information

2017 NYSDOH Tableau Sepsis Audit Reports User Guide 1

2017 NYSDOH Tableau Sepsis Audit Reports User Guide 1 2017 NYSDOH Tableau Sepsis Audit Reports User Guide 1 1 Aligns with the June 20, 2018 live WebEx: Recording of the event is available at https://ny.sepsis.ipro.org/ Contents TABLEAU...3 WHY TABLEAU?...3

More information

Multidimensional (Multivariate)

Multidimensional (Multivariate) Multidimensional (Multivariate) Data Visualization IV Course Spring 14 Graduate Course of UCAS May 9th, 2014 1 Data by Dimensionality 1-D (Linear, Set and Sequences) SeeSoft, Info Mural 2-D (Map) GIS,

More information

AN OVERVIEW AND EXPLORATION OF JMP A DATA DISCOVERY SYSTEM IN DAIRY SCIENCE

AN OVERVIEW AND EXPLORATION OF JMP A DATA DISCOVERY SYSTEM IN DAIRY SCIENCE AN OVERVIEW AND EXPLORATION OF JMP A DATA DISCOVERY SYSTEM IN DAIRY SCIENCE A.P. Ruhil and Tara Chand National Dairy Research Institute, Karnal-132001 JMP commonly pronounced as Jump is a statistical software

More information

Introduction to Geospatial Analysis

Introduction to Geospatial Analysis Introduction to Geospatial Analysis Introduction to Geospatial Analysis 1 Descriptive Statistics Descriptive statistics. 2 What and Why? Descriptive Statistics Quantitative description of data Why? Allow

More information

Perception Maneesh Agrawala CS : Visualization Fall 2013 Multidimensional Visualization

Perception Maneesh Agrawala CS : Visualization Fall 2013 Multidimensional Visualization Perception Maneesh Agrawala CS 294-10: Visualization Fall 2013 Multidimensional Visualization 1 Visual Encoding Variables Position Length Area Volume Value Texture Color Orientation Shape ~8 dimensions?

More information

Data Analytics Training Program

Data Analytics Training Program Data Analytics Training Program In exclusive association with 1200+ Trainings 20,000+ Participants 10,000+ Brands 45+ Countries [Since 2009] Training partner for Who Is This Course For? Programers Willing

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

DATA ABSTRACTION & INTRO TO TABLEAU

DATA ABSTRACTION & INTRO TO TABLEAU cs6630 September 4 2014 DATA ABSTRACTION & INTRO TO TABLEAU Miriah Meyer University of Utah 1 administrivia... 2 - design critiques due tonight - first assignment out today - there *might* be 3 seats available

More information

Properties of Data. Digging into Data: Jordan Boyd-Graber. University of Maryland. February 11, 2013

Properties of Data. Digging into Data: Jordan Boyd-Graber. University of Maryland. February 11, 2013 Properties of Data Digging into Data: Jordan Boyd-Graber University of Maryland February 11, 2013 Digging into Data: Jordan Boyd-Graber (UMD) Properties of Data February 11, 2013 1 / 43 Roadmap Munging

More information

Data Visualization (DSC 530/CIS )

Data Visualization (DSC 530/CIS ) Data Visualization (DSC 530/CIS 60-0) Isosurfaces & Volume Rendering Dr. David Koop Fields & Grids Fields: - Values come from a continuous domain, infinitely many values - Sampled at certain positions

More information

Data Science. Data Analyst. Data Scientist. Data Architect

Data Science. Data Analyst. Data Scientist. Data Architect Data Science Data Analyst Data Analysis in Excel Programming in R Introduction to Python/SQL/Tableau Data Visualization in R / Tableau Exploratory Data Analysis Data Scientist Inferential Statistics &

More information

2. Getting started with MLwiN

2. Getting started with MLwiN 2. Getting started with MLwiN Introduction This chapter aims to provide you with some practice with MLwiN commands before you begin to fit multilevel models. It is may be helpful if you have already read

More information

CIS 467/602-01: Data Visualization

CIS 467/602-01: Data Visualization CIS 467/602-01: Data Visualization Tables Dr. David Koop Assignment 2 http://www.cis.umassd.edu/ ~dkoop/cis467/assignment2.html Plagiarism on Assignment 1 Any questions? 2 Recap (Interaction) Important

More information

Making Science Graphs and Interpreting Data

Making Science Graphs and Interpreting Data Making Science Graphs and Interpreting Data Eye Opener: 5 mins What do you see? What do you think? Look up terms you don t know What do Graphs Tell You? A graph is a way of expressing a relationship between

More information

Data Science Bootcamp Curriculum. NYC Data Science Academy

Data Science Bootcamp Curriculum. NYC Data Science Academy Data Science Bootcamp Curriculum NYC Data Science Academy 100+ hours free, self-paced online course. Access to part-time in-person courses hosted at NYC campus Machine Learning with R and Python Foundations

More information

Introduction to Data Analytics. David Walling

Introduction to Data Analytics. David Walling Introduction to Data Analytics David Walling walling@tacc.utexas.edu Source: http://research.microsoft.com/en-us/collaboration/fourthparadigm/default.aspx Computational Simulation Model first, given initial

More information

Data Visualization (CIS/DSC 468)

Data Visualization (CIS/DSC 468) Data Visualization (CIS/DSC 468) Marks & Channels Dr. David Koop D3 Pattern Select visual elements (d3.select, d3.selectall) Join them with data items (.data(mydata, key_function)) Using enter, update,

More information

Data Visualization. Fall 2017

Data Visualization. Fall 2017 Data Visualization Fall 2017 Course Targets and Goals Getting acquainted with advanced techniques of visualization of scientific and technical data (spatial and non-spatial data) Application of selected

More information

pandas: Rich Data Analysis Tools for Quant Finance

pandas: Rich Data Analysis Tools for Quant Finance pandas: Rich Data Analysis Tools for Quant Finance Wes McKinney April 24, 2012, QWAFAFEW Boston about me MIT 07 AQR Capital: 2007-2010 Global Macro and Credit Research WES MCKINNEY pandas: 2008 - Present

More information

R for IR. Created by Narren Brown, Grinnell College, and Diane Saphire, Trinity University

R for IR. Created by Narren Brown, Grinnell College, and Diane Saphire, Trinity University R for IR Created by Narren Brown, Grinnell College, and Diane Saphire, Trinity University For presentation at the June 2013 Meeting of the Higher Education Data Sharing Consortium Table of Contents I.

More information

Intro to Stata for Political Scientists

Intro to Stata for Political Scientists Intro to Stata for Political Scientists Andrew S. Rosenberg Junior PRISM Fellow Department of Political Science Workshop Description This is an Introduction to Stata I will assume little/no prior knowledge

More information

CIS : Scalable Data Analysis

CIS : Scalable Data Analysis CIS 602-01: Scalable Data Analysis Visualization Dr. David Koop Growth of Data 2 Usefulness of Data 3 Analyzed Data 4 Example Data Sources Radio Telescopes Twitter Wind Turbine Sensors Surveillance Cameras

More information

Lecture 3: Data Principles

Lecture 3: Data Principles Lecture 3: Data Principles Information Visualization CPSC 533C, Fall 2011 Tamara Munzner UBC Computer Science Mon, 19 September 2011 1 / 33 Papers Covered Chapter 2: Data Principles Polaris: A System for

More information

Error-Bar Charts from Summary Data

Error-Bar Charts from Summary Data Chapter 156 Error-Bar Charts from Summary Data Introduction Error-Bar Charts graphically display tables of means (or medians) and variability. Following are examples of the types of charts produced by

More information

GRAPHING BAYOUSIDE CLASSROOM DATA

GRAPHING BAYOUSIDE CLASSROOM DATA LUMCON S BAYOUSIDE CLASSROOM GRAPHING BAYOUSIDE CLASSROOM DATA Focus/Overview This activity allows students to answer questions about their environment using data collected during water sampling. Learning

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming

More information

Information Visualization. SWE 432, Fall 2016 Design and Implementation of Software for the Web

Information Visualization. SWE 432, Fall 2016 Design and Implementation of Software for the Web Information Visualization SWE 432, Fall 2016 Design and Implementation of Software for the Web Today What types of information visualization are there? Which one should you choose? What does usability

More information

JUPYTER (IPYTHON) NOTEBOOK CHEATSHEET

JUPYTER (IPYTHON) NOTEBOOK CHEATSHEET JUPYTER (IPYTHON) NOTEBOOK CHEATSHEET About Jupyter Notebooks The Jupyter Notebook is a web application that allows you to create and share documents that contain executable code, equations, visualizations

More information

Visualization Analysis & Design

Visualization Analysis & Design Visualization Analysis & Design Tamara Munzner Department of Computer Science University of British Columbia InformationPlus 2016 Keynote June 16 2016, Vancouver BC http://www.cs.ubc.ca/~tmm/talks.html#vad16infoplus

More information

Tutorial on Machine Learning Tools

Tutorial on Machine Learning Tools Tutorial on Machine Learning Tools Yanbing Xue Milos Hauskrecht Why do we need these tools? Widely deployed classical models No need to code from scratch Easy-to-use GUI Outline Matlab Apps Weka 3 UI TensorFlow

More information

Scientific Visualization

Scientific Visualization Scientific Visualization Topics Motivation Color InfoVis vs. SciVis VisTrails Core Techniques Advanced Techniques 1 Check Assumptions: Why Visualize? Problem: How do you apprehend 100k tuples? when your

More information

Acquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data.

Acquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data. Summary Statistics Acquisition Description Exploration Examination what data is collected Characterizing properties of data. Exploring the data distribution(s). Identifying data quality problems. Selecting

More information

Data analysis using Microsoft Excel

Data analysis using Microsoft Excel Introduction to Statistics Statistics may be defined as the science of collection, organization presentation analysis and interpretation of numerical data from the logical analysis. 1.Collection of Data

More information

What is OneNote? The first time you start OneNote, it asks you to sign in. Sign in with your personal Microsoft account.

What is OneNote? The first time you start OneNote, it asks you to sign in. Sign in with your personal Microsoft account. OneNote What is OneNote? OneNote is a digital notebook. In OneNote, you can: Type notes or record audio at your laptop. Sketch or write ideas on your tablet. Add picture from your phone. Find notes instantly.

More information

Geometry. Dilations. Talk About It. More Ideas. Formative Assessment

Geometry. Dilations. Talk About It. More Ideas. Formative Assessment 5 Objective Common Core State Standards Dilations A dilation is a transformation that produces an image that is the same shape as the original but of a different size. Dilations with a greater than 1 produce

More information

Introduction (SPSS) Opening SPSS Start All Programs SPSS Inc SPSS 21. SPSS Menus

Introduction (SPSS) Opening SPSS Start All Programs SPSS Inc SPSS 21. SPSS Menus Introduction (SPSS) SPSS is the acronym of Statistical Package for the Social Sciences. SPSS is one of the most popular statistical packages which can perform highly complex data manipulation and analysis

More information

Tableau Advanced Training. Student Guide April x. For Evaluation Only

Tableau Advanced Training. Student Guide April x. For Evaluation Only Tableau Advanced Training Student Guide www.datarevelations.com 914.945.0567 April 2017 10.x Contents A. Warm Up 1 Bar Chart Colored by Profit 1 Salary Curve 2 2015 v s. 2014 Sales 3 VII. Programmatic

More information

Desktop Layout Portfolio

Desktop Layout Portfolio Rensselaer Polytechnic Institute Desktop Layout Portfolio Osvaldo Rosado & Adam Ryason Interactive Visualization Spring 2016 INTRODUCTION Computers are generally competent at doing what we tell them to

More information

David J. Pine. Introduction to Python for Science & Engineering

David J. Pine. Introduction to Python for Science & Engineering David J. Pine Introduction to Python for Science & Engineering To Alex Pine who introduced me to Python Contents Preface About the Author xi xv 1 Introduction 1 1.1 Introduction to Python for Science and

More information

Excel 2010 with XLSTAT

Excel 2010 with XLSTAT Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with

More information

Session 1: Do's and Don'ts of Data Visualization. Presenter: Randy Olson, Ph.D.

Session 1: Do's and Don'ts of Data Visualization. Presenter: Randy Olson, Ph.D. Session 1: Do's and Don'ts of Data Visualization Presenter: Randy Olson, Ph.D. DO S AND DON TS OF DATA VISUALIZATION Dr. Randy Olson rso@randalolson.com @randal_olson randalolson.com ABOUT ME AI researcher

More information

pandas & ggplot quick analysis with python and friends Vincent Warmerdam Data

pandas & ggplot quick analysis with python and friends Vincent Warmerdam Data pandas & ggplot quick analysis with python and friends Vincent Warmerdam Data Scientist @fishnets88 vincentwarmerdam@godatadriven.com GoDataDriven PROUDLY PART OF THE XEBIA GROUP Who is this guy? - Data

More information

Data Visualization. Fall 2016

Data Visualization. Fall 2016 Data Visualization Fall 2016 Information Visualization Upon now, we dealt with scientific visualization (scivis) Scivisincludes visualization of physical simulations, engineering, medical imaging, Earth

More information

SPATIAL DATA MODELS Introduction to GIS Winter 2015

SPATIAL DATA MODELS Introduction to GIS Winter 2015 SPATIAL DATA MODELS Introduction to GIS Winter 2015 GIS Data Organization The basics Data can be organized in a variety of ways Spatial location, content (attributes), frequency of use Come up with a system

More information

PYTHON DATA VISUALIZATIONS

PYTHON DATA VISUALIZATIONS PYTHON DATA VISUALIZATIONS from Learning Python for Data Analysis and Visualization by Jose Portilla https://www.udemy.com/learning-python-for-data-analysis-and-visualization/ Notes by Michael Brothers

More information

An introduction to ggplot: An implementation of the grammar of graphics in R

An introduction to ggplot: An implementation of the grammar of graphics in R An introduction to ggplot: An implementation of the grammar of graphics in R Hadley Wickham 00-0-7 1 Introduction Currently, R has two major systems for plotting data, base graphics and lattice graphics

More information

Statistics 251: Statistical Methods

Statistics 251: Statistical Methods Statistics 251: Statistical Methods Summaries and Graphs in R Module R1 2018 file:///u:/documents/classes/lectures/251301/renae/markdown/master%20versions/summary_graphs.html#1 1/14 Summary Statistics

More information

The Python interpreter

The Python interpreter The Python interpreter Daniel Winklehner, Remi Lehe US Particle Accelerator School (USPAS) Summer Session Self-Consistent Simulations of Beam and Plasma Systems S. M. Lund, J.-L. Vay, D. Bruhwiler, R.

More information

CIS 467/602-01: Data Visualization

CIS 467/602-01: Data Visualization CIS 467/60-01: Data Visualization Isosurfacing and Volume Rendering Dr. David Koop Fields and Grids Fields: values come from a continuous domain, infinitely many values - Sampled at certain positions to

More information

MATH 117 Statistical Methods for Management I Chapter Two

MATH 117 Statistical Methods for Management I Chapter Two Jubail University College MATH 117 Statistical Methods for Management I Chapter Two There are a wide variety of ways to summarize, organize, and present data: I. Tables 1. Distribution Table (Categorical

More information

Facets and Continuous graphs

Facets and Continuous graphs Facets and Continuous graphs One way to add additional variables is with aesthetics. Another way, particularly useful for categorical variables, is to split your plot into facets, subplots that each display

More information

Manual Data Validation Excel 2010 List From Another Workbook

Manual Data Validation Excel 2010 List From Another Workbook Manual Data Validation Excel 2010 List From Another Workbook It is quite easy to create a data validation drop down list among worksheets you will learn how to create a drop fown list from another workbook

More information

Data Foundations. Topic Objectives. and list subcategories of each. its properties. before producing a visualization. subsetting

Data Foundations. Topic Objectives. and list subcategories of each. its properties. before producing a visualization. subsetting CS 725/825 Information Visualization Fall 2013 Data Foundations Dr. Michele C. Weigle http://www.cs.odu.edu/~mweigle/cs725-f13/ Topic Objectives! Distinguish between ordinal and nominal values and list

More information

CS-5630 / CS-6630 Visualization for Data Science The Visualization Alphabet: Marks and Channels

CS-5630 / CS-6630 Visualization for Data Science The Visualization Alphabet: Marks and Channels CS-5630 / CS-6630 Visualization for Data Science The Visualization Alphabet: Marks and Channels Alexander Lex alex@sci.utah.edu [xkcd] How can I visually represent two numbers, e.g., 4 and 8 Marks & Channels

More information

VISUALIZATION OF MULTIVARIATE DATA

VISUALIZATION OF MULTIVARIATE DATA VISUALIZATION OF MULTIVARIATE DATA Prof. Rahul C. Basole CS/MGT 8803-DV > January 18, 2017 True/False? Explain Why. InfoVis SciVis Name this Visual Representation Name this Interaction Technique Show of

More information

HYPERVARIATE DATA VISUALIZATION

HYPERVARIATE DATA VISUALIZATION HYPERVARIATE DATA VISUALIZATION Prof. Rahul C. Basole CS/MGT 8803-DV > January 25, 2017 Agenda Hypervariate Data Project Elevator Pitch Hypervariate Data (n > 3) Many well-known visualization techniques

More information

WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide

WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide 2011 Version 6A Do these tutorials first This series of tutorials provides a quick start to using WINKS. Feel free

More information

Multi-Dimensional Vis

Multi-Dimensional Vis CSE512 :: 21 Jan 2014 Multi-Dimensional Vis Jeffrey Heer University of Washington 1 Last Time: Exploratory Data Analysis 2 Exposure, the effective laying open of the data to display the unanticipated,

More information

Revision Topic 11: Straight Line Graphs

Revision Topic 11: Straight Line Graphs Revision Topic : Straight Line Graphs The simplest way to draw a straight line graph is to produce a table of values. Example: Draw the lines y = x and y = 6 x. Table of values for y = x x y - - - - =

More information

Data Visualization (CIS/DSC 468)

Data Visualization (CIS/DSC 468) Data Visualization (CIS/DSC 468) Tabular Data Dr. David Koop Channel Considerations Discriminability Separability Visual Popout Weber's Law Luminance Perception 2 Separability Cannot treat all channels

More information

Connecting ArcGIS with R and Conda. Shaun Walbridge

Connecting ArcGIS with R and Conda. Shaun Walbridge Connecting ArcGIS with R and Conda Shaun Walbridge https://github.com/sc w/nyc-r-ws High Quality PDF ArcGIS Today: R and Conda Conda Introduction Optional demo R and the R-ArcGIS Bridge Introduction Demo

More information

CSE Data Visualization. Multidimensional Vis. Jeffrey Heer University of Washington

CSE Data Visualization. Multidimensional Vis. Jeffrey Heer University of Washington CSE 512 - Data Visualization Multidimensional Vis Jeffrey Heer University of Washington Last Time: Exploratory Data Analysis Exposure, the effective laying open of the data to display the unanticipated,

More information

Brief Guide on Using SPSS 10.0

Brief Guide on Using SPSS 10.0 Brief Guide on Using SPSS 10.0 (Use student data, 22 cases, studentp.dat in Dr. Chang s Data Directory Page) (Page address: http://www.cis.ysu.edu/~chang/stat/) I. Processing File and Data To open a new

More information