data visualization Show the Data Snow Month skimming deep waters

Similar documents
An Introduction to R Graphics

Facets and Continuous graphs

Module 10. Data Visualization. Andrew Jaffe Instructor

Data Visualization. Andrew Jaffe Instructor

INTRODUCTION TO R. Basic Graphics

Intro to R for Epidemiologists

Rstudio GGPLOT2. Preparations. The first plot: Hello world! W2018 RENR690 Zihaohan Sang

Intro to R Graphics Center for Social Science Computation and Research, 2010 Stephanie Lee, Dept of Sociology, University of Washington

Importing and visualizing data in R. Day 3

Data visualization with ggplot2

LondonR: Introduction to ggplot2. Nick Howlett Data Scientist

Using Built-in Plotting Functions

AA BB CC DD EE. Introduction to Graphics in R

Plotting with ggplot2: Part 2. Biostatistics

Large data. Hadley Wickham. Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University.

Introduction to R for Beginners, Level II. Jeon Lee Bio-Informatics Core Facility (BICF), UTSW

DATA VISUALIZATION WITH GGPLOT2. Coordinates

Chapter 3. Determining Effective Data Display with Charts

Graphics in R Ira Sharenow January 2, 2019

Introduction to R for Epidemiologists

Introduction to Data Visualization

Stat405. Displaying distributions. Hadley Wickham. Thursday, August 23, 12

Information Visualization. SWE 432, Fall 2016 Design and Implementation of Software for the Web

Configuring Figure Regions with prepplot Ulrike Grömping 03 April 2018

Lecture 4: Data Visualization I

ggplot2 basics Hadley Wickham Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University September 2011

Graphical critique & theory. Hadley Wickham

Data Visualization Using R & ggplot2. Karthik Ram October 6, 2013

Statistical transformations

Package harrypotter. September 3, 2018

limma: A brief introduction to R

Lab 1 Introduction to R

DATA VISUALIZATION WITH GGPLOT2. Grid Graphics

R Workshop 1: Introduction to R

ggplot in 3 easy steps (maybe 2 easy steps)

The following presentation is based on the ggplot2 tutotial written by Prof. Jennifer Bryan.

Package panelview. April 24, 2018

Stat 849: Plotting responses and covariates

Econ 2148, spring 2019 Data visualization

Package MSG. R topics documented: August 29, Type Package

Introduction to ggvis. Aimee Gott R Consultant

Package sciplot. February 15, 2013

DSCI 325: Handout 18 Introduction to Graphics in R

Information Visualization in Data Mining. S.T. Balke Department of Chemical Engineering and Applied Chemistry University of Toronto

Creating elegant graphics in R with ggplot2

STAT 1291: Data Science

Teaching univariate measures of location-using loss functions

Plotting: An Iterative Process

Getting started with ggplot2

Graphics - Part III: Basic Graphics Continued

visualizing q uantitative quantitative information information

Introduction to R. Biostatistics 615/815 Lecture 23

Package hbm. February 20, 2015

The diamonds dataset Visualizing data in R with ggplot2

Statistical Programming with R

Plotting Complex Figures Using R. Simon Andrews v

You submitted this quiz on Sat 17 May :19 AM CEST. You got a score of out of

Package beanplot. R topics documented: February 19, Type Package

Statistical Programming Camp: An Introduction to R

1 The ggplot2 workflow

Introduction to R: Day 2 September 20, 2017

Package lvplot. August 29, 2016

Types of Plotting Functions. Managing graphics devices. Further High-level Plotting Functions. The plot() Function

Exploratory Projection Pursuit

Az R adatelemzési nyelv

Session 5 Nick Hathaway;

Stat 849: Plotting responses and covariates

Basic Statistical Graphics in R. Stem and leaf plots 100,100,100,99,98,97,96,94,94,87,83,82,77,75,75,73,71,66,63,55,55,55,51,19

Basics of Plotting Data

Module 6: Advanced Plotting in R

Data Visualization in R

CS Introduction to Computational and Data Science. Instructor: Renzhi Cao Computer Science Department Pacific Lutheran University Spring 2017

Shrinkage of logarithmic fold changes

IST 3108 Data Analysis and Graphics Using R Week 9

Introduction to R and the tidyverse. Paolo Crosetto

Advanced Graphics with R

Graphics in R STAT 133. Gaston Sanchez. Department of Statistics, UC Berkeley

Preliminary Figures for Renormalizing Illumina SNP Cell Line Data

Short tutorial on studying module preservation: Preservation of female mouse liver modules in male data

Easy interactive ggplots

Package ggextra. April 4, 2018

Package nonmem2r. April 5, 2018

The first one will centre the data and ensure unit variance (i.e. sphere the data):

Data Visualization in R

ggplot2 and maps Marcin Kierczak 11/10/2016

Introduction to Graphics with ggplot2

03 - Intro to graphics (with ggplot2)

plot(seq(0,10,1), seq(0,10,1), main = "the Title", xlim=c(1,20), ylim=c(1,20), col="darkblue");

A set of rules describing how to compose a 'vocabulary' into permissible 'sentences'

An introduction to R Graphics 4. ggplot2

ggplot2 for beginners Maria Novosolov 1 December, 2014

The Average and SD in R

Solution Set 8. Andrew Goldstone March 26, 2015

CQN (Conditional Quantile Normalization)

Maps & layers. Hadley Wickham. Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University.

Data Visualization. Module 7

Practical 2: Plotting

Package ggdark. R topics documented: January 11, Type Package Title Dark Mode for 'ggplot2' Themes Version Author Neal Grantham

Mixed models in R using the lme4 package Part 2: Lattice graphics

Package MFDFA. April 18, 2018

Transcription:

data visualization skimming deep waters Show the Data Snow 2 4 6 8 12

Minimize Distraction Minimize Distraction Snow 2 4 6 8 12 2 4 6 8 12 Make Big Data Coherent Reveal Several Levels of Detail 1974 1975 1976 1977 1978 1979 198 1981 1982 1983 1984 1985 1986 1987 1988 1989 199 1992 1993 1994 1995 1996 1997 3 6 9 12 3 6 9 12 3 6 9 12 Mean Snow by Across s 2 3 4 5 6 7 8 9 11 12 198 199 198 199 198 199

Be Closely Integrated with Statistics The Data:Ink Ratio Mean Snow by Across s 2 3 4 5 6 7 8 9 11 12 1. Above all else show data. 2. Maximize the data-ink ratio. 3. Erase non-data-ink. 4. Erase redundant data-ink. 5. Revise and edit 198 199 198 199 198 199 Minimizing Ink Minimizing Ink Mean Snow by Across s Mean Snow by Across s 2 3 4 5 2 3 4 5 6 7 8 9 11 12 6 7 8 9 11 12 198 199 198 199 198 199 198 199 198 199 198 199

Extra Ink in Boxes & Lines A Cleaner Boxplot 2.5 5. 7.5. 12.5 2.5 5. 7.5. 12.5 Pure Tufte Boxplots Basic Plotting in R diatom 2.5 5. 7.5. 12.5

cyanobacteria.tota diatom dinoflagellate green Snow Visualizing a Lot of the Data A Basic Bivariate Plot 4 pairs(plankton[, 14:18]) 4 15 4 diatom, data = plankton) 5 15 25 5 15 15 4 diatom A Basic Bivariate Plot Adding Axis Labels plot(plankton$diatom, plankton$) plankton$ 5 15 25 diatom, data = plankton, xlab = "", ylab = "", xlim = c(, )) 5 15 25 5 15 plankton$diatom 5 15

Adding Axis Limits More Point Shapes diatom, data = plankton, xlab = "", ylab = "", xlim = c(, )) 5 15 25 diatom, data = plankton, xlab = "", ylab = "", xlim = c(, ), pch = 19) 5 15 25 5 15 5 15 More Point Shapes cex for Size plot symbols : points (... pch = *, cex = 3 ) 1 2 3 4 5 6 7 8 9 11 12 13 14 15 16 17 18 19 21 22 23 24 25 *. o * o OO + + %% # # diatom, data = plankton, xlab = "", ylab = "", xlim = c(, ), pch = 19, cex = 4) See also cex.axis, cex.lab, and more. 5 15 5 15 25

Add a Little Color Panels with Par and Mfrow diatom, data = plankton, xlab = "", ylab = "", xlim = c(, ), pch = 19, col = ) 5 15 25 par(mfrow = c(1, 2)) 5 15 25 Daphnia Abundnace 4 6 8 1 5 15 5 5 Lots of Other Functions that For Plots?matplot?lines?axis?title?legend?points?segments ggplot2 or how I learned to stop worring and love http://had.co.nz/ggplot2 & http://stackoverflow.com/ So...Explore! Plot with the data, try different par settings, or use some of these functions!

Start with nothing... p <- ggplot(data = plankton, mapping = aes(x =, y = )) p ## Error: No layers in plot There is no layout specified here for the data. Add a Layer Format with Theme p <- p + geom_point() p p <- p + ylab("total Copepod Abundance") + theme_bw() p Total Copepod Abundance 2.5 5. 7.5. 12.5 2.5 5. 7.5. 12.5

Map a Variable to Color Set Your Own Scale p2 <- ggplot(data = plankton, aes(x =, y =, color = )) + geom_point() + theme_bw() p2 p2 <- p2 + scale_color_gradient(low = "blue", high = "red") p2 1995 199 1985 198 1975 1995 199 1985 198 1975 2.5 5. 7.5. 12.5 2.5 5. 7.5. 12.5 And Maybe Add Another Layer Facet for Easier Visualization p2 <- p2 + geom_line(aes(group = )) p2 p2 <- p2 + facet_wrap( ) + scale_x_continuous(breaks = c(3, 6, 9, 12)) p2 1995 199 1985 198 1975 2.5 5. 7.5. 12.5 1974 1975 1976 1977 1978 1979 198 1981 1982 1983 1984 1985 1986 1987 1988 1989 199 1992 1993 1994 1995 1996 1997 3 6 9 12 3 6 9 12 3 6 9 12 1995 199 1985 198 1975

This All Can Lead to Interesting Visualizations qplot(factor(), diatom, geom = "bar", fill = factor(), data = plankton) + theme_bw() + xlab("") + ylab("\n") + scale_fill_discrete(name = "") + scale_x_discrete(breaks = seq(1974, 1997, 5)) 2 3 4 5 6 7 8 9 11 12 Lots of Layers to Add to ggplot2 Objects?theme?labs?xlim?facet_grid?scale_x_log?geom_histogram?geom_ribbon?geom_linerange?geom_freqpoly So...Explore! Also, see http://had.co.nz/ggplot2 for some examples 1974 1979 1984 1989 1994