Introduction to R and the tidyverse. Paolo Crosetto
|
|
- Sandra Bates
- 6 years ago
- Views:
Transcription
1 Introduction to R and the tidyverse Paolo Crosetto
2 Lecture 1: plotting
3 Before we start: Rstudio Interactive console Object explorer Script window Plot window
4 Before we start: R concatenate: c() assign: <- vector, matrices: rbind(), cbind() matrix extraction: [ ] variable extraction: $ data frames: mpg
5 Why do we plot Why do we want to plot data? we are human beings we are pattern recognizers we can see things we are not able to grasp from data good to explore a dataset and look for regularities good to convey a clear message to have fun (to show your colleagues how nice your plot is)
6 What do you see? Figure 1: Plots allow to convey a lot of information in a compact way
7 Good plots, bad plots It is important to make good plots i.e., plots that look good and are honest to the data it is very easy to hide the message rather than highlighting it it is very easy to mislead with a plot so let s start with a gallery of bad plots. Can you guess why they are bad?
8 Bad plotting 1 Figure 2:
9 Bad plotting 2 Figure 3:
10 Bad plotting 3 Figure 4:
11 Bad plotting 4 Figure 5:
12 Bad plotting 5 Figure 6:
13 Bad plotting 5 (really, you don t need 3D plots) Figure 7:
14 The road to good plotting know your data think before you hit the enter button sketch on paper first be honest draw your axis first choose your visualization wisely
15 Some data We will start by using the built-in dataset mpg mpg ## #... with 224 more rows, and 2 more variables: fl <chr> ## # A tibble: 234 x 11 ## manufacturer model displ year cyl trans ## <chr> <chr> <dbl> <int> <int> <chr> ## 1 audi a auto(l5) ## 2 audi a manual(m5) ## 3 audi a manual(m6) ## 4 audi a auto(av) ## 5 audi a auto(l5) ## 6 audi a manual(m5) ## 7 audi a auto(av) ## 8 audi a4 quattro manual(m5) ## 9 audi a4 quattro auto(l5) ## 10 audi a4 quattro manual(m6)
16 A look at the data model : model name displ : engine displacement, in litres year : year of manufacture cyl : number of cylinders trans : type of transmission drv : f = front-wheel drive, r = rear wheel drive, 4 = 4wd cty : city miles per gallon hwy : highway miles per gallon fl : fuel type class : type of car
17 We will be using ggplot2. Why? Advantages of ggplot2 consistent underlying grammar of graphics (Wilkinson, 2005) plot specification at a high level of abstraction very flexible theme system for polishing plot appearance mature and complete graphics system many users, active mailing list
18 What is a grammar of graphics? The basic idea: independently specify plot building blocks and combine them to create just about any kind of graphical display you want. Building blocks of a graph include: data aesthetic mapping geometric object statistical transformations scales coordinate system position adjustments faceting
19 Starting from the basics As in a grammar the minimal sentence is a subject in a plot the minimal object is data p <- ggplot(mpg) In a grammar, you need a verb. In plots, this is axis p <- ggplot(mpg, aes(x = displ, y = hwy)) Still no plot generated!
20 Generating a plot But you also need an object. In ggplot, this is geoms p + geom_point() 40 hwy
21 Generating a plot, 2 But you also need an object. In ggplot, this is geoms p + geom_smooth() ## `geom_smooth()` using method = 'loess' hwy 25 20
22 Generating a plot, 3 But you also need an object. In ggplot, this is geoms p + geom_smooth()+geom_point() ## `geom_smooth()` using method = 'loess' 40 hwy 30 20
23 The beauty of a grammar metaphor once you get the main idea, adding things is easy a plot is a sentence made with data you add layers with + as you would add words to a sentence as in grammar you use adjectives to give more nuanced meaning, in plots you could use + to add color, fill, size, shape, etc...
24 Adding meaning: color p + geom_point(aes(color=class)) hwy class 2seater compact midsize minivan pickup subcompact suv displ
25 Adding meaning: size p + geom_point(aes(size=cyl)) hwy cyl displ
26 Adding meaning: color AND size p + geom_point(aes(size = cyl, color=class)) compact hwy midsize minivan pickup subcompact suv displ cyl
27 Adding meaning: shape p + geom_point(aes(shape=fl)) hwy fl c d e p r displ
28 Adding meaning: all together p + geom_point(aes(color=manufacturer, shape =fl, size = cy mercury hwy nissan pontiac subaru toyota volkswagen displ fl c d e p
29 Facets sometimes sentences become a bit too long it is useful to split them up in shorter sentences for instance, you could first talk about a car, then another one in plots, you can split up the plot along a variable so that one plot is drawn for each level of a given variable, say type of fuel
30 Facets p + geom_point(aes(color=manufacturer, size = cyl))+facet_g hwy c d e p r hyundai jeep land rover lincoln mercury nissan pontiac subaru toyota volkswagen displ cyl
31 More details on the grammar A ggplot is made up of data (subject) axis (verb) geoms (object) aesthetic layers (size, fill color, shape, label,... ) facets (splitting sentences) And then you can change how things look and behave: - coordinate functions (changing the axis appearance and type) - scale functions (changing the appearance of the geoms) - theme functions (changing the appearance of the plot itself)
32 Exploring data with plots: one variable Plot types depend on the variable type one-variable plots, discrete variable: barplot one-variable plots, continuous variable: distribution, density
33 Barplots let s look at the drive type of the cars: front, rear, or 4wd p <- ggplot(mpg, aes(drv)) p + geom_bar() count
34 Barplots not so fancy. should we add color? p <- ggplot(mpg, aes(drv)) p + geom_bar(aes(color=drv)) 100 count drv 4 f r 0
35 Barplots ups. Maybe we meant fill? p <- ggplot(mpg, aes(drv)) p + geom_bar(aes(fill=drv)) 100 count drv 4 f r 0
36 Barplots nice. doesn t add much information, though. what if we cross it with car class? p <- ggplot(mpg, aes(drv)) p + geom_bar(aes(fill=class)) count class 2seater compact midsize minivan pickup subcompact suv
37 Barplots By default stacked. How to unstack? p <- ggplot(mpg, aes(drv)) p + geom_bar(aes(fill=class), position = position_dodge()) count class 2seater compact midsize minivan pickup subcompact suv 0
38 Barplots By default stacked. How to show relative weight? p <- ggplot(mpg, aes(drv)) p + geom_bar(aes(fill=class), position = position_fill()) 1.00 count class 2seater compact midsize minivan pickup subcompact suv 0.00
39 10 One variable, continuous: mpg on highway When the variable is continuous, it makes more sense to show distributions p <- ggplot(mpg, aes(hwy)) p + geom_histogram() ## `stat_bin()` using `bins = 30`. Pick better value with ` count 20
40 Histograms: binwidth p + geom_histogram(bins = 10) count hwy
41 Histograms: binwidth p + geom_histogram(bins = 100) count hwy
42 An alternative do histogram: dotplot p + geom_dotplot(binwidth = 0.5) count hwy
43 Continuous distribution: Kernel Density Estimation p + geom_density() 0.06 density hwy
44 Continuous distribution: Kernel Density Estimation p + geom_density(adjust = 3) density hwy
45 Continuous distribution: Kernel Density Estimation p + geom_density(adjust = 0.5) density hwy
46 Additional resources try to look for PlotCon2016 videos, and especially this one...
Lecture 4: Data Visualization I
Lecture 4: Data Visualization I Data Science for Business Analytics Thibault Vatter Department of Statistics, Columbia University and HEC Lausanne, UNIL 11.03.2018 Outline 1 Overview
More informationFacets and Continuous graphs
Facets and Continuous graphs One way to add additional variables is with aesthetics. Another way, particularly useful for categorical variables, is to split your plot into facets, subplots that each display
More informationIntoduction to data analysis with R
1/66 Intoduction to data analysis with R Mark Johnson Macquarie University Sydney, Australia September 17, 2014 2/66 Outline Goals for today: calculate summary statistics for data construct several kinds
More informationThe diamonds dataset Visualizing data in R with ggplot2
Lecture 2 STATS/CME 195 Matteo Sesia Stanford University Spring 2018 Contents The diamonds dataset Visualizing data in R with ggplot2 The diamonds dataset The tibble package The tibble package is part
More informationggplot2 basics Hadley Wickham Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University September 2011
ggplot2 basics Hadley Wickham Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University September 2011 1. Diving in: scatterplots & aesthetics 2. Facetting 3. Geoms
More information03 - Intro to graphics (with ggplot2)
3 - Intro to graphics (with ggplot2) ST 597 Spring 217 University of Alabama 3-dataviz.pdf Contents 1 Intro to R Graphics 2 1.1 Graphics Packages................................ 2 1.2 Base Graphics...................................
More informationGraphical critique & theory. Hadley Wickham
Graphical critique & theory Hadley Wickham Exploratory graphics Are for you (not others). Need to be able to create rapidly because your first attempt will never be the most revealing. Iteration is crucial
More informationGgplot2 QMMA. Emanuele Taufer. 2/19/2018 Ggplot2 (1)
Ggplot2 QMMA Emanuele Taufer file:///c:/users/emanuele.taufer/google%20drive/2%20corsi/5%20qmma%20-%20mim/0%20classes/1-4_ggplot2.html#(1) 1/27 Ggplot2 ggplot2 is a plotting system for R, based on the
More informationGetting started with ggplot2
Getting started with ggplot2 STAT 133 Gaston Sanchez Department of Statistics, UC Berkeley gastonsanchez.com github.com/gastonstat/stat133 Course web: gastonsanchez.com/stat133 ggplot2 2 Resources for
More informationA set of rules describing how to compose a 'vocabulary' into permissible 'sentences'
Lecture 8: The grammar of graphics STAT598z: Intro. to computing for statistics Vinayak Rao Department of Statistics, Purdue University Grammar? A set of rules describing how to compose a 'vocabulary'
More informationVisualizing Data: Customization with ggplot2
Visualizing Data: Customization with ggplot2 Data Science 1 Stanford University, Department of Statistics ggplot2: Customizing graphics in R ggplot2 by RStudio s Hadley Wickham and Winston Chang offers
More informationStatistical transformations
Statistical transformations Next, let s take a look at a bar chart. Bar charts seem simple, but they are interesting because they reveal something subtle about plots. Consider a basic bar chart, as drawn
More informationAdvanced Plotting with ggplot2. Algorithm Design & Software Engineering November 13, 2016 Stefan Feuerriegel
Advanced Plotting with ggplot2 Algorithm Design & Software Engineering November 13, 2016 Stefan Feuerriegel Today s Lecture Objectives 1 Distinguishing different types of plots and their purpose 2 Learning
More informationCreating elegant graphics in R with ggplot2
Creating elegant graphics in R with ggplot2 Lauren Steely Bren School of Environmental Science and Management University of California, Santa Barbara What is ggplot2, and why is it so great? ggplot2 is
More informationPlotting with Rcell (Version 1.2-5)
Plotting with Rcell (Version 1.2-) Alan Bush October 7, 13 1 Introduction Rcell uses the functions of the ggplots2 package to create the plots. This package created by Wickham implements the ideas of Wilkinson
More informationHadley Wickham. ggplot2. Elegant Graphics for Data Analysis. July 26, Springer
Hadley Wickham ggplot2 Elegant Graphics for Data Analysis July 26, 2016 Springer To my parents, Alison & Brian Wickham. Without them, and their unconditional love and support, none of this would have
More informationRstudio GGPLOT2. Preparations. The first plot: Hello world! W2018 RENR690 Zihaohan Sang
Rstudio GGPLOT2 Preparations There are several different systems for creating data visualizations in R. We will introduce ggplot2, which is based on Leland Wilkinson s Grammar of Graphics. The learning
More informationLecture 09. Graphics::ggplot I R Teaching Team. October 1, 2018
Lecture 09 Graphics::ggplot I 2018 R Teaching Team October 1, 2018 Acknowledgements 1. Mike Fliss & Sara Levintow! 2. stackoverflow (particularly user David for lecture styling - link) 3. R Markdown: The
More informationSession 3 Nick Hathaway;
Session 3 Nick Hathaway; nicholas.hathaway@umassmed.edu Contents Manipulating Data frames and matrices 1 Converting to long vs wide formats.................................... 2 Manipulating data in table........................................
More information# Call plot plot(gg)
Most of the requirements related to look and feel can be achieved using the theme() function. It accepts a large number of arguments. Type?theme in the R console and see for yourself. # Setup options(scipen=999)
More informationggplot2 for beginners Maria Novosolov 1 December, 2014
ggplot2 for beginners Maria Novosolov 1 December, 214 For this tutorial we will use the data of reproductive traits in lizards on different islands (found in the website) First thing is to set the working
More informationLab5A - Intro to GGPLOT2 Z.Sang Sept 24, 2018
LabA - Intro to GGPLOT2 Z.Sang Sept 24, 218 In this lab you will learn to visualize raw data by plotting exploratory graphics with ggplot2 package. Unlike final graphs for publication or thesis, exploratory
More informationggplot in 3 easy steps (maybe 2 easy steps)
1 ggplot in 3 easy steps (maybe 2 easy steps) 1.1 aesthetic: what you want to graph (e.g. x, y, z). 1.2 geom: how you want to graph it. 1.3 options: optional titles, themes, etc. 2 Background R has a number
More informationBIOSTATS 640 Spring 2018 Introduction to R Data Description. 1. Start of Session. a. Preliminaries... b. Install Packages c. Attach Packages...
BIOSTATS 640 Spring 2018 Introduction to R and R-Studio Data Description Page 1. Start of Session. a. Preliminaries... b. Install Packages c. Attach Packages... 2. Load R Data.. a. Load R data frames...
More informationIntroduction to Graphics with ggplot2
Introduction to Graphics with ggplot2 Reaction 2017 Flavio Santi Sept. 6, 2017 Flavio Santi Introduction to Graphics with ggplot2 Sept. 6, 2017 1 / 28 Graphics with ggplot2 ggplot2 [... ] allows you to
More informationInstall RStudio from - use the standard installation.
Session 1: Reading in Data Before you begin: Install RStudio from http://www.rstudio.com/ide/download/ - use the standard installation. Go to the course website; http://faculty.washington.edu/kenrice/rintro/
More informationEXPLORATORY DATA ANALYSIS. Introducing the data
EXPLORATORY DATA ANALYSIS Introducing the data Email data set > email # A tibble: 3,921 21 spam to_multiple from cc sent_email time image 1 not-spam 0 1 0 0
More informationImporting and visualizing data in R. Day 3
Importing and visualizing data in R Day 3 R data.frames Like pandas in python, R uses data frame (data.frame) object to support tabular data. These provide: Data input Row- and column-wise manipulation
More informationPackage ggextra. April 4, 2018
Package ggextra April 4, 2018 Title Add Marginal Histograms to 'ggplot2', and More 'ggplot2' Enhancements Version 0.8 Collection of functions and layers to enhance 'ggplot2'. The flagship function is 'ggmarginal()',
More informationStatistics Lecture 6. Looking at data one variable
Statistics 111 - Lecture 6 Looking at data one variable Chapter 1.1 Moore, McCabe and Craig Probability vs. Statistics Probability 1. We know the distribution of the random variable (Normal, Binomial)
More informationLesson 16: More on Modeling Relationships with a Line
Student Outcomes Students use the least squares line to predict values for a given data set. Students use residuals to evaluate the accuracy of predictions based on the least squares line. Lesson Notes
More informationIntroduction to R for Beginners, Level II. Jeon Lee Bio-Informatics Core Facility (BICF), UTSW
Introduction to R for Beginners, Level II Jeon Lee Bio-Informatics Core Facility (BICF), UTSW Basics of R Powerful programming language and environment for statistical computing Useful for very basic analysis
More informationData Visualization in R
Data Visualization in R L. Torgo ltorgo@fc.up.pt Faculdade de Ciências / LIAAD-INESC TEC, LA Universidade do Porto Oct, 216 Introduction Motivation for Data Visualization Humans are outstanding at detecting
More information1 The ggplot2 workflow
ggplot2 @ statistics.com Week 2 Dope Sheet Page 1 dope, n. information especially from a reliable source [the inside dope]; v. figure out usually used with out; adj. excellent 1 This week s dope This week
More informationData Visualization. Module 7
Data Visualization http://datascience.tntlab.org Module 7 Today s Agenda A Brief Reminder to Update your Software A walkthrough of ggplot2 Big picture New cheatsheet, with some familiar caveats Geometric
More informationSTAT 1291: Data Science
STAT 1291: Data Science Lecture 20 - Summary Sungkyu Jung Semester recap data visualization data wrangling professional ethics statistical foundation Statistical modeling: Regression Cause and effect:
More informationVisual Analytics. Visualizing multivariate data:
Visual Analytics 1 Visualizing multivariate data: High density time-series plots Scatterplot matrices Parallel coordinate plots Temporal and spectral correlation plots Box plots Wavelets Radar and /or
More informationEcon 2148, spring 2019 Data visualization
Econ 2148, spring 2019 Maximilian Kasy Department of Economics, Harvard University 1 / 43 Agenda One way to think about statistics: Mapping data-sets into numerical summaries that are interpretable by
More informationSpring 2017 CS130 - Intro to R 1 R VISUALIZING DATA. Spring 2017 CS130 - Intro to R 2
Spring 2017 CS130 - Intro to R 1 R VISUALIZING DATA Spring 2017 Spring 2017 CS130 - Intro to R 2 Goals for this lecture: Review constructing Data Frame, Categorizing variables Construct basic graph, learn
More informationFigure 1: The PMG GUI on startup
Statistics involves a fair number of computations that can be made much more convenient using either a calculator or a computer. Although the basic TI-83 or 84 series of calculators can do most of the
More informationIntroduction to Data Visualization
Introduction to Data Visualization Author: Nicholas G Reich This material is part of the statsteachr project Made available under the Creative Commons Attribution-ShareAlike 3.0 Unported License: http://creativecommons.org/licenses/by-sa/3.0/deed.en
More informationAn introduction to ggplot: An implementation of the grammar of graphics in R
An introduction to ggplot: An implementation of the grammar of graphics in R Hadley Wickham 00-0-7 1 Introduction Currently, R has two major systems for plotting data, base graphics and lattice graphics
More informationR Visualizing Data. Fall Fall 2016 CS130 - Intro to R 1
R Visualizing Data Fall 2016 Fall 2016 CS130 - Intro to R 1 mtcars Data Frame R has a built-in data frame called mtcars Useful R functions length(object) # number of variables str(object) # structure of
More informationData Visualization in R
Data Visualization in R L. Torgo ltorgo@fc.up.pt Faculdade de Ciências / LIAAD-INESC TEC, LA Universidade do Porto Aug, 2017 Introduction Motivation for Data Visualization Humans are outstanding at detecting
More informationYou submitted this quiz on Sat 17 May :19 AM CEST. You got a score of out of
uiz Feedback Coursera 1 of 7 01/06/2014 20:02 Feedback Week 2 Quiz Help You submitted this quiz on Sat 17 May 2014 11:19 AM CEST. You got a score of 10.00 out of 10.00. Question 1 Under the lattice graphics
More informationLondonR: Introduction to ggplot2. Nick Howlett Data Scientist
LondonR: Introduction to ggplot2 Nick Howlett Data Scientist Email: nhowlett@mango-solutions.com Agenda Catie Gamble, M&S - Using R to Understand Revenue Opportunities for your Online Business Andrie de
More informationIntroduction to ggvis. Aimee Gott R Consultant
Introduction to ggvis Overview Recap of the basics of ggplot2 Getting started with ggvis The %>% operator Changing aesthetics Layers Interactivity Resources for the Workshop R (version 3.1.2) RStudio ggvis
More informationData Visualization. Andrew Jaffe Instructor
Module 9 Data Visualization Andrew Jaffe Instructor Basic Plots We covered some basic plots previously, but we are going to expand the ability to customize these basic graphics first. 2/45 Read in Data
More informationThe following presentation is based on the ggplot2 tutotial written by Prof. Jennifer Bryan.
Graphics Agenda Grammer of Graphics Using ggplot2 The following presentation is based on the ggplot2 tutotial written by Prof. Jennifer Bryan. ggplot2 (wiki) ggplot2 is a data visualization package Created
More informationStat405. Displaying distributions. Hadley Wickham. Thursday, August 23, 12
Stat405 Displaying distributions Hadley Wickham 1. The diamonds data 2. Histograms and bar charts 3. Homework Diamonds Diamonds data ~54,000 round diamonds from http://www.diamondse.info/ Carat, colour,
More informationR Workshop 1: Introduction to R
R Workshop 1: Introduction to R Gavin Simpson Environmental Change Research Centre, Department of Geography UCL April 30, 2013 Gavin Simpson (ECRC, UCL) Introduction to R April 30, 2013 1 / 43 Outline
More informationData Handling: Import, Cleaning and Visualisation
Data Handling: Import, Cleaning and Visualisation 1 Data Display Lecture 11: Visualisation and Dynamic Documents Prof. Dr. Ulrich Matter (University of St. Gallen) 13/12/18 In the last part of a data pipeline
More informationLarge data. Hadley Wickham. Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University.
Large data Hadley Wickham Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University November 2010 1. The diamonds data 2. Histograms and bar charts 3. Frequency polygons
More informationChapter 7. The Data Frame
Chapter 7. The Data Frame The R equivalent of the spreadsheet. I. Introduction Most analytical work involves importing data from outside of R and carrying out various manipulations, tests, and visualizations.
More informationChapter 2 - Graphical Summaries of Data
Chapter 2 - Graphical Summaries of Data Data recorded in the sequence in which they are collected and before they are processed or ranked are called raw data. Raw data is often difficult to make sense
More informationIntroduction to R Software
1. Introduction R is a free software environment for statistical computing and graphics. It is almost perfectly compatible with S-plus. The only thing you need to do is download the software from the internet
More informationData Visualization Using R & ggplot2. Karthik Ram October 6, 2013
Data Visualization Using R & ggplot2 Karthik Ram October 6, 2013 Some housekeeping Install some packages install.packages("ggplot2", dependencies = TRUE) install.packages("plyr") install.packages("ggthemes")
More informationIntro to R for Epidemiologists
Lab 9 (3/19/15) Intro to R for Epidemiologists Part 1. MPG vs. Weight in mtcars dataset The mtcars dataset in the datasets package contains fuel consumption and 10 aspects of automobile design and performance
More informationData visualization with ggplot2
Data visualization with ggplot2 Visualizing data in R with the ggplot2 package Authors: Mateusz Kuzak, Diana Marek, Hedi Peterson, Dmytro Fishman Disclaimer We will be using the functions in the ggplot2
More information<style> pre { overflow-x: auto; } pre code { word-wrap: normal; white-space: pre; } </style>
--- title: "Visualization for Data Management Modules Wheat CAP 2018" author: name: "Jean-Luc Jannink" affiliation: "USDA-ARS" date: "June 7, 2018" output: html_document: fig_height: 6 fig_width: 12 highlight:
More information3. elucidating mathematical or algorithmic concepts.
Chapter 10 Data Visualization Visualizing data is key in e ective data analysis. It is useful for the following purposes: 1. initially investigating datasets, 2. confirming or refuting data models, and
More informationIntroduction to R and R-Studio Toy Program #2 Excel to R & Basic Descriptives
Introduction to R and R-Studio 2018-19 Toy Program #2 Basic Descriptives Summary The goal of this toy program is to give you a boiler for working with your own excel data. So, I m hoping you ll try!. In
More informationPackage autocogs. September 22, Title Automatic Cognostic Summaries Version 0.1.1
Title Automatic Cognostic Summaries Version 0.1.1 Package autocogs September 22, 2018 Automatically calculates cognostic groups for plot objects and list column plot objects. Results are returned in a
More informationSession 5 Nick Hathaway;
Session 5 Nick Hathaway; nicholas.hathaway@umassmed.edu Contents Adding Text To Plots 1 Line graph................................................. 1 Bar graph..................................................
More informationMaps & layers. Hadley Wickham. Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University.
Maps & layers Hadley Wickham Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University July 2010 1. Introduction to map data 2. Map projections 3. Loading & converting
More informationEXST 7014, Lab 1: Review of R Programming Basics and Simple Linear Regression
EXST 7014, Lab 1: Review of R Programming Basics and Simple Linear Regression OBJECTIVES 1. Prepare a scatter plot of the dependent variable on the independent variable 2. Do a simple linear regression
More informationMaking Better Features: Principal Components Analysis and Other Data Transformations
Making Better Features: Principal Components Analysis and Other Data Transformations 36-350: Data Mining September 27, 2006 Reading: Sections 2.4, 3.4, 3.5 and 3.6 in the textbook, especially Section 3.6
More informationThe Tidyverse BIOF 339 9/25/2018
The Tidyverse BIOF 339 9/25/2018 What is the Tidyverse? The tidyverse is an opinionated collection of R packages designed for data science. All packages share an underlying design philosophy, grammar,
More informationdope, n. information especially from a reliable source [the inside dope]; v. figure out usually used with out; adj. excellent 1
ggplot2 @ statistics.com Week 1 Dope Sheet Page 1 dope, n. information especially from a reliable source [the inside dope]; v. figure out usually used with out; adj. excellent 1 This week s dope This we
More informationIntroduction to R: Day 2 September 20, 2017
Introduction to R: Day 2 September 20, 2017 Outline RStudio projects Base R graphics plotting one or two continuous variables customizable elements of plots saving plots to a file Create a new project
More informationChapter 2: Graphical Summaries of Data 2.1 Graphical Summaries for Qualitative Data. Frequency: Frequency distribution:
Chapter 2: Graphical Summaries of Data 2.1 Graphical Summaries for Qualitative Data Frequency: Frequency distribution: Example 2.1 The following are survey results from Fall 2014 Statistics class regarding
More informationParallel Coordinates CS 6630 Scientific Visualization
Parallel Coordinates CS 6630 Scientific Visualization Yang Shen u0879466 Jason Davies Example Jason s parallel coordinates is fairly effective when it comes to interaction. I like the way creating filter
More informationUnit I Supplement OpenIntro Statistics 3rd ed., Ch. 1
Unit I Supplement OpenIntro Statistics 3rd ed., Ch. 1 KEY SKILLS: Organize a data set into a frequency distribution. Construct a histogram to summarize a data set. Compute the percentile for a particular
More information3. Visual Analytics (Part 1: Visual Encoding) Jacobs University Visualization and Computer Graphics Lab
3. Visual Analytics (Part 1: Visual Encoding) 3.1 Introduction Motivation Big Data cannot be analyzed anymore without the help of computers. Computers are good in quickly processing large amounts of data.
More informationIntroduction to R Forecasting Techniques
Introduction to R zabbeta@fsu.gr katerina@fsu.gr Starting out in R Working with data Plotting & Forecasting 1. Starting Out In R R & RStudio Variables & Basics Data Types Functions R + RStudio Programming
More informationThe Average and SD in R
The Average and SD in R The Basics: mean() and sd() Calculating an average and standard deviation in R is straightforward. The mean() function calculates the average and the sd() function calculates the
More informationAn Introduction to R Graphics
An Introduction to R Graphics PnP Group Seminar 25 th April 2012 Why use R for graphics? Fast data exploration Easy automation and reproducibility Create publication quality figures Customisation of almost
More informationLoading Data into R. Loading Data Sets
Loading Data into R Loading Data Sets Rather than manually entering data using c() or something else, we ll want to load data in stored in a data file. For this class, these will usually be one of three
More informationUser manual forggsubplot
User manual forggsubplot Garrett Grolemund September 3, 2012 1 Introduction ggsubplot expands the ggplot2 package to help users create multi-level plots, or embedded plots." Embedded plots embed subplots
More informationAn Introduction to R. Ed D. J. Berry 9th January 2017
An Introduction to R Ed D. J. Berry 9th January 2017 Overview Why now? Why R? General tips Recommended packages Recommended resources 2/48 Why now? Efficiency Pointandclick software just isn't time efficient
More informationMaking R Graphs, For People Who Don t Want To Learn R
Making R Graphs, For People Who Don t Want To Learn R Richard Blissett I get it. You already know one statistical programming language, and the idea of having to learn another one just to make pretty pictures
More informationData Import and Formatting
Data Import and Formatting http://datascience.tntlab.org Module 4 Today s Agenda Importing text data Basic data visualization tidyverse vs data.table Data reshaping and type conversion Basic Text Data
More informationLAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT
NAVAL POSTGRADUATE SCHOOL LAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT Statistics (OA3102) Lab #2: Sampling, Sampling Distributions, and the Central Limit Theorem Goal: Use R to demonstrate sampling
More informationToday. Parity. General Polygons? Non-Zero Winding Rule. Winding Numbers. CS559 Lecture 11 Polygons, Transformations
CS559 Lecture Polygons, Transformations These are course notes (not used as slides) Written by Mike Gleicher, Oct. 005 With some slides adapted from the notes of Stephen Chenney Final version (after class)
More informationSection 1.2. Displaying Quantitative Data with Graphs. Mrs. Daniel AP Stats 8/22/2013. Dotplots. How to Make a Dotplot. Mrs. Daniel AP Statistics
Section. Displaying Quantitative Data with Graphs Mrs. Daniel AP Statistics Section. Displaying Quantitative Data with Graphs After this section, you should be able to CONSTRUCT and INTERPRET dotplots,
More informationA Quick Introduction to R
Math 4501 Fall 2012 A Quick Introduction to R The point of these few pages is to give you a quick introduction to the possible uses of the free software R in statistical analysis. I will only expect you
More informationCreating a Box-and-Whisker Graph in Excel: Step One: Step Two:
Creating a Box-and-Whisker Graph in Excel: It s not as simple as selecting Box and Whisker from the Chart Wizard. But if you ve made a few graphs in Excel before, it s not that complicated to convince
More informationCS4445 Data Mining and Knowledge Discovery in Databases. A Term 2008 Exam 2 October 14, 2008
CS4445 Data Mining and Knowledge Discovery in Databases. A Term 2008 Exam 2 October 14, 2008 Prof. Carolina Ruiz Department of Computer Science Worcester Polytechnic Institute NAME: Prof. Ruiz Problem
More information(Refer Slide Time: 00:01:27 min)
Computer Aided Design Prof. Dr. Anoop Chawla Department of Mechanical engineering Indian Institute of Technology, Delhi Lecture No. # 01 An Introduction to CAD Today we are basically going to introduce
More informationGraphics in R. There are three plotting systems in R. base Convenient, but hard to adjust after the plot is created
Graphics in R There are three plotting systems in R base Convenient, but hard to adjust after the plot is created lattice Good for creating conditioning plot ggplot2 Powerful and flexible, many tunable
More informationA Quick and focused overview of R data types and ggplot2 syntax MAHENDRA MARIADASSOU, MARIA BERNARD, GERALDINE PASCAL, LAURENT CAUQUIL
A Quick and focused overview of R data types and ggplot2 syntax MAHENDRA MARIADASSOU, MARIA BERNARD, GERALDINE PASCAL, LAURENT CAUQUIL 1 R and RStudio OVERVIEW 2 R and RStudio R is a free and open environment
More informationHW 10 STAT 472, Spring 2018
HW 10 STAT 472, Spring 2018 1) (0 points) Do parts (a), (b), (c), and (e) of Exercise 2 on p. 298 of ISL. 2) (0 points) Do Exercise 3 on p. 298 of ISL. 3) For this problem, you can merely submit the things
More informationLecture 1: Getting Started and Data Basics
Lecture 1: Getting Started and Data Basics The first lecture is intended to provide you the basics for running R. Outline: 1. An Introductory R Session 2. R as a Calculator 3. Import, export and manipulate
More informationTracking Computer Vision Spring 2018, Lecture 24
Tracking http://www.cs.cmu.edu/~16385/ 16-385 Computer Vision Spring 2018, Lecture 24 Course announcements Homework 6 has been posted and is due on April 20 th. - Any questions about the homework? - How
More informationPackage ggsubplot. February 15, 2013
Package ggsubplot February 15, 2013 Maintainer Garrett Grolemund License GPL Title Explore complex data by embedding subplots within plots. LazyData true Type Package Author Garrett
More informationVisualization for Data Management Modules Wheat CAP 2018
Visualization for Data Management Modules Wheat CAP 2018 Jean-Luc Jannink USDA-ARS June 7, 2018 Preliminaries Learning objectives 1. Get into the mind of ggplot i. plots are objects with layers. There
More informationDATA VISUALIZATION WITH GGPLOT2. Coordinates
DATA VISUALIZATION WITH GGPLOT2 Coordinates Coordinates Layer Controls plot dimensions coord_ coord_cartesian() Zooming in scale_x_continuous(limits =...) xlim() coord_cartesian(xlim =...) Original Plot
More informationMA30SA Applied Math Unit D - Linear Programming Revd:
1 Introduction to Linear Programming MA30SA Applied Math Unit D - Linear Programming Revd: 120051212 1. Linear programming is a very important skill. It is a brilliant method for establishing optimum solutions
More informationIntroduction to R Programming
Course Overview Over the past few years, R has been steadily gaining popularity with business analysts, statisticians and data scientists as a tool of choice for conducting statistical analysis of data
More informationdata visualization Show the Data Snow Month skimming deep waters
data visualization skimming deep waters Show the Data Snow 2 4 6 8 12 Minimize Distraction Minimize Distraction Snow 2 4 6 8 12 2 4 6 8 12 Make Big Data Coherent Reveal Several Levels of Detail 1974 1975
More information