IST 3108 Data Analysis and Graphics Using R. Summarizing Data Data Import-Export

Size: px
Start display at page:

Download "IST 3108 Data Analysis and Graphics Using R. Summarizing Data Data Import-Export"

Transcription

1 IST 3108 Data Analysis and Graphics Using R Summarizing Data Data Import-Export Engin YILDIZTEPE, PhD Working with Vectors and Logical Subscripts >x<-1:20 >sum(x) how many of the values were less than 10: >sum(x<10) sum of the values of x that are less than 10: >sum(x[x<10]) 2 1

2 Working with Vectors and Logical Subscripts sort, rev >x<-c(3,5,12,6,7,10,7,2,4,9,3,2) >rev(x) [1] >sort(x) [1] How can you sort the values in decreasing order? > sort(x,decreasing=true) [1] > rev(sort(x)) [1] Working with Vectors and Logical Subscripts sort, rev >x<-c(3,5,12,6,7,10,7,2,4,9,3,2) What is the sum of the three largest values in x? > sum(sort(x,true)[1:3]) [1]

3 order() function order() returns a permutation which rearranges its first argument into ascending or descending order. (you need to sort a series of variables according to the values of some other variables) Example: Sort mtcars values sorted by mpg > order(mtcars$mpg) [1] > mtcars[order(mtcars$mpg),] > mtcars[rev(order(mtcars$mpg)),] > mtcars[order(mtcars$mpg,mtcars$qsec),] 5 Working with Vectors and Logical Subscripts To extract every 25th value in a 1000-long vector of normal random numbers with mean value 0 and std. dev. 1 >x<-rnorm(1000,0,1) >x[seq(25,length(x),25)] 6 3

4 Summarizing Data Name mean() median() summary() min(), max() quantile() var(), sd() cov(), cor() Operation arithmetic mean sample median generic summary function for data smallest/largest values calculate sample quantiles (percentiles) sample variance, sample std. dev. sample covariance/correlation 7 Summarizing Data summary() data(mtcars) # load in dataset attach(mtcars) # add mtcars to search path?mtcars mtcars >summary(mpg) Min. 1st Qu. Median Mean 3rd Qu. Max

5 Summarizing Data quantile() mean(hp) [1] var(mpg) [1] #qsec :1/4 mile (402 m) time in seconds quantile(qsec) 0% 25% 50% 75% 100% Summarizing Data quantile() >quantile(qsec, probs = c(0.20, 0.80)) # 20th and 80th percentiles 20% 80% How can we get this result? 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% > per<-seq(0,1,0.1) > per [1]

6 Summarizing Data quantile() > quantile(qsec,probs=per) 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% > quantile(qsec,probs=per,type=1) 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% > length(qsec) [1] 32 > sort(qsec) Summarizing Data quantile() Can we compute the median with quantile()? > median(qsec) [1] > quantile(qsec, probs= 0.5) 50% > quantile(qsec, probs= 0.5,type=1) 50%

7 > cor(wt,mpg) [1] Summarizing Data Example > cor(hp,disp) [1] Summarizing Data table() For the discrete variables, we can get summary counts; > table(mtcars$cyl) > table(mtcars$cyl,mtcars$gear)

8 Data Export write.table(x, file = "", append = FALSE, quote = TRUE, sep= " ", eol = "\n", na = "NA", dec = ".", row.names=true, col.names = TRUE, qmethod=c("escape", "double"), fileencoding = "") write.table prints its required argument x (after converting it to a data frame if it is not one nor a matrix) to a file 15 Data Export - Example >x<-rnorm(20) >write.table(x, file="s:\\x1.txt") or >write.table(x, file="s:/x1.txt") 16 8

9 Data Export - Examples >m <- matrix(1:20,ncol=5) >m >write.table(m,file= s:\\m1.txt") >write.table(m,file= s:\\m2.txt", sep = "\t") >write.table(m,file= s:\\m3.txt", sep = "\n") >write.table(m,file= s:\\m4.txt", sep = ";", row.names=f, col.names = F) 17 Data Export - Examples >x<-rnorm(20) >y<-rnorm(20) >df<-data.frame(x,y) >write.table(df,file=" s:\\df1.txt", sep = "\t", row.names=f, col.names = F) >write.table(df,file=" s:\\df2.txt", sep = "\t",dec=',, row.names=f, col.names = F) 18 9

10 Data Export - Examples Export mtcars data.frame to a tab delimated text file. >write.table(mtcars,file="s:\\carsdata.txt", sep="\t", dec=',', row.names=t, col.names = T) 19 Data Export - Examples Export mtcars data.frame to a tab delimated text file. >write.table(mtcars,file=file.choose(), sep="\t", dec=',', row.names=t, col.names = T) 20 10

11 Data Import read.table(file, header = FALSE, sep = "", quote = "\"'", dec = ".", row.names, col.names, as.is =!stringsasfactors, na.strings = "NA", colclasses = NA, nrows = -1, Reads a file in table format and creates a data frame from it, with cases corresponding to lines and variables to fields in the file. 21 Data Import - Examples new.df1<-read.table(file="s:/df1.txt", header=f, sep="\t", dec=".") new.df2<-read.table(file="s:/df2.txt", header=f, sep="\t", dec=",") 22 11

12 Data Import - Examples >cars<-read.table(file=file.choose(),header=t, sep="\t", dec=',') >cars 23 Data Import - Examples >m <- matrix(1:20,ncol=5) >write.table(m,file="m5.txt", sep = " ",row.names=f, col.names = T) >new.m<-read.table(file= m5.txt", header=t, sep=" ", dec=".") >new.m >class(new.m) 24 12

13 Variants of read.table Data Import read.csv(file, header=t, sep = ",", dec=".",...) read.csv2(file, header=t, sep=";", dec=",",...) The former assumes that fields are seperated by comma, and the latter assumes that they are seperated by semicolons but use a comma as the decimal point (the format in Turkey and European) 25 Data Import Further variants of read.table for tab delimited files read.delim (file, header=t, sep = "\t", dec=".",...) read.delim2(file, header=t, sep = "\t", dec=",",...) The former assumes that decimal point is period, and the latter assumes that the decimal point is comma. Example: air<-read.delim2(file="data1.txt") 26 13

Input/Output Data Frames

Input/Output Data Frames Input/Output Data Frames Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Input/Output Importing text files Rectangular (n rows, c columns) Usually you want to use read.table read.table(file,

More information

Basics of R. > x=2 (or x<-2) > y=x+3 (or y<-x+3)

Basics of R. > x=2 (or x<-2) > y=x+3 (or y<-x+3) Basics of R 1. Arithmetic Operators > 2+2 > sqrt(2) # (2) >2^2 > sin(pi) # sin(π) >(1-2)*3 > exp(1) # e 1 >1-2*3 > log(10) # This is a short form of the full command, log(10, base=e). (Note) For log 10

More information

Reading and wri+ng data

Reading and wri+ng data An introduc+on to Reading and wri+ng data Noémie Becker & Benedikt Holtmann Winter Semester 16/17 Course outline Day 4 Course outline Review Data types and structures Reading data How should data look

More information

Reading in data. Programming in R for Data Science Anders Stockmarr, Kasper Kristensen, Anders Nielsen

Reading in data. Programming in R for Data Science Anders Stockmarr, Kasper Kristensen, Anders Nielsen Reading in data Programming in R for Data Science Anders Stockmarr, Kasper Kristensen, Anders Nielsen Data Import R can import data many ways. Packages exists that handles import from software systems

More information

Data Input/Output. Andrew Jaffe. January 4, 2016

Data Input/Output. Andrew Jaffe. January 4, 2016 Data Input/Output Andrew Jaffe January 4, 2016 Before we get Started: Working Directories R looks for files on your computer relative to the working directory It s always safer to set the working directory

More information

What R is. STAT:5400 (22S:166) Computing in Statistics

What R is. STAT:5400 (22S:166) Computing in Statistics STAT:5400 (22S:166) Computing in Statistics Introduction to R Lecture 5 September 9, 2015 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowa.edu 1 What R is an integrated suite of software facilities for data

More information

Reading and writing data

Reading and writing data 25/10/2017 Reading data Reading data is one of the most consuming and most cumbersome aspects of bioinformatics... R provides a number of ways to read and write data stored on different media (file, database,

More information

Topics for today Input / Output Using data frames Mathematics with vectors and matrices Summary statistics Basic graphics

Topics for today Input / Output Using data frames Mathematics with vectors and matrices Summary statistics Basic graphics Topics for today Input / Output Using data frames Mathematics with vectors and matrices Summary statistics Basic graphics Introduction to S-Plus 1 Input: Data files For rectangular data files (n rows,

More information

STAT 540 Computing in Statistics

STAT 540 Computing in Statistics STAT 540 Computing in Statistics Introduces programming skills in two important statistical computer languages/packages. 30-40% R and 60-70% SAS Examples of Programming Skills: 1. Importing Data from External

More information

IMPORTING DATA IN R. Introduction read.csv

IMPORTING DATA IN R. Introduction read.csv IMPORTING DATA IN R Introduction read.csv Importing data in R? 5 types Flat files Data from Excel Databases Web Statistical software Flat Files states.csv Comma Separated Values state,capital,pop_mill,area_sqm

More information

IMPORTING DATA INTO R. Introduction Flat Files

IMPORTING DATA INTO R. Introduction Flat Files IMPORTING DATA INTO R Introduction Flat Files Importing data into R? 5 Types Flat Files Excel Files Statistical Software Databases Data from the Web Flat Files states.csv Comma Separated Values state,capital,pop_mill,area_sqm

More information

"no.loss"), FALSE) na.strings=c("na","#div/0!"), 72 # Ενσωματωμένες συναρτήσεις (build-in functions) του R

no.loss), FALSE) na.strings=c(na,#div/0!), 72 # Ενσωματωμένες συναρτήσεις (build-in functions) του R 71 72 # Ενσωματωμένες συναρτήσεις (build-in functions) του R ----------------------------------------- 73 read.table(file, header = FALSE, sep = "", quote = "\"'", 74 dec = ".", numerals = c("allow.loss",

More information

R package

R package R package www.r-project.org Download choose the R version for your OS install R for the first time Download R 3 run R MAGDA MIELCZAREK 2 help help( nameofthefunction )? nameofthefunction args(nameofthefunction)

More information

Mails : ; Document version: 14/09/12

Mails : ; Document version: 14/09/12 Mails : leslie.regad@univ-paris-diderot.fr ; gaelle.lelandais@univ-paris-diderot.fr Document version: 14/09/12 A freely available language and environment Statistical computing Graphics Supplementary

More information

Basic R QMMA. Emanuele Taufer. 2/19/2018 Basic R (1)

Basic R QMMA. Emanuele Taufer. 2/19/2018 Basic R (1) Basic R QMMA Emanuele Taufer file:///c:/users/emanuele.taufer/google%20drive/2%20corsi/5%20qmma%20-%20mim/0%20classes/1-3_basic_r.html#(1) 1/21 Preliminary R is case sensitive: a is not the same as A.

More information

Chapter 7. The Data Frame

Chapter 7. The Data Frame Chapter 7. The Data Frame The R equivalent of the spreadsheet. I. Introduction Most analytical work involves importing data from outside of R and carrying out various manipulations, tests, and visualizations.

More information

Package swat. June 5, 2018

Package swat. June 5, 2018 Type Package Package swat June 5, 2018 Title SAS Scripting Wrapper for Analytics Transfer (SWAT) Version 1.2.1 Date 11OCT2017 Author Jared Dean [aut, cre], Tom Weber [aut, cre], Kevin Smith [aut] SWAT

More information

Package swat. March 5, 2018

Package swat. March 5, 2018 Type Package Package swat March 5, 2018 Title SAS Scripting Wrapper for Analytics Transfer (SWAT) Version 1.2.0.9000 Date 11OCT2017 Author Jared Dean [aut, cre], Tom Weber [aut, cre], Kevin Smith [aut]

More information

Intermediate Programming in R Session 1: Data. Olivia Lau, PhD

Intermediate Programming in R Session 1: Data. Olivia Lau, PhD Intermediate Programming in R Session 1: Data Olivia Lau, PhD Outline About Me About You Course Overview and Logistics R Data Types R Data Structures Importing Data Recoding Data 2 About Me Using and programming

More information

Package swat. April 27, 2017

Package swat. April 27, 2017 Type Package Package swat April 27, 2017 Title SAS Scripting Wrapper for Analytics Transfer (SWAT) Version 1.0.0 Date 28APR2017 Author Jared Dean [aut, cre], Tom Weber [aut, cre], Kevin Smith [aut] SWAT

More information

Getting Started. Slides R-Intro: R-Analytics: R-HPC:

Getting Started. Slides R-Intro:   R-Analytics:   R-HPC: Getting Started Download and install R + Rstudio http://www.r-project.org/ https://www.rstudio.com/products/rstudio/download2/ TACC ssh username@wrangler.tacc.utexas.edu % module load Rstats %R Slides

More information

Description/History Objects/Language Description Commonly Used Basic Functions. More Specific Functionality Further Resources

Description/History Objects/Language Description Commonly Used Basic Functions. More Specific Functionality Further Resources R Outline Description/History Objects/Language Description Commonly Used Basic Functions Basic Stats and distributions I/O Plotting Programming More Specific Functionality Further Resources www.r-project.org

More information

BIO5312: R Session 1 An Introduction to R and Descriptive Statistics

BIO5312: R Session 1 An Introduction to R and Descriptive Statistics BIO5312: R Session 1 An Introduction to R and Descriptive Statistics Yujin Chung August 30th, 2016 Fall, 2016 Yujin Chung R Session 1 Fall, 2016 1/24 Introduction to R R software R is both open source

More information

The statistical software R

The statistical software R The statistical software R Luca Frigau University of Cagliari Ph.D. course Quantitative Methods A.A. 2017/2018 1 / 75 Outline 1 R and its history 2 Logic and objects Data acquisition Object structure and

More information

MBV4410/9410 Fall Bioinformatics for Molecular Biology. Introduction to R

MBV4410/9410 Fall Bioinformatics for Molecular Biology. Introduction to R MBV4410/9410 Fall 2018 Bioinformatics for Molecular Biology Introduction to R Outline Introduce R Basic operations RStudio Bioconductor? Goal of the lecture Introduce you to R Show how to run R, basic

More information

No Name What it does? 1 attach Attach your data frame to your working environment. 2 boxplot Creates a boxplot.

No Name What it does? 1 attach Attach your data frame to your working environment. 2 boxplot Creates a boxplot. No Name What it does? 1 attach Attach your data frame to your working environment. 2 boxplot Creates a boxplot. 3 confint A metafor package function that gives you the confidence intervals of effect sizes.

More information

Module 4. Data Input. Andrew Jaffe Instructor

Module 4. Data Input. Andrew Jaffe Instructor Module 4 Data Input Andrew Jaffe Instructor Data Input We used several pre-installed sample datasets during previous modules (CO2, iris) However, 'reading in' data is the first step of any real project/analysis

More information

Reading and writing data

Reading and writing data An introduction to WS 2017/2018 Reading and writing data Dr. Noémie Becker Dr. Sonja Grath Special thanks to: Prof. Dr. Martin Hutzenthaler and Dr. Benedikt Holtmann for significant contributions to course

More information

Package zebu. R topics documented: October 24, 2017

Package zebu. R topics documented: October 24, 2017 Type Package Title Local Association Measures Version 0.1.2 Date 2017-10-21 Author Olivier M. F. Martin [aut, cre], Michel Ducher [aut] Package zebu October 24, 2017 Maintainer Olivier M. F. Martin

More information

The Beginning g of an Introduction to R Dan Nettleton

The Beginning g of an Introduction to R Dan Nettleton The Beginning g of an Introduction to R for New Users 2010 Dan Nettleton 1 Preliminaries Throughout these slides, red text indicates text that is typed at the R prompt or text that is to be cut from a

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Bradley Broom Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org bmbroom@mdanderson.org

More information

Importing data sets in R

Importing data sets in R Importing data sets in R R can import and export different types of data sets including csv files text files excel files access database STATA data SPSS data shape files audio files image files and many

More information

Importing and visualizing data in R. Day 3

Importing and visualizing data in R. Day 3 Importing and visualizing data in R Day 3 R data.frames Like pandas in python, R uses data frame (data.frame) object to support tabular data. These provide: Data input Row- and column-wise manipulation

More information

Introduction to R. UCLA Statistical Consulting Center R Bootcamp. Irina Kukuyeva September 20, 2010

Introduction to R. UCLA Statistical Consulting Center R Bootcamp. Irina Kukuyeva September 20, 2010 UCLA Statistical Consulting Center R Bootcamp Irina Kukuyeva ikukuyeva@stat.ucla.edu September 20, 2010 Outline 1 Introduction 2 Preliminaries 3 Working with Vectors and Matrices 4 Data Sets in R 5 Overview

More information

R: BASICS. Andrea Passarella. (plus some additions by Salvatore Ruggieri)

R: BASICS. Andrea Passarella. (plus some additions by Salvatore Ruggieri) R: BASICS Andrea Passarella (plus some additions by Salvatore Ruggieri) BASIC CONCEPTS R is an interpreted scripting language Types of interactions Console based Input commands into the console Examine

More information

Introduction to R. Nishant Gopalakrishnan, Martin Morgan January, Fred Hutchinson Cancer Research Center

Introduction to R. Nishant Gopalakrishnan, Martin Morgan January, Fred Hutchinson Cancer Research Center Introduction to R Nishant Gopalakrishnan, Martin Morgan Fred Hutchinson Cancer Research Center 19-21 January, 2011 Getting Started Atomic Data structures Creating vectors Subsetting vectors Factors Matrices

More information

Introduction to R, Github and Gitlab

Introduction to R, Github and Gitlab Introduction to R, Github and Gitlab 27/11/2018 Pierpaolo Maisano Delser mail: maisanop@tcd.ie ; pm604@cam.ac.uk Outline: Why R? What can R do? Basic commands and operations Data analysis in R Github and

More information

LaF benchmarks. D.J. van der Laan

LaF benchmarks. D.J. van der Laan LaF benchmarks D.J. van der Laan 2011-11-06 1 Introduction LaF is a package for R for working with large ASCII files in R. The manual vignette contains an discription of the functionality provided. In

More information

An introduction to R WS 2013/2014

An introduction to R WS 2013/2014 An introduction to R WS 2013/2014 Dr. Noémie Becker (AG Metzler) Dr. Sonja Grath (AG Parsch) Special thanks to: Dr. Martin Hutzenthaler (previously AG Metzler, now University of Frankfurt) course development,

More information

A Short Guide to R with RStudio

A Short Guide to R with RStudio Short Guides to Microeconometrics Fall 2013 Prof. Dr. Kurt Schmidheiny Universität Basel A Short Guide to R with RStudio 2 1 Introduction A Short Guide to R with RStudio 1 Introduction 3 2 Installing R

More information

Brief cheat sheet of major functions covered here. shoe<-c(8,7,8.5,6,10.5,11,7,6,12,10)

Brief cheat sheet of major functions covered here. shoe<-c(8,7,8.5,6,10.5,11,7,6,12,10) 1 Class 2. Handling data in R Creating, editing, reading, & exporting data frames; sorting, subsetting, combining Goals: (1) Creating matrices and dataframes: cbind and as.data.frame (2) Editing data:

More information

int64 : 64 bits integer vectors

int64 : 64 bits integer vectors int64 : 64 bits integer vectors Romain François - romain@r-enthusiasts.com int64 version 1.1.2 Abstract The int64 package adds 64 bit integer vectors to R. The package provides the int64 and uint64 classes

More information

An Introduction to Statistical Computing in R

An Introduction to Statistical Computing in R An Introduction to Statistical Computing in R K2I Data Science Boot Camp - Day 1 AM Session May 15, 2017 Statistical Computing in R May 15, 2017 1 / 55 AM Session Outline Intro to R Basics Plotting In

More information

II.Matrix. Creates matrix, takes a vector argument and turns it into a matrix matrix(data, nrow, ncol, byrow = F)

II.Matrix. Creates matrix, takes a vector argument and turns it into a matrix matrix(data, nrow, ncol, byrow = F) II.Matrix A matrix is a two dimensional array, it consists of elements of the same type and displayed in rectangular form. The first index denotes the row; the second index denotes the column of the specified

More information

A Brief Introduction to R

A Brief Introduction to R A Brief Introduction to R Babak Shahbaba Department of Statistics, University of California, Irvine, USA Chapter 1 Introduction to R 1.1 Installing R To install R, follow these steps: 1. Go to http://www.r-project.org/.

More information

Rearranging and manipula.ng data

Rearranging and manipula.ng data An introduc+on to Rearranging and manipula.ng data Noémie Becker & Benedikt Holtmann Winter Semester 16/17 Course outline Day 7 Course outline Review Checking and cleaning data Rearranging and manipula+ng

More information

Reading Data in zoo. Gabor Grothendieck GKX Associates Inc. Achim Zeileis Universität Innsbruck

Reading Data in zoo. Gabor Grothendieck GKX Associates Inc. Achim Zeileis Universität Innsbruck Reading Data in zoo Gabor Grothendieck GKX Associates Inc. Achim Zeileis Universität Innsbruck Abstract This vignette gives examples of how to read data in various formats in the zoo package using the

More information

R basics workshop Sohee Kang

R basics workshop Sohee Kang R basics workshop Sohee Kang Math and Stats Learning Centre Department of Computer and Mathematical Sciences Objective To teach the basic knowledge necessary to use R independently, thus helping participants

More information

Introduction to R Software

Introduction to R Software 1. Introduction R is a free software environment for statistical computing and graphics. It is almost perfectly compatible with S-plus. The only thing you need to do is download the software from the internet

More information

Introduction to R (BaRC Hot Topics)

Introduction to R (BaRC Hot Topics) Introduction to R (BaRC Hot Topics) George Bell September 30, 2011 This document accompanies the slides from BaRC s Introduction to R and shows the use of some simple commands. See the accompanying slides

More information

Step-by-step user instructions to the hamlet-package

Step-by-step user instructions to the hamlet-package Step-by-step user instructions to the hamlet-package Teemu Daniel Laajala May 26, 2018 Contents 1 Analysis workflow 2 2 Loading data into R 2 2.1 Excel format data.......................... 4 2.2 CSV-files...............................

More information

Introduction to R Benedikt Brors Dept. Intelligent Bioinformatics Systems German Cancer Research Center

Introduction to R Benedikt Brors Dept. Intelligent Bioinformatics Systems German Cancer Research Center Introduction to R Benedikt Brors Dept. Intelligent Bioinformatics Systems German Cancer Research Center What is R? R is a statistical computing environment with graphics capabilites It is fully scriptable

More information

The goal of this handout is to allow you to install R on a Windows-based PC and to deal with some of the issues that can (will) come up.

The goal of this handout is to allow you to install R on a Windows-based PC and to deal with some of the issues that can (will) come up. Fall 2010 Handout on Using R Page: 1 The goal of this handout is to allow you to install R on a Windows-based PC and to deal with some of the issues that can (will) come up. 1. Installing R First off,

More information

EXST 7014, Lab 1: Review of R Programming Basics and Simple Linear Regression

EXST 7014, Lab 1: Review of R Programming Basics and Simple Linear Regression EXST 7014, Lab 1: Review of R Programming Basics and Simple Linear Regression OBJECTIVES 1. Prepare a scatter plot of the dependent variable on the independent variable 2. Do a simple linear regression

More information

S CHAPTER return.data S CHAPTER.Data S CHAPTER

S CHAPTER return.data S CHAPTER.Data S CHAPTER 1 S CHAPTER return.data S CHAPTER.Data MySwork S CHAPTER.Data 2 S e > return ; return + # 3 setenv S_CLEDITOR emacs 4 > 4 + 5 / 3 ## addition & divison [1] 5.666667 > (4 + 5) / 3 ## using parentheses [1]

More information

POL 345: Quantitative Analysis and Politics

POL 345: Quantitative Analysis and Politics POL 345: Quantitative Analysis and Politics Precept Handout 1 Week 2 (Verzani Chapter 1: Sections 1.2.4 1.4.31) Remember to complete the entire handout and submit the precept questions to the Blackboard

More information

Lecture 1: Getting Started and Data Basics

Lecture 1: Getting Started and Data Basics Lecture 1: Getting Started and Data Basics The first lecture is intended to provide you the basics for running R. Outline: 1. An Introductory R Session 2. R as a Calculator 3. Import, export and manipulate

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Reading Data Dr. David Koop Data Frame A dictionary of Series (labels for each series) A spreadsheet with column headers Has an index shared with each series Allows

More information

Unit 3 Fill Series, Functions, Sorting

Unit 3 Fill Series, Functions, Sorting Unit 3 Fill Series, Functions, Sorting Fill enter repetitive values or formulas in an indicated direction Using the Fill command is much faster than using copy and paste you can do entire operation in

More information

Unit 3 Functions Review, Fill Series, Sorting, Merge & Center

Unit 3 Functions Review, Fill Series, Sorting, Merge & Center Unit 3 Functions Review, Fill Series, Sorting, Merge & Center Function built-in formula that performs simple or complex calculations automatically names a function instead of using operators (+, -, *,

More information

Intro to R h)p://jacobfenton.s3.amazonaws.com/r- handson.pdf. Jacob Fenton CAR Director InvesBgaBve ReporBng Workshop, American University

Intro to R h)p://jacobfenton.s3.amazonaws.com/r- handson.pdf. Jacob Fenton CAR Director InvesBgaBve ReporBng Workshop, American University Intro to R h)p://jacobfenton.s3.amazonaws.com/r- handson.pdf Jacob Fenton CAR Director InvesBgaBve ReporBng Workshop, American University Overview Import data Move around the file system, save an image

More information

Package iotools. R topics documented: January 25, Version Title I/O Tools for Streaming

Package iotools. R topics documented: January 25, Version Title I/O Tools for Streaming Version 0.2-5 Title I/O Tools for Streaming Package iotools January 25, 2018 Author Simon Urbanek , Taylor Arnold Maintainer Simon Urbanek

More information

Why use R? Getting started. Why not use R? Introduction to R: Log into tak. Start R R or. It s hard to use at first

Why use R? Getting started. Why not use R? Introduction to R: Log into tak. Start R R or. It s hard to use at first Why use R? Introduction to R: Using R for statistics ti ti and data analysis BaRC Hot Topics October 2011 George Bell, Ph.D. http://iona.wi.mit.edu/bio/education/r2011/ To perform inferential statistics

More information

R tutorial. Fundamentals of the R language Lorenza Bordoli. R commands are in written in red

R tutorial. Fundamentals of the R language Lorenza Bordoli. R commands are in written in red R tutorial 13.02.07 R commands are in written in red Fundamentals of the R language R as a calculator Calculator +, -, /, *, ^, log, exp, : > (17*0.35)^(1/3) > log(10) > exp(1) > 3^-1 Assigning Values

More information

getting started in R

getting started in R Garrick Aden-Buie // Friday, March 25, 2016 getting started in R 1 / 70 getting started in R Garrick Aden-Buie // Friday, March 25, 2016 INFORMS Code & Data Boot Camp Today we ll talk about Garrick Aden-Buie

More information

Statistics 251: Statistical Methods

Statistics 251: Statistical Methods Statistics 251: Statistical Methods Summaries and Graphs in R Module R1 2018 file:///u:/documents/classes/lectures/251301/renae/markdown/master%20versions/summary_graphs.html#1 1/14 Summary Statistics

More information

Programming for Chemical and Life Science Informatics

Programming for Chemical and Life Science Informatics Programming for Chemical and Life Science Informatics I573 - Week 7 (Statistical Programming with R) Rajarshi Guha 24 th February, 2009 Resources Download binaries If you re working on Unix it s a good

More information

Exploring and Understanding Data Using R.

Exploring and Understanding Data Using R. Exploring and Understanding Data Using R. Loading the data into an R data frame: variable

More information

Package csvread. August 29, 2016

Package csvread. August 29, 2016 Title Fast Specialized CSV File Loader Version 1.2 Author Sergei Izrailev Package csvread August 29, 2016 Maintainer Sergei Izrailev Description Functions for loading large

More information

R commander an introduction

R commander an introduction R commander an introduction free, user-friendly, and powerful software Ho Kim SCHOOL OF PUBLIC HEALTH, SNU Useful sites R is a free software with powerful tools The Comprehensive R Archives Network http://cran.r-project.org/

More information

R for Libraries. Session 2: Data Exploration. Clarke Iakovakis Scholarly Communications Librarian University of Houston-Clear Lake

R for Libraries. Session 2: Data Exploration. Clarke Iakovakis Scholarly Communications Librarian University of Houston-Clear Lake R for Libraries Session 2: Data Exploration Clarke Iakovakis Scholarly Communications Librarian University of Houston-Clear Lake This work is licensed under a Creative Commons Attribution 4.0 International

More information

PARTE XI: Introduzione all ambiente R - Panoramica

PARTE XI: Introduzione all ambiente R - Panoramica PARTE XI: Introduzione all ambiente R - Panoramica 1 Getting help Most R functions have online documentation. help(topic) documentation on topic?topic id. help.search("topic") search the help system apropos("topic")

More information

Statistical Software Camp: Introduction to R

Statistical Software Camp: Introduction to R Statistical Software Camp: Introduction to R Day 1 August 24, 2009 1 Introduction 1.1 Why Use R? ˆ Widely-used (ever-increasingly so in political science) ˆ Free ˆ Power and flexibility ˆ Graphical capabilities

More information

Regression Models Course Project Vincent MARIN 28 juillet 2016

Regression Models Course Project Vincent MARIN 28 juillet 2016 Regression Models Course Project Vincent MARIN 28 juillet 2016 Executive Summary "Is an automatic or manual transmission better for MPG" "Quantify the MPG difference between automatic and manual transmissions"

More information

University of Torino, Italy Earth Science Department. R Summer School

University of Torino, Italy Earth Science Department. R Summer School University of Torino, Italy Earth Science Department R Summer School COST Action ES0601 HOME: Advances in HOmogenisation MEthods for climate series: an integrated approach September 7 th -10 th, 2009 Physics

More information

Using R for statistics and data analysis

Using R for statistics and data analysis Introduction ti to R: Using R for statistics and data analysis BaRC Hot Topics October 2011 George Bell, Ph.D. http://iona.wi.mit.edu/bio/education/r2011/ Why use R? To perform inferential statistics (e.g.,

More information

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015 STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, tsvv@steno.dk, Steno Diabetes Center June 11, 2015 Contents 1 Introduction 1 2 Recap: Variables 2 3 Data Containers 2 3.1 Vectors................................................

More information

Why use R? Getting started. Why not use R? Introduction to R: It s hard to use at first. To perform inferential statistics (e.g., use a statistical

Why use R? Getting started. Why not use R? Introduction to R: It s hard to use at first. To perform inferential statistics (e.g., use a statistical Why use R? Introduction to R: Using R for statistics ti ti and data analysis BaRC Hot Topics November 2013 George W. Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ To perform inferential

More information

#a- a vector of 100 random number from a normal distribution a<-rnorm(100, mean= 32, sd=6)

#a- a vector of 100 random number from a normal distribution a<-rnorm(100, mean= 32, sd=6) 1 Transition to R Class 3: Basic functions for descriptive statistics and summarizing data Use simple functions and Xapply functions for summarizing and describing data Goals: (1) Summarizing continuous

More information

Introduction to R Statistical Package. Eng. Mohammad Khalaf Dep. of Statistics

Introduction to R Statistical Package. Eng. Mohammad Khalaf Dep. of Statistics Introduction to R Statistical Package Eng. Mohammad Khalaf Dep. of Statistics Introduction R is an integrated suite of software facilities for data manipulation, calculation and graphical display. Among

More information

Stat405. More about data. Hadley Wickham. Tuesday, September 11, 12

Stat405. More about data. Hadley Wickham. Tuesday, September 11, 12 Stat405 More about data Hadley Wickham 1. (Data update + announcement) 2. Motivating problem 3. External data 4. Strings and factors 5. Saving data Slot machines they be sure casinos are honest? CC by-nc-nd:

More information

Package assertr. R topics documented: February 23, Type Package

Package assertr. R topics documented: February 23, Type Package Type Package Package assertr February 23, 2018 Title Assertive Programming for R Analysis Pipelines Version 2.5 Provides functionality to assert conditions that have to be met so that errors in data used

More information

Lab #7 - More on Regression in R Econ 224 September 18th, 2018

Lab #7 - More on Regression in R Econ 224 September 18th, 2018 Lab #7 - More on Regression in R Econ 224 September 18th, 2018 Robust Standard Errors Your reading assignment from Chapter 3 of ISL briefly discussed two ways that the standard regression inference formulas

More information

Data types and structures

Data types and structures An introduc+on to Data types and structures Noémie Becker & Benedikt Holtmann Winter Semester 16/17 Course outline Day 3 Review GeFng started with R Crea+ng Objects Data types in R Data structures in R

More information

Package HomoVert. November 10, 2010

Package HomoVert. November 10, 2010 Package HomoVert November 10, 2010 Version 0.4.1 Date 2010-10-27 Title HomoVert: Functions to convert Gene IDs between species Author Matthew Fero Maintainer Matthew Fero

More information

Loops. An R programmer can determine the order of processing of commands, via use of the control statements; repeat{}, while(), for(), break, and next

Loops. An R programmer can determine the order of processing of commands, via use of the control statements; repeat{}, while(), for(), break, and next Source: https://www.r-exercises.com/2016/06/01/scripting-loops-in-r/ Loops An R programmer can determine the order of processing of commands, via use of the control statements; repeat{, while(), for(),

More information

Index. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113

Index. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113 Index A Add-on packages information page, 186 187 Linux users, 191 Mac users, 189 mirror sites, 185 Windows users, 187 aggregate function, 62 Analysis of variance (ANOVA), 152 anova function, 152 as.data.frame

More information

Bar Charts and Frequency Distributions

Bar Charts and Frequency Distributions Bar Charts and Frequency Distributions Use to display the distribution of categorical (nominal or ordinal) variables. For the continuous (numeric) variables, see the page Histograms, Descriptive Stats

More information

Data Input/Output. Introduction to R for Public Health Researchers

Data Input/Output. Introduction to R for Public Health Researchers Data Input/Output Introduction to R for Public Health Researchers Common new user mistakes we have seen 1. Working directory problems: trying to read files that R "can't find" RStudio can help, and so

More information

36-402/608 HW #1 Solutions 1/21/2010

36-402/608 HW #1 Solutions 1/21/2010 36-402/608 HW #1 Solutions 1/21/2010 1. t-test (20 points) Use fullbumpus.r to set up the data from fullbumpus.txt (both at Blackboard/Assignments). For this problem, analyze the full dataset together

More information

Exercise 1: Introduction to Stata

Exercise 1: Introduction to Stata Exercise 1: Introduction to Stata New Stata Commands use describe summarize stem graph box histogram log on, off exit New Stata Commands Downloading Data from the Web I recommend that you use Internet

More information

Data input & output. Hadley Wickham. Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University.

Data input & output. Hadley Wickham. Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University. Data input & output Hadley Wickham Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University June 2012 1. Working directories 2. Loading data 3. Strings and factors

More information

An introduction to WS 2015/2016

An introduction to WS 2015/2016 An introduction to WS 2015/2016 Dr. Noémie Becker (AG Metzler) Dr. Sonja Grath (AG Parsch) Special thanks to: Prof. Dr. Martin Hutzenthaler (previously AG Metzler, now University of Duisburg-Essen) course

More information

Getting Started in R

Getting Started in R Getting Started in R Giles Hooker May 28, 2007 1 Overview R is a free alternative to Splus: a nice environment for data analysis and graphical exploration. It uses the objectoriented paradigm to implement

More information

MULTIVARIATE ANALYSIS USING R

MULTIVARIATE ANALYSIS USING R MULTIVARIATE ANALYSIS USING R B N Mandal I.A.S.R.I., Library Avenue, New Delhi 110 012 bnmandal @iasri.res.in 1. Introduction This article gives an exposition of how to use the R statistical software for

More information

Eric Pitman Summer Workshop in Computational Science

Eric Pitman Summer Workshop in Computational Science Eric Pitman Summer Workshop in Computational Science 2. Data Structures: Vectors and Data Frames Jeanette Sperhac Data Objects in R These objects, composed of multiple atomic data elements, are the bread

More information

Cluster Randomization Create Cluster Means Dataset

Cluster Randomization Create Cluster Means Dataset Chapter 270 Cluster Randomization Create Cluster Means Dataset Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. Examples of such clusters

More information

Basic R Part 1. Boyce Thompson Institute for Plant Research Tower Road Ithaca, New York U.S.A. by Aureliano Bombarely Gomez

Basic R Part 1. Boyce Thompson Institute for Plant Research Tower Road Ithaca, New York U.S.A. by Aureliano Bombarely Gomez Basic R Part 1 Boyce Thompson Institute for Plant Research Tower Road Ithaca, New York 14853-1801 U.S.A. by Aureliano Bombarely Gomez A Brief Introduction to R: 1. What is R? 2. Software and documentation.

More information

Getting Started in R

Getting Started in R Getting Started in R Phil Beineke, Balasubramanian Narasimhan, Victoria Stodden modified for Rby Giles Hooker January 25, 2004 1 Overview R is a free alternative to Splus: a nice environment for data analysis

More information

R Programming Basics - Useful Builtin Functions for Statistics

R Programming Basics - Useful Builtin Functions for Statistics R Programming Basics - Useful Builtin Functions for Statistics Vectorized Arithmetic - most arthimetic operations in R work on vectors. Here are a few commonly used summary statistics. testvect = c(1,3,5,2,9,10,7,8,6)

More information