command.name(measurement, grouping, argument1=true, argument2=3, argument3= word, argument4=c( A, B, C ))

Size: px
Start display at page:

Download "command.name(measurement, grouping, argument1=true, argument2=3, argument3= word, argument4=c( A, B, C ))"

Transcription

1 Tutorial 3: Data Manipulation Anatomy of an R Command Every command has a unique name. These names are specific to the program and case-sensitive. In the example below, command.name is the name of the command. Command names are always followed by a set of parentheses within which the names of data are given, or supplied, and various parameters for running the command are set. In general, the first entry in the parentheses requires that you specify (supply) the dataset. It may be a data object (vector, data frame, matrix) or part of a data object (like a single row or column of numerical values or measurements we will see how to do this later in this tutorial) that you want to analyze. Before running any command for the first time, it is a good idea to look at the help topics to see what type(s) of data needs to be supplied. command.name(vector) command.name(dataframe) command.name(matrix) A command may require multiple measurements or vectors. command.name(measurement1, measurement2) command.name(vector1, vector2) In some commands, the next entry (or next few entries) require that you specify some additional settings with regard to how your data will be handled by the command. In this example, grouping is a categorical variable (perhaps representing experimental treatments) by which to group the measurement values. command.name(measurement, grouping) Most commands in R have multiple parameters that can be used to customize your analyses. These parameters are called arguments and have default settings that are shown in the help topics. There are multiple types of arguments and names of arguments are given in the help topics. Most help topics should provide a list of available options. Some arguments are like on/off switches and only need to be set equal to TRUE or FALSE ( T or F work too). See argument 1 in the example below. Some arguments need to be set equal to a numeric value that corresponds to a certain condition. See argument 2 below. Some arguments need to be set equal to a character string (one or more letters/words in quotation marks). See argument 3 below. If multiple character strings or numbers are required, they must be concatenated with c(), as in argument 4 below. command.name(measurement, grouping, argument1=true, argument2=3, argument3= word, argument4=c( A, B, C )) 1

2 Quickly Review Column Headings To quickly remind yourself what the column names are in your dataset, you can use the head() command to see just the first several rows of your data. head(dataset) Recalling Columns from a Data Frame or Matrix The most reliable way to recall and display columns from a data frame or matrix is to specify both the name of the dataset and the name of the column, separated by a $ symbol. The example code below tells R to display all records (rows) of a particular variable (one column) from a particular dataset. dataset$variable1 dataset$variable2 The utility of this is that it also allows you to specify a specific column from a dataset that you want to analyze with some command. Also, any column can be used to create an independent vector with a new name. command.name(dataset$variable3) new.name=dataset$variable3 command.name(new.name) Attach: Another way to recall columns, but not recommended There is shorter way to recall columns by attaching names of the variables to the data. Variable names will be the column headings (first row of cells, header row) from the dataset. This allows you to recall individual columns of data without having to specify the dataset. To see the names of the variables (columns of a data frame), use the names() command. attach(dataset) names(dataset) Recall and display variables in the dataset by typing the column names. variable1 CAUTION: The attach() method can cause problems if you are working with multiple datasets (read in from separate.csv files) that happen to have some or all of the same column headings. Only the last attached dataset will be recognized, so it s up to you to organize your R code carefully or to give unique names to all columns across all datasets. This is generally not a problem if you are only working with one dataset during an R session. The dataset$variable convention is recommended for avoiding confusion and mistakes in analyses. 2

3 Selecting Values from a Vector (Indexing a Vector) Subsets of data from a vector can generated by indexing the desired positions within the vector. Each value contained in a vector is located in a particular position. The number of positions depends on the length of the vector, or how many values it contains. A set of square brackets directly after the vector name are used to indicate the position(s) of the values you want to select. vector[position] For example, suppose we generate a vector containing the numbers 11 through 20. vector=c(11:20) vector [1] We can see how long the vector is by using the length() command. It tells us how many values are in the vector, and thus how many positions there are. This example vector has 10 values, so it is comprised of 10 positions. length(vector) [1] 10 Now say we want to extract the value in the 4 th position. Use square brackets to specify position 4 of the example vector. It will return the number 14, which is the 4 th value in the vector. vector[4] [1] 14 We can also extract values from multiple positions. Consecutive positions can be specified as a range using a colon. Commas separate non-consecutive positions. Note that we have to group our selected positions together using the c() command. So here we have selected values from the 4 th through 6 th positions, 8 th position, and 10 th position. vector[c(4:6, 8, 10)] [1]

4 Selecting Rows and Columns (Indexing a Data Frame or Matrix) Data frames and matrices can be indexed in a similar way to vectors, but the basic anatomy of the selection code is a little different. The name of the dataset is followed by brackets containing the positions within rows and columns, in that order, and separated by a comma. dataset[row.position, column.position] For clarity, consider the matrix of values below. Note how at the left of each row there is a set of brackets containing a number followed by a comma. These are the positions of each row (similar to how rows are numbered in an Excel spreadsheet). Note how the top of each column has a set of brackets containing a comma followed by a number. These are the positions of each column (similar to how columns are labeled with letters in an Excel spreadsheet). [,1] [,2] [,3] [,4] [1,] [2,] [3,] [4,] To find the dimensions (number of rows and columns) a data frame or matrix has, use the dim() command. The first value returned tells how many rows there are, and the second value returned tells how many columns. So if this matrix was named dataset, we would find its dimensions: dim(dataset) [1] 4 4 The simplest selection would be to extract a single value from matrix. To select the position located in the first row and first column (the value 11), we would use: dataset[1,1] We could extract multiple values from matrix. To select the positions located in the first two rows and first column (the values 11 and 21), we would use: dataset[1:2, 1] To select the positions located in the first two rows and first three columns (the values 11, 12, 13, 21, 22, and 23), we would use: dataset[1:2, 1:3] 4

5 Any selection should be assigned to a new name. You can see your new subset dataset by running the assigned name ( subset in this example). subset=dataset[row.position, column.position] subset It is not necessary to provide selection criteria for both rows and columns if you are only choosing based on one, the other can be left blank, but don t forget the comma! The next two sections will show examples of how indexing can be used in this way. dataset[, column.position] dataset[row.position,] Selecting Dataset Rows (Records) to Create a Subset Sometimes you want to choose a particular subset of rows (records) to analyze separately from the rest of your data. Three common ways to accomplish this are (1) selecting rows based on position, (2) selecting rows based on a numerical variable, and (3) selecting rows based on a categorical (text or character string) variable. 1. Selecting rows (records) based on positions in a data frame or matrix. As we saw in the previous section, a set of consecutive rows can be selected by specifying a range (use a colon). For example, to select the first 30 rows in a dataset: dataset[1:30,] We can also select non-consecutive rows by separating the positions with a comma and using the c() command to group the selected positions together. The third example will select first, fifth, seventh, and ninth rows. dataset[c(1, 5, 7, 9),] We can select groups of consecutive rows. This example will select the first 30 rows in the dataset and the 50 th 60 th rows. dataset[c(1:30, 50:60),] We can select non-consecutive rows and groups of consecutive rows at the same time. This example will select the first, fifth, and tenth rows and the 50 th 60 th rows. dataset[c(1, 5, 10, 50:60),] Note that the order of the positions matters. The selected rows will be placed in the subset in the order specified. 5

6 2. To select certain rows based on a numerical variable contained in one of the columns of the dataset. Selections can be made to extract rows that contain values less than, equal to, or greater than a designated threshold within a particular column. To select rows from the dataset that have a value less than 1 in the column named variable. dataset[dataset$variable<1,] To select rows from the dataset that have a value equal to 1 in the column named variable. Note the use of double equals signs (==) to represent the mathematical operator equal to. dataset[dataset$variable==1,] To select rows from the dataset that have a value greater than 1 in the column named variable. dataset[dataset$variable>1,] To select rows from the dataset that have a value less than or equal to 1 in the column named variable. dataset[dataset$variable<=1,] To select rows from the dataset that have a value greater than or equal to 1 in the column named variable. dataset[dataset$variable>=1,] 3. To select certain rows by a text criterion. If the dataset has a column of categorical variables (text), you can choose for rows that belong to a particular category. Rows that contain the desired text in the column named variable will be selected. The ignore.case argument is used to designate whether the selection is case-sensitive. dataset[grep( text, dataset$variable, ignore.case=true),] Selecting Dataset Columns to Create a Subset Columns can be selected based either on (1) the column positions or (2) by the names of the columns. You can also (3) remove particular columns from a dataset. 1. Selecting based on column position. Note the use of c() to group the selected positions together. Consecutive columns are selected by specifying a range. dataset[, c(1:3)] 6

7 Non-consecutive column selection: dataset[, c(1, 3, 5)] A combination of consecutive and non-consecutive column selections: dataset[, c(1:3, 6, 8)] 2. Selecting based on column names. Place the names of the columns in quotation marks. Use c() to group multiple column name selections together. Spelling and case must exactly match the names in the dataset. dataset[, column1 ] dataset[, c( column1, column2 )] 3. Remove a column from a dataset by preceding the positions with a dash (negative sign). Multiple rows can be deleted by using c(). Note that these removals do not affect the original.csv file saved on your computer. dataset[, -4] dataset[, c(-4, -6)] Selecting Data Based on Rows and Columns Both row and column criteria can be used together to select a subset of data. Any of the selection methods can be combined to make even more specific data selections. Select data that are in rows 1 through 30 and from columns named column1 and column2. dataset[1:30, c( column1, column2 )] Select data from the column named column1, but only for rows 1, 3, 6, and 8 through 12. dataset[c(1, 3, 6, 8:12), column1 ] Select data from columns 1 and 5, but only for records (rows) with a value greater than one in the column named variable. dataset[dataset$variable>1, c(1, 5)] 7

8 Renaming Columns Sometimes it is useful to rename columns in a newly created output dataset or an existing dataset without having to go back to your original data files. The colnames() command requires the name of the dataset in the parentheses, followed by brackets with the column positions of the columns you wish to rename. In the example below, the first column in the dataset will be changed to New.Name. colnames(dataset)[1]= New.Name If you wish to change the names of multiple, consecutive columns, you can specify the range of columns (1:3 in the example below, corresponding to the first three columns of the dataset) and supply multiple new column names using the c(). In this example, the first three columns of the dataset will be renamed to Name1, Name2, and Name3, respectively. colnames(dataset)[1:3]=c( Name1, Name2, Name3 ) Note: This method of renaming columns is only temporary these new names will exist only for the duration of your R session. The original.csv file is unchanged. Merging Two Data Frames The merge() command can be used to merge two data frames based on a common column (commonly by species or plot). If the column by which data frames are to be merged have the same name in both data frames, no other columns have the same name, and the data frames are the same length (same number of rows), then the simplest use of merge() may work. This example takes two data frames (dataset.x and dataset.y) and searches each for a column name common to both, then merges the datasets together row-by-row by matching up the values in the column the datasets have in common. merge(dataset.x, dataset.y) If the columns by which datasets are to be merged have different names, then these names must be specified. This example will merge together two datasets (dataset.x, dataset.y) based on two columns that have different names. merge(dataset.x, dataset.y, by.x= column.x, by.y= column.y, all=true, sort=false) If the datasets are not of the same length (they do not have the same number of rows, or are otherwise not a perfect match), unmatched records will be omitted from the output unless all rows are kept via the all argument. If all=true, unmatched rows will be kept, inserting na values for missing data. 8

9 If sort=false, the output will be organized so that successfully merged records appear first, followed by records from dataset.x that did not match any records in dataset.y, followed by records from dataset.y that did not match any records in dataset.x. Note: Input dataset columns that have the same name prior to the merge will be appended with.x or.y depending on whether they come from dataset.x or dataset.y. Exporting Data from R Data frames, matrices, and other outputs generated in R can be exported to a.csv file to save and use later. If you make modifications to a dataset and want to save your changes to use later, you can generate a.csv file directly from R using the write.csv() command. First, the name of the dataset you want to save is supplied, followed by a file pathname for where you want the file to be saved. Note that the file pathname includes the name of the file that will be created (new.dataset.csv in this example). write.csv(new.dataset, file= /Users/username/Desktop/ folder/new.dataset.csv ) write.csv(new.dataset, file="c:/documents and Settings/ Owner/Desktop/new.dataset.csv") If you have set a working directory, you do not need provide the full pathname, only the name of the file you want to create is required. The resulting file will be saved in the folder you set as your working directory. setwd("/users/johndoe/desktop/") setwd("c:/documents and Settings/Owner/Desktop/") write.csv(dataset, dataset.csv") 9

10 Tutorial Code #read in a.csv file using the full pathname; assigns dataframe to the name data data=read.csv("/users/johndoe/desktop/r_example_dataframe.csv") data #view the dataset head(data) #view headers of the data dataset #change the name of the first column of the data dataset to Sample colnames(data)[1]="sample" head(data) #change the name of the second column of the data dataset to Comm_Type colnames(data)[2]="comm_type" head(data) #remove the first column of the data dataset and assign to a new name, remove1 remove1=data[, -1] remove1 #remove the first three columns of the data dataset and assign to a new name, remove1 remove2=data[, c(-1, -2, -3)] remove2 setwd("/users/johndoe/desktop/") #set working directory example=read.csv("r_example_dataframe.csv") #read in.csv file example #view dataset #take values in the Diversity column of the example dataset and use them to generate a vector named div div=example$diversity div #view the vector 10

11 #take values in the Richness column of the example dataset and generate a vector named rich rich=example[, "Richness"] rich #take the Richness and Plot columns of the example dataset and generate a new dataframe called rich2 rich2=example[, c("richness", "Plot")] rich2 #take values in the first column of the example dataset and use them to generate a vector named plot plot=example[, 1] plot #create a subset of the example dataset called subset1 that contains rows 1 through 16 subset1=example[1:16,] subset1 #create a subset of the example dataset that only contains rows where Diversity values are greater than 4 subset2=example[example$diversity >4,] subset2 #create a subset of the example dataset that only contains rows where Age values are Young subset3=example[grep("y", example$age),] subset3 #create a subset of the example dataset that rows 1 through 16 and rows 25 through 32 subset4=example[c(1:16, 25:32),] subset4 #create a subset of the example dataset that rows 1, 5, 7, and 9 through 14 subset5=example[c(1,5,7,9:14),] subset5 11

12 #create a subset of the example dataset that only contains rows where Richness values are equal to 4 subset6=example[example$richness == 8,] subset6 #create a subset of the example dataset that only contains plots of the Oak community type oakplots=example[grep("oak", example$community, ignore.case=t),] oakplots #create a subset of example that contains only the Richness, Plot, and Age data from rows 1 through 16 crazy=example[1:16, c("richness", "Plot", "Age")] crazy #create a subset of example that contains only the data from rows 1 through 16 and columns 1 through 5 crazy2=example[c(1:16, 25:32), c(1,5)] crazy2 #create a subset of example that contains only the Maple community type data from columns 1 through 5 crazy3=example[grep("maple", example$community), c(1:5)] crazy3 #create a subset of example that contains only the Young plot data from columns 1 through 5 crazy4=example[grep("young", example$age), c(1:5)] crazy4 #Merge two datasets data.x=example[,c(1,5,6)] head(data.x) data.y=example[,c(1,2,3)] head(data.y) merge(data.x, data.y) 12

Assumption 1: Groups of data represent random samples from their respective populations.

Assumption 1: Groups of data represent random samples from their respective populations. Tutorial 6: Comparing Two Groups Assumptions The following methods for comparing two groups are based on several assumptions. The type of test you use will vary based on whether these assumptions are met

More information

R Basics / Course Business

R Basics / Course Business R Basics / Course Business We ll be using a sample dataset in class today: CourseWeb: Course Documents " Sample Data " Week 2 Can download to your computer before class CourseWeb survey on research/stats

More information

3. Data Tables & Data Management

3. Data Tables & Data Management 3. Data Tables & Data Management In this lab, we will learn how to create and manage data tables for analysis. We work with a very simple example, so it is easy to see what the code does. In your own projects

More information

Formulas and Functions

Formulas and Functions Conventions used in this document: Keyboard keys that must be pressed will be shown as Enter or Ctrl. Controls to be activated with the mouse will be shown as Start button > Settings > System > About.

More information

Matlab notes Matlab is a matrix-based, high-performance language for technical computing It integrates computation, visualisation and programming usin

Matlab notes Matlab is a matrix-based, high-performance language for technical computing It integrates computation, visualisation and programming usin Matlab notes Matlab is a matrix-based, high-performance language for technical computing It integrates computation, visualisation and programming using familiar mathematical notation The name Matlab stands

More information

Module 1: Introduction RStudio

Module 1: Introduction RStudio Module 1: Introduction RStudio Contents Page(s) Installing R and RStudio Software for Social Network Analysis 1-2 Introduction to R Language/ Syntax 3 Welcome to RStudio 4-14 A. The 4 Panes 5 B. Calculator

More information

Stat 579: Objects in R Vectors

Stat 579: Objects in R Vectors Stat 579: Objects in R Vectors Ranjan Maitra 2220 Snedecor Hall Department of Statistics Iowa State University. Phone: 515-294-7757 maitra@iastate.edu, 1/23 Logical Vectors I R allows manipulation of logical

More information

University of Alberta

University of Alberta A Brief Introduction to MATLAB University of Alberta M.G. Lipsett 2008 MATLAB is an interactive program for numerical computation and data visualization, used extensively by engineers for analysis of systems.

More information

Reference Guide. Adding a Generic File Store - Importing From a Local or Network ShipWorks Page 1 of 21

Reference Guide. Adding a Generic File Store - Importing From a Local or Network ShipWorks Page 1 of 21 Reference Guide Adding a Generic File Store - Importing From a Local or Network Folder Page 1 of 21 Adding a Generic File Store TABLE OF CONTENTS Background First Things First The Process Creating the

More information

Access Intermediate

Access Intermediate Access 2013 - Intermediate 103-134 Advanced Queries Quick Links Overview Pages AC124 AC125 Selecting Fields Pages AC125 AC128 AC129 AC131 AC238 Sorting Results Pages AC131 AC136 Specifying Criteria Pages

More information

Practical Quantitative Analysis

Practical Quantitative Analysis Manchester University Dealing with Data in R www.research-training.net/2018manchesterma Graeme.Hutcheson@manchester.ac.uk University of Manchester Your MA dissertation data If you are going to be using

More information

SUM - This says to add together cells F28 through F35. Notice that it will show your result is

SUM - This says to add together cells F28 through F35. Notice that it will show your result is COUNTA - The COUNTA function will examine a set of cells and tell you how many cells are not empty. In this example, Excel analyzed 19 cells and found that only 18 were not empty. COUNTBLANK - The COUNTBLANK

More information

Python allows variables to hold string values, just like any other type (Boolean, int, float). So, the following assignment statements are valid:

Python allows variables to hold string values, just like any other type (Boolean, int, float). So, the following assignment statements are valid: 1 STRINGS Objectives: How text data is internally represented as a string Accessing individual characters by a positive or negative index String slices Operations on strings: concatenation, comparison,

More information

BeSt 1.0 Tutorial. by Cuaucihuatl Vital

BeSt 1.0 Tutorial. by Cuaucihuatl Vital BeSt 1.0 Tutorial by Cuaucihuatl Vital (cvital@indiana.edu) Before you start Download version j2sdk1.4.2 of java or later http://java.sun.com/j2se/1.4.2/download.html Follow the instructions for a straight-forward

More information

Logical operators: R provides an extensive list of logical operators. These include

Logical operators: R provides an extensive list of logical operators. These include meat.r: Explanation of code Goals of code: Analyzing a subset of data Creating data frames with specified X values Calculating confidence and prediction intervals Lists and matrices Only printing a few

More information

Using Basic Formulas 4

Using Basic Formulas 4 Using Basic Formulas 4 LESSON SKILL MATRIX Skills Exam Objective Objective Number Understanding and Displaying Formulas Display formulas. 1.4.8 Using Cell References in Formulas Insert references. 4.1.1

More information

Introductory Excel. Spring CS130 - Introductory Excel 1

Introductory Excel. Spring CS130 - Introductory Excel 1 Introductory Excel Spring 2012 CS130 - Introductory Excel 1 Introduction to Excel What is Microsoft Excel? What can we do with Excel? We will do all of these things through the four major parts of the

More information

Computer lab 2 Course: Introduction to R for Biologists

Computer lab 2 Course: Introduction to R for Biologists Computer lab 2 Course: Introduction to R for Biologists April 23, 2012 1 Scripting As you have seen, you often want to run a sequence of commands several times, perhaps with small changes. An efficient

More information

Introduction to MATLAB

Introduction to MATLAB Chapter 1 Introduction to MATLAB 1.1 Software Philosophy Matrix-based numeric computation MATrix LABoratory built-in support for standard matrix and vector operations High-level programming language Programming

More information

An Introduction to Stata Exercise 1

An Introduction to Stata Exercise 1 An Introduction to Stata Exercise 1 Anna Folke Larsen, September 2016 1 Table of Contents 1 Introduction... 1 2 Initial options... 3 3 Reading a data set from a spreadsheet... 5 4 Descriptive statistics...

More information

You will learn: The structure of the Stata interface How to open files in Stata How to modify variable and value labels How to manipulate variables

You will learn: The structure of the Stata interface How to open files in Stata How to modify variable and value labels How to manipulate variables Jennie Murack You will learn: The structure of the Stata interface How to open files in Stata How to modify variable and value labels How to manipulate variables How to conduct basic descriptive statistics

More information

Access Intermediate

Access Intermediate Access 2010 - Intermediate 103-134 Advanced Queries Quick Links Overview Pages AC116 AC117 Selecting Fields Pages AC118 AC119 AC122 Sorting Results Pages AC125 AC126 Specifying Criteria Pages AC132 AC134

More information

Microsoft Access XP Queries. Student Manual

Microsoft Access XP Queries. Student Manual Microsoft Access XP Queries Student Manual Duplication is prohibited without the written consent of The Abreon Group. Foster Plaza 10 680 Andersen Drive Suite 500 Pittsburgh, PA 15220 412.539.1800 800.338.5185

More information

Access 2007: Advanced Instructor s Edition

Access 2007: Advanced Instructor s Edition Access 2007: Advanced Instructor s Edition ILT Series COPYRIGHT Axzo Press. All rights reserved. No part of this work may be reproduced, transcribed, or used in any form or by any means graphic, electronic,

More information

EXCEL 2003 DISCLAIMER:

EXCEL 2003 DISCLAIMER: EXCEL 2003 DISCLAIMER: This reference guide is meant for experienced Microsoft Excel users. It provides a list of quick tips and shortcuts for familiar features. This guide does NOT replace training or

More information

A Tutorial for Excel 2002 for Windows

A Tutorial for Excel 2002 for Windows INFORMATION SYSTEMS SERVICES Writing Formulae with Microsoft Excel 2002 A Tutorial for Excel 2002 for Windows AUTHOR: Information Systems Services DATE: August 2004 EDITION: 2.0 TUT 47 UNIVERSITY OF LEEDS

More information

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview Chapter 888 Introduction This procedure generates D-optimal designs for multi-factor experiments with both quantitative and qualitative factors. The factors can have a mixed number of levels. For example,

More information

Opening a Data File in SPSS. Defining Variables in SPSS

Opening a Data File in SPSS. Defining Variables in SPSS Opening a Data File in SPSS To open an existing SPSS file: 1. Click File Open Data. Go to the appropriate directory and find the name of the appropriate file. SPSS defaults to opening SPSS data files with

More information

CHAPTER 4: MICROSOFT OFFICE: EXCEL 2010

CHAPTER 4: MICROSOFT OFFICE: EXCEL 2010 CHAPTER 4: MICROSOFT OFFICE: EXCEL 2010 Quick Summary A workbook an Excel document that stores data contains one or more pages called a worksheet. A worksheet or spreadsheet is stored in a workbook, and

More information

Introduction to Stata

Introduction to Stata Introduction to Stata Introduction In introductory biostatistics courses, you will use the Stata software to apply statistical concepts and practice analyses. Most of the commands you will need are available

More information

2. INTRODUCTORY EXCEL

2. INTRODUCTORY EXCEL CS130 - Introductory Excel 1 2. INTRODUCTORY EXCEL Fall 2017 CS130 - Introductory Excel 2 Introduction to Excel What is Microsoft Excel? What can we do with Excel? CS130 - Introductory Excel 3 Launch Excel

More information

Getting started with Ms Access Getting Started. Primary Key Composite Key Foreign Key

Getting started with Ms Access Getting Started. Primary Key Composite Key Foreign Key Getting started with Ms Access 2007 Getting Started Customize Microsoft Office Toolbar The Ribbon Quick Access Toolbar Navigation Tabbed Document Window Viewing Primary Key Composite Key Foreign Key Table

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Methods@Manchester Summer School Manchester University July 2 6, 2018 Software and Data www.research-training.net/manchester2018 Graeme.Hutcheson@manchester.ac.uk University of

More information

In this section you will learn some simple data entry, editing, formatting techniques and some simple formulae. Contents

In this section you will learn some simple data entry, editing, formatting techniques and some simple formulae. Contents In this section you will learn some simple data entry, editing, formatting techniques and some simple formulae. Contents Section Topic Sub-topic Pages Section 2 Spreadsheets Layout and Design S2: 2 3 Formulae

More information

Excel Intermediate

Excel Intermediate Excel 2013 - Intermediate (103-124) Multiple Worksheets Quick Links Manipulating Sheets Pages EX16 EX17 Copying Worksheets Page EX337 Grouping Worksheets Pages EX330 EX332 Multi-Sheet Cell References Page

More information

Bulk Provisioning Overview

Bulk Provisioning Overview CHAPTER 8 Bulk provisioning functionality in the Cisco Voice Provisioning Tool allows you to add or modify a large number of users or a large number of phones/device profiles (and the corresponding configuration

More information

Introductory Exercises in Microsoft Access XP

Introductory Exercises in Microsoft Access XP INFORMATION SYSTEMS SERVICES Introductory Exercises in Microsoft Access XP This document contains a series of exercises which give an introduction to the Access relational database program. AUTHOR: Information

More information

Mails : ; Document version: 14/09/12

Mails : ; Document version: 14/09/12 Mails : leslie.regad@univ-paris-diderot.fr ; gaelle.lelandais@univ-paris-diderot.fr Document version: 14/09/12 A freely available language and environment Statistical computing Graphics Supplementary

More information

Fitting a Polynomial to Heat Capacity as a Function of Temperature for Ag. by

Fitting a Polynomial to Heat Capacity as a Function of Temperature for Ag. by Fitting a Polynomial to Heat Capacity as a Function of Temperature for Ag. by Theresa Julia Zielinski Department of Chemistry, Medical Technology, and Physics Monmouth University West Long Branch, J 00764-1898

More information

Introduction to Excel 2013

Introduction to Excel 2013 Introduction to Excel 2013 Copyright 2014, Software Application Training, West Chester University. A member of the Pennsylvania State Systems of Higher Education. No portion of this document may be reproduced

More information

Now I ll turn this over to our presenter, Ellie.

Now I ll turn this over to our presenter, Ellie. Welcome to the webcast about TRAX, the Tool for RSR and ADR XML Generation. My name is Michael Costa, and I am part of the DART Team, one of the Technical Assistance providers responsible for the RSR and

More information

Designing Reports. eivf Designing Reports Note Types 1

Designing Reports. eivf Designing Reports Note Types 1 Designing Reports Designing Reports...1 Note Types...3 Notes...3 Shorthands...3 Quick Note...3 Click N Build...3 Reports (Data Plates )...3 Most commonly use of the Note Types...4 Notes...5 To create a

More information

Tutorial 1: Getting Started with Excel

Tutorial 1: Getting Started with Excel Tutorial 1: Getting Started with Excel Microsoft Excel 2010 Objectives Understand the use of spreadsheets and Excel Learn the parts of the Excel window Scroll through a worksheet and navigate between worksheets

More information

Spreadsheet definition: Starting a New Excel Worksheet: Navigating Through an Excel Worksheet

Spreadsheet definition: Starting a New Excel Worksheet: Navigating Through an Excel Worksheet Copyright 1 99 Spreadsheet definition: A spreadsheet stores and manipulates data that lends itself to being stored in a table type format (e.g. Accounts, Science Experiments, Mathematical Trends, Statistics,

More information

KEYWORDS DDE GETOBJECT PATHNAME CLASS VB EDITOR WITHEVENTS HMI 1.0 TYPE LIBRARY HMI.TAG

KEYWORDS DDE GETOBJECT PATHNAME CLASS VB EDITOR WITHEVENTS HMI 1.0 TYPE LIBRARY HMI.TAG Document Number: IX_APP00113 File Name: SpreadsheetLinking.doc Date: January 22, 2003 Product: InteractX Designer Application Note Associated Project: GetObjectDemo KEYWORDS DDE GETOBJECT PATHNAME CLASS

More information

ECON 502 INTRODUCTION TO MATLAB Nov 9, 2007 TA: Murat Koyuncu

ECON 502 INTRODUCTION TO MATLAB Nov 9, 2007 TA: Murat Koyuncu ECON 502 INTRODUCTION TO MATLAB Nov 9, 2007 TA: Murat Koyuncu 0. What is MATLAB? 1 MATLAB stands for matrix laboratory and is one of the most popular software for numerical computation. MATLAB s basic

More information

download instant at

download instant at CHAPTER 1 - LAB SESSION INTRODUCTION TO EXCEL INTRODUCTION: This lab session is designed to introduce you to the statistical aspects of Microsoft Excel. During this session you will learn how to enter

More information

Introduction to MATLAB

Introduction to MATLAB Introduction to MATLAB Introduction: MATLAB is a powerful high level scripting language that is optimized for mathematical analysis, simulation, and visualization. You can interactively solve problems

More information

R in Linguistic Analysis. Week 2 Wassink Autumn 2012

R in Linguistic Analysis. Week 2 Wassink Autumn 2012 R in Linguistic Analysis Week 2 Wassink Autumn 2012 Today R fundamentals The anatomy of an R help file but first... How did you go about learning the R functions in the reading? More help learning functions

More information

24 - TEAMWORK... 1 HOW DOES MAXQDA SUPPORT TEAMWORK?... 1 TRANSFER A MAXQDA PROJECT TO OTHER TEAM MEMBERS... 2

24 - TEAMWORK... 1 HOW DOES MAXQDA SUPPORT TEAMWORK?... 1 TRANSFER A MAXQDA PROJECT TO OTHER TEAM MEMBERS... 2 24 - Teamwork Contents 24 - TEAMWORK... 1 HOW DOES MAXQDA SUPPORT TEAMWORK?... 1 TRANSFER A MAXQDA PROJECT TO OTHER TEAM MEMBERS... 2 Sharing projects that include external files... 3 TRANSFER CODED SEGMENTS,

More information

A Short Introduction to STATA

A Short Introduction to STATA A Short Introduction to STATA 1) Introduction: This session serves to link everyone from theoretical equations to tangible results under the amazing promise of Stata! Stata is a statistical package that

More information

BaSICS OF excel By: Steven 10.1

BaSICS OF excel By: Steven 10.1 BaSICS OF excel By: Steven 10.1 Workbook 1 workbook is made out of spreadsheet files. You can add it by going to (File > New Workbook). Cell Each & every rectangular box in a spreadsheet is referred as

More information

Introduction to scientific programming in R

Introduction to scientific programming in R Introduction to scientific programming in R John M. Drake & Pejman Rohani 1 Introduction This course will use the R language programming environment for computer modeling. The purpose of this exercise

More information

Order Preserving Triclustering Algorithm. (Version1.0)

Order Preserving Triclustering Algorithm. (Version1.0) Order Preserving Triclustering Algorithm User Manual (Version1.0) Alain B. Tchagang alain.tchagang@nrc-cnrc.gc.ca Ziying Liu ziying.liu@nrc-cnrc.gc.ca Sieu Phan sieu.phan@nrc-cnrc.gc.ca Fazel Famili fazel.famili@nrc-cnrc.gc.ca

More information

Objective 1: Familiarize yourself with basic database terms and definitions. Objective 2: Familiarize yourself with the Access environment.

Objective 1: Familiarize yourself with basic database terms and definitions. Objective 2: Familiarize yourself with the Access environment. Beginning Access 2007 Objective 1: Familiarize yourself with basic database terms and definitions. What is a Database? A Database is simply defined as a collection of related groups of information. Things

More information

Lua - Instruction Manual

Lua - Instruction Manual Table of Contents: General information...... 2 Embedding in "X Train"...... 2 Syntax of Lua... 4 What commands are available...... 5 General Lua commands... 5 "Train X" specific commands...... 6 Functions...

More information

MATLAB Project: Getting Started with MATLAB

MATLAB Project: Getting Started with MATLAB Name Purpose: To learn to create matrices and use various MATLAB commands for reference later MATLAB built-in functions used: [ ] : ; + - * ^, size, help, format, eye, zeros, ones, diag, rand, round, cos,

More information

Formulas, LookUp Tables and PivotTables Prepared for Aero Controlex

Formulas, LookUp Tables and PivotTables Prepared for Aero Controlex Basic Topics: Formulas, LookUp Tables and PivotTables Prepared for Aero Controlex Review ribbon terminology such as tabs, groups and commands Navigate a worksheet, workbook, and multiple workbooks Prepare

More information

ADVANCED INQUIRIES IN ALBEDO: PART 2 EXCEL DATA PROCESSING INSTRUCTIONS

ADVANCED INQUIRIES IN ALBEDO: PART 2 EXCEL DATA PROCESSING INSTRUCTIONS ADVANCED INQUIRIES IN ALBEDO: PART 2 EXCEL DATA PROCESSING INSTRUCTIONS Once you have downloaded a MODIS subset, there are a few steps you must take before you begin analyzing the data. Directions for

More information

Appendix A. Introduction to MATLAB. A.1 What Is MATLAB?

Appendix A. Introduction to MATLAB. A.1 What Is MATLAB? Appendix A Introduction to MATLAB A.1 What Is MATLAB? MATLAB is a technical computing environment developed by The Math- Works, Inc. for computation and data visualization. It is both an interactive system

More information

ICT IGCSE Practical Revision Presentation Web Authoring

ICT IGCSE Practical Revision Presentation Web Authoring 21.1 Web Development Layers 21.2 Create a Web Page Chapter 21: 21.3 Use Stylesheets 21.4 Test and Publish a Website Web Development Layers Presentation Layer Content layer: Behaviour layer Chapter 21:

More information

System Administrator s Handbook

System Administrator s Handbook System Administrator s Handbook www.lamplightdb.co.uk Contents The role of system administrators p.4 Database operators adding, setting permissions and deleting p.5 Lockouts and factor authentication

More information

Business Data Analysis MA0123. Dr Gavin Shaddick Department of Mathematical Sciences 4W 5.7

Business Data Analysis MA0123. Dr Gavin Shaddick Department of Mathematical Sciences 4W 5.7 Business Data Analysis MA0123 Dr Gavin Shaddick Department of Mathematical Sciences g.shaddick@bath.ac.uk 4W 5.7 Lectures and computer labs Two lectures a week (Monday and Friday). One computing lab (time

More information

An Introductory Tutorial: Learning R for Quantitative Thinking in the Life Sciences. Scott C Merrill. September 5 th, 2012

An Introductory Tutorial: Learning R for Quantitative Thinking in the Life Sciences. Scott C Merrill. September 5 th, 2012 An Introductory Tutorial: Learning R for Quantitative Thinking in the Life Sciences Scott C Merrill September 5 th, 2012 Chapter 2 Additional help tools Last week you asked about getting help on packages.

More information

.txt - Exporting and Importing. Table of Contents

.txt - Exporting and Importing. Table of Contents .txt - Exporting and Importing Table of Contents Export... 2 Using Add Skip... 3 Delimiter... 3 Other Options... 4 Saving Templates of Options Chosen... 4 Editing Information in the lower Grid... 5 Import...

More information

Introduction to the workbook and spreadsheet

Introduction to the workbook and spreadsheet Excel Tutorial To make the most of this tutorial I suggest you follow through it while sitting in front of a computer with Microsoft Excel running. This will allow you to try things out as you follow along.

More information

STAT 20060: Statistics for Engineers. Statistical Programming with R

STAT 20060: Statistics for Engineers. Statistical Programming with R STAT 20060: Statistics for Engineers Statistical Programming with R Why R? Because it s free to download for everyone! Most statistical software is very, very expensive, so this is a big advantage. Statisticians

More information

pairwise.t.test(dataset$measurement, dataset$group, p.adj = bonferroni ) TukeyHSD(aov(dataset$measurement~dataset$group))

pairwise.t.test(dataset$measurement, dataset$group, p.adj = bonferroni ) TukeyHSD(aov(dataset$measurement~dataset$group)) Tutorial 9: Comparing Three or More Groups One-way (single-factor) ANOVA (analysis of variance) Used to compare means of 3 or more groups based on a single explanatory (independent) variable, or factor.

More information

2 Spreadsheet Considerations 3 Zip Code and... Tax ID Issues 4 Using The Format... Cells Dialog 5 Creating The Source... File

2 Spreadsheet Considerations 3 Zip Code and... Tax ID Issues 4 Using The Format... Cells Dialog 5 Creating The Source... File Contents I Table of Contents Part 1 Introduction 1 Part 2 Importing from Microsoft Excel 1 1 Overview... 1 2 Spreadsheet Considerations... 1 3 Zip Code and... Tax ID Issues 2 4 Using The Format... Cells

More information

An Introduction to Matlab5

An Introduction to Matlab5 An Introduction to Matlab5 Phil Spector Statistical Computing Facility University of California, Berkeley August 21, 2006 1 Background Matlab was originally developed as a simple interface to the LINPACK

More information

Microsoft Access XP (2002) - Advanced Queries

Microsoft Access XP (2002) - Advanced Queries Microsoft Access XP (2002) - Advanced Queries Group/Summary Operations Change Join Properties Not Equal Query Parameter Queries Working with Text IIF Queries Expression Builder Backing up Tables Action

More information

Activity: page 1/10 Introduction to Excel. Getting Started

Activity: page 1/10 Introduction to Excel. Getting Started Activity: page 1/10 Introduction to Excel Excel is a computer spreadsheet program. Spreadsheets are convenient to use for entering and analyzing data. Although Excel has many capabilities for analyzing

More information

EGR 111 Introduction to MATLAB

EGR 111 Introduction to MATLAB EGR 111 Introduction to MATLAB This lab introduces the MATLAB help facility, shows how MATLAB TM, which stands for MATrix LABoratory, can be used as an advanced calculator. This lab also introduces assignment

More information

Applied Calculus. Lab 1: An Introduction to R

Applied Calculus. Lab 1: An Introduction to R 1 Math 131/135/194, Fall 2004 Applied Calculus Profs. Kaplan & Flath Macalester College Lab 1: An Introduction to R Goal of this lab To begin to see how to use R. What is R? R is a computer package for

More information

Excel Shortcuts Increasing YOUR Productivity

Excel Shortcuts Increasing YOUR Productivity Excel Shortcuts Increasing YOUR Productivity CompuHELP Division of Tommy Harrington Enterprises, Inc. tommy@tommyharrington.com https://www.facebook.com/tommyharringtonextremeexcel Excel Shortcuts Increasing

More information

Getting started with Minitab 14 for Windows

Getting started with Minitab 14 for Windows INFORMATION SYSTEMS SERVICES Getting started with Minitab 14 for Windows This document provides an introduction to the Minitab (Version 14) statistical package. AUTHOR: Information Systems Services, University

More information

Creating a data file and entering data

Creating a data file and entering data 4 Creating a data file and entering data There are a number of stages in the process of setting up a data file and analysing the data. The flow chart shown on the next page outlines the main steps that

More information

What is MATLAB and howtostart it up?

What is MATLAB and howtostart it up? MAT rix LABoratory What is MATLAB and howtostart it up? Object-oriented high-level interactive software package for scientific and engineering numerical computations Enables easy manipulation of matrix

More information

Welcome to the webcast about the Tool for RSR and ADR XML Generation or TRAX. My name is Michael Costa, and I am part of the DART Team, one of the

Welcome to the webcast about the Tool for RSR and ADR XML Generation or TRAX. My name is Michael Costa, and I am part of the DART Team, one of the Welcome to the webcast about the Tool for RSR and ADR XML Generation or TRAX. My name is Michael Costa, and I am part of the DART Team, one of the technical assistance providers responsible for the RSR

More information

Computer Vision. Matlab

Computer Vision. Matlab Computer Vision Matlab A good choice for vision program development because Easy to do very rapid prototyping Quick to learn, and good documentation A good library of image processing functions Excellent

More information

Microsoft Office Excel Use Excel s functions. Tutorial 2 Working With Formulas and Functions

Microsoft Office Excel Use Excel s functions. Tutorial 2 Working With Formulas and Functions Microsoft Office Excel 2003 Tutorial 2 Working With Formulas and Functions 1 Use Excel s functions You can easily calculate the sum of a large number of cells by using a function. A function is a predefined,

More information

6 Subscripting. 6.1 Basics of Subscripting. 6.2 Numeric Subscripts. 6.3 Character Subscripts

6 Subscripting. 6.1 Basics of Subscripting. 6.2 Numeric Subscripts. 6.3 Character Subscripts 6 Subscripting 6.1 Basics of Subscripting For objects that contain more than one element (vectors, matrices, arrays, data frames, and lists), subscripting is used to access some or all of those elements.

More information

Open Office Calc (Spreadsheet) Tutorial

Open Office Calc (Spreadsheet) Tutorial Open Office Calc (Spreadsheet) Tutorial Table of Contents Introduction...3 What is a Spreadsheet?...3 Starting OpenOffice Calc...3 OpenOffice Calc (Spreadsheet) Basics...4 Creating a New Document...5 Entering

More information

CSV Import Guide. Public FINAL V

CSV Import Guide. Public FINAL V CSV Import Guide FINAL V1.1 2018-03-01 This short guide demonstrates how to prepare and open a CSV data file using a spreadsheet application such as Excel. It does not cover all possible ways to open files.

More information

addition + =5+C2 adds 5 to the value in cell C2 multiplication * =F6*0.12 multiplies the value in cell F6 by 0.12

addition + =5+C2 adds 5 to the value in cell C2 multiplication * =F6*0.12 multiplies the value in cell F6 by 0.12 BIOL 001 Excel Quick Reference Guide (Office 2010) For your lab report and some of your assignments, you will need to use Excel to analyze your data and/or generate graphs. This guide highlights specific

More information

OUTLINES. Variable names in MATLAB. Matrices, Vectors and Scalar. Entering a vector Colon operator ( : ) Mathematical operations on vectors.

OUTLINES. Variable names in MATLAB. Matrices, Vectors and Scalar. Entering a vector Colon operator ( : ) Mathematical operations on vectors. 1 LECTURE 3 OUTLINES Variable names in MATLAB Examples Matrices, Vectors and Scalar Scalar Vectors Entering a vector Colon operator ( : ) Mathematical operations on vectors examples 2 VARIABLE NAMES IN

More information

chapter 2 G ETTING I NFORMATION FROM A TABLE

chapter 2 G ETTING I NFORMATION FROM A TABLE chapter 2 Chapter G ETTING I NFORMATION FROM A TABLE This chapter explains the basic technique for getting the information you want from a table when you do not want to make any changes to the data and

More information

Running Minitab for the first time on your PC

Running Minitab for the first time on your PC Running Minitab for the first time on your PC Screen Appearance When you select the MINITAB option from the MINITAB 14 program group, or click on MINITAB 14 under RAS you will see the following screen.

More information

Input/Output of data and file manipulation in SMath Studio Prepared by Gilberto E. Urroz, September 2009

Input/Output of data and file manipulation in SMath Studio Prepared by Gilberto E. Urroz, September 2009 Input/Output of data and file manipulation in SMath Studio Prepared by Gilberto E. Urroz, September 2009 SMath Studio provides functions wfile, rfile, and dfile for output into a file, input from a file,

More information

INTRODUCTION... 1 UNDERSTANDING CELLS... 2 CELL CONTENT... 4

INTRODUCTION... 1 UNDERSTANDING CELLS... 2 CELL CONTENT... 4 Introduction to Microsoft Excel 2016 INTRODUCTION... 1 The Excel 2016 Environment... 1 Worksheet Views... 2 UNDERSTANDING CELLS... 2 Select a Cell Range... 3 CELL CONTENT... 4 Enter and Edit Data... 4

More information

SPARK-PL: Introduction

SPARK-PL: Introduction Alexey Solovyev Abstract All basic elements of SPARK-PL are introduced. Table of Contents 1. Introduction to SPARK-PL... 1 2. Alphabet of SPARK-PL... 3 3. Types and variables... 3 4. SPARK-PL basic commands...

More information

Chapter 5. Repetition. Contents. Introduction. Three Types of Program Control. Two Types of Repetition. Three Syntax Structures for Looping in C++

Chapter 5. Repetition. Contents. Introduction. Three Types of Program Control. Two Types of Repetition. Three Syntax Structures for Looping in C++ Repetition Contents 1 Repetition 1.1 Introduction 1.2 Three Types of Program Control Chapter 5 Introduction 1.3 Two Types of Repetition 1.4 Three Structures for Looping in C++ 1.5 The while Control Structure

More information

EXCEL 98 TUTORIAL Chemistry C2407 fall 1998 Andy Eng, Columbia University 1998

EXCEL 98 TUTORIAL Chemistry C2407 fall 1998 Andy Eng, Columbia University 1998 Created on 09/02/98 11:58 PM 1 EXCEL 98 TUTORIAL Chemistry C2407 fall 1998 Andy Eng, Columbia University 1998 Note for Excel 97 users: All features of Excel 98 for Macintosh are available in Excel 97 for

More information

Intermediate Algebra. Gregg Waterman Oregon Institute of Technology

Intermediate Algebra. Gregg Waterman Oregon Institute of Technology Intermediate Algebra Gregg Waterman Oregon Institute of Technology c 2017 Gregg Waterman This work is licensed under the Creative Commons Attribution 4.0 International license. The essence of the license

More information

Functions in Excel. Structure of a function: Basic Mathematical Functions. Arithmetic operators: Comparison Operators:

Functions in Excel. Structure of a function: Basic Mathematical Functions. Arithmetic operators: Comparison Operators: Page1 Functions in Excel Formulas (functions) are equations that perform calculations on values in your spreadsheet. A formula always starts with an equal sign (=). Example: =5+2*7 This formula multiples

More information

Draft Proof - do not copy, post, or distribute DATA MUNGING LEARNING OBJECTIVES

Draft Proof - do not copy, post, or distribute DATA MUNGING LEARNING OBJECTIVES 6 DATA MUNGING LEARNING OBJECTIVES Describe what data munging is. Demonstrate how to read a CSV data file. Explain how to select, remove, and rename rows and columns. Assess why data scientists need to

More information

Mail Merge Quick Reference Guide

Mail Merge Quick Reference Guide Mail Merge Letters To mail merge letters two documents are needed: 1. The letter, including all text that does not change. 2. Recipient names and addresses (a) The document containing recipient names and

More information

STAT 113: R/RStudio Intro

STAT 113: R/RStudio Intro STAT 113: R/RStudio Intro Colin Reimer Dawson Last Revised September 1, 2017 1 Starting R/RStudio There are two ways you can run the software we will be using for labs, R and RStudio. Option 1 is to log

More information

An Introduction to R- Programming

An Introduction to R- Programming An Introduction to R- Programming Hadeel Alkofide, Msc, PhD NOT a biostatistician or R expert just simply an R user Some slides were adapted from lectures by Angie Mae Rodday MSc, PhD at Tufts University

More information

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 2 Working with data in Excel and exporting to JMP Introduction

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 2 Working with data in Excel and exporting to JMP Introduction Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 2 Working with data in Excel and exporting to JMP Introduction In this exercise, we will learn how to reorganize and reformat a data

More information