2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments;

Size: px
Start display at page:

Download "2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments;"

Transcription

1 A. Goals of Exercise Biology 164 Laboratory Using Comparative Statistics in Biology "Statistics" is a mathematical tool for analyzing and making generalizations about a population from a number of individual observations. Statistics provides a shorthand way of describing and comparing all sorts of scientific data, so that we can confidently draw conclusions based on quantitative information. In using comparative statistics we are 1) trying to evaluate goodness of fit between observed and expected numbers (e.g., Chi-square test), or 2) trying to determine if there is a statistically significant difference between experimental groups (e.g., t-test, Wilcoxon ranked-sum test, etc.). The purpose of this exercise is to 1) introduce you to some of the basic concepts of probability, statistics, and hypothesis testing; 2) familiarize you with a variety of comparative statistics biologists use to evaluate results of experiments; 3) provide an opportunity for you to use the statistical software package, Stata. 4) use your statistical knowledge to evaluate the results of an experiment to determine the effects of light intensity on three growth characteristics of Brassica rapa. B. Introduction to Comparative Statistics Samples and Populations A sample consists of a set of observations drawn from a larger set, the population, about which information is desired. For example, a campus survey might not include every student at Colby, but to be more manageable would most likely involve a sample of the total student body chosen at random, say, a few hundred students, depending upon resources available for conducting the survey. The object of statistics is to make inferences about a population based on information gathered from a sample. Variability and Experimental Design The design of an experiment is essentially based upon a sequence of steps written prior to the start of the experiment that helps ensure that adequate and appropriate data will be gathered to provide an objective and repeatable analysis of the question at hand. Several experimental designs may be possible, and the best design will result in the greatest amount of useful information for the amount of time, money, and other limitations involved. With careful use of controls, a good experimental design allows variation resulting from a biological effect to be distinguished from background natural chance variation. Statistics is the mathematical tool used to make the distinction between chance variation and the variation for which we have designed the study to investigate. Replication and Randomness One of the things to consider when designing an experiment is the variation associated with the observations. Each observation within a sample will be different from every other observation to some degree. Therefore, it is logical to have replicates of each observation, and of the experimental treatments that will be applied. In other words, samples must be large enough to adequately account for the variability among observations within each sample, and there should be more than one sample to which a treatment is applied to account for the natural variability among samples from the same population. These concepts are generally addressed by including replication in the experimental design. Another issue that needs to be addressed when designing an experiment is that of randomness. Human nature will lend some degree of bias to the selection of observations that are included in a sample unless steps are taken to ensure that observational units are selected at random from the original Comparative Statistics in Biology Page 1

2 population. The idea is to have as broad a scope of inference as possible by selecting a sample that most accurately reflects the amount of variability found in the original population. Obviously, sample size is also important, we wouldn t want to base conclusion on only a handful of observations, but one must be careful to be unbiased in sampling, i.e., not sampling from only the most conspicuous plants or the most out-going individuals. Hypothesis Testing A hypothesis is a statement of belief, which may or may not be true, about a population. The test of a hypothesis is a comparison using objectively collected facts. If these facts are in agreement with the stated belief, the hypothesis is accepted. If these facts do not agree with the statement of belief, the hypothesis is rejected. The hypothesis that is tested in a statistical analysis is called the null (no difference) hypothesis, and its opposite is called the alternative hypothesis. These two hypotheses cover all the possible outcomes of an experiment, and since they are opposites of one another, only one of them will be supported by the statistical test performed. All statistical analyses are designed to calculate a probability of the null hypothesis being true. If the probability calculated is greater than.05 (5%), we conclude that the variation seen in the study can be attributed to chance natural variability. By convention the null hypothesis is only rejected as false if the probability of it being true is equal to or less than.05. When the null hypothesis has been rejected, the alternative hypothesis is accepted as being true, meaning that there is a statistically significant difference in the values being compared. An important point to note about using statistics for hypothesis testing is that a statistical analysis is never able to prove or disprove a hypothesis in the mathematical sense. The reason for this is that all statistical analyses are merely based on a portion (e.g., a sample) of the entire population, not the entire population itself. It is important to remember that statistics are only used to support or refute hypotheses, not prove them. For example, if one wanted to prove or disprove that brown was the most common color of M&M s, it would not be sufficient to statistically analyze a sample represented by a number of packages purchased at your local store. What would be required to prove such a fact? Ask your instructor if the answer is not obvious. Thinking that statistics actually prove something is one of the greatest misconceptions that students have about using statistics!! Types of Variables In experimental design, the experimenter manipulates one of the variables of a study to determine whether the variable affects the outcome of the study. The variable so designated is known as the independent variable. The other measured variables in the study are considered to be the dependent variables. For example, in an experiment to determine how differing brands of fertilizer affect the yield of corn, the brand of fertilizer represents the independent variable while the corn yield that is measured in the study represents the dependent variable. In statistics variables may be either qualitative (also called discreet or grouping), or quantitative (also called continuous). When variables are classified according to some attribute or character they possess, rather than by a numerical measurement, they are considered to be qualitative because their values fall into one or more discreet categories. Some authors refer to qualitative variables as nominal variables, since they often represent names of things. Qualitative variables correspond to independent variable of an experiment. In contrast,a continuous variable is one that can, in theory, take on any value that can be expressed numerically. Quantitative variables correspond to the dependent variable. Frequency Distributions For many biological populations a typical shape for a frequency distribution of the values of a continuous variable is that of a bell-shaped curve, or normal distribution. In fact, a great many continuous variables are normally distributed, and if the shape of the frequency distribution is unknown, it is often assumed to be "normal", especially if we have only a small sample from which to estimate the true population frequency distribution. The characteristic features of a normal distribution are a central peak in the frequency distribution and symmetric variation on either side of the peak. When a population has a normal distribution, inferences about the population can be made using certain parameters of the sample as a mathematical guide. Comparative Statistics in Biology Page 2

3 Parametric and Non-Parametric Statistics Two statistical parameters that are commonly calculated for a sample are 1) the mean ( x), a measure of central tendency, and 2) the standard deviation (SD), a measure of the dispersion of the sample on either side of the mean. When the mean and standard deviation have been calculated for a sample, they provide an estimate of the shape of the frequency distribution for the population. By definition ( x) ± 1 SD includes approximately 90% of the population, ( x) ± 2 SD includes approximately 95% of the population, and ( x) ± 3 SD includes approximately 99% of the population. Another dispersion parameter commonly measured for a normal distribution is the standard error of the mean, often simply called the standard error (SE). The standard error is calculated by dividing the sample standard deviation by the square root of the sample size, N. The standard error is useful for predicting the value of the population mean in relation to the sample mean. By definition there is approximately a 90% probability that the population mean lies within 1 SE of the sample mean, a 95% probability that the population mean lies within 2 SE of the sample mean, and a 99% probability that the population mean lies within 3 SE of the sample mean. All parametric statistical tests require that the population have a normal distribution, and that the distribution is reasonably symmetrical. If it is suspected that a population does not meet the criteria of a normal distribution, sample parameters like mean and standard deviation are useless for making inferences about the population. A different approach, known as non-parametric statistics, is used that does not assume a normal distribution for the population. Non-parametrics employ alternative approaches such as comparing sequentially ranked observations made from different groups. In general, for each parametric statistical test there is a corresponding non-parametric equivalent that can be used when the distribution of the population is unknown or is expected to have a distribution other than a normal distribution. Very commonly count data or proportion data (e.g., decimal values falling between 0 and 1) do not follow a normal distribution, and hence should be analyzed using non-parametrics. C. Guide to Using Stata Moving Datasets from Excel to Stata Stata is a powerful software package, but its user interface can be a bit intimidating to first-time users, especially regarding data entry. Fortunately, data files created in Excel can be easily copied and pasted into the Stata data editor. Thus, in Biology lab settings, we generally create and manage data files using Excel, and then transfer the data to Stata when we need to perform statistical analyses. Sorting Data in Excel It is occasionally useful to sort data. Sorting is particularly useful when the data file contains measurements from two or more experimental treatments and you would like to generate summary descriptive statistics from each of the experimental treatments separately. To sort data locate the Sort command under the Data menu. Excel allows you to sort using any of the variables as sorting keys. For example if you choose your experimental treatment variable as the sorting key, the data will be sorted such that all the measurements of a given experimental treatment will be grouped together. Nested sorting can be accomplished by selecting more that one sorting key. Nested sorting is useful if you want to sort data by experimental treatment, having the measurements within a treatment ranked from lowest to highest values. To perform this type of nested sort, you would select the experimental treatment variable as the first key, and the measured variable as the second key. Preparing Data for Use in Stata Stata is particular in regards to how your data is organized. If you remember the earlier section on Types of Variables, you should realize that each experimental observation consists of a measured variable (the dependent variable) and a corresponding independent variable (the variable manipulated by the experimenter). For instance, if you have records of fish body length size for two different ponds, one column would be designated for the body length measurements (the dependent variable), while the other column would be designated for ponds from which the fish were obtained, Johnson and Messalonskee (the independent variable). Comparative Statistics in Biology Page 3

4 It is generally easier to get the data into the correct format in Excel before moving it to Stata. Using the above fish example, a researcher may have found it easiest to record the fish length measurements using one column for Johnson Pond and one column for Messalonskee Pond. Before moving to Stata however, it is best to rearrange the data so that there is a single column for all the fish lengths, with a second corresponding column that indicates from which pond each measurement came. Stata Windows When you launch Stata you should see something similar to the following. Note that Stata has four basic windows: 1. The Stata Results window will display all the commands issued to Stata, as well as all the output resulting from these commands (with the exception of graphs). 2. The Stata Command window is used to type commands directly into Stata. Most of what you will be doing will be done through the drop down menus. One key command drop _all can be used to clear the data editor spreadsheet. 3. The Review window keeps a running history of all the commands that you have issued Stata during your session. To recall a command, simply click on it in this window. 4. The Variables window shows you all the variables that you currently have in your dataset. If you should ever lose a window, click on Window and choose the window name that you want to appear. There are other windows in Stata, but the only other one you re likely to encounter is the Graph window. This window pops up anytime you generate a graph. You can right-click on a graph for printing and copying options. Graphs can be copied in Stata and pasted into any Microsoft Office application (such as Word). Oftentimes you want to keep a running history of all your commands and the output from those commands. You can do this by using a log file. To begin a log file click on the button and choose where you would like to save the file (use the.log extension rather than the.scml extension). This file is a text file and can be opened in Notepad or any word processing package. Comparative Statistics in Biology Page 4

5 Copying an Excel Spreadsheet into Stata Reading data from an Excel spreadsheet is straightforward. Make sure that the variables are in the columns with each variable name at the top of each column. As mentioned above, make sure that for each dependent variable column there is a corresponding independent variable column. Otherwise, Stata will not be able to use your data!! Highlight the data including the first row of variable names and copy to the clipboard. Open the Data Editor in Stata by clicking on the button. You will see an empty spreadsheet (if not then you already have a Stata file open). Click on the upper left cell and select paste. You can click on the x in the upper-left corner to exit. The data will be preserved in the temporary memory, but will not be saved until you choose File > Save As from the menu and enter the name under which you wish to save it. You cannot run any statistical tests unless you close the Data Editor window first. Entering Data in Stata by Hand (SKIP TO PAGE 6 IF YOU RE USING EXCEL DATAFILES) Open the data editor in Stata by clicking on the button. Click on the upper-left cell of the empty spreadsheet and begin entering the data by putting each variable in its own column (each row is a separate observation). Stata names the variables var1, var2, var3, etc. by default. To give the variables more meaningful names, double-click on the column you want to change and Variable Properties box will appear. You can also enter a longer, more descriptive, label for the variable in the box. Note that Stata also decides what format the variable should be in. This can take on many different forms and it is usually best to let it default. Changing Data Values Once They are Entered If you want to change any numbers that are in your data set, open the editor and click on the cell you want to change. Simply enter the new number and press enter. When you close the editor window, Stata will ask if you want to preserve the changes. If you click accept changes your change(s) will be applied. At this point, only the data in the local memory have been changed. If you want to save the data set to the disk then you must select File > Save As. If you click discard changes no changes will be made. Labeling Categorical Variables If you have one or more categorical variables (variables that take only a finite number of distinct categories) you may want to label the values in them. For example, if your dataset includes a variable called treetype that can take on two values (1 for deciduous and 2 for coniferous), then you will see a series of 1s and 2s in the data set. To give these numbers meaningful labels you can select Data > Labels > Label values > Define or modify label values. Click on Define to define a new label and give the label a name. A new box will pop up asking for you to give the numerical value of one of the categories. It will also ask you to give the label you wish to assign to that numerical value. Keep entering labels until you have exhausted all the categories, then click cancel. To actually assign the labels you just created to a variable, select Data > Labels > Label values > Assign value labels to variable. Select the label name you just created and the variable you wish to apply them to and hit enter. Comparative Statistics in Biology Page 5

6 The Example Dataset For this exercise you will use the Brassica Light Study.xls dataset. Open the data file and examine the data. These data were generated by Intro Bio students who investigated the effects of growing plants for three weeks at two different light levels. After the growth period, the students measured plant height, plant weight, and the number of petiole trichomes produced on the first true leaf of each plant. Note that the first column corresponds to the initials of students entering the data, the second column corresponds to the two light levels used in the study, third column = plant height, fourth column = plant weight, and fifth column = # of petiole trichomes. Exercise Question #1: Which are the independent and dependent variables for this study? Examining the Frequency Distribution of the Data In order to decide whether parametric or non-parametric statistics are appropriate for a comparative statistical analysis, the frequency distribution of the sample must be examined. Copy the data from Brassica Light Study.xls into Stata s Data Editor, preserve it, then close the Data Editor. From the Graphics menu select Histogram. The following dialog box will appear: Note that the histogram dialog window consists of an upper tab bar, corresponding to a number of subwindows. Note how the Main sub-window is highlighted. We ll start with the Main window and move to other sub-windows as necessary. In the Variable box, select the particular variable for which you want to generate a frequency distribution. The variable chosen must be a dependent variable, and you can only choose one variable at a time. From the Y axis box, select the Frequency button. Take a look at the Bins box for a moment. Bins refer to the size of each interval of values on the X axis of the frequency distribution. Stata uses a default of 10 bins, and determines the size of each interval (bin) by dividing the range of values in the distribution by 10. We will go with 10 bins for now, but note that if you ever want to change the size and number of bins, this is where to do it. If we generated the frequency histogram without setting further criteria, the distribution would include all of the measurements of the study for the particular variable chosen. However, we would like to see separate frequency distributions for each of the conditions of the independent variable. Thus we need to set further criteria, and for that we need to use the By sub-window. Comparative Statistics in Biology Page 6

7 Select the By sub-window, and the following dialog box will appear: Click the Draw subgraphs for unique values of variables box, and for Variables select the data column that contains the independent variable. We will not make any changes to other sub-windows, but take a look at the Y axis and X axis subwindows. These are the sub-windows in which you can control the scale and other properties of the axes. When finished click the OK button, and a histogram of the frequency distribution for the dependent variable chosen will be displayed. If you would like to save the histogram for inserting into a Word document, perform a Save As.. and use the.png format. The saved histogram can then be inserted as a picture from file into your Word document. Do either of the frequency distributions shown in the histogram resemble a normal distribution? The distribution does not have to be a perfect bell shape curve, but there should be some suggestion of central tendency, and some suggestion of symmetry of the tails. If one can make a reasonable case upon examination that the population is normally distributed, parametric tests should be used because of the greater statistical robustness they offer. Otherwise non-parametrics must be used. Go back to the Graphics menu and create histograms for the other two dependent variables in the study. The only changes you need to make are in the Variables box of the Main sub-window, since all previous selections are retained. Exercise Question #2: For which of the three growth characters measured can you make a reasonable case for a normal distribution? Do any of the growth characters exhibit an obvious non-normal distribution? Exercise Question #3: Can the same comparative statistical test be used to analyze all three of the growth characters? Explain your reasoning. Comparative Statistics in Biology Page 7

8 Parametric Tests t-test: Comparing Two Sample Groups The unpaired t-test is a test of the null hypothesis (i.e. no difference between means) used to compare one sample mean to another sample mean. The "t" refers to the "t-distribution," which is a specific normal-type distribution. We allow for three possibilities with an unpaired t-test. The test is two-tailed if one sample mean could theoretically be smaller or larger than the other sample mean to which it is being compared. In a two-tailed t-test we test for no difference between sample means (the null hypothesis). We use a one-tailed test if we are only concerned about the sample mean deviating in one direction. Thus, in a one-tailed t-test two possibilities are tested for: sample mean A is larger than sample mean B, and sample mean A is smaller than sample mean B. To begin a t-test go to the Statistics menu, and select Summaries, tables & tests, then select Classical tests of hypotheses, and then Two group mean comparison test. You should see the following dialog box: Select a dependent variable for the Variable Name box, and the independent variable (sometimes referred to as the grouping variable ) for the Group variable name box. To get more accurate results check the boxes for Unequal variances and Welch s approximation, then click OK. A table similar to the one below will appear in the Results window. Note that the first line (in white text) indicates the names of the dependent and independent variables selected for the t-test, while the table contains values for the statistical parameters of central tendency and dispersion. The middle p-value at the bottom of the window is for a two-tailed t-test, while the other p-values are for the corresponding one-tailed t-tests. To interpret the meaning of the central p-value, refer to the section on Hypothesis Testing on page 2. Comparative Statistics in Biology Page 8

9 Nonparametric Tests Nonparametrics test hypotheses about data for which the underlying distribution cannot be assumed to be normal. Keep in mind when summarizing such data that you must steer clear of statistical parameters such as mean and standard error since those statistics are meaningless for a population lacking a normal distribution. For such data it is customary to report the Minimum and Maximum values observed along with the Median, the value that lies midway in the distribution (see below for details on how to obtain these descriptive statistics in Stata). The use of nonparametric tests in Stata is very straightforward and similar to performing parametric tests. However, one important difference with nonparametric tests is that they only provide the p-value of the analysis and do not provide any summary descriptive statistics (as the parametric tests do). The Wilcoxon ranked-sum test (also known as the Mann Whitney test) is the nonparametric equivalent of the unpaired t-test. To begin this nonparametric test go to the Statistics menu, and select Summaries, tables & tests, then select Nonparametric tests of hypotheses, and then Wilcoxon ranked-sum test. You should see the following dialog box: Select a dependent variable for the Variable box, and an independent variable for the Grouping variable box, then click OK. In the table that appears in the Results window, Prob is the p-value of the null hypothesis. To interpret the meaning of the p-value, refer to Hypothesis Testing on page 2. Obtaining a Summary Table of Descriptive Statistics To get a quick, convenient summary of descriptive stats for your dataset in tabular form, go to the Statistics menu, and select Summaries, tables & tests, then select Tables, and then Table of summary statistics (tabstat). You should get a dialog box that looks like the following: For the Variables box you can select as many dependent variables as you like. Check the Group statistics by variable box, then select the independent variable in the pull-down menu. In the Statistics to display portion of the window you can choose up to eight different statistics to display in the table. For this example please choose the following four (you need to first click each box in order to activate each pull-down menu): Count, Minimum, Maximum, and Median. Click OK to complete your selection. Comparative Statistics in Biology Page 9

10 You should get a table that looks like the following: Note that the header line of the Results window indicates the choice of summary statistics you have made and the names of the variables used in the table. These are also indicated in the table itself, though the variable names have unfortunately been severely truncated. Nevertheless, such a table can be very handy, and remember you can include up to eight different summary statistics in the table! You now have the tools to answer the final question of this exercise: Exercise Question #4: How did light intensity affect the three growth characters measured? Were statistically significant differences seen between different light intensities for all the growth characters? Explain briefly. D. Homework Assignment: Summarizing the Results of the Brassica Light Study Write a Results section that summarizes the findings of this study. Use the questions listed above as a guide, and support your answers using the results from the appropriate statistical test. The Results section should include a textual summary along with properly labeled summary tables or figures where appropriate. The reporting of statistical results and organization of tables should adhere to the guidelines described in the Biology 164 Style Guide for Papers and Lab Reports. You may work on the assignment individually or with your lab partner. If working with a lab partner, submit one assignment with both of your names on it. The completed assignment is due at the beginning of lab next week. Comparative Statistics in Biology Page 10

Descriptive Statistics, Standard Deviation and Standard Error

Descriptive Statistics, Standard Deviation and Standard Error AP Biology Calculations: Descriptive Statistics, Standard Deviation and Standard Error SBI4UP The Scientific Method & Experimental Design Scientific method is used to explore observations and answer questions.

More information

Statistics with a Hemacytometer

Statistics with a Hemacytometer Statistics with a Hemacytometer Overview This exercise incorporates several different statistical analyses. Data gathered from cell counts with a hemacytometer is used to explore frequency distributions

More information

Data Analysis Guidelines

Data Analysis Guidelines Data Analysis Guidelines DESCRIPTIVE STATISTICS Standard Deviation Standard deviation is a calculated value that describes the variation (or spread) of values in a data set. It is calculated using a formula

More information

STATS PAD USER MANUAL

STATS PAD USER MANUAL STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11

More information

Frequency Distributions

Frequency Distributions Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data so that it is possible to get a general overview of the results. Remember,

More information

Graphical Analysis of Data using Microsoft Excel [2016 Version]

Graphical Analysis of Data using Microsoft Excel [2016 Version] Graphical Analysis of Data using Microsoft Excel [2016 Version] Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things.

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. + What is Data? Data is a collection of facts. Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. In most cases, data needs to be interpreted and

More information

An introduction to plotting data

An introduction to plotting data An introduction to plotting data Eric D. Black California Institute of Technology February 25, 2014 1 Introduction Plotting data is one of the essential skills every scientist must have. We use it on a

More information

Using Excel for Graphical Analysis of Data

Using Excel for Graphical Analysis of Data Using Excel for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters. Graphs are

More information

Survey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9

Survey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9 Survey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9 Contents 1 Introduction to Using Excel Spreadsheets 2 1.1 A Serious Note About Data Security.................................... 2 1.2

More information

CHAPTER 6. The Normal Probability Distribution

CHAPTER 6. The Normal Probability Distribution The Normal Probability Distribution CHAPTER 6 The normal probability distribution is the most widely used distribution in statistics as many statistical procedures are built around it. The central limit

More information

Excel Tips and FAQs - MS 2010

Excel Tips and FAQs - MS 2010 BIOL 211D Excel Tips and FAQs - MS 2010 Remember to save frequently! Part I. Managing and Summarizing Data NOTE IN EXCEL 2010, THERE ARE A NUMBER OF WAYS TO DO THE CORRECT THING! FAQ1: How do I sort my

More information

1 Introduction to Using Excel Spreadsheets

1 Introduction to Using Excel Spreadsheets Survey of Math: Excel Spreadsheet Guide (for Excel 2007) Page 1 of 6 1 Introduction to Using Excel Spreadsheets This section of the guide is based on the file (a faux grade sheet created for messing with)

More information

Univariate Statistics Summary

Univariate Statistics Summary Further Maths Univariate Statistics Summary Types of Data Data can be classified as categorical or numerical. Categorical data are observations or records that are arranged according to category. For example:

More information

Minitab 17 commands Prepared by Jeffrey S. Simonoff

Minitab 17 commands Prepared by Jeffrey S. Simonoff Minitab 17 commands Prepared by Jeffrey S. Simonoff Data entry and manipulation To enter data by hand, click on the Worksheet window, and enter the values in as you would in any spreadsheet. To then save

More information

Section 4 General Factorial Tutorials

Section 4 General Factorial Tutorials Section 4 General Factorial Tutorials General Factorial Part One: Categorical Introduction Design-Ease software version 6 offers a General Factorial option on the Factorial tab. If you completed the One

More information

8. MINITAB COMMANDS WEEK-BY-WEEK

8. MINITAB COMMANDS WEEK-BY-WEEK 8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are

More information

Chapter 12: Statistics

Chapter 12: Statistics Chapter 12: Statistics Once you have imported your data or created a geospatial model, you may wish to calculate some simple statistics, run some simple tests, or see some traditional plots. On the main

More information

Introduction to Minitab 1

Introduction to Minitab 1 Introduction to Minitab 1 We begin by first starting Minitab. You may choose to either 1. click on the Minitab icon in the corner of your screen 2. go to the lower left and hit Start, then from All Programs,

More information

Using Excel for Graphical Analysis of Data

Using Excel for Graphical Analysis of Data EXERCISE Using Excel for Graphical Analysis of Data Introduction In several upcoming experiments, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

Lab 5 - Risk Analysis, Robustness, and Power

Lab 5 - Risk Analysis, Robustness, and Power Type equation here.biology 458 Biometry Lab 5 - Risk Analysis, Robustness, and Power I. Risk Analysis The process of statistical hypothesis testing involves estimating the probability of making errors

More information

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. 1 CHAPTER 1 Introduction Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. Variable: Any characteristic of a person or thing that can be expressed

More information

Basic Statistical Terms and Definitions

Basic Statistical Terms and Definitions I. Basics Basic Statistical Terms and Definitions Statistics is a collection of methods for planning experiments, and obtaining data. The data is then organized and summarized so that professionals can

More information

Table of Contents (As covered from textbook)

Table of Contents (As covered from textbook) Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression

More information

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...

More information

Minitab Notes for Activity 1

Minitab Notes for Activity 1 Minitab Notes for Activity 1 Creating the Worksheet 1. Label the columns as team, heat, and time. 2. Have Minitab automatically enter the team data for you. a. Choose Calc / Make Patterned Data / Simple

More information

Data analysis using Microsoft Excel

Data analysis using Microsoft Excel Introduction to Statistics Statistics may be defined as the science of collection, organization presentation analysis and interpretation of numerical data from the logical analysis. 1.Collection of Data

More information

BIOL 417: Biostatistics Laboratory #3 Tuesday, February 8, 2011 (snow day February 1) INTRODUCTION TO MYSTAT

BIOL 417: Biostatistics Laboratory #3 Tuesday, February 8, 2011 (snow day February 1) INTRODUCTION TO MYSTAT BIOL 417: Biostatistics Laboratory #3 Tuesday, February 8, 2011 (snow day February 1) INTRODUCTION TO MYSTAT Go to the course Blackboard site and download Laboratory 3 MYSTAT Intro.xls open this file in

More information

BIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26

BIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26 Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26 INTRODUCTION Graphs are one of the most important aspects of data analysis and presentation of your of data. They are visual representations

More information

STA 570 Spring Lecture 5 Tuesday, Feb 1

STA 570 Spring Lecture 5 Tuesday, Feb 1 STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row

More information

STAT STATISTICAL METHODS. Statistics: The science of using data to make decisions and draw conclusions

STAT STATISTICAL METHODS. Statistics: The science of using data to make decisions and draw conclusions STAT 515 --- STATISTICAL METHODS Statistics: The science of using data to make decisions and draw conclusions Two branches: Descriptive Statistics: The collection and presentation (through graphical and

More information

Chapter 1. Looking at Data-Distribution

Chapter 1. Looking at Data-Distribution Chapter 1. Looking at Data-Distribution Statistics is the scientific discipline that provides methods to draw right conclusions: 1)Collecting the data 2)Describing the data 3)Drawing the conclusions Raw

More information

MKTG 460 Winter 2019 Solutions #1

MKTG 460 Winter 2019 Solutions #1 MKTG 460 Winter 2019 Solutions #1 Short Answer: Data Analysis 1. What is a data table and how are the data values organized? A data table stores the data values for variables across different observations,

More information

Using Large Data Sets Workbook Version A (MEI)

Using Large Data Sets Workbook Version A (MEI) Using Large Data Sets Workbook Version A (MEI) 1 Index Key Skills Page 3 Becoming familiar with the dataset Page 3 Sorting and filtering the dataset Page 4 Producing a table of summary statistics with

More information

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created

More information

The Power and Sample Size Application

The Power and Sample Size Application Chapter 72 The Power and Sample Size Application Contents Overview: PSS Application.................................. 6148 SAS Power and Sample Size............................... 6148 Getting Started:

More information

Descriptive Statistics and Graphing

Descriptive Statistics and Graphing Anatomy and Physiology Page 1 of 9 Measures of Central Tendency Descriptive Statistics and Graphing Measures of central tendency are used to find typical numbers in a data set. There are different ways

More information

Lab #9: ANOVA and TUKEY tests

Lab #9: ANOVA and TUKEY tests Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for

More information

Excel 2010 with XLSTAT

Excel 2010 with XLSTAT Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with

More information

Brief Guide on Using SPSS 10.0

Brief Guide on Using SPSS 10.0 Brief Guide on Using SPSS 10.0 (Use student data, 22 cases, studentp.dat in Dr. Chang s Data Directory Page) (Page address: http://www.cis.ysu.edu/~chang/stat/) I. Processing File and Data To open a new

More information

Learner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display

Learner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display CURRICULUM MAP TEMPLATE Priority Standards = Approximately 70% Supporting Standards = Approximately 20% Additional Standards = Approximately 10% HONORS PROBABILITY AND STATISTICS Essential Questions &

More information

UAccess ANALYTICS Next Steps: Working with Bins, Groups, and Calculated Items: Combining Data Your Way

UAccess ANALYTICS Next Steps: Working with Bins, Groups, and Calculated Items: Combining Data Your Way UAccess ANALYTICS Next Steps: Working with Bins, Groups, and Calculated Items: Arizona Board of Regents, 2014 THE UNIVERSITY OF ARIZONA created 02.07.2014 v.1.00 For information and permission to use our

More information

Chapter 6: DESCRIPTIVE STATISTICS

Chapter 6: DESCRIPTIVE STATISTICS Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling

More information

Unit 7 Statistics. AFM Mrs. Valentine. 7.1 Samples and Surveys

Unit 7 Statistics. AFM Mrs. Valentine. 7.1 Samples and Surveys Unit 7 Statistics AFM Mrs. Valentine 7.1 Samples and Surveys v Obj.: I will understand the different methods of sampling and studying data. I will be able to determine the type used in an example, and

More information

4. Descriptive Statistics: Measures of Variability and Central Tendency

4. Descriptive Statistics: Measures of Variability and Central Tendency 4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and

More information

Exploring and Understanding Data Using R.

Exploring and Understanding Data Using R. Exploring and Understanding Data Using R. Loading the data into an R data frame: variable

More information

Basic concepts and terms

Basic concepts and terms CHAPTER ONE Basic concepts and terms I. Key concepts Test usefulness Reliability Construct validity Authenticity Interactiveness Impact Practicality Assessment Measurement Test Evaluation Grading/marking

More information

DOWNLOAD PDF BIG IDEAS MATH VERTICAL SHRINK OF A PARABOLA

DOWNLOAD PDF BIG IDEAS MATH VERTICAL SHRINK OF A PARABOLA Chapter 1 : BioMath: Transformation of Graphs Use the results in part (a) to identify the vertex of the parabola. c. Find a vertical line on your graph paper so that when you fold the paper, the left portion

More information

BIOSTATISTICS LABORATORY PART 1: INTRODUCTION TO DATA ANALYIS WITH STATA: EXPLORING AND SUMMARIZING DATA

BIOSTATISTICS LABORATORY PART 1: INTRODUCTION TO DATA ANALYIS WITH STATA: EXPLORING AND SUMMARIZING DATA BIOSTATISTICS LABORATORY PART 1: INTRODUCTION TO DATA ANALYIS WITH STATA: EXPLORING AND SUMMARIZING DATA Learning objectives: Getting data ready for analysis: 1) Learn several methods of exploring the

More information

3 Graphical Displays of Data

3 Graphical Displays of Data 3 Graphical Displays of Data Reading: SW Chapter 2, Sections 1-6 Summarizing and Displaying Qualitative Data The data below are from a study of thyroid cancer, using NMTR data. The investigators looked

More information

CREATING THE DISTRIBUTION ANALYSIS

CREATING THE DISTRIBUTION ANALYSIS Chapter 12 Examining Distributions Chapter Table of Contents CREATING THE DISTRIBUTION ANALYSIS...176 BoxPlot...178 Histogram...180 Moments and Quantiles Tables...... 183 ADDING DENSITY ESTIMATES...184

More information

Chapter 3 Analyzing Normal Quantitative Data

Chapter 3 Analyzing Normal Quantitative Data Chapter 3 Analyzing Normal Quantitative Data Introduction: In chapters 1 and 2, we focused on analyzing categorical data and exploring relationships between categorical data sets. We will now be doing

More information

Statistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or me, I will answer promptly.

Statistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or  me, I will answer promptly. Statistical Methods Instructor: Lingsong Zhang 1 Issues before Class Statistical Methods Lingsong Zhang Office: Math 544 Email: lingsong@purdue.edu Phone: 765-494-7913 Office Hour: Monday 1:00 pm - 2:00

More information

Minitab on the Math OWL Computers (Windows NT)

Minitab on the Math OWL Computers (Windows NT) STAT 100, Spring 2001 Minitab on the Math OWL Computers (Windows NT) (This is an incomplete revision by Mike Boyle of the Spring 1999 Brief Introduction of Benjamin Kedem) Department of Mathematics, UMCP

More information

SPSS. (Statistical Packages for the Social Sciences)

SPSS. (Statistical Packages for the Social Sciences) Inger Persson SPSS (Statistical Packages for the Social Sciences) SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform basic statistical calculations in SPSS.

More information

INSTRUCTIONS FOR USING MICROSOFT EXCEL PERFORMING DESCRIPTIVE AND INFERENTIAL STATISTICS AND GRAPHING

INSTRUCTIONS FOR USING MICROSOFT EXCEL PERFORMING DESCRIPTIVE AND INFERENTIAL STATISTICS AND GRAPHING APPENDIX INSTRUCTIONS FOR USING MICROSOFT EXCEL PERFORMING DESCRIPTIVE AND INFERENTIAL STATISTICS AND GRAPHING (Developed by Dr. Dale Vogelien, Kennesaw State University) ** For a good review of basic

More information

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement

More information

TYPES OF VARIABLES, STRUCTURE OF DATASETS, AND BASIC STATA LAYOUT

TYPES OF VARIABLES, STRUCTURE OF DATASETS, AND BASIC STATA LAYOUT PRIMER FOR ACS OUTCOMES RESEARCH COURSE: TYPES OF VARIABLES, STRUCTURE OF DATASETS, AND BASIC STATA LAYOUT STEP 1: Install STATA statistical software. STEP 2: Read through this primer and complete the

More information

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order. Chapter 2 2.1 Descriptive Statistics A stem-and-leaf graph, also called a stemplot, allows for a nice overview of quantitative data without losing information on individual observations. It can be a good

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming

More information

Week 7: The normal distribution and sample means

Week 7: The normal distribution and sample means Week 7: The normal distribution and sample means Goals Visualize properties of the normal distribution. Learning the Tools Understand the Central Limit Theorem. Calculate sampling properties of sample

More information

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data Chapter 2 Descriptive Statistics: Organizing, Displaying and Summarizing Data Objectives Student should be able to Organize data Tabulate data into frequency/relative frequency tables Display data graphically

More information

Chapter Two: Descriptive Methods 1/50

Chapter Two: Descriptive Methods 1/50 Chapter Two: Descriptive Methods 1/50 2.1 Introduction 2/50 2.1 Introduction We previously said that descriptive statistics is made up of various techniques used to summarize the information contained

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use

More information

Chapters 5-6: Statistical Inference Methods

Chapters 5-6: Statistical Inference Methods Chapters 5-6: Statistical Inference Methods Chapter 5: Estimation (of population parameters) Ex. Based on GSS data, we re 95% confident that the population mean of the variable LONELY (no. of days in past

More information

Downloaded from

Downloaded from UNIT 2 WHAT IS STATISTICS? Researchers deal with a large amount of data and have to draw dependable conclusions on the basis of data collected for the purpose. Statistics help the researchers in making

More information

DOING MORE WITH EXCEL: MICROSOFT OFFICE 2013

DOING MORE WITH EXCEL: MICROSOFT OFFICE 2013 DOING MORE WITH EXCEL: MICROSOFT OFFICE 2013 GETTING STARTED PAGE 02 Prerequisites What You Will Learn MORE TASKS IN MICROSOFT EXCEL PAGE 03 Cutting, Copying, and Pasting Data Basic Formulas Filling Data

More information

EXCEL BASICS: MICROSOFT OFFICE 2007

EXCEL BASICS: MICROSOFT OFFICE 2007 EXCEL BASICS: MICROSOFT OFFICE 2007 GETTING STARTED PAGE 02 Prerequisites What You Will Learn USING MICROSOFT EXCEL PAGE 03 Opening Microsoft Excel Microsoft Excel Features Keyboard Review Pointer Shapes

More information

To make sense of data, you can start by answering the following questions:

To make sense of data, you can start by answering the following questions: Taken from the Introductory Biology 1, 181 lab manual, Biological Sciences, Copyright NCSU (with appreciation to Dr. Miriam Ferzli--author of this appendix of the lab manual). Appendix : Understanding

More information

Depending on the computer you find yourself in front of, here s what you ll need to do to open SPSS.

Depending on the computer you find yourself in front of, here s what you ll need to do to open SPSS. 1 SPSS 11.5 for Windows Introductory Assignment Material covered: Opening an existing SPSS data file, creating new data files, generating frequency distributions and descriptive statistics, obtaining printouts

More information

Page 1. Graphical and Numerical Statistics

Page 1. Graphical and Numerical Statistics TOPIC: Description Statistics In this tutorial, we show how to use MINITAB to produce descriptive statistics, both graphical and numerical, for an existing MINITAB dataset. The example data come from Exercise

More information

Excel Basics Rice Digital Media Commons Guide Written for Microsoft Excel 2010 Windows Edition by Eric Miller

Excel Basics Rice Digital Media Commons Guide Written for Microsoft Excel 2010 Windows Edition by Eric Miller Excel Basics Rice Digital Media Commons Guide Written for Microsoft Excel 2010 Windows Edition by Eric Miller Table of Contents Introduction!... 1 Part 1: Entering Data!... 2 1.a: Typing!... 2 1.b: Editing

More information

Descriptive Statistics Descriptive statistics & pictorial representations of experimental data.

Descriptive Statistics Descriptive statistics & pictorial representations of experimental data. Psychology 312: Lecture 7 Descriptive Statistics Slide #1 Descriptive Statistics Descriptive statistics & pictorial representations of experimental data. In this lecture we will discuss descriptive statistics.

More information

Chapter 2 Describing, Exploring, and Comparing Data

Chapter 2 Describing, Exploring, and Comparing Data Slide 1 Chapter 2 Describing, Exploring, and Comparing Data Slide 2 2-1 Overview 2-2 Frequency Distributions 2-3 Visualizing Data 2-4 Measures of Center 2-5 Measures of Variation 2-6 Measures of Relative

More information

Homework 1 Excel Basics

Homework 1 Excel Basics Homework 1 Excel Basics Excel is a software program that is used to organize information, perform calculations, and create visual displays of the information. When you start up Excel, you will see the

More information

Opening a Data File in SPSS. Defining Variables in SPSS

Opening a Data File in SPSS. Defining Variables in SPSS Opening a Data File in SPSS To open an existing SPSS file: 1. Click File Open Data. Go to the appropriate directory and find the name of the appropriate file. SPSS defaults to opening SPSS data files with

More information

C ontent descriptions

C ontent descriptions C ontent descriptions http://topdrawer.aamt.edu.au/reasoning/big-ideas/same-and-different Year Number and Algebra F 2 Establish understanding of the language and processes of counting by naming numbers

More information

MAT 110 WORKSHOP. Updated Fall 2018

MAT 110 WORKSHOP. Updated Fall 2018 MAT 110 WORKSHOP Updated Fall 2018 UNIT 3: STATISTICS Introduction Choosing a Sample Simple Random Sample: a set of individuals from the population chosen in a way that every individual has an equal chance

More information

SAS/STAT 13.1 User s Guide. The Power and Sample Size Application

SAS/STAT 13.1 User s Guide. The Power and Sample Size Application SAS/STAT 13.1 User s Guide The Power and Sample Size Application This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as

More information

CHAPTER 1 COPYRIGHTED MATERIAL. Finding Your Way in the Inventor Interface

CHAPTER 1 COPYRIGHTED MATERIAL. Finding Your Way in the Inventor Interface CHAPTER 1 Finding Your Way in the Inventor Interface COPYRIGHTED MATERIAL Understanding Inventor s interface behavior Opening existing files Creating new files Modifying the look and feel of Inventor Managing

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

Product Catalog. AcaStat. Software

Product Catalog. AcaStat. Software Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,

More information

Error Analysis, Statistics and Graphing

Error Analysis, Statistics and Graphing Error Analysis, Statistics and Graphing This semester, most of labs we require us to calculate a numerical answer based on the data we obtain. A hard question to answer in most cases is how good is your

More information

3 Graphical Displays of Data

3 Graphical Displays of Data 3 Graphical Displays of Data Reading: SW Chapter 2, Sections 1-6 Summarizing and Displaying Qualitative Data The data below are from a study of thyroid cancer, using NMTR data. The investigators looked

More information

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables

More information

Lab #1: Introduction to Basic SAS Operations

Lab #1: Introduction to Basic SAS Operations Lab #1: Introduction to Basic SAS Operations Getting Started: OVERVIEW OF SAS (access lab pages at http://www.stat.lsu.edu/exstlab/) There are several ways to open the SAS program. You may have a SAS icon

More information

WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide

WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide WINKS SDA Windows KwikStat Statistical Data Analysis and Graphs Getting Started Guide 2011 Version 6A Do these tutorials first This series of tutorials provides a quick start to using WINKS. Feel free

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency lowest value + highest value midrange The word average: is very ambiguous and can actually refer to the mean,

More information

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview Chapter 888 Introduction This procedure generates D-optimal designs for multi-factor experiments with both quantitative and qualitative factors. The factors can have a mixed number of levels. For example,

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

Creating a data file and entering data

Creating a data file and entering data 4 Creating a data file and entering data There are a number of stages in the process of setting up a data file and analysing the data. The flow chart shown on the next page outlines the main steps that

More information

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Creation & Description of a Data Set * 4 Levels of Measurement * Nominal, ordinal, interval, ratio * Variable Types

More information

Chapter 2: Frequency Distributions

Chapter 2: Frequency Distributions Chapter 2: Frequency Distributions Chapter Outline 2.1 Introduction to Frequency Distributions 2.2 Frequency Distribution Tables Obtaining ΣX from a Frequency Distribution Table Proportions and Percentages

More information

Advanced Excel Skills

Advanced Excel Skills Advanced Excel Skills Note : This tutorial is based upon MSExcel 2000. If you are using MSExcel 2002, there may be some operations which look slightly different (e.g. pivot tables), but the same principles

More information

Table Of Contents. Table Of Contents

Table Of Contents. Table Of Contents Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store

More information

Organizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013

Organizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Learning Objectives Identify Different Types of Variables Appropriately Naming Variables Constructing

More information

Meet MINITAB. Student Release 14. for Windows

Meet MINITAB. Student Release 14. for Windows Meet MINITAB Student Release 14 for Windows 2003, 2004 by Minitab Inc. All rights reserved. MINITAB and the MINITAB logo are registered trademarks of Minitab Inc. All other marks referenced remain the

More information

Math 227 EXCEL / MEGASTAT Guide

Math 227 EXCEL / MEGASTAT Guide Math 227 EXCEL / MEGASTAT Guide Introduction Introduction: Ch2: Frequency Distributions and Graphs Construct Frequency Distributions and various types of graphs: Histograms, Polygons, Pie Charts, Stem-and-Leaf

More information

Exploring Data. This guide describes the facilities in SPM to gain initial insights about a dataset by viewing and generating descriptive statistics.

Exploring Data. This guide describes the facilities in SPM to gain initial insights about a dataset by viewing and generating descriptive statistics. This guide describes the facilities in SPM to gain initial insights about a dataset by viewing and generating descriptive statistics. 2018 by Minitab Inc. All rights reserved. Minitab, SPM, SPM Salford

More information

Box-Cox Transformation for Simple Linear Regression

Box-Cox Transformation for Simple Linear Regression Chapter 192 Box-Cox Transformation for Simple Linear Regression Introduction This procedure finds the appropriate Box-Cox power transformation (1964) for a dataset containing a pair of variables that are

More information