Statistics I 2011/2012 Notes about the third Computer Class: Simulation of samples and goodness of fit; Central Limit Theorem; Confidence intervals.

Size: px
Start display at page:

Download "Statistics I 2011/2012 Notes about the third Computer Class: Simulation of samples and goodness of fit; Central Limit Theorem; Confidence intervals."

Transcription

1 Statistics I 2011/2012 Notes about the third Computer Class: Simulation of samples and goodness of fit; Central Limit Theorem; Confidence intervals. In this Computer Class we are going to use Statgraphics to simulate random samples and to evaluate their goodness of fit with respect to some random variable with known probability law. Moreover, we are going to study an application of the Central Limit Theorem and an introduction to the confidence intervals. At the end of each Section, we must save the simulated random samples, close Statgraphics and open it again (alternatively, we could save the simulated random samples, clean the DataBook and close all the active windows). This operation is fundamental to answer to the questions of Section 4. To save the simulated random samples we can make the following: Save as Save Data File as 1. Simulation of samples from random variables; Goodness of fit. Statgraphics allows generating samples from random variables, i.e. samples based on probability laws. For example, in this section we are going to simulate two samples with size n=100: RAND1: sample of a N(0,1). RAND2: sample of a Student s t with 5 degrees of freedom. Thanks to the above simulations, we are also going to recall the main differences between the Student s t and the Normal laws. Let s start generating the random sample RAND1. To do this, select Describe Distribution Fitting Probability Distributions In the emerging window, select the Probability Distribution in which we are interested, that is the Normal. In the window Normal Options insert the parameters, that are Mean = 0 and Std. Dev. = 1. In the window Tables and Graphs select Analysis Summary, Density/Mass Function and Random

2 Numbers. With the option Density/Mass Function we are going to compare the density functions of the N(0,1) and of the Student s t with 5 degrees of freedom, whereas with the option Random Numbers we are going to generate the random samples. To obtain the sample RAND1, first we press the button: Successively, we are repeat the following instructions in the window Save Results Options : In this way, we obtain the sample RAND1, and you can find it in the DataBook of Statgraphics:

3 Next, we generate the sample RAND2. To do this, select Describe Distribution Fitting Probability Distributions In the emerging window, select the Probability Distribution in which we are interested, that is the Student s t. In the window Student s t Options insert the parameter, that is D. F. = 5 (we are going to insert it in the second box; in this way Statgraphics recognizes that the Student s t is the second distribution with which we want to work). In the window Tables and Graphs select Analysis Summary, Density/Mass Function and Random Numbers. Thanks to the option Density/Mass Function we can now compare the two density functions. What are the main differences that you can appreciate? Let s come back to the main goal of this part of the Notes, that is the generation of RAND2. In this case, we have to repeat the following instructions in the window Save Results Options : Once we obtain RAND2, we can study its goodness of fit. To do this with Statgraphics, we can draw the histogram of RAND2 and we can compare it with the density function of a Normal random variable having parameters equal to the mean and standard deviation sample estimates. With the same estimates, we can obtain also the corresponding QQ-plot. To obtain these two graphs, select: Describe Distribution Fitting Fitting Uncensored Data

4 We are interested in knowing if the sample RAND2 is coherent with a Normal law, that is we want to examine the goodness of fit of RAND2 with respect to a Normal law. To do this, select RAND2 and, in the emerging window, select the Normal option. In the window Tables and Graphs select Analysis Summary, Frequency Histogram and Quantile-Quantile Plot. The following figure shows an example of the result that you should obtain: Clearly, each of you will obtain different results because of the random nature of the examples above. Exercise: Examine the goodness of fit of RAND1 and analyze if there are differences in the results. Write your comments in last sheet of these Notes. BEFORE TO PASS TO THE NEXT SECTION, SAVE THE DATA, CLOSE AND OPEN AGAIN STATGRAPHICS.

5 2. An application of the Central Limit Theorem. In this Section we are going to study the probability law of the Sample Mean estimator. To do this, we are going to generate M samples with size n from a random variable with known probability law. Next, for each sample we are going to compute its sample mean, obtaining at the end M different values. Finally, we are going to study the probability law of this sample with size M of sample means, using the fact that according to the Central Limit Theorem the probability law should be Normal. Let s start considering a random variable X with Uniform law with parameters a=1 and b=10, where a and b are the lower and the upper limits of the random variables X respectively. Using this probability law, let s generate M=30 sample with size n=30. To do this, select the following path: Describe Distribution Fitting Probability Distributions In the emerging window, select the Probability Distribution in which we are interested, that is Uniform and in the window Uniform Options insert the same values for the parameters in all the 5 rows: Lower Limit = 1 y Upper Limit = 10. In the window Tables and Graphs select Analysis Summary and Random Numbers. To change the size of each sample, position the mouse on the second window ( Random Numbers ), press the right button, select Pane Options and change the Size from 100 to 30. On the other hand, thanks to Section 1, we know how to generate the first 5 samples, and in particular for this case we are going to repeat the following instructions in the window Save Results Options :

6 Thanks to the previous steps we have generated the first 5 random samples. Now, to generate the remaining 25 samples, we have to repeat 5 times the previous operations, but using different names for the Target Variables. At the end of these operations, our DataBook will contain M=30 columns and each of them will be a sample with size n=30 of a random variable X with Uniform law with parameters a=1 and b=10. Performing a Multiple-Variable Analysis of these data, we can obtain the M=30 sample means: Describe Numeric Data Multiple-Variable Analysis

7 Using the Pane Options of the Multiple-Variable Analysis we can reduce the number of statistics that we want to observe. Indeed, we are going to select exclusively Average. Now, we are going to copy the 30 averages in the first column of the sheet B of the DataBook (ADVICE: write them on a paper and then copy them in Statgraphics. There is no other way to do that!!! Use two decimals). Change also the name of the column (with a double click; for example it could be Sample means ):

8 Let s recall that our main goal consists in studying the probability law of the Sample Mean estimator, and for that we want to use the Central Limit Theorem. This theorem says that, whatever is the probability law of X (i.e. the random variable used to generate the samples with the same size, n), if n is sufficiently big, the estimator Sample Mean will have a Normal probability law: E[X], Var(X) n In our case, given that X follows a Uniform law with parameters a=1 and b=10, its expected value and its variance are given by: a + b E[X] = = = (b a) Var(X) = = = = 6.75 It follows that if we apply the Central Limit Theorem, we obtain that the estimator Sample Mean should have a law such as X ~ N(5.5, ). As in the previous section, we can examine the goodness of fit of the random sample Sample Means (i.e. the column we have built) with respect to a Normal law: Describe Distribution Fitting Fitting Uncensored Data In the emerging window, select the Normal option and in the window Tables and Graphs select Analysis Summary, Histogram and Quantile-Quantile Plot. The following figure shows an example of the result that you should obtain:

9 Clearly, each of you will obtain different results because of the random nature of the examples above. In this example and according to the previous figure, the estimates for the mean and for the standard deviation are: Mean = (in place of 5.5) Standard Deviation = 0, Important: you should remember that Statgraphics calls variance (standard deviation) the following: 2 s n 1 = 1 n (x n 1 i=1 i x ) 2 The expression ns 2 2 n = (n 1)s n 1 explains the relation between one and the other. Finally, are the estimates of the mean and of the standard deviation good? What are the absolute errors in your case? Exercise: examine the goodness of fit of the sample Sample Means for the cases n=15 and n=60. Analyze how changes the distribution of the Sample Mean, and if the approximation improves or not calculating the absolute errors. Write your comments in the last page of these Notes BEFORE TO PASS TO THE NEXT SECTION, SAVE THE DATA, CLOSE AND OPEN AGAIN STATGRAPHICS.

10 3. Introduction to the construction of confidence intervals. In this section we are going to define two confidence intervals: The first is related with a sample from a N(0,1). The second is related with a sample from a Student s t with 5 degrees of freedom. For what concerns to the confidence intervals, Statgraphics always assumes that the population from which the data come follows a Normal law. For this reason, it is advisable to use the following instructions exclusively when the sample size is big (n 30), and this is because in these cases the Central Limit Theorem assures that the distribution of the Sample Mean estimator follows approximately a Normal law, whatever is the distribution of the initial data. In Section 1 we have seen in details how to generate samples. Let s start generating two samples with size n=100, and in particular: SAMPLE1: sample with size 100 from a N(0,1). SAMPLE2: sample with size 100 from a Student s t with 5 degrees of freedom. Once obtained SAMPLE1 and SAMPLE2, select the following path: Describe Numeric Data One-Variable Analysis Select for example SAMPLE1, and in the windows Tables and Graphs select Analysis Summary and Confidence Intervals. The following table shows the confidence interval for the mean obtained with SAMPLE1: 95% Confidence interval for the mean: 0, /- 0, [-0,105738; 0,343347] Clearly, each of you will obtain different results because of the random nature of the examples above. Exercise: compute the confidence interval associated to SAMPLE2. Write it in the last page of these Notes. BEFORE TO PASS TO THE NEXT SECTION, SAVE THE DATA, CLOSE AND OPEN AGAIN STATGRAPHICS.

11 4. Exercises (you should hand them at the end of this class, using the last sheet of these Notes) Each section concludes with a bold text: develop the instructions and report the comments and/or the answers in the last sheet of these Notes.

12 Answers for Section 4. Name and Surname(s): NIU: Degree: Group Section 1: Comments about the goodness of fit of RAND1 and comparison with the results obtained for RAND2. Section 2: Comments about the goodness of fit of Sample Means for the cases n=15 and n=60. Compute the absolute errors for the estimation of the mean and of the standard deviation. Compare with the results obtained for n=30. Section 3: 95% Confidence interval for the mean obtained using SAMPLE2.

Statistics I Practice 2 Notes Probability and probabilistic models; Introduction of the statistical inference

Statistics I Practice 2 Notes Probability and probabilistic models; Introduction of the statistical inference Statistics I Practice 2 Notes Probability and probabilistic models; Introduction of the statistical inference 1. Simulation of random variables In Excel we can simulate values from random variables (discrete

More information

Page 1. Graphical and Numerical Statistics

Page 1. Graphical and Numerical Statistics TOPIC: Description Statistics In this tutorial, we show how to use MINITAB to produce descriptive statistics, both graphical and numerical, for an existing MINITAB dataset. The example data come from Exercise

More information

Multivariate Normal Random Numbers

Multivariate Normal Random Numbers Multivariate Normal Random Numbers Revised: 10/11/2017 Summary... 1 Data Input... 3 Analysis Options... 4 Analysis Summary... 5 Matrix Plot... 6 Save Results... 8 Calculations... 9 Summary This procedure

More information

Topic 5 - Joint distributions and the CLT

Topic 5 - Joint distributions and the CLT Topic 5 - Joint distributions and the CLT Joint distributions Calculation of probabilities, mean and variance Expectations of functions based on joint distributions Central Limit Theorem Sampling distributions

More information

Week 7: The normal distribution and sample means

Week 7: The normal distribution and sample means Week 7: The normal distribution and sample means Goals Visualize properties of the normal distribution. Learning the Tools Understand the Central Limit Theorem. Calculate sampling properties of sample

More information

CHAPTER 6. The Normal Probability Distribution

CHAPTER 6. The Normal Probability Distribution The Normal Probability Distribution CHAPTER 6 The normal probability distribution is the most widely used distribution in statistics as many statistical procedures are built around it. The central limit

More information

We have seen that as n increases, the length of our confidence interval decreases, the confidence interval will be more narrow.

We have seen that as n increases, the length of our confidence interval decreases, the confidence interval will be more narrow. {Confidence Intervals for Population Means} Now we will discuss a few loose ends. Before moving into our final discussion of confidence intervals for one population mean, let s review a few important results

More information

Using Large Data Sets Workbook Version A (MEI)

Using Large Data Sets Workbook Version A (MEI) Using Large Data Sets Workbook Version A (MEI) 1 Index Key Skills Page 3 Becoming familiar with the dataset Page 3 Sorting and filtering the dataset Page 4 Producing a table of summary statistics with

More information

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 8: Interval Estimation Population Mean: Known Population Mean: Unknown Margin of Error and the Interval

More information

Unit 5: Estimating with Confidence

Unit 5: Estimating with Confidence Unit 5: Estimating with Confidence Section 8.3 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Unit 5 Estimating with Confidence 8.1 8.2 8.3 Confidence Intervals: The Basics Estimating

More information

LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA

LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA This lab will assist you in learning how to summarize and display categorical and quantitative data in StatCrunch. In particular, you will learn how to

More information

Central Limit Theorem Sample Means

Central Limit Theorem Sample Means Date Central Limit Theorem Sample Means Group Member Names: Part One Review of Types of Distributions Consider the three graphs below. Match the histograms with the distribution description. Write the

More information

Use of Extreme Value Statistics in Modeling Biometric Systems

Use of Extreme Value Statistics in Modeling Biometric Systems Use of Extreme Value Statistics in Modeling Biometric Systems Similarity Scores Two types of matching: Genuine sample Imposter sample Matching scores Enrolled sample 0.95 0.32 Probability Density Decision

More information

Probability Models.S4 Simulating Random Variables

Probability Models.S4 Simulating Random Variables Operations Research Models and Methods Paul A. Jensen and Jonathan F. Bard Probability Models.S4 Simulating Random Variables In the fashion of the last several sections, we will often create probability

More information

Brief Guide on Using SPSS 10.0

Brief Guide on Using SPSS 10.0 Brief Guide on Using SPSS 10.0 (Use student data, 22 cases, studentp.dat in Dr. Chang s Data Directory Page) (Page address: http://www.cis.ysu.edu/~chang/stat/) I. Processing File and Data To open a new

More information

Density Curve (p52) Density curve is a curve that - is always on or above the horizontal axis.

Density Curve (p52) Density curve is a curve that - is always on or above the horizontal axis. 1.3 Density curves p50 Some times the overall pattern of a large number of observations is so regular that we can describe it by a smooth curve. It is easier to work with a smooth curve, because the histogram

More information

So..to be able to make comparisons possible, we need to compare them with their respective distributions.

So..to be able to make comparisons possible, we need to compare them with their respective distributions. Unit 3 ~ Modeling Distributions of Data 1 ***Section 2.1*** Measures of Relative Standing and Density Curves (ex) Suppose that a professional soccer team has the money to sign one additional player and

More information

Problem Set #8. Econ 103

Problem Set #8. Econ 103 Problem Set #8 Econ 103 Part I Problems from the Textbook No problems from the textbook on this assignment. Part II Additional Problems 1. For this question assume that we have a random sample from a normal

More information

One Factor Experiments

One Factor Experiments One Factor Experiments 20-1 Overview Computation of Effects Estimating Experimental Errors Allocation of Variation ANOVA Table and F-Test Visual Diagnostic Tests Confidence Intervals For Effects Unequal

More information

Chapter 6: Simulation Using Spread-Sheets (Excel)

Chapter 6: Simulation Using Spread-Sheets (Excel) Chapter 6: Simulation Using Spread-Sheets (Excel) Refer to Reading Assignments 1 Simulation Using Spread-Sheets (Excel) OBJECTIVES To be able to Generate random numbers within a spreadsheet environment.

More information

Basics: How to Calculate Standard Deviation in Excel

Basics: How to Calculate Standard Deviation in Excel Basics: How to Calculate Standard Deviation in Excel In this guide, we are going to look at the basics of calculating the standard deviation of a data set. The calculations will be done step by step, without

More information

Lesson 19: The Graph of a Linear Equation in Two Variables Is a Line

Lesson 19: The Graph of a Linear Equation in Two Variables Is a Line The Graph of a Linear Equation in Two Variables Is a Line Classwork Exercises THEOREM: The graph of a linear equation yy = mmmm + bb is a non-vertical line with slope mm and passing through (0, bb), where

More information

CHAPTER 2: Describing Location in a Distribution

CHAPTER 2: Describing Location in a Distribution CHAPTER 2: Describing Location in a Distribution 2.1 Goals: 1. Compute and use z-scores given the mean and sd 2. Compute and use the p th percentile of an observation 3. Intro to density curves 4. More

More information

Chapter 3. Bootstrap. 3.1 Introduction. 3.2 The general idea

Chapter 3. Bootstrap. 3.1 Introduction. 3.2 The general idea Chapter 3 Bootstrap 3.1 Introduction The estimation of parameters in probability distributions is a basic problem in statistics that one tends to encounter already during the very first course on the subject.

More information

R Programming Basics - Useful Builtin Functions for Statistics

R Programming Basics - Useful Builtin Functions for Statistics R Programming Basics - Useful Builtin Functions for Statistics Vectorized Arithmetic - most arthimetic operations in R work on vectors. Here are a few commonly used summary statistics. testvect = c(1,3,5,2,9,10,7,8,6)

More information

Multivariate Capability Analysis

Multivariate Capability Analysis Multivariate Capability Analysis Summary... 1 Data Input... 3 Analysis Summary... 4 Capability Plot... 5 Capability Indices... 6 Capability Ellipse... 7 Correlation Matrix... 8 Tests for Normality... 8

More information

2. (a) Briefly discuss the forms of Data preprocessing with neat diagram. (b) Explain about concept hierarchy generation for categorical data.

2. (a) Briefly discuss the forms of Data preprocessing with neat diagram. (b) Explain about concept hierarchy generation for categorical data. Code No: M0502/R05 Set No. 1 1. (a) Explain data mining as a step in the process of knowledge discovery. (b) Differentiate operational database systems and data warehousing. [8+8] 2. (a) Briefly discuss

More information

Lecture 6: Chapter 6 Summary

Lecture 6: Chapter 6 Summary 1 Lecture 6: Chapter 6 Summary Z-score: Is the distance of each data value from the mean in standard deviation Standardizes data values Standardization changes the mean and the standard deviation: o Z

More information

CREATING THE DISTRIBUTION ANALYSIS

CREATING THE DISTRIBUTION ANALYSIS Chapter 12 Examining Distributions Chapter Table of Contents CREATING THE DISTRIBUTION ANALYSIS...176 BoxPlot...178 Histogram...180 Moments and Quantiles Tables...... 183 ADDING DENSITY ESTIMATES...184

More information

Fathom Dynamic Data TM Version 2 Specifications

Fathom Dynamic Data TM Version 2 Specifications Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other

More information

LAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT

LAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT NAVAL POSTGRADUATE SCHOOL LAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT Statistics (OA3102) Lab #2: Sampling, Sampling Distributions, and the Central Limit Theorem Goal: Use R to demonstrate sampling

More information

Descriptive Statistics, Standard Deviation and Standard Error

Descriptive Statistics, Standard Deviation and Standard Error AP Biology Calculations: Descriptive Statistics, Standard Deviation and Standard Error SBI4UP The Scientific Method & Experimental Design Scientific method is used to explore observations and answer questions.

More information

Exam 2 is Tue Nov 21. Bring a pencil and a calculator. Discuss similarity to exam1. HW3 is due Tue Dec 5.

Exam 2 is Tue Nov 21. Bring a pencil and a calculator. Discuss similarity to exam1. HW3 is due Tue Dec 5. Stat 100a: Introduction to Probability. Outline for the day 1. Bivariate and marginal density. 2. CLT. 3. CIs. 4. Sample size calculations. 5. Review for exam 2. Exam 2 is Tue Nov 21. Bring a pencil and

More information

Screening Design Selection

Screening Design Selection Screening Design Selection Summary... 1 Data Input... 2 Analysis Summary... 5 Power Curve... 7 Calculations... 7 Summary The STATGRAPHICS experimental design section can create a wide variety of designs

More information

Table 1 below illustrates the construction for the case of 11 integers selected from 20.

Table 1 below illustrates the construction for the case of 11 integers selected from 20. Q: a) From the first 200 natural numbers 101 of them are arbitrarily chosen. Prove that among the numbers chosen there exists a pair such that one divides the other. b) Prove that if 100 numbers are chosen

More information

Lesson 19: The Graph of a Linear Equation in Two Variables is a Line

Lesson 19: The Graph of a Linear Equation in Two Variables is a Line Lesson 19: The Graph of a Linear Equation in Two Variables is a Line Classwork Exercises Theorem: The graph of a linear equation y = mx + b is a non-vertical line with slope m and passing through (0, b),

More information

Probability and Statistics for Final Year Engineering Students

Probability and Statistics for Final Year Engineering Students Probability and Statistics for Final Year Engineering Students By Yoni Nazarathy, Last Updated: April 11, 2011. Lecture 1: Introduction and Basic Terms Welcome to the course, time table, assessment, etc..

More information

LASER s Level 2 Maths Course - Summary

LASER s Level 2 Maths Course - Summary LASER s Level 2 Maths Course - Summary Unit Code Unit Title Credits Level Status SER945 Shape, Space and Measurement 3 2 Mandatory SER946 Collecting, Recording and Analysing Data 3 2 Mandatory SER947 Development

More information

Topic (3) SUMMARIZING DATA - TABLES AND GRAPHICS

Topic (3) SUMMARIZING DATA - TABLES AND GRAPHICS Topic (3) SUMMARIZING DATA - TABLES AND GRAPHICS 3- Topic (3) SUMMARIZING DATA - TABLES AND GRAPHICS A) Frequency Distributions For Samples Defn: A FREQUENCY DISTRIBUTION is a tabular or graphical display

More information

Robust Linear Regression (Passing- Bablok Median-Slope)

Robust Linear Regression (Passing- Bablok Median-Slope) Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming

More information

Chapter 5: The standard deviation as a ruler and the normal model p131

Chapter 5: The standard deviation as a ruler and the normal model p131 Chapter 5: The standard deviation as a ruler and the normal model p131 Which is the better exam score? 67 on an exam with mean 50 and SD 10 62 on an exam with mean 40 and SD 12? Is it fair to say: 67 is

More information

If the active datasheet is empty when the StatWizard appears, a dialog box is displayed to assist in entering data.

If the active datasheet is empty when the StatWizard appears, a dialog box is displayed to assist in entering data. StatWizard Summary The StatWizard is designed to serve several functions: 1. It assists new users in entering data to be analyzed. 2. It provides a search facility to help locate desired statistical procedures.

More information

Quantitative - One Population

Quantitative - One Population Quantitative - One Population The Quantitative One Population VISA procedures allow the user to perform descriptive and inferential procedures for problems involving one population with quantitative (interval)

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency lowest value + highest value midrange The word average: is very ambiguous and can actually refer to the mean,

More information

To complete the computer assignments, you ll use the EViews software installed on the lab PCs in WMC 2502 and WMC 2506.

To complete the computer assignments, you ll use the EViews software installed on the lab PCs in WMC 2502 and WMC 2506. An Introduction to EViews The purpose of the computer assignments in BUEC 333 is to give you some experience using econometric software to analyse real-world data. Along the way, you ll become acquainted

More information

Using Arithmetic of Real Numbers to Explore Limits and Continuity

Using Arithmetic of Real Numbers to Explore Limits and Continuity Using Arithmetic of Real Numbers to Explore Limits and Continuity by Maria Terrell Cornell University Problem Let a =.898989... and b =.000000... (a) Find a + b. (b) Use your ideas about how to add a and

More information

Error Analysis, Statistics and Graphing

Error Analysis, Statistics and Graphing Error Analysis, Statistics and Graphing This semester, most of labs we require us to calculate a numerical answer based on the data we obtain. A hard question to answer in most cases is how good is your

More information

Experiment 1 Yahtzee or Validating the t-table

Experiment 1 Yahtzee or Validating the t-table Experiment 1 Yahtzee or Validating the t-table Learning Goals 1. Receive training and practice with many useful spreadsheet functions in Excel. 2. Review and experience the concepts of probability and

More information

A Constant Rate of Change Name Part 1

A Constant Rate of Change Name Part 1 A Constant Rate of Change Name Part 1 Consider the function table below. Complete this page by solving the problems at the bottom. Use a separate sheet of paper for your descriptions and explanations.

More information

Assignment 4/5 Statistics Due: Nov. 29

Assignment 4/5 Statistics Due: Nov. 29 Assignment 4/5 Statistics 5.301 Due: Nov. 29 1. Two decision rules are given here. Assume they apply to a normally distributed quality characteristic, the control chart has three-sigma control limits,

More information

Measures of Dispersion

Measures of Dispersion Measures of Dispersion 6-3 I Will... Find measures of dispersion of sets of data. Find standard deviation and analyze normal distribution. Day 1: Dispersion Vocabulary Measures of Variation (Dispersion

More information

MHPE 494: Data Analysis. Welcome! The Analytic Process

MHPE 494: Data Analysis. Welcome! The Analytic Process MHPE 494: Data Analysis Alan Schwartz, PhD Department of Medical Education Memoona Hasnain,, MD, PhD, MHPE Department of Family Medicine College of Medicine University of Illinois at Chicago Welcome! Your

More information

Computational Mathematics/Information Technology. Worksheet 2 Iteration and Excel

Computational Mathematics/Information Technology. Worksheet 2 Iteration and Excel Computational Mathematics/Information Technology Worksheet 2 Iteration and Excel This sheet uses Excel and the method of iteration to solve the problem f(x) = 0. It introduces user functions and self referencing

More information

Exercise 2.23 Villanova MAT 8406 September 7, 2015

Exercise 2.23 Villanova MAT 8406 September 7, 2015 Exercise 2.23 Villanova MAT 8406 September 7, 2015 Step 1: Understand the Question Consider the simple linear regression model y = 50 + 10x + ε where ε is NID(0, 16). Suppose that n = 20 pairs of observations

More information

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement

More information

Learner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display

Learner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display CURRICULUM MAP TEMPLATE Priority Standards = Approximately 70% Supporting Standards = Approximately 20% Additional Standards = Approximately 10% HONORS PROBABILITY AND STATISTICS Essential Questions &

More information

STAT 113: Lab 9. Colin Reimer Dawson. Last revised November 10, 2015

STAT 113: Lab 9. Colin Reimer Dawson. Last revised November 10, 2015 STAT 113: Lab 9 Colin Reimer Dawson Last revised November 10, 2015 We will do some of the following together. The exercises with a (*) should be done and turned in as part of HW9. Before we start, let

More information

STAT 135 Lab 1 Solutions

STAT 135 Lab 1 Solutions STAT 135 Lab 1 Solutions January 26, 2015 Introduction To complete this lab, you will need to have access to R and RStudio. If you have not already done so, you can download R from http://cran.cnr.berkeley.edu/,

More information

Today. Lecture 4: Last time. The EM algorithm. We examine clustering in a little more detail; we went over it a somewhat quickly last time

Today. Lecture 4: Last time. The EM algorithm. We examine clustering in a little more detail; we went over it a somewhat quickly last time Today Lecture 4: We examine clustering in a little more detail; we went over it a somewhat quickly last time The CAD data will return and give us an opportunity to work with curves (!) We then examine

More information

Spreadsheet and Graphing Exercise Biology 210 Introduction to Research

Spreadsheet and Graphing Exercise Biology 210 Introduction to Research 1 Spreadsheet and Graphing Exercise Biology 210 Introduction to Research There are many good spreadsheet programs for analyzing data. In this class we will use MS Excel. Below are a series of examples

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use

More information

CHAPTER 2 Modeling Distributions of Data

CHAPTER 2 Modeling Distributions of Data CHAPTER 2 Modeling Distributions of Data 2.2 Density Curves and Normal Distributions The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Density Curves

More information

Regression Analysis and Linear Regression Models

Regression Analysis and Linear Regression Models Regression Analysis and Linear Regression Models University of Trento - FBK 2 March, 2015 (UNITN-FBK) Regression Analysis and Linear Regression Models 2 March, 2015 1 / 33 Relationship between numerical

More information

Fractions and decimals have opposites, just as integers do. For example, 5 8 and 2 5

Fractions and decimals have opposites, just as integers do. For example, 5 8 and 2 5 Domain 1 Lesson Rational Numbers Common Core Standards: 6.NS., 6.NS.6.c, 6.NS.7.c Getting the Idea A rational number is a number that can be expressed as the ratio of two integers in the form a, where

More information

= 3 + (5*4) + (1/2)*(4/2)^2.

= 3 + (5*4) + (1/2)*(4/2)^2. Physics 100 Lab 1: Use of a Spreadsheet to Analyze Data by Kenneth Hahn and Michael Goggin In this lab you will learn how to enter data into a spreadsheet and to manipulate the data in meaningful ways.

More information

MAT 142 College Mathematics. Module ST. Statistics. Terri Miller revised July 14, 2015

MAT 142 College Mathematics. Module ST. Statistics. Terri Miller revised July 14, 2015 MAT 142 College Mathematics Statistics Module ST Terri Miller revised July 14, 2015 2 Statistics Data Organization and Visualization Basic Terms. A population is the set of all objects under study, a sample

More information

Behavior of the sample mean. varx i = σ 2

Behavior of the sample mean. varx i = σ 2 Behavior of the sample mean We observe n independent and identically distributed (iid) draws from a random variable X. Denote the observed values by X 1, X 2,..., X n. Assume the X i come from a population

More information

4. Descriptive Statistics: Measures of Variability and Central Tendency

4. Descriptive Statistics: Measures of Variability and Central Tendency 4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and

More information

STATS PAD USER MANUAL

STATS PAD USER MANUAL STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11

More information

Using Excel This is only a brief overview that highlights some of the useful points in a spreadsheet program.

Using Excel This is only a brief overview that highlights some of the useful points in a spreadsheet program. Using Excel 2007 This is only a brief overview that highlights some of the useful points in a spreadsheet program. 1. Input of data - Generally you should attempt to put the independent variable on the

More information

Lab 7 Statistics I LAB 7 QUICK VIEW

Lab 7 Statistics I LAB 7 QUICK VIEW Lab 7 Statistics I This lab will cover how to do statistical calculations in excel using formulas. (Note that your version of excel may have additional formulas to calculate statistics, but these formulas

More information

E-Campus Inferential Statistics - Part 2

E-Campus Inferential Statistics - Part 2 E-Campus Inferential Statistics - Part 2 Group Members: James Jones Question 4-Isthere a significant difference in the mean prices of the stores? New Textbook Prices New Price Descriptives 95% Confidence

More information

Winstats Instruction Sheet

Winstats Instruction Sheet Winstats Instruction Sheet I. Installing Winstats on your Computer A. Go to the Peanut Software homepage. Either go directly to http://math.exeter.edu/rparris/default.html or Google Peanut Software. B.

More information

Instructions for Using ABCalc James Alan Fox Northeastern University Updated: August 2009

Instructions for Using ABCalc James Alan Fox Northeastern University Updated: August 2009 Instructions for Using ABCalc James Alan Fox Northeastern University Updated: August 2009 Thank you for using ABCalc, a statistical calculator to accompany several introductory statistics texts published

More information

VALIDITY OF 95% t-confidence INTERVALS UNDER SOME TRANSECT SAMPLING STRATEGIES

VALIDITY OF 95% t-confidence INTERVALS UNDER SOME TRANSECT SAMPLING STRATEGIES Libraries Conference on Applied Statistics in Agriculture 1996-8th Annual Conference Proceedings VALIDITY OF 95% t-confidence INTERVALS UNDER SOME TRANSECT SAMPLING STRATEGIES Stephen N. Sly Jeffrey S.

More information

Critical Numbers, Maximums, & Minimum

Critical Numbers, Maximums, & Minimum Critical Numbers, Maximums, & Minimum Caleb Adams October 23, 2014 Critical Numbers of Rational Functions Recall, to find a critical number, we first found the derivative of a function, then examined where

More information

CHAPTER 2 Modeling Distributions of Data

CHAPTER 2 Modeling Distributions of Data CHAPTER 2 Modeling Distributions of Data 2.2 Density Curves and Normal Distributions The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers HW 34. Sketch

More information

STAT 503 Fall Introduction to SAS

STAT 503 Fall Introduction to SAS Getting Started Introduction to SAS 1) Download all of the files, sas programs (.sas) and data files (.dat) into one of your directories. I would suggest using your H: drive if you are using a computer

More information

4b: Making an auxiliary table for calculating the standard deviation

4b: Making an auxiliary table for calculating the standard deviation In the book we discussed the use of an auxiliary table to calculate variance and standard deviation (Table 4.3). Such a table gives much more insight in the underlying calculations than the simple number

More information

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010 THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE

More information

Lecture 3 - Object-oriented programming and statistical programming examples

Lecture 3 - Object-oriented programming and statistical programming examples Lecture 3 - Object-oriented programming and statistical programming examples Björn Andersson (w/ Ronnie Pingel) Department of Statistics, Uppsala University February 1, 2013 Table of Contents 1 Some notes

More information

Specific Objectives Students will understand that that the family of equation corresponds with the shape of the graph. Students will be able to create a graph of an equation by plotting points. In lesson

More information

Points Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked

Points Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked Plotting Menu: QCExpert Plotting Module graphs offers various tools for visualization of uni- and multivariate data. Settings and options in different types of graphs allow for modifications and customizations

More information

Week 4: Describing data and estimation

Week 4: Describing data and estimation Week 4: Describing data and estimation Goals Investigate sampling error; see that larger samples have less sampling error. Visualize confidence intervals. Calculate basic summary statistics using R. Calculate

More information

Chapter 5snow year.notebook March 15, 2018

Chapter 5snow year.notebook March 15, 2018 Chapter 5: Statistical Reasoning Section 5.1 Exploring Data Measures of central tendency (Mean, Median and Mode) attempt to describe a set of data by identifying the central position within a set of data

More information

[Note: each line drawn must be a single line segment satisfying x = 3] (b) y = x drawn 1 B1 for y = x drawn

[Note: each line drawn must be a single line segment satisfying x = 3] (b) y = x drawn 1 B1 for y = x drawn 1. (a) x = 3 drawn 1 B1 for x = 3 drawn (b) y = x drawn 1 B1 for y = x drawn [Note: each line drawn must be a single line segment satisfying x = 3] [Note: each line drawn must be a single line segment

More information

Visual-XSel Introduction & SixSigma Selected statistical methods, examples and SixSigma with Visual-XSel Copyright CRGRAPH

Visual-XSel Introduction & SixSigma Selected statistical methods, examples and SixSigma with Visual-XSel Copyright CRGRAPH Selected statistical methods, examples and SixSigma with Visual-XSel 14.0 Main Guide The Main Guide shows three important statistical methods: DoE, Reliability and Capability analyses. A pre investigation

More information

Chapter 6: DESCRIPTIVE STATISTICS

Chapter 6: DESCRIPTIVE STATISTICS Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling

More information

Using R. Liang Peng Georgia Institute of Technology January 2005

Using R. Liang Peng Georgia Institute of Technology January 2005 Using R Liang Peng Georgia Institute of Technology January 2005 1. Introduction Quote from http://www.r-project.org/about.html: R is a language and environment for statistical computing and graphics. It

More information

Assignment 5.5. Nothing here to hand in

Assignment 5.5. Nothing here to hand in Assignment 5.5 Nothing here to hand in Load the tidyverse before we start: library(tidyverse) ## Loading tidyverse: ggplot2 ## Loading tidyverse: tibble ## Loading tidyverse: tidyr ## Loading tidyverse:

More information

Tutorial: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and Expression measures

Tutorial: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and Expression measures : RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and February 24, 2014 Sample to Insight : RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and : RNA-Seq Analysis

More information

Testing Random- Number Generators

Testing Random- Number Generators Testing Random- Number Generators Raj Jain Washington University Saint Louis, MO 63131 Jain@cse.wustl.edu These slides are available on-line at: http://www.cse.wustl.edu/~jain/cse574-06/ 27-1 Overview

More information

Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt )

Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt ) JMP Output from Chapter 9 Factorial Analysis through JMP Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt ) Fitting the Model and checking conditions Analyze > Fit Model

More information

Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots

Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots STAT 350 (Spring 2015) Lab 3: SAS Solutions 1 Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots Note: The data sets are not included in the solutions;

More information

Chapter 3: Data Description Calculate Mean, Median, Mode, Range, Variation, Standard Deviation, Quartiles, standard scores; construct Boxplots.

Chapter 3: Data Description Calculate Mean, Median, Mode, Range, Variation, Standard Deviation, Quartiles, standard scores; construct Boxplots. MINITAB Guide PREFACE Preface This guide is used as part of the Elementary Statistics class (Course Number 227) offered at Los Angeles Mission College. It is structured to follow the contents of the textbook

More information

Optimization and Simulation

Optimization and Simulation Optimization and Simulation Statistical analysis and bootstrapping Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale

More information

What s Normal Anyway?

What s Normal Anyway? Name Class Problem 1 A Binomial Experiment 1. When rolling a die, what is the theoretical probability of rolling a 3? 2. When a die is rolled 100 times, how many times do you expect that a 3 will be rolled?

More information

Chapters 5-6: Statistical Inference Methods

Chapters 5-6: Statistical Inference Methods Chapters 5-6: Statistical Inference Methods Chapter 5: Estimation (of population parameters) Ex. Based on GSS data, we re 95% confident that the population mean of the variable LONELY (no. of days in past

More information