Statistics I 2011/2012 Notes about the third Computer Class: Simulation of samples and goodness of fit; Central Limit Theorem; Confidence intervals.
|
|
- Charlene Mitchell
- 5 years ago
- Views:
Transcription
1 Statistics I 2011/2012 Notes about the third Computer Class: Simulation of samples and goodness of fit; Central Limit Theorem; Confidence intervals. In this Computer Class we are going to use Statgraphics to simulate random samples and to evaluate their goodness of fit with respect to some random variable with known probability law. Moreover, we are going to study an application of the Central Limit Theorem and an introduction to the confidence intervals. At the end of each Section, we must save the simulated random samples, close Statgraphics and open it again (alternatively, we could save the simulated random samples, clean the DataBook and close all the active windows). This operation is fundamental to answer to the questions of Section 4. To save the simulated random samples we can make the following: Save as Save Data File as 1. Simulation of samples from random variables; Goodness of fit. Statgraphics allows generating samples from random variables, i.e. samples based on probability laws. For example, in this section we are going to simulate two samples with size n=100: RAND1: sample of a N(0,1). RAND2: sample of a Student s t with 5 degrees of freedom. Thanks to the above simulations, we are also going to recall the main differences between the Student s t and the Normal laws. Let s start generating the random sample RAND1. To do this, select Describe Distribution Fitting Probability Distributions In the emerging window, select the Probability Distribution in which we are interested, that is the Normal. In the window Normal Options insert the parameters, that are Mean = 0 and Std. Dev. = 1. In the window Tables and Graphs select Analysis Summary, Density/Mass Function and Random
2 Numbers. With the option Density/Mass Function we are going to compare the density functions of the N(0,1) and of the Student s t with 5 degrees of freedom, whereas with the option Random Numbers we are going to generate the random samples. To obtain the sample RAND1, first we press the button: Successively, we are repeat the following instructions in the window Save Results Options : In this way, we obtain the sample RAND1, and you can find it in the DataBook of Statgraphics:
3 Next, we generate the sample RAND2. To do this, select Describe Distribution Fitting Probability Distributions In the emerging window, select the Probability Distribution in which we are interested, that is the Student s t. In the window Student s t Options insert the parameter, that is D. F. = 5 (we are going to insert it in the second box; in this way Statgraphics recognizes that the Student s t is the second distribution with which we want to work). In the window Tables and Graphs select Analysis Summary, Density/Mass Function and Random Numbers. Thanks to the option Density/Mass Function we can now compare the two density functions. What are the main differences that you can appreciate? Let s come back to the main goal of this part of the Notes, that is the generation of RAND2. In this case, we have to repeat the following instructions in the window Save Results Options : Once we obtain RAND2, we can study its goodness of fit. To do this with Statgraphics, we can draw the histogram of RAND2 and we can compare it with the density function of a Normal random variable having parameters equal to the mean and standard deviation sample estimates. With the same estimates, we can obtain also the corresponding QQ-plot. To obtain these two graphs, select: Describe Distribution Fitting Fitting Uncensored Data
4 We are interested in knowing if the sample RAND2 is coherent with a Normal law, that is we want to examine the goodness of fit of RAND2 with respect to a Normal law. To do this, select RAND2 and, in the emerging window, select the Normal option. In the window Tables and Graphs select Analysis Summary, Frequency Histogram and Quantile-Quantile Plot. The following figure shows an example of the result that you should obtain: Clearly, each of you will obtain different results because of the random nature of the examples above. Exercise: Examine the goodness of fit of RAND1 and analyze if there are differences in the results. Write your comments in last sheet of these Notes. BEFORE TO PASS TO THE NEXT SECTION, SAVE THE DATA, CLOSE AND OPEN AGAIN STATGRAPHICS.
5 2. An application of the Central Limit Theorem. In this Section we are going to study the probability law of the Sample Mean estimator. To do this, we are going to generate M samples with size n from a random variable with known probability law. Next, for each sample we are going to compute its sample mean, obtaining at the end M different values. Finally, we are going to study the probability law of this sample with size M of sample means, using the fact that according to the Central Limit Theorem the probability law should be Normal. Let s start considering a random variable X with Uniform law with parameters a=1 and b=10, where a and b are the lower and the upper limits of the random variables X respectively. Using this probability law, let s generate M=30 sample with size n=30. To do this, select the following path: Describe Distribution Fitting Probability Distributions In the emerging window, select the Probability Distribution in which we are interested, that is Uniform and in the window Uniform Options insert the same values for the parameters in all the 5 rows: Lower Limit = 1 y Upper Limit = 10. In the window Tables and Graphs select Analysis Summary and Random Numbers. To change the size of each sample, position the mouse on the second window ( Random Numbers ), press the right button, select Pane Options and change the Size from 100 to 30. On the other hand, thanks to Section 1, we know how to generate the first 5 samples, and in particular for this case we are going to repeat the following instructions in the window Save Results Options :
6 Thanks to the previous steps we have generated the first 5 random samples. Now, to generate the remaining 25 samples, we have to repeat 5 times the previous operations, but using different names for the Target Variables. At the end of these operations, our DataBook will contain M=30 columns and each of them will be a sample with size n=30 of a random variable X with Uniform law with parameters a=1 and b=10. Performing a Multiple-Variable Analysis of these data, we can obtain the M=30 sample means: Describe Numeric Data Multiple-Variable Analysis
7 Using the Pane Options of the Multiple-Variable Analysis we can reduce the number of statistics that we want to observe. Indeed, we are going to select exclusively Average. Now, we are going to copy the 30 averages in the first column of the sheet B of the DataBook (ADVICE: write them on a paper and then copy them in Statgraphics. There is no other way to do that!!! Use two decimals). Change also the name of the column (with a double click; for example it could be Sample means ):
8 Let s recall that our main goal consists in studying the probability law of the Sample Mean estimator, and for that we want to use the Central Limit Theorem. This theorem says that, whatever is the probability law of X (i.e. the random variable used to generate the samples with the same size, n), if n is sufficiently big, the estimator Sample Mean will have a Normal probability law: E[X], Var(X) n In our case, given that X follows a Uniform law with parameters a=1 and b=10, its expected value and its variance are given by: a + b E[X] = = = (b a) Var(X) = = = = 6.75 It follows that if we apply the Central Limit Theorem, we obtain that the estimator Sample Mean should have a law such as X ~ N(5.5, ). As in the previous section, we can examine the goodness of fit of the random sample Sample Means (i.e. the column we have built) with respect to a Normal law: Describe Distribution Fitting Fitting Uncensored Data In the emerging window, select the Normal option and in the window Tables and Graphs select Analysis Summary, Histogram and Quantile-Quantile Plot. The following figure shows an example of the result that you should obtain:
9 Clearly, each of you will obtain different results because of the random nature of the examples above. In this example and according to the previous figure, the estimates for the mean and for the standard deviation are: Mean = (in place of 5.5) Standard Deviation = 0, Important: you should remember that Statgraphics calls variance (standard deviation) the following: 2 s n 1 = 1 n (x n 1 i=1 i x ) 2 The expression ns 2 2 n = (n 1)s n 1 explains the relation between one and the other. Finally, are the estimates of the mean and of the standard deviation good? What are the absolute errors in your case? Exercise: examine the goodness of fit of the sample Sample Means for the cases n=15 and n=60. Analyze how changes the distribution of the Sample Mean, and if the approximation improves or not calculating the absolute errors. Write your comments in the last page of these Notes BEFORE TO PASS TO THE NEXT SECTION, SAVE THE DATA, CLOSE AND OPEN AGAIN STATGRAPHICS.
10 3. Introduction to the construction of confidence intervals. In this section we are going to define two confidence intervals: The first is related with a sample from a N(0,1). The second is related with a sample from a Student s t with 5 degrees of freedom. For what concerns to the confidence intervals, Statgraphics always assumes that the population from which the data come follows a Normal law. For this reason, it is advisable to use the following instructions exclusively when the sample size is big (n 30), and this is because in these cases the Central Limit Theorem assures that the distribution of the Sample Mean estimator follows approximately a Normal law, whatever is the distribution of the initial data. In Section 1 we have seen in details how to generate samples. Let s start generating two samples with size n=100, and in particular: SAMPLE1: sample with size 100 from a N(0,1). SAMPLE2: sample with size 100 from a Student s t with 5 degrees of freedom. Once obtained SAMPLE1 and SAMPLE2, select the following path: Describe Numeric Data One-Variable Analysis Select for example SAMPLE1, and in the windows Tables and Graphs select Analysis Summary and Confidence Intervals. The following table shows the confidence interval for the mean obtained with SAMPLE1: 95% Confidence interval for the mean: 0, /- 0, [-0,105738; 0,343347] Clearly, each of you will obtain different results because of the random nature of the examples above. Exercise: compute the confidence interval associated to SAMPLE2. Write it in the last page of these Notes. BEFORE TO PASS TO THE NEXT SECTION, SAVE THE DATA, CLOSE AND OPEN AGAIN STATGRAPHICS.
11 4. Exercises (you should hand them at the end of this class, using the last sheet of these Notes) Each section concludes with a bold text: develop the instructions and report the comments and/or the answers in the last sheet of these Notes.
12 Answers for Section 4. Name and Surname(s): NIU: Degree: Group Section 1: Comments about the goodness of fit of RAND1 and comparison with the results obtained for RAND2. Section 2: Comments about the goodness of fit of Sample Means for the cases n=15 and n=60. Compute the absolute errors for the estimation of the mean and of the standard deviation. Compare with the results obtained for n=30. Section 3: 95% Confidence interval for the mean obtained using SAMPLE2.
Statistics I Practice 2 Notes Probability and probabilistic models; Introduction of the statistical inference
Statistics I Practice 2 Notes Probability and probabilistic models; Introduction of the statistical inference 1. Simulation of random variables In Excel we can simulate values from random variables (discrete
More informationPage 1. Graphical and Numerical Statistics
TOPIC: Description Statistics In this tutorial, we show how to use MINITAB to produce descriptive statistics, both graphical and numerical, for an existing MINITAB dataset. The example data come from Exercise
More informationMultivariate Normal Random Numbers
Multivariate Normal Random Numbers Revised: 10/11/2017 Summary... 1 Data Input... 3 Analysis Options... 4 Analysis Summary... 5 Matrix Plot... 6 Save Results... 8 Calculations... 9 Summary This procedure
More informationTopic 5 - Joint distributions and the CLT
Topic 5 - Joint distributions and the CLT Joint distributions Calculation of probabilities, mean and variance Expectations of functions based on joint distributions Central Limit Theorem Sampling distributions
More informationWeek 7: The normal distribution and sample means
Week 7: The normal distribution and sample means Goals Visualize properties of the normal distribution. Learning the Tools Understand the Central Limit Theorem. Calculate sampling properties of sample
More informationCHAPTER 6. The Normal Probability Distribution
The Normal Probability Distribution CHAPTER 6 The normal probability distribution is the most widely used distribution in statistics as many statistical procedures are built around it. The central limit
More informationWe have seen that as n increases, the length of our confidence interval decreases, the confidence interval will be more narrow.
{Confidence Intervals for Population Means} Now we will discuss a few loose ends. Before moving into our final discussion of confidence intervals for one population mean, let s review a few important results
More informationUsing Large Data Sets Workbook Version A (MEI)
Using Large Data Sets Workbook Version A (MEI) 1 Index Key Skills Page 3 Becoming familiar with the dataset Page 3 Sorting and filtering the dataset Page 4 Producing a table of summary statistics with
More informationEcon 3790: Business and Economics Statistics. Instructor: Yogesh Uppal
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 8: Interval Estimation Population Mean: Known Population Mean: Unknown Margin of Error and the Interval
More informationUnit 5: Estimating with Confidence
Unit 5: Estimating with Confidence Section 8.3 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Unit 5 Estimating with Confidence 8.1 8.2 8.3 Confidence Intervals: The Basics Estimating
More informationLAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA
LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA This lab will assist you in learning how to summarize and display categorical and quantitative data in StatCrunch. In particular, you will learn how to
More informationCentral Limit Theorem Sample Means
Date Central Limit Theorem Sample Means Group Member Names: Part One Review of Types of Distributions Consider the three graphs below. Match the histograms with the distribution description. Write the
More informationUse of Extreme Value Statistics in Modeling Biometric Systems
Use of Extreme Value Statistics in Modeling Biometric Systems Similarity Scores Two types of matching: Genuine sample Imposter sample Matching scores Enrolled sample 0.95 0.32 Probability Density Decision
More informationProbability Models.S4 Simulating Random Variables
Operations Research Models and Methods Paul A. Jensen and Jonathan F. Bard Probability Models.S4 Simulating Random Variables In the fashion of the last several sections, we will often create probability
More informationBrief Guide on Using SPSS 10.0
Brief Guide on Using SPSS 10.0 (Use student data, 22 cases, studentp.dat in Dr. Chang s Data Directory Page) (Page address: http://www.cis.ysu.edu/~chang/stat/) I. Processing File and Data To open a new
More informationDensity Curve (p52) Density curve is a curve that - is always on or above the horizontal axis.
1.3 Density curves p50 Some times the overall pattern of a large number of observations is so regular that we can describe it by a smooth curve. It is easier to work with a smooth curve, because the histogram
More informationSo..to be able to make comparisons possible, we need to compare them with their respective distributions.
Unit 3 ~ Modeling Distributions of Data 1 ***Section 2.1*** Measures of Relative Standing and Density Curves (ex) Suppose that a professional soccer team has the money to sign one additional player and
More informationProblem Set #8. Econ 103
Problem Set #8 Econ 103 Part I Problems from the Textbook No problems from the textbook on this assignment. Part II Additional Problems 1. For this question assume that we have a random sample from a normal
More informationOne Factor Experiments
One Factor Experiments 20-1 Overview Computation of Effects Estimating Experimental Errors Allocation of Variation ANOVA Table and F-Test Visual Diagnostic Tests Confidence Intervals For Effects Unequal
More informationChapter 6: Simulation Using Spread-Sheets (Excel)
Chapter 6: Simulation Using Spread-Sheets (Excel) Refer to Reading Assignments 1 Simulation Using Spread-Sheets (Excel) OBJECTIVES To be able to Generate random numbers within a spreadsheet environment.
More informationBasics: How to Calculate Standard Deviation in Excel
Basics: How to Calculate Standard Deviation in Excel In this guide, we are going to look at the basics of calculating the standard deviation of a data set. The calculations will be done step by step, without
More informationLesson 19: The Graph of a Linear Equation in Two Variables Is a Line
The Graph of a Linear Equation in Two Variables Is a Line Classwork Exercises THEOREM: The graph of a linear equation yy = mmmm + bb is a non-vertical line with slope mm and passing through (0, bb), where
More informationCHAPTER 2: Describing Location in a Distribution
CHAPTER 2: Describing Location in a Distribution 2.1 Goals: 1. Compute and use z-scores given the mean and sd 2. Compute and use the p th percentile of an observation 3. Intro to density curves 4. More
More informationChapter 3. Bootstrap. 3.1 Introduction. 3.2 The general idea
Chapter 3 Bootstrap 3.1 Introduction The estimation of parameters in probability distributions is a basic problem in statistics that one tends to encounter already during the very first course on the subject.
More informationR Programming Basics - Useful Builtin Functions for Statistics
R Programming Basics - Useful Builtin Functions for Statistics Vectorized Arithmetic - most arthimetic operations in R work on vectors. Here are a few commonly used summary statistics. testvect = c(1,3,5,2,9,10,7,8,6)
More informationMultivariate Capability Analysis
Multivariate Capability Analysis Summary... 1 Data Input... 3 Analysis Summary... 4 Capability Plot... 5 Capability Indices... 6 Capability Ellipse... 7 Correlation Matrix... 8 Tests for Normality... 8
More information2. (a) Briefly discuss the forms of Data preprocessing with neat diagram. (b) Explain about concept hierarchy generation for categorical data.
Code No: M0502/R05 Set No. 1 1. (a) Explain data mining as a step in the process of knowledge discovery. (b) Differentiate operational database systems and data warehousing. [8+8] 2. (a) Briefly discuss
More informationLecture 6: Chapter 6 Summary
1 Lecture 6: Chapter 6 Summary Z-score: Is the distance of each data value from the mean in standard deviation Standardizes data values Standardization changes the mean and the standard deviation: o Z
More informationCREATING THE DISTRIBUTION ANALYSIS
Chapter 12 Examining Distributions Chapter Table of Contents CREATING THE DISTRIBUTION ANALYSIS...176 BoxPlot...178 Histogram...180 Moments and Quantiles Tables...... 183 ADDING DENSITY ESTIMATES...184
More informationFathom Dynamic Data TM Version 2 Specifications
Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other
More informationLAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT
NAVAL POSTGRADUATE SCHOOL LAB #2: SAMPLING, SAMPLING DISTRIBUTIONS, AND THE CLT Statistics (OA3102) Lab #2: Sampling, Sampling Distributions, and the Central Limit Theorem Goal: Use R to demonstrate sampling
More informationDescriptive Statistics, Standard Deviation and Standard Error
AP Biology Calculations: Descriptive Statistics, Standard Deviation and Standard Error SBI4UP The Scientific Method & Experimental Design Scientific method is used to explore observations and answer questions.
More informationExam 2 is Tue Nov 21. Bring a pencil and a calculator. Discuss similarity to exam1. HW3 is due Tue Dec 5.
Stat 100a: Introduction to Probability. Outline for the day 1. Bivariate and marginal density. 2. CLT. 3. CIs. 4. Sample size calculations. 5. Review for exam 2. Exam 2 is Tue Nov 21. Bring a pencil and
More informationScreening Design Selection
Screening Design Selection Summary... 1 Data Input... 2 Analysis Summary... 5 Power Curve... 7 Calculations... 7 Summary The STATGRAPHICS experimental design section can create a wide variety of designs
More informationTable 1 below illustrates the construction for the case of 11 integers selected from 20.
Q: a) From the first 200 natural numbers 101 of them are arbitrarily chosen. Prove that among the numbers chosen there exists a pair such that one divides the other. b) Prove that if 100 numbers are chosen
More informationLesson 19: The Graph of a Linear Equation in Two Variables is a Line
Lesson 19: The Graph of a Linear Equation in Two Variables is a Line Classwork Exercises Theorem: The graph of a linear equation y = mx + b is a non-vertical line with slope m and passing through (0, b),
More informationProbability and Statistics for Final Year Engineering Students
Probability and Statistics for Final Year Engineering Students By Yoni Nazarathy, Last Updated: April 11, 2011. Lecture 1: Introduction and Basic Terms Welcome to the course, time table, assessment, etc..
More informationLASER s Level 2 Maths Course - Summary
LASER s Level 2 Maths Course - Summary Unit Code Unit Title Credits Level Status SER945 Shape, Space and Measurement 3 2 Mandatory SER946 Collecting, Recording and Analysing Data 3 2 Mandatory SER947 Development
More informationTopic (3) SUMMARIZING DATA - TABLES AND GRAPHICS
Topic (3) SUMMARIZING DATA - TABLES AND GRAPHICS 3- Topic (3) SUMMARIZING DATA - TABLES AND GRAPHICS A) Frequency Distributions For Samples Defn: A FREQUENCY DISTRIBUTION is a tabular or graphical display
More informationRobust Linear Regression (Passing- Bablok Median-Slope)
Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming
More informationChapter 5: The standard deviation as a ruler and the normal model p131
Chapter 5: The standard deviation as a ruler and the normal model p131 Which is the better exam score? 67 on an exam with mean 50 and SD 10 62 on an exam with mean 40 and SD 12? Is it fair to say: 67 is
More informationIf the active datasheet is empty when the StatWizard appears, a dialog box is displayed to assist in entering data.
StatWizard Summary The StatWizard is designed to serve several functions: 1. It assists new users in entering data to be analyzed. 2. It provides a search facility to help locate desired statistical procedures.
More informationQuantitative - One Population
Quantitative - One Population The Quantitative One Population VISA procedures allow the user to perform descriptive and inferential procedures for problems involving one population with quantitative (interval)
More informationMath 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency
Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency lowest value + highest value midrange The word average: is very ambiguous and can actually refer to the mean,
More informationTo complete the computer assignments, you ll use the EViews software installed on the lab PCs in WMC 2502 and WMC 2506.
An Introduction to EViews The purpose of the computer assignments in BUEC 333 is to give you some experience using econometric software to analyse real-world data. Along the way, you ll become acquainted
More informationUsing Arithmetic of Real Numbers to Explore Limits and Continuity
Using Arithmetic of Real Numbers to Explore Limits and Continuity by Maria Terrell Cornell University Problem Let a =.898989... and b =.000000... (a) Find a + b. (b) Use your ideas about how to add a and
More informationError Analysis, Statistics and Graphing
Error Analysis, Statistics and Graphing This semester, most of labs we require us to calculate a numerical answer based on the data we obtain. A hard question to answer in most cases is how good is your
More informationExperiment 1 Yahtzee or Validating the t-table
Experiment 1 Yahtzee or Validating the t-table Learning Goals 1. Receive training and practice with many useful spreadsheet functions in Excel. 2. Review and experience the concepts of probability and
More informationA Constant Rate of Change Name Part 1
A Constant Rate of Change Name Part 1 Consider the function table below. Complete this page by solving the problems at the bottom. Use a separate sheet of paper for your descriptions and explanations.
More informationAssignment 4/5 Statistics Due: Nov. 29
Assignment 4/5 Statistics 5.301 Due: Nov. 29 1. Two decision rules are given here. Assume they apply to a normally distributed quality characteristic, the control chart has three-sigma control limits,
More informationMeasures of Dispersion
Measures of Dispersion 6-3 I Will... Find measures of dispersion of sets of data. Find standard deviation and analyze normal distribution. Day 1: Dispersion Vocabulary Measures of Variation (Dispersion
More informationMHPE 494: Data Analysis. Welcome! The Analytic Process
MHPE 494: Data Analysis Alan Schwartz, PhD Department of Medical Education Memoona Hasnain,, MD, PhD, MHPE Department of Family Medicine College of Medicine University of Illinois at Chicago Welcome! Your
More informationComputational Mathematics/Information Technology. Worksheet 2 Iteration and Excel
Computational Mathematics/Information Technology Worksheet 2 Iteration and Excel This sheet uses Excel and the method of iteration to solve the problem f(x) = 0. It introduces user functions and self referencing
More informationExercise 2.23 Villanova MAT 8406 September 7, 2015
Exercise 2.23 Villanova MAT 8406 September 7, 2015 Step 1: Understand the Question Consider the simple linear regression model y = 50 + 10x + ε where ε is NID(0, 16). Suppose that n = 20 pairs of observations
More informationResearch Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel
Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement
More informationLearner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display
CURRICULUM MAP TEMPLATE Priority Standards = Approximately 70% Supporting Standards = Approximately 20% Additional Standards = Approximately 10% HONORS PROBABILITY AND STATISTICS Essential Questions &
More informationSTAT 113: Lab 9. Colin Reimer Dawson. Last revised November 10, 2015
STAT 113: Lab 9 Colin Reimer Dawson Last revised November 10, 2015 We will do some of the following together. The exercises with a (*) should be done and turned in as part of HW9. Before we start, let
More informationSTAT 135 Lab 1 Solutions
STAT 135 Lab 1 Solutions January 26, 2015 Introduction To complete this lab, you will need to have access to R and RStudio. If you have not already done so, you can download R from http://cran.cnr.berkeley.edu/,
More informationToday. Lecture 4: Last time. The EM algorithm. We examine clustering in a little more detail; we went over it a somewhat quickly last time
Today Lecture 4: We examine clustering in a little more detail; we went over it a somewhat quickly last time The CAD data will return and give us an opportunity to work with curves (!) We then examine
More informationSpreadsheet and Graphing Exercise Biology 210 Introduction to Research
1 Spreadsheet and Graphing Exercise Biology 210 Introduction to Research There are many good spreadsheet programs for analyzing data. In this class we will use MS Excel. Below are a series of examples
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use
More informationCHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data 2.2 Density Curves and Normal Distributions The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Density Curves
More informationRegression Analysis and Linear Regression Models
Regression Analysis and Linear Regression Models University of Trento - FBK 2 March, 2015 (UNITN-FBK) Regression Analysis and Linear Regression Models 2 March, 2015 1 / 33 Relationship between numerical
More informationFractions and decimals have opposites, just as integers do. For example, 5 8 and 2 5
Domain 1 Lesson Rational Numbers Common Core Standards: 6.NS., 6.NS.6.c, 6.NS.7.c Getting the Idea A rational number is a number that can be expressed as the ratio of two integers in the form a, where
More information= 3 + (5*4) + (1/2)*(4/2)^2.
Physics 100 Lab 1: Use of a Spreadsheet to Analyze Data by Kenneth Hahn and Michael Goggin In this lab you will learn how to enter data into a spreadsheet and to manipulate the data in meaningful ways.
More informationMAT 142 College Mathematics. Module ST. Statistics. Terri Miller revised July 14, 2015
MAT 142 College Mathematics Statistics Module ST Terri Miller revised July 14, 2015 2 Statistics Data Organization and Visualization Basic Terms. A population is the set of all objects under study, a sample
More informationBehavior of the sample mean. varx i = σ 2
Behavior of the sample mean We observe n independent and identically distributed (iid) draws from a random variable X. Denote the observed values by X 1, X 2,..., X n. Assume the X i come from a population
More information4. Descriptive Statistics: Measures of Variability and Central Tendency
4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and
More informationSTATS PAD USER MANUAL
STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11
More informationUsing Excel This is only a brief overview that highlights some of the useful points in a spreadsheet program.
Using Excel 2007 This is only a brief overview that highlights some of the useful points in a spreadsheet program. 1. Input of data - Generally you should attempt to put the independent variable on the
More informationLab 7 Statistics I LAB 7 QUICK VIEW
Lab 7 Statistics I This lab will cover how to do statistical calculations in excel using formulas. (Note that your version of excel may have additional formulas to calculate statistics, but these formulas
More informationE-Campus Inferential Statistics - Part 2
E-Campus Inferential Statistics - Part 2 Group Members: James Jones Question 4-Isthere a significant difference in the mean prices of the stores? New Textbook Prices New Price Descriptives 95% Confidence
More informationWinstats Instruction Sheet
Winstats Instruction Sheet I. Installing Winstats on your Computer A. Go to the Peanut Software homepage. Either go directly to http://math.exeter.edu/rparris/default.html or Google Peanut Software. B.
More informationInstructions for Using ABCalc James Alan Fox Northeastern University Updated: August 2009
Instructions for Using ABCalc James Alan Fox Northeastern University Updated: August 2009 Thank you for using ABCalc, a statistical calculator to accompany several introductory statistics texts published
More informationVALIDITY OF 95% t-confidence INTERVALS UNDER SOME TRANSECT SAMPLING STRATEGIES
Libraries Conference on Applied Statistics in Agriculture 1996-8th Annual Conference Proceedings VALIDITY OF 95% t-confidence INTERVALS UNDER SOME TRANSECT SAMPLING STRATEGIES Stephen N. Sly Jeffrey S.
More informationCritical Numbers, Maximums, & Minimum
Critical Numbers, Maximums, & Minimum Caleb Adams October 23, 2014 Critical Numbers of Rational Functions Recall, to find a critical number, we first found the derivative of a function, then examined where
More informationCHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data 2.2 Density Curves and Normal Distributions The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers HW 34. Sketch
More informationSTAT 503 Fall Introduction to SAS
Getting Started Introduction to SAS 1) Download all of the files, sas programs (.sas) and data files (.dat) into one of your directories. I would suggest using your H: drive if you are using a computer
More information4b: Making an auxiliary table for calculating the standard deviation
In the book we discussed the use of an auxiliary table to calculate variance and standard deviation (Table 4.3). Such a table gives much more insight in the underlying calculations than the simple number
More informationTHIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010
THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE
More informationLecture 3 - Object-oriented programming and statistical programming examples
Lecture 3 - Object-oriented programming and statistical programming examples Björn Andersson (w/ Ronnie Pingel) Department of Statistics, Uppsala University February 1, 2013 Table of Contents 1 Some notes
More informationSpecific Objectives Students will understand that that the family of equation corresponds with the shape of the graph. Students will be able to create a graph of an equation by plotting points. In lesson
More informationPoints Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked
Plotting Menu: QCExpert Plotting Module graphs offers various tools for visualization of uni- and multivariate data. Settings and options in different types of graphs allow for modifications and customizations
More informationWeek 4: Describing data and estimation
Week 4: Describing data and estimation Goals Investigate sampling error; see that larger samples have less sampling error. Visualize confidence intervals. Calculate basic summary statistics using R. Calculate
More informationChapter 5snow year.notebook March 15, 2018
Chapter 5: Statistical Reasoning Section 5.1 Exploring Data Measures of central tendency (Mean, Median and Mode) attempt to describe a set of data by identifying the central position within a set of data
More information[Note: each line drawn must be a single line segment satisfying x = 3] (b) y = x drawn 1 B1 for y = x drawn
1. (a) x = 3 drawn 1 B1 for x = 3 drawn (b) y = x drawn 1 B1 for y = x drawn [Note: each line drawn must be a single line segment satisfying x = 3] [Note: each line drawn must be a single line segment
More informationVisual-XSel Introduction & SixSigma Selected statistical methods, examples and SixSigma with Visual-XSel Copyright CRGRAPH
Selected statistical methods, examples and SixSigma with Visual-XSel 14.0 Main Guide The Main Guide shows three important statistical methods: DoE, Reliability and Capability analyses. A pre investigation
More informationChapter 6: DESCRIPTIVE STATISTICS
Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling
More informationUsing R. Liang Peng Georgia Institute of Technology January 2005
Using R Liang Peng Georgia Institute of Technology January 2005 1. Introduction Quote from http://www.r-project.org/about.html: R is a language and environment for statistical computing and graphics. It
More informationAssignment 5.5. Nothing here to hand in
Assignment 5.5 Nothing here to hand in Load the tidyverse before we start: library(tidyverse) ## Loading tidyverse: ggplot2 ## Loading tidyverse: tibble ## Loading tidyverse: tidyr ## Loading tidyverse:
More informationTutorial: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and Expression measures
: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and February 24, 2014 Sample to Insight : RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and : RNA-Seq Analysis
More informationTesting Random- Number Generators
Testing Random- Number Generators Raj Jain Washington University Saint Louis, MO 63131 Jain@cse.wustl.edu These slides are available on-line at: http://www.cse.wustl.edu/~jain/cse574-06/ 27-1 Overview
More informationChemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt )
JMP Output from Chapter 9 Factorial Analysis through JMP Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt ) Fitting the Model and checking conditions Analyze > Fit Model
More informationLab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots
STAT 350 (Spring 2015) Lab 3: SAS Solutions 1 Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots Note: The data sets are not included in the solutions;
More informationChapter 3: Data Description Calculate Mean, Median, Mode, Range, Variation, Standard Deviation, Quartiles, standard scores; construct Boxplots.
MINITAB Guide PREFACE Preface This guide is used as part of the Elementary Statistics class (Course Number 227) offered at Los Angeles Mission College. It is structured to follow the contents of the textbook
More informationOptimization and Simulation
Optimization and Simulation Statistical analysis and bootstrapping Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale
More informationWhat s Normal Anyway?
Name Class Problem 1 A Binomial Experiment 1. When rolling a die, what is the theoretical probability of rolling a 3? 2. When a die is rolled 100 times, how many times do you expect that a 3 will be rolled?
More informationChapters 5-6: Statistical Inference Methods
Chapters 5-6: Statistical Inference Methods Chapter 5: Estimation (of population parameters) Ex. Based on GSS data, we re 95% confident that the population mean of the variable LONELY (no. of days in past
More information