Ch6: The Normal Distribution

Similar documents
courtesy 1

Chapter 6. The Normal Distribution. McGraw-Hill, Bluman, 7 th ed., Chapter 6 1

Math 14 Lecture Notes Ch. 6.1

appstats6.notebook September 27, 2016

Section 10.4 Normal Distributions

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.

The Normal Distribution & z-scores

IT 403 Practice Problems (1-2) Answers

Measures of Position

BIOL Gradation of a histogram (a) into the normal curve (b)

CHAPTER 2 Modeling Distributions of Data

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe

Distributions of random variables

The Normal Distribution

The Normal Distribution & z-scores

6-1 THE STANDARD NORMAL DISTRIBUTION

STA Module 4 The Normal Distribution

STA /25/12. Module 4 The Normal Distribution. Learning Objectives. Let s Look at Some Examples of Normal Curves

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

The Normal Distribution & z-scores

CHAPTER 6. The Normal Probability Distribution

Chapter 6 Normal Probability Distributions

Today s Topics. Percentile ranks and percentiles. Standardized scores. Using standardized scores to estimate percentiles

Chapter 3 Analyzing Normal Quantitative Data

Chapter 6. THE NORMAL DISTRIBUTION

Normal Distribution. 6.4 Applications of Normal Distribution

CHAPTER 3: Data Description

Chapter 6. THE NORMAL DISTRIBUTION

Female Brown Bear Weights

Chapter 5: The normal model

Learning Objectives. Continuous Random Variables & The Normal Probability Distribution. Continuous Random Variable

4.3 The Normal Distribution

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 6.2-1

MAT 102 Introduction to Statistics Chapter 6. Chapter 6 Continuous Probability Distributions and the Normal Distribution

8: Statistics. Populations and Samples. Histograms and Frequency Polygons. Page 1 of 10

Density Curve (p52) Density curve is a curve that - is always on or above the horizontal axis.

AP Statistics. Study Guide

Distributions of Continuous Data

CHAPTER 2 Modeling Distributions of Data

Chapter 6: Continuous Random Variables & the Normal Distribution. 6.1 Continuous Probability Distribution

UNIT 1A EXPLORING UNIVARIATE DATA

Probability and Statistics. Copyright Cengage Learning. All rights reserved.

Univariate Statistics Summary

Lecture 6: Chapter 6 Summary

The main issue is that the mean and standard deviations are not accurate and should not be used in the analysis. Then what statistics should we use?

Chapter 2 Modeling Distributions of Data

How individual data points are positioned within a data set.

No. of blue jelly beans No. of bags

Lecture 3 Questions that we should be able to answer by the end of this lecture:

23.2 Normal Distributions

Chapter 5: The standard deviation as a ruler and the normal model p131

10.4 Measures of Central Tendency and Variation

10.4 Measures of Central Tendency and Variation

Sec 6.3. Bluman, Chapter 6 1

MAT 142 College Mathematics. Module ST. Statistics. Terri Miller revised July 14, 2015

MAT 110 WORKSHOP. Updated Fall 2018

Chapter 6 The Standard Deviation as Ruler and the Normal Model

CHAPTER 2 DESCRIPTIVE STATISTICS

Averages and Variation

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 3 Descriptive Measures

Lecture 3 Questions that we should be able to answer by the end of this lecture:

Chapter 7 Assignment SOLUTIONS WOMEN: Plus, add your pulse rate to the histogram.

Frequency Distributions

Chapter 6: DESCRIPTIVE STATISTICS

CHAPTER 2 Modeling Distributions of Data

Chapters 5-6: Statistical Inference Methods

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things.

Central Limit Theorem Sample Means

3. Data Analysis and Statistics

Lecture Notes 3: Data summarization

Measures of Central Tendency

Section 7.2: Applications of the Normal Distribution

MAT 155. Z score. August 31, S3.4o3 Measures of Relative Standing and Boxplots

CHAPTER 2: DESCRIPTIVE STATISTICS Lecture Notes for Introductory Statistics 1. Daphne Skipper, Augusta University (2016)

CHAPTER 2: Describing Location in a Distribution

Homework Packet Week #3

Key: 5 9 represents a team with 59 wins. (c) The Kansas City Royals and Cleveland Indians, who both won 65 games.

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set.

Continuous Improvement Toolkit. Normal Distribution. Continuous Improvement Toolkit.

Chapter 7 Assignment SOLUTIONS WOMEN:

Name Date Types of Graphs and Creating Graphs Notes

Chapter 1. Looking at Data-Distribution

Chapter 11. Worked-Out Solutions Explorations (p. 585) Chapter 11 Maintaining Mathematical Proficiency (p. 583)

Chapter 2: The Normal Distributions

Check Homework. More with normal distribution

Chapter 2: Modeling Distributions of Data

STA 570 Spring Lecture 5 Tuesday, Feb 1

Probability & Statistics Chapter 6. Normal Distribution

Section 2.2 Normal Distributions

Unit I Supplement OpenIntro Statistics 3rd ed., Ch. 1

Chapter 2: The Normal Distribution

Basic Statistical Terms and Definitions

Data organization. So what kind of data did we collect?

IQR = number. summary: largest. = 2. Upper half: Q3 =

Section 6.3: Measures of Position

1. To condense data in a single value. 2. To facilitate comparisons between data.

Unit 8: Normal Calculations

AP Statistics Summer Assignment:

a. divided by the. 1) Always round!! a) Even if class width comes out to a, go up one.

The Normal Distribution

Transcription:

Ch6: The Normal Distribution Introduction Review: A continuous random variable can assume any value between two endpoints. Many continuous random variables have an approximately normal distribution, which means we can use the distribution of a normal random variable to make statistical inference about the variable. CH6: The Normal Distribution Santorico - Page 175

Example: Heights of randomly selected women CH6: The Normal Distribution Santorico - Page 176

Section 6-1: Properties of a Normal Distribution A normal distribution is a continuous, symmetric, bell-shaped distribution of a variable. The theoretical shape of a normal distribution is given by the mathematical formula (x) 2 y e 2, where and are the mean and standard deviations of the probability distribution, respectively. Review: The and are parameters and hence describe the population. 2 2 CH6: The Normal Distribution Santorico - Page 177

The shape of a normal distribution is fully characterized by its mean and standard deviation. specifies the location of the distribution. specifies the spread/shape of the distribution. CH6: The Normal Distribution Santorico - Page 178

CH6: The Normal Distribution Santorico - Page 179

Properties of the Theoretical Normal Distribution A normal distribution curve is bell-shaped. The mean, median, and mode are equal and are located at the center of the distribution. A normal distribution curve is unimodal. The curve is symmetric about the mean. The curve is continuous (no gaps or holes). The curve never touches the x-axis (just approaches it). The total area under a normal distribution curve is equal to 1.0 or 100%. Review: The Empirical Rule applies (68%-95%-99.7%) CH6: The Normal Distribution Santorico - Page 180

CH6: The Normal Distribution Santorico - Page 181

The Standard Normal Distribution Since each normally distributed variable has its own mean and standard deviation, the shape and location of these curves will vary. To simplify making statistical inference using the normal distribution, we use the standard normal distribution. Standard normal distribution - a normal distribution with mean 0 and a standard deviation of 1. CH6: The Normal Distribution Santorico - Page 182

All normally distributed variables can be transformed into a standard normally distributed variable using the formula for the standard score (z-score): z value mean standard deviation or z X. The z-score for an observation is the number of standard deviations the observation lies from the mean. The letter Z is used to denote a standard normal random variable. CH6: The Normal Distribution Santorico - Page 183

Example: The mean resting pulse rates for men are normally distributed with a mean of 70 and a standard deviation of 8. A man has a pulse of 80. Find the corresponding z-score. We know: =70, =8, X=80 z X 80 70 10 1.25 8 8 The man s pulse is standard deviations above the mean because is positive. CH6: The Normal Distribution Santorico - Page 184

Example: The average speed of drivers on I-25 is 63 mph with a standard deviation of 5 mph. Assuming the data are normally distributed, find the z-score for the following speeds clocked by police officers: 74 mph z= 51 mph z= 82 mph z= CH6: The Normal Distribution Santorico - Page 185

Finding Areas/Probabilities for the Standard Normal Distribution Table E in Appendix C gives the area/probability under the standard normal distribution curve to the left of z-values between -3.49 and 3.49 in increments of.01. To find P(Z z) (the probability that a standard normal random variable Z is less than or equal to the value z), we match our value of z with the left column and top row of Table E (by adding the numbers together). The row and column where these numbers intersect gives us the probability or area under the standard normal curve to the left of z. Note: Use.0001 when z 3.50 and.9999 when z 3.50. CH6: The Normal Distribution Santorico - Page 186

Examples: P(Z 3.49).0002 P(Z 1.31).0951 P(Z 0).5000 P(Z 1.96).9750 P(Z 3.49).9998 P(Z 1.43) P(Z 1.82) CH6: The Normal Distribution Santorico - Page 187

Finding the Area Under the Standard Normal Distribution Curve 1. To the left of any z-value: look up the z value in the table and use the area given. CH6: The Normal Distribution Santorico - Page 188

Examples: Find the area to the left of z 0.84, i.e., find P(Z 0.84). Find the area to the left of z = -1.32, i.e., find P(Z 1.32). CH6: The Normal Distribution Santorico - Page 189

2. To the right of any z-value: look up the z value and subtract the area from 1. CH6: The Normal Distribution Santorico - Page 190

Examples: Find the area to the right of z 0.21, i.e., find P(Z 0.21). P(Z > -0.21) = 1 P(Z -0.21) = 1 0.4168 = 0.5832 Find the area to the right of z 1.52, i.e., find P(Z 1.52). CH6: The Normal Distribution Santorico - Page 191

3. Between any two z-values: look up both z values and subtract the smaller area from the larger area. CH6: The Normal Distribution Santorico - Page 192

Examples: Find the area between z 1.71 and z 0.45, i.e., find P( 1.71 Z 0.45) P( Z 0.45) P( Z 1.71) 0.6736-0.0436 0.63 Find the area between z=1.43 and z=3.01, i.e., find P(1.43 Z 3.01). CH6: The Normal Distribution Santorico - Page 193

Note: For a continuous random variable X (not a discrete random variable), the probability that X equals a specific number is 0. Let c be some number (like 1 or 7). Then P(X c) 0. Intuitive explanation: If we had ten numbers, 1, 2,, 10, and X is the number we see, P(X 2) 1/10. If we had one thousand numbers, 1, 2,, 1000, and X is the number we see, P(X 2) 1/1000. For a continuous random variable, there are an infinite number of possibilities, so P(X c) 1/ 0. (This is bad mathematics, but it gets the point across). CH6: The Normal Distribution Santorico - Page 194

Thus, for a continuous random variable: P(Z z) P(Z z) and P(Z z) P(Z z) and P(z 1 Z z 2 ) P(z 1 Z z 2 ) P(z 1 Z z 2 ) P(z 1 Z z 2 ) CH6: The Normal Distribution Santorico - Page 195

Section 6-2: Applications of the Normal Distribution Finding Probabilities for Normal Distributions To find probabilities for any normal distribution with mean and standard deviation, we simply transform the values on the original scale (call these x, x 1, or x 2 ) using the formula z value mean standard deviation or z x, and then use the techniques from the previous section. CH6: The Normal Distribution Santorico - Page 196

Example: An American household generates an average of 28 pounds per month of newspaper for garbage, with a standard deviation of 2 pounds. Assume the amount of newspaper garbage is normally distributed. If a household is selected at random, find the probability of generating between 27 and 31 pounds of newspaper garbage per month. X = garbage. We want P(27 < X < 31). Turn into a Z score so we can use the standard normal distribution. Z X and 27 X 31 27 28 31 28 P P Z 2 2 P( 0.5 Z 1.5) P( Z 1.5) P( Z 0.5) 0.9332 0.3085 0.6247 CH6: The Normal Distribution Santorico - Page 197

Example: College women s heights are normally distributed with 65 inches and 2.7 inches. What is the probability that a randomly selected college woman is 62 inches or shorter? X=height and X X 65 Z 2.7. We want X P( X 62) P 62 65 P Z 2.7 PZ ( -1.11) 0.1335 62 CH6: The Normal Distribution Santorico - Page 198

Example: What is the probability that a randomly selected college woman is between 63 and 69 inches tall? CH6: The Normal Distribution Santorico - Page 199

Sometimes we need to find the value X that corresponds to a certain percentile. (e.g., what value corresponds to the 98 th percentile?) To solve this problem: 1. Find the z that corresponds to this percentile for the standard normal distribution. a. The area to the left of this z-score has the area closest to this percentile. 2. X z. CH6: The Normal Distribution Santorico - Page 200

Example: Mensa is a society of high-iq people whose members have a score on an IQ test at the 98 th percentile or higher. The mean IQ score is 100 with a standard deviation of 16. What IQ score do you need to get into Mensa? 1. First determine what z-score is related to the 98 th percentile (to be in the 98 th percentile means that 98% of IQ test takers scored at or below your score) (i.e. 0.980). Z = 2.06 2. What is the IQ for the 98 th percentile using the formula for X given above? Z = +Z = 100 + 2.06(16) = 132.96 CH6: The Normal Distribution Santorico - Page 201

Example: College women s heights are normally distributed with a 65 and 2.7. What is the 54 th percentile of this population? 1. Find Z-score for 54 th percentile: 2. Find corresponding value for X: CH6: The Normal Distribution Santorico - Page 202

Example: An American household generates an average of 28 pounds per month of newspaper for garbage with a standard deviation of 2 pounds. Assume the amount of newspaper garbage is normally distributed. What is the 34 th percentile for the amount of newspaper garbage American households produce per month? CH6: The Normal Distribution Santorico - Page 203

Determining Normality The easiest way to determine if a distribution is bell-shaped is to draw a histogram for the data and check its shape. If the histogram is NOT approximately bell-shaped then the distribution is NOT normally distributed. The skewness of a data set can be checked by using the Pearson s index, PI, of skewness. PI 3(X median) s If the index is greater than or equal to +1 or less than or equal to -1, it can be concluded that the data are significantly skewed. CH6: The Normal Distribution Santorico - Page 204

If the index is smaller than -1, then the distribution is negatively skewed. If the index is greater than +1, then the distribution is positively skewed. The data should also be checked for outliers since they can have a big effect on normality. If the histogram is bell-shaped, the PI is between -1 and +1, and there are no outliers then it can be concluded that the distribution is normally distributed. CH6: The Normal Distribution Santorico - Page 205

Example: A survey of 18 high-technology firms showed the number of days inventory they had on hand. Determine if the data are normally distributed. CH6: The Normal Distribution Santorico - Page 206

For these data, the sample mean is 79.5, the median is 77.5, and the sample standard deviation is 40.5. Check for skewness using Pearson s skewness index. Based on the PI, which is 0.148, the distribution for the data is not significantly skewed. PI 3( X median) s 3(79.5 77.5) 40.5 0.148 CH6: The Normal Distribution Santorico - Page 207

Example: Consider a data set with a mean of 126.5, a median of 144 and a standard deviation of 38.65, find the PI and interpret it. PI= CH6: The Normal Distribution Santorico - Page 208

Section 6-3: The Central Limit Theorem If we took numerous random samples and calculated the sample means, what distribution would the sample means have? This distribution is known as the sampling distribution of the sample means. Sampling distribution of sample means the distribution of the sample means calculated from all possible random samples of size n from a population. CH6: The Normal Distribution Santorico - Page 209

If the samples are randomly selected, the sample means will be somewhat different from the population mean. These differences are due to sampling error. Sampling error - the difference between the sample measure and the corresponding population measure due to the fact that the sample is not a perfect representation of the population. Let s look at a simulation through the Sampling Distribution (CLT) Experiment available at http://www.socr.ucla.edu/htmls/socr_experiments.html. What do you notice about the sampling distribution for the mean? CH6: The Normal Distribution Santorico - Page 210

Properties of the Distribution of Sample Means When all possible samples of a specific size are selected with replacement from a population, the distribution of the sample means for a variable has three important properties: 1. The mean of the sample means will be the same as the population mean and is given by X. 2. The standard deviation of the sample means (known as the standard error of the mean) is given by X n. Example: See pg 331 and 332 for a well-worked out example. CH6: The Normal Distribution Santorico - Page 211

The Central Limit Theorem Reader s Digest Version: The distribution of the sample mean gets closer and closer to a normal distribution as the sample size increases. This distribution has a mean deviation X n. X and a standard This result applies no matter what the shape of the probability distribution from which the samples are taken. CH6: The Normal Distribution Santorico - Page 212

Some Important Points to Remember About the Central Limit Theorem: 1. If the distribution of the population is normal, then the distribution of the sample means will ALWAYS be normal. 2. If the sample size is large enough (typically n 30), then the distribution of the sample means will be approximately normal. The larger the sample, the better the approximation will be. CH6: The Normal Distribution Santorico - Page 213

Why is the Central Limit Theorem so useful? Even when we don t know the distribution of the population, if the sample size is sufficiently large, then we can use the properties of the normal distribution to make statistical inference about the population mean! So essentially, if the sample size is large enough we can make statistical inference about the population mean even if we don t know anything else about the population! If the population size is really large, then the results about the sampling distribution of the mean are approximately correct even if sampling without replacement. CH6: The Normal Distribution Santorico - Page 214

When the sample size is sufficiently large (30+), the central limit theorem can be used to answer questions about sample means in the same way that a normal distribution can be used to answer questions about individual data. However, our z- score formula changes slightly to z x / n since the standard deviation of x is / n. CH6: The Normal Distribution Santorico - Page 215

Example: The average yearly cost per household of owning a dog is $186.80 with a standard deviation of $32.00. Suppose that we randomly select 50 households that own a dog. What is the probability that the sample mean for these 50 households is less than $175.00? We want to know Px ( 175) and have =186.80, =32, n=50. To use our standard normal distribution, we need to convert to a Z- score: x 175 175 186.80 P( x 175) P P Z / n / n 32 / 50 PZ ( -2.61) = 0.0045 CH6: The Normal Distribution Santorico - Page 216

What is the probability that the sample mean for these 50 households is between $195 and $200? What is the probability that the sample mean for these 50 households is between $180 and $190? CH6: The Normal Distribution Santorico - Page 217

Tips: 1. Look for keywords about whether the data are normally distributed or the sample size is relatively large. 2. Note whether you are looking for a probability or a percentile. 3. The formula z (x )/ is used to gain information about an individual data value when the variable is normally distributed. 4. The formula z x is used to gain information when / n applying the central limit theorem about a sample mean when the population is normally distributed or when the sample size is 30 or more.. CH6: The Normal Distribution Santorico - Page 218