Chapter 6. THE NORMAL DISTRIBUTION

Similar documents
Chapter 6. THE NORMAL DISTRIBUTION

STA Module 4 The Normal Distribution

STA /25/12. Module 4 The Normal Distribution. Learning Objectives. Let s Look at Some Examples of Normal Curves

Density Curve (p52) Density curve is a curve that - is always on or above the horizontal axis.

Chapter 2 Modeling Distributions of Data

Chapter 5: The standard deviation as a ruler and the normal model p131

Data Analysis & Probability

CHAPTER 2 Modeling Distributions of Data

The Normal Distribution

Lecture 3 Questions that we should be able to answer by the end of this lecture:

appstats6.notebook September 27, 2016

Lecture 3 Questions that we should be able to answer by the end of this lecture:

Normal Distribution. 6.4 Applications of Normal Distribution

CHAPTER 2 Modeling Distributions of Data

Learning Objectives. Continuous Random Variables & The Normal Probability Distribution. Continuous Random Variable

CHAPTER 2 Modeling Distributions of Data

Section 10.4 Normal Distributions

Chapter 2: Modeling Distributions of Data

Section 2.2 Normal Distributions. Normal Distributions

Key: 5 9 represents a team with 59 wins. (c) The Kansas City Royals and Cleveland Indians, who both won 65 games.

MAT 102 Introduction to Statistics Chapter 6. Chapter 6 Continuous Probability Distributions and the Normal Distribution

6-1 THE STANDARD NORMAL DISTRIBUTION

BIOL Gradation of a histogram (a) into the normal curve (b)

IT 403 Practice Problems (1-2) Answers

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.

Lecture 21 Section Fri, Oct 3, 2008

CHAPTER 2: Describing Location in a Distribution

Chapter 2: The Normal Distributions

Chapter 6 Normal Probability Distributions

Sections 4.3 and 4.4

Section 2.2 Normal Distributions

The Normal Distribution

Unit 7 Statistics. AFM Mrs. Valentine. 7.1 Samples and Surveys

Averages and Variation

STP 226 ELEMENTARY STATISTICS NOTES PART 2 - DESCRIPTIVE STATISTICS CHAPTER 3 DESCRIPTIVE MEASURES

10.4 Measures of Central Tendency and Variation

10.4 Measures of Central Tendency and Variation

Distributions of random variables

Chapter 5. Normal. Normal Curve. the Normal. Curve Examples. Standard Units Standard Units Examples. for Data

Chapter 1. Looking at Data-Distribution

Lecture 6: Chapter 6 Summary

4.3 The Normal Distribution

Normal Data ID1050 Quantitative & Qualitative Reasoning

Ms Nurazrin Jupri. Frequency Distributions

Chapter 3 - Displaying and Summarizing Quantitative Data

Chapter 6: DESCRIPTIVE STATISTICS

Name: Date: Period: Chapter 2. Section 1: Describing Location in a Distribution

1. The Normal Distribution, continued

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

How individual data points are positioned within a data set.

Ch6: The Normal Distribution

Chapter 6 The Standard Deviation as Ruler and the Normal Model

Distributions of Continuous Data

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

The standard deviation 1 n

Chapter 6: Continuous Random Variables & the Normal Distribution. 6.1 Continuous Probability Distribution

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 3 Descriptive Measures

No. of blue jelly beans No. of bags

CHAPTER 3: Data Description

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 6.2-1

Measures of Position

Math 14 Lecture Notes Ch. 6.1

MAT 110 WORKSHOP. Updated Fall 2018

Chapter 2: Statistical Models for Distributions

23.2 Normal Distributions

Stat 528 (Autumn 2008) Density Curves and the Normal Distribution. Measures of center and spread. Features of the normal distribution

Chapter 2: The Normal Distribution

Chapter2 Description of samples and populations. 2.1 Introduction.

UNIT 1A EXPLORING UNIVARIATE DATA

CHAPTER 2 DESCRIPTIVE STATISTICS

The first few questions on this worksheet will deal with measures of central tendency. These data types tell us where the center of the data set lies.

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES

AP Statistics. Study Guide

2.1: Frequency Distributions and Their Graphs

So..to be able to make comparisons possible, we need to compare them with their respective distributions.

Probability & Statistics Chapter 6. Normal Distribution

Chapter 2 Describing, Exploring, and Comparing Data

Chapter 3 Analyzing Normal Quantitative Data

3.5 Applying the Normal Distribution: Z-Scores

L E A R N I N G O B JE C T I V E S

Probability and Statistics. Copyright Cengage Learning. All rights reserved.

Section 1.2. Displaying Quantitative Data with Graphs. Mrs. Daniel AP Stats 8/22/2013. Dotplots. How to Make a Dotplot. Mrs. Daniel AP Statistics

7.2. The Standard Normal Distribution

Ch3 E lement Elemen ar tary Descriptive Descriptiv Statistics

a. divided by the. 1) Always round!! a) Even if class width comes out to a, go up one.

AND NUMERICAL SUMMARIES. Chapter 2

MATH NATION SECTION 9 H.M.H. RESOURCES

The Normal Distribution

Chapter 6. The Normal Distribution. McGraw-Hill, Bluman, 7 th ed., Chapter 6 1

Student Learning Objectives

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.1- #

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data

1.3 Graphical Summaries of Data

Acquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data.

height VUD x = x 1 + x x N N 2 + (x 2 x) 2 + (x N x) 2. N

CHAPTER 2: DESCRIPTIVE STATISTICS Lecture Notes for Introductory Statistics 1. Daphne Skipper, Augusta University (2016)

CHAPTER 2: SAMPLING AND DATA

Measures of Central Tendency

MEASURES OF CENTRAL TENDENCY

Transcription:

Chapter 6. THE NORMAL DISTRIBUTION Introducing Normally Distributed Variables The distributions of some variables like thickness of the eggshell, serum cholesterol concentration in blood, white blood cells count in a specimen of blood have roughly the shape of a normal curve (bell shaped curve) Normally Distributed Variable A variable is said to be normally distributed or to have a normal distribution if its distribution has the shape of a normal curve. Normal distribution (curve) completely determined by mean ( ) and standard deviation ( ). Parameters of Normal distribution = (, ) Characteristics of Normal distribution Bell-shaped Symmetric around the mean Close to the horizontal axis outside the range from -3 to +3 Spread depends on the standard deviation. Area under the curve is 1 for any (, ). Notation: Y~N(12, 7) indicates that Y has normal distribution with mean 12 and standard deviation 7 1

Normally distributed variables and normal-curve areas For a normally distributed variable, the percentage of all possible observations that lie within any specified range equals the corresponding area under its associated normal curve expressed as a percentage. This holds true approximately for a variable that is approximately normally distributed. Visualizing a Normal Distribution (Empirical Rule) 1. 68.26% of all possible observations lie within one standard deviation to either side of the mean, i.e., between - and +. 2. 95.44% of all possible observations lie within two standard deviations to either side of the mean, i.e., between - 2 and + 2. 3. 99.74% of all possible observations lie within three standard deviations to either side of the mean, i.e., between - 3 and + 3. 2

Special Case: Standard Normal Distribution; Standard Normal Curve A normally distributed variable having mean 0 and standard deviation 1 is said to have the standard normal distribution. Its associated normal curve is called the standard normal curve. Basic Properties of the Standard Normal Curve 1. The total area under the standard normal curve is equal to 1. 2. The standard normal curve extends indefinitely in both directions, approaching, but never touching, the horizontal axis as it does so. 3. The standard normal curve is symmetric about 0; i.e., the part of the curve to the left of 0 is the mirror image of the part of the curve to the right of 0. 4. Most of the area under the standard normal curve lies between 3 and 3. Standardized Normally Distributed Variable The standardized version of a normally distributed variable Y, Z= Y μ σ standard normal distribution. has the Example (Heights of Female of College Students): Suppose that large college has female students population with height that follows normal distribution, the mean height of these students is 64.4 inches and the standard deviation is 2.4 inches. Q1: Suppose you want to find out the percentage of students whose heights are between 66 and 68 inches, you have to evaluate the area under the normal curve from 66 to 68. Q2: Suppose you want to determine the probability that randomly selected student will have height between 66 and 68. Both questions are answered by computing the area below: 3

68 66 ( x 64.4)2 1 2 π(2.4) e 2(2. 4) 2 dx 2 Area = 0.1846 (by TABLE) Facts: 1) Once we know the mean and standard deviation of a normally distributed variable, we know its distribution and associated normal curve 2) Percentages for a normally distributed variable are equal to areas under its associated normal curve. 3) Probability that randomly selected element from normally distributed population is in a particular range equals to the area under that normal curve over the given range. Example: Suppose quiz scores X in stp226 class nave normal distribution with mean 9 and sd 2, X~N(9,2) Q1.What % of quiz scores are between 5 and 11 points? Q2. For randomly selected student what is the probability that he scored between 5 and 11 points Both questions can be answered by computing area shown above. Using tables that area is equal to 0.8418-0.0228=0.819 (81.9%) 4

How to find areas under the Standard Normal Curve Using the Standard-Normal Table There are infinitely many normally distributed variables, however, if these variables can be standardized, then the standard normal tables can be used to find the areas under the curve. * Table set up to accumulate the area under the curve from - to and specified value. * The table starts at 3.9 and goes to 3.9 since outside this range of values the area is negligible. * The table can be used to find a z value given and area, or and area given a z value. Working with Normally Distributed Variables To Determine a Percentage or Probability for a normally Distributed Variable 1. Sketch the normal curve associated with the variable. 2. Shade the region of interest and mark the delimiting x-values. 3. Compute the z-scores for the delimiting x-values found in step 2. 4. Use Table II to obtain the area under the standard normal curve delimited by the z-scores found in step 3. Example (contd.) Height of Female students: Normal distribution with = 64.4, = 2.4. We want to determine the probability that randomly selected student will have height between 66 and 68. or the percentage of students whose heights are between 66 and 68 inches. z-score for x = 66: z = (66-64.4)/2.4 = 0.67, x=68: z = (68-64.4)/2.4 = 1.5 area under standard normal curve: z= 1.5 -> 0.9332, z = 0.67 -> 0.7486 resulting probability: 0.9332 0.7486 = 0.1846 5

In conclusion: For normally distributed variables Y : 1)What percentage of values of Y are in the range a to b 2)For randomly selected Y what is the probability P(a< Y < b) Both above questions:can be answered by computing area under the normal curve between a and b. To Determine the Observations Corresponding to a specified Percentage or Probability for a Normally Distributed Variable. 1. Sketch the normal curve associated with the variable. 2. Shade the region of interest. 3. Use Table II to obtain the z-scores delimiting the region in step 2. 4. Obtain the x-values having the z-scores found in step 3: x= + z( ) Example (contd.) a. Obtain the Q 3 (75 th percentile) of the height of female students. The z-score corresponding to Q 3 is the one having an area of 0.75 to its left under the standard normal curve. From Table II, that z-score is 0.67=z 0.25, approximately. So the x-value (height) corresponding to that z-score is 64.4 + (0.67)*2.4 = 66 inches. b. Obtain the 10 th percentile. z-score corresponding to P 10 is the one having an area of 0.1 to its left under the standard normal curve. From Table II, that z-score is 1.28= - z 0.10, approximately. So the x-value (height) corresponding to that z-score is 64.4 + (-1.28)*2.4 = 61.32. 6

The z Notation The symbol z is used to denote the z- score having area (alpha) to its right under the standard normal curve. z - z sub alpha or simply z. Assessing Normality. Normal Probability Plots. 1. Many statistical procedures are based on the assumption that data analyzed is coming from normally distributed populations. One way to assess the normality of your data is through the use of Empirical Rule. We can compute percentages within 1, 2 and 3 SD-s from the mean of the data and check is the percentages are close to expected 68-95-99.7. Visual check of the histogram is also helpful, if we have unimodal, nearly symmetric graph with no long or very short tails, we can be pretty sure that normality assumption can be made. 2. With small data sets in particular Empirical Rule or a visual check of the histogram is not as useful. A special statistical graph: Normal Probability Plot is often used. The plot is a scatterplot that compares observed data values to the values we would expect to have if the population were normal. If the data came from normal population, points would follow a straight line; Following example illustrates the procedure. Ex. Y= age of onset of diabetes, sample of size 5: 7, 48, 43, 51, 49. Order your data. Compute mean and standard deviation: y=39.6, s=18.46 Y=observed height 7 43 48 49 51 (i-.5)/n= adjusted percentile.10.30.50.7.9 z -1.28-0.52 0 0.52 1.28 Y=theoretical height= 16.0 30.0 39.6 49.2 63.2 7

y+ z s Graphing theoretical height (x-axis) vs observed height (y-axis) we can see that points do not follow a line, first value is much smaller than expected theoretical value indicating left skewness. Following two pictures illustrate Normal Quantile plots for other data sets. normal Ex1 Rainwater ph value Good fit, distribution is close to normal Ex2 Survival times in some population Curved pattern, distribution is not normal but right skewed, few individuals have particularly long survival times. 8

Interpreting Normal plots: Distribution close to normal will be indicated by a straight line (more or less) Left skewed distribution will be indicated by the line curving down from the left (lower observations smaller than expected for normal distr.). Right skewed distribution will be indicated by the line curving up from the right (upper observations larger than expected for normal distr.) Outliers will appear as points far away from the overall pattern of the plot. Granularity: sometimes plotted points appear to form a horizontal segments indicating repeated identical observations. This should not prevent us from adopting a normal distribution for the data. We can avoid this problem often by taking more precise measurements (not rounding to much). Instructions for : TI83, 83-Plus, 84-Plus Computing areas under normal curves: use 2 nd VARS to get to the DISTR menu: option 2 normalcdf(lower limit, upper limit, mean, standard deviation) will give are between lower and upper limits (mean=0 and SD=1 are default values) Ex1 To find area between 1 and 1.7 under N(0,1) use normalcdf(1,1.7,0,1)=.1141 Ex2 To find area onder N(0,1) left of 2.3 use normalcdf(-1000000, 2.3,0,1)=.9893 (use any large negative number as lower limit) Ex3 To find area right of 2.11 under N(0,1) use normalcdf(2.11,1000000,0,1)=.0174 (use any large positive number as upper limit) Ex4 To find area under curve N(12,3) between 10 and 16 use normalcdf(10, 16, 12,3)=.6563 9

Finding points from under the normal curves when area is given. use 2 nd VARS to get to the DISTR menu: option 3 invnorm(area to the left, mean, standard deviation) (mean=0 and SD=1 are default values) Ex5 To find third decile of N(0,1) use invnorm(.3,0,1)=-.52 Ex6 To find 95 th percentile of N(0,1) (or to find Z. 05 ) use invnorm(.95,0,1)=1.645 Ex7 To find third quartile on N(12,3) use invnorm(.75,12,3)=14.02 10