appstats6.notebook September 27, 2016

Similar documents
STA Module 4 The Normal Distribution

STA /25/12. Module 4 The Normal Distribution. Learning Objectives. Let s Look at Some Examples of Normal Curves

Chapter 6 The Standard Deviation as Ruler and the Normal Model

Chapter 2 Modeling Distributions of Data

Chapter 5: The standard deviation as a ruler and the normal model p131

Density Curve (p52) Density curve is a curve that - is always on or above the horizontal axis.

UNIT 1A EXPLORING UNIVARIATE DATA

Name: Date: Period: Chapter 2. Section 1: Describing Location in a Distribution

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 3 Descriptive Measures

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.

Chapter 2: The Normal Distributions

Chapter 6: DESCRIPTIVE STATISTICS

STA Module 2B Organizing Data and Comparing Distributions (Part II)

STA Learning Objectives. Learning Objectives (cont.) Module 2B Organizing Data and Comparing Distributions (Part II)

Chapter 5: The normal model

Chapter 6. THE NORMAL DISTRIBUTION

Chapter 6. THE NORMAL DISTRIBUTION

Measures of Position

Averages and Variation

Lecture 6: Chapter 6 Summary

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

Ch6: The Normal Distribution

Measures of Dispersion

CHAPTER 2 Modeling Distributions of Data

AP Statistics. Study Guide

Distributions of random variables

Chapter 3: Data Description - Part 3. Homework: Exercises 1-21 odd, odd, odd, 107, 109, 118, 119, 120, odd

Chapter 3 Analyzing Normal Quantitative Data

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe

10.4 Measures of Central Tendency and Variation

10.4 Measures of Central Tendency and Variation

Section 6.3: Measures of Position

a. divided by the. 1) Always round!! a) Even if class width comes out to a, go up one.

Lecture 3 Questions that we should be able to answer by the end of this lecture:

Lecture 3 Questions that we should be able to answer by the end of this lecture:

Chapter 3 - Displaying and Summarizing Quantitative Data

AP Statistics Prerequisite Packet

IT 403 Practice Problems (1-2) Answers

Chapter 2: The Normal Distribution

CHAPTER 2 DESCRIPTIVE STATISTICS

No. of blue jelly beans No. of bags

How individual data points are positioned within a data set.

Chapter 6 Normal Probability Distributions

The first few questions on this worksheet will deal with measures of central tendency. These data types tell us where the center of the data set lies.

Lecture Notes 3: Data summarization

CHAPTER 2 Modeling Distributions of Data

Section 1.2. Displaying Quantitative Data with Graphs. Mrs. Daniel AP Stats 8/22/2013. Dotplots. How to Make a Dotplot. Mrs. Daniel AP Statistics

Normal Distribution. 6.4 Applications of Normal Distribution

Section 10.4 Normal Distributions

CHAPTER 2: DESCRIPTIVE STATISTICS Lecture Notes for Introductory Statistics 1. Daphne Skipper, Augusta University (2016)

MAT 110 WORKSHOP. Updated Fall 2018

CHAPTER 3: Data Description

MATH 112 Section 7.2: Measuring Distribution, Center, and Spread

Chapter 1. Looking at Data-Distribution

15 Wyner Statistics Fall 2013

AP Statistics Summer Assignment:

Understanding and Comparing Distributions. Chapter 4

STANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA I. 4 th Nine Weeks,

Ch3 E lement Elemen ar tary Descriptive Descriptiv Statistics

+ Statistical Methods in

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.

L E A R N I N G O B JE C T I V E S

Chapter 2 Describing, Exploring, and Comparing Data

The main issue is that the mean and standard deviations are not accurate and should not be used in the analysis. Then what statistics should we use?

1. The Normal Distribution, continued

MAT 142 College Mathematics. Module ST. Statistics. Terri Miller revised July 14, 2015

MATH NATION SECTION 9 H.M.H. RESOURCES

Chapter 2: Modeling Distributions of Data

Descriptive Statistics

Chapter 5. Understanding and Comparing Distributions. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Measures of Central Tendency

Key: 5 9 represents a team with 59 wins. (c) The Kansas City Royals and Cleveland Indians, who both won 65 games.

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set.

Female Brown Bear Weights

CHAPTER 2: Describing Location in a Distribution

The Normal Curve. June 20, Bryan T. Karazsia, M.A.

STANDARDS OF LEARNING CONTENT REVIEW NOTES. ALGEBRA I Part II. 3 rd Nine Weeks,

STP 226 ELEMENTARY STATISTICS NOTES PART 2 - DESCRIPTIVE STATISTICS CHAPTER 3 DESCRIPTIVE MEASURES

Today s Topics. Percentile ranks and percentiles. Standardized scores. Using standardized scores to estimate percentiles

Section 2.2 Normal Distributions. Normal Distributions

Chapter2 Description of samples and populations. 2.1 Introduction.

Univariate Statistics Summary

2.1: Frequency Distributions and Their Graphs

Unit I Supplement OpenIntro Statistics 3rd ed., Ch. 1

Stat 528 (Autumn 2008) Density Curves and the Normal Distribution. Measures of center and spread. Features of the normal distribution

Name Date Types of Graphs and Creating Graphs Notes

Table of Contents (As covered from textbook)

Measures of Dispersion

Homework Packet Week #3

MATH 1070 Introductory Statistics Lecture notes Descriptive Statistics and Graphical Representation

Learning Log Title: CHAPTER 7: PROPORTIONS AND PERCENTS. Date: Lesson: Chapter 7: Proportions and Percents

Chapter 5snow year.notebook March 15, 2018

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES

The standard deviation 1 n

Chpt 3. Data Description. 3-2 Measures of Central Tendency /40

MATH& 146 Lesson 8. Section 1.6 Averages and Variation

To calculate the arithmetic mean, sum all the values and divide by n (equivalently, multiple 1/n): 1 n. = 29 years.

Chapter 3: Describing, Exploring & Comparing Data

Transcription:

Chapter 6 The Standard Deviation as a Ruler and the Normal Model Objectives: 1.Students will calculate and interpret z scores. 2.Students will compare/contrast values from different distributions using their z scores.

The Standard Deviation as a Ruler pg 102 Standard deviation is a measure of spread, or variability. The smaller the standard deviation, the less variability is present in the data. The larger the standard deviation, the more variability is present in the data. Standard deviation can be used as a ruler for measuring how an individual compares to a group. Over and Over in this course, we will ask the question " How far is this value from the mean? Standardizing with z scores We compare individual data values to their mean, relative to their standard deviation using the following formula: To measure how far above or below the mean any given data value is, we find its standardized value, or z score. To standardize a value, subtract the mean and divide by the standard deviation. pg 103

Standardizing with z scores pg 103 z scores measure the distance of each data value from the mean in standard deviations. The standardized values have no units and are called z scores. Z scores represent how far the value is above the mean (if positive) or below the mean (if negative). Ex: z = 1 means the value is one standard deviation above the mean Ex: z = 0.5 means the value is one half of a standard deviation below the mean Standardized values have no units. Examples: height for women is 64.5 inches with a standard deviation of 2.5 inches and for men is 69 inches with a standard deviation of 2.5 inches. Are you tall? height z = Suppose the average woman s shoe size is 8.25 with a standard deviation 1.15 and the average male shoe size is 10 with a standard deviation of 1.5. Do you have big feet? shoe z = Suppose Sharon wears a size 9 shoe and Andrew wears a size 9. Does Sharon have big feet? Does Andrew? Sharon z = Andrew z =

pg 103 In order to compare values that are measured using different scales, you must first standardize the values. The larger the z score, the more unusual it is. Benefits of Standardizing Standardized values have been converted from their original units to the standard statistical unit of standard deviations from the mean. pg 104 Standardized values, because they have no units, are therefore useful when comparing values that are measured on different scales, with different units, or from different populations.

Shifting Data pg 104 When adding (or subtracting) a constant amount to each value, all measures of position(median,mean, percentiles, min, max) will increase (or decrease) by the same the same constant. Adding a constant to all of the values in a set of data adds the same constant to the measures of center and percentiles. It does not, however, affect the spread. Example: Add 5 to each value in the given set of data (on the left) to form a new set of data (on the right). Then find the indicated measures of center and spread. {5, 5, 10, 35, 45}. {10, 10, 15, 40, 50}. Center: Center: x = x = M = M = Mode = Mode = Spread: Spread: Range = Range = IQR = IQR = SD = SD =

The following histograms show a shift from men s actual weights to kilograms above recommended weight of 74 kg: 74 kg was subtracted to each of the men's weight to compare their weights to the recommended height. Pair share A)The distribution has a shape,center and spread. B)The new distribution has a shape,center and spread. Rescaling Data pg 105 When we divide or multiply all the data values by any constant value, both measures of location (e.g., mean and median) and measures of spread (e.g., range, IQR, standard deviation) are divided and multiplied by the same value. Multiplying a constant to all of the values in a set of data multiplies the same constant to the measures of center and spread.

Example: Multiply each value in the given set of data (on the left) by 2 to form a new set of data(on the right). Then find the indicated measures of center and spread. {5, 5, 10, 35, 45}. {10, 10, 20, 70, 90}. Center: Center: x = x = M = M = Mode = Mode = Spread: Range = Spread: Range = IQR = IQR = SD = SD = The men s weight data set measured weights in kilograms. If we want to think about these weights in pounds, we would rescale the data: pg 106 Each weight was multiplied by 2.2 because there is 2.2 pounds in every kilogram. Pair share A)The distribution has a shape,center and spread. B)The new distribution has a shape,center and spread

Example#1 Example#1

Back to z scores pg 106 see step by step Standardizing data into z scores shifts the data by subtracting the mean and rescales the values by dividing by their standard deviation. Standardizing into z scores does not change the shape of the distribution. Standardizing into z scores changes the center by making the mean 0. Standardizing into z scores changes the spread by making the standard deviation 1. When Is a z score Big? pg 107 A z score gives us an indication of how unusual a value is because it tells us how far it is from the mean. Remember that a negative z score tells us that the data value is below the mean, while a positive z score tells us that the data value is above the mean. The larger a z score is (negative or positive), the more unusual it is.

When Is a z score Big? (cont.) There is no universal standard for z scores, but there is a model that shows up over and over in Statistics. This model is called the Normal model (You may have heard of bell shaped curves. or emperical rule). Normal models are appropriate for distributions whose shapes are unimodal and roughly symmetric. These distributions provide a measure of how extreme a z score is. pg 108 The 68 95 99.7 Rule pg 109 Normal models give us an idea of how extreme a value is by telling us how likely it is to find one that far from the mean. We can find these numbers precisely, but until then we will use a simple rule that tells us a lot about the Normal model

The Normal model: is symmetric and bell shaped. follows the 68 95 99.7 Rule About 68% of the values fall within one standard deviation of the mean. About 95% of the values fall within two standard deviations of the mean. About 99.7% (almost all) of the values fall within three standard deviations of the mean. pg 108 There is a Normal model for every possible combination of mean and standard deviation. The Normal model is determined by sigma and mu. We use the Greek letters sigma and mu because this is a model; it does not come from actual data. Sigma and mu are the parameters that specify the model. (average) We write N(μ,σ) to represent a Normal model with a mean of μ and a standard deviation of σ

Add to your notes Statistics Parameters The larger sigma, the more spread out the normal model appears. The inflection points occur a distance of sigma on either side of mu.

write in your notes pg 108 Summaries of data, like the sample mean and standard deviation, are written with Latin letters. Such summaries of data are called statistics. When we standardize Normal data, we still call the standardized value a z score. To standardize Normal data, subtract the mean (mu) and divide by the standard deviation (sigma). Once we have standardized, we need only one model: pg 108 The N(0,1) model is called the standard Normal model (or the standard Normal distribution). Be careful don t use a Normal model for just any data set, since standardizing does not change the shape of the distribution.

When we use the Normal model, we are assuming the distribution is Normal. pg 108 We cannot check this assumption in practice, so we check the following condition: Nearly Normal Condition: The shape of the data s distribution is unimodal and symmetric. This condition can be checked with a histogram or a Normal probability plot (to be explained later). Never use the Normal Model without checking whether the condition is satisfied. To assess normality: Examine the shape of the histogram or stem and leaf plot. A normal model is symmetric about the mean and bell shaped. Compare the mean and median. In a Normal model, the mean and median are equal. Verify that the 68 95 99.7 Rule holds. Construct a normal probability plot. If the graph is linear, the model is Normal TI 84 Tips Normal Probability Plot pg 119

Are You Normal? How Can You Tell? pg 118 Nearly Normal data have a histogram and a Normal probability plot that look somewhat like this example: If the distribution of the data is roughly Normal, the Normal probability plot approximates a diagonal straight line. Deviations from a straight line indicate that the distribution is not Normal. Are You Normal? How Can You Tell? pg 118 A skewed distribution might have a histogram and Normal probability plot like this:

Don t use a Normal model when the distribution is not unimodal and symmetric. pg 119 Example #2

Solution Example #3

solution pg 110 The First Three Rules for Working with Normal Models see step by step Make a picture. Make a picture. Make a picture. When we have data, make a histogram to check the Nearly Normal Condition to make sure we can use the Normal model to model the distribution.

pg 110 Finding Normal Percentiles by Hand pg 111 When a data value doesn t fall exactly 1, 2, or 3 standard deviations from the mean, we can look it up in a table of Normal percentiles. Table Z in Appendix E provides us with normal percentiles, but many calculators and statistics computer packages provide these as well.

Finding Normal Percentiles by Hand (cont.) pg 111 Table Z is the standard Normal table. We have to convert our data to z scores before using the table. Figure 6.5 shows us how to find the area to the left when we have a z score of 1.80: Finding Normal Percentiles Using Technology pg 112 Many calculators and statistics programs have the ability to find normal percentiles for us. The ActivStats Multimedia Assistant offers two methods for finding normal percentiles: The Normal Model Tool makes it easy to see how areas under parts of the Normal model correspond to particular cut points. There is also a Normal table in which the picture of the normal model is interactive. See TI Tips pg 112 113

Finding Normal Percentiles Using Technology The following was produced with the Normal Model Tool in ActivStats: pg 112 See Step by step pgs 113 117 Example #4 Sketch Normal Model

Example #4 Sketch Normal Model From Percentiles to Scores: z in Reverse pg 117 Step by Step Sometimes we start with areas and need to find the corresponding z score or even the original data value. Example: What z score represents the first quartile in a Normal model?

From Percentiles to Scores: z in Reverse pg 118 Look in Table Z for an area of 0.2500. The exact area is not there, but 0.2514 is pretty close. This figure is associated with z = 0.67, so the first quartile is 0.67 standard deviations below the mean.

Example#5

What Can Go Wrong? (cont.) pg 120 Don t use the mean and standard deviation when outliers are present the mean and standard deviation can both be distorted by outliers. Don t round off too soon. Don t round your results in the middle of a calculation. Don t worry about minor differences in results.

What have we learned? pg 120 The story data can tell may be easier to understand after shifting or rescaling the data. Shifting data by adding or subtracting the same amount from each value affects measures of center and position but not measures of spread. Rescaling data by multiplying or dividing every value by a constant changes all the summary statistics center, position, and spread. What have we learned? (cont.) pg 121 We ve learned the power of standardizing data. Standardizing uses the SD as a ruler to measure distance from the mean (z scores). With z scores, we can compare values from different distributions or values based on different units. z scores can identify unusual or surprising values among data.

What have we learned? (cont.) pg 121 A Normal model sometimes provides a useful way to understand data. We need to Think about whether a method will work check the Nearly Normal Condition with a histogram or Normal probability plot. Normal models follow the 68 95 99.7 Rule, and we can use technology or tables for a more detailed analysis.