Center, Shape, & Spread Center, shape, and spread are all words that describe what a particular graph looks like.

Similar documents
MATH& 146 Lesson 10. Section 1.6 Graphing Numerical Data

Chapter 3 - Displaying and Summarizing Quantitative Data

Measures of Dispersion

Let s take a closer look at the standard deviation.

Chapter 3. Descriptive Measures. Slide 3-2. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

1.3 Graphical Summaries of Data

STA Module 2B Organizing Data and Comparing Distributions (Part II)

STA Learning Objectives. Learning Objectives (cont.) Module 2B Organizing Data and Comparing Distributions (Part II)

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 3 Descriptive Measures

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set.

Math 167 Pre-Statistics. Chapter 4 Summarizing Data Numerically Section 3 Boxplots

Mean,Median, Mode Teacher Twins 2015

Chpt 3. Data Description. 3-2 Measures of Central Tendency /40

Measures of Central Tendency

Univariate Statistics Summary

M7D1.a: Formulate questions and collect data from a census of at least 30 objects and from samples of varying sizes.

Section 3.1 Shapes of Distributions MDM4U Jensen

Pre-Calculus Multiple Choice Questions - Chapter S2

Averages and Variation

Chapter 6: DESCRIPTIVE STATISTICS

Density Curve (p52) Density curve is a curve that - is always on or above the horizontal axis.

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things.

10.4 Measures of Central Tendency and Variation

10.4 Measures of Central Tendency and Variation

Table of Contents (As covered from textbook)

CHAPTER 2 DESCRIPTIVE STATISTICS

DAY 52 BOX-AND-WHISKER

Name Date Types of Graphs and Creating Graphs Notes

UNIT 1A EXPLORING UNIVARIATE DATA

Day 4 Percentiles and Box and Whisker.notebook. April 20, 2018

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

STP 226 ELEMENTARY STATISTICS NOTES PART 2 - DESCRIPTIVE STATISTICS CHAPTER 3 DESCRIPTIVE MEASURES

Chapter 2 Describing, Exploring, and Comparing Data

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data

Chapter 1. Looking at Data-Distribution

AP Statistics Prerequisite Packet

Learning Log Title: CHAPTER 7: PROPORTIONS AND PERCENTS. Date: Lesson: Chapter 7: Proportions and Percents

Homework Packet Week #3

No. of blue jelly beans No. of bags

CHAPTER 2: SAMPLING AND DATA

Section 5.2: BUY OR SELL A CAR OBJECTIVES

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.

Week 4: Describing data and estimation

Box Plots. OpenStax College

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.

AP Statistics Summer Assignment:

CHAPTER 3: Data Description

CHAPTER 2: DESCRIPTIVE STATISTICS Lecture Notes for Introductory Statistics 1. Daphne Skipper, Augusta University (2016)

MATH& 146 Lesson 8. Section 1.6 Averages and Variation

MATH NATION SECTION 9 H.M.H. RESOURCES

Probability and Statistics. Copyright Cengage Learning. All rights reserved.

Vocabulary: Data Distributions

STA Module 4 The Normal Distribution

STA /25/12. Module 4 The Normal Distribution. Learning Objectives. Let s Look at Some Examples of Normal Curves

a. divided by the. 1) Always round!! a) Even if class width comes out to a, go up one.

15 Wyner Statistics Fall 2013

Understanding Statistical Questions

MATH 112 Section 7.2: Measuring Distribution, Center, and Spread

+ Statistical Methods in

Week 2: Frequency distributions

Measures of Position

Section 6.3: Measures of Position

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Frequency Distributions

Unit I Supplement OpenIntro Statistics 3rd ed., Ch. 1

The first few questions on this worksheet will deal with measures of central tendency. These data types tell us where the center of the data set lies.

Descriptive Statistics Descriptive statistics & pictorial representations of experimental data.

TMTH 3360 NOTES ON COMMON GRAPHS AND CHARTS

Chapter 11. Worked-Out Solutions Explorations (p. 585) Chapter 11 Maintaining Mathematical Proficiency (p. 583)

Lecture 3 Questions that we should be able to answer by the end of this lecture:

Ex.1 constructing tables. a) find the joint relative frequency of males who have a bachelors degree.

Learning Log Title: CHAPTER 8: STATISTICS AND MULTIPLICATION EQUATIONS. Date: Lesson: Chapter 8: Statistics and Multiplication Equations

Lecture 3 Questions that we should be able to answer by the end of this lecture:

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES

Lecture 6: Chapter 6 Summary

Chapter 5: The standard deviation as a ruler and the normal model p131

Descriptive Statistics

Section 3.2 Measures of Central Tendency MDM4U Jensen

AND NUMERICAL SUMMARIES. Chapter 2

Name Geometry Intro to Stats. Find the mean, median, and mode of the data set. 1. 1,6,3,9,6,8,4,4,4. Mean = Median = Mode = 2.

Organizing and Summarizing Data

2.1: Frequency Distributions and Their Graphs

Chapter 3 Understanding and Comparing Distributions

Section 9: One Variable Statistics

Lecture 3: Chapter 3

Chapter 2 Modeling Distributions of Data

Chapter2 Description of samples and populations. 2.1 Introduction.

To calculate the arithmetic mean, sum all the values and divide by n (equivalently, multiple 1/n): 1 n. = 29 years.

The main issue is that the mean and standard deviations are not accurate and should not be used in the analysis. Then what statistics should we use?

Lecture Notes 3: Data summarization

Data Preprocessing. S1 Teknik Informatika Fakultas Teknologi Informasi Universitas Kristen Maranatha

September 11, Unit 2 Day 1 Notes Measures of Central Tendency.notebook

CHAPTER-13. Mining Class Comparisons: Discrimination between DifferentClasses: 13.4 Class Description: Presentation of Both Characterization and

Create a bar graph that displays the data from the frequency table in Example 1. See the examples on p Does our graph look different?

MAT 142 College Mathematics. Module ST. Statistics. Terri Miller revised July 14, 2015

Use of GeoGebra in teaching about central tendency and spread variability

3 Graphical Displays of Data

Descriptive Statistics

Actual Major League Baseball Salaries ( )

Transcription:

Center, Shape, & Spread Center, shape, and spread are all words that describe what a particular graph looks like. Center When we talk about center, shape, or spread, we are talking about the distribution of the data, or how the data is spread across the graph. The center of a distribution gives you exactly what it sounds like. It tells you the center or median of the data. When you look at a graph, it will be the value where approximately half of your data is on one side and the rest of your data is on the other side.

The median point of your data set is the middle number if you were to put your data in ascending order. Let's say we are taking surveys of different groups of people and their donut eating habits. For the first group of people, we have this graph. We see that our center is 5 because half of the people are to the left and the other half are to the right. Another way to describe the center is to take the mean or average of all your data. When you describe your center in terms of mean and median, you might find that they are slightly different. Your mean might be more or less than your median. We will discuss what skewed means in just a little bit, but as far as the center is concerned, if your graph is skewed, then you will want to use the median as your center.

Shape Depending on the group of people we survey about their donut eating habits, we will get different sets of data. When graphed, we can get different looking graphs. We use shape to describe the different types of graphs we will see. There are four different ways in which we can describe a graph's shape. 1. We can say a graph is symmetric if the left and right sides of the graph are mirror images of each other. This graph, for example, is symmetric because the left side is a mirror image of the right side. We see that at either end of the distribution, only 1 person chooses to eat 3 donuts and 7 donuts. Going closer to the center, we see that 2 persons choose to each 4 donuts and 6 donuts. They are mirror images of each other.

2. Sometimes, our graph will look like a rollercoaster and will have a number of peaks, or areas where the graph is higher than the surrounding areas. If there is only one peak, then we call it unimodal distribution. If this one peak occurs at the center of the graph, it is also called bell-shaped. Doesn't this look like a bell? If it has two peaks, then we will call it a bimodal distribution. (Note: The term bimodal alone literally means two modes, and the term unimodal alone literally means one mode. When describing the shape, you must use the word distribution after bimodal/unimodal every time.)

3. If our graph has more data on one side rather than the other, we call it skewed. If there are more to the right, we call it skewed left. For our donuts eaten survey, this would mean that more people choose to eat more donuts and fewer people choose to eat just a few. If our graph has more data to the left, then we would say that our graph is skewed right. For our donuts survey, it would mean that more people prefer to eat fewer donuts. A good way to remember this is to view the graph as a slide. If you slide down to the right, then it is skewed right and if you slide down to the left, then it is skewed left.

Boxplots often provide information about the shape of a data set. The examples below show some common patterns. Each of the above boxplots illustrates a different skewness pattern. If most of the observations are concentrated on the low end of the scale, the distribution is skewed right; and vice versa. If a distribution is symmetric, the observations will be evenly split at the median, as show above in the middle figure. 4. If our survey of people's donut eating habits showed that for each amount of donuts eaten, the same number of people would choose that amount, then our graph will look flat all across the top, then we call it uniform. A uniform shape has no peaks nor is it skewed. Spread A measure of spread, sometimes also called a measure of dispersion, is used to describe the variability in a sample or population. It is usually used in conjunction with a measure of central tendency, such as the mean or median, to provide an overall description of a set of data. There are many reasons why the measure of the spread of data values is important, but one of the main reasons regards its relationship with measures of central tendency. A measure of spread gives us an idea of how well the mean, for example, represents the data. If the spread of values in the data set is large, the mean is not as representative of the data as if the spread of data is small. This is because a large spread indicates that there are probably large differences between individual scores. Additionally, in research, it is often seen as positive if there is little variation in each data group. We will be looking at two measures of spread: range and quartiles. The range is the difference between the highest and lowest scores in a data set and is the simplest measure of spread. So we calculate range as: Range = maximum value - minimum value Whilst using the range as a measure of spread is limited, it does set the boundaries of the scores. This can be useful if you are measuring a variable that has either a critical low or high threshold (or both) that should not be crossed. The range will instantly inform you whether at least one value broke these critical thresholds. In addition, the range can be used to detect any errors when entering data. For example, if you have recorded the age of school children in your study and your range is 7 to 123 years old you know you have made a mistake! Quartiles tell us about the spread of a data set by breaking the data set into quarters, just like the median breaks it in half. Quartiles are a useful measure of spread because they are much less affected by outliers or a skewed data set than mean. For this reason, quartiles are often reported along with the median as the best choice of measure of spread and central tendency, respectively, when dealing with skewed and/or data with outliers. A common way of expressing quartiles is as an interquartile range. The interquartile range describes the difference between the third quartile (Q3) and the first quartile (Q1), telling us about the range of the middle half of the scores in the distribution. Interquartile range = Q3 - Q1