Univariate Statistics Summary

Size: px
Start display at page:

Download "Univariate Statistics Summary"

Transcription

1 Further Maths Univariate Statistics Summary Types of Data Data can be classified as categorical or numerical. Categorical data are observations or records that are arranged according to category. For example: the favourite colour of a class of students; the mode of transport that each student uses to get to school; the rating of a TV program, either a great program, average program or poor program. Postal codes such as 3011, 3015 etc. Numerical data are observations based on counting or measurement. Calculations can be performed on numerical data. There are two main types of numerical data Discrete data, which takes only fixed values, usually whole numbers. Discrete data often arises as the result of counting items. For example: the number of siblings each student has, the number of pets a set of randomly chosen people have or the number of passengers in cars that pass an intersection. Continuous data can take any value in a given range. It is usually a measurement. For example: the weights of students in a class. The weight of each student could be measured to the nearest tenth of a kg. Weights of 84.8kg and 67.5kg would be recorded. Other examples of continuous data include the time taken to complete a task or the heights of a group of people. Exercise 1 Decide whether the following data is categorical or numerical. If numerical decide if the data is discrete or continuous Page 1 of 21

2 3. 4. Solutions 1a Numerical-discrete b. Categorical c. Categorical d. Numerical-Continuous 2a. Numerical b. Categorical c. Numerical d. Categorical e. Categorical 3a Continuous b. Discrete c. Continuous d. Continuous e. Discrete 4. D Representing Data Data is often represented in frequency charts, column charts, histograms and dot plots. Remember: the frequency of an observation is the number of times that observation occurs. Page 2 of 21

3 Frequency Univariate Statistics Summary Example 1: The following frequency distribution table gives the number of days of each weather type for the month of January. Represent the information using a column chart. Weather Type Hot Warm Mild Cool Weather Example 2 Represent the data in a. a histogram. b. a frequency polygon a. b. Notice that for a histogram there is no gap between the bars and the number of visits are positioned at the centre of each bar. Page 3 of 21

4 Example 3 Represent the data in a. a histogram b. a frequency polygon a. b. Notice that the numbers are placed at the edges of the bars along the x-axis for grouped data. Types of Average There are three types of average which can represent a set of data. An average is a measure of central tendency. The Mean The most common average is the mean. x is used to denote the mean. x = sum of all scores number of scores Example 1: The following data gives the number of pets kept in each of 10 different households. 3, 5, 4, 4, 2, 3, 0, 1, 4, 5 The mean number of pets is given by: Page 4 of 21 = 3.1 The mean is sometimes not the best average to use as it is affected by extreme scores or outliers.

5 The Median The median of a set of scores is the middle score when the data are arranged in order of size. The median s position is given by n+1 2 th score, where n is the number of scores. In example 1 the median s position is given by the 10+1 th score. This is the 5.5 th score or halfway 2 between the 5 th and 6 th score, after the scores have been arranged in order of size. Arranging the data in order of size: 0, 1, 2, 3, 3, 4, 4, 4, 5, 5 Median number of pets is: The median is not affected by extreme values or outliers. The Mode = 3.5 (as there are two middle scores we take their mean. The mode of a group of scores is the score that occurs most often. That is the score with the highest frequency. In example 1 the modal number of pets is 4. More than one mode is possible. Frequency Tables Example 4 The table indicates that 6 students made 0 cinema visits, 7 students made 1 cinema visit, 4 students made 2 cinema visits etc. The mean number of visits can be found by adding an extra column to the table and multiplying the number of visits by the frequency. Number of visits (x) Frequency (f) f x Total a. mean number of visits = total of f x total of f = = 1.25 Page 5 of 21

6 b. the median number of visits can be found by finding the position of the median as the number of visits are in order of size in the table. The median s position is the n th score = th = 10.5th position. Halfway between the 10 th 2 2 and 11 th scores. The median s position falls within the second row and is therefore 1. c. The mode is the score with the highest frequency. The mode is 1. Alternatively, the mean and median can be found using Lists and Spreadsheet in the calculator. 1: Enter the data into Lists and Spreadsheet view 2: Hit Menu, Statistics, Stat Calculations, One Variable Statistics 3. Click OK when number of lists appears. 4. In the pop up, click in the X1 List box and select visits from the drop down list. Hit the Tab key to move to the next box and select freq from the drop down list in the Frequency List box 6. The statistical data appears. The mean is given by x = 1.25 The median is 1. n is useful as it gives the frequency total. There is no need to enter data into the other boxes. 5. Click OK. Page 6 of 21

7 Grouped Data When data is presented in a frequency table within class intervals, and we do not know the actual values within each class interval, we assume that all values are equal to the midpoint of the class interval in order to find the mean. Example 5: The ages of a group of 30 people attending a superannuation seminar are recorded in the frequency table below, calculate the mean age. Age (Class Intervals) Frequency Total 30 To find the mean age, assume all people in the class interval are 24.5 years of age (This value is obtained by finding the midpoint of 20-29), all people in the class interval are 34.5 years of age and so on. The mean age can be found from the table below: Age (Class Intervals) Frequency f Midpoint of f m Class Interval m Total: The mean age x = total of f m total of f = = 46.8 years (correct to 1 decimal place) The above can be more easily done using Lists and Spreadsheet on the calculator. Page 7 of 21

8 1: In Lists and Spreadsheets view enter the data for the midpoints and the frequency into the first two columns. Label the columns as shown. 2. Press Menu, Statistics, Stat Calculations, One Variable Statistics. 3. Leave the Number of Lists as 1 and select OK. 4. In the pop up box, click in the X1 List box and select midpoint from the drop down list. Press the Tab key to move to the Frequency List box and select freq from the drop down list. 5. Press the TAB key to move to the OK button. The mean x = The mean age = 46.8 years Page 8 of 21

9 Measures of Variability or Spread It is useful to be able to measure the spread or variability of the data. How dispersed is the data? The Range The simplest measure of spread is the range. The range is the difference between the smallest score and the largest. Example 1 The set of data 3, 5, 4, 4, 2, 3, 0, 1, 4, 5 (which gave the number of pets in each of 10 households) has a range of 5 0 = 5 The Interquartile Range (IQR) The lower quartile, Q 1 is 1 of the way through the set of data. 4 The upper quartile, Q 3 is 3 of the way through the set of data. 4 The IQR = Q 3 Q 1 Page 9 of 21

10 Example 6 The Standard Deviation The standard deviation gives a measure of the spread of the data about the mean. The formula to find the standard deviation is complex and we usually find it directly from the calculator. On the calculator it is denoted by the symbol sx. The bigger the standard deviation, the greater the spread of data. Example 7 Find the standard deviation of the set of data: 12, 9, 4, 6, 5, 8, 9, 4, 10, 2. This is the same data as in Example In Lists and Spreadsheet view, enter the data. 2. Press Menu, Statistics, Stat Calculations, One Variable Statistics. 3. Leave the Number of Lists as 1 and select OK. 4. Enter the data into the pop up box as shown. 5. Click OK. Page 10 of 21

11 The statistical data appears. The standard deviation is given by sx = Notice that the lower quartile is given by Q 1 X = 4 and the upper quartile is given by Q 3 X = 9 This agrees with the solutions to Example 2. The IQR = 9 4 = 5 Stem and Leaf Plots Example 8 The data below shows the weights in kg of 20 possums arranged in order of size: Page 11 of 21

12 We can represent this data in a stem and leaf plot as shown below: Key: 0 7 = 0.7kg Stem Leaf In a stem and leaf plot the numbers are arranged in order of size. The key is given as 0 7 kg means stem 0 and leaf 7 which represents 0.7 kg. You should always include a key in the stem and leaf plot. When preparing a stem and leaf plot keep the number s in neat vertical columns because a neat plot will show the distribution of the scores. It is like a sideways bar chart or histogram. The interquartile range can be found from the stem and leaf plot. ( 20 1) 1. Find the median weight. The median weight Q2 is the th score. ie the 10.5 th score. 2 The median lies between the 10 th and 11 th scores. Count through the data to find the position of the median. It can be seen from the plot that the median lies between 1.8 and ( ) 1.9. The median weight is = 1.85 kg. 2 ( 10 1) 2. The lower quartile Q1 will be the th score in the lower half. ie the 5.5 th score in the 2 lower half. Count through the data to find the position of the lower quartile. Q1 = ( ) = 1.55 kg The upper quartile Q3 will be the 5.5 th score in the upper half of the plot. Count through the ( ) data to find the position of the upper quartile. Q3 = = 2.25 kg 2 4. The interquartile range = Q3 Q1 = = 0.7 kg 5. See diagram below: Q1 median Q2 Key: 0 7 = 0.7kg Stem Leaf Page 12 of 21 Q3

13 Example 9 Find the interquartile range of the data presented in the following stem and leaf plot. Key: 15 4 = 154 Q1 median Q2 Stem Leaf Q3 The median is the median is 179. ( 30 1) th score. ie the 15.5 th score which lies between 179 and 179. So the 2 The lower quartile Q1 will be the Q1 = 168. ( 15 1) th score in the lower half. ie the 8 th score in the lower half. 2 The upper quartile Q2 will be the 8 th score in the upper half of the data. ie 188. The interquartile range = Q3 Q1 = = 20. See the diagram above. Using CAS. You could check your answers by entering the data into your CAS calculator to determine the median, lower and upper quartiles. Boxplots Five-number summary A five number summary is a list consisting of the lowest score (Xmin), lower quartile (Q1), median (Q2), upper quartile (Q3) and the greatest score (Xmax) of a set of data. A five number summary gives information about the spread or variability of a set of data. Box Plots A box plot is a graph of the 5-number summary. It is a powerful way of showing the spread of data. A box plot consists of a central divided box with attached whiskers. The box spans the interquartile range. The median is marked by a vertical line inside the box. The whiskers indicate the range of scores. Box plots are always drawn to scale and a scale is often attached. Page 13 of 21

14 Interpreting a Boxplot A boxplot divides the data into four sections. 25% of the scores lie between the lowest score and the lower quartile, 25% between the lower quartile and the median, 25% between the median and the upper quartile and 25% between the upper quartile and the greatest score. Extreme Values or Outliers Extreme values often make the whiskers appear longer than they should and hence give the appearance that the data is spread over a much greater range than they really are. If an extreme value occurs in a set of data it can be denoted by a small cross on the boxplot. The whisker is then shortened to the next largest or smallest score. When one observation lies well away from other observations in a set, we call it an outlier. For example the histogram shows the weights of a group of 5-year old boys. Clearly the weight of 33kg is an outlier. Page 14 of 21

15 Determining whether an Observation is an Outlier To identify possible outliers we can use the following rule: An outlier is a score, x, which is either less than Q IQR or greater than Q IQR Where Q 1 is the lower quartile and Q 3 is the upper quartile and IQR is the interquartile range In summary An outlier is a score, x, which lies outside the interval Example 10 Q IQR x Q IQR Page 15 of 21

16 Distribution of Data Page 16 of 21

17 Comparing Two Sets of Data Back to Back Stem and Leaf Plots Univariate Statistics Summary Two sets of data can be compared using back to back stem and leaf plots. The data below shows the life time of a sample of 40 batteries in hours of each of two brands when fitted into a child s toy. Some of the toys are fitted with an ordinary battery and some with Brand X. Which battery is best? Key: 6 9 = 69 hours Ordinary Brand Leaf Stem Brand X Leaf The spread of each set of data can be seen graphically from the stem and leaf plot. It can be seen that although brand X showed a little more variability than the ordinary brand the batteries generally lasted longer. Parallel Box Plots The above data can also be compared by using parallel boxplots. The boxplots share a common scale. Quantitative comparisons can be made between the sets of data. The 5-Number Summaries of both types of batteries are given below. You can work them out from the stem plots or by using your calculator. Brand X Xmin Lower Quartile Q1 Median Q2 Upper Quartile Q3 Xmax Ordinary Brand Xmin Lower Quartile Q1 Median Q2 Upper Quartile Q3 Xmax Page 17 of 21

18 The following parallel boxplots can be drawn to compare the data. Time in Hours Brand X Ordinary Brand From the box plots it can be seen that: 1. Brand X showed more variability in its performance than the ordinary brand. Brand X range = 77, ordinary brand range = 54. Brand X interquartile range = 27.5 and ordinary brand interquartile range = The longest lifetime recorded was that of a Brand X battery of 146 hours 3. The shortest lifetime recorded was that of an ordinary battery of 60 hours. 4. Brand X battery median lifetime (109.5 hours) was better than that of an ordinary battery (87.5 hours) 5. Over one quarter of Brand X batteries were better performers than the best ordinary brand battery (that is, had longer lifetimes than the longest of the ordinary brand batteries lifetimes) The Mean, Standard Deviation and Normal Distribution The Mean The mean of a set of data, x, is given by: x = x, where, x represents the sum of all the n observations in the data set and n represents the number of observations in the data set. For example the mean of 4, 7, 9, 12 and 18 is given by: x = = 10 The mean is often used as the everyday average and gives a measure of the centre of a distribution. The mean is sometimes affected by outliers (extreme values) and the median is often a better average to use as it is not affected by extreme values. The Standard Deviation The standard deviation gives a measure of how the data is spread around the mean. The formula for the standard deviation is given below: (x x )2 s = n 1 Fortunately, the calculator can be used to find the standard deviation. On the calculator it is denoted by sx. Page 18 of 21

19 The 68%-95%-99.7% Rule applied to the Normal Distribution Many sets of data are approximately symmetric forming a "bell-shaped" curve. We refer to this type of data as a normal distribution. Examples include birth weights, people's heights etc. Data that is normally distributed have their symmetrical bell-shaped distribution centred on the mean value, x. The above data of the heights of people form a bell-shape and approximates a normal distribution. Page 19 of 21

20 Using the symmetry of the bell-shaped curve and the above rules various percentages can be calculated. Z-Scores To compare scores in different distributions we can make use of z-scores. The z-score, also known as the standardised score, indicates the position of a score in relation to the mean. It gives the number of standard deviations that the score is from the mean. A z score can be both positive and negative. A z-score of 0 indicates that the score is equal to the mean, a negative z-score indicates that the score is below the mean and a positive z-score indicates that the score is above the mean. A score that is exactly 1 standard deviation above the mean has a z-score of 1. A score that is exactly 2 standard deviations below the mean has a z-score of 2. Not all z-scores will be whole numbers; in fact most will not be. A whole number indicates only that the score is an exact number of standard deviations above or below the mean. For example, if the mean value of the IQ s of a group of students is 100 and the standard deviation is 15, an IQ of 88 would be represented by a z score of 0.8, as shown below. z = x x s = = 0.8 The negative value indicates that the IQ of 88 is below the mean but by less than one standard deviation. Page 20 of 21

21 Using Z-Scores to Compare Data Page 21 of 21

Chapter 3 - Displaying and Summarizing Quantitative Data

Chapter 3 - Displaying and Summarizing Quantitative Data Chapter 3 - Displaying and Summarizing Quantitative Data 3.1 Graphs for Quantitative Data (LABEL GRAPHS) August 25, 2014 Histogram (p. 44) - Graph that uses bars to represent different frequencies or relative

More information

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order. Chapter 2 2.1 Descriptive Statistics A stem-and-leaf graph, also called a stemplot, allows for a nice overview of quantitative data without losing information on individual observations. It can be a good

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency lowest value + highest value midrange The word average: is very ambiguous and can actually refer to the mean,

More information

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set.

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set. Measures of Central Tendency A measure of central tendency is a value used to represent the typical or average value in a data set. The Mean the sum of all data values divided by the number of values in

More information

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES Chapter 2 2.1 Objectives 2.1 What Are the Types of Data? www.managementscientist.org 1. Know the definitions of a. Variable b. Categorical versus quantitative

More information

MATH NATION SECTION 9 H.M.H. RESOURCES

MATH NATION SECTION 9 H.M.H. RESOURCES MATH NATION SECTION 9 H.M.H. RESOURCES SPECIAL NOTE: These resources were assembled to assist in student readiness for their upcoming Algebra 1 EOC. Although these resources have been compiled for your

More information

AND NUMERICAL SUMMARIES. Chapter 2

AND NUMERICAL SUMMARIES. Chapter 2 EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES Chapter 2 2.1 What Are the Types of Data? 2.1 Objectives www.managementscientist.org 1. Know the definitions of a. Variable b. Categorical versus quantitative

More information

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables

More information

Chapter 2 Describing, Exploring, and Comparing Data

Chapter 2 Describing, Exploring, and Comparing Data Slide 1 Chapter 2 Describing, Exploring, and Comparing Data Slide 2 2-1 Overview 2-2 Frequency Distributions 2-3 Visualizing Data 2-4 Measures of Center 2-5 Measures of Variation 2-6 Measures of Relative

More information

Measures of Central Tendency

Measures of Central Tendency Page of 6 Measures of Central Tendency A measure of central tendency is a value used to represent the typical or average value in a data set. The Mean The sum of all data values divided by the number of

More information

STP 226 ELEMENTARY STATISTICS NOTES PART 2 - DESCRIPTIVE STATISTICS CHAPTER 3 DESCRIPTIVE MEASURES

STP 226 ELEMENTARY STATISTICS NOTES PART 2 - DESCRIPTIVE STATISTICS CHAPTER 3 DESCRIPTIVE MEASURES STP 6 ELEMENTARY STATISTICS NOTES PART - DESCRIPTIVE STATISTICS CHAPTER 3 DESCRIPTIVE MEASURES Chapter covered organizing data into tables, and summarizing data with graphical displays. We will now use

More information

AP Statistics Prerequisite Packet

AP Statistics Prerequisite Packet Types of Data Quantitative (or measurement) Data These are data that take on numerical values that actually represent a measurement such as size, weight, how many, how long, score on a test, etc. For these

More information

STA 570 Spring Lecture 5 Tuesday, Feb 1

STA 570 Spring Lecture 5 Tuesday, Feb 1 STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row

More information

10.4 Measures of Central Tendency and Variation

10.4 Measures of Central Tendency and Variation 10.4 Measures of Central Tendency and Variation Mode-->The number that occurs most frequently; there can be more than one mode ; if each number appears equally often, then there is no mode at all. (mode

More information

10.4 Measures of Central Tendency and Variation

10.4 Measures of Central Tendency and Variation 10.4 Measures of Central Tendency and Variation Mode-->The number that occurs most frequently; there can be more than one mode ; if each number appears equally often, then there is no mode at all. (mode

More information

STA Module 2B Organizing Data and Comparing Distributions (Part II)

STA Module 2B Organizing Data and Comparing Distributions (Part II) STA 2023 Module 2B Organizing Data and Comparing Distributions (Part II) Learning Objectives Upon completing this module, you should be able to 1 Explain the purpose of a measure of center 2 Obtain and

More information

STA Learning Objectives. Learning Objectives (cont.) Module 2B Organizing Data and Comparing Distributions (Part II)

STA Learning Objectives. Learning Objectives (cont.) Module 2B Organizing Data and Comparing Distributions (Part II) STA 2023 Module 2B Organizing Data and Comparing Distributions (Part II) Learning Objectives Upon completing this module, you should be able to 1 Explain the purpose of a measure of center 2 Obtain and

More information

Name Date Types of Graphs and Creating Graphs Notes

Name Date Types of Graphs and Creating Graphs Notes Name Date Types of Graphs and Creating Graphs Notes Graphs are helpful visual representations of data. Different graphs display data in different ways. Some graphs show individual data, but many do not.

More information

CHAPTER 2: SAMPLING AND DATA

CHAPTER 2: SAMPLING AND DATA CHAPTER 2: SAMPLING AND DATA This presentation is based on material and graphs from Open Stax and is copyrighted by Open Stax and Georgia Highlands College. OUTLINE 2.1 Stem-and-Leaf Graphs (Stemplots),

More information

CHAPTER 2 DESCRIPTIVE STATISTICS

CHAPTER 2 DESCRIPTIVE STATISTICS CHAPTER 2 DESCRIPTIVE STATISTICS 1. Stem-and-Leaf Graphs, Line Graphs, and Bar Graphs The distribution of data is how the data is spread or distributed over the range of the data values. This is one of

More information

1.3 Graphical Summaries of Data

1.3 Graphical Summaries of Data Arkansas Tech University MATH 3513: Applied Statistics I Dr. Marcel B. Finan 1.3 Graphical Summaries of Data In the previous section we discussed numerical summaries of either a sample or a data. In this

More information

DAY 52 BOX-AND-WHISKER

DAY 52 BOX-AND-WHISKER DAY 52 BOX-AND-WHISKER VOCABULARY The Median is the middle number of a set of data when the numbers are arranged in numerical order. The Range of a set of data is the difference between the highest and

More information

Averages and Variation

Averages and Variation Averages and Variation 3 Copyright Cengage Learning. All rights reserved. 3.1-1 Section 3.1 Measures of Central Tendency: Mode, Median, and Mean Copyright Cengage Learning. All rights reserved. 3.1-2 Focus

More information

Box Plots. OpenStax College

Box Plots. OpenStax College Connexions module: m46920 1 Box Plots OpenStax College This work is produced by The Connexions Project and licensed under the Creative Commons Attribution License 3.0 Box plots (also called box-and-whisker

More information

Mean,Median, Mode Teacher Twins 2015

Mean,Median, Mode Teacher Twins 2015 Mean,Median, Mode Teacher Twins 2015 Warm Up How can you change the non-statistical question below to make it a statistical question? How many pets do you have? Possible answer: What is your favorite type

More information

AP Statistics Summer Assignment:

AP Statistics Summer Assignment: AP Statistics Summer Assignment: Read the following and use the information to help answer your summer assignment questions. You will be responsible for knowing all of the information contained in this

More information

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 3 Descriptive Measures

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 3 Descriptive Measures STA 2023 Module 3 Descriptive Measures Learning Objectives Upon completing this module, you should be able to: 1. Explain the purpose of a measure of center. 2. Obtain and interpret the mean, median, and

More information

UNIT 1A EXPLORING UNIVARIATE DATA

UNIT 1A EXPLORING UNIVARIATE DATA A.P. STATISTICS E. Villarreal Lincoln HS Math Department UNIT 1A EXPLORING UNIVARIATE DATA LESSON 1: TYPES OF DATA Here is a list of important terms that we must understand as we begin our study of statistics

More information

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data Chapter 2 Descriptive Statistics: Organizing, Displaying and Summarizing Data Objectives Student should be able to Organize data Tabulate data into frequency/relative frequency tables Display data graphically

More information

Understanding Statistical Questions

Understanding Statistical Questions Unit 6: Statistics Standards, Checklist and Concept Map Common Core Georgia Performance Standards (CCGPS): MCC6.SP.1: Recognize a statistical question as one that anticipates variability in the data related

More information

Section 9: One Variable Statistics

Section 9: One Variable Statistics The following Mathematics Florida Standards will be covered in this section: MAFS.912.S-ID.1.1 MAFS.912.S-ID.1.2 MAFS.912.S-ID.1.3 Represent data with plots on the real number line (dot plots, histograms,

More information

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable. 5-number summary 68-95-99.7 Rule Area principle Bar chart Bimodal Boxplot Case Categorical data Categorical variable Center Changing center and spread Conditional distribution Context Contingency table

More information

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. 1 CHAPTER 1 Introduction Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. Variable: Any characteristic of a person or thing that can be expressed

More information

M7D1.a: Formulate questions and collect data from a census of at least 30 objects and from samples of varying sizes.

M7D1.a: Formulate questions and collect data from a census of at least 30 objects and from samples of varying sizes. M7D1.a: Formulate questions and collect data from a census of at least 30 objects and from samples of varying sizes. Population: Census: Biased: Sample: The entire group of objects or individuals considered

More information

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things.

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. + What is Data? Data is a collection of facts. Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. In most cases, data needs to be interpreted and

More information

No. of blue jelly beans No. of bags

No. of blue jelly beans No. of bags Math 167 Ch5 Review 1 (c) Janice Epstein CHAPTER 5 EXPLORING DATA DISTRIBUTIONS A sample of jelly bean bags is chosen and the number of blue jelly beans in each bag is counted. The results are shown in

More information

Middle Years Data Analysis Display Methods

Middle Years Data Analysis Display Methods Middle Years Data Analysis Display Methods Double Bar Graph A double bar graph is an extension of a single bar graph. Any bar graph involves categories and counts of the number of people or things (frequency)

More information

2.1: Frequency Distributions and Their Graphs

2.1: Frequency Distributions and Their Graphs 2.1: Frequency Distributions and Their Graphs Frequency Distribution - way to display data that has many entries - table that shows classes or intervals of data entries and the number of entries in each

More information

Descriptive Statistics: Box Plot

Descriptive Statistics: Box Plot Connexions module: m16296 1 Descriptive Statistics: Box Plot Susan Dean Barbara Illowsky, Ph.D. This work is produced by The Connexions Project and licensed under the Creative Commons Attribution License

More information

Chapter 1. Looking at Data-Distribution

Chapter 1. Looking at Data-Distribution Chapter 1. Looking at Data-Distribution Statistics is the scientific discipline that provides methods to draw right conclusions: 1)Collecting the data 2)Describing the data 3)Drawing the conclusions Raw

More information

Using a percent or a letter grade allows us a very easy way to analyze our performance. Not a big deal, just something we do regularly.

Using a percent or a letter grade allows us a very easy way to analyze our performance. Not a big deal, just something we do regularly. GRAPHING We have used statistics all our lives, what we intend to do now is formalize that knowledge. Statistics can best be defined as a collection and analysis of numerical information. Often times we

More information

How individual data points are positioned within a data set.

How individual data points are positioned within a data set. Section 3.4 Measures of Position Percentiles How individual data points are positioned within a data set. P k is the value such that k% of a data set is less than or equal to P k. For example if we said

More information

Processing, representing and interpreting data

Processing, representing and interpreting data Processing, representing and interpreting data 21 CHAPTER 2.1 A head CHAPTER 17 21.1 polygons A diagram can be drawn from grouped discrete data. A diagram looks the same as a bar chart except that the

More information

MAT 142 College Mathematics. Module ST. Statistics. Terri Miller revised July 14, 2015

MAT 142 College Mathematics. Module ST. Statistics. Terri Miller revised July 14, 2015 MAT 142 College Mathematics Statistics Module ST Terri Miller revised July 14, 2015 2 Statistics Data Organization and Visualization Basic Terms. A population is the set of all objects under study, a sample

More information

MAT 110 WORKSHOP. Updated Fall 2018

MAT 110 WORKSHOP. Updated Fall 2018 MAT 110 WORKSHOP Updated Fall 2018 UNIT 3: STATISTICS Introduction Choosing a Sample Simple Random Sample: a set of individuals from the population chosen in a way that every individual has an equal chance

More information

NOTES TO CONSIDER BEFORE ATTEMPTING EX 1A TYPES OF DATA

NOTES TO CONSIDER BEFORE ATTEMPTING EX 1A TYPES OF DATA NOTES TO CONSIDER BEFORE ATTEMPTING EX 1A TYPES OF DATA Statistics is concerned with scientific methods of collecting, recording, organising, summarising, presenting and analysing data from which future

More information

Frequency Distributions

Frequency Distributions Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data so that it is possible to get a general overview of the results. Remember,

More information

Chapter 3. Descriptive Measures. Slide 3-2. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Chapter 3. Descriptive Measures. Slide 3-2. Copyright 2012, 2008, 2005 Pearson Education, Inc. Chapter 3 Descriptive Measures Slide 3-2 Section 3.1 Measures of Center Slide 3-3 Definition 3.1 Mean of a Data Set The mean of a data set is the sum of the observations divided by the number of observations.

More information

Downloaded from

Downloaded from UNIT 2 WHAT IS STATISTICS? Researchers deal with a large amount of data and have to draw dependable conclusions on the basis of data collected for the purpose. Statistics help the researchers in making

More information

CHAPTER 2: DESCRIPTIVE STATISTICS Lecture Notes for Introductory Statistics 1. Daphne Skipper, Augusta University (2016)

CHAPTER 2: DESCRIPTIVE STATISTICS Lecture Notes for Introductory Statistics 1. Daphne Skipper, Augusta University (2016) CHAPTER 2: DESCRIPTIVE STATISTICS Lecture Notes for Introductory Statistics 1 Daphne Skipper, Augusta University (2016) 1. Stem-and-Leaf Graphs, Line Graphs, and Bar Graphs The distribution of data is

More information

TMTH 3360 NOTES ON COMMON GRAPHS AND CHARTS

TMTH 3360 NOTES ON COMMON GRAPHS AND CHARTS To Describe Data, consider: Symmetry Skewness TMTH 3360 NOTES ON COMMON GRAPHS AND CHARTS Unimodal or bimodal or uniform Extreme values Range of Values and mid-range Most frequently occurring values In

More information

Chapter 6: DESCRIPTIVE STATISTICS

Chapter 6: DESCRIPTIVE STATISTICS Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling

More information

Homework Packet Week #3

Homework Packet Week #3 Lesson 8.1 Choose the term that best completes statements # 1-12. 10. A data distribution is if the peak of the data is in the middle of the graph. The left and right sides of the graph are nearly mirror

More information

1.2. Pictorial and Tabular Methods in Descriptive Statistics

1.2. Pictorial and Tabular Methods in Descriptive Statistics 1.2. Pictorial and Tabular Methods in Descriptive Statistics Section Objectives. 1. Stem-and-Leaf displays. 2. Dotplots. 3. Histogram. Types of histogram shapes. Common notation. Sample size n : the number

More information

Unit I Supplement OpenIntro Statistics 3rd ed., Ch. 1

Unit I Supplement OpenIntro Statistics 3rd ed., Ch. 1 Unit I Supplement OpenIntro Statistics 3rd ed., Ch. 1 KEY SKILLS: Organize a data set into a frequency distribution. Construct a histogram to summarize a data set. Compute the percentile for a particular

More information

Table of Contents (As covered from textbook)

Table of Contents (As covered from textbook) Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression

More information

Numerical Descriptive Measures

Numerical Descriptive Measures Chapter 3 Numerical Descriptive Measures 1 Numerical Descriptive Measures Chapter 3 Measures of Central Tendency and Measures of Dispersion A sample of 40 students at a university was randomly selected,

More information

Measures of Position

Measures of Position Measures of Position In this section, we will learn to use fractiles. Fractiles are numbers that partition, or divide, an ordered data set into equal parts (each part has the same number of data entries).

More information

15 Wyner Statistics Fall 2013

15 Wyner Statistics Fall 2013 15 Wyner Statistics Fall 2013 CHAPTER THREE: CENTRAL TENDENCY AND VARIATION Summary, Terms, and Objectives The two most important aspects of a numerical data set are its central tendencies and its variation.

More information

Day 4 Percentiles and Box and Whisker.notebook. April 20, 2018

Day 4 Percentiles and Box and Whisker.notebook. April 20, 2018 Day 4 Box & Whisker Plots and Percentiles In a previous lesson, we learned that the median divides a set a data into 2 equal parts. Sometimes it is necessary to divide the data into smaller more precise

More information

Learning Log Title: CHAPTER 7: PROPORTIONS AND PERCENTS. Date: Lesson: Chapter 7: Proportions and Percents

Learning Log Title: CHAPTER 7: PROPORTIONS AND PERCENTS. Date: Lesson: Chapter 7: Proportions and Percents Chapter 7: Proportions and Percents CHAPTER 7: PROPORTIONS AND PERCENTS Date: Lesson: Learning Log Title: Date: Lesson: Learning Log Title: Chapter 7: Proportions and Percents Date: Lesson: Learning Log

More information

Measures of Dispersion

Measures of Dispersion Lesson 7.6 Objectives Find the variance of a set of data. Calculate standard deviation for a set of data. Read data from a normal curve. Estimate the area under a curve. Variance Measures of Dispersion

More information

Unit 7 Statistics. AFM Mrs. Valentine. 7.1 Samples and Surveys

Unit 7 Statistics. AFM Mrs. Valentine. 7.1 Samples and Surveys Unit 7 Statistics AFM Mrs. Valentine 7.1 Samples and Surveys v Obj.: I will understand the different methods of sampling and studying data. I will be able to determine the type used in an example, and

More information

Chpt 3. Data Description. 3-2 Measures of Central Tendency /40

Chpt 3. Data Description. 3-2 Measures of Central Tendency /40 Chpt 3 Data Description 3-2 Measures of Central Tendency 1 /40 Chpt 3 Homework 3-2 Read pages 96-109 p109 Applying the Concepts p110 1, 8, 11, 15, 27, 33 2 /40 Chpt 3 3.2 Objectives l Summarize data using

More information

Date Lesson TOPIC HOMEWORK. Displaying Data WS 6.1. Measures of Central Tendency WS 6.2. Common Distributions WS 6.6. Outliers WS 6.

Date Lesson TOPIC HOMEWORK. Displaying Data WS 6.1. Measures of Central Tendency WS 6.2. Common Distributions WS 6.6. Outliers WS 6. UNIT 6 ONE VARIABLE STATISTICS Date Lesson TOPIC HOMEWORK 6.1 3.3 6.2 3.4 Displaying Data WS 6.1 Measures of Central Tendency WS 6.2 6.3 6.4 3.5 6.5 3.5 Grouped Data Central Tendency Measures of Spread

More information

Exploratory Data Analysis

Exploratory Data Analysis Chapter 10 Exploratory Data Analysis Definition of Exploratory Data Analysis (page 410) Definition 12.1. Exploratory data analysis (EDA) is a subfield of applied statistics that is concerned with the investigation

More information

The Normal Distribution

The Normal Distribution 14-4 OBJECTIVES Use the normal distribution curve. The Normal Distribution TESTING The class of 1996 was the first class to take the adjusted Scholastic Assessment Test. The test was adjusted so that the

More information

Basic Statistical Terms and Definitions

Basic Statistical Terms and Definitions I. Basics Basic Statistical Terms and Definitions Statistics is a collection of methods for planning experiments, and obtaining data. The data is then organized and summarized so that professionals can

More information

Lecture Notes 3: Data summarization

Lecture Notes 3: Data summarization Lecture Notes 3: Data summarization Highlights: Average Median Quartiles 5-number summary (and relation to boxplots) Outliers Range & IQR Variance and standard deviation Determining shape using mean &

More information

6th Grade Vocabulary Mathematics Unit 2

6th Grade Vocabulary Mathematics Unit 2 6 th GRADE UNIT 2 6th Grade Vocabulary Mathematics Unit 2 VOCABULARY area triangle right triangle equilateral triangle isosceles triangle scalene triangle quadrilaterals polygons irregular polygons rectangles

More information

WHOLE NUMBER AND DECIMAL OPERATIONS

WHOLE NUMBER AND DECIMAL OPERATIONS WHOLE NUMBER AND DECIMAL OPERATIONS Whole Number Place Value : 5,854,902 = Ten thousands thousands millions Hundred thousands Ten thousands Adding & Subtracting Decimals : Line up the decimals vertically.

More information

Round each observation to the nearest tenth of a cent and draw a stem and leaf plot.

Round each observation to the nearest tenth of a cent and draw a stem and leaf plot. Warm Up Round each observation to the nearest tenth of a cent and draw a stem and leaf plot. 1. Constructing Frequency Polygons 2. Create Cumulative Frequency and Cumulative Relative Frequency Tables 3.

More information

Measures of Dispersion

Measures of Dispersion Measures of Dispersion 6-3 I Will... Find measures of dispersion of sets of data. Find standard deviation and analyze normal distribution. Day 1: Dispersion Vocabulary Measures of Variation (Dispersion

More information

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student Organizing data Learning Outcome 1. make an array 2. divide the array into class intervals 3. describe the characteristics of a table 4. construct a frequency distribution table 5. constructing a composite

More information

Chapter 5snow year.notebook March 15, 2018

Chapter 5snow year.notebook March 15, 2018 Chapter 5: Statistical Reasoning Section 5.1 Exploring Data Measures of central tendency (Mean, Median and Mode) attempt to describe a set of data by identifying the central position within a set of data

More information

Section 2-2 Frequency Distributions. Copyright 2010, 2007, 2004 Pearson Education, Inc

Section 2-2 Frequency Distributions. Copyright 2010, 2007, 2004 Pearson Education, Inc Section 2-2 Frequency Distributions Copyright 2010, 2007, 2004 Pearson Education, Inc. 2.1-1 Frequency Distribution Frequency Distribution (or Frequency Table) It shows how a data set is partitioned among

More information

The main issue is that the mean and standard deviations are not accurate and should not be used in the analysis. Then what statistics should we use?

The main issue is that the mean and standard deviations are not accurate and should not be used in the analysis. Then what statistics should we use? Chapter 4 Analyzing Skewed Quantitative Data Introduction: In chapter 3, we focused on analyzing bell shaped (normal) data, but many data sets are not bell shaped. How do we analyze quantitative data when

More information

Section 6.3: Measures of Position

Section 6.3: Measures of Position Section 6.3: Measures of Position Measures of position are numbers showing the location of data values relative to the other values within a data set. They can be used to compare values from different

More information

CHAPTER 3: Data Description

CHAPTER 3: Data Description CHAPTER 3: Data Description You ve tabulated and made pretty pictures. Now what numbers do you use to summarize your data? Ch3: Data Description Santorico Page 68 You ll find a link on our website to a

More information

Chapter 2 Organizing and Graphing Data. 2.1 Organizing and Graphing Qualitative Data

Chapter 2 Organizing and Graphing Data. 2.1 Organizing and Graphing Qualitative Data Chapter 2 Organizing and Graphing Data 2.1 Organizing and Graphing Qualitative Data 2.2 Organizing and Graphing Quantitative Data 2.3 Stem-and-leaf Displays 2.4 Dotplots 2.1 Organizing and Graphing Qualitative

More information

Statistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or me, I will answer promptly.

Statistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or  me, I will answer promptly. Statistical Methods Instructor: Lingsong Zhang 1 Issues before Class Statistical Methods Lingsong Zhang Office: Math 544 Email: lingsong@purdue.edu Phone: 765-494-7913 Office Hour: Monday 1:00 pm - 2:00

More information

CHAPTER-13. Mining Class Comparisons: Discrimination between DifferentClasses: 13.4 Class Description: Presentation of Both Characterization and

CHAPTER-13. Mining Class Comparisons: Discrimination between DifferentClasses: 13.4 Class Description: Presentation of Both Characterization and CHAPTER-13 Mining Class Comparisons: Discrimination between DifferentClasses: 13.1 Introduction 13.2 Class Comparison Methods and Implementation 13.3 Presentation of Class Comparison Descriptions 13.4

More information

MATH& 146 Lesson 10. Section 1.6 Graphing Numerical Data

MATH& 146 Lesson 10. Section 1.6 Graphing Numerical Data MATH& 146 Lesson 10 Section 1.6 Graphing Numerical Data 1 Graphs of Numerical Data One major reason for constructing a graph of numerical data is to display its distribution, or the pattern of variability

More information

8 Organizing and Displaying

8 Organizing and Displaying CHAPTER 8 Organizing and Displaying Data for Comparison Chapter Outline 8.1 BASIC GRAPH TYPES 8.2 DOUBLE LINE GRAPHS 8.3 TWO-SIDED STEM-AND-LEAF PLOTS 8.4 DOUBLE BAR GRAPHS 8.5 DOUBLE BOX-AND-WHISKER PLOTS

More information

Section 3.2 Measures of Central Tendency MDM4U Jensen

Section 3.2 Measures of Central Tendency MDM4U Jensen Section 3.2 Measures of Central Tendency MDM4U Jensen Part 1: Video This video will review shape of distributions and introduce measures of central tendency. Answer the following questions while watching.

More information

Chapter 2 - Graphical Summaries of Data

Chapter 2 - Graphical Summaries of Data Chapter 2 - Graphical Summaries of Data Data recorded in the sequence in which they are collected and before they are processed or ranked are called raw data. Raw data is often difficult to make sense

More information

LESSON 3: CENTRAL TENDENCY

LESSON 3: CENTRAL TENDENCY LESSON 3: CENTRAL TENDENCY Outline Arithmetic mean, median and mode Ungrouped data Grouped data Percentiles, fractiles, and quartiles Ungrouped data Grouped data 1 MEAN Mean is defined as follows: Sum

More information

This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form.

This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form. CHAPTER 2 Frequency Distributions and Graphs Objectives Organize data using frequency distributions. Represent data in frequency distributions graphically using histograms, frequency polygons, and ogives.

More information

Numerical Summaries of Data Section 14.3

Numerical Summaries of Data Section 14.3 MATH 11008: Numerical Summaries of Data Section 14.3 MEAN mean: The mean (or average) of a set of numbers is computed by determining the sum of all the numbers and dividing by the total number of observations.

More information

STANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA I. 4 th Nine Weeks,

STANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA I. 4 th Nine Weeks, STANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA I 4 th Nine Weeks, 2016-2017 1 OVERVIEW Algebra I Content Review Notes are designed by the High School Mathematics Steering Committee as a resource for

More information

Descriptive Statistics

Descriptive Statistics Chapter 2 Descriptive Statistics 2.1 Descriptive Statistics 1 2.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Display data graphically and interpret graphs:

More information

Measures of Central Tendency

Measures of Central Tendency Measures of Central Tendency MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2017 Introduction Measures of central tendency are designed to provide one number which

More information

Create a bar graph that displays the data from the frequency table in Example 1. See the examples on p Does our graph look different?

Create a bar graph that displays the data from the frequency table in Example 1. See the examples on p Does our graph look different? A frequency table is a table with two columns, one for the categories and another for the number of times each category occurs. See Example 1 on p. 247. Create a bar graph that displays the data from the

More information

Bar Graphs and Dot Plots

Bar Graphs and Dot Plots CONDENSED LESSON 1.1 Bar Graphs and Dot Plots In this lesson you will interpret and create a variety of graphs find some summary values for a data set draw conclusions about a data set based on graphs

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Library, Teaching & Learning 014 Summary of Basic data Analysis DATA Qualitative Quantitative Counted Measured Discrete Continuous 3 Main Measures of Interest Central Tendency Dispersion

More information

Density Curve (p52) Density curve is a curve that - is always on or above the horizontal axis.

Density Curve (p52) Density curve is a curve that - is always on or above the horizontal axis. 1.3 Density curves p50 Some times the overall pattern of a large number of observations is so regular that we can describe it by a smooth curve. It is easier to work with a smooth curve, because the histogram

More information

Chapter 3 Analyzing Normal Quantitative Data

Chapter 3 Analyzing Normal Quantitative Data Chapter 3 Analyzing Normal Quantitative Data Introduction: In chapters 1 and 2, we focused on analyzing categorical data and exploring relationships between categorical data sets. We will now be doing

More information

MATH 112 Section 7.2: Measuring Distribution, Center, and Spread

MATH 112 Section 7.2: Measuring Distribution, Center, and Spread MATH 112 Section 7.2: Measuring Distribution, Center, and Spread Prof. Jonathan Duncan Walla Walla College Fall Quarter, 2006 Outline 1 Measures of Center The Arithmetic Mean The Geometric Mean The Median

More information

The first few questions on this worksheet will deal with measures of central tendency. These data types tell us where the center of the data set lies.

The first few questions on this worksheet will deal with measures of central tendency. These data types tell us where the center of the data set lies. Instructions: You are given the following data below these instructions. Your client (Courtney) wants you to statistically analyze the data to help her reach conclusions about how well she is teaching.

More information

Name Geometry Intro to Stats. Find the mean, median, and mode of the data set. 1. 1,6,3,9,6,8,4,4,4. Mean = Median = Mode = 2.

Name Geometry Intro to Stats. Find the mean, median, and mode of the data set. 1. 1,6,3,9,6,8,4,4,4. Mean = Median = Mode = 2. Name Geometry Intro to Stats Statistics are numerical values used to summarize and compare sets of data. Two important types of statistics are measures of central tendency and measures of dispersion. A

More information