CHAPTER 2. Objectives. Frequency Distributions and Graphs. Basic Vocabulary. Introduction. Organise data using frequency distributions.

Similar documents
This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form.

Spell out your full name (first, middle and last)

Chapter 2. Frequency Distributions and Graphs. Bluman, Chapter 2

Frequency Distributions and Graphs

Courtesy :

Organizing Data. Class limits (in miles) Tally Frequency Total 50

2.1: Frequency Distributions

Frequency distribution

MATH 117 Statistical Methods for Management I Chapter Two

Chapter 2 - Frequency Distributions and Graphs

Section 2-2 Frequency Distributions. Copyright 2010, 2007, 2004 Pearson Education, Inc

BUSINESS DECISION MAKING. Topic 1 Introduction to Statistical Thinking and Business Decision Making Process; Data Collection and Presentation

Chapter 2 Organizing and Graphing Data. 2.1 Organizing and Graphing Qualitative Data

Overview. Frequency Distributions. Chapter 2 Summarizing & Graphing Data. Descriptive Statistics. Inferential Statistics. Frequency Distribution

MATH1635, Statistics (2)

Graphical Presentation for Statistical Data (Relevant to AAT Examination Paper 4: Business Economics and Financial Mathematics) Introduction

Downloaded from

Raw Data is data before it has been arranged in a useful manner or analyzed using statistical techniques.

Elementary Statistics

At the end of the chapter, you will learn to: Present data in textual form. Construct different types of table and graphs

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.1- #

Test Bank for Privitera, Statistics for the Behavioral Sciences

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student

B. Graphing Representation of Data

Chapter 2 Describing, Exploring, and Comparing Data

Chapter 2 - Graphical Summaries of Data

Business Statistics 10th Edition Groebner SOLUTIONS MANUAL

2.3 Organizing Quantitative Data

2.1: Frequency Distributions and Their Graphs

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Section 2-2. Histograms, frequency polygons and ogives. Friday, January 25, 13

- 1 - Class Intervals

Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation

Round each observation to the nearest tenth of a cent and draw a stem and leaf plot.

Elementary Statistics. Organizing Raw Data

Prob and Stats, Sep 4

Raw Data. Statistics 1/8/2016. Relative Frequency Distribution. Frequency Distributions for Qualitative Data

Math Tech IIII, Sep 14

JUST THE MATHS UNIT NUMBER STATISTICS 1 (The presentation of data) A.J.Hobson

+ Statistical Methods in

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Teaching Tips for Each Chapter

Math 227 EXCEL / MEGASTAT Guide

Chapter 2: Graphical Summaries of Data 2.1 Graphical Summaries for Qualitative Data. Frequency: Frequency distribution:

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data

2. The histogram. class limits class boundaries frequency cumulative frequency

MATH 1070 Introductory Statistics Lecture notes Descriptive Statistics and Graphical Representation

1.2. Pictorial and Tabular Methods in Descriptive Statistics

Organizing and Summarizing Data

Making Science Graphs and Interpreting Data

Chpt 2. Frequency Distributions and Graphs. 2-4 Pareto chart, time series graph, Pie chart / 35

Using a percent or a letter grade allows us a very easy way to analyze our performance. Not a big deal, just something we do regularly.

Chapter Two: Descriptive Methods 1/50

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES

LESSON 3: CENTRAL TENDENCY

Name Date Types of Graphs and Creating Graphs Notes

STP 226 ELEMENTARY STATISTICS NOTES

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.

Chapter 2: Understanding Data Distributions with Tables and Graphs

NOTES TO CONSIDER BEFORE ATTEMPTING EX 1A TYPES OF DATA

Averages and Variation

TMTH 3360 NOTES ON COMMON GRAPHS AND CHARTS

AND NUMERICAL SUMMARIES. Chapter 2

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set.

Unit: 1: Number version 1 Key Stage 4 Foundation

aqa GCSE MATHS EXAMINATIoN BoARd CoVERAGE

Summary Of Topics covered in Year 7. Topic All pupils should Most pupils should Some pupils should Learn formal methods for

Ch. 1.4 Histograms & Stem-&-Leaf Plots

Measures of Central Tendency

28 CHAPTER 2 Summarizing and Graphing Data

Statistics can best be defined as a collection and analysis of numerical information.

Parents Names Mom Cell/Work # Dad Cell/Work # Parent List the Math Courses you have taken and the grade you received 1 st 2 nd 3 rd 4th

Mathematics Year 9-11 Skills and Knowledge Checklist. Name: Class: Set : 4 Date Year 9 MEG :

CHAPTER 2 DESCRIPTIVE STATISTICS

Decimals should be spoken digit by digit eg 0.34 is Zero (or nought) point three four (NOT thirty four).

Things you ll know (or know better to watch out for!) when you leave in December: 1. What you can and cannot infer from graphs.

Table of Contents (As covered from textbook)

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things.

CHAPTER 2: SAMPLING AND DATA

download instant at Summarizing Data: Listing and Grouping

MATH NATION SECTION 9 H.M.H. RESOURCES

NUMBER 1 ALGEBRA 1 AUTUMN TERM YEAR 7

Frequency Distributions

The variable: scores on a 60 question exam for 20 students 50, 46, 58, 49, 50, 57, 49, 48, 53, 45, 50, 55, 43, 49, 46, 48, 44, 56, 57, 44

Maths Class 9 Notes for Statistics

a. divided by the. 1) Always round!! a) Even if class width comes out to a, go up one.

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

Curriculum Area: Mathematics Year: 10 Foundation. Aspire Learn Achieve. 1 Number All students have access to:

2.4-Statistical Graphs

KS3 MATHEMATICS THRESHOLD DESCRIPTORS NUMBER (Incl. RATIO & PROPORTION)

ORGANIZING THE DATA IN A FREQUENCY TABLE

Lesson 18-1 Lesson Lesson 18-1 Lesson Lesson 18-2 Lesson 18-2

Part I, Chapters 4 & 5. Data Tables and Data Analysis Statistics and Figures

Using the Best of Both!

DIOCESAN COLLEGE PREPARATORY SCHOOL Mathematics Syllabus: Grade 7

TYPES OF NUMBER P1 P2 P3 Learning Objective Understand place value in large numbers Add and subtract large numbers (up to 3 digits) Multiply and

Organisation and Presentation of Data in Medical Research Dr K Saji.MD(Hom)

DAY 52 BOX-AND-WHISKER

Maths Revision Worksheet: Algebra I Week 1 Revision 5 Problems per night

I can solve simultaneous equations algebraically and graphically. I can solve inequalities algebraically and graphically.

Transcription:

CHAPTER 2 Objectives Organise data using frequency distributions. Distributions and Graphs Represent data in frequency distributions graphically using histograms, frequency polygons, and ogives. Represent data using Pareto charts, time series graphs, and pie graphs. Draw and interpret a stem and leaf plot. 2-1 2-2 Introduction Basic Vocabulary This chapter will show how to organise data by constructing frequency distributions and then construct appropriate graphs to represent the data in a concise, easy-tounderstand form. When data are collected in original form, they are called raw data. A frequency distribution is the organisation of raw data in table form, using classes and frequencies. The two most common distributions are categorical frequency distribution and the grouped frequency distribution. 2-3 2-4

An Example An Example (cont d) Suppose a researcher wished to do a study on the number of miles that the employees of a large department store travelled to work each day. The researcher first would have to collect the data by asking each employee the approximate distance the store is from his or her home. The data are: 1 2 6 7 12 13 2 6 9 5 18 7 3 15 15 4 17 1 14 5 4 16 4 5 8 6 5 18 5 2 9 11 12 1 9 2 10 11 4 10 9 18 8 8 4 14 7 3 2 6 It is very hard for us to tell the information from such a table full of raw data. Therefore, the researcher organises the data by constructing a frequency distribution. The frequency is the number of values in a specific class of the distribution. Now some general observations can be obtained from looking at the data in the form of a frequency distribution. For example, the majority of employees live within 9 miles of the store. Reminder: A frequency distribution is the organisation of raw data in table form, using classes and frequencies. 2-5 2-6 Categorical Distributions An example The categorical frequency distribution is used for data that can be placed in specific categories, such as nominal- or ordinal- data. For example, data such as political affiliation, religious affiliation, or studying field would use categorical frequency distributions. Twenty-five army officers were given a blood test to determine their blood type. The data set is: A B B AB O O O B AB B B B O A O A O O O AB AB A O B A Its frequency distribution is: Class Percent A 5 20 B 7 28 O 9 36 AB 4 16 Total 25 100 2-7 2-8

Grouped Distributions Grouped Distributions (cont d.) When the range of the data is large, the data must be grouped into classes that are more than one unit in width. Class limits (in miles) Class boundaries Cumulative 24-30 23.5-30.5 3 3 31-37 30.5-37.5 1 4 38-44 37.5-44.5 5 9 45-51 44.5-51.5 9 18 52-58 51.5-58.5 6 24 59-65 58.5-65.5 1 25 25 The lower class limit represents the smallest data value that can be included in the class. The upper class limit represents the largest value that can be included in the class. In the distribution shown in the last slide, taking the first class as an example, the values 24 and 30 are the class limits. The lower class limit is 24. The upper class limit is 30. 2-9 2-10 Grouped Distributions (cont d.) Class Boundaries Significant Figures The class boundaries are used to separate the classes so that there are no gaps in the frequency distribution. Rule of Thumb: Class limits should have the same decimal place value as the data, but the class boundaries have one additional place value and end in a 5. The numbers in the second column are the class boundaries in the previous example. As one can see, for example, there is a gap between 30 and 31. The number 30.6 is grouped into the class 31-37. 2-11 2-12

Finding Class Boundaries Class Rules The class width for a class in a frequency distribution is found by subtracting the lower (or upper) class limit of one class from the the lower (or upper) class limit of the next class. (Do NOT subtract the limits of a single class!) The class midpoint is found by adding the lower and upper boundaries (or limits) and dividing by 2. There should be between 5 and 20 classes. The class width should be an odd number. The classes must be mutually exclusive. The classes must be continuous. The classes must be exhaustive. The classes must be equal width. 2-13 2-14 An Example An Example (cont d) The data shown here represent the number of miles per gallon that 30 selected four-wheel-drive sports utility vehicles obtained in city driving. Construct a frequency distribution. 12 17 12 14 16 18 16 18 12 16 17 15 15 16 12 15 16 16 12 14 15 12 15 15 19 13 16 18 16 14 Solution: Class limits (in miles) Class boundaries Frequenc y Cumulative 12 11.5-12.5 6 6 13 12.5-13.5 1 7 14 13.5-14.5 3 10 15 14.5-15.5 6 16 16 15.5-16.5 8 24 17 16.5-17.5 2 26 18 17.5-18.5 3 29 19 18.5-19.5 1 30 2-15 2-16

An Example (cont d) Types of Distributions Procedure Table Constructing a Grouped Distribution Step 1 Determine the classes. Find the highest and lowest value. Find the range. Select the number of classes desired. Find the width by dividing the range by the number of classes and rounding up. Select a starting point (usually the lowest value or any convenient number less than the lowest value); add the width to get the lower limits. Find the upper class limits. Find the boundaries. Step 2 Find the numerical frequencies. A categorical frequency distribution is used when the data is nominal. A grouped frequency distribution is used when the range is large and classes of several units in width are needed. An ungrouped frequency distribution is used for numerical data and when the range of data is small. Step 3 Find the cumulative frequencies. 2-17 2-18 Why Construct Distributions? The Role of Graphs To organise the data in a meaningful, intelligible way. To enable the reader to make comparisons among different data sets. To facilitate computational procedures for measures of average and spread. The purpose of graphs in statistics is to convey the data to the viewer in pictorial form. Graphs are useful in getting the audience s attention in a publication or a presentation. To enable the reader to determine the nature or shape of the distribution. To enable the researcher to draw charts and graphs for the presentation of data. 2-19 2-20

Three Most Common Graphs Three Most Common Graphs (cont d.) The histogram displays the data by using vertical bars of various heights to represent the frequencies. The frequency polygon displays the data by using lines that connect points plotted for the frequencies at the midpoints of the classes. 8 6 4 2 0 0 10.5 20.5 30.5 40.5 50.5 60.5 70.5 80.5 Class Boundaries 8 7 6 5 4 3 2 1 0 0.5 10.5 20.5 30.5 40.5 50.5 60.5 70.5 80.5 Class Midpoints 2-21 2-22 Three Most Common Graphs (cont d.) Relative Graphs The cumulative frequency or ogive represents the cumulative frequencies for the classes in a frequency distribution. Cumulative 30 20 10 0 0 10.5 21 31.5 42 52.5 63 73.5 84 94.5 Class Boundaries A relative frequency graph is a graph that uses proportions instead of frequencies. Relative frequencies are used when the proportion of data values that fall into a given class is more important than the frequency. 2-23 2-24

Other Types of Graphs Other Types of Graphs (cont d.) A Pareto chart is used A time series graph represents data that occur to represent a frequency distribution for categorical variable, and the frequencies are displayed by the heights of vertical 30 20 10 0 How People Get to Work Auto Bus Trolley Train Walk over a specific period of time. Temp. Temperature Over a 5-hour Period 55 50 45 40 35 bars, which are arranged in order from 12 1 2 3 4 5 Time highest to lowest. 2-25 2-26 Other Types of Graphs (cont d.) Stem-and-Leaf Plots A pie graph is a circle that is divided into sections or wedges according to the percentage of frequencies in each category of the distribution. Favorite American Snacks snack nuts 8% popcorn 13% pretzels 14% tortilla chips 27% potato chips 38% A stem-and-leaf plot is a data plot that uses part of a data value as the stem and part of the data value as the leaf to form groups or classes. It has the advantage over grouped frequency distribution of retaining the actual data while showing them in graphic form. 2-27 2-28

Stem-and-Leaf Plots An Example Stem-and-Leaf diagrams were originally developed by John Tukey of Princeton University. They are extremely useful in summarising reasonably sized data (under 150 values as a general rule) and, unlike histograms, result in no loss of information. By this we mean it is possible to retrieve the original data set from a stem-and-leaf diagram, which is not the case when using a histogram some of the information in the original data is lost when a histogram is constructed, although histograms provide the best alternative when attempting to summarise a large set of sample data. Suppose a study reports the after-tax profits of 12 selected companies. The profits (recorded as cents per dollar of revenue) are as follows 3.4 4.5 2.3 2.7 3.8 5.9 3.4 4.7 2.4 4.1 3.6 5.1 The stem-and-leaf diagram for these data is shown below: Stem Leaf (unit =.1) 2 3 4 7 3 4 4 6 8 4 1 5 7 5 1 9 2-29 2-30 An Example (cont d) An Example (cont d) Each observation is represented by a stem to the left of the vertical line and a leaf to the right of the vertical line. For example the stems and leaves for the first and last observation would be be represented as 5 62 or as 56 2 in a stem-and-leaf diagram, depending on the range of the sample data. If the diagram is rotated counterclockwise, it has the appearance of a histogram and clearly describes the shape of the sample data. Stem Leaf (unit =.1) Stem Leaf (unit =.1) 3 4 5 1 In a stem-and-leaf diagram, the stem are put in order to the left of the vertical line. The leaf for each observation is generally the last digit (or possibly the last two digits) of the data value, with the stem consisting of the remaining first digits. The leaf values are generally arranged in ascending order. The value 562 could 2-31 2-32

Summary of Graphs and Uses Histograms, frequency polygons, and ogives are used when the data are contained in a grouped frequency distribution. Pareto charts are used to show frequencies for nominal variables. Summary of Graphs and Uses (cont d.) Time series graphs are used to show a pattern or trend that occurs over time. Pie graphs are used to show the relationship between the parts and the whole. 2-33 2-34 Conclusions Data can be organized in some meaningful way using frequency distributions. Once the frequency distribution is constructed, the representation of the data by graphs is a simple task. 2-35