STP 226 ELEMENTARY STATISTICS NOTES

Similar documents
Section 2-2 Frequency Distributions. Copyright 2010, 2007, 2004 Pearson Education, Inc

Section 1.2. Displaying Quantitative Data with Graphs. Mrs. Daniel AP Stats 8/22/2013. Dotplots. How to Make a Dotplot. Mrs. Daniel AP Statistics

Chapter 2 - Graphical Summaries of Data

Overview. Frequency Distributions. Chapter 2 Summarizing & Graphing Data. Descriptive Statistics. Inferential Statistics. Frequency Distribution

Organizing and Summarizing Data

AND NUMERICAL SUMMARIES. Chapter 2

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

2.3 Organizing Quantitative Data

Chapter 2: Graphical Summaries of Data 2.1 Graphical Summaries for Qualitative Data. Frequency: Frequency distribution:

Name Date Types of Graphs and Creating Graphs Notes

TMTH 3360 NOTES ON COMMON GRAPHS AND CHARTS

Chapter 3 - Displaying and Summarizing Quantitative Data

12. A(n) is the number of times an item or number occurs in a data set.

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.1- #

Frequency Distributions

2.1: Frequency Distributions

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.

Chapter 2 Describing, Exploring, and Comparing Data

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data

Raw Data is data before it has been arranged in a useful manner or analyzed using statistical techniques.

No. of blue jelly beans No. of bags

STAT STATISTICAL METHODS. Statistics: The science of using data to make decisions and draw conclusions

+ Statistical Methods in

AP Statistics Summer Assignment:

Chapter2 Description of samples and populations. 2.1 Introduction.

Chapter 1. Looking at Data-Distribution

Chapter 6: DESCRIPTIVE STATISTICS

2.1: Frequency Distributions and Their Graphs

Chapter 2 Organizing and Graphing Data. 2.1 Organizing and Graphing Qualitative Data

Statistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or me, I will answer promptly.

Table of Contents (As covered from textbook)

At the end of the chapter, you will learn to: Present data in textual form. Construct different types of table and graphs

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

MTH 3210: PROBABILITY AND STATISTICS DESCRIPTIVE STATISTICS WORKSHEET

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.

Data organization. So what kind of data did we collect?

Displaying Distributions - Quantitative Variables

Test Bank for Privitera, Statistics for the Behavioral Sciences

MATH 117 Statistical Methods for Management I Chapter Two

UNIT 1A EXPLORING UNIVARIATE DATA

Basic Statistical Terms and Definitions

MATH& 146 Lesson 10. Section 1.6 Graphing Numerical Data

Lesson 18-1 Lesson Lesson 18-1 Lesson Lesson 18-2 Lesson 18-2

The variable: scores on a 60 question exam for 20 students 50, 46, 58, 49, 50, 57, 49, 48, 53, 45, 50, 55, 43, 49, 46, 48, 44, 56, 57, 44

Elementary Statistics

CHAPTER 2: SAMPLING AND DATA

Section 3.1 Shapes of Distributions MDM4U Jensen

Applied Statistics for the Behavioral Sciences

Univariate Statistics Summary

B. Graphing Representation of Data

Chapter Two: Descriptive Methods 1/50

Parents Names Mom Cell/Work # Dad Cell/Work # Parent List the Math Courses you have taken and the grade you received 1 st 2 nd 3 rd 4th

Summarising Data. Mark Lunt 09/10/2018. Arthritis Research UK Epidemiology Unit University of Manchester

Courtesy :

MATH1635, Statistics (2)

NOTES TO CONSIDER BEFORE ATTEMPTING EX 1A TYPES OF DATA

Vocabulary: Data Distributions

Chapter 2: Frequency Distributions

1.3 Graphical Summaries of Data

Visualizing Data: Freq. Tables, Histograms

Making Science Graphs and Interpreting Data

Chapter 2 - Descriptive Statistics

appstats6.notebook September 27, 2016

Density Curve (p52) Density curve is a curve that - is always on or above the horizontal axis.

Chapter 2: Understanding Data Distributions with Tables and Graphs

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Chapter 2 Descriptive Statistics. Tabular and Graphical Presentations

Type of graph: Explain why you picked this type of graph. Temperature (C) of product formed per minute)

Chapter 3 Analyzing Normal Quantitative Data

Chapter 2: The Normal Distributions

MATH 1070 Introductory Statistics Lecture notes Descriptive Statistics and Graphical Representation

Exploratory Data Analysis

Understanding and Comparing Distributions. Chapter 4

3 Graphical Displays of Data

BUSINESS DECISION MAKING. Topic 1 Introduction to Statistical Thinking and Business Decision Making Process; Data Collection and Presentation

CHAPTER 2. Objectives. Frequency Distributions and Graphs. Basic Vocabulary. Introduction. Organise data using frequency distributions.

28 CHAPTER 2 Summarizing and Graphing Data

Unit 7 Statistics. AFM Mrs. Valentine. 7.1 Samples and Surveys

8 Organizing and Displaying

1.2. Pictorial and Tabular Methods in Descriptive Statistics

Sections Graphical Displays and Measures of Center. Brian Habing Department of Statistics University of South Carolina.

Week 2: Frequency distributions

Data Statistics Population. Census Sample Correlation... Statistical & Practical Significance. Qualitative Data Discrete Data Continuous Data

Date Lesson TOPIC HOMEWORK. Displaying Data WS 6.1. Measures of Central Tendency WS 6.2. Common Distributions WS 6.6. Outliers WS 6.

Chapter 2: Descriptive Statistics

Slides Prepared by JOHN S. LOUCKS St. Edward s s University Thomson/South-Western. Slide

Chapter 2. Frequency distribution. Summarizing and Graphing Data

STA 218: Statistics for Management

Select Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables

Chapter 2. Frequency Distributions and Graphs. Bluman, Chapter 2

Spell out your full name (first, middle and last)

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things.

Measures of Central Tendency

MTH 1080, SPRING 2018 DESCRIPTIVE STATISTICS WORKSHEET

SOST 201 September 20, Stem-and-leaf display 2. Miscellaneous issues class limits, rounding, and interval width.

CHAPTER 3: Data Description

MATH11400 Statistics Homepage

Chapter 2 Descriptive Statistics I: Tabular and Graphical Presentations. Learning objectives

Transcription:

ELEMENTARY STATISTICS NOTES PART 2 - DESCRIPTIVE STATISTICS CHAPTER 2 ORGANIZING DATA Descriptive Statistics - include methods for organizing and summarizing information clearly and effectively. - classify data by type - organize into tables - summarize with graphs 2.1 Variables and Data Variable - a characteristic that varies from one person or thing to another Example - height, weight, # of siblings, sex, marital status, eye color Qualitative variable - a non-numerically valued variable Example - sex, marital status, eye color Quantitative variable - a numerically valued variable Example - height, weight, # of siblings Discrete variable - a quantitative variable whose possible values form a finite (or countably infinite) set of numbers. - counting variable Example - # of siblings Continuous variable - a quantitative variable whose possible values form some interval of numbers - measuring variable Example - height, weight 1

Variable Qualitative Quantitative Discrete Continuous Data - information obtained by observing values of a variable. Qualitative data - data obtained by observing values of a qualitative variable. Quantitative data - data obtained by observing values of a quantative variable Discrete data - data obtained by observing values of a discrete variable Continuous data - data obtained by observing values of a continuous variable Observation - individual piece of data, eg. 5 Data set - collection of all observations Example 2.1 pg.61 includes various examples of variables and data. Sex = qualitative variable with values male and female For women Names Rank Fatuma 1 st Renata 2 nd. discrete quantitative data Cindy 14 th Yumiko 15 th Names Time Moses 2:07:34 continuous quantitative data Fatuma 2:23:21 Other examples Blood types A, B, AB, O are qualitative data The number of people in a household is discrete quantitative data 2

The heights of waterfalls are continuous quantitative data It is important to classify data correctly since descriptive and inferential statistics uses only certain types of data. Even statisticians sometimes disagree on type of data. 2.2 Grouping Data As the quantity of data becomes large, grouping into categories/classes becomes necessary to make data more readable and understandable. Guidelines 1. # of classes small enough to provide effective summary but large enough to display relevant characteristics of the data (anywhere from 7-20 classes seem reasonable) 2. each observation must belong to one and only one, class 3. wherever feasible, all classes should have the same width Use of the Frequency Distribution (table with columns for classes and frequencies) to display the information Use of the Relative Frequency Distribution (table with columns for classes and relative frequencies) to display the information Classes - categories for grouping data Frequency - # of observations that fall in a class Frequency distribution - listing of all classes along with their frequencies Relative Frequency - ratio of the frequency of a class to the total # of observations - may involve round-off errors Relative Frequency distribution - listing of all classes along with their relative frequencies Lower cut point - smallest value that can go into a class Upper cut point - smallest value that can go into the next higher class UCP of any class = LCP of the next higher class Midpoint - middle of a class = average of the cut points = (LCP + UCP)/2 Width - UCP - LCP 3

Grouped-data table - table that includes the columns; classes, frequencies, relative frequencies, and midpoints. Single value grouping - one number per class Midpoint = value of the class Frequency/relative frequency distribution for qualitative data - information can still be put into classes - can compute frequencies and relative frequencies 2.3 Graphs and Charts Frequency histogram - frequency of each class is the height of the bar which is equal the frequency - displays the classes on the horizontal axis and the frequencies on the vertical axis. Relative frequency histogram - relative frequency of each class is the height of the bar which is equal the relative frequency - displays the classes on the horizontal axis and the relative frequencies on the vertical axis. Histograms for single value grouping - the single values are placed below the middle of the bars Dot Plots - dots are used to represent the data Graphical displays for qualitative data - Pie charts - Bar charts - similar to histograms, except that the bars do not touch each other since the categories are not on some numeric scale. They represent distinct categories. 2.4 Stem-and-leaf Diagrams (Stemplot) - Professor John Tukey (1960) Princeton University - Stem - leading digit(s) - leaves - other digit Stemplot gives quick picture of a distribution while including the actual numerical values in the graph 4

1. Separate each observation into a stem (has all but the last digit, can be 1, 2, or more digits) consisting of all but the final (rightmost) digit and a leaf (has only one digit), the final digit. 2. Write the stems in a vertical column with the smallest at the top, and draw a vertical line at the right of this column. 3. Write each leaf in the row to the right of its stem, in increasing order out from the stem. Back-to-back stemplot uses one stem and two sets of leaves, one on either side of the stem helps to make comparison between two data sets. The number of stems can be doubled by splitting the stem in two; one with leaves from 0 to 4 and the other with leaves 5 to 9. Good idea to round off numbers to only a few digits before trying to make a stemplot (lose some accuracy in measurements) 2.5 Distribution: Shapes, Symmetry, and Skewness. - a distribution of a data set is a table, graph, or formula that tells us the values of the observation and how often they occur Distribution Shapes Bell-shaped J shaped Reverse J-shaped Right skewed Multimodal Left skewed Triangular Uniform Bimodal Modality Unimodal - has one peak eg. Bell-shaped, Triangular, Reverse J-shaped, J-shaped, Right skewed, Left skewed Bimodal - has two peaks Multimodal - has 3 or more peaks Symmetry and Skewness technically, all peaks should Be same height, not so in practice Symmetry - property of a distribution to be divided into 2 parts that are mirror images of each other. Do not have to be exact in identifying symmetry. Eg. bell-shaped, triangular, uniform. 5

Non-symmetric Distribution - Reverse J-shaped, J-shaped, Right skewed, Left skewed The distribution of population data is called population distribution, or the distribution of the variable. The distribution of sample data is a sample distribution. The distribution of a random sample from a population approximates the population distribution, hence, larger samples are better. 2.6 Misleading Graphs Scales on the vertical axis should begin at 0 to compare the heights of the bars correctly. When the vertical scale does not begin at 0, it may be easier to see an up/down trend as you move from left to right, but one may easily make the error of comparing the heights of the bars. - not recommended even though they still appear in some reputable publications. Need to be careful to use and observe proper scaling so that one can get precise and consistent interpretation. If a truncated graph is to be used, use the symbol // to show that a portion of the graph is truncated. This will alert the audience to be careful in interpreting from the graph. 6