Statistical Graphics
|
|
- Susanna O’Brien’
- 6 years ago
- Views:
Transcription
1 Idea: Instant impression Statistical Graphics Bad graphics abound: From newspapers, magazines, Excel defaults, other software. 1 Color helpful: if used effectively. Avoid "chartjunk." Keep level/interests of audience in mind. Data types matter. Here are some common choices Nominal: Pie chart, sometimes bar chart Ordinal: Bar chart, sometimes pie (with extra cues) Discrete: Bar chart, sometimes histogram; For small sample only, maybe or dotplot (stripchart) Continuous: Histogram, boxplot Bivariate data: C = Categorical (Nominal or Ordinal), N = Numerical Data Types: Types of plots (Inferential methods later in course) C C: Table, multi-level barchart (Are categorical variables independent) C N: Multiple dotplots or boxplots (2-sample t-test, or one-way ANOVA) N N: Scatterplot or time-series plot (Correlation, regression, time series)
2 Simulated (fake) data to illustrate histogram and stripchart. Small normal sample. R software. 2 x = round(rnorm(25, 100, 15), 1) # normal data rounded to 1 place hist(x, ylim=c(-.5,10), main="test Scores") stripchart(x, at=-.5, add=t, pch=" ")
3 > sort(x) [1] [11] [21] > summary(x) Min. 1st Qu. Median Mean 3rd Qu. Max > boxplot(x) 3
4 Small normal sample. Minitab software. 4 MTB > rand 25 c1; SUBC> norm MTB > let c1 = round(c1 * 10)/10 MTB > name c1 'x' MTB > hist 'x' 6 Histogram of x 5 Frequency x
5 MTB > dotp 'x' 5 Dotplot of x x MTB > gstd # Starts 'standard' (character) graphics mode MTB > dotp 'x' Dotplot: x... ::..... :... :. : x MTB > gpro # Toggles back to 'professional' graphics mode
6 6 MTB > sort c1 c2 MTB > print c2 Data Display C MTB > desc c1 Descriptive Statistics: x Variable N N* Mean SE Mean StDev Minimum Q1 Median Q3 x Variable Maximum x MTB > gstd MTB > boxp c1 Boxplot I + I x
7 MTB > gpro MTB > boxp c1 7 Boxplot of x x
8 Large normal sample. R. 8 y = rnorm(1000, 100, 15) hist(y, prob=t) yy = seq(50, 150, by=.01) lines(yy, dnorm(yy, 100, 15), col="blue")
9 summary(y) Min. 1st Qu. Median Mean 3rd Qu. Max boxplot(y) 9
10 Large normal sample. Minitab. 10 MTB > rand 1000 c3; SUBC> norm MTB > name c3 'y' MTB > hist 'y' # Default algorithm--too many bins 80 Histogram of y Frequency y
11 Easy to restructure bins (boundaries called breakpoints or cutpoints) in Minitab. I have no idea how to overlay a theoretical normal curve on a Minitab histogram (but see below). 11 Histogram of y Frequency y
12 MTB > desc c3 12 Descriptive Statistics: y Variable N N* Mean SE Mean StDev Minimum Q1 Median Q3 y Variable Maximum y MTB > gstd MTB > boxplot 'y' Boxplot ** I + I * y
13 Caution: In Minitab character and professional graphics boxplots use different definitions of "quartile," so the "fences" that define outliers may also differ. (Here the two types of graph happen to agree on three outliers.) Also, the exact definition of quartile can differ from one statistical package to another. MTB > gpro Boxplot of y 125 y
14 About Outliers 14 "Detached" observations. In some sense, "far from other observations." Possible causes: Process error, data entry error Inherent feature of population 'Signal' that occasionally rises above 'noise' Difficulties: May indicate nonnormal distribution; thus call for alternative method of analysis. As above, even data known (simulated) to be exactly normal can show "outliers." Temptation: Throw them out. (Not always a good idea.) Only with good reason, and then report the deletion Caution: sometimes "outliers" may be main message of the data! More on outliers later.
15 MTB > rand 1000 c1; SUBC> norm MTB > GSummary c1. 15 Summary for C1 A nderson-darling N ormality Test A-S quared 0.20 P-V alue Mean StDev V ariance S kew ness Kurtosis N 1000 M inimum st Q uartile M edian rd Q uartile M aximum % Confidence Interval for Mean % Confidence Interval for Median % Confidence Intervals 95% Confidence Interval for StDev Mean Median The normal curve through the data is based on sample mean and standard deviation.
16 Standard Deviation 16 > x = c(1, 2, 6, 3) > mean(x) [1] 3 > d = x - 3 > d.sq = d^2 > x.sq = x^2 > sum(x) [1] 12 > sum(x.sq) [1] 50 > sum(d.sq) [1] 14 > cbind(x, d, d.sq, x.sq) x d d.sq x.sq [1,] [2,] [3,] [4,] > mean(x) [1] 3 > var(x) [1] > sd(x) [1] Illustrating computation of S: Two methods. i x i (xi - 3) (xi - 3) 2 x i Tot Definition: S 2 = 14/3 = 4.67 S = 2.16 "Computational Formula": S 2 = ( /4)/3 = (50-36)/3 = 14/3 = 4.67 Prove: For nonnegative data, the sample mean exceeds its standard error (SE = S/ n). Hints: Use the computational formula for S 2. What relationship between Σ i X i 2 and (Σ i X i ) 2?
17 Interpretations of Standard Deviation 17 Empirical Rule Within 1 standard deviation of mean: "About" 68% of observations Within 2 standard deviations of mean: "About" 95% of observations Within 3 standard deviations of mean: "All or almost all" observations Not a theorem. When does it work? For "normal" population: would be exactly true. (But we have a sample.) For large samples from a normal or "nearly normal" population (several hundred at least): often useful approximation. For small samples, even from normal: extremely rough guide. More on Empirical Rule Later Chebyshev's Inequality. Theorem (true for all samples and populations) Too general to be of much use except in proving limit theorems in probability. Within 2 standard deviations of mean: 1 1/2 2 = 75% Within 3 standard deviations of mean: 1 1/3 2 = 88.9% "ACB Rule." (Almost Completely Bogus Simply not true unless n 25 or n 100) "To estimate σ or approximate S divide sample range by 4 (O/L p91) or by 5 (other books)." DON'T USE THIS!
18 Simulation to disprove the "ACB Rule": m = n = c(5:10, 20, 25, 50, 100, 250, 500) k = numeric(length(n)) for (i in 1:length(n)) { DTA = matrix(rnorm(m*n[i]), nrow=m) x.sd = apply(dta, 1, sd) x.mx = apply(dta, 1, max) x.mn = apply(dta, 1, min) x.rg = x.mx - x.mn k[i] = mean(x.rg)/mean(x.sd) } plot(n, k) cbind(n, round(k,2)) > cbind(n, round(k,2)) n [1,] [2,] [3,] [4,] [5,] [6,] [7,] [8,] [9,] [10,] [11,] [12,] Notes on simulation: Each value k below (each dot on graph) is based on m = 10,000 simulated samples of the specified size. Specified sizes are n = 5, 6,..., 10, 20, 25, 50, 100, 250, 500. Go through a loop, using a different value of n on each pass through the loop. Within the loop, the DTA matrix has m = 10,000 rows and n columns. Each row of the matrix is one sample of the specified size. The apply function is used to find the standard deviation, maximum, and minimum for each sample (row). The correct divisor k is found for each n. After the loop the results are presented in a table and (next page) a graph. This demonstration is only for normal data. 18
19 Maybe it's reasonable to divide by 3 for n 8 or 9, by 4 for n 25, and by 5 for n 100. Unlike many results in stat/prob, approx. of S using range gets worse as n increases. 19 n = 5 (divide by 2.5). Range/2.5 gives pretty good approximation of S. (Find upper and lower linear boundaries for n = 5. What exact relationship for n = 2?)
20 20
21 21 # Boundary lines for n = 5 m = DTA = matrix(rnorm(m*5), nrow=m) x.sd = apply(dta, 1, sd) x.mx = apply(dta, 1, max) x.mn = apply(dta, 1, min) x.rg = x.mx - x.mn k = mean(x.rg)/mean(x.sd) plot(x.rg, x.sd, pch=".") abline(a=0, b=.3536, col="darkred") abline(a=0, b=.5477, col="darkred") # Exactly linear for n = 2 m = DTA = matrix(rnorm(m*2), nrow=m) x.sd = apply(dta, 1, sd) x.mx = apply(dta, 1, max) x.mn = apply(dta, 1, min) x.rg = x.mx - x.mn k = mean(x.rg)/mean(x.sd) plot(x.rg, x.sd, pch=".") > k [1]
22 n = 25 (divide by 4). Plot is not very nearly linear. Mediocre estimation of S Range/4 22
23 n = 500 (divide by 6) Plot is not at all linear: Very poor estimation of S Range/6. 23 Note: The required divisor goes to with increasing n.
24 Simulation in Minitab for n = 5. (Relatively rare case where simulation is easy in Minitab.) Try it for yourself. Look at first few rows and verify values of SD and range by hand. Note: When finished, delete HUGE worksheet before continuing to use Minitab for other purposes. MTB > random c1-c5; SUBC> norm 0 1. MTB > name c6 'sd' c7 'mx' c8 'mn' c9 'rng' c10 'k' MTB > rstd c1-c5 c6 MTB > rmax c1-c5 c7 MTB > rmin c1-c5 c8 MTB > let c9 = c7 - c8 MTB > let c10 = c9/c6 MTB > desc c10 Descriptive Statistics: k Variable N N* Mean SE Mean StDev Minimum Q1 Median Q3 k Variable Maximum k # Slightly different procedure: divided before averaging. # Result nearly the same as in R for n = 5: k approx. 2.5 MTB > plot c6 * c9 # click on large red dots, change to small red dots. # result is similar to plot done above in R. # Note y-axis mentioned first in Minitab. 24
25 In-Lab Problems: 25 Dataset: Gender Height Weight (inches) (pounds) College M Bsns F Sci F Bsns M Sci M Bsns F Eng M Eng M Sci 1. Input the data into Minitab system. 2. Summarizing Gender: find its empirical distribution and graph it. 3. Summarizing Height: find its mean and median. Graph its distribution; which measure best summarizes the center of the distribution? 4. Make a table to show the association between Gender and College. 5. Make a plot to illustrate the association between Height and Weight. 6. Make a plot to illustrate the association between Height and Gender.
26 Brief Instructions Basic descriptive statistics: Stat> Basic Statistics> Display Descriptive Statistics, Variables, Graphs 2. Drawing graphs: Graph> the wanted graph Use one of the two datasets presented on page 125 of the text (in Exercise 3.30), to do the following: (a) Make a histogram by hand. Explain your choice of intervals. Why might Minitab make a histogram that looks different? (b) Find the sample mean. On the horizontal axis (bottom scale) of your histogram, mark the position of the sample mean. Does the histogram seem to "balance" there? (c) Find the interval that extends two sample standard deviations on either side of the sample mean. Sketch this interval on the bottom scale of your histogram. What proportion of all observations actually falls in this interval? (You may find the sample mean and standard deviation either with software or with a calculator.)
8. MINITAB COMMANDS WEEK-BY-WEEK
8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are
More informationMinitab Notes for Activity 1
Minitab Notes for Activity 1 Creating the Worksheet 1. Label the columns as team, heat, and time. 2. Have Minitab automatically enter the team data for you. a. Choose Calc / Make Patterned Data / Simple
More informationStatistics 251: Statistical Methods
Statistics 251: Statistical Methods Summaries and Graphs in R Module R1 2018 file:///u:/documents/classes/lectures/251301/renae/markdown/master%20versions/summary_graphs.html#1 1/14 Summary Statistics
More informationPart I, Chapters 4 & 5. Data Tables and Data Analysis Statistics and Figures
Part I, Chapters 4 & 5 Data Tables and Data Analysis Statistics and Figures Descriptive Statistics 1 Are data points clumped? (order variable / exp. variable) Concentrated around one value? Concentrated
More informationIT 403 Practice Problems (1-2) Answers
IT 403 Practice Problems (1-2) Answers #1. Using Tukey's Hinges method ('Inclusionary'), what is Q3 for this dataset? 2 3 5 7 11 13 17 a. 7 b. 11 c. 12 d. 15 c (12) #2. How do quartiles and percentiles
More informationChapter 3 - Displaying and Summarizing Quantitative Data
Chapter 3 - Displaying and Summarizing Quantitative Data 3.1 Graphs for Quantitative Data (LABEL GRAPHS) August 25, 2014 Histogram (p. 44) - Graph that uses bars to represent different frequencies or relative
More informationPrepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.
Chapter 2 2.1 Descriptive Statistics A stem-and-leaf graph, also called a stemplot, allows for a nice overview of quantitative data without losing information on individual observations. It can be a good
More informationUnit 1: The One-Factor ANOVA as a Generalization of the Two-Sample t Test
Minitab Notes for STAT 6305: Analysis of Variance Models Department of Statistics and Biostatistics CSU East Bay Unit 1: The One-Factor ANOVA as a Generalization of the Two-Sample t Test 1.1. Data and
More informationExcel 2010 with XLSTAT
Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with
More informationName Date Types of Graphs and Creating Graphs Notes
Name Date Types of Graphs and Creating Graphs Notes Graphs are helpful visual representations of data. Different graphs display data in different ways. Some graphs show individual data, but many do not.
More informationMinitab 17 commands Prepared by Jeffrey S. Simonoff
Minitab 17 commands Prepared by Jeffrey S. Simonoff Data entry and manipulation To enter data by hand, click on the Worksheet window, and enter the values in as you would in any spreadsheet. To then save
More informationMinitab Guide for MA330
Minitab Guide for MA330 The purpose of this guide is to show you how to use the Minitab statistical software to carry out the statistical procedures discussed in your textbook. The examples usually are
More informationUnit I Supplement OpenIntro Statistics 3rd ed., Ch. 1
Unit I Supplement OpenIntro Statistics 3rd ed., Ch. 1 KEY SKILLS: Organize a data set into a frequency distribution. Construct a histogram to summarize a data set. Compute the percentile for a particular
More informationChapter 2 Describing, Exploring, and Comparing Data
Slide 1 Chapter 2 Describing, Exploring, and Comparing Data Slide 2 2-1 Overview 2-2 Frequency Distributions 2-3 Visualizing Data 2-4 Measures of Center 2-5 Measures of Variation 2-6 Measures of Relative
More informationMath 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency
Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency lowest value + highest value midrange The word average: is very ambiguous and can actually refer to the mean,
More informationBIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26
Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26 INTRODUCTION Graphs are one of the most important aspects of data analysis and presentation of your of data. They are visual representations
More informationAn Introduction to Minitab Statistics 529
An Introduction to Minitab Statistics 529 1 Introduction MINITAB is a computing package for performing simple statistical analyses. The current version on the PC is 15. MINITAB is no longer made for the
More informationRegression III: Advanced Methods
Lecture 3: Distributions Regression III: Advanced Methods William G. Jacoby Michigan State University Goals of the lecture Examine data in graphical form Graphs for looking at univariate distributions
More informationBluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition
Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created
More informationSTA Module 4 The Normal Distribution
STA 2023 Module 4 The Normal Distribution Learning Objectives Upon completing this module, you should be able to 1. Explain what it means for a variable to be normally distributed or approximately normally
More informationSTA /25/12. Module 4 The Normal Distribution. Learning Objectives. Let s Look at Some Examples of Normal Curves
STA 2023 Module 4 The Normal Distribution Learning Objectives Upon completing this module, you should be able to 1. Explain what it means for a variable to be normally distributed or approximately normally
More informationUNIT 1A EXPLORING UNIVARIATE DATA
A.P. STATISTICS E. Villarreal Lincoln HS Math Department UNIT 1A EXPLORING UNIVARIATE DATA LESSON 1: TYPES OF DATA Here is a list of important terms that we must understand as we begin our study of statistics
More information1.2. Pictorial and Tabular Methods in Descriptive Statistics
1.2. Pictorial and Tabular Methods in Descriptive Statistics Section Objectives. 1. Stem-and-Leaf displays. 2. Dotplots. 3. Histogram. Types of histogram shapes. Common notation. Sample size n : the number
More informationSo..to be able to make comparisons possible, we need to compare them with their respective distributions.
Unit 3 ~ Modeling Distributions of Data 1 ***Section 2.1*** Measures of Relative Standing and Density Curves (ex) Suppose that a professional soccer team has the money to sign one additional player and
More informationChapter 3: Data Description Calculate Mean, Median, Mode, Range, Variation, Standard Deviation, Quartiles, standard scores; construct Boxplots.
MINITAB Guide PREFACE Preface This guide is used as part of the Elementary Statistics class (Course Number 227) offered at Los Angeles Mission College. It is structured to follow the contents of the textbook
More informationSTA 570 Spring Lecture 5 Tuesday, Feb 1
STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row
More information3 Graphical Displays of Data
3 Graphical Displays of Data Reading: SW Chapter 2, Sections 1-6 Summarizing and Displaying Qualitative Data The data below are from a study of thyroid cancer, using NMTR data. The investigators looked
More informationStatistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or me, I will answer promptly.
Statistical Methods Instructor: Lingsong Zhang 1 Issues before Class Statistical Methods Lingsong Zhang Office: Math 544 Email: lingsong@purdue.edu Phone: 765-494-7913 Office Hour: Monday 1:00 pm - 2:00
More informationCopyright 2015 by Sean Connolly
1 Copyright 2015 by Sean Connolly All rights reserved. No part of this publication may be reproduced, distributed, or transmitted in any form or by any means, including photocopying, recording, or other
More informationLearner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display
CURRICULUM MAP TEMPLATE Priority Standards = Approximately 70% Supporting Standards = Approximately 20% Additional Standards = Approximately 10% HONORS PROBABILITY AND STATISTICS Essential Questions &
More informationChapter 6: DESCRIPTIVE STATISTICS
Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling
More informationTable of Contents (As covered from textbook)
Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression
More informationLAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA
LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA This lab will assist you in learning how to summarize and display categorical and quantitative data in StatCrunch. In particular, you will learn how to
More informationYour Name: Section: 2. To develop an understanding of the standard deviation as a measure of spread.
Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab #3 Interpreting the Standard Deviation and Exploring Transformations Objectives: 1. To review stem-and-leaf plots and their
More informationSolution to Tumor growth in mice
Solution to Tumor growth in mice Exercise 1 1. Import the data to R Data is in the file tumorvols.csv which can be read with the read.csv2 function. For a succesful import you need to tell R where exactly
More informationa. divided by the. 1) Always round!! a) Even if class width comes out to a, go up one.
Probability and Statistics Chapter 2 Notes I Section 2-1 A Steps to Constructing Frequency Distributions 1 Determine number of (may be given to you) a Should be between and classes 2 Find the Range a The
More informationData can be in the form of numbers, words, measurements, observations or even just descriptions of things.
+ What is Data? Data is a collection of facts. Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. In most cases, data needs to be interpreted and
More informationNumerical Descriptive Measures
Chapter 3 Numerical Descriptive Measures 1 Numerical Descriptive Measures Chapter 3 Measures of Central Tendency and Measures of Dispersion A sample of 40 students at a university was randomly selected,
More informationThings you ll know (or know better to watch out for!) when you leave in December: 1. What you can and cannot infer from graphs.
1 2 Things you ll know (or know better to watch out for!) when you leave in December: 1. What you can and cannot infer from graphs. 2. How to construct (in your head!) and interpret confidence intervals.
More informationplots Chris Parrish August 20, 2015
plots Chris Parrish August 20, 2015 plots We construct some of the most commonly used types of plots for numerical data. dotplot A stripchart is most suitable for displaying small data sets. data
More informationCHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.
1 CHAPTER 1 Introduction Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. Variable: Any characteristic of a person or thing that can be expressed
More informationSummarising Data. Mark Lunt 09/10/2018. Arthritis Research UK Epidemiology Unit University of Manchester
Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 09/10/2018 Summarising Data Today we will consider Different types of data Appropriate ways to summarise these
More informationContinuous Improvement Toolkit. Normal Distribution. Continuous Improvement Toolkit.
Continuous Improvement Toolkit Normal Distribution The Continuous Improvement Map Managing Risk FMEA Understanding Performance** Check Sheets Data Collection PDPC RAID Log* Risk Analysis* Benchmarking***
More informationYour Name: Section: INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression
Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression Objectives: 1. To learn how to interpret scatterplots. Specifically you will investigate, using
More informationUnivariate Statistics Summary
Further Maths Univariate Statistics Summary Types of Data Data can be classified as categorical or numerical. Categorical data are observations or records that are arranged according to category. For example:
More informationIntroduction to Minitab 1
Introduction to Minitab 1 We begin by first starting Minitab. You may choose to either 1. click on the Minitab icon in the corner of your screen 2. go to the lower left and hit Start, then from All Programs,
More information1.3 Graphical Summaries of Data
Arkansas Tech University MATH 3513: Applied Statistics I Dr. Marcel B. Finan 1.3 Graphical Summaries of Data In the previous section we discussed numerical summaries of either a sample or a data. In this
More informationComputing With R Handout 1
Computing With R Handout 1 Getting Into R To access the R language (free software), go to a computing lab that has R installed, or a computer on which you have downloaded R from one of the distribution
More informationGraphical Analysis of Data using Microsoft Excel [2016 Version]
Graphical Analysis of Data using Microsoft Excel [2016 Version] Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters.
More informationChapter 5: The standard deviation as a ruler and the normal model p131
Chapter 5: The standard deviation as a ruler and the normal model p131 Which is the better exam score? 67 on an exam with mean 50 and SD 10 62 on an exam with mean 40 and SD 12? Is it fair to say: 67 is
More informationMinitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D.
Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Introduction to Minitab The interface for Minitab is very user-friendly, with a spreadsheet orientation. When you first launch Minitab, you will see
More informationTMTH 3360 NOTES ON COMMON GRAPHS AND CHARTS
To Describe Data, consider: Symmetry Skewness TMTH 3360 NOTES ON COMMON GRAPHS AND CHARTS Unimodal or bimodal or uniform Extreme values Range of Values and mid-range Most frequently occurring values In
More information2.1: Frequency Distributions and Their Graphs
2.1: Frequency Distributions and Their Graphs Frequency Distribution - way to display data that has many entries - table that shows classes or intervals of data entries and the number of entries in each
More informationThe basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student
Organizing data Learning Outcome 1. make an array 2. divide the array into class intervals 3. describe the characteristics of a table 4. construct a frequency distribution table 5. constructing a composite
More informationMATH& 146 Lesson 10. Section 1.6 Graphing Numerical Data
MATH& 146 Lesson 10 Section 1.6 Graphing Numerical Data 1 Graphs of Numerical Data One major reason for constructing a graph of numerical data is to display its distribution, or the pattern of variability
More informationVocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.
5-number summary 68-95-99.7 Rule Area principle Bar chart Bimodal Boxplot Case Categorical data Categorical variable Center Changing center and spread Conditional distribution Context Contingency table
More informationChapter 2: Descriptive Statistics
Chapter 2: Descriptive Statistics Student Learning Outcomes By the end of this chapter, you should be able to: Display data graphically and interpret graphs: stemplots, histograms and boxplots. Recognize,
More informationGetting started with simulating data in R: some helpful functions and how to use them Ariel Muldoon August 28, 2018
Getting started with simulating data in R: some helpful functions and how to use them Ariel Muldoon August 28, 2018 Contents Overview 2 Generating random numbers 2 rnorm() to generate random numbers from
More informationIQR = number. summary: largest. = 2. Upper half: Q3 =
Step by step box plot Height in centimeters of players on the 003 Women s Worldd Cup soccer team. 157 1611 163 163 164 165 165 165 168 168 168 170 170 170 171 173 173 175 180 180 Determine the 5 number
More informationCHAPTER 2: Describing Location in a Distribution
CHAPTER 2: Describing Location in a Distribution 2.1 Goals: 1. Compute and use z-scores given the mean and sd 2. Compute and use the p th percentile of an observation 3. Intro to density curves 4. More
More informationIn Minitab interface has two windows named Session window and Worksheet window.
Minitab Minitab is a statistics package. It was developed at the Pennsylvania State University by researchers Barbara F. Ryan, Thomas A. Ryan, Jr., and Brian L. Joiner in 1972. Minitab began as a light
More informationChapter 3 Analyzing Normal Quantitative Data
Chapter 3 Analyzing Normal Quantitative Data Introduction: In chapters 1 and 2, we focused on analyzing categorical data and exploring relationships between categorical data sets. We will now be doing
More informationChapter 3: Data Description - Part 3. Homework: Exercises 1-21 odd, odd, odd, 107, 109, 118, 119, 120, odd
Chapter 3: Data Description - Part 3 Read: Sections 1 through 5 pp 92-149 Work the following text examples: Section 3.2, 3-1 through 3-17 Section 3.3, 3-22 through 3.28, 3-42 through 3.82 Section 3.4,
More informationappstats6.notebook September 27, 2016
Chapter 6 The Standard Deviation as a Ruler and the Normal Model Objectives: 1.Students will calculate and interpret z scores. 2.Students will compare/contrast values from different distributions using
More informationAP Statistics Summer Assignment:
AP Statistics Summer Assignment: Read the following and use the information to help answer your summer assignment questions. You will be responsible for knowing all of the information contained in this
More informationDescriptive Statistics
Chapter 2 Descriptive Statistics 2.1 Descriptive Statistics 1 2.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Display data graphically and interpret graphs:
More informationChapter 6. THE NORMAL DISTRIBUTION
Chapter 6. THE NORMAL DISTRIBUTION Introducing Normally Distributed Variables The distributions of some variables like thickness of the eggshell, serum cholesterol concentration in blood, white blood cells
More informationSections 4.3 and 4.4
Sections 4.3 and 4.4 Timothy Hanson Department of Statistics, University of South Carolina Stat 205: Elementary Statistics for the Biological and Life Sciences 1 / 32 4.3 Areas under normal densities Every
More informationHomework 1 Excel Basics
Homework 1 Excel Basics Excel is a software program that is used to organize information, perform calculations, and create visual displays of the information. When you start up Excel, you will see the
More informationStatistics Lecture 6. Looking at data one variable
Statistics 111 - Lecture 6 Looking at data one variable Chapter 1.1 Moore, McCabe and Craig Probability vs. Statistics Probability 1. We know the distribution of the random variable (Normal, Binomial)
More informationSTA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 3 Descriptive Measures
STA 2023 Module 3 Descriptive Measures Learning Objectives Upon completing this module, you should be able to: 1. Explain the purpose of a measure of center. 2. Obtain and interpret the mean, median, and
More informationChapter 2 Modeling Distributions of Data
Chapter 2 Modeling Distributions of Data Section 2.1 Describing Location in a Distribution Describing Location in a Distribution Learning Objectives After this section, you should be able to: FIND and
More informationCHAPTER 2 DESCRIPTIVE STATISTICS
CHAPTER 2 DESCRIPTIVE STATISTICS 1. Stem-and-Leaf Graphs, Line Graphs, and Bar Graphs The distribution of data is how the data is spread or distributed over the range of the data values. This is one of
More information3 Graphical Displays of Data
3 Graphical Displays of Data Reading: SW Chapter 2, Sections 1-6 Summarizing and Displaying Qualitative Data The data below are from a study of thyroid cancer, using NMTR data. The investigators looked
More informationChapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data
Chapter 2 Descriptive Statistics: Organizing, Displaying and Summarizing Data Objectives Student should be able to Organize data Tabulate data into frequency/relative frequency tables Display data graphically
More informationData Management Project Using Software to Carry Out Data Analysis Tasks
Data Management Project Using Software to Carry Out Data Analysis Tasks This activity involves two parts: Part A deals with finding values for: Mean, Median, Mode, Range, Standard Deviation, Max and Min
More informationHow individual data points are positioned within a data set.
Section 3.4 Measures of Position Percentiles How individual data points are positioned within a data set. P k is the value such that k% of a data set is less than or equal to P k. For example if we said
More informationCHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data 2.2 Density Curves and Normal Distributions The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers HW 34. Sketch
More informationChapter2 Description of samples and populations. 2.1 Introduction.
Chapter2 Description of samples and populations. 2.1 Introduction. Statistics=science of analyzing data. Information collected (data) is gathered in terms of variables (characteristics of a subject that
More informationMeasures of Position
Measures of Position In this section, we will learn to use fractiles. Fractiles are numbers that partition, or divide, an ordered data set into equal parts (each part has the same number of data entries).
More informationWeek 7: The normal distribution and sample means
Week 7: The normal distribution and sample means Goals Visualize properties of the normal distribution. Learning the Tools Understand the Central Limit Theorem. Calculate sampling properties of sample
More informationUnivariate Data - 2. Numeric Summaries
Univariate Data - 2. Numeric Summaries Young W. Lim 2018-08-01 Mon Young W. Lim Univariate Data - 2. Numeric Summaries 2018-08-01 Mon 1 / 36 Outline 1 Univariate Data Based on Numerical Summaries R Numeric
More informationBar Charts and Frequency Distributions
Bar Charts and Frequency Distributions Use to display the distribution of categorical (nominal or ordinal) variables. For the continuous (numeric) variables, see the page Histograms, Descriptive Stats
More informationCHAPTER 3: Data Description
CHAPTER 3: Data Description You ve tabulated and made pretty pictures. Now what numbers do you use to summarize your data? Ch3: Data Description Santorico Page 68 You ll find a link on our website to a
More informationStatistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975.
Statistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975. SPSS Statistics were designed INTRODUCTION TO SPSS Objective About the
More informationChapter 6. THE NORMAL DISTRIBUTION
Chapter 6. THE NORMAL DISTRIBUTION Introducing Normally Distributed Variables The distributions of some variables like thickness of the eggshell, serum cholesterol concentration in blood, white blood cells
More informationCh6: The Normal Distribution
Ch6: The Normal Distribution Introduction Review: A continuous random variable can assume any value between two endpoints. Many continuous random variables have an approximately normal distribution, which
More informationUNIT 15 GRAPHICAL PRESENTATION OF DATA-I
UNIT 15 GRAPHICAL PRESENTATION OF DATA-I Graphical Presentation of Data-I Structure 15.1 Introduction Objectives 15.2 Graphical Presentation 15.3 Types of Graphs Histogram Frequency Polygon Frequency Curve
More informationGETTING STARTED WITH MINITAB INTRODUCTION TO MINITAB STATISTICAL SOFTWARE
Six Sigma Quality Concepts & Cases Volume I STATISTICAL TOOLS IN SIX SIGMA DMAIC PROCESS WITH MINITAB APPLICATIONS CHAPTER 2 GETTING STARTED WITH MINITAB INTRODUCTION TO MINITAB STATISTICAL SOFTWARE Amar
More informationKey: 5 9 represents a team with 59 wins. (c) The Kansas City Royals and Cleveland Indians, who both won 65 games.
AP statistics Chapter 2 Notes Name Modeling Distributions of Data Per Date 2.1A Distribution of a variable is the a variable takes and it takes that value. When working with quantitative data we can calculate
More informationSTAT STATISTICAL METHODS. Statistics: The science of using data to make decisions and draw conclusions
STAT 515 --- STATISTICAL METHODS Statistics: The science of using data to make decisions and draw conclusions Two branches: Descriptive Statistics: The collection and presentation (through graphical and
More informationThis lesson is designed to improve students
NATIONAL MATH + SCIENCE INITIATIVE Mathematics g x 8 6 4 2 0 8 6 4 2 y h x k x f x r x 8 6 4 2 0 8 6 4 2 2 2 4 6 8 0 2 4 6 8 4 6 8 0 2 4 6 8 LEVEL Algebra or Math in a unit on function transformations
More informationTable Of Contents. Table Of Contents
Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store
More informationMaking Science Graphs and Interpreting Data
Making Science Graphs and Interpreting Data Eye Opener: 5 mins What do you see? What do you think? Look up terms you don t know What do Graphs Tell You? A graph is a way of expressing a relationship between
More informationCHAPTER 6. The Normal Probability Distribution
The Normal Probability Distribution CHAPTER 6 The normal probability distribution is the most widely used distribution in statistics as many statistical procedures are built around it. The central limit
More informationDensity Curve (p52) Density curve is a curve that - is always on or above the horizontal axis.
1.3 Density curves p50 Some times the overall pattern of a large number of observations is so regular that we can describe it by a smooth curve. It is easier to work with a smooth curve, because the histogram
More informationChapter 2 - Graphical Summaries of Data
Chapter 2 - Graphical Summaries of Data Data recorded in the sequence in which they are collected and before they are processed or ranked are called raw data. Raw data is often difficult to make sense
More informationAcquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data.
Summary Statistics Acquisition Description Exploration Examination what data is collected Characterizing properties of data. Exploring the data distribution(s). Identifying data quality problems. Selecting
More informationName: Date: Period: Chapter 2. Section 1: Describing Location in a Distribution
Name: Date: Period: Chapter 2 Section 1: Describing Location in a Distribution Suppose you earned an 86 on a statistics quiz. The question is: should you be satisfied with this score? What if it is the
More informationChapter 1. Looking at Data-Distribution
Chapter 1. Looking at Data-Distribution Statistics is the scientific discipline that provides methods to draw right conclusions: 1)Collecting the data 2)Describing the data 3)Drawing the conclusions Raw
More information