Univariate Data - 2. Numeric Summaries
|
|
- Marylou Richard
- 5 years ago
- Views:
Transcription
1 Univariate Data - 2. Numeric Summaries Young W. Lim Mon Young W. Lim Univariate Data - 2. Numeric Summaries Mon 1 / 31
2 Outline 1 Univariate Data Based on Numerical Summaries Young W. Lim Univariate Data - 2. Numeric Summaries Mon 2 / 31
3 Based on "Using R for Introductory Statistics" John Verzani I, the copyright holder of this work, hereby publish it under the following licenses: GNU head Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the section entitled GNU Free Documentation License. CC BY SA This file is licensed under the Creative Commons Attribution ShareAlike 3.0 Unported License. In short: you are free to share and make derivative works of the file under the conditions that you appropriately attribute it, and that you distribute it only under a license compatible with this one. Young W. Lim Univariate Data - 2. Numeric Summaries Mon 3 / 31
4 Functions function(x) { sum(x) / length(x) } functions(x) { total <- sum(x) n <- length(x) total / n } my_mean <- function(x) { sum(x) / length(x) } my_mean( c(1,2,3,4) ) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 4 / 31
5 The sample mean x <- c(51, 52, 48, 83, 92) sort(x) mean(x) devs <- x - mean(x) mean(devs) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 5 / 31
6 The trimmed mean mean(x, trim=0.10) # trim 10% of both ends mean(exec.pay) mean(exec.pay, trim=0.10) the Macdonell (HistData) data set w <- Macdonell$frequency / sum(macdonell$frequency) y <- Macdonell$height sum(w*y) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 6 / 31
7 The sample median median(x) n <- length(exec.pay); trim=0.10 lo <- 1 + floor(n * trim) hi <- n lo median(sort(exec.pay)[lo:hi]) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 7 / 31
8 Measures of position x <- 0:5 length(x) mean(sort(x)[3:4]) median(x) quantile(x) quantile(x, 0.25) # 0, 25, 50, pecentiles quantile(x, seq(0, 1, by=0.2)) fivenum(x) y = c("a"=95, "B"=82, "C"=74, "D"=61)y Young W. Lim Univariate Data - 2. Numeric Summaries Mon 8 / 31
9 Other measures of center table(x) table(x) == max(table(x)) which(table(x) == max(table(x))) as.numeric(names( which(table(x) == max(table(x))))) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 9 / 31
10 Spread range(x) diff(range(x)) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 10 / 31
11 Variance and standard deviation var(x) sum( (x - mean(x))^2 ) / (length(x) -1) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 11 / 31
12 Sample standard deviation x <- c(100, 300, 900, NA, 200, 500, 700) range(x, na.rm=true) sd(x, na.rm=true) z_score <- function(x) (x - mean(x)) / sd(x) z_score(x) scale(x)[,1] # to extract the 1st column z <- (x -mean(x)) / sd(x) x[ z >= 1.28] mean(x) * sd(x) z <- (exec.pay- mean(exec.pay)) / sd(exec.pay) out <- abs(z) > 3 sum(out) / length(z) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 12 / 31
13 IQR (InterWuartile Range) median(rivers) IQR(rivers) IQR(rivers) / sd(rivers) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 13 / 31
14 mad (Median Absolute Deviation) mad(rivers) / sd(rivers) ht <- kid.weights$height mad(ht) / sd(ht) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 14 / 31
15 Shape Symmetry and skew Tails Modes Young W. Lim Univariate Data - 2. Numeric Summaries Mon 15 / 31
16 Symmetry and skew skew <- function(x) { n <- length(x) z <- (x - mean(x)) / sd(x) sum(z^3) / n } skew(exec.pay) four_year_hts <- \ kid.weights[kid.weights$age %in% 48:59, "height"] skew(four_year_hts) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 16 / 31
17 Tail kurtosis <- function(x) { n <- length(x) z <- (x -mean(x)) / sd(x) sum(z^4)/n - 3 } kurtosis(galton$parent) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 17 / 31
18 Modes unimodal bimodal multimodal Young W. Lim Univariate Data - 2. Numeric Summaries Mon 18 / 31
19 Viewing the shape of a data set Dot plots Stem and leaf plots Histograms Density plots Box plots Quantile graphs Young W. Lim Univariate Data - 2. Numeric Summaries Mon 19 / 31
20 Dot plots jitter stripchart Young W. Lim Univariate Data - 2. Numeric Summaries Mon 20 / 31
21 Stem-and-leaf plots stem stem(bumpers) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 21 / 31
22 Histogram hist(faithful$waiting) bins <- seq(40, 100, by=5) x <- faithful$waiting out <- cut(x, breaks=bins) head(out) table(out) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 22 / 31
23 Density plots plot( density(bumpers) ) b_hist <- hist(bumpers, plot=false) b_dens <- density(bumpers) hist(bumpers, probability=true, xlim=range(c(b_hist$breaks, b_dens$x)), ylim=range(c(b_hist$density, b_dens$y)) ) lines(b_dens, lwd=2) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 23 / 31
24 Standard plotting arguments xlim ylim xlab ylab main pch cex col lwd lty set x coordinate range set y coordinate range set label for x axis set label for y axis set the main title adjust plot symbols (?pch) adjust size of text and symbols adjust color of objects drawn (?colors) adjust width of lines drawn adjust how line is drawn "blank", "solid", "dashed", dotdash" bty adjust bos type ("o", "l", "7", "c", "u", "]" Young W. Lim Univariate Data - 2. Numeric Summaries Mon 24 / 31
25 Functions to add layers points lines abkube text mtext add points to a graphic add points connected by lines to a graphic add a line of the form a + bx, y=h, or x=v add text to a graphic add text to margins of a graphic Young W. Lim Univariate Data - 2. Numeric Summaries Mon 25 / 31
26 Various plots hist(bumpers, probability=true, xlim = range( c(b_hist$breaks, b_dens$x)), ylim = range( c(b_hist$density, b_dens$y))) lines(b_dens, lwd=2) boxplot(bumpers, horizontal=true, main="bumpers") x <- rep(macdonell$finger, Macdonell$frequency) qqnorm(x) x <- jitter(histdata::galton$child, factor=5) qqnorm(x) Young W. Lim Univariate Data - 2. Numeric Summaries Mon 26 / 31
27 OOO Young W. Lim Univariate Data - 2. Numeric Summaries Mon 27 / 31
28 OOO Young W. Lim Univariate Data - 2. Numeric Summaries Mon 28 / 31
29 OOO Young W. Lim Univariate Data - 2. Numeric Summaries Mon 29 / 31
30 OOO Young W. Lim Univariate Data - 2. Numeric Summaries Mon 30 / 31
31 OOO Young W. Lim Univariate Data - 2. Numeric Summaries Mon 31 / 31
Univariate Data - 2. Numeric Summaries
Univariate Data - 2. Numeric Summaries Young W. Lim 2018-08-01 Mon Young W. Lim Univariate Data - 2. Numeric Summaries 2018-08-01 Mon 1 / 36 Outline 1 Univariate Data Based on Numerical Summaries R Numeric
More informationHyperGeometric Distribution
HyperGeometric Distribution Young W. Lim 2018-02-22 Thr Young W. Lim HyperGeometric Distribution 2018-02-22 Thr 1 / 15 Outline 1 HyperGeometric Distribution Based on HyperGeometric Random Variables Cumulative
More informationDay06 A. Young W. Lim Mon. Young W. Lim Day06 A Mon 1 / 16
Day06 A Young W. Lim 2017-09-18 Mon Young W. Lim Day06 A 2017-09-18 Mon 1 / 16 Outline 1 Based on 2 Introduction C Program Control Young W. Lim Day06 A 2017-09-18 Mon 2 / 16 Based on "C How to Program",
More informationStatistical Programming Camp: An Introduction to R
Statistical Programming Camp: An Introduction to R Handout 3: Data Manipulation and Summarizing Univariate Data Fox Chapters 1-3, 7-8 In this handout, we cover the following new materials: ˆ Using logical
More informationSTAT 3743 PROBABILITY & STATISTICS FALL 2010 KERNS. Exam I
STAT 3743 PROBABILITY & STATISTICS FALL 2010 KERNS Exam I Name: ANSWER KEY Note: the questions are randomly generated so these may (not) exactly match those on your paper. The answers below are for these
More informationStatistics 251: Statistical Methods
Statistics 251: Statistical Methods Summaries and Graphs in R Module R1 2018 file:///u:/documents/classes/lectures/251301/renae/markdown/master%20versions/summary_graphs.html#1 1/14 Summary Statistics
More informationChapter 1. Looking at Data-Distribution
Chapter 1. Looking at Data-Distribution Statistics is the scientific discipline that provides methods to draw right conclusions: 1)Collecting the data 2)Describing the data 3)Drawing the conclusions Raw
More informationChapter 3 - Displaying and Summarizing Quantitative Data
Chapter 3 - Displaying and Summarizing Quantitative Data 3.1 Graphs for Quantitative Data (LABEL GRAPHS) August 25, 2014 Histogram (p. 44) - Graph that uses bars to represent different frequencies or relative
More informationGAS Tutorial - 6. Expression
GAS Tutorial - 6. Expression Young W. Lim 2016-03-03 Wed Young W. Lim GAS Tutorial - 6. Expression 2016-03-03 Wed 1 / 15 Outline 1 Expressions Young W. Lim GAS Tutorial - 6. Expression 2016-03-03 Wed 2
More informationSection 1.2. Displaying Quantitative Data with Graphs. Mrs. Daniel AP Stats 8/22/2013. Dotplots. How to Make a Dotplot. Mrs. Daniel AP Statistics
Section. Displaying Quantitative Data with Graphs Mrs. Daniel AP Statistics Section. Displaying Quantitative Data with Graphs After this section, you should be able to CONSTRUCT and INTERPRET dotplots,
More informationTMTH 3360 NOTES ON COMMON GRAPHS AND CHARTS
To Describe Data, consider: Symmetry Skewness TMTH 3360 NOTES ON COMMON GRAPHS AND CHARTS Unimodal or bimodal or uniform Extreme values Range of Values and mid-range Most frequently occurring values In
More informationThe nor1mix Package. August 3, 2006
The nor1mix Package August 3, 2006 Title Normal (1-d) Mixture Models (S3 Classes and Methods) Version 1.0-6 Date 2006-08-02 Author: Martin Mächler Maintainer Martin Maechler
More informationThe nor1mix Package. June 12, 2007
The nor1mix Package June 12, 2007 Title Normal (1-d) Mixture Models (S3 Classes and Methods) Version 1.0-7 Date 2007-03-15 Author Martin Mächler Maintainer Martin Maechler
More informationPre-Calculus Multiple Choice Questions - Chapter S2
1 Which of the following is NOT part of a univariate EDA? a Shape b Center c Dispersion d Distribution Pre-Calculus Multiple Choice Questions - Chapter S2 2 Which of the following is NOT an acceptable
More informationTopic (3) SUMMARIZING DATA - TABLES AND GRAPHICS
Topic (3) SUMMARIZING DATA - TABLES AND GRAPHICS 3- Topic (3) SUMMARIZING DATA - TABLES AND GRAPHICS A) Frequency Distributions For Samples Defn: A FREQUENCY DISTRIBUTION is a tabular or graphical display
More informationDay08 A. Young W. Lim Mon. Young W. Lim Day08 A Mon 1 / 27
Day08 A Young W. Lim 2017-10-16 Mon Young W. Lim Day08 A 2017-10-16 Mon 1 / 27 Outline 1 Based on 2 C Functions (2) Storage Class and Scope Storage Class Specifiers A. Storage Duration B. Scope C. Linkage
More informationDay02 A. Young W. Lim Sat. Young W. Lim Day02 A Sat 1 / 12
Day02 A Young W. Lim 2017-10-07 Sat Young W. Lim Day02 A 2017-10-07 Sat 1 / 12 Outline 1 Based on 2 Introduction (2) - Basic Elements Basic Elements in C Programming Young W. Lim Day02 A 2017-10-07 Sat
More informationAcquisition Description Exploration Examination Understanding what data is collected. Characterizing properties of data.
Summary Statistics Acquisition Description Exploration Examination what data is collected Characterizing properties of data. Exploring the data distribution(s). Identifying data quality problems. Selecting
More informationDay06 A. Young W. Lim Wed. Young W. Lim Day06 A Wed 1 / 26
Day06 A Young W. Lim 2017-09-20 Wed Young W. Lim Day06 A 2017-09-20 Wed 1 / 26 Outline 1 Based on 2 C Program Control Overview for, while, do... while break and continue Relational and Logical Operators
More information> glucose = c(81, 85, 93, 93, 99, 76, 75, 84, 78, 84, 81, 82, 89, + 81, 96, 82, 74, 70, 84, 86, 80, 70, 131, 75, 88, 102, 115, + 89, 82, 79, 106)
This document describes how to use a number of R commands for plotting one variable and for calculating one variable summary statistics Specifically, it describes how to use R to create dotplots, histograms,
More informationBINF702 SPRING CHAPTER 2 Descriptive Statistics. BINF702 - SPRING2014- SOLKA - CHAPTER 2 Descriptive Statistics
BINF702 SPRING 2014 CHAPTER 2 Section 2.1 - Introduction If our set of observations is small we may study them via enumeration. This is usually not possible though. Ex. 2.1 Some investigators have proposed
More informationIntroduction To R Workshop Selected R Commands (See Dalgaard, 2008, Appendix C)
ELEMENTARY Commands List Objects in Workspace Delete object Search Path ls(), or objects() rm() search() Variable Names Combinations of letters, digits and period. Don t start with number or period. Assignments
More informationA Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn
A Handbook of Statistical Analyses Using R Brian S. Everitt and Torsten Hothorn CHAPTER 7 Density Estimation: Erupting Geysers and Star Clusters 7.1 Introduction 7.2 Density Estimation The three kernel
More informationVocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.
5-number summary 68-95-99.7 Rule Area principle Bar chart Bimodal Boxplot Case Categorical data Categorical variable Center Changing center and spread Conditional distribution Context Contingency table
More informationDay05 A. Young W. Lim Sat. Young W. Lim Day05 A Sat 1 / 14
Day05 A Young W. Lim 2017-10-07 Sat Young W. Lim Day05 A 2017-10-07 Sat 1 / 14 Outline 1 Based on 2 Structured Programming (2) Conditions and Loops Conditional Statements Loop Statements Type Cast Young
More informationTeaching univariate measures of location-using loss functions
Original Article Teaching univariate measures of location-using loss functions Jon-Paul Paolino Department of Mathematics and Computer Sciences, Mercy College, Dobbs Ferry, 10522, NY USA e-mail: jpaolino@mercy.edu
More informationCommon Sta 101 Commands for R. 1 One quantitative variable. 2 One categorical variable. 3 Two categorical variables. Summary statistics
Common Sta 101 Commands for R 1 One quantitative variable summary(x) # most summary statitstics at once mean(x) median(x) sd(x) hist(x) boxplot(x) # horizontal = TRUE for horizontal plot qqnorm(x) qqline(x)
More informationPackage sciplot. February 15, 2013
Package sciplot February 15, 2013 Version 1.1-0 Title Scientific Graphing Functions for Factorial Designs Author Manuel Morales , with code developed by the R Development Core Team
More informationNumerical Methods 5633
Numerical Methods 5633 Lecture 1 Marina Krstic Marinkovic marina.marinkovic@cern.ch School of Mathematics Trinity College Dublin Marina Krstic Marinkovic 1 / 15 5633-Numerical Methods R programming https://www.r-project.org/
More informationStatistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or me, I will answer promptly.
Statistical Methods Instructor: Lingsong Zhang 1 Issues before Class Statistical Methods Lingsong Zhang Office: Math 544 Email: lingsong@purdue.edu Phone: 765-494-7913 Office Hour: Monday 1:00 pm - 2:00
More informationDensity Curve (p52) Density curve is a curve that - is always on or above the horizontal axis.
1.3 Density curves p50 Some times the overall pattern of a large number of observations is so regular that we can describe it by a smooth curve. It is easier to work with a smooth curve, because the histogram
More informationAA BB CC DD EE. Introduction to Graphics in R
Introduction to Graphics in R Cori Mar 7/10/18 ### Reading in the data dat
More informationTable of Contents (As covered from textbook)
Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression
More informationChapter2 Description of samples and populations. 2.1 Introduction.
Chapter2 Description of samples and populations. 2.1 Introduction. Statistics=science of analyzing data. Information collected (data) is gathered in terms of variables (characteristics of a subject that
More informationSTATISTICAL LABORATORY, April 30th, 2010 BIVARIATE PROBABILITY DISTRIBUTIONS
STATISTICAL LABORATORY, April 3th, 21 BIVARIATE PROBABILITY DISTRIBUTIONS Mario Romanazzi 1 MULTINOMIAL DISTRIBUTION Ex1 Three players play 1 independent rounds of a game, and each player has probability
More information2. HW/SW Co-design. Young W. Lim Thr. Young W. Lim 2. HW/SW Co-design Thr 1 / 21
2. HW/SW Co-design Young W. Lim 2016-03-11 Thr Young W. Lim 2. HW/SW Co-design 2016-03-11 Thr 1 / 21 Outline 1 Software Engineering Young W. Lim 2. HW/SW Co-design 2016-03-11 Thr 2 / 21 Based on Software
More informationChapter 5: The standard deviation as a ruler and the normal model p131
Chapter 5: The standard deviation as a ruler and the normal model p131 Which is the better exam score? 67 on an exam with mean 50 and SD 10 62 on an exam with mean 40 and SD 12? Is it fair to say: 67 is
More informationGAS Tutorial - 4. Sections & Relocation
GAS Tutorial - 4. Sections & Relocation Young W. Lim 2016-03-01 Tue Young W. Lim GAS Tutorial - 4. Sections & Relocation 2016-03-01 Tue 1 / 22 Outline 1 Sections and Relocation Young W. Lim GAS Tutorial
More informationCommon R commands used in Data Analysis and Statistical Inference
Common R commands used in Data Analysis and Statistical Inference 1 One numerical variable summary(x) # most summary statitstics at once mean(x) median(x) sd(x) hist(x) boxplot(x) # horizontal = TRUE for
More informationSTA 570 Spring Lecture 5 Tuesday, Feb 1
STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row
More informationProcedure Calls. Young W. Lim Mon. Young W. Lim Procedure Calls Mon 1 / 29
Procedure Calls Young W. Lim 2017-08-21 Mon Young W. Lim Procedure Calls 2017-08-21 Mon 1 / 29 Outline 1 Introduction Based on Stack Background Transferring Control Register Usage Conventions Procedure
More information1.3 Graphical Summaries of Data
Arkansas Tech University MATH 3513: Applied Statistics I Dr. Marcel B. Finan 1.3 Graphical Summaries of Data In the previous section we discussed numerical summaries of either a sample or a data. In this
More informationAccess. Young W. Lim Fri. Young W. Lim Access Fri 1 / 18
Access Young W. Lim 2017-01-27 Fri Young W. Lim Access 2017-01-27 Fri 1 / 18 Outline 1 Introduction References IA32 Operand Forms Data Movement Instructions Young W. Lim Access 2017-01-27 Fri 2 / 18 Based
More informationWeek 2 Basic Statistical Concepts, Part II
Week 2 Basic Statistical Concepts, Part II Week 2 Objectives 1 Data presentation through numerical and graphical summaries using R: sample mean, variance and percentiles; the box plot, histogram, stem
More informationFurther Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables
Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables
More informationSTA Module 2B Organizing Data and Comparing Distributions (Part II)
STA 2023 Module 2B Organizing Data and Comparing Distributions (Part II) Learning Objectives Upon completing this module, you should be able to 1 Explain the purpose of a measure of center 2 Obtain and
More informationSTA Learning Objectives. Learning Objectives (cont.) Module 2B Organizing Data and Comparing Distributions (Part II)
STA 2023 Module 2B Organizing Data and Comparing Distributions (Part II) Learning Objectives Upon completing this module, you should be able to 1 Explain the purpose of a measure of center 2 Obtain and
More informationPackage beanplot. R topics documented: February 19, Type Package
Type Package Package beanplot February 19, 2015 Title Visualization via Beanplots (like Boxplot/Stripchart/Violin Plot) Version 1.2 Date 2014-09-15 Author Peter Kampstra Maintainer Peter Kampstra
More informationAccess. Young W. Lim Sat. Young W. Lim Access Sat 1 / 19
Access Young W. Lim 2017-06-10 Sat Young W. Lim Access 2017-06-10 Sat 1 / 19 Outline 1 Introduction References IA32 Operand Forms Data Movement Instructions Data Movement Examples Young W. Lim Access 2017-06-10
More informationLecture 6: Chapter 6 Summary
1 Lecture 6: Chapter 6 Summary Z-score: Is the distance of each data value from the mean in standard deviation Standardizes data values Standardization changes the mean and the standard deviation: o Z
More informationProcedure Calls. Young W. Lim Sat. Young W. Lim Procedure Calls Sat 1 / 27
Procedure Calls Young W. Lim 2016-11-05 Sat Young W. Lim Procedure Calls 2016-11-05 Sat 1 / 27 Outline 1 Introduction References Stack Background Transferring Control Register Usage Conventions Procedure
More informationCHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.
1 CHAPTER 1 Introduction Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. Variable: Any characteristic of a person or thing that can be expressed
More informationR Exercise Practical 2. Follow the instructions described in this sheet. Type or paste any answers you are asked for into a Microsoft Word document.
R Exercise Practical 2 Follow the instructions described in this sheet. Type or paste any answers you are asked for into a Microsoft Word document. Note: You should have this sheet on your computer for
More informationThe first one will centre the data and ensure unit variance (i.e. sphere the data):
SECTION 5 Exploratory Projection Pursuit We are now going to look at an exploratory tool called projection pursuit (Jerome Friedman, PROJECTION PURSUIT METHODS FOR DATA ANALYSIS, June. 1980, SLAC PUB-2768)
More informationStack Tutorial. Young W. Lim Sat. Young W. Lim Stack Tutorial Sat 1 / 15
Stack Tutorial Young W. Lim 2016-10-15 Sat Young W. Lim Stack Tutorial 2016-10-15 Sat 1 / 15 Outline 1 Introduction References Stack Background Transferring Control Young W. Lim Stack Tutorial 2016-10-15
More informationDay14 A. Young W. Lim Thr. Young W. Lim Day14 A Thr 1 / 14
Day14 A Young W. Lim 2017-11-02 Thr Young W. Lim Day14 A 2017-11-02 Thr 1 / 14 Outline 1 Based on 2 C Strings (1) Characters and Strings Unformatted IO Young W. Lim Day14 A 2017-11-02 Thr 2 / 14 Based
More informationR Programming: Worksheet 6
R Programming: Worksheet 6 Today we ll study a few useful functions we haven t come across yet: all(), any(), `%in%`, match(), pmax(), pmin(), unique() We ll also apply our knowledge to the bootstrap.
More informationLECTURE NOTES FOR ECO231 COMPUTER APPLICATIONS I. Part Two. Introduction to R Programming. RStudio. November Written by. N.
LECTURE NOTES FOR ECO231 COMPUTER APPLICATIONS I Part Two Introduction to R Programming RStudio November 2016 Written by N.Nilgün Çokça Introduction to R Programming 5 Installing R & RStudio 5 The R Studio
More informationFrequency Distributions
Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data so that it is possible to get a general overview of the results. Remember,
More informationUnivariate data. 2.1 For example: p <- c(2, 3, 5, 7, 11, 13, 17, 19)
2 Univariate data 2.1 For example: p
More informationArrays. Young W. Lim Mon. Young W. Lim Arrays Mon 1 / 17
Arrays Young W. Lim 2017-02-06 Mon Young W. Lim Arrays 2017-02-06 Mon 1 / 17 Outline 1 Introduction References Array Background Young W. Lim Arrays 2017-02-06 Mon 2 / 17 Based on "Self-service Linux: Mastering
More informationSection 2-2 Frequency Distributions. Copyright 2010, 2007, 2004 Pearson Education, Inc
Section 2-2 Frequency Distributions Copyright 2010, 2007, 2004 Pearson Education, Inc. 2.1-1 Frequency Distribution Frequency Distribution (or Frequency Table) It shows how a data set is partitioned among
More informationChapter 6: DESCRIPTIVE STATISTICS
Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling
More informationExploratory Projection Pursuit
Exploratory Projection Pursuit (Jerome Friedman, PROJECTION PURSUIT METHODS FOR DATA ANALYSIS, June. 1980, SLAC PUB-2768) Read in required files drive - D: code.dir - paste(drive, DATA/Data Mining R-Code,
More informationNo. of blue jelly beans No. of bags
Math 167 Ch5 Review 1 (c) Janice Epstein CHAPTER 5 EXPLORING DATA DISTRIBUTIONS A sample of jelly bean bags is chosen and the number of blue jelly beans in each bag is counted. The results are shown in
More informationIntroduction to R. Biostatistics 615/815 Lecture 23
Introduction to R Biostatistics 615/815 Lecture 23 So far We have been working with C Strongly typed language Variable and function types set explicitly Functional language Programs are a collection of
More informationVisualizing Data: Freq. Tables, Histograms
Visualizing Data: Freq. Tables, Histograms Engineering Statistics Section 1.2 Josh Engwer TTU 25 January 2016 Josh Engwer (TTU) Visualizing Data: Freq. Tables, Histograms 25 January 2016 1 / 23 Descriptive
More informationMATH& 146 Lesson 10. Section 1.6 Graphing Numerical Data
MATH& 146 Lesson 10 Section 1.6 Graphing Numerical Data 1 Graphs of Numerical Data One major reason for constructing a graph of numerical data is to display its distribution, or the pattern of variability
More informationChapter 5. Understanding and Comparing Distributions. Copyright 2012, 2008, 2005 Pearson Education, Inc.
Chapter 5 Understanding and Comparing Distributions The Big Picture We can answer much more interesting questions about variables when we compare distributions for different groups. Below is a histogram
More informationIST 3108 Data Analysis and Graphics Using R Week 9
IST 3108 Data Analysis and Graphics Using R Week 9 Engin YILDIZTEPE, Ph.D 2017-Spring Introduction to Graphics >y plot (y) In R, pictures are presented in the active graphical device or window.
More informationDay14 A. Young W. Lim Tue. Young W. Lim Day14 A Tue 1 / 15
Day14 A Young W. Lim 2017-12-26 Tue Young W. Lim Day14 A 2017-12-26 Tue 1 / 15 Outline 1 Based on 2 C Strings (1) Characters and Strings Unformatted IO Young W. Lim Day14 A 2017-12-26 Tue 2 / 15 Based
More informationStat 428 Autumn 2006 Homework 2 Solutions
Section 6.3 (5, 8) 6.3.5 Here is the Minitab output for the service time data set. Descriptive Statistics: Service Times Service Times 0 69.35 1.24 67.88 17.59 28.00 61.00 66.00 Variable Q3 Maximum Service
More informationRegression III: Advanced Methods
Lecture 3: Distributions Regression III: Advanced Methods William G. Jacoby Michigan State University Goals of the lecture Examine data in graphical form Graphs for looking at univariate distributions
More informationUnivariate Statistics Summary
Further Maths Univariate Statistics Summary Types of Data Data can be classified as categorical or numerical. Categorical data are observations or records that are arranged according to category. For example:
More informationPrepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.
Chapter 2 2.1 Descriptive Statistics A stem-and-leaf graph, also called a stemplot, allows for a nice overview of quantitative data without losing information on individual observations. It can be a good
More informationStatistical Programming with R
Statistical Programming with R Lecture 9: Basic graphics in R Part 2 Bisher M. Iqelan biqelan@iugaza.edu.ps Department of Mathematics, Faculty of Science, The Islamic University of Gaza 2017-2018, Semester
More informationUnit 7 Statistics. AFM Mrs. Valentine. 7.1 Samples and Surveys
Unit 7 Statistics AFM Mrs. Valentine 7.1 Samples and Surveys v Obj.: I will understand the different methods of sampling and studying data. I will be able to determine the type used in an example, and
More informationWeek 4: Describing data and estimation
Week 4: Describing data and estimation Goals Investigate sampling error; see that larger samples have less sampling error. Visualize confidence intervals. Calculate basic summary statistics using R. Calculate
More informationLecture 3 Questions that we should be able to answer by the end of this lecture:
Lecture 3 Questions that we should be able to answer by the end of this lecture: Which is the better exam score? 67 on an exam with mean 50 and SD 10 or 62 on an exam with mean 40 and SD 12 Is it fair
More informationMATH11400 Statistics Homepage
MATH11400 Statistics 1 2010 11 Homepage http://www.stats.bris.ac.uk/%7emapjg/teach/stats1/ 1.1 A Framework for Statistical Problems Many statistical problems can be described by a simple framework in which
More informationLogic Haskell Exercises
Logic Haskell Exercises Young W. Lim 2018-09-15 Sat Young W. Lim Logic Haskell Exercises 2018-09-15 Sat 1 / 36 Outline 1 Based on 2 Logic Using TAMO.hs Young W. Lim Logic Haskell Exercises 2018-09-15 Sat
More informationLecture 3 Questions that we should be able to answer by the end of this lecture:
Lecture 3 Questions that we should be able to answer by the end of this lecture: Which is the better exam score? 67 on an exam with mean 50 and SD 10 or 62 on an exam with mean 40 and SD 12 Is it fair
More informationLab 4: Distributions of random variables
Lab 4: Distributions of random variables In this lab we ll investigate the probability distribution that is most central to statistics: the normal distribution If we are confident that our data are nearly
More informationAccessibility (1A) Young Won Lim 8/22/13
Accessibility (1A) Copyright (c) 2011-2013 Young W. Lim. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any
More informationName Date Types of Graphs and Creating Graphs Notes
Name Date Types of Graphs and Creating Graphs Notes Graphs are helpful visual representations of data. Different graphs display data in different ways. Some graphs show individual data, but many do not.
More informationChapter 5. Understanding and Comparing Distributions. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 5 Understanding and Comparing Distributions The Big Picture We can answer much more interesting questions about variables when we compare distributions for different groups. Below is a histogram
More informationUnderstanding and Comparing Distributions. Chapter 4
Understanding and Comparing Distributions Chapter 4 Objectives: Boxplot Calculate Outliers Comparing Distributions Timeplot The Big Picture We can answer much more interesting questions about variables
More information050 0 N 03 BECABCDDDBDBCDBDBCDADDBACACBCCBAACEDEDBACBECCDDCEA
050 0 N 03 BECABCDDDBDBCDBDBCDADDBACACBCCBAACEDEDBACBECCDDCEA 55555555555555555555555555555555555555555555555555 NYYNNYNNNYNYYYYYNNYNNNNNYNYYYYYNYNNNNYNNYNNNYNNNNN 01 CAEADDBEDEDBABBBBCBDDDBAAAECEEDCDCDBACCACEECACCCEA
More informationLearner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display
CURRICULUM MAP TEMPLATE Priority Standards = Approximately 70% Supporting Standards = Approximately 20% Additional Standards = Approximately 10% HONORS PROBABILITY AND STATISTICS Essential Questions &
More informationSection 3.1 Shapes of Distributions MDM4U Jensen
Section 3.1 Shapes of Distributions MDM4U Jensen Part 1: Histogram Review Example 1: Earthquakes are measured on a scale known as the Richter Scale. There data are a sample of earthquake magnitudes in
More informationUP School of Statistics Student Council Education and Research
w UP School of Statistics Student Council Education and Research erho.weebly.com 0 erhomyhero@gmail.com f /erhoismyhero t @erhomyhero S133_HOA_001 Statistics 133 Bayesian Statistical Inference Use of R
More informationSTP 226 ELEMENTARY STATISTICS NOTES
ELEMENTARY STATISTICS NOTES PART 2 - DESCRIPTIVE STATISTICS CHAPTER 2 ORGANIZING DATA Descriptive Statistics - include methods for organizing and summarizing information clearly and effectively. - classify
More informationGraph tool instructions and R code
Graph tool instructions and R code 1) Prepare data: tab-delimited format Data need to be inputted in a tab-delimited format. This can be easily achieved by preparing the data in a spread sheet program
More informationMath 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency
Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency lowest value + highest value midrange The word average: is very ambiguous and can actually refer to the mean,
More informationChapter 2 - Descriptive Statistics
Chapter 2 - Descriptive Statistics Our ultimate aim in this class is to study Inferential statistics but before we can start making decisions about data we need to know how to describe this data in a useful
More informationCh3 E lement Elemen ar tary Descriptive Descriptiv Statistics
Ch3 Elementary Descriptive Statistics Section 3.1: Elementary Graphical Treatment of Data Before doing ANYTHING with data: Understand the question. An approximate answer to the exact question is always
More informationCh. 1.4 Histograms & Stem-&-Leaf Plots
Ch. 1.4 Histograms & Stem-&-Leaf Plots Learning Intentions: Create a histogram & stem-&-leaf plot of a data set. Given a list of data, use a calculator to graph a histogram. Interpret histograms & stem-&-leaf
More informationLesson 18-1 Lesson Lesson 18-1 Lesson Lesson 18-2 Lesson 18-2
Topic 18 Set A Words survey data Topic 18 Set A Words Lesson 18-1 Lesson 18-1 sample line plot Lesson 18-1 Lesson 18-1 frequency table bar graph Lesson 18-2 Lesson 18-2 Instead of making 2-sided copies
More informationSTAT 503 Fall Introduction to SAS
Getting Started Introduction to SAS 1) Download all of the files, sas programs (.sas) and data files (.dat) into one of your directories. I would suggest using your H: drive if you are using a computer
More informationPlotting: An Iterative Process
Plotting: An Iterative Process Plotting is an iterative process. First we find a way to represent the data that focusses on the important aspects of the data. What is considered an important aspect may
More information