Quick introduction to descriptive statistics and graphs in. R Commander. Written by: Robin Beaumont
|
|
- Cecily Potter
- 5 years ago
- Views:
Transcription
1 Quick introduction to descriptive statistics and graphs in R Commander Written by: Robin Beaumont robin@organplayers.co.uk Date last updated Wednesday, 24 April 2013 Version: 2 Contents Boxplots... 2 Percentages for each category/factor level... 3 Summaries for a interval/ratio variable divided across categories (factor levels)... 3 Histograms... 4 Density plots... 5 Densityplots for subgroups defined by factor levels... 6 Graphical summaries of data - aggregation... 7 Aggregating data... 11
2 Boxplots From within R you need to load R commander by typing in the following command: library(rcmdr) First of all you need some data and for this example I'll use the sample dataset, by loading it directly from my website. You can do this by selecting the R commander menu option: Data-> from text, the clipboard or URL Then I have given the resultant dataframe the name mydataframe, also indicating that it is from a URL (i.e. the web) and the columns are separated by tab characters. Clicking on the OK button brings up the internet URL box, you need to type in it the following to obtain my sample data: This dataset has 7 variables of which we are only interested in two here; time (the outcome variable) and dosage a grouping variable indicating which group the result ('time') belongs to. time High Low dosage
3 Percentages for each category/factor level Using the dataset from the boxplots example. Taking a single variable we can obtain the counts for each category + percentage in R commander. Consider we wanted to know what the number and percentage of cases are in each group, that is within each category (level) of the dosage variable. The dosage variable is a grouping variable = nominal data, and each value is said to represent a factor level. Summaries for a interval/ratio variable divided across categories (factor levels) We can obtain simple descriptive statistics using the menu option show opposite we can also find these for subgroups by using the Summarize by groups option.
4 Histograms Say we wanted to see the distribution of ages in our dataset, you have three options usually you would only show one in a report. Frequency counts: frequency Percentages: mydataframe$age percent mydataframe$age Note the dataframe dollar column name format i.e. mydataframe$age description of the x axis. density Density histogram mydataframe$age
5 Density plots A density plot is a smoothed version of a histogram its very useful. Unfortunately there is no r commander menu option to produce them so you need to type the command: plot (density(dataframe name $ column name)) So for our dataframe which we have called mydataframe and the column called age within it we type; plot( density ( mydataframe$age)) density.default(x = mydataframe$ag Density N = 200 Bandwidth = 3.239
6 Densityplots for subgroups defined by factor levels There are many ways and the easiest is to use the lattice package introduced latter in the course but for now just considering the gender variable which has only 2 levels we can do the following: First copy only the male cases into a dataframe called maledata: select only rows where gender =male maledata <- mydataframe[mydataframe$gender == "Male",] note the double = = to mean "is equal to" and all the columns in the dataframe the comma is important Now copy only the female cases into a dataframe called femaledata: select only rows where gender =female femaledata <- mydataframe[mydataframe$gender == "Female",] note the double = = to mean "is equal to" and all the columns in the dataframe the comma is important plot the densities of. Now create our densityplot the male ages set the y axis limits to 0 to 0.07 set the x axis label to read..... plot(density(maledata$age), ylim = c(0, 0.07), main = "densityplots for males/females[dotted] for age", xlab= "age (years)" ) set the main title of the graph to read Now need to superimpose the female density line. lines(density(femaledata$age), lty = 2) set the line type to 2 which is dotted to differentiate it from teh default line type solid
7 Graphical summaries of data - aggregation Problem: we want to show hourly wage against years working at a health institution and have the data in the following format. First obtain either the healthwagedata.sav or the healthwagedata.rda, file from the url below and store it on your local machine. or The top left screenshot shows how to load the rda file. We see there are many entries for each yrsscale (time worked with institution). While the hourwage shows the average hourly wage. (top right) Before we do anything let's check what the summary values are for each level of employment time using the menu option statistics -> summaries -> numeric summaries and setup the dialog box as shown opposite. Clearly the mean and median hourly rate go up with years employment, from 18 to 21.63
8 Because of the multiple hourly wage values for each level of employment time a scatter plot of the raw data is not appropriate but we have two options: produce a series of boxplots or means or each group or aggregate the data, for example find the mean at each hourly wage against employment time and then plot these values. We can easily produce a boxplot of the above findings. 657 hourwage By selecting the identify outliers option: automatically we have the case numbers marked. 5 or less or more yrsscale hourwage or less or more By selecting the identify outliers option we now have a clearer, but possibly less useful graph. Asking the question what do the many outliers suggest? would require knowledge of the context in which the data was collected they might be miscoded values or a particular distinct subset of employees such as consultants and a definitive answer needs detailed knowledge of the environment from where the data was collected. yrsscale
9 Ignoring the outliers and assuming that the data are normally distributed at each no of years employment level we can produce a graph of means at each level along with a indication of range. Graphs->plot of means Selecting the standard errors option we can see the estimated accuracy of the mean for each group mean of mydataset$hourwage Plot of Means I feel that presenting the data like this possibly does it a disservice as it now appears very clean giving no indication of those very low and high paid workers! 5 or less or mo mydataset$yrsscale Notice that the x categories are in the correct order but this is not always the case, the rda and sav files contained additional information specifying the factor level order. However if we had used a plan text file (i.e..dat or.txt) you would have needed to reorder the factor levels by using the R Commander menu option: Data ->Manage variables in active dataset->reorder factor- >levels
10 The alternative strategy is to produce a new dataframe which only consists of the summary values. To do this we first need to remove all those rows which have empty values for either the hourwage or yrsscale variables. data->active data set->remove cases with missing data See opposite. I have called the new dataframe cleandataframe. Notice that the new dataframe is automatically loaded. The new dataframe has 89 less records
11 Aggregating data Aggregating data and new datasets from the aggregated values is a common occurrence with large datasets and this scenario provides you with a good example. Having removed all the cases with missing data we can now create a newdataframe with just the aggregated data (i.e. the means) by selecting the menu option: Then setup the dialog box as shown opposite. Notice that the new dataframe is automatically loaded. The new dataframe has 6 records. Clicking on the edit data set button we can edit the new dataframe. When you have finished make sure you close it by clicking on the X button on the top right hand side of the window.
12 The next stage is to produce a scatterplot of the means against year, however we can only do this when we have at least two interval/ratio variables in the dataframe else the R commander scatterplot menu option is grayed out. Which it would be if you tried with the current dataframe. However this is easily fixed by changing the yrsscale variable from a factor to a numeric variable. Once again click on the edit data set button this time selecting the top of the yrsscale column and change the variable to numeric. When you have finished make sure you close both the variable editor and the data editor windows with the X button. Now we can produce the scatterplot. Setup the dialog box as shown opposite. The result is shown below. But I feel is far less informative than the boxplots we created earlier? yrsscale hourwage end of document
LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA
LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA This lab will assist you in learning how to summarize and display categorical and quantitative data in StatCrunch. In particular, you will learn how to
More informationIntroduction to Minitab 1
Introduction to Minitab 1 We begin by first starting Minitab. You may choose to either 1. click on the Minitab icon in the corner of your screen 2. go to the lower left and hit Start, then from All Programs,
More informationA Simple Guide to Using SPSS (Statistical Package for the. Introduction. Steps for Analyzing Data. Social Sciences) for Windows
A Simple Guide to Using SPSS (Statistical Package for the Social Sciences) for Windows Introduction ٢ Steps for Analyzing Data Enter the data Select the procedure and options Select the variables Run the
More information1. Basic Steps for Data Analysis Data Editor. 2.4.To create a new SPSS file
1 SPSS Guide 2009 Content 1. Basic Steps for Data Analysis. 3 2. Data Editor. 2.4.To create a new SPSS file 3 4 3. Data Analysis/ Frequencies. 5 4. Recoding the variable into classes.. 5 5. Data Analysis/
More informationStatistics Lecture 6. Looking at data one variable
Statistics 111 - Lecture 6 Looking at data one variable Chapter 1.1 Moore, McCabe and Craig Probability vs. Statistics Probability 1. We know the distribution of the random variable (Normal, Binomial)
More informationCombo Charts. Chapter 145. Introduction. Data Structure. Procedure Options
Chapter 145 Introduction When analyzing data, you often need to study the characteristics of a single group of numbers, observations, or measurements. You might want to know the center and the spread about
More informationBrief Guide on Using SPSS 10.0
Brief Guide on Using SPSS 10.0 (Use student data, 22 cases, studentp.dat in Dr. Chang s Data Directory Page) (Page address: http://www.cis.ysu.edu/~chang/stat/) I. Processing File and Data To open a new
More informationAnalysing Hospital Episode Statistics (HES)
Analysing Hospital Episode Statistics (HES) Practical Session 3 - Looking at several diagnoses for each age - Merging data files and creating calculated fields Available to download from: www.robin-beaumont.co.uk/virtualclassroon/hes
More informationIntroduction to Health Informatics
Introduction to Health Informatics Introduction to Hospital Episode Statistics (HES) Session 3 - Looking at several diagnoses for each age - Merging data files Date: 21/08/07 16:37:09 Written by: Robin
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming
More informationHomework 1 Excel Basics
Homework 1 Excel Basics Excel is a software program that is used to organize information, perform calculations, and create visual displays of the information. When you start up Excel, you will see the
More informationMixed models in R using the lme4 package Part 2: Lattice graphics
Mixed models in R using the lme4 package Part 2: Lattice graphics Douglas Bates University of Wisconsin - Madison and R Development Core Team University of Lausanne July 1,
More informationYour Name: Section: INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression
Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression Objectives: 1. To learn how to interpret scatterplots. Specifically you will investigate, using
More informationBasics of Plotting Data
Basics of Plotting Data Luke Chang Last Revised July 16, 2010 One of the strengths of R over other statistical analysis packages is its ability to easily render high quality graphs. R uses vector based
More informationAn Introduction to Minitab Statistics 529
An Introduction to Minitab Statistics 529 1 Introduction MINITAB is a computing package for performing simple statistical analyses. The current version on the PC is 15. MINITAB is no longer made for the
More informationSelect Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables
Select Cases GRAPHS The Select Cases command excludes from further analysis all those cases that do not meet specified selection criteria. Select Cases For a subset of the datafile, use Select Cases. In
More informationINTRODUCTION TO SPSS OUTLINE 6/17/2013. Assoc. Prof. Dr. Md. Mujibur Rahman Room No. BN Phone:
INTRODUCTION TO SPSS Assoc. Prof. Dr. Md. Mujibur Rahman Room No. BN-0-024 Phone: 89287269 E-mail: mujibur@uniten.edu.my OUTLINE About the four-windows in SPSS The basics of managing data files The basic
More informationIBMSPSSSTATL1P: IBM SPSS Statistics Level 1
SPSS IBMSPSSSTATL1P IBMSPSSSTATL1P: IBM SPSS Statistics Level 1 Version: 4.4 QUESTION NO: 1 Which statement concerning IBM SPSS Statistics application windows is correct? A. At least one Data Editor window
More informationCOMM 391 Winter 2014 Term 1. Tutorial 1: Microsoft Excel - Creating Pivot Table
COMM 391 Winter 2014 Term 1 Tutorial 1: Microsoft Excel - Creating Pivot Table The purpose of this tutorial is to enable you to create Pivot Table to analyze worksheet data in Microsoft Excel. You should
More informationLab 3 - Introduction to Graphics
Lab 3 - Introduction to Graphics Spring 2018 Contents 1 Graphics 2 1.1 Plots in one dimension................................... 2 1.2 Plots in two dimensions: two-way plots.......................... 5
More informationMinitab 17 commands Prepared by Jeffrey S. Simonoff
Minitab 17 commands Prepared by Jeffrey S. Simonoff Data entry and manipulation To enter data by hand, click on the Worksheet window, and enter the values in as you would in any spreadsheet. To then save
More information1 Introduction to Using Excel Spreadsheets
Survey of Math: Excel Spreadsheet Guide (for Excel 2007) Page 1 of 6 1 Introduction to Using Excel Spreadsheets This section of the guide is based on the file (a faux grade sheet created for messing with)
More informationPractical 2: Plotting
Practical 2: Plotting Complete this sheet as you work through it. If you run into problems, then ask for help - don t skip sections! Open Rstudio and store any files you download or create in a directory
More informationUsing RExcel and R Commander
Chapter 2 Using RExcel and R Commander Abstract We review the complete set of Rcmdr menu items, including both the action menu items and the active Dataset and model items. We illustrate the output graphs
More informationThe basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student
Organizing data Learning Outcome 1. make an array 2. divide the array into class intervals 3. describe the characteristics of a table 4. construct a frequency distribution table 5. constructing a composite
More informationOutline. Part 2: Lattice graphics. The formula/data method of specifying graphics. Exploring and presenting data. Presenting data.
Outline Part 2: Lattice graphics ouglas ates University of Wisconsin - Madison and R evelopment ore Team Sept 08, 2010 Presenting data Scatter plots Histograms and density plots
More informationChapter 2 Assignment (due Thursday, April 19)
(due Thursday, April 19) Introduction: The purpose of this assignment is to analyze data sets by creating histograms and scatterplots. You will use the STATDISK program for both. Therefore, you should
More informationIntroduction (SPSS) Opening SPSS Start All Programs SPSS Inc SPSS 21. SPSS Menus
Introduction (SPSS) SPSS is the acronym of Statistical Package for the Social Sciences. SPSS is one of the most popular statistical packages which can perform highly complex data manipulation and analysis
More informationResearch Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel
Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement
More informationError-Bar Charts from Summary Data
Chapter 156 Error-Bar Charts from Summary Data Introduction Error-Bar Charts graphically display tables of means (or medians) and variability. Following are examples of the types of charts produced by
More informationMKTG 460 Winter 2019 Solutions #1
MKTG 460 Winter 2019 Solutions #1 Short Answer: Data Analysis 1. What is a data table and how are the data values organized? A data table stores the data values for variables across different observations,
More informationSTA 490H1S Initial Examination of Data
Initial Examination of Data Alison L. Department of Statistics University of Toronto Winter 2011 Course mantra It s OK not to know. Expressing ignorance is encouraged. It s not OK to not have a willingness
More informationSurvey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9
Survey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9 Contents 1 Introduction to Using Excel Spreadsheets 2 1.1 A Serious Note About Data Security.................................... 2 1.2
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use
More informationThomas Vincent Head of Data Science, Getty Images
VISUALIZING TIME SERIES DATA IN PYTHON Clean your time series data Thomas Vincent Head of Data Science, Getty Images The CO2 level time series A snippet of the weekly measurements of CO2 levels at the
More informationStatistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975.
Statistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975. SPSS Statistics were designed INTRODUCTION TO SPSS Objective About the
More informationHow to Use a Statistical Package
APPENDIX D How to Use a Statistical Package Candace M. Evans, Lisa M. Gilman, Jeffrey Xavier, Joan Saxton Weber Computers and statistical software such as the Statistical Package for the Social Sciences
More informationBIOSTATISTICS LABORATORY PART 1: INTRODUCTION TO DATA ANALYIS WITH STATA: EXPLORING AND SUMMARIZING DATA
BIOSTATISTICS LABORATORY PART 1: INTRODUCTION TO DATA ANALYIS WITH STATA: EXPLORING AND SUMMARIZING DATA Learning objectives: Getting data ready for analysis: 1) Learn several methods of exploring the
More informationOrientation Assignment for Statistics Software (nothing to hand in) Mary Parker,
Orientation to MINITAB, Mary Parker, mparker@austincc.edu. Last updated 1/3/10. page 1 of Orientation Assignment for Statistics Software (nothing to hand in) Mary Parker, mparker@austincc.edu When you
More informationTips and Guidance for Analyzing Data. Executive Summary
Tips and Guidance for Analyzing Data Executive Summary This document has information and suggestions about three things: 1) how to quickly do a preliminary analysis of time-series data; 2) key things to
More informationSPSS. (Statistical Packages for the Social Sciences)
Inger Persson SPSS (Statistical Packages for the Social Sciences) SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform basic statistical calculations in SPSS.
More informationUNIT 4. Research Methods in Business
UNIT 4 Preparing Data for Analysis:- After data are obtained through questionnaires, interviews, observation or through secondary sources, they need to be edited. The blank responses, if any have to be
More informationAfter opening Stata for the first time: set scheme s1mono, permanently
Stata 13 HELP Getting help Type help command (e.g., help regress). If you don't know the command name, type lookup topic (e.g., lookup regression). Email: tech-support@stata.com. Put your Stata serial
More informationStata: A Brief Introduction Biostatistics
Stata: A Brief Introduction Biostatistics 140.621 2005-2006 1. Statistical Packages There are many statistical packages (Stata, SPSS, SAS, Splus, etc.) Statistical packages can be used for Analysis Data
More informationOpening a Data File in SPSS. Defining Variables in SPSS
Opening a Data File in SPSS To open an existing SPSS file: 1. Click File Open Data. Go to the appropriate directory and find the name of the appropriate file. SPSS defaults to opening SPSS data files with
More informationRelease notes for StatCrunch mid-march 2015 update
Release notes for StatCrunch mid-march 2015 update A major StatCrunch update was made on March 18, 2015. This document describes the content of the update including major additions to StatCrunch that were
More informationAssignment 3 due Thursday Oct. 11
Instructor Linda C. Stephenson due Thursday Oct. 11 GENERAL NOTE: These assignments often build on each other what you learn in one assignment may be carried over to subsequent assignments. If I have already
More informationCITS4009 Introduc0on to Data Science
School of Computer Science and Software Engineering CITS4009 Introduc0on to Data Science SEMESTER 2, 2017: CHAPTER 3 EXPLORING DATA 1 Chapter Objec0ves Using summary sta.s.cs to explore data Exploring
More informationCanadian National Longitudinal Survey of Children and Youth (NLSCY)
Canadian National Longitudinal Survey of Children and Youth (NLSCY) Fathom workshop activity For more information about the survey, see: http://www.statcan.ca/ Daily/English/990706/ d990706a.htm Notice
More information2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES
EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES Chapter 2 2.1 Objectives 2.1 What Are the Types of Data? www.managementscientist.org 1. Know the definitions of a. Variable b. Categorical versus quantitative
More informationEasing into Data Exploration, Reporting, and Analytics Using SAS Enterprise Guide
Paper 809-2017 Easing into Data Exploration, Reporting, and Analytics Using SAS Enterprise Guide ABSTRACT Marje Fecht, Prowerk Consulting Whether you have been programming in SAS for years, are new to
More informationMake sure to keep all graphs in same excel file as your measures.
Project Part 2 Graphs. I. Use Excel to make bar graph for questions 1, and 5. II. Use Excel to make histograms for questions 2, and 3. III. Use Excel to make pie graphs for questions 4, and 6. IV. Use
More informationNCSS Statistical Software
Chapter 152 Introduction When analyzing data, you often need to study the characteristics of a single group of numbers, observations, or measurements. You might want to know the center and the spread about
More informationWELCOME! Lecture 3 Thommy Perlinger
Quantitative Methods II WELCOME! Lecture 3 Thommy Perlinger Program Lecture 3 Cleaning and transforming data Graphical examination of the data Missing Values Graphical examination of the data It is important
More informationLECTURE NOTES FOR ECO231 COMPUTER APPLICATIONS I. Part Two. Introduction to R Programming. RStudio. November Written by. N.
LECTURE NOTES FOR ECO231 COMPUTER APPLICATIONS I Part Two Introduction to R Programming RStudio November 2016 Written by N.Nilgün Çokça Introduction to R Programming 5 Installing R & RStudio 5 The R Studio
More informationFrequency Tables. Chapter 500. Introduction. Frequency Tables. Types of Categorical Variables. Data Structure. Missing Values
Chapter 500 Introduction This procedure produces tables of frequency counts and percentages for categorical and continuous variables. This procedure serves as a summary reporting tool and is often used
More informationSPSS Instructions and Guidelines PSCI 2300 Intro to Political Science Research Dr. Paul Hensel Last updated 10 March 2018
SPSS Instructions and Guidelines PSCI 2300 Intro to Political Science Research Dr. Paul Hensel Last updated 10 March 2018 Table of Contents Introduction... 1 Accessing SPSS... 2 Possible Alternative: PSPP...
More informationR syntax guide. Richard Gonzalez Psychology 613. August 27, 2015
R syntax guide Richard Gonzalez Psychology 613 August 27, 2015 This handout will help you get started with R syntax. There are obviously many details that I cannot cover in these short notes but these
More informationAt the end of the chapter, you will learn to: Present data in textual form. Construct different types of table and graphs
DATA PRESENTATION At the end of the chapter, you will learn to: Present data in textual form Construct different types of table and graphs Identify the characteristics of a good table and graph Identify
More informationAND NUMERICAL SUMMARIES. Chapter 2
EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES Chapter 2 2.1 What Are the Types of Data? 2.1 Objectives www.managementscientist.org 1. Know the definitions of a. Variable b. Categorical versus quantitative
More informationR for IR. Created by Narren Brown, Grinnell College, and Diane Saphire, Trinity University
R for IR Created by Narren Brown, Grinnell College, and Diane Saphire, Trinity University For presentation at the June 2013 Meeting of the Higher Education Data Sharing Consortium Table of Contents I.
More information4. Descriptive Statistics: Measures of Variability and Central Tendency
4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and
More informationCS130/230 Lecture 6 Introduction to StatView
Thursday, January 15, 2004 Intro to StatView CS130/230 Lecture 6 Introduction to StatView StatView is a statistical analysis program that allows: o Data management in a spreadsheet-like format o Graphs
More informationMinitab Notes for Activity 1
Minitab Notes for Activity 1 Creating the Worksheet 1. Label the columns as team, heat, and time. 2. Have Minitab automatically enter the team data for you. a. Choose Calc / Make Patterned Data / Simple
More informationAn Introduction to R Graphics
An Introduction to R Graphics PnP Group Seminar 25 th April 2012 Why use R for graphics? Fast data exploration Easy automation and reproducibility Create publication quality figures Customisation of almost
More informationExample 1 - Joining datasets by a common variable: Creating a single table using multiple datasets Other features illustrated: Aggregate data multi-variable recode, computational calculation Background:
More informationMath 263 Excel Assignment 3
ath 263 Excel Assignment 3 Sections 001 and 003 Purpose In this assignment you will use the same data as in Excel Assignment 2. You will perform an exploratory data analysis using R. You shall reproduce
More informationAn Introduction to the R Commander
An Introduction to the R Commander BIO/MAT 460, Spring 2011 Christopher J. Mecklin Department of Mathematics & Statistics Biomathematics Research Group Murray State University Murray, KY 42071 christopher.mecklin@murraystate.edu
More informationCENTRAL SUSQUEHANNA INTERMEDIATE UNIT Application: Payroll
CENTRAL SUSQUEHANNA INTERMEDIATE UNIT Application: Payroll Employer Parameters Time Clock Set Up Step-by-step Instructions 2012 Central Susquehanna Intermediate Unit, USA Table of Contents Introduction...
More informationSummarising Data. Mark Lunt 09/10/2018. Arthritis Research UK Epidemiology Unit University of Manchester
Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 09/10/2018 Summarising Data Today we will consider Different types of data Appropriate ways to summarise these
More informationIntroductions Overview of SPSS
Introductions Overview of SPSS Welcome to our SPSS tutorials. This first tutorial will provide a basic overview of the SPSS environment. We will be using SPSS version 22 for these tutorials, however, versions
More informationAn introduction to ggplot: An implementation of the grammar of graphics in R
An introduction to ggplot: An implementation of the grammar of graphics in R Hadley Wickham 00-0-7 1 Introduction Currently, R has two major systems for plotting data, base graphics and lattice graphics
More informationWHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide
STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO
More informationExcel 2007/2010/2013: Using Data Validation to provide dropdown selection menu
Excel 2007/2010/2013: Using Data Validation to provide dropdown selection menu Submitted by Jess on Sun, 06/30/2013-20:57 In Excel, there are various ways to provide a drop-down menu in a form or in cells.
More informationFigure 3.20: Visualize the Titanic Dataset
80 Chapter 3. Data Mining with Azure Machine Learning Studio Figure 3.20: Visualize the Titanic Dataset 3. After verifying the output, we will cast categorical values to the corresponding columns. To begin,
More informationAn Introduction to EDA with GeoDa
An Introduction to EDA with GeoDa Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign http://sal.agecon.uiuc.edu/ June 16,
More informationHow to Use a Statistical Package
APPENDIX F How to Use a Statistical Package With the assistance of Lisa M. Gilman and Jeffrey Xavier and with contributions by Joan Saxton Weber Computers and statistical software such as the Statistical
More informationMATH 117 Statistical Methods for Management I Chapter Two
Jubail University College MATH 117 Statistical Methods for Management I Chapter Two There are a wide variety of ways to summarize, organize, and present data: I. Tables 1. Distribution Table (Categorical
More informationCreate Custom Tables in No Time
PASW Custom Tables 18 Create Custom Tables in No Time Easily analyze data and communicate your results with PASW Custom Tables Show the results of analyses clearly and quickly You often report the results
More information74 Wyner Math Academy I Spring 2016
74 Wyner Math Academy I Spring 2016 CHAPTER EIGHT: SPREADSHEETS Review April 18 Test April 25 Spreadsheets are an extremely useful and versatile tool. Some basic knowledge allows many basic tasks to be
More informationSAS Visual Analytics 8.2: Getting Started with Reports
SAS Visual Analytics 8.2: Getting Started with Reports Introduction Reporting The SAS Visual Analytics tools give you everything you need to produce and distribute clear and compelling reports. SAS Visual
More informationEcon Stata Tutorial I: Reading, Organizing and Describing Data. Sanjaya DeSilva
Econ 329 - Stata Tutorial I: Reading, Organizing and Describing Data Sanjaya DeSilva September 8, 2008 1 Basics When you open Stata, you will see four windows. 1. The Results window list all the commands
More informationWorking with Variables: Primary Document Families
WORKING WITH VARIABLES: PRIMARY DOCUMENT FAMILIES 245 Working with Variables: Primary Document Families PD families as variables can be used in queries and SPSS jobs. Use PD-Family tables to assign PDs
More informationMath 121 Project 4: Graphs
Math 121 Project 4: Graphs Purpose: To review the types of graphs, and use MS Excel to create them from a dataset. Outline: You will be provided with several datasets and will use MS Excel to create graphs.
More informationChapter 5: The beast of bias
Chapter 5: The beast of bias Self-test answers SELF-TEST Compute the mean and sum of squared error for the new data set. First we need to compute the mean: + 3 + + 3 + 2 5 9 5 3. Then the sum of squared
More informationWORKSHOP: Using the Health Survey for England, 2014
WORKSHOP: Using the Health Survey for England, 2014 There are three sections to this workshop, each with a separate worksheet. The worksheets are designed to be accessible to those who have no prior experience
More information1.3 Graphical Summaries of Data
Arkansas Tech University MATH 3513: Applied Statistics I Dr. Marcel B. Finan 1.3 Graphical Summaries of Data In the previous section we discussed numerical summaries of either a sample or a data. In this
More informationCHAPTER 6. The Normal Probability Distribution
The Normal Probability Distribution CHAPTER 6 The normal probability distribution is the most widely used distribution in statistics as many statistical procedures are built around it. The central limit
More informationFrequency Distributions and Descriptive Statistics in SPS
230 Combs Building 859.622.3050 studentcomputing.eku.edu studentcomputing@eku.edu Frequency Distributions and Descriptive Statistics in SPSS In this tutorial, we re going to work through a sample problem
More informationStat 290: Lab 2. Introduction to R/S-Plus
Stat 290: Lab 2 Introduction to R/S-Plus Lab Objectives 1. To introduce basic R/S commands 2. Exploratory Data Tools Assignment Work through the example on your own and fill in numerical answers and graphs.
More informationElementary Statistics
1 Elementary Statistics Introduction Statistics is the collection of methods for planning experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting, and drawing
More informationExcel Tips and FAQs - MS 2010
BIOL 211D Excel Tips and FAQs - MS 2010 Remember to save frequently! Part I. Managing and Summarizing Data NOTE IN EXCEL 2010, THERE ARE A NUMBER OF WAYS TO DO THE CORRECT THING! FAQ1: How do I sort my
More informationStata version 12. Lab Session 1 February Preliminary: How to Screen Capture.. 2. Preliminary: How to Keep a Log of Your Stata Session..
Stata version 12 Lab Session 1 February 2013 1. Preliminary: How to Screen Capture.. 2. Preliminary: How to Keep a Log of Your Stata Session.. 3. Preliminary: How to Save a Stata Graph... 4. Enter Data:
More informationECLT 5810 Data Preprocessing. Prof. Wai Lam
ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate
More informationExcel 2010 with XLSTAT
Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with
More informationStatistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or me, I will answer promptly.
Statistical Methods Instructor: Lingsong Zhang 1 Issues before Class Statistical Methods Lingsong Zhang Office: Math 544 Email: lingsong@purdue.edu Phone: 765-494-7913 Office Hour: Monday 1:00 pm - 2:00
More informationExploring cdna Data. Achim Tresch, Andreas Buness, Wolfgang Huber, Tim Beißbarth
Exploring cdna Data Achim Tresch, Andreas Buness, Wolfgang Huber, Tim Beißbarth Practical DNA Microarray Analysis http://compdiag.molgen.mpg.de/ngfn/pma0nov.shtml The following exercise will guide you
More informationExcel 2007 for Teaching Statistics
ICT & MATHS Modules 4 & 5 Excel 2007 for Teaching Statistics 1 Published by The National Centre for Technology in Education in association with the Project Maths Development Team. Permission granted to
More informationICSSR Data Service. Stata: User Guide. Indian Council of Social Science Research. Indian Social Science Data Repository
http://www.icssrdataservice.in/ ICSSR Data Service Indian Social Science Data Repository Stata: User Guide Indian Council of Social Science Research ICSSR Data Service Contents: 1. Introduction 1 2. Opening
More informationSpreadsheet Applications Test
Spreadsheet Applications Test 1. The expression returns the maximum value in the range A1:A100 and then divides the value by 100. a. =MAX(A1:A100/100) b. =MAXIMUM(A1:A100)/100 c. =MAX(A1:A100)/100 d. =MAX(100)/(A1:A100)
More information