STAT10010 Introductory Statistics Lab 2

Similar documents
Your Name: Section: INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression

Statistical Analysis Using Minitab

8. MINITAB COMMANDS WEEK-BY-WEEK

Basic concepts and terms

IENG484 Quality Engineering Lab 1 RESEARCH ASSISTANT SHADI BOLOUKIFAR

BUSINESS DECISION MAKING. Topic 1 Introduction to Statistical Thinking and Business Decision Making Process; Data Collection and Presentation

Correctly Compute Complex Samples Statistics

Opening a Data File in SPSS. Defining Variables in SPSS

Minitab 17 commands Prepared by Jeffrey S. Simonoff

GETTING STARTED WITH MINITAB INTRODUCTION TO MINITAB STATISTICAL SOFTWARE

Introduction to Minitab 1

Objective 1: To simulate the rolling of a die 100 times and to build a probability distribution.

Minitab Lab #1 Math 120 Nguyen 1 of 7

Probability and Statistics. Copyright Cengage Learning. All rights reserved.

TYPES OF VARIABLES, STRUCTURE OF DATASETS, AND BASIC STATA LAYOUT

Colorado Results. For 10/3/ /4/2012. Contact: Doug Kaplan,

Getting started with Minitab 14 for Windows

Practical 2: Using Minitab (not assessed, for practice only!)

Minitab Notes for Activity 1

An Introduction to Minitab Statistics 529

SPSS TRAINING SPSS VIEWS

DEPARTMENT OF HEALTH AND HUMAN SCIENCES HS900 RESEARCH METHODS

The first thing you see when you open NVivo is a list of recent projects you have worked with in program. Today we will create a new project.

Introduction to SPSS Faiez Mossa 2 nd Class

Quality and Six Sigma Tools using MINITAB Statistical Software: A complete Guide to Six Sigma DMAIC Tools using MINITAB

+ Statistical Methods in

MKT 450 Sampling Homework Instructions

Things you ll know (or know better to watch out for!) when you leave in December: 1. What you can and cannot infer from graphs.

Creating a data file and entering data

Orientation Assignment for Statistics Software (nothing to hand in) Mary Parker,

Table Of Contents. Table Of Contents

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things.

SPSS syntax for data set definition

SSC-Stat Tutorial. by Roger Stern, Sandro Leidi and Colin Grayer. Contents

MINITAB 17 BASICS REFERENCE GUIDE

INTRODUCTORY SPSS. Dr Feroz Mahomed Swalaha x2689

QUEEN MARY, UNIVERSITY OF LONDON. Introduction to Statistics

GEO 425: SPRING 2012 LAB 9: Introduction to Postgresql and SQL

Statistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or me, I will answer promptly.

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition

Surviving SPSS.

Working with Variables: Primary Document Families

Module 15: Multilevel Modelling of Repeated Measures Data. MLwiN Practical 1

Measures of Dispersion

Your Name: Section: 2. To develop an understanding of the standard deviation as a measure of spread.

SSC-Stat 2.18 Tutorial

Lab #9: ANOVA and TUKEY tests

Part I, Chapters 4 & 5. Data Tables and Data Analysis Statistics and Figures

CHAPTER 6. The Normal Probability Distribution

Correctly Compute Complex Samples Statistics

1. What specialist uses information obtained from bones to help police solve crimes?

PIVOT TABLES IN MICROSOFT EXCEL 2016

Tutorial #1: Using Latent GOLD choice to Estimate Discrete Choice Models

Statistics 528: Minitab Handout 1

Selected Introductory Statistical and Data Manipulation Procedures. Gordon & Johnson 2002 Minitab version 13.

Handling Your Data in SPSS. Columns, and Labels, and Values... Oh My! The Structure of SPSS. You should think about SPSS as having three major parts.

There are 3 main windows, and 3 main types of files, in SPSS: Data, Syntax, and Output.

1. The Normal Distribution, continued

Running Minitab for the first time on your PC

Let s use Technology Use Data from Cycle 14 of the General Social Survey with Fathom for a data analysis project

MINITAB BASICS STORING DATA

LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA

CSE 131 Introduction to Computer Science Fall Exam I

Lab #3: Probability, Simulations, Distributions:

Analysis of Complex Survey Data with SAS

BIOL 417: Biostatistics Laboratory #3 Tuesday, February 8, 2011 (snow day February 1) INTRODUCTION TO MYSTAT

Select Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables

Mr. Kongmany Chaleunvong. GFMER - WHO - UNFPA - LAO PDR Training Course in Reproductive Health Research Vientiane, 22 October 2009

In Minitab interface has two windows named Session window and Worksheet window.

Data analysis using Microsoft Excel

ASSOCIATION BETWEEN VARIABLES: SCATTERGRAMS (Like Father, Like Son)

INSTRUCTIONS FOR USING MICROSOFT EXCEL PERFORMING DESCRIPTIVE AND INFERENTIAL STATISTICS AND GRAPHING

Introduction (SPSS) Opening SPSS Start All Programs SPSS Inc SPSS 21. SPSS Menus

ACER Online Assessment and Reporting System (OARS) User Guide

Setting Up the Randomization Module in REDCap How-To Guide

Page 1. Graphical and Numerical Statistics

CPT1. Unit 1 Computer Systems, Programming and Networking Concepts. General Certificate of Education January 2004 Advanced Subsidiary Examination

Numerical Descriptive Measures

Table of Contents (As covered from textbook)

NOTES TO CONSIDER BEFORE ATTEMPTING EX 1A TYPES OF DATA

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 2 Working with data in Excel and exporting to JMP Introduction

Telephone Survey Response: Effects of Cell Phones in Landline Households

Nuts and Bolts Research Methods Symposium

ASSOCIATION BETWEEN VARIABLES: CROSSTABULATIONS

IBM SPSS Statistics 22 Brief Guide

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES

MATH 1340 Mathematics & Politics

Navigating in SPSS. C h a p t e r 2 OBJECTIVES

Creating a combined line and column chart.

This strand involves properties of the physical world that can be measured, the units used to measure them and the process of measurement.

CS 237: Probability in Computing

Data Analysis using SPSS

Session One: MINITAB Basics

Graphical Presentation for Statistical Data (Relevant to AAT Examination Paper 4: Business Economics and Financial Mathematics) Introduction

CS130 Software Tools. Fall 2010 Intro to SPSS and Data Handling

STA 570 Spring Lecture 5 Tuesday, Feb 1

Getting Started with JMP at ISU

7.4 Tutorial #4: Profiling LC Segments Using the CHAID Option

For many people, learning any new computer software can be an anxietyproducing

A Simple Guide to Using SPSS (Statistical Package for the. Introduction. Steps for Analyzing Data. Social Sciences) for Windows

Transcription:

STAT10010 Introductory Statistics Lab 2 1. Aims of Lab 2 By the end of this lab you will be able to: i. Recognize the type of recorded data. ii. iii. iv. Construct summaries of recorded variables. Calculate and interpret the margin of error in a survey. Draw a stratified random sample of data. v. Calculate some descriptive statistics for a data set. 2. Survey data In this lab we will work with some survey data collected in a political study in the USA. The researcher wanted to assess if there was an association between age or gender and candidate preference (Democrats, Republicans, and Others) in a presidential election. The researcher randomly selected 400 individuals and asked them the following 3 questions: 1) What gender are you? 2) What age are you in years? 3) Is the candidate you will back in the upcoming presidential election a: (a) Democrat (b) Republican or (c) other? Q1: Is each question asked by the researcher an open question or a closed question? From Blackboard, download the Minitab worksheet file called PoliticalPoll.mtw to your computer and open it in Minitab. (Recall from lab 1 how to open a Minitab worksheet.) Your worksheet should then look like the screen below:

Clearly the first column contains the gender of each person in the survey, the second column contains their political preference and the third, their age. Scroll down to double check that there are 400 observations/people in your data set (i.e. there should be 400 rows of data in your worksheet). Q2: The data recorded for the gender variable are categorical data; are the data ordinal or nominal? Q3: What type of data is recorded for the preference variable? Q4: What type of data is recorded for the age variable? Note that the Gender column and the Preference column are both in text format (again, recall lab 1.). To analyse the data it will often be easier to work with the data in numerical format. To change the format of the data, in the menu bar go to Data, then Code, then Text to Numeric. Code the gender variable as 0 for female and 1 for male, and save the new data in column C4, say, in your worksheet. Give your new column of data a label. Re-code the preference data column in the same way.

Let s look at some tables which summarise the information in our data set. In the menu bar go to Stat, then Tables, then Tally Individual Variables. Choose your new numerically expressed gender data, and your new numerically expressed preference data. Q5: How many females were in the sample of 400 people? Q6: How many people in the sample supported neither the Democrats nor the Republicans? Q7: What proportion support the Democrats? 3. The margin of error One way of assessing the uncertainty in our estimate of the proportion which supports the Democrats is through the margin of error. Recall from lectures that the margin of error in a survey in which the sample is of size n is 1 divided by the square root of n i.e. MoE = 1 / n Let s calculate the margin of error for the political poll data set. In the menu bar, go to Calc, then Calculator. Store your result in the next free column (probably column C6). In the Expression box, enter 1/SQRT(400). You should be able to find the SQRT function in the list of arithmetic functions. Q8: What is the margin of error (in %) of the study? Q9: What is the interval in which the true proportion which supports the Democrats lies? 4. Stratified random sampling Let s now draw a stratified random sample from the political poll data. Recall from lectures what is meant by a stratified random sample. Let s treat the two gender categories as our two strata. Say we wish to draw a random sample of size 10 from each stratum. Let s first organise our data so that all the female data is grouped together, and then all the male data. Go to Data, then Sort. You want to sort all the columns of data, and you want to sort them by gender. Check the original columns option.

Your worksheet should now be organised such that all the female data are in the first rows followed by all the male data. The female observations are numbered 1 up to 204 let s choose a random sample of size 10 from this set of observations. Select Calc, then Random Data and then Integer. Ask Minitab to generate 10 rows and to store the resulting sample in your next free column. Enter 1 as the minimum value and 204 as the maximum. The numbers generated are a set of random numbers. Each observation in our original (female) data set included in the list of random numbers will be an observation in our stratified random sample. To construct a new data set consisting of the randomly sampled female observations go to Data, Subset Worksheet. Check the row numbers box, enter the list of randomly generated numbers and press OK. In the next Window, tell Minitab to include all the columns containing data. A new worksheet of data should pop up save this worksheet as in your STAT10010 folder on your H drive. Repeat this to draw a sample of 10 observations from the male stratum and save your worksheet. Q10: Based on your new stratified random samples, which stratum has the higher proportion of support for Republicans?

5. Some basic descriptive statistics To calculate some descriptive statistics we can use the Stat, Basic Statistics, Display Descriptive Statistics option. Click the statistics box, and ensure that only the mean, minimum and maximum boxes are checked. Q11: Which stratum has the largest average age? Q12: Which stratum has the maximum age? ooo You now have now worked with some categorical and numerical data, drawn some conclusions based on sampled data and drawn a stratified random sample. Some more steps into the world of a statistician ooo