MKTG 460 Winter 2019 Solutions #1

Size: px
Start display at page:

Download "MKTG 460 Winter 2019 Solutions #1"

Transcription

1 MKTG 460 Winter 2019 Solutions #1 Short Answer: Data Analysis 1. What is a data table and how are the data values organized? A data table stores the data values for variables across different observations, such as people, companies, etc. The values in a column are for a specific variable and the values in a row are for a specific observation. The first row usually contains the variable names. 2. What is a csv file? What are its properties, its primary advantage and its primary disadvantage? A csv file is a standard text file, that is, just text characters, the letters of the alphabet, digits, punctuation and a few control codes such as line return. It is a specific type of text file in which adjacent values on the same line are separated by comas. 3. What is the distinction between categorical and continuous variables? Provide an example variable of each along with some sample values. Nominal data are unordered categories, such as the numerals of football player jerseys. Ordinal data are ordered categories. Still not numeric, but Severe Injury indicates a greater amount of injury than Minor Injury. 4. Why do bar charts but not histograms have gaps between the bars? Bar charts for categorical variables. The gaps between the bars on a bar chart reflect the non-numeric lack of continuity. Histograms are for continuous variables. The bars on a histogram are joined to indicate the underlying continuity. 5. Identify and describe two properties of histogram bins that lead to arbitrariness in the histogram display. Bin width: vary the width of the bins and the shape of the histogram changes Bin shift: vary the starting point of the first bin and the shape of the histogram changes 6. What is the histogram artifact of undersmoothing and how does it relate to sample size? Describe a histogram that is undersmoothed? How can the problem be fixed? Undersmoothing occurs when the bin width is too narrow for the given sample size. The result is a histogram that has lots of jaggedness with too many ups and downs that reflect sampling error more than the actual shape of the distribution. Fix the problem by increasing 1

2 2 bin width. 7. What is the histogram artifact of oversmoothing and how does it relate to sample size? Describe a histogram that is oversmoothed? How can the problem be fixed? Oversmoothing occurs when the bin width is too wide for the given sample size. The result is a histogram that does not display as much detail as is available. Too much information is discarded. Fix the problem by decreasing bin width. 8. Compare the mean to the median Both values represent the middle of a distribution, but in two different ways. The mean is the balance point of deviations scores. A deviation score is mean subtracted from a data value. The mean is best for more or less symmetric distributions. The median is the middle of the sorted data values. As such it is less affected by outliers or a skewed distribution. If the last value of a distribution of income is $10000 or $ , the mean is greatly increased while the median is the unchanged. 9. What is an outlier and how does it affect data analysis? An outlier is a data value that is far from most of the other data values. This extreme data value may represent a data value from a different distribution, in which case it should not be included in the analysis with the remaining data values. Short Answer: General Marketing Research 1. How does marketing research relate to management decisions? Managers need to understand their environmentcustomers, market, suppliersand marketing research is used to turn gathered data into useful information regarding their environment 2. What is a market segmentation study? A Market Segmentation Study divides customers into groups according to relevant characteristics such as gender, location, and education level. Different groups of potential customers have different wants, needs and income, so marketers should understand the characteristic of the potential customers. 3. What is a market share and market size study? Market share and market size studies determine the products purchased, the volume purchased, and the actual size of sales being realized by each competitor. In other words, who

3 3 else is selling to our target market and how successful are they? 4. Longer question: List and briefly describe the nine steps in a market research study. a. Define the Management Problem Define the problem clearly for all team members to agree as to the nature of the problem b. State Research Purpose Gathering data can be expensive so clarify role of the research c. State the Research Objectives Specify the questions the research should answer d. Develop Research Design Plan how to obtain the answers your team is searching for, how the hypothesis is to be tested e. Select Data-Collection Methodology The methods, i.e., the protocol, used to gather the data f. Determine Measurement and Data-Analysis Methods Methods for analyzing the data g. Design Data-Collection Forms The forms that are to be filled in. The responses are the data values that measure the variables h. Define Sampling Methods Define the population or universe of the study to specify who can be selected for participation in the study according to the sample procedure i. Collect, Analyze and Interpret the Data, and Present the Results Interpret the results such that the analysis helps guide the team or client to make an informed decision

4 4 Worked Problems 1. Data Entry. Consider the data in Figure 1, randomly selected from a data file of the body measurements of thousands of motorcyclists. Figure 1: Gender, Height and Weight of eight motorcyclists. a. List each of the each variable names in Figure 1 and classify each as continuous or categorical. Gender: Categorical, M and F Height: Continuous Weight: Continuous b. Manually enter each data value from Figure 1 into a worksheet (such as Excel). Copy and paste the data table from your R output into your homework document. See Figure 1. c. Every data table, whether in R or Excel, has a name. What is the name of the worksheet (located on a tab toward the bottom left corner)? What is that name? Note the distinction between the name of the data table and the names of the variable. This is the same distinction in Excel or in R. By default, Excel names the first worksheet of an Excel file Sheet1. Though not necessary, for clarity, this default name could be changed to indicate actual name of the data table, such as HtWt or d. d. Read these data directly from the Excel file you created on your computer system into an R data table named d. You can enter the full path name as an argument to the Read function, but easier to browse for the data file, indicated by an empty character string "". Copy to your homework document the listing from the lessr Read function you used to read the data that displays the variable names, variable types, and first and last data values. The data table will also have a name when stored within R. Use the data table name d.

5 5 > d <- Read("") e. Verify that the data values for the variables in the R data table appear in the intended format. From the output of Read what is the type of each of these three variables that R assigned? What type does R assign to a non-numeric, categorical variable? Variable Missing Unique Name Type Values Values Values First and last values Gender character F F M... M F M 2 Height integer Weight integer NA The Read function lists the name of each variable read and the type of variable, here Gender is read as type character and Height and Weight are read as type integer. f. Display the data from within R with the print(d) function call and then copy to your homework document. Note that simply entering the name of the R object, here called d, is an abbreviation to invoke the print function. > d Gender Weight Height 1 F F M M F M F M Or, particularly useful for larger data tables, just to see the variable names and first 6 lines of data, use head(d). > head(d) Gender Weight Height 1 F F M M F M

6 6 g. From the previous answers, compare the data stored in Excel and then compare to the representation of the data stored in R. In particular, what does NA refer to in the R data table? The representations are the same, which is to be expected because the same data table is represented within Excel and within R. Gender is stored as an R character variable, which means as a categorical variable. The primary distinction is that a blank cell in Excel indicates missing data. R, instead, uses the code NA for numerical data and <NA> non-numeric data, each an abbreviation for Not Assigned. 2. Bar Chart (from data). A motorcycle clothing company makes jackets are of three types: Lite, Medium and Thick. The company needs guidance as to how many different jackets of each type to bring to a gathering of motorcyclists of a specific brand, here BMW or Honda motorcycles. The data are recorded from past sales of motorcycle jackets to owners of BMW or Honda motorcyclists. For now, we just analyze the type of jackets sold, and ignore the motorcycle type. Consider the data at a. Verify that there is an actual data file called Jackets.csv at the specified web address (URL) by pointing your browser at that URL. Take a screenshot of the first several lines of the data file and paste into your homework document. b. What is the format of that document? The data table is stored in a file in csv format, comma separated values, a simple text file with adjacent data values in a row of data separated by commas. c. How many variables are in the data file? What are their names? What is the relevant variable for this analysis? There are two variables in this data file, Bike and Jacket. Here we are only concerned with

7 7 Jacket, which indicates the style of Jacket sold. Describe the distribution of Jacket Types with: First read the data in the file BikeJacket.csv directly from the web into the R data table (frame) called d. Then do the requested text and graphic analyses with the lessr BarChart function. > d <- Read(" The output of the Read function summarizes the variables and their data values. Variable Missing Unique Name Type Values Values Values First and last values Bike character BMW Honda Honda... Honda BMW Jacket character Lite Lite Lite... Med Lite d. frequency table > BarChart(Jacket) --- Jacket --- Lite Med Thick Total Frequencies: Proportions: e. bar chart 300 Count of Jacket % 33% 30% Lite Med Thick Jacket f. pie chart

8 8 > PieChart(Jacket) Jacket 36% Lite Med 33% 30% Thick This version of a pie chart is called a doughnut chart or a ring chart. If you want to see the entire pie, set the size of the hole by entering any value from 0 to 1 for the hole parameter. The default value is pc(jacket, hole=0) Jacket 36% Lite Med 33% 30% Thick

9 9 Interpretation: Almost the same amount of Light, Medium and Thick jackets are sold, though there is a slight tendency in the data for the lighter the jacket the more sales. (A future homework will extend this analysis by relating to the second variable in the data set, Motorcycle Type.) 3. Histogram The data for this exercise are included with lessr, obtained when the package was downloaded. To access these built-in data sets, add the format="lessr" option to the Read statement. Here the data set is the Employee data. > d <- Read("Employee", in.lessr=true) a. Obtain the default histogram for Salary. As an option just to illustrate different styles, here change the default color theme to orange and add a black background such as might be appropriate for a slide presentation. > style("orange", sub.theme="black") > Histogram(Salary) 8 6 Count of Salary Salary b. What is the default bin width? Starting point of the first bin? The obtained frequency distribution shows a bin width of $10,000 with the first bin starting at $30,000. Bin Width: Number of Bins: 10 Bin Midpnt Count Prop Cumul.c Cumul.p >

10 > > > > > > > > > c. Interpret the histogram. Most of the salaries occur in the lower range of Salaries, with $40,000 to $60,000 being the most frequently occurring salaries. The distribution is skewed with a tail of less frequently occurring salaries in the upper end. The largest salary is $124, and the smallest is $63, d. Using the bin.width option deliberately over-smooth the histogram (bins too wide). What is wrong with this histogram? As an option only, try the "darkred" style. > style("darkred") > Histogram(Salary, bin.width=40000) Count of Salary Salary The problem is that the oversmoothed histogram wastes information in the sense that the given sample size allows for a smaller bin width and thus more information to be displayed in the histogram. This histogram does not match the data in that it starts high and then just lowers over three bins. In actuality, the distribution begins lower in frequency, achieves a peak around $40,000 to $60,000 and then gradually descends. e. Using the bin.width option deliberately under-smooth the histogram (bins too narrow). What is wrong with this histogram?

11 11 > Histogram(Salary, bin.width=2000) 4 3 Count of Salary Salary The problem with the undersmoothed histogram is that the bin width is too small for the available sample size. The result are sampling fluctuations illustrated by the zigging and zagging of the histogram that only represent random fluctuations, which would not replicate in future samples. 4. Analysis of Survey Data. The downloaded csv file of your responses to the class survey are on the web. The survey itself is also on the web. The survey was constructed and administered with Qualtrics, the survey administration system you will use for your project. If an item has a specific set of possible responses, Qualtrics labels a response to the first alternative as a 1, a 2 for a response to the second alternative, and so forth. a. Point your browser to the data file. Copy the first five or so lines into your homework document. Describe what you see. Learn_1,Learn_2,Learn_3,Learn_4,Feel_1,Feel_2,Feel_3,Feel_4,Past_1,Past_2,Past_3,Past_4,Past2_ 3,5,7,6,6,7,5,7,3,4,3,4,6,2,2,59,78,95,53 4,4,3,6,2,6,6,6,4,4,4,4,6,1,2,30,50,60,50

12 12 3,3,7,3,5,3,4,1,4,2,3,2,7,2,2,,,100, 4,4,5,5,2,6,4,5,2,2,1,2,6,1,2,50,39,70,60 6,6,5,6,3,6,4,3,2,3,4,3,13,2,2,60,100,50,91 b. Read the data into an R data table called d and then display the data from within R. [Just enter the name of the object, such as a data table, to display the contents of that object.] Show both the R instruction to do this, and then the data. Allow enough width to get all the columns of data across the console window by first enlarging the size of the R console window. > d <- Read(" > d Learn_1 Learn_2 Learn_3 Learn_4 Feel_1 Feel_2 Feel_3 Feel_4 Past_1 Past_2 Past_3 Past_4 Past2_1 Gender Class Learn2_1 Learn2_2 Learn2_3 Learn2_ NA NA 100 NA NA NA NA NA c. By direct examination of the data, what specific values are missing? Missing values are wherever an NA is reported. There are several. For example Person #3 has no data value for the last four variables in the data table, beginning with Learn2 1. d. To help locate the missing data, which R represents as NA, turn to the analysis provided by the initial details function. The Observation specifies the name of each row, here just the row number. To protect anonymity, student names were not collected as part of the data table. Missing Data Analysis n.miss Observation

13 Overall, there is not too much missing data in the analysis. The 3rd person did not respond to three of the last 4 items. People # 19, 20, 27 and 29 did not respond to one of the items. From the Read output, the variable with the most missing data is Past2 1, and it is only missing 4 responses. The analysis can proceed without much concern for missing data. e. Focus on the variable Learn2 1, measured on a response scale from 0 to 100. This variable is the first item listed on under the section Learn2 on the survey, which consists of the last four items on the survey. What is the item? I want to learn more of how to apply statistical techniques to marketing research. f. Plot the histogram and gather the summary statistics of Learn2 1. Put the item content for the Learn2 1 on the x-axis of the resulting histogram. [The lessr histogram function Histogram, or hs, provides both the histogram and summary statistics. The xlab option provides the content of the label on the x axis, which by default is the variable name.] But first a little fun. Optionally, change the default lessr color theme. > style("dodgerblue") By default R plots the histogram according to the range of observed values. It does not know the range of possible values according to the possible responses you had on your survey. Usually one wants to plot the histogram across the range of possible values. One way to do that is to specify that the bins start at 0 with the bin.start=0 specification. If needed, specify the ending value with bin.end option. > Histogram(Learn2 1, bin.start=0, xlab="i want to learn more of how to apply statistical techniques to marketing research.") g. Summarize in a few sentences the results of this analysis in terms of student attitudes toward the course regarding this item. The values of Learn2 1 span the complete range of values from 0 to 100. One person clearly has no interest to apply statistics to data analysis, but most of the values are between 40 and 90. The tendency is for the number of responses in a bin to increase as the value becomes larger, up to the value of 80. The bin with the most number of responses is for the largest scores, from 70 to 80.

14 14 12 Count of Learn2_ I want to learn more of how to apply statistical techniques to marketing research. Figure 2: Histogram fro Learn2 1.

Table of Contents (As covered from textbook)

Table of Contents (As covered from textbook) Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression

More information

Frequency Distributions

Frequency Distributions Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data so that it is possible to get a general overview of the results. Remember,

More information

Statistics with a Hemacytometer

Statistics with a Hemacytometer Statistics with a Hemacytometer Overview This exercise incorporates several different statistical analyses. Data gathered from cell counts with a hemacytometer is used to explore frequency distributions

More information

Homework 1 Excel Basics

Homework 1 Excel Basics Homework 1 Excel Basics Excel is a software program that is used to organize information, perform calculations, and create visual displays of the information. When you start up Excel, you will see the

More information

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student

The basic arrangement of numeric data is called an ARRAY. Array is the derived data from fundamental data Example :- To store marks of 50 student Organizing data Learning Outcome 1. make an array 2. divide the array into class intervals 3. describe the characteristics of a table 4. construct a frequency distribution table 5. constructing a composite

More information

CHAPTER 4: MICROSOFT OFFICE: EXCEL 2010

CHAPTER 4: MICROSOFT OFFICE: EXCEL 2010 CHAPTER 4: MICROSOFT OFFICE: EXCEL 2010 Quick Summary A workbook an Excel document that stores data contains one or more pages called a worksheet. A worksheet or spreadsheet is stored in a workbook, and

More information

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...

More information

At the end of the chapter, you will learn to: Present data in textual form. Construct different types of table and graphs

At the end of the chapter, you will learn to: Present data in textual form. Construct different types of table and graphs DATA PRESENTATION At the end of the chapter, you will learn to: Present data in textual form Construct different types of table and graphs Identify the characteristics of a good table and graph Identify

More information

Create a bar graph that displays the data from the frequency table in Example 1. See the examples on p Does our graph look different?

Create a bar graph that displays the data from the frequency table in Example 1. See the examples on p Does our graph look different? A frequency table is a table with two columns, one for the categories and another for the number of times each category occurs. See Example 1 on p. 247. Create a bar graph that displays the data from the

More information

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things.

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. + What is Data? Data is a collection of facts. Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. In most cases, data needs to be interpreted and

More information

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.

CHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. 1 CHAPTER 1 Introduction Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. Variable: Any characteristic of a person or thing that can be expressed

More information

Frequency Tables. Chapter 500. Introduction. Frequency Tables. Types of Categorical Variables. Data Structure. Missing Values

Frequency Tables. Chapter 500. Introduction. Frequency Tables. Types of Categorical Variables. Data Structure. Missing Values Chapter 500 Introduction This procedure produces tables of frequency counts and percentages for categorical and continuous variables. This procedure serves as a summary reporting tool and is often used

More information

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set.

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set. Measures of Central Tendency A measure of central tendency is a value used to represent the typical or average value in a data set. The Mean the sum of all data values divided by the number of values in

More information

Probability and Statistics. Copyright Cengage Learning. All rights reserved.

Probability and Statistics. Copyright Cengage Learning. All rights reserved. Probability and Statistics Copyright Cengage Learning. All rights reserved. 14.6 Descriptive Statistics (Graphical) Copyright Cengage Learning. All rights reserved. Objectives Data in Categories Histograms

More information

Chapter 6: DESCRIPTIVE STATISTICS

Chapter 6: DESCRIPTIVE STATISTICS Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling

More information

AP Statistics Summer Assignment:

AP Statistics Summer Assignment: AP Statistics Summer Assignment: Read the following and use the information to help answer your summer assignment questions. You will be responsible for knowing all of the information contained in this

More information

Measures of Central Tendency

Measures of Central Tendency Page of 6 Measures of Central Tendency A measure of central tendency is a value used to represent the typical or average value in a data set. The Mean The sum of all data values divided by the number of

More information

Chapter 2 Organizing and Graphing Data. 2.1 Organizing and Graphing Qualitative Data

Chapter 2 Organizing and Graphing Data. 2.1 Organizing and Graphing Qualitative Data Chapter 2 Organizing and Graphing Data 2.1 Organizing and Graphing Qualitative Data 2.2 Organizing and Graphing Quantitative Data 2.3 Stem-and-leaf Displays 2.4 Dotplots 2.1 Organizing and Graphing Qualitative

More information

Exploratory Data Analysis

Exploratory Data Analysis Chapter 10 Exploratory Data Analysis Definition of Exploratory Data Analysis (page 410) Definition 12.1. Exploratory data analysis (EDA) is a subfield of applied statistics that is concerned with the investigation

More information

Raw Data is data before it has been arranged in a useful manner or analyzed using statistical techniques.

Raw Data is data before it has been arranged in a useful manner or analyzed using statistical techniques. Section 2.1 - Introduction Graphs are commonly used to organize, summarize, and analyze collections of data. Using a graph to visually present a data set makes it easy to comprehend and to describe the

More information

Statistics Lecture 6. Looking at data one variable

Statistics Lecture 6. Looking at data one variable Statistics 111 - Lecture 6 Looking at data one variable Chapter 1.1 Moore, McCabe and Craig Probability vs. Statistics Probability 1. We know the distribution of the random variable (Normal, Binomial)

More information

Section 2-2 Frequency Distributions. Copyright 2010, 2007, 2004 Pearson Education, Inc

Section 2-2 Frequency Distributions. Copyright 2010, 2007, 2004 Pearson Education, Inc Section 2-2 Frequency Distributions Copyright 2010, 2007, 2004 Pearson Education, Inc. 2.1-1 Frequency Distribution Frequency Distribution (or Frequency Table) It shows how a data set is partitioned among

More information

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.1- #

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.1- # Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 2 Summarizing and Graphing Data 2-1 Review and Preview 2-2 Frequency Distributions 2-3 Histograms

More information

Organizing and Summarizing Data

Organizing and Summarizing Data 1 Organizing and Summarizing Data Key Definitions Frequency Distribution: This lists each category of data and how often they occur. : The percent of observations within the one of the categories. This

More information

Chapter 3 - Displaying and Summarizing Quantitative Data

Chapter 3 - Displaying and Summarizing Quantitative Data Chapter 3 - Displaying and Summarizing Quantitative Data 3.1 Graphs for Quantitative Data (LABEL GRAPHS) August 25, 2014 Histogram (p. 44) - Graph that uses bars to represent different frequencies or relative

More information

This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form.

This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form. CHAPTER 2 Frequency Distributions and Graphs Objectives Organize data using frequency distributions. Represent data in frequency distributions graphically using histograms, frequency polygons, and ogives.

More information

Chapter 2: Understanding Data Distributions with Tables and Graphs

Chapter 2: Understanding Data Distributions with Tables and Graphs Test Bank Chapter 2: Understanding Data with Tables and Graphs Multiple Choice 1. Which of the following would best depict nominal level data? a. pie chart b. line graph c. histogram d. polygon Ans: A

More information

NOTES TO CONSIDER BEFORE ATTEMPTING EX 1A TYPES OF DATA

NOTES TO CONSIDER BEFORE ATTEMPTING EX 1A TYPES OF DATA NOTES TO CONSIDER BEFORE ATTEMPTING EX 1A TYPES OF DATA Statistics is concerned with scientific methods of collecting, recording, organising, summarising, presenting and analysing data from which future

More information

Training for ICDL Spreadsheets Spreadsheets 261

Training for ICDL Spreadsheets Spreadsheets  261 261 Module Goals 1 Introduction 2 What is a Spreadsheet? 2 Section 1 Using the Application 3 1.1. Working with 3 1.2. Enhancing Productivity 11 Section 2 Cells 15 2.1. Inserting and Selecting Data 15 2.2.

More information

IT 403 Practice Problems (1-2) Answers

IT 403 Practice Problems (1-2) Answers IT 403 Practice Problems (1-2) Answers #1. Using Tukey's Hinges method ('Inclusionary'), what is Q3 for this dataset? 2 3 5 7 11 13 17 a. 7 b. 11 c. 12 d. 15 c (12) #2. How do quartiles and percentiles

More information

Statistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975.

Statistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975. Statistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975. SPSS Statistics were designed INTRODUCTION TO SPSS Objective About the

More information

MATH& 146 Lesson 8. Section 1.6 Averages and Variation

MATH& 146 Lesson 8. Section 1.6 Averages and Variation MATH& 146 Lesson 8 Section 1.6 Averages and Variation 1 Summarizing Data The distribution of a variable is the overall pattern of how often the possible values occur. For numerical variables, three summary

More information

Microsoft Excel 2010 Basics

Microsoft Excel 2010 Basics Microsoft Excel 2010 Basics Starting Word 2010 with XP: Click the Start Button, All Programs, Microsoft Office, Microsoft Excel 2010 Starting Word 2010 with 07: Click the Microsoft Office Button with the

More information

A Simple Guide to Using SPSS (Statistical Package for the. Introduction. Steps for Analyzing Data. Social Sciences) for Windows

A Simple Guide to Using SPSS (Statistical Package for the. Introduction. Steps for Analyzing Data. Social Sciences) for Windows A Simple Guide to Using SPSS (Statistical Package for the Social Sciences) for Windows Introduction ٢ Steps for Analyzing Data Enter the data Select the procedure and options Select the variables Run the

More information

CS1100: Computer Science and Its Applications. Creating Graphs and Charts in Excel

CS1100: Computer Science and Its Applications. Creating Graphs and Charts in Excel CS1100: Computer Science and Its Applications Creating Graphs and Charts in Excel Charts Data is often better explained through visualization as either a graph or a chart. Excel makes creating charts easy:

More information

Introduction to CS databases and statistics in Excel Jacek Wiślicki, Laurent Babout,

Introduction to CS databases and statistics in Excel Jacek Wiślicki, Laurent Babout, One of the applications of MS Excel is data processing and statistical analysis. The following exercises will demonstrate some of these functions. The base files for the exercises is included in http://lbabout.iis.p.lodz.pl/teaching_and_student_projects_files/files/us/lab_04b.zip.

More information

September 11, Unit 2 Day 1 Notes Measures of Central Tendency.notebook

September 11, Unit 2 Day 1 Notes Measures of Central Tendency.notebook Measures of Central Tendency: Mean, Median, Mode and Midrange A Measure of Central Tendency is a value that represents a typical or central entry of a data set. Four most commonly used measures of central

More information

Slide Copyright 2005 Pearson Education, Inc. SEVENTH EDITION and EXPANDED SEVENTH EDITION. Chapter 13. Statistics Sampling Techniques

Slide Copyright 2005 Pearson Education, Inc. SEVENTH EDITION and EXPANDED SEVENTH EDITION. Chapter 13. Statistics Sampling Techniques SEVENTH EDITION and EXPANDED SEVENTH EDITION Slide - Chapter Statistics. Sampling Techniques Statistics Statistics is the art and science of gathering, analyzing, and making inferences from numerical information

More information

Box Plots. OpenStax College

Box Plots. OpenStax College Connexions module: m46920 1 Box Plots OpenStax College This work is produced by The Connexions Project and licensed under the Creative Commons Attribution License 3.0 Box plots (also called box-and-whisker

More information

Microsoft Excel 2010 Handout

Microsoft Excel 2010 Handout Microsoft Excel 2010 Handout Excel is an electronic spreadsheet program you can use to enter and organize data, and perform a wide variety of number crunching tasks. Excel helps you organize and track

More information

Statistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or me, I will answer promptly.

Statistical Methods. Instructor: Lingsong Zhang. Any questions, ask me during the office hour, or  me, I will answer promptly. Statistical Methods Instructor: Lingsong Zhang 1 Issues before Class Statistical Methods Lingsong Zhang Office: Math 544 Email: lingsong@purdue.edu Phone: 765-494-7913 Office Hour: Monday 1:00 pm - 2:00

More information

Automated AFM Image Processing User Manual

Automated AFM Image Processing User Manual Automated AFM Image Processing User Manual Starting The Program Open and run the GUI_run_me.m script in Matlab to start the program. The first thing to do is to select the folder that contains the images

More information

Chapter 2 Modeling Distributions of Data

Chapter 2 Modeling Distributions of Data Chapter 2 Modeling Distributions of Data Section 2.1 Describing Location in a Distribution Describing Location in a Distribution Learning Objectives After this section, you should be able to: FIND and

More information

Chapter 2: The Normal Distribution

Chapter 2: The Normal Distribution Chapter 2: The Normal Distribution 2.1 Density Curves and the Normal Distributions 2.2 Standard Normal Calculations 1 2 Histogram for Strength of Yarn Bobbins 15.60 16.10 16.60 17.10 17.60 18.10 18.60

More information

Part I, Chapters 4 & 5. Data Tables and Data Analysis Statistics and Figures

Part I, Chapters 4 & 5. Data Tables and Data Analysis Statistics and Figures Part I, Chapters 4 & 5 Data Tables and Data Analysis Statistics and Figures Descriptive Statistics 1 Are data points clumped? (order variable / exp. variable) Concentrated around one value? Concentrated

More information

Homework Packet Week #3

Homework Packet Week #3 Lesson 8.1 Choose the term that best completes statements # 1-12. 10. A data distribution is if the peak of the data is in the middle of the graph. The left and right sides of the graph are nearly mirror

More information

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable. 5-number summary 68-95-99.7 Rule Area principle Bar chart Bimodal Boxplot Case Categorical data Categorical variable Center Changing center and spread Conditional distribution Context Contingency table

More information

+ Statistical Methods in

+ Statistical Methods in + Statistical Methods in Practice STA/MTH 3379 + Dr. A. B. W. Manage Associate Professor of Statistics Department of Mathematics & Statistics Sam Houston State University Discovering Statistics 2nd Edition

More information

Chapter 11. Worked-Out Solutions Explorations (p. 585) Chapter 11 Maintaining Mathematical Proficiency (p. 583)

Chapter 11. Worked-Out Solutions Explorations (p. 585) Chapter 11 Maintaining Mathematical Proficiency (p. 583) Maintaining Mathematical Proficiency (p. 3) 1. After School Activities. Pets Frequency 1 1 3 7 Number of activities 3. Students Favorite Subjects Math English Science History Frequency 1 1 1 3 Number of

More information

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data

Chapter 2. Descriptive Statistics: Organizing, Displaying and Summarizing Data Chapter 2 Descriptive Statistics: Organizing, Displaying and Summarizing Data Objectives Student should be able to Organize data Tabulate data into frequency/relative frequency tables Display data graphically

More information

SAS Visual Analytics 8.2: Working with Report Content

SAS Visual Analytics 8.2: Working with Report Content SAS Visual Analytics 8.2: Working with Report Content About Objects After selecting your data source and data items, add one or more objects to display the results. SAS Visual Analytics provides objects

More information

Sections Graphical Displays and Measures of Center. Brian Habing Department of Statistics University of South Carolina.

Sections Graphical Displays and Measures of Center. Brian Habing Department of Statistics University of South Carolina. STAT 515 Statistical Methods I Sections 2.1-2.3 Graphical Displays and Measures of Center Brian Habing Department of Statistics University of South Carolina Redistribution of these slides without permission

More information

Bar Charts and Frequency Distributions

Bar Charts and Frequency Distributions Bar Charts and Frequency Distributions Use to display the distribution of categorical (nominal or ordinal) variables. For the continuous (numeric) variables, see the page Histograms, Descriptive Stats

More information

Date Lesson TOPIC HOMEWORK. Displaying Data WS 6.1. Measures of Central Tendency WS 6.2. Common Distributions WS 6.6. Outliers WS 6.

Date Lesson TOPIC HOMEWORK. Displaying Data WS 6.1. Measures of Central Tendency WS 6.2. Common Distributions WS 6.6. Outliers WS 6. UNIT 6 ONE VARIABLE STATISTICS Date Lesson TOPIC HOMEWORK 6.1 3.3 6.2 3.4 Displaying Data WS 6.1 Measures of Central Tendency WS 6.2 6.3 6.4 3.5 6.5 3.5 Grouped Data Central Tendency Measures of Spread

More information

STATISTICAL TECHNIQUES. Interpreting Basic Statistical Values

STATISTICAL TECHNIQUES. Interpreting Basic Statistical Values STATISTICAL TECHNIQUES Interpreting Basic Statistical Values INTERPRETING BASIC STATISTICAL VALUES Sample representative How would one represent the average or typical piece of information from a given

More information

New Perspectives on Microsoft Excel Module 5: Working with Excel Tables, PivotTables, and PivotCharts

New Perspectives on Microsoft Excel Module 5: Working with Excel Tables, PivotTables, and PivotCharts New Perspectives on Microsoft Excel 2016 Module 5: Working with Excel Tables, PivotTables, and PivotCharts Objectives, Part 1 Explore a structured range of data Freeze rows and columns Plan and create

More information

Univariate Statistics Summary

Univariate Statistics Summary Further Maths Univariate Statistics Summary Types of Data Data can be classified as categorical or numerical. Categorical data are observations or records that are arranged according to category. For example:

More information

Exploring extreme weather with Excel - The basics

Exploring extreme weather with Excel - The basics Exploring extreme weather with Excel - The basics These activities will help to develop your data skills using Excel and explore extreme weather in the UK. This activity introduces the basics of using

More information

Creating a Basic Chart in Excel 2007

Creating a Basic Chart in Excel 2007 Creating a Basic Chart in Excel 2007 A chart is a pictorial representation of the data you enter in a worksheet. Often, a chart can be a more descriptive way of representing your data. As a result, those

More information

Working with Microsoft Excel. Touring Excel. Selecting Data. Presented by: Brian Pearson

Working with Microsoft Excel. Touring Excel. Selecting Data. Presented by: Brian Pearson Working with Microsoft Excel Presented by: Brian Pearson Touring Excel Menu bar Name box Formula bar Ask a Question box Standard and Formatting toolbars sharing one row Work Area Status bar Task Pane 2

More information

Summarising Data. Mark Lunt 09/10/2018. Arthritis Research UK Epidemiology Unit University of Manchester

Summarising Data. Mark Lunt 09/10/2018. Arthritis Research UK Epidemiology Unit University of Manchester Summarising Data Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 09/10/2018 Summarising Data Today we will consider Different types of data Appropriate ways to summarise these

More information

Section 3.2 Measures of Central Tendency MDM4U Jensen

Section 3.2 Measures of Central Tendency MDM4U Jensen Section 3.2 Measures of Central Tendency MDM4U Jensen Part 1: Video This video will review shape of distributions and introduce measures of central tendency. Answer the following questions while watching.

More information

Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts. Microsoft Excel 2013 Enhanced

Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts. Microsoft Excel 2013 Enhanced Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts Microsoft Excel 2013 Enhanced Objectives Explore a structured range of data Freeze rows and columns Plan and create an Excel table Rename

More information

Handling Your Data in SPSS. Columns, and Labels, and Values... Oh My! The Structure of SPSS. You should think about SPSS as having three major parts.

Handling Your Data in SPSS. Columns, and Labels, and Values... Oh My! The Structure of SPSS. You should think about SPSS as having three major parts. Handling Your Data in SPSS Columns, and Labels, and Values... Oh My! You might think that simple intuition will guide you to a useful organization of your data. If you follow that path, you might find

More information

Lab 2: Your first web page

Lab 2: Your first web page Lab 2: Your first web page Due date DUE DATE: 3/3/2017, by 5pm. Demo in lab section/ta office hours. Overview In this lab we will walk through the usage of Microsoft Excel to create bar charts, pie charts,

More information

Select Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables

Select Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables Select Cases GRAPHS The Select Cases command excludes from further analysis all those cases that do not meet specified selection criteria. Select Cases For a subset of the datafile, use Select Cases. In

More information

GO! with Microsoft Excel 2016 Comprehensive

GO! with Microsoft Excel 2016 Comprehensive GO! with Microsoft Excel 2016 Comprehensive First Edition Chapter 2 Using Functions, Creating Tables, and Managing Large Workbooks Use SUM and Statistical Functions The SUM function is a predefined formula

More information

LESSON 3: CENTRAL TENDENCY

LESSON 3: CENTRAL TENDENCY LESSON 3: CENTRAL TENDENCY Outline Arithmetic mean, median and mode Ungrouped data Grouped data Percentiles, fractiles, and quartiles Ungrouped data Grouped data 1 MEAN Mean is defined as follows: Sum

More information

Microsoft Excel 2007

Microsoft Excel 2007 Microsoft Excel 2007 1 Excel is Microsoft s Spreadsheet program. Spreadsheets are often used as a method of displaying and manipulating groups of data in an effective manner. It was originally created

More information

ICT & MATHS. Excel 2003 in Mathematics Teaching

ICT & MATHS. Excel 2003 in Mathematics Teaching ICT & MATHS Excel 2003 in Mathematics Teaching Published by The National Centre for Technology in Education in association with the Project Maths Development Team. Permission granted to reproduce for educational

More information

Chapter 2 Describing, Exploring, and Comparing Data

Chapter 2 Describing, Exploring, and Comparing Data Slide 1 Chapter 2 Describing, Exploring, and Comparing Data Slide 2 2-1 Overview 2-2 Frequency Distributions 2-3 Visualizing Data 2-4 Measures of Center 2-5 Measures of Variation 2-6 Measures of Relative

More information

Chapter 3 Analyzing Normal Quantitative Data

Chapter 3 Analyzing Normal Quantitative Data Chapter 3 Analyzing Normal Quantitative Data Introduction: In chapters 1 and 2, we focused on analyzing categorical data and exploring relationships between categorical data sets. We will now be doing

More information

Elementary Statistics

Elementary Statistics 1 Elementary Statistics Introduction Statistics is the collection of methods for planning experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting, and drawing

More information

Microsoft Excel Chapter 1. Creating a Worksheet and a Chart

Microsoft Excel Chapter 1. Creating a Worksheet and a Chart Microsoft Excel 2013 Chapter 1 Creating a Worksheet and a Chart Objectives Describe the Excel worksheet Enter text and numbers Use the Sum button to sum a range of cells Enter a simple function Copy the

More information

2.1: Frequency Distributions

2.1: Frequency Distributions 2.1: Frequency Distributions Frequency Distribution: organization of data into groups called. A: Categorical Frequency Distribution used for and level qualitative data that can be put into categories.

More information

LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA

LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA This lab will assist you in learning how to summarize and display categorical and quantitative data in StatCrunch. In particular, you will learn how to

More information

Overview. Frequency Distributions. Chapter 2 Summarizing & Graphing Data. Descriptive Statistics. Inferential Statistics. Frequency Distribution

Overview. Frequency Distributions. Chapter 2 Summarizing & Graphing Data. Descriptive Statistics. Inferential Statistics. Frequency Distribution Chapter 2 Summarizing & Graphing Data Slide 1 Overview Descriptive Statistics Slide 2 A) Overview B) Frequency Distributions C) Visualizing Data summarize or describe the important characteristics of a

More information

MATH 117 Statistical Methods for Management I Chapter Two

MATH 117 Statistical Methods for Management I Chapter Two Jubail University College MATH 117 Statistical Methods for Management I Chapter Two There are a wide variety of ways to summarize, organize, and present data: I. Tables 1. Distribution Table (Categorical

More information

Exploring Microsoft Office Excel 2007

Exploring Microsoft Office Excel 2007 Exploring Microsoft Office Excel 2007 Chapter 5 Data to Information Robert Grauer, Keith Mulbery, Judy Scheeren Committed to Shaping the Next Generation of IT Experts. Copyright 2008 Pearson Prentice Hall.

More information

MATH 1070 Introductory Statistics Lecture notes Descriptive Statistics and Graphical Representation

MATH 1070 Introductory Statistics Lecture notes Descriptive Statistics and Graphical Representation MATH 1070 Introductory Statistics Lecture notes Descriptive Statistics and Graphical Representation Objectives: 1. Learn the meaning of descriptive versus inferential statistics 2. Identify bar graphs,

More information

CHAPTER 6. The Normal Probability Distribution

CHAPTER 6. The Normal Probability Distribution The Normal Probability Distribution CHAPTER 6 The normal probability distribution is the most widely used distribution in statistics as many statistical procedures are built around it. The central limit

More information

Chapter Two: Descriptive Methods 1/50

Chapter Two: Descriptive Methods 1/50 Chapter Two: Descriptive Methods 1/50 2.1 Introduction 2/50 2.1 Introduction We previously said that descriptive statistics is made up of various techniques used to summarize the information contained

More information

BIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26

BIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26 Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26 INTRODUCTION Graphs are one of the most important aspects of data analysis and presentation of your of data. They are visual representations

More information

Book 5. Chapter 1: Slides with SmartArt & Pictures... 1 Working with SmartArt Formatting Pictures Adjust Group Buttons Picture Styles Group Buttons

Book 5. Chapter 1: Slides with SmartArt & Pictures... 1 Working with SmartArt Formatting Pictures Adjust Group Buttons Picture Styles Group Buttons Chapter 1: Slides with SmartArt & Pictures... 1 Working with SmartArt Formatting Pictures Adjust Group Buttons Picture Styles Group Buttons Chapter 2: Slides with Charts & Shapes... 12 Working with Charts

More information

Chapter 3: The IF Function and Table Lookup

Chapter 3: The IF Function and Table Lookup Chapter 3: The IF Function and Table Lookup Objectives This chapter focuses on the use of IF and LOOKUP functions, while continuing to introduce other functions as well. Here is a partial list of what

More information

Information Technology and Media Services. Office Excel. Charts

Information Technology and Media Services. Office Excel. Charts Information Technology and Media Services Office 2010 Excel Charts August 2014 Information Technology and Media Services CONTENTS INTRODUCTION... 1 CHART TYPES... 3 CHOOSING A CHART... 4 CREATING A COLUMN

More information

3 Graphical Displays of Data

3 Graphical Displays of Data 3 Graphical Displays of Data Reading: SW Chapter 2, Sections 1-6 Summarizing and Displaying Qualitative Data The data below are from a study of thyroid cancer, using NMTR data. The investigators looked

More information

Tableau Advanced Training. Student Guide April x. For Evaluation Only

Tableau Advanced Training. Student Guide April x. For Evaluation Only Tableau Advanced Training Student Guide www.datarevelations.com 914.945.0567 April 2017 10.x Contents A. Warm Up 1 Bar Chart Colored by Profit 1 Salary Curve 2 2015 v s. 2014 Sales 3 VII. Programmatic

More information

One does not necessarily have special statistical software to perform statistical analyses.

One does not necessarily have special statistical software to perform statistical analyses. Appendix F How to Use a Data Spreadsheet Excel One does not necessarily have special statistical software to perform statistical analyses. Microsoft Office Excel can be used to run statistical procedures.

More information

UNIT 4. Research Methods in Business

UNIT 4. Research Methods in Business UNIT 4 Preparing Data for Analysis:- After data are obtained through questionnaires, interviews, observation or through secondary sources, they need to be edited. The blank responses, if any have to be

More information

Creating a combined line and column chart.

Creating a combined line and column chart. Creating a combined line and column chart. Open a workbook called Line and column chart. Click within the table of data. Click on the Insert tab and within the Charts group click on the Column button.

More information

Survey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9

Survey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9 Survey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9 Contents 1 Introduction to Using Excel Spreadsheets 2 1.1 A Serious Note About Data Security.................................... 2 1.2

More information

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES Chapter 2 2.1 Objectives 2.1 What Are the Types of Data? www.managementscientist.org 1. Know the definitions of a. Variable b. Categorical versus quantitative

More information

Unit I Supplement OpenIntro Statistics 3rd ed., Ch. 1

Unit I Supplement OpenIntro Statistics 3rd ed., Ch. 1 Unit I Supplement OpenIntro Statistics 3rd ed., Ch. 1 KEY SKILLS: Organize a data set into a frequency distribution. Construct a histogram to summarize a data set. Compute the percentile for a particular

More information

1 Introduction to Using Excel Spreadsheets

1 Introduction to Using Excel Spreadsheets Survey of Math: Excel Spreadsheet Guide (for Excel 2007) Page 1 of 6 1 Introduction to Using Excel Spreadsheets This section of the guide is based on the file (a faux grade sheet created for messing with)

More information

Math 227 EXCEL / MEGASTAT Guide

Math 227 EXCEL / MEGASTAT Guide Math 227 EXCEL / MEGASTAT Guide Introduction Introduction: Ch2: Frequency Distributions and Graphs Construct Frequency Distributions and various types of graphs: Histograms, Polygons, Pie Charts, Stem-and-Leaf

More information

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order. Chapter 2 2.1 Descriptive Statistics A stem-and-leaf graph, also called a stemplot, allows for a nice overview of quantitative data without losing information on individual observations. It can be a good

More information

L E A R N I N G O B JE C T I V E S

L E A R N I N G O B JE C T I V E S 2.2 Measures of Central Location L E A R N I N G O B JE C T I V E S 1. To learn the concept of the center of a data set. 2. To learn the meaning of each of three measures of the center of a data set the

More information

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 3 Descriptive Measures

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 3 Descriptive Measures STA 2023 Module 3 Descriptive Measures Learning Objectives Upon completing this module, you should be able to: 1. Explain the purpose of a measure of center. 2. Obtain and interpret the mean, median, and

More information

Things you ll know (or know better to watch out for!) when you leave in December: 1. What you can and cannot infer from graphs.

Things you ll know (or know better to watch out for!) when you leave in December: 1. What you can and cannot infer from graphs. 1 2 Things you ll know (or know better to watch out for!) when you leave in December: 1. What you can and cannot infer from graphs. 2. How to construct (in your head!) and interpret confidence intervals.

More information