Reference

Size: px
Start display at page:

Download "Reference"

Transcription

1 Leaning diary: research methodology Name: Juriaan Zandvliet Student number: (1) a short description of each topic of the course, (2) desciption of possible examples or exercises done in relation and (3) your perspective, links, references, ideas and intrerest related to the topic and issues where you would need further training 1. Set up Science produces accurate facts that can be checked, scientific laws and theories. You can say the results of science are repeatable and the theories are falsifiable. The sciencecouncil.org states the following about science: the pursuit and application of knowledge and understanding of the natural and social world following a systematic methodology based on evidence. Science is carried out during a research process. A research process consists out of four stages; problems, theories, criticism and new problems. This research process can be carried out by the following method: 1. Observe some aspect of the universe 2. Invent a tentative description, called hypothesis 3. Use the hypothesis to make predictions 4. Test those predictions by experiments and modify the hypothesis 5. Repeat steps 3 and 4 until there are no discrepancies There are two examples of scientific research processes. The first one is called pure scientific research, this is about explaining the word around us and explaining how the universe operates. Something can be called science when it is falsifiable. For instance the theory of gravity, when I throw a rock into the air and it fall back the theory will stand. If I throw a rock into the air and it will not fall back the theory is refuted. An example of a not falsifiable theory it the existence of a god. Because it is not falsifiable it is not scientific. There is a wide variety of examples of scientific research. Scientific research can be a publication in a scientific paper. Two well-known journals with a relatively broad spectrum of subjects are Nature and Science. Recognised publishers of academic journals are Elsevier, Wiley-Blackwell and Springer. Reference

2 2. Main concepts Statistics is a form of mathematical analysis that uses quantified models, representations and synopses for a given set of experimental data or real-life studies. Statistics studies methodologies to gather, review, analyze and draw conclusions from data. Statistics is a term used to summarize a process that an analyst uses to characterize a data set. If the data set depends on a sample of a larger population, then the analyst can develop interpretations about the population primarily based on the statistical outcomes from the sample. Statistical analysis involves the process of gathering and evaluating data and then summarizing the data into a mathematical form. Mean A mean is the mathematical average of a group of two or more numerals. The mean for a specified set of numbers can be computed in multiple ways, including the arithmetic mean, which shows how well a specific commodity performs over time, and the geometric mean, which shows the performance results of an investor s portfolio invested in that same commodity over the same period. Variance σ2 Variance is a measurement of the span of numbers in a data set. The variance measures the distance each number in the set is from the mean. Variance can help determine the risk an investor might accept when buying an investment. Standard Deviation σ, Variance σ2 Standard deviation is a measure of the dispersion of a set of data from its mean. It is calculated as the square root of variance by determining the variation between each data point relative to the mean. If the data points are further from the mean, there is higher deviation within the data set.

3 Normal distribution A normal distribution is an arrangement of a data set in which most values cluster in the middle of the range and the rest taper off symmetrically toward either extreme. Height is one simple example of something that follows a normal distribution pattern: Most people are of average height, the numbers of people that are taller and shorter than average are fairly equal and a very small (and still roughly equivalent) number of people are either extremely tall or extremely short. Degrees of freedom Df = n-1 Standard Deviation vs. Mean In its simplest form, the mean is simply the average of all the data points in a given set. In investing, for example, you might want to know the mean closing price for the last 20 days. This can be obtained by adding the closing prices for each session and dividing by 20. Because markets are fickle at best, traders and analysts use moving averages that adjust daily to incorporate the most updated data. This

4 means the calculation is always taking into account the most recent sessions' movements, and older sessions drop off as they become less relevant. An exponential moving average (EMA) is calculated by weighting each data point, giving greater significance to more recent data. Standard deviation is calculated based on the mean. The distance of each data point from the mean is squared, summed and averaged to find the variance. Or to put it another way: Variance is derived by taking the mean of the data points, subtracting the mean from each data point individually, squaring each of these results and then taking another mean of these squares. Standard deviation is simply the square root of the variance. Calculating Standard Deviation The formula for standard deviation uses three variables. The first variable is to be the value of each point within the data set, traditionally listed as x, with a sub-number denoting each additional variable (x, x1, x2, x3, etc.). The mean, or average, of the data points is applied to the value of the variable M, and the number of data points involved is assigned to the variable n. To determine the mean value, the values of the data points must be added together, and that total is then divided by the number of data points that were included. For example, if the data points were 5, 7, 3 and 7, the total would be 22. The total of 22 would then be divided by the number of data points, in this case four, resulting in a mean of 5.5. This leads to the following determinations: M=5.5 and n=4. The variance is determined by subtracting the value of the mean from each data point, resulting in - 0.5, 1.5, -2.5 and 1.5. Each of those values are then squared, resulting in 0.25, 2.25, 6.25 and The square values are then added together, resulting in a total of 11, which is then divided by the value of n-1, which is 3 in this instance, resulting in a variance approximately of The square root of the variance is then calculated, resulting in the standard deviation of Range The range is the highest value minus the lowest value in a ascending) series of numbers. The range of for example the number series 11, 12, 13, 13, 16, 18, 19, 20, 22 is 11. This is calculated as = 11. Standard Deviation vs. Variance The variance helps determine the data's spread size when compared to the mean value. As the variance gets bigger, more variation in data values occurs, and there may be a larger gap between one data value and another. If the data values are all close together, the variance will be smaller. This is more difficult to grasp than are standard deviations, however, because variances represent a squared result that may not be meaningfully expressed on the same graph as the original data set.

5 Standard deviations are usually easier to picture and apply. The standard deviation is expressed the same unit of measurement as the data, which isn't necessarily be the case with the variance. Using the standard deviation, statisticians may determine if the data has a normal curve or other mathematical relationship. If the data behaves in a normal curve, then 68% of the data points will fall within one standard deviation of the average, or mean data point. Bigger variances cause more data points to fall outside the standard deviation. Smaller variances result in more data that is close to average. All the definitions are further explained in the next video: Reference Verhoeven, N. (2015). Statistiek in stappen. Amsterdam: Uitgeverij Boom

6 3. Tests Test are used when samples of whole population have to be taken care of and needs to be generalized. There is always insecurity during generalizing, statistics tries to control this by testing assumptions to make decisions. For instance when comparing two populations with each other, statistics is used to make decisions. Comparing two populations is done with the help of the t-test. T-test A t-test is a parametric statistical test that can be used, among other things, to determine whether the population average of a normally distributed quantity deviates from a certain value, or whether there is a difference between the averages of two groups in the population. Using a t-test, you can then determine an exceedance probability or a confidence interval. A t-test can be used if certain conditions are met. For the t test for one sample, the sample concerned must be a random sample from a normal distribution, with possibly unknown variance. In the case of two samples, both samples should come from a normal distribution. The two samples must either be independent of each other or have to be paired. In the case of two independent samples, the two populations should have the same variance when applying the standard t test. If both populations have a different variance, an adapted t-test can be used. The case of paired observations amounts to a t-test for the single sample of the differences. Violations of these assumptions have consequences for the robustness and the distinctive character of the t-test. With the help of an F-test it can be tested whether the variances in both groups differ significantly from each other. The normality of the populations can be tested using the Kolmogorov- Smirnov test. If the conditions of the central limit theorem are met, the t-test can be approximated for large samples. The sample averages required for the calculation of the test quantity are then almost normally distributed. Anova Anova is a testing procedure to determine whether the population averages of more than 2 groups differ from each other. In that sense it is a generalization of the t-test for two samples.

7 Levene test The Levene test is a statistical test to measure if two variations for two different population a different of each other. When the out coming p-value is lower than the alfa there is a significant difference in variation between the two populations. A good example of what statistical test to use is the following youtube video: Reference Verhoeven, N. (2015). Statistiek in stappen. Amsterdam: Uitgeverij Boom

8 4. Regression methods and models Correlation Correlation is the statistical relationship between two variables. These can be two sets of measurements, or possible values of two random variables. The strength of this relationship is expressed in a correlation coefficient. The degree of correlation between two variables is expressed in the correlation coefficient. Their value can vary between -1 and means no linear coherence, +1: perfect positive linear coherence and - 1: perfect negative linear coherence. The further the correlation coefficient is removed from 0, the stronger the correlation rate. R and R2 R² is a measure that provides information about the extent to which a model approaches the actual data. If all predicted values match the actual values, then R² = 1. Note that a (perfect) relationship says nothing about the causality in the data. Regression Analysis Regression analysis determines the extent to which specific factors such as interest rates, the price of a product or service, or particular industries or sectors influence the price fluctuations of an asset. This is depicted in the form of a straight line called linear regression. The line summarizes the relationship between x and y The residuals need to be distributed according to a normal distribution. The error term has a normal distribution with a mean of 0.

9 Models Linear regression technique tries to predict the values of the outcome Y via a linear relation from the values of X. The outcome variable Y is called the dependent variable, and the predictor X the independent variable. Simple regression is when you predict the outcome with one predictor. In practice, this will in many cases not be sufficient and it is desirable to analyse the effects of two or more predictors. Like with models predicting the tree volume, multiple predict. We then speak of multiple (also multiple or multivariable) linear regression. A model tries to predict value y with the help of Y = β 0 + β 1 x. The model line is created where the sum of squares of model residuals is minimized. In the example β 0 is the constant, and is also called the interceptor. β 1 is the variable: income, it is also called the slope. The dependent variable is price. This coefficients table show the correlation between income and price. The value for β1 is.564 and is significant, this means this value cannot be zero which means there would be no correlation. The significance of β0 explains nothing about the correlation between the two variables price and income.

10 Standard Error (SE) The standard error of an estimate shows the precision of that estimate. The standard error is used to indicate how certain you are of an estimated value. Formally, the value represents the spread of the estimate if there were multiple samples, with the estimate being made again for each sample. The formula for standard error is as follows: When the number of samples increases (n) the standard error will decrease because the outcome of dividing true a bigger number will give a smaller number. A good example is given in the following Youtube video about simple linear regression: Reference Verhoeven, N. (2015). Statistiek in stappen. Amsterdam: Uitgeverij Boom

11 5. Other response variables Homoscedasticity is when the residuals show a trend and get wider or smaller. In heteroscedasticity date the variance is constant. There are other data types like: count data, proportional data, binary response data and survival data (age-at-death data). Count data Count data is non-negative integer values (0,1,2,3, ) arising from counting rather than ranking. Example is the number of days a student is absent in one school year. This data can never be less than zero. That is why a normal regression model would not work because the model will eventually turn negative. Proportional data Proportional data gives a proportion of two or more categories, for instance the proportion voting for two different parties or the proportion of dead and alive trees after a storm. Just like count data the data has to be converted before using a linear model because values cannot exceed 100% or can be less than 0%. The variance of the response variable is likely to increase with the mean. Binomial data Binomial data is data consisting out of 0 and 1. Zero stands for normal and one stands for special. For instance healthy people (zero) or sick people (one). Just like count data and proportional data linear model will not acknowledge that the data cannot exceed the values of one and zero. A transformation is therefore required Generalized Linear Models (GLMs) Generalized linear models is count-, proportional or binomial data that has undergone a transformation so a linear model can be utilized. This is done by transforming the predictor to a logistic scale bar or logit. The following youtube video give examples of different statistical data types and how generalized linear models work: Reference Verhoeven, N. (2015). Statistiek in stappen. Amsterdam: Uitgeverij Boom

12 6. Visualizations Visualizations is a big part of academic research. It is about showing the data and communicate information using efficient visuals. Sometimes data can be presented in misleading ways to give the data more impacts. In general complex ideas should be communicated with clarity, precision, and efficiency. There are five design principals for visualising data. 1. Proportions in graphics should accurately represent raw data 2. Labels should be clear, accurate, and descriptive 3. Data should vary, not design 4. There should be no more dimensions shown than dimensions in the data 5. Graphics must not quite data out of context Figures and tables Figure legend is placed under the figure and table heading above table. Every figure and table included in the paper must be referred to for the text. Each column of the table should have a title or a label. Vertical lines should be avoided and do not use a table when you wish to show a trend or pattern. The figure on the right is a bad example of data visualisation. It clearly violates rule 3 listed above: data should vary, not design. Also the proportions of the data do not represent the raw data (rule 1). All the percentages in the figure add up to 243%, 100% would be the right proportion.

13 7. GIS A geographic information system, abbreviated to GIS, is an information system with which spatial data or information about geographical objects, so-called geo-information. This can be stored, managed, processed, analyzed, integrated and presented. There are a lot of spatial analyst tools in GIS to analyse the data and make calculations: clip, intersect, merge, buffer and dissolve. Clip: Extracts input features that overlay the clip features. Intersect: Computes a geometric intersection of the input features. Features or portions of features which overlap in all layers and/or feature classes will be written to the output feature class. Merge: Combines multiple input datasets of the same data type into a single, new output dataset. This tool can combine point, line, or polygon feature classes or tables. Buffer: Creates buffer polygons around input features to a specified distance. Dissolve: Aggregates features based on specified attributes. A possible exam question will be the difference between topology and topography. Topography is the study of the description of characteristics of places and areas. Topography also includes the study of the location and the names of places, waters, mountains, regions, countries and other geographical forms. Topology is a kind of geometry, but topology does not deal with metric properties such as the distance between points. It is the property that describe how the space is composed, such as coherence and orientation. 8. R statistics R is a program that can be used for statistic to make calculations and analyse data. The program uses script language to execute commends. It is a great programme for data analysis and statistical computing with vector and matrix calculations. Frequent codes used in the R programme are: summary(data) sd(data) hist(data) boxplot(data) plot(model) length(data)

Descriptive Statistics, Standard Deviation and Standard Error

Descriptive Statistics, Standard Deviation and Standard Error AP Biology Calculations: Descriptive Statistics, Standard Deviation and Standard Error SBI4UP The Scientific Method & Experimental Design Scientific method is used to explore observations and answer questions.

More information

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things.

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. + What is Data? Data is a collection of facts. Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. In most cases, data needs to be interpreted and

More information

Math 214 Introductory Statistics Summer Class Notes Sections 3.2, : 1-21 odd 3.3: 7-13, Measures of Central Tendency

Math 214 Introductory Statistics Summer Class Notes Sections 3.2, : 1-21 odd 3.3: 7-13, Measures of Central Tendency Math 14 Introductory Statistics Summer 008 6-9-08 Class Notes Sections 3, 33 3: 1-1 odd 33: 7-13, 35-39 Measures of Central Tendency odd Notation: Let N be the size of the population, n the size of the

More information

Regression. Dr. G. Bharadwaja Kumar VIT Chennai

Regression. Dr. G. Bharadwaja Kumar VIT Chennai Regression Dr. G. Bharadwaja Kumar VIT Chennai Introduction Statistical models normally specify how one set of variables, called dependent variables, functionally depend on another set of variables, called

More information

MAT 142 College Mathematics. Module ST. Statistics. Terri Miller revised July 14, 2015

MAT 142 College Mathematics. Module ST. Statistics. Terri Miller revised July 14, 2015 MAT 142 College Mathematics Statistics Module ST Terri Miller revised July 14, 2015 2 Statistics Data Organization and Visualization Basic Terms. A population is the set of all objects under study, a sample

More information

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables

More information

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order.

Prepare a stem-and-leaf graph for the following data. In your final display, you should arrange the leaves for each stem in increasing order. Chapter 2 2.1 Descriptive Statistics A stem-and-leaf graph, also called a stemplot, allows for a nice overview of quantitative data without losing information on individual observations. It can be a good

More information

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown Z-TEST / Z-STATISTIC: used to test hypotheses about µ when the population standard deviation is known and population distribution is normal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses

More information

Middle School Math Course 2

Middle School Math Course 2 Middle School Math Course 2 Correlation of the ALEKS course Middle School Math Course 2 to the Indiana Academic Standards for Mathematics Grade 7 (2014) 1: NUMBER SENSE = ALEKS course topic that addresses

More information

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes. Resources for statistical assistance Quantitative covariates and regression analysis Carolyn Taylor Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC January 24, 2017 Department

More information

Measures of Dispersion

Measures of Dispersion Measures of Dispersion 6-3 I Will... Find measures of dispersion of sets of data. Find standard deviation and analyze normal distribution. Day 1: Dispersion Vocabulary Measures of Variation (Dispersion

More information

THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann

THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA. Forrest W. Young & Carla M. Bann Forrest W. Young & Carla M. Bann THE L.L. THURSTONE PSYCHOMETRIC LABORATORY UNIVERSITY OF NORTH CAROLINA CB 3270 DAVIE HALL, CHAPEL HILL N.C., USA 27599-3270 VISUAL STATISTICS PROJECT WWW.VISUALSTATS.ORG

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency lowest value + highest value midrange The word average: is very ambiguous and can actually refer to the mean,

More information

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Creation & Description of a Data Set * 4 Levels of Measurement * Nominal, ordinal, interval, ratio * Variable Types

More information

Data Mining. ❷Chapter 2 Basic Statistics. Asso.Prof.Dr. Xiao-dong Zhu. Business School, University of Shanghai for Science & Technology

Data Mining. ❷Chapter 2 Basic Statistics. Asso.Prof.Dr. Xiao-dong Zhu. Business School, University of Shanghai for Science & Technology ❷Chapter 2 Basic Statistics Business School, University of Shanghai for Science & Technology 2016-2017 2nd Semester, Spring2017 Contents of chapter 1 1 recording data using computers 2 3 4 5 6 some famous

More information

For our example, we will look at the following factors and factor levels.

For our example, we will look at the following factors and factor levels. In order to review the calculations that are used to generate the Analysis of Variance, we will use the statapult example. By adjusting various settings on the statapult, you are able to throw the ball

More information

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...

More information

Learner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display

Learner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display CURRICULUM MAP TEMPLATE Priority Standards = Approximately 70% Supporting Standards = Approximately 20% Additional Standards = Approximately 10% HONORS PROBABILITY AND STATISTICS Essential Questions &

More information

5. Compare the volume of a three dimensional figure to surface area.

5. Compare the volume of a three dimensional figure to surface area. 5. Compare the volume of a three dimensional figure to surface area. 1. What are the inferences that can be drawn from sets of data points having a positive association and a negative association. 2. Why

More information

CCSSM Curriculum Analysis Project Tool 1 Interpreting Functions in Grades 9-12

CCSSM Curriculum Analysis Project Tool 1 Interpreting Functions in Grades 9-12 Tool 1: Standards for Mathematical ent: Interpreting Functions CCSSM Curriculum Analysis Project Tool 1 Interpreting Functions in Grades 9-12 Name of Reviewer School/District Date Name of Curriculum Materials:

More information

Multiple Regression White paper

Multiple Regression White paper +44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms

More information

Announcements. Data Sources a list of data files and their sources, an example of what I am looking for:

Announcements. Data Sources a list of data files and their sources, an example of what I am looking for: Data Announcements Data Sources a list of data files and their sources, an example of what I am looking for: Source Map of Bangor MEGIS NG911 road file for Bangor MEGIS Tax maps for Bangor City Hall, may

More information

IT 403 Practice Problems (1-2) Answers

IT 403 Practice Problems (1-2) Answers IT 403 Practice Problems (1-2) Answers #1. Using Tukey's Hinges method ('Inclusionary'), what is Q3 for this dataset? 2 3 5 7 11 13 17 a. 7 b. 11 c. 12 d. 15 c (12) #2. How do quartiles and percentiles

More information

Slide Copyright 2005 Pearson Education, Inc. SEVENTH EDITION and EXPANDED SEVENTH EDITION. Chapter 13. Statistics Sampling Techniques

Slide Copyright 2005 Pearson Education, Inc. SEVENTH EDITION and EXPANDED SEVENTH EDITION. Chapter 13. Statistics Sampling Techniques SEVENTH EDITION and EXPANDED SEVENTH EDITION Slide - Chapter Statistics. Sampling Techniques Statistics Statistics is the art and science of gathering, analyzing, and making inferences from numerical information

More information

Product Catalog. AcaStat. Software

Product Catalog. AcaStat. Software Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,

More information

Table of Contents (As covered from textbook)

Table of Contents (As covered from textbook) Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression

More information

Big Mathematical Ideas and Understandings

Big Mathematical Ideas and Understandings Big Mathematical Ideas and Understandings A Big Idea is a statement of an idea that is central to the learning of mathematics, one that links numerous mathematical understandings into a coherent whole.

More information

Lecture 1: Statistical Reasoning 2. Lecture 1. Simple Regression, An Overview, and Simple Linear Regression

Lecture 1: Statistical Reasoning 2. Lecture 1. Simple Regression, An Overview, and Simple Linear Regression Lecture Simple Regression, An Overview, and Simple Linear Regression Learning Objectives In this set of lectures we will develop a framework for simple linear, logistic, and Cox Proportional Hazards Regression

More information

3. Data Analysis and Statistics

3. Data Analysis and Statistics 3. Data Analysis and Statistics 3.1 Visual Analysis of Data 3.2.1 Basic Statistics Examples 3.2.2 Basic Statistical Theory 3.3 Normal Distributions 3.4 Bivariate Data 3.1 Visual Analysis of Data Visual

More information

INDEPENDENT SCHOOL DISTRICT 196 Rosemount, Minnesota Educating our students to reach their full potential

INDEPENDENT SCHOOL DISTRICT 196 Rosemount, Minnesota Educating our students to reach their full potential INDEPENDENT SCHOOL DISTRICT 196 Rosemount, Minnesota Educating our students to reach their full potential MINNESOTA MATHEMATICS STANDARDS Grades 9, 10, 11 I. MATHEMATICAL REASONING Apply skills of mathematical

More information

Course of study- Algebra Introduction: Algebra 1-2 is a course offered in the Mathematics Department. The course will be primarily taken by

Course of study- Algebra Introduction: Algebra 1-2 is a course offered in the Mathematics Department. The course will be primarily taken by Course of study- Algebra 1-2 1. Introduction: Algebra 1-2 is a course offered in the Mathematics Department. The course will be primarily taken by students in Grades 9 and 10, but since all students must

More information

Chapter 6: DESCRIPTIVE STATISTICS

Chapter 6: DESCRIPTIVE STATISTICS Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling

More information

DOWNLOAD PDF BIG IDEAS MATH VERTICAL SHRINK OF A PARABOLA

DOWNLOAD PDF BIG IDEAS MATH VERTICAL SHRINK OF A PARABOLA Chapter 1 : BioMath: Transformation of Graphs Use the results in part (a) to identify the vertex of the parabola. c. Find a vertical line on your graph paper so that when you fold the paper, the left portion

More information

Downloaded from

Downloaded from UNIT 2 WHAT IS STATISTICS? Researchers deal with a large amount of data and have to draw dependable conclusions on the basis of data collected for the purpose. Statistics help the researchers in making

More information

Chapter Two: Descriptive Methods 1/50

Chapter Two: Descriptive Methods 1/50 Chapter Two: Descriptive Methods 1/50 2.1 Introduction 2/50 2.1 Introduction We previously said that descriptive statistics is made up of various techniques used to summarize the information contained

More information

Measures of Central Tendency

Measures of Central Tendency Page of 6 Measures of Central Tendency A measure of central tendency is a value used to represent the typical or average value in a data set. The Mean The sum of all data values divided by the number of

More information

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set.

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set. Measures of Central Tendency A measure of central tendency is a value used to represent the typical or average value in a data set. The Mean the sum of all data values divided by the number of values in

More information

Frequency Distributions

Frequency Distributions Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data so that it is possible to get a general overview of the results. Remember,

More information

Building Better Parametric Cost Models

Building Better Parametric Cost Models Building Better Parametric Cost Models Based on the PMI PMBOK Guide Fourth Edition 37 IPDI has been reviewed and approved as a provider of project management training by the Project Management Institute

More information

7 Fractions. Number Sense and Numeration Measurement Geometry and Spatial Sense Patterning and Algebra Data Management and Probability

7 Fractions. Number Sense and Numeration Measurement Geometry and Spatial Sense Patterning and Algebra Data Management and Probability 7 Fractions GRADE 7 FRACTIONS continue to develop proficiency by using fractions in mental strategies and in selecting and justifying use; develop proficiency in adding and subtracting simple fractions;

More information

Part I, Chapters 4 & 5. Data Tables and Data Analysis Statistics and Figures

Part I, Chapters 4 & 5. Data Tables and Data Analysis Statistics and Figures Part I, Chapters 4 & 5 Data Tables and Data Analysis Statistics and Figures Descriptive Statistics 1 Are data points clumped? (order variable / exp. variable) Concentrated around one value? Concentrated

More information

8 th Grade Pre Algebra Pacing Guide 1 st Nine Weeks

8 th Grade Pre Algebra Pacing Guide 1 st Nine Weeks 8 th Grade Pre Algebra Pacing Guide 1 st Nine Weeks MS Objective CCSS Standard I Can Statements Included in MS Framework + Included in Phase 1 infusion Included in Phase 2 infusion 1a. Define, classify,

More information

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,

More information

SLStats.notebook. January 12, Statistics:

SLStats.notebook. January 12, Statistics: Statistics: 1 2 3 Ways to display data: 4 generic arithmetic mean sample 14A: Opener, #3,4 (Vocabulary, histograms, frequency tables, stem and leaf) 14B.1: #3,5,8,9,11,12,14,15,16 (Mean, median, mode,

More information

Data Preprocessing. S1 Teknik Informatika Fakultas Teknologi Informasi Universitas Kristen Maranatha

Data Preprocessing. S1 Teknik Informatika Fakultas Teknologi Informasi Universitas Kristen Maranatha Data Preprocessing S1 Teknik Informatika Fakultas Teknologi Informasi Universitas Kristen Maranatha 1 Why Data Preprocessing? Data in the real world is dirty incomplete: lacking attribute values, lacking

More information

8: Statistics. Populations and Samples. Histograms and Frequency Polygons. Page 1 of 10

8: Statistics. Populations and Samples. Histograms and Frequency Polygons. Page 1 of 10 8: Statistics Statistics: Method of collecting, organizing, analyzing, and interpreting data, as well as drawing conclusions based on the data. Methodology is divided into two main areas. Descriptive Statistics:

More information

Continuous Improvement Toolkit. Normal Distribution. Continuous Improvement Toolkit.

Continuous Improvement Toolkit. Normal Distribution. Continuous Improvement Toolkit. Continuous Improvement Toolkit Normal Distribution The Continuous Improvement Map Managing Risk FMEA Understanding Performance** Check Sheets Data Collection PDPC RAID Log* Risk Analysis* Benchmarking***

More information

Year 8 Review 1, Set 1 Number confidence (Four operations, place value, common indices and estimation)

Year 8 Review 1, Set 1 Number confidence (Four operations, place value, common indices and estimation) Year 8 Review 1, Set 1 Number confidence (Four operations, place value, common indices and estimation) Place value Digit Integer Negative number Difference, Minus, Less Operation Multiply, Multiplication,

More information

Data Preprocessing. Slides by: Shree Jaswal

Data Preprocessing. Slides by: Shree Jaswal Data Preprocessing Slides by: Shree Jaswal Topics to be covered Why Preprocessing? Data Cleaning; Data Integration; Data Reduction: Attribute subset selection, Histograms, Clustering and Sampling; Data

More information

Correctly Compute Complex Samples Statistics

Correctly Compute Complex Samples Statistics SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample

More information

Table Of Contents. Table Of Contents

Table Of Contents. Table Of Contents Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store

More information

Year 9: Long term plan

Year 9: Long term plan Year 9: Long term plan Year 9: Long term plan Unit Hours Powerful procedures 7 Round and round 4 How to become an expert equation solver 6 Why scatter? 6 The construction site 7 Thinking proportionally

More information

demonstrate an understanding of the exponent rules of multiplication and division, and apply them to simplify expressions Number Sense and Algebra

demonstrate an understanding of the exponent rules of multiplication and division, and apply them to simplify expressions Number Sense and Algebra MPM 1D - Grade Nine Academic Mathematics This guide has been organized in alignment with the 2005 Ontario Mathematics Curriculum. Each of the specific curriculum expectations are cross-referenced to the

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Library, Teaching & Learning 014 Summary of Basic data Analysis DATA Qualitative Quantitative Counted Measured Discrete Continuous 3 Main Measures of Interest Central Tendency Dispersion

More information

Integers & Absolute Value Properties of Addition Add Integers Subtract Integers. Add & Subtract Like Fractions Add & Subtract Unlike Fractions

Integers & Absolute Value Properties of Addition Add Integers Subtract Integers. Add & Subtract Like Fractions Add & Subtract Unlike Fractions Unit 1: Rational Numbers & Exponents M07.A-N & M08.A-N, M08.B-E Essential Questions Standards Content Skills Vocabulary What happens when you add, subtract, multiply and divide integers? What happens when

More information

1a. Define, classify, and order rational and irrational numbers and their subsets. (DOK 1)

1a. Define, classify, and order rational and irrational numbers and their subsets. (DOK 1) 1a. Define, classify, and order rational and irrational numbers and their subsets. (DOK 1) 1b. Formulate and solve standard and reallife problems involving addition, subtraction, multiplication, and division

More information

Stage 1 (intervention) Stage 2 Stage 3 Stage 4. Advanced 7-8. Secure 4-6

Stage 1 (intervention) Stage 2 Stage 3 Stage 4. Advanced 7-8. Secure 4-6 Stage 1 (intervention) Stage 2 Stage 3 Stage 4 YEAR 7 LAT Grade Emerging (not secondary read) 1-3 Secure 4-6 Advanced 7-8 Advanced + 9 YEAR 8 1 Emerging 2-3 Secure 4-6 Advanced 7-9 Autumn 1 Place Value

More information

STATS PAD USER MANUAL

STATS PAD USER MANUAL STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11

More information

ECLT 5810 Data Preprocessing. Prof. Wai Lam

ECLT 5810 Data Preprocessing. Prof. Wai Lam ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate

More information

CHAPTER 2 DESCRIPTIVE STATISTICS

CHAPTER 2 DESCRIPTIVE STATISTICS CHAPTER 2 DESCRIPTIVE STATISTICS 1. Stem-and-Leaf Graphs, Line Graphs, and Bar Graphs The distribution of data is how the data is spread or distributed over the range of the data values. This is one of

More information

SPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL

SPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL SPSS QM II SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform some statistical analyses in SPSS. Details around a certain function/analysis method not covered

More information

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 3 Descriptive Measures

STA Rev. F Learning Objectives. Learning Objectives (Cont.) Module 3 Descriptive Measures STA 2023 Module 3 Descriptive Measures Learning Objectives Upon completing this module, you should be able to: 1. Explain the purpose of a measure of center. 2. Obtain and interpret the mean, median, and

More information

Stage 1 Place Value Calculations Geometry Fractions Data. Name and describe (using appropriate vocabulary) common 2d and 3d shapes

Stage 1 Place Value Calculations Geometry Fractions Data. Name and describe (using appropriate vocabulary) common 2d and 3d shapes Stage 1 Place Value Calculations Geometry Fractions Data YEAR 7 Working towards Read and write whole numbers in words and figures Mental methods for addition and subtraction, Name and describe (using appropriate

More information

Middle School Math Course 3

Middle School Math Course 3 Middle School Math Course 3 Correlation of the ALEKS course Middle School Math Course 3 to the Texas Essential Knowledge and Skills (TEKS) for Mathematics Grade 8 (2012) (1) Mathematical process standards.

More information

Route Map (Start September 2012) Year 9

Route Map (Start September 2012) Year 9 Route Map (Start September 2012) Year 9 3 th 7 th Sept 10 th -14 th Sept 17 th 21 st Sept 24 th 28 th Sept 1 st -5 th Oct 8 th -12 th Oct 15 th 19 th Oct 22 nd -26 th Oct 29 th Oct-2 nd Nov 5 th -9 th

More information

The Use of Biplot Analysis and Euclidean Distance with Procrustes Measure for Outliers Detection

The Use of Biplot Analysis and Euclidean Distance with Procrustes Measure for Outliers Detection Volume-8, Issue-1 February 2018 International Journal of Engineering and Management Research Page Number: 194-200 The Use of Biplot Analysis and Euclidean Distance with Procrustes Measure for Outliers

More information

Key Stage 3 Curriculum

Key Stage 3 Curriculum Key Stage 3 Curriculum Learning Area: Maths Learning Area Coordinator: Ms S J Pankhurst What will I study? SUBJECT YEAR 7 Autumn 1 Autumn 2 Spring 1 Spring 2 Summer 1 Summer 2 Focus Counting and comparing

More information

Unit 5: Estimating with Confidence

Unit 5: Estimating with Confidence Unit 5: Estimating with Confidence Section 8.3 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Unit 5 Estimating with Confidence 8.1 8.2 8.3 Confidence Intervals: The Basics Estimating

More information

8. MINITAB COMMANDS WEEK-BY-WEEK

8. MINITAB COMMANDS WEEK-BY-WEEK 8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are

More information

Year 8 Key Performance Indicators Maths (Number)

Year 8 Key Performance Indicators Maths (Number) Key Performance Indicators Maths (Number) M8.1 N1: I can solve problems by adding, subtracting, multiplying and dividing decimals. Use correct notation for recurring decimals, know the denominators of

More information

Alaska Mathematics Standards Vocabulary Word List Grade 7

Alaska Mathematics Standards Vocabulary Word List Grade 7 1 estimate proportion proportional relationship rate ratio rational coefficient rational number scale Ratios and Proportional Relationships To find a number close to an exact amount; an estimate tells

More information

Use of GeoGebra in teaching about central tendency and spread variability

Use of GeoGebra in teaching about central tendency and spread variability CREAT. MATH. INFORM. 21 (2012), No. 1, 57-64 Online version at http://creative-mathematics.ubm.ro/ Print Edition: ISSN 1584-286X Online Edition: ISSN 1843-441X Use of GeoGebra in teaching about central

More information

PSS718 - Data Mining

PSS718 - Data Mining Lecture 5 - Hacettepe University October 23, 2016 Data Issues Improving the performance of a model To improve the performance of a model, we mostly improve the data Source additional data Clean up the

More information

Predictive Analysis: Evaluation and Experimentation. Heejun Kim

Predictive Analysis: Evaluation and Experimentation. Heejun Kim Predictive Analysis: Evaluation and Experimentation Heejun Kim June 19, 2018 Evaluation and Experimentation Evaluation Metrics Cross-Validation Significance Tests Evaluation Predictive analysis: training

More information

Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM

Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM * Which directories are used for input files and output files? See menu-item "Options" and page 22 in the manual.

More information

Section E. Measuring the Strength of A Linear Association

Section E. Measuring the Strength of A Linear Association This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

At the end of the chapter, you will learn to: Present data in textual form. Construct different types of table and graphs

At the end of the chapter, you will learn to: Present data in textual form. Construct different types of table and graphs DATA PRESENTATION At the end of the chapter, you will learn to: Present data in textual form Construct different types of table and graphs Identify the characteristics of a good table and graph Identify

More information

Scope and Sequence for the New Jersey Core Curriculum Content Standards

Scope and Sequence for the New Jersey Core Curriculum Content Standards Scope and Sequence for the New Jersey Core Curriculum Content Standards The following chart provides an overview of where within Prentice Hall Course 3 Mathematics each of the Cumulative Progress Indicators

More information

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010 THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE

More information

Fathom Dynamic Data TM Version 2 Specifications

Fathom Dynamic Data TM Version 2 Specifications Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other

More information

Carnegie Learning Math Series Course 2, A Florida Standards Program

Carnegie Learning Math Series Course 2, A Florida Standards Program to the students previous understanding of equivalent ratios Introduction to. Ratios and Rates Ratios, Rates,. and Mixture Problems.3.4.5.6 Rates and Tables to Solve Problems to Solve Problems Unit Rates

More information

Minnesota Academic Standards for Mathematics 2007

Minnesota Academic Standards for Mathematics 2007 An Alignment of Minnesota for Mathematics 2007 to the Pearson Integrated High School Mathematics 2014 to Pearson Integrated High School Mathematics Common Core Table of Contents Chapter 1... 1 Chapter

More information

Statistical Analysis of MRI Data

Statistical Analysis of MRI Data Statistical Analysis of MRI Data Shelby Cummings August 1, 2012 Abstract Every day, numerous people around the country go under medical testing with the use of MRI technology. Developed in the late twentieth

More information

Students will understand 1. that numerical expressions can be written and evaluated using whole number exponents

Students will understand 1. that numerical expressions can be written and evaluated using whole number exponents Grade 6 Expressions and Equations Essential Questions: How do you use patterns to understand mathematics and model situations? What is algebra? How are the horizontal and vertical axes related? How do

More information

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable. 5-number summary 68-95-99.7 Rule Area principle Bar chart Bimodal Boxplot Case Categorical data Categorical variable Center Changing center and spread Conditional distribution Context Contingency table

More information

Data analysis using Microsoft Excel

Data analysis using Microsoft Excel Introduction to Statistics Statistics may be defined as the science of collection, organization presentation analysis and interpretation of numerical data from the logical analysis. 1.Collection of Data

More information

Error Analysis, Statistics and Graphing

Error Analysis, Statistics and Graphing Error Analysis, Statistics and Graphing This semester, most of labs we require us to calculate a numerical answer based on the data we obtain. A hard question to answer in most cases is how good is your

More information

1. Assumptions. 1. Introduction. 2. Terminology

1. Assumptions. 1. Introduction. 2. Terminology 4. Process Modeling 4. Process Modeling The goal for this chapter is to present the background and specific analysis techniques needed to construct a statistical model that describes a particular scientific

More information

Mathematics Appendix 1: Examples of formal written methods for addition, subtraction, multiplication and division

Mathematics Appendix 1: Examples of formal written methods for addition, subtraction, multiplication and division Mathematics Appendix 1: Examples of formal written methods for addition, subtraction, multiplication and division This appendix sets out some examples of formal written methods for all four operations

More information

Chapters 5-6: Statistical Inference Methods

Chapters 5-6: Statistical Inference Methods Chapters 5-6: Statistical Inference Methods Chapter 5: Estimation (of population parameters) Ex. Based on GSS data, we re 95% confident that the population mean of the variable LONELY (no. of days in past

More information

Curriculum Catalog

Curriculum Catalog 2018-2019 Curriculum Catalog Table of Contents MATHEMATICS 800 COURSE OVERVIEW... 1 UNIT 1: THE REAL NUMBER SYSTEM... 1 UNIT 2: MODELING PROBLEMS IN INTEGERS... 3 UNIT 3: MODELING PROBLEMS WITH RATIONAL

More information

Averages and Variation

Averages and Variation Averages and Variation 3 Copyright Cengage Learning. All rights reserved. 3.1-1 Section 3.1 Measures of Central Tendency: Mode, Median, and Mean Copyright Cengage Learning. All rights reserved. 3.1-2 Focus

More information

Graphical Analysis of Data using Microsoft Excel [2016 Version]

Graphical Analysis of Data using Microsoft Excel [2016 Version] Graphical Analysis of Data using Microsoft Excel [2016 Version] Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

Themes in the Texas CCRS - Mathematics

Themes in the Texas CCRS - Mathematics 1. Compare real numbers. a. Classify numbers as natural, whole, integers, rational, irrational, real, imaginary, &/or complex. b. Use and apply the relative magnitude of real numbers by using inequality

More information

Whole Numbers and Integers. Angles and Bearings

Whole Numbers and Integers. Angles and Bearings Whole Numbers and Integers Multiply two 2-digit whole numbers without a calculator Know the meaning of square number Add and subtract two integers without a calculator Multiply an integer by a single digit

More information

Equations and Functions, Variables and Expressions

Equations and Functions, Variables and Expressions Equations and Functions, Variables and Expressions Equations and functions are ubiquitous components of mathematical language. Success in mathematics beyond basic arithmetic depends on having a solid working

More information

Chapter 2 - Graphical Summaries of Data

Chapter 2 - Graphical Summaries of Data Chapter 2 - Graphical Summaries of Data Data recorded in the sequence in which they are collected and before they are processed or ranked are called raw data. Raw data is often difficult to make sense

More information

Modelling Proportions and Count Data

Modelling Proportions and Count Data Modelling Proportions and Count Data Rick White May 4, 2016 Outline Analysis of Count Data Binary Data Analysis Categorical Data Analysis Generalized Linear Models Questions Types of Data Continuous data:

More information

ANNUAL NATIONAL ASSESSMENT 2014 ASSESSMENT GUIDELINES MATHEMATICS GRADE 8

ANNUAL NATIONAL ASSESSMENT 2014 ASSESSMENT GUIDELINES MATHEMATICS GRADE 8 ANNUAL NATIONAL ASSESSMENT 2014 ASSESSMENT GUIDELINES MATHEMATICS GRADE 8 INTRODUCTION The 2014 cycle of Annual National Assessment (ANA 2014) will be administered in all public and designated 1 independent

More information

Exercise: Graphing and Least Squares Fitting in Quattro Pro

Exercise: Graphing and Least Squares Fitting in Quattro Pro Chapter 5 Exercise: Graphing and Least Squares Fitting in Quattro Pro 5.1 Purpose The purpose of this experiment is to become familiar with using Quattro Pro to produce graphs and analyze graphical data.

More information