Parabolic Path to a Best Best-Fit Line:

Similar documents
EM375 STATISTICS AND MEASUREMENT UNCERTAINTY LEAST SQUARES LINEAR REGRESSION ANALYSIS

Math Section 2.2 Polynomial Functions

The isoperimetric problem on the hypercube

Pattern Recognition Systems Lab 1 Least Mean Squares

The Closest Line to a Data Set in the Plane. David Gurney Southeastern Louisiana University Hammond, Louisiana

Intro to Scientific Computing: Solutions

Bezier curves. Figure 2 shows cubic Bezier curves for various control points. In a Bezier curve, only

Consider the following population data for the state of California. Year Population

Arithmetic Sequences

South Slave Divisional Education Council. Math 10C

Numerical Methods Lecture 6 - Curve Fitting Techniques

The VSS CCD photometry spreadsheet

The number n of subintervals times the length h of subintervals gives length of interval (b-a).

Math 10C Long Range Plans

Section 7.2: Direction Fields and Euler s Methods

EVALUATION OF TRIGONOMETRIC FUNCTIONS

Civil Engineering Computation

Global Support Guide. Verizon WIreless. For the BlackBerry 8830 World Edition Smartphone and the Motorola Z6c

IMP: Superposer Integrated Morphometrics Package Superposition Tool

The Graphs of Polynomial Functions

Examples and Applications of Binary Search

9 x and g(x) = 4. x. Find (x) 3.6. I. Combining Functions. A. From Equations. Example: Let f(x) = and its domain. Example: Let f(x) = and g(x) = x x 4

Polynomial Functions and Models. Learning Objectives. Polynomials. P (x) = a n x n + a n 1 x n a 1 x + a 0, a n 0

A Taste of Maya. Character Setup

A Resource for Free-standing Mathematics Qualifications

Overview Chapter 12 A display model

Our Learning Problem, Again

Alpha Individual Solutions MAΘ National Convention 2013

Sharing Collections. Share a Collection via . Share a Collection via Google Classroom. Quick Reference Guide

It just came to me that I 8.2 GRAPHS AND CONVERGENCE

9.1. Sequences and Series. Sequences. What you should learn. Why you should learn it. Definition of Sequence

Learning to Shoot a Goal Lecture 8: Learning Models and Skills

MOTIF XF Extension Owner s Manual

Chapter 10. Defining Classes. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Recursive Procedures. How can you model the relationship between consecutive terms of a sequence?

Python Programming: An Introduction to Computer Science

Princeton Instruments Reference Manual

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance

CHAPTER IV: GRAPH THEORY. Section 1: Introduction to Graphs

Lecturers: Sanjam Garg and Prasad Raghavendra Feb 21, Midterm 1 Solutions

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Getting Started. Getting Started - 1

Fundamentals of Media Processing. Shin'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dinh Le

. Written in factored form it is easy to see that the roots are 2, 2, i,

Ones Assignment Method for Solving Traveling Salesman Problem

Intermediate Statistics

Weston Anniversary Fund

COMPOSITE TRANSFORMATIONS. DOES ORDER MATTER Use the composite transformation to plot A B C and A B C. 1a)

SD vs. SD + One of the most important uses of sample statistics is to estimate the corresponding population parameters.

1.8 What Comes Next? What Comes Later?

Reading. Parametric curves. Mathematical curve representation. Curves before computers. Required: Angel , , , 11.9.

Descriptive Statistics Summary Lists

MATHEMATICAL METHODS OF ANALYSIS AND EXPERIMENTAL DATA PROCESSING (Or Methods of Curve Fitting)

APPLICATION NOTE PACE1750AE BUILT-IN FUNCTIONS

Texture Mapping. Jian Huang. This set of slides references the ones used at Ohio State for instruction.

6.854J / J Advanced Algorithms Fall 2008

UNIT 4 Section 8 Estimating Population Parameters using Confidence Intervals

12-5A. Equivalent Fractions and Decimals. 1 Daily Common Core Review. Common Core. Lesson. Lesson Overview. Math Background

Homework 1 Solutions MA 522 Fall 2017

Python Programming: An Introduction to Computer Science

Area As A Limit & Sigma Notation

Using the Keyboard. Using the Wireless Keyboard. > Using the Keyboard

Baan Tools User Management

ECE4050 Data Structures and Algorithms. Lecture 6: Searching

PLEASURE TEST SERIES (XI) - 04 By O.P. Gupta (For stuffs on Math, click at theopgupta.com)

OCR Statistics 1. Working with data. Section 3: Measures of spread

Visualization of Gauss-Bonnet Theorem

Markov Chain Model of HomePlug CSMA MAC for Determining Optimal Fixed Contention Window Size

Assignment 5; Due Friday, February 10

Lecture 18. Optimization in n dimensions

Parametric curves. Reading. Parametric polynomial curves. Mathematical curve representation. Brian Curless CSE 457 Spring 2015

Computational Geometry

( n+1 2 ) , position=(7+1)/2 =4,(median is observation #4) Median=10lb

Math 167 Review for Test 4 Chapters 7, 8 & 9

Chapter 5. Functions for All Subtasks. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Name Date Hr. ALGEBRA 1-2 SPRING FINAL MULTIPLE CHOICE REVIEW #2

1.2 Binomial Coefficients and Subsets

Chapter 11. Friends, Overloaded Operators, and Arrays in Classes. Copyright 2014 Pearson Addison-Wesley. All rights reserved.

BAAN IVc/BaanERP. Conversion Guide Oracle7 to Oracle8

condition w i B i S maximum u i

Chapter 9. Pointers and Dynamic Arrays. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Avid Interplay Bundle

Lecture 5. Counting Sort / Radix Sort

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design

CS 683: Advanced Design and Analysis of Algorithms

c) Did doing the transformations in a different order matter? Explain why?

ENGR Spring Exam 1

Name Date Hr. ALGEBRA 1-2 SPRING FINAL MULTIPLE CHOICE REVIEW #1

HP Media Center PC Getting Started Guide

2) Give an example of a polynomial function of degree 4 with leading coefficient of -6

K-NET bus. When several turrets are connected to the K-Bus, the structure of the system is as showns

CMPT 125 Assignment 2 Solutions

Copyright Hewlett-Packard Development Company, L.P.

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON

Investigation Monitoring Inventory

Improving Template Based Spike Detection

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19

ANN WHICH COVERS MLP AND RBF

Reading. Subdivision curves and surfaces. Subdivision curves. Chaikin s algorithm. Recommended:

Performance Plus Software Parameter Definitions

Transcription:

Studet Activity : Fidig the Least Squares Regressio Lie By Explorig the Relatioship betwee Slope ad Residuals Objective: How does oe determie a best best-fit lie for a set of data? Eyeballig it may be a good place to start, but there is a more exact way. I this activity, you will ot oly fid a best best-fit lie for a data set, you will discover why it must be the best. What to Do: Figure 1: Fathom Case Table 1. Ope the Fathom file Parabolic_Path_LSRL.ftm. 2. I the Case Table (upper left corer of scree), iput the data set as give i Figure 1. 3. Notice the data poits are displayed i a scatter plot (upper right of scree) as i Figure 2. Also o the scatter plot, a blue horizotal lie has appeared that we ca use to fid a best-fit lie to model the data. Figure 2: Fathom Scatter Plot a. The geeral equatio for this lie ca be foud just below the scatter plot. What is it? Write the equatio usig x for mea(x) ad y for mea(y). b. What specific slope does the iitial horizotal (blue) lie have? c. Based o that slope, simplify the geeral liear equatio for the specific lie graphed.

4. Below the Case Table (Figure 1) o the Fathom scree is a slider (Figure 3) that chages the values of parameter b i the liear equatio. Try movig the slider to the left ad right. Figure 3: Slider for Parameter b a. What effect does chagig the value of b have o the blue lie? Why? b. By chagig b, ca the lie be vertically or horizotally traslated? c. By chagig b, what type of trasformatio does occur? 5. I order to fid a equatio for a lie, at least oe poit o the lie must be kow. For a best-fit lie, a reasoable poit to begi with is ( x, y ), also called the ceter of gravity for the data set. a. Why would the ceter of gravity be a good poit to iclude o a best-fit lie? b. Aroud what poit does the lie o the scatter plot appear to rotate whe the slope, b, is chaged with the slider? c. I the Fathom widow, a Summary Table as i Figure 4 displays statistics o the data, icludig the slope (b), x, y, the sum of the residuals, ad the sum of squared residuals. Based o the data s statistics, what are the specific coordiates of the poit of rotatio? Figure 4: Fathom Summary Table d. Lookig at the lie graphed o the scatter plot, do these coordiates appear to be correct? 2003, rev. 2008 J. Reihardt & J. Simos 2

e. Now look closely at the Case Table as i Figure 5 that cotais the data poits. It also icludes other useful iformatio: YFitted values ad Residual values. How are the YFitted values also kow as predicted values determied? Figure 5: Fathom Case Table f. How are the Residuals determied? What is a residual? 6. Kowig oly oe poit (i this case, the ceter of gravity) is ot sufficiet iformatio to determie ay lie, much less a best-fit lie. We also eed to fid the best slope to fit the data. a. Adjust the slope of the lie by usig the slider to chage the value of b. Try to fid a lie that is a good fit to the data. Record the value of the slope (b) for this iitial estimate. b. How did you decide what the slope of the best-fit lie should be? c. Do you thik someoe else would choose the same slope as you? Why or why ot? 7. Is there a more accurate method for determiig the best slope? Go to the Object meu ad select Show Hidde Objects as i Figure 6. A graph of a fuctio appears. a. What is the explaatory (idepedet) variable of this fuctio? [Hit: How are the axes labeled?] Figure 6: Fathom s Object Meu b. What is the respose (depedet) variable? 2003, rev. 2008 J. Reihardt & J. Simos 3

c. What type of fuctio models the relatioship betwee these two variables? d. What is the shape of this fuctio s graph? 8. Try adjustig the slope (b) of the lie oce agai. This time otice how the mysterious poit o the parabola moves i respose. This poit ad the slope of our best-fit lie are coected. a. Adjust the slope to move the mysterious poit closer to the vertex of the parabola. What effect does this have o how well the lie fits the data? b. Use the parabola ad its vertex to determie the best slope for your liear model. Oce you are satisfied with your choice, record the coordiate values of the mysterious poit to the earest 2 decimal places. (Refer to the Summary Table for values.) c. At the vertex poit, otice that the respose variable has a miimum value. Cosiderig the meaig of the respose variable, why is the vertex helpful i determiig the slope of our best-fit lie? 9. Determie a equatio for the best-fit lie for the data set. a. Use the ceter of gravity ad the slope foud with the parabola to write a equatio for the best-fit lie i poit-slope form, that is, y " y 1 = b( x " x 1 ). b. Rewrite your equatio above i slope-itercept form, y = bx + a. c. The type of best-fit lie that foud i this activity is kow as the Least Squares Regressio Lie, or LSRL, for short. Why is it referred to by this ame? 2003, rev. 2008 J. Reihardt & J. Simos 4

10. Now that we have determied the LSRL, we ca check it with the LSRL as computed by Fathom. a. Highlight (click o) the scatter plot. The select Least-Squares Lie from the Graph meu. Fathom graphs the LSRL (gree lie) o the scatter plot with ours (blue lie). Are they close? What is Fathom s LSRL equatio, ad how does it compare to ours? b. Retur to the Graph meu ad select the Show Squares optio. Notice that squares coected to the LSRL appear as i Figure 7. What do these squares represet? Ad what does the total sum of the areas of these squares represet? Figure 7: Fathom Scatter Plot ad LSRL c. Below the equatio for Fathom s LSRL is a value for its Sum of squares. What is this value? Where ca this value be foud o the graph of the parabola? d. Chage the value of the slope (b) of the lie ad otice the effect o the size of the squares. To determie a best best-fit lie for a data set, we foud a special lie through the ceter of gravity that also does what to the sum of the areas of the gree squares? 11. What two primary characteristics must a Least Squares Regressio Lie (a best-fit lie) have to model a set of data, ad why is each characteristic sigificat? 2003, rev. 2008 J. Reihardt & J. Simos 5

Extesios 1. Why is there a quadratic relatioship betwee the slope of a best-fit lie ad the sum of the squared residuals? Ivestigate this relatioship i the followig: a. Use a lie i the form y = b( x! x ) + y that cotais the ceter of gravity, x = 0.8 ad y =1.0. Fid the residual i terms of b for each of the five data poits i the activity. (It may help to orgaize the results i a table, ad you may wat to collaborate with others.) b. Square each residual. The fid the sum of the squared residuals i terms of b (the slope). c. What type of fuctio describes the relatioship betwee the slope ad the sum of squared residuals for a lie that icludes the ceter of gravity? d. Verify that the geeral quadratic fuctio (give i the parabola s widow i Fathom ad also below) works for the data set i the activity.!(residuals) 2 =!(y i " y) 2 " 2b #! y i (x i " x) + b 2 #!(x i " x) 2 e. Show that the above holds true for ay geeral data set. 2. I the activity, we determied a LSRL lie for a data set of five specific poits. What if the data set chaged? a. How does chagig a poit i the data set affect the quadratic relatioship betwee the slope ad the sum of the squared residuals for a possible best-fit lie? Choose ay oe of the five data poits ad drag it aroud to ew locatios. I what ways does this affect the parabola used to determie the LSRL? How ca the effects o the parabola caused by chagig a poit i the data set be explaied i terms of the parabola s equatio? b. How does chagig a poit i the data set affect the Least Squares Regressio Lie? Highlight (click o) the scatter plot. The uder the Graph meu, de-select Show Squares. Choose ay oe of the five data poits ad drag it aroud to ew locatios. I what ways does this affect the LSRL (the gree lie i the Fathom widow)? Specifically, how does chagig a poit i the data set affect each of the two defiig characteristics of the Least Squares Regressio Lie? 2003, rev. 2008 J. Reihardt & J. Simos 6