ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Descriptive Statistics

Similar documents
( n+1 2 ) , position=(7+1)/2 =4,(median is observation #4) Median=10lb

UNIT 4 Section 8 Estimating Population Parameters using Confidence Intervals

Normal Distributions

Performance Plus Software Parameter Definitions

OCR Statistics 1. Working with data. Section 3: Measures of spread

Intermediate Statistics

Exercise 6 (Week 42) For the foreign students only.

SAMPLE VERSUS POPULATION. Population - consists of all possible measurements that can be made on a particular item or procedure.

Lecture 1: Introduction and Strassen s Algorithm

3D Model Retrieval Method Based on Sample Prediction

SD vs. SD + One of the most important uses of sample statistics is to estimate the corresponding population parameters.

Descriptive Statistics Summary Lists

Computer Science Foundation Exam. August 12, Computer Science. Section 1A. No Calculators! KEY. Solutions and Grading Criteria.

The Closest Line to a Data Set in the Plane. David Gurney Southeastern Louisiana University Hammond, Louisiana

CS 683: Advanced Design and Analysis of Algorithms

Computational Geometry

9.1. Sequences and Series. Sequences. What you should learn. Why you should learn it. Definition of Sequence

CMPT 125 Assignment 2 Solutions

Chapter 11. Friends, Overloaded Operators, and Arrays in Classes. Copyright 2014 Pearson Addison-Wesley. All rights reserved.

Arithmetic Sequences

Octahedral Graph Scaling

Data Analysis. Concepts and Techniques. Chapter 2. Chapter 2: Getting to Know Your Data. Data Objects and Attribute Types

Recursive Estimation

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design

Random Graphs and Complex Networks T

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Project 2.5 Improved Euler Implementation

CSC165H1 Worksheet: Tutorial 8 Algorithm analysis (SOLUTIONS)

condition w i B i S maximum u i

Capability Analysis (Variable Data)

Module 8-7: Pascal s Triangle and the Binomial Theorem

The Nature of Light. Chapter 22. Geometric Optics Using a Ray Approximation. Ray Approximation

Alpha Individual Solutions MAΘ National Convention 2013

Chapter 9. Pointers and Dynamic Arrays. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Area As A Limit & Sigma Notation

Image Segmentation EEE 508

Python Programming: An Introduction to Computer Science

1 Graph Sparsfication

Lecture 6. Lecturer: Ronitt Rubinfeld Scribes: Chen Ziv, Eliav Buchnik, Ophir Arie, Jonathan Gradstein

Weston Anniversary Fund

Consider the following population data for the state of California. Year Population

WebAssign Lesson 6-1b Geometric Series (Homework)

The golden search method: Question 1

University of Waterloo Department of Electrical and Computer Engineering ECE 250 Algorithms and Data Structures

Lecture 13: Validation

Describing data with graphics and numbers

PLEASURE TEST SERIES (XI) - 04 By O.P. Gupta (For stuffs on Math, click at theopgupta.com)

Lecture 2: Spectra of Graphs

An (or ) is a sequence in which each term after the first differs from the preceding term by a fixed constant, called the.

Mathematical Stat I: solutions of homework 1

Chapter 5. Functions for All Subtasks. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

EM375 STATISTICS AND MEASUREMENT UNCERTAINTY LEAST SQUARES LINEAR REGRESSION ANALYSIS

1.8 What Comes Next? What Comes Later?

Ch 9.3 Geometric Sequences and Series Lessons

Solution printed. Do not start the test until instructed to do so! CS 2604 Data Structures Midterm Spring, Instructions:

Recursive Procedures. How can you model the relationship between consecutive terms of a sequence?

Math Section 2.2 Polynomial Functions

Chapter 4. Procedural Abstraction and Functions That Return a Value. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

So we find a sample mean but what can we say about the General Education Statistics

Frequency Distributions

Matrix representation of a solution of a combinatorial problem of the group theory

Evaluation scheme for Tracking in AMI

Math 3201 Notes Chapter 4: Rational Expressions & Equations

CS Polygon Scan Conversion. Slide 1

EE260: Digital Design, Spring /16/18. n Example: m 0 (=x 1 x 2 ) is adjacent to m 1 (=x 1 x 2 ) and m 2 (=x 1 x 2 ) but NOT m 3 (=x 1 x 2 )

Examples and Applications of Binary Search

On (K t e)-saturated Graphs

Lecture 9: Exam I Review

Perhaps the method will give that for every e > U f() > p - 3/+e There is o o-trivial upper boud for f() ad ot eve f() < Z - e. seems to be kow, where

Ones Assignment Method for Solving Traveling Salesman Problem

ENGR Spring Exam 1

Sorting in Linear Time. Data Structures and Algorithms Andrei Bulatov

. Written in factored form it is easy to see that the roots are 2, 2, i,

Civil Engineering Computation

Improving Template Based Spike Detection

Using the Keyboard. Using the Wireless Keyboard. > Using the Keyboard

6.854J / J Advanced Algorithms Fall 2008

Name Date Hr. ALGEBRA 1-2 SPRING FINAL MULTIPLE CHOICE REVIEW #1

CSE 417: Algorithms and Computational Complexity

Integration: Reduction Formulas Any positive integer power of sin x can be integrated by using a reduction formula.

1. The lines intersect. There is one solution, the point where they intersect. The system is called a consistent system.

On Computing the Fuzzy Weighted Average Using the KM Algorithms

Hash Tables. Presentation for use with the textbook Algorithm Design and Applications, by M. T. Goodrich and R. Tamassia, Wiley, 2015.

9 x and g(x) = 4. x. Find (x) 3.6. I. Combining Functions. A. From Equations. Example: Let f(x) = and its domain. Example: Let f(x) = and g(x) = x x 4

CS 111: Program Design I Lecture 15: Objects, Pandas, Modules. Robert H. Sloan & Richard Warner University of Illinois at Chicago October 13, 2016

Name Date Hr. ALGEBRA 1-2 SPRING FINAL MULTIPLE CHOICE REVIEW #2

A MODIFIED APPROACH FOR ESTIMATING PROCESS CAPABILITY INDICES USING IMPROVED ESTIMATORS

Major CSL Write your name and entry no on every sheet of the answer script. Time 2 Hrs Max Marks 70

IMP: Superposer Integrated Morphometrics Package Superposition Tool

FURTHER INTEGRATION TECHNIQUES (TRIG, LOG, EXP FUNCTIONS)

APPLICATION NOTE PACE1750AE BUILT-IN FUNCTIONS

Fast Fourier Transform (FFT) Algorithms

Term Project Report. This component works to detect gesture from the patient as a sign of emergency message and send it to the emergency manager.

NTH, GEOMETRIC, AND TELESCOPING TEST

The Magma Database file formats

Appendix D. Controller Implementation

Homework 1 Solutions MA 522 Fall 2017

Xbar/R Chart for x1-x3

Algorithms for Disk Covering Problems with the Most Points

The isoperimetric problem on the hypercube

Transcription:

ENGI 44 Probability ad Statistics Faculty of Egieerig ad Applied Sciece Problem Set Descriptive Statistics. If, i the set of values {,, 3, 4, 5, 6, 7 } a error causes the value 5 to be replaced by 50, (a) what effect will this chage have o the media value? (b) what effect will this chage have o the mea value? (c) what effect will this chage have o the mode? (d) which of mea ad media is the better measure of locatio for this chaged data set ad why?. The total scores obtaied o a pair of biased ( loaded ) dice whe they were throw 00 times are summarized i the frequecy table below: Score x Frequecy f Score x 8 7 3 0 9 4 0 0 5 37 6 5 7 5 Total: 00 Frequecy f (a) Display this iformatio o a bar chart. (b) Idetify the mode. (c) Costruct the cumulative frequecy table ad hece fid the media. (d) Fid the arithmetic mea. (e) Fid the sample variace. (f) Commet o ay evidece for skew i these data.

ENGI 44 Problem Set Page of 5 3. The grades received by a egieerig class i a certai course are as show i the frequecy table below: Grade Frequecy A 34 B 47 C 50 D 8 F 6 Display this iformatio graphically i the form of (a) a bar chart (b) a pie chart Show the calculatio for the agle of ay two segmets of the pie chart. I questios 4 to 7 below, use Miitab (or some other software package) to aswer the questios. If you do ot use Miitab, the state what software package you have used. 4. For the followig data set, (also available as a plai text file here),.035.545 6.3796 0.6863.498 9.400 8.008 9.3688 7.084.353 7.674.0376.3456.4693.637 3.8840 3.436.4395 9.060 0.385.345 9.0963 9.9664 0.0884 0.689 0.857.53 8.98 8.8498 0.54.3870 7.876 0.64 0.064 7.938 9.403.544 8.3797.705 9.957 (a) create a pritout of Miitab s stadard Descriptive Statistics output, icludig the default bar chart with superimposed ormal graph ad the default boxplot, (as was demostrated i the Miitab tutorial), (or provide equivalet iformatio from some other software package). (b) What evidece do you see for skewess i these data?

ENGI 44 Problem Set Page 3 of 5 5. For the followig data set of 00 values, (also available as a plai text file here),.8679 3.03009 6.40883 4.33369 0.63779 0.5385 0.4579 3.079.38530 4.67676.7304.7739 0.854.85599.8534.7757.8583 0.65357 0.4.97.47675.7943 0.66736.5375 3.759.8378 0.790.60064.8358.67403.03660 0.50900.0876.59330 0.969 0.760.6550 0.53473.4 0.67745 3.68679 5.63466 4.460 0.63746.00497.4397.05.760.394.5488.758.878.0864.436.549.36957 3.34404 4.357 0.8697.300 0.66336 3.653.769.94.6554.56736 0.84466 0.4495.48484 4.6585 5.37489.8596.67463 0.87603.675.57 0.68.85488 3.8630 0.6538 0.7766 0.970.0063 0.99977.6056.0060.06657.938 0.8605.809.9997.944.58438 0.94377 0.33508.94735.83459.8873.7406.6448 (a) create a pritout of Miitab s stadard Descriptive Statistics output, (or provide equivalet iformatio from some other software package). (b) costruct a stadard boxplot, orieted horizotally, with gridlies at itervals of 0.5 uits. (c) idetify ay outliers (list their values). (d) costruct a histogram, usig as class boudaries the cosecutive itegers, from 0 to the ext iteger above the largest observed value. (e) What evidece do you see for skewess i these data? 6. For the followig data set of 30 values, (also available as a plai text file here), 0.957438 0.66777 0.69579 0.53556 0.989805 0.740677 0.837656 0.8593 0.97656 0.789 0.930773 0.945 0.96407 0.99488 0.90530 0.98569 0.658793 0.88450 0.978 0.99899 0.93477 0.905575 0.856455 0.7894 0.836906 0.89483 0.5985 0.848346 0.90458 0.96747 (a) create a pritout of Miitab s stadard Descriptive Statistics output, (or provide equivalet iformatio from some other software package). (b) costruct a stadard boxplot ad add a symbol to idicate the locatio of the arithmetic mea. (c) idetify ay outliers (list their values). (d) costruct a histogram, class widths of 0., from 0 to. (e) What evidece do you see for skewess i these data?

ENGI 44 Problem Set Page 4 of 5 7. For the followig data set of 60 values, (also available as a plai text file here), 7 6 43 54 54 48 48 59 55 6 50 55 30 66 4 55 48 57 6 48 46 6 30 50 66 73 54 48 66 6 45 57 48 70 68 43 5 50 46 64 46 50 50 50 48 37 45 53 64 50 39 3 66 68 4 70 48 73 39 43 (a) costruct a frequecy bar chart, with classes of width 5 ad cetres at { 3, 37, 4, 47,..., 67, 7 }. (b) create a pritout of Miitab s stadard Descriptive Statistics output, but display oly the umber cout, mea, stadard deviatio, media ad quartiles, (or provide equivalet iformatio from some other software package). (c) idetify the modal class ad the media class from your bar chart. (d) use the grouped data (from the bar chart) to calculate the mea, the populatio stadard deviatio ad the sample stadard deviatio (you may fid this easier to do i a spreadsheet program such as Microsoft Excel ). (e) Why are the mea ad stadard deviatio that you calculated i part (d) differet from the Miitab values? 8. Problem Set Bous Questio, Descriptive Statistics Prove that, for ay real costat a Hit: Use the idetities k x, i i ) i= i= ( x x) < ( x a = k (for ay costat k ) ad x i = x.

ENGI 44 Problem Set Page 5 of 5 Additioal Note for Questio 8: It the follows that, for ay radom sample of size draw from a populatio of true mea µ, ( x x ) ( x μ ) i i= i= (with equality oly i the very ulikely evet that x = μ ). N Recall that σ = ( x N i μ ) (where there are N members i the etire populatio). Oe ca the speculate [correctly] that, o average, ( ) xi x σ ( xi x ) is said to be a biased estimate of σ o average. variace s. i σ, i that it uderestimates the true value of The bias disappears whe this variace formula is replaced by the sample I the sectio o estimators we shall see a proof that s is a ubiased estimate of σ. Retur to the idex of questios O to the solutios to this problem set