Statistical Methods for NLP LT 2202
|
|
- Brittney Benson
- 5 years ago
- Views:
Transcription
1 LT 2202 Lecture 5 Statistical inference January 31, 2012
2 Summary of lecture 4 Probabilities and statistics in Python Scipy Matplotlib Descriptive statistics Random sample Sample mean Sample variance and standard deviation Plotting sample distribution: histograms Correlation
3 pmf and cdf in Python Import the uniform distribution (randint): from scipy.stats import randint What is the probability of a die rolling 4? randint.pmf(4, 1, 7) What is the probability of rolling 4 or below? randint.cdf(4, 1, 7) What is the probability of rolling between 2 and 4? randint.cdf(4, 1, 7) - randint.cdf(1, 1, 7)
4 One additional note: quantiles (ppf) Document classifier with error rate of 0.20 applied to 100 documents Import the binomial distribution (binom): from scipy.stats import binom 13 errors or less? binom.cdf(13, 100, 0.20) gives errors or less? binom.cdf(14, 100, 0.20) gives Smallest k so that P(X k) is at least 0.05? binom.ppf(0.05, 100, 0.20) gives 14
5 The pmf
6 Example: human height The average height of a Swedish female (16-84 years) is cm The standard deviation (σ) is 7 cm Assume normal distribution Import the normal distribution (norm): from scipy.stats import norm Probability of being at most 154 cm tall? norm.cdf(154, 165.5, 7) gives 0.05 How short are the shortest 5% of the population? norm.ppf(0.05, 165.5, 7) gives 154
7 Statistical inference: overview Given a random sample, how do we estimate some parameter of the distribution? What is the error rate of my tagger? What is the probability of the word green? determine some interval that is very likely to contain the true value? 95% confidence interval for the error rate test some hypothesis about the parameter? Is the error rate greater than 0.03? Is the error rate of tagger A greater than that of tagger B?
8 Random sample A random sample is a set of values generated by some random variable Typically generated by carrying out some repeated experiment Examples: Running a tagger on a text and counting errors Word and sentence lengths in a corpus
9 Random sample: formally Let s assume we have a random variable X Definition: a sample variable for X is a set of independent variables X 1,..., X n with the same distribution as X Definition: a random sample x 1,..., x n of X is a possible outcome of the sample variable X 1,..., X n
10 Random sample: example Let s assume we have a random variable: X = the roll of a die Sample variable: X 1,..., X 3 = the rolls of 3 dice A possible random sample: 5, 1, 4
11 Point estimates Given a dataset, how do we estimate some parameter of the random variable that generated the data? An estimator is a function that guesses a parameter value given a dataset
12 Maximum likelihood estimates There are many ways to construct estimators Most common: the maximum likelihood method: Select the parameter value that maximizes the probability of the data
13 Maximum likelihood estimates Select the parameter value that maximizes the probability of the data Mathematically, define the likelihood function L(p) like this: L( p) = P( x1,..., xn p) = P( x1 p)... P( xn p) Then find the p* that maximizes L(p) We ll now look at one special case
14 Maximum likelihood estimation of the probability of an event We carry out an experiment n times, and we get a positive outcome x times How do we estimate the probability p of a positive outcome?
15 Estimating the parameter This is a binomial distribution with parameters n and p Maximum likelihood estimation: find the p* that makes x most likely Maximize L( p) n x x n x = P( x p) = p (1 p)
16 Estimating the parameter Maximize L( p) n x x n x = P( x p) = p (1 p) It can be shown that the ML estimation is * p = x n
17 Intuition: move the bubble For instance: 100 documents, 8 errors Move the bubble to maximize the probability of 8 errors
18 ML estimation of word probabilities We observe the words in a corpus of 1,173,766 words: the: 50,975 times big: 559 times dog: 10 times Assuming a unigram model: what is the probability of the? ML estimation: p_the = /
19 The probabilities of rare events Sentence probability in the unigram model: P(the big dog) = P(the)P(big)P(dog)= = * * P(the big donut) = P(the)P(big)P(donut)= = * * 0
20 Laplace s law: add one to all counts Vocabulary size: 49,206 words P(the) = ( )/( ) P(big) = ( )/( ) P(dog) = ( )/( ) P(donut) = (0 + 1)/( ) P(the big donut) = P(the)P(big)P(donut)= = * *
21 Evaluating performance When evaluating NLP systems, several performance measures can be interpreted as probabilities. error rate = 1 - accuracy Precision / recall False positive rate / true positive rate We estimate all these using ML
22 Performance measures as probabilities Error rate = P(error), accuracy = P(correct) Precision = P(positive guess positive) Recall = TPR = P(guess positive positive) FPR = P(guess positive negative)
23 ML estimates of performance measures Error rate = P(error) MLE: #errors / # tests Precision = P(positive guess positive) MLE: #positive and guess positive / #guess positive Recall = TPR = P(guess positive positive) MLE: #positive and guess positive / #positive
24 Interval estimates If our estimator gives us a value of a parameter: how close is it to the true value? Definition: a confidence interval for the parameter θ with significance value α is an interval [θ 1, θ 2 ] so that P θ θ θ ) ( 1 2 α Example: error rate between 0.05 and 0.08 with 95% probability
25 The distribution of our estimator Our estimator applied to randomly selected samples has a distribution Depends on the sample size
26 Estimator distribution / sample size
27 Computing a confidence interval If we have made a point estimate θ*, how can we compute an interval that contains the true θ with 95% probability? Impractical for most distributions We ll give an approximate method for the case of error/success rates
28 Cookbook method for error rate confidence interval Pretend that binomial is normal The true variance is p(1-p)/n; use p* instead Then we can use the following approximate confidence interval: σ * I p = p ± = * z α σ * * p (1 p ) n *
29 Explanation of formula * I p = p ± * z α σ σ = * * p (1 p ) n * z α is the value such that P( z z ) α < X < α = if X is normally distributed α
30 Normal quantile in Python z α is is the value such that P( z z ) α < X < α = α If X is normally distributed In Python: z x : norm.ppf(1-(1-x)/2)) z 0.95 : norm.ppf(0.975) z 0.99 : norm.ppf(0.995) z : norm.ppf(0.9995)
31 Example in Python Assume we test on n=10,000 documents and make n err =745 errors. p_mle = n_err/n sd_est = math.sqrt(p_mle*(1-p_mle)/n) z95 = norm.ppf(0.975) p_upper = p_mle + z95*sd_est p_lower = p_mle - z95*sd_est
32 Comparing performance measurements If we evaluate two NLP tools and get the estimated error rates p* 1 and p* 2 How can we say that a difference is not due to chance? We distinguish two cases: Estimated on different test sets Estimated on the same test set
33 Performance estimated on different test sets Define d* as the difference between the estimated error rates p* 1 and p* 2 Now we give a cookbook method for computing a confidence interval I d for d If I d does not include 0, we can say that the difference is real
34 Confidence interval for the difference σ * * * * * p1 (1 p1 ) p2 (1 p2 ) = + n 1 n 2 * I d = d ± * z α σ
35 Example in Python Assume we test tagger 1 on n 1 =2,000 documents and make x 1 =67 errors, and tagger 2 n 2 =1,500 and make x 2 =68 errors n1 = ; n2 = ; x1 = 67.0; x2 = 68.0 p1_mle = x1/n1; p2_mle = x2/n2; d_mle = p1_mle-p2_mle sd_est = math.sqrt(p1_mle*(1-p1_mle)/n1 + p2_mle*(1- p2_mle)/n2) z95 = norm.ppf(0.975) d_upper = d_mle + z95*sd_est d_lower = d_mle - z95*sd_est
36 Performance estimated on the same test set There are many such tests We ll present one of the simplest: McNemar s test:
37 McNemar s test Make a 2x2 contingency table: System 1 OK System 1 error System2 OK a c System 2 error b d We are interested in the differences: b and c
38 McNemar s test Form the test quantity h: ( b c) b + c If h > threshold, we have a significant difference: threshold In Python (α = significance level): from scipy.stats import chi2 threshold = chi2.ppf(alpha, 1) h = 2 = χ α (1) 2
39 Example System 1 OK System 1 error System 2 OK System 2 error Err rate 1 = 0.10, err rate 2 = 0.13 Significant difference? We form the test quantity: h = ( b c) b + c 2 = (46 29) = 3.85
40 McNemar s test in Python from scipy.stats import chi2 b = 46.0; c = 29.0 alpha = 0.95 threshold = chi2.ppf(alpha, 1) h = (b-c)*(b-c)/(b+c) if h > threshold: print 'Significant at level', alpha Threshold = 3.84, h = 3.85!
41 Summary Point estimates: Given a dataset, how do I estimate my parameters? Interval estimates: Given a dataset, how do I compute an interval likely to contain true value? Comparing performance estimates
Chapter 6 Normal Probability Distributions
Chapter 6 Normal Probability Distributions 6-1 Review and Preview 6-2 The Standard Normal Distribution 6-3 Applications of Normal Distributions 6-4 Sampling Distributions and Estimators 6-5 The Central
More informationMetrics for Performance Evaluation How to evaluate the performance of a model? Methods for Performance Evaluation How to obtain reliable estimates?
Model Evaluation Metrics for Performance Evaluation How to evaluate the performance of a model? Methods for Performance Evaluation How to obtain reliable estimates? Methods for Model Comparison How to
More informationBiostatistics & SAS programming. Kevin Zhang
Biostatistics & SAS programming Kevin Zhang February 27, 2017 Random variables and distributions 1 Data analysis Simulation study Apply existing methodologies to your collected samples, with the hope to
More informationData Mining Classification: Bayesian Decision Theory
Data Mining Classification: Bayesian Decision Theory Lecture Notes for Chapter 2 R. O. Duda, P. E. Hart, and D. G. Stork, Pattern classification, 2nd ed. New York: Wiley, 2001. Lecture Notes for Chapter
More informationHomework 4: Clustering, Recommenders, Dim. Reduction, ML and Graph Mining (due November 19 th, 2014, 2:30pm, in class hard-copy please)
Virginia Tech. Computer Science CS 5614 (Big) Data Management Systems Fall 2014, Prakash Homework 4: Clustering, Recommenders, Dim. Reduction, ML and Graph Mining (due November 19 th, 2014, 2:30pm, in
More informationClassification. Instructor: Wei Ding
Classification Part II Instructor: Wei Ding Tan,Steinbach, Kumar Introduction to Data Mining 4/18/004 1 Practical Issues of Classification Underfitting and Overfitting Missing Values Costs of Classification
More informationUse of Extreme Value Statistics in Modeling Biometric Systems
Use of Extreme Value Statistics in Modeling Biometric Systems Similarity Scores Two types of matching: Genuine sample Imposter sample Matching scores Enrolled sample 0.95 0.32 Probability Density Decision
More informationToday. Lecture 4: Last time. The EM algorithm. We examine clustering in a little more detail; we went over it a somewhat quickly last time
Today Lecture 4: We examine clustering in a little more detail; we went over it a somewhat quickly last time The CAD data will return and give us an opportunity to work with curves (!) We then examine
More informationWill Monroe July 21, with materials by Mehran Sahami and Chris Piech. Joint Distributions
Will Monroe July 1, 017 with materials by Mehran Sahami and Chris Piech Joint Distributions Review: Normal random variable An normal (= Gaussian) random variable is a good approximation to many other distributions.
More informationMetrics Overfitting Model Evaluation Research directions. Classification. Practical Issues. Huiping Cao. lassification-issues, Slide 1/57
lassification-issues, Slide 1/57 Classification Practical Issues Huiping Cao lassification-issues, Slide 2/57 Outline Criteria to evaluate a classifier Underfitting and overfitting Model evaluation lassification-issues,
More informationChapter 3. Bootstrap. 3.1 Introduction. 3.2 The general idea
Chapter 3 Bootstrap 3.1 Introduction The estimation of parameters in probability distributions is a basic problem in statistics that one tends to encounter already during the very first course on the subject.
More informationStatistics I 2011/2012 Notes about the third Computer Class: Simulation of samples and goodness of fit; Central Limit Theorem; Confidence intervals.
Statistics I 2011/2012 Notes about the third Computer Class: Simulation of samples and goodness of fit; Central Limit Theorem; Confidence intervals. In this Computer Class we are going to use Statgraphics
More informationRobust Linear Regression (Passing- Bablok Median-Slope)
Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their
More informationDistributions of Continuous Data
C H A P T ER Distributions of Continuous Data New cars and trucks sold in the United States average about 28 highway miles per gallon (mpg) in 2010, up from about 24 mpg in 2004. Some of the improvement
More informationExcel 2010 with XLSTAT
Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with
More informationLearning Objectives. Continuous Random Variables & The Normal Probability Distribution. Continuous Random Variable
Learning Objectives Continuous Random Variables & The Normal Probability Distribution 1. Understand characteristics about continuous random variables and probability distributions 2. Understand the uniform
More informationBluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition
Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created
More informationCS 237 Fall 2018, Homework 08 Solution
CS 237 Fall 2018, Homework 08 Solution Due date: Thursday November 8th at 11:59 pm (10% off if up to 24 hours late) via Gradescope General Instructions Please complete this notebook by filling in solutions
More informationIQR = number. summary: largest. = 2. Upper half: Q3 =
Step by step box plot Height in centimeters of players on the 003 Women s Worldd Cup soccer team. 157 1611 163 163 164 165 165 165 168 168 168 170 170 170 171 173 173 175 180 180 Determine the 5 number
More informationUnit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users
BIOSTATS 640 Spring 2018 Review of Introductory Biostatistics STATA solutions Page 1 of 13 Key Comments begin with an * Commands are in bold black I edited the output so that it appears here in blue Unit
More informationChapter 6 Evaluation Metrics and Evaluation
Chapter 6 Evaluation Metrics and Evaluation The area of evaluation of information retrieval and natural language processing systems is complex. It will only be touched on in this chapter. First the scientific
More informationThe Power and Sample Size Application
Chapter 72 The Power and Sample Size Application Contents Overview: PSS Application.................................. 6148 SAS Power and Sample Size............................... 6148 Getting Started:
More informationR Programming Basics - Useful Builtin Functions for Statistics
R Programming Basics - Useful Builtin Functions for Statistics Vectorized Arithmetic - most arthimetic operations in R work on vectors. Here are a few commonly used summary statistics. testvect = c(1,3,5,2,9,10,7,8,6)
More informationIntroductory Applied Statistics: A Variable Approach TI Manual
Introductory Applied Statistics: A Variable Approach TI Manual John Gabrosek and Paul Stephenson Department of Statistics Grand Valley State University Allendale, MI USA Version 1.1 August 2014 2 Copyright
More informationEvaluating Machine Learning Methods: Part 1
Evaluating Machine Learning Methods: Part 1 CS 760@UW-Madison Goals for the lecture you should understand the following concepts bias of an estimator learning curves stratified sampling cross validation
More information10.4 Linear interpolation method Newton s method
10.4 Linear interpolation method The next best thing one can do is the linear interpolation method, also known as the double false position method. This method works similarly to the bisection method by
More information8. MINITAB COMMANDS WEEK-BY-WEEK
8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are
More informationMEASURING CLASSIFIER PERFORMANCE
MEASURING CLASSIFIER PERFORMANCE ERROR COUNTING Error types in a two-class problem False positives (type I error): True label is -1, predicted label is +1. False negative (type II error): True label is
More informationUnit 7 Statistics. AFM Mrs. Valentine. 7.1 Samples and Surveys
Unit 7 Statistics AFM Mrs. Valentine 7.1 Samples and Surveys v Obj.: I will understand the different methods of sampling and studying data. I will be able to determine the type used in an example, and
More informationStatistical Pattern Recognition
Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2012 http://ce.sharif.edu/courses/90-91/2/ce725-1/ Agenda Features and Patterns The Curse of Size and
More informationCross-validation and the Bootstrap
Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the bootstrap. These methods refit a model of interest to samples formed from the training set,
More informationInstall RStudio from - use the standard installation.
Session 1: Reading in Data Before you begin: Install RStudio from http://www.rstudio.com/ide/download/ - use the standard installation. Go to the course website; http://faculty.washington.edu/kenrice/rintro/
More informationMiddle School Math Course 2
Middle School Math Course 2 Correlation of the ALEKS course Middle School Math Course 2 to the Indiana Academic Standards for Mathematics Grade 7 (2014) 1: NUMBER SENSE = ALEKS course topic that addresses
More informationDATA MINING AND MACHINE LEARNING. Lecture 6: Data preprocessing and model selection Lecturer: Simone Scardapane
DATA MINING AND MACHINE LEARNING Lecture 6: Data preprocessing and model selection Lecturer: Simone Scardapane Academic Year 2016/2017 Table of contents Data preprocessing Feature normalization Missing
More informationConfidence Intervals: Estimators
Confidence Intervals: Estimators Point Estimate: a specific value at estimates a parameter e.g., best estimator of e population mean ( ) is a sample mean problem is at ere is no way to determine how close
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 14: Introduction to hypothesis testing (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 10 Hypotheses 2 / 10 Quantifying uncertainty Recall the two key goals of inference:
More informationTesting Random- Number Generators
Testing Random- Number Generators Raj Jain Washington University Saint Louis, MO 63131 Jain@cse.wustl.edu These slides are available on-line at: http://www.cse.wustl.edu/~jain/cse574-06/ 27-1 Overview
More informationLecture Notes 3: Data summarization
Lecture Notes 3: Data summarization Highlights: Average Median Quartiles 5-number summary (and relation to boxplots) Outliers Range & IQR Variance and standard deviation Determining shape using mean &
More information1 Overview of Statistics; Essential Vocabulary
1 Overview of Statistics; Essential Vocabulary Statistics: the science of collecting, organizing, analyzing, and interpreting data in order to make decisions Population and sample Population: the entire
More informationCS839: Probabilistic Graphical Models. Lecture 10: Learning with Partially Observed Data. Theo Rekatsinas
CS839: Probabilistic Graphical Models Lecture 10: Learning with Partially Observed Data Theo Rekatsinas 1 Partially Observed GMs Speech recognition 2 Partially Observed GMs Evolution 3 Partially Observed
More informationSelected Introductory Statistical and Data Manipulation Procedures. Gordon & Johnson 2002 Minitab version 13.
Minitab@Oneonta.Manual: Selected Introductory Statistical and Data Manipulation Procedures Gordon & Johnson 2002 Minitab version 13.0 Minitab@Oneonta.Manual: Selected Introductory Statistical and Data
More informationThe first few questions on this worksheet will deal with measures of central tendency. These data types tell us where the center of the data set lies.
Instructions: You are given the following data below these instructions. Your client (Courtney) wants you to statistically analyze the data to help her reach conclusions about how well she is teaching.
More informationStatsMate. User Guide
StatsMate User Guide Overview StatsMate is an easy-to-use powerful statistical calculator. It has been featured by Apple on Apps For Learning Math in the App Stores around the world. StatsMate comes with
More informationCh6: The Normal Distribution
Ch6: The Normal Distribution Introduction Review: A continuous random variable can assume any value between two endpoints. Many continuous random variables have an approximately normal distribution, which
More informationTHIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010
THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE
More informationLab 5 - Risk Analysis, Robustness, and Power
Type equation here.biology 458 Biometry Lab 5 - Risk Analysis, Robustness, and Power I. Risk Analysis The process of statistical hypothesis testing involves estimating the probability of making errors
More informationObjective 1: To simulate the rolling of a die 100 times and to build a probability distribution.
Minitab Lab #2 Math 120 Nguyen 1 of 6 Objectives: 1) Simulate games of chance that have equally likely outcomes 2) Construct a binomial probability distribution and sketch a probability histogram 3) Calculate
More informationFall 09, Homework 5
5-38 Fall 09, Homework 5 Due: Wednesday, November 8th, beginning of the class You can work in a group of up to two people. This group does not need to be the same group as for the other homeworks. You
More informationSPSS Basics for Probability Distributions
Built-in Statistical Functions in SPSS Begin by defining some variables in the Variable View of a data file, save this file as Probability_Distributions.sav and save the corresponding output file as Probability_Distributions.spo.
More informationWeek 4: Simple Linear Regression III
Week 4: Simple Linear Regression III Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Goodness of
More informationMendel and His Peas Investigating Monhybrid Crosses Using the Graphing Calculator
20 Investigating Monhybrid Crosses Using the Graphing Calculator This activity will use the graphing calculator s random number generator to simulate the production of gametes in a monohybrid cross. The
More informationDiscrete Mathematics Course Review 3
21-228 Discrete Mathematics Course Review 3 This document contains a list of the important definitions and theorems that have been covered thus far in the course. It is not a complete listing of what has
More informationMiddle School Math Course 2. Correlation of the ALEKS course Middle School Math Course 2 to the Common Core State Standards for Grade 7 (2010)
Middle School Math Course 2 Correlation of the ALEKS course Middle School Math Course 2 to the Common Core State Standards for Grade 7 (2010) 7.RP: Ratios & Proportional Relationships 7.RP.A.1: 7.RP.A.2:
More informationCSE 417T: Introduction to Machine Learning. Lecture 6: Bias-Variance Trade-off. Henry Chai 09/13/18
CSE 417T: Introduction to Machine Learning Lecture 6: Bias-Variance Trade-off Henry Chai 09/13/18 Let! ", $ = the maximum number of dichotomies on " points s.t. no subset of $ points is shattered Recall
More informationIntroduction to Geospatial Analysis
Introduction to Geospatial Analysis Introduction to Geospatial Analysis 1 Descriptive Statistics Descriptive statistics. 2 What and Why? Descriptive Statistics Quantitative description of data Why? Allow
More informationCHAPTER 6. The Normal Probability Distribution
The Normal Probability Distribution CHAPTER 6 The normal probability distribution is the most widely used distribution in statistics as many statistical procedures are built around it. The central limit
More informationA Brief Look at Optimization
A Brief Look at Optimization CSC 412/2506 Tutorial David Madras January 18, 2018 Slides adapted from last year s version Overview Introduction Classes of optimization problems Linear programming Steepest
More informationBootstrap confidence intervals Class 24, Jeremy Orloff and Jonathan Bloom
1 Learning Goals Bootstrap confidence intervals Class 24, 18.05 Jeremy Orloff and Jonathan Bloom 1. Be able to construct and sample from the empirical distribution of data. 2. Be able to explain the bootstrap
More informationMinitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D.
Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Introduction to Minitab The interface for Minitab is very user-friendly, with a spreadsheet orientation. When you first launch Minitab, you will see
More informationNetwork Traffic Measurements and Analysis
DEIB - Politecnico di Milano Fall, 2017 Sources Hastie, Tibshirani, Friedman: The Elements of Statistical Learning James, Witten, Hastie, Tibshirani: An Introduction to Statistical Learning Andrew Ng:
More informationDecision Trees Dr. G. Bharadwaja Kumar VIT Chennai
Decision Trees Decision Tree Decision Trees (DTs) are a nonparametric supervised learning method used for classification and regression. The goal is to create a model that predicts the value of a target
More informationLouis Fourrier Fabien Gaie Thomas Rolf
CS 229 Stay Alert! The Ford Challenge Louis Fourrier Fabien Gaie Thomas Rolf Louis Fourrier Fabien Gaie Thomas Rolf 1. Problem description a. Goal Our final project is a recent Kaggle competition submitted
More informationStatistical Pattern Recognition
Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Agenda Features and Patterns The Curse of Size and
More informationA Quick Introduction to R
Math 4501 Fall 2012 A Quick Introduction to R The point of these few pages is to give you a quick introduction to the possible uses of the free software R in statistical analysis. I will only expect you
More information3. Data Analysis and Statistics
3. Data Analysis and Statistics 3.1 Visual Analysis of Data 3.2.1 Basic Statistics Examples 3.2.2 Basic Statistical Theory 3.3 Normal Distributions 3.4 Bivariate Data 3.1 Visual Analysis of Data Visual
More informationChapter 5: Joint Probability Distributions and Random
Chapter 5: Joint Probability Distributions and Random Samples Curtis Miller 2018-06-13 Introduction We may naturally inquire about collections of random variables that are related to each other in some
More informationLearner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display
CURRICULUM MAP TEMPLATE Priority Standards = Approximately 70% Supporting Standards = Approximately 20% Additional Standards = Approximately 10% HONORS PROBABILITY AND STATISTICS Essential Questions &
More informationEvaluation Metrics. (Classifiers) CS229 Section Anand Avati
Evaluation Metrics (Classifiers) CS Section Anand Avati Topics Why? Binary classifiers Metrics Rank view Thresholding Confusion Matrix Point metrics: Accuracy, Precision, Recall / Sensitivity, Specificity,
More informationNatural Language Processing Basics. Yingyu Liang University of Wisconsin-Madison
Natural Language Processing Basics Yingyu Liang University of Wisconsin-Madison Natural language Processing (NLP) The processing of the human languages by computers One of the oldest AI tasks One of the
More informationSTAT 113: Lab 9. Colin Reimer Dawson. Last revised November 10, 2015
STAT 113: Lab 9 Colin Reimer Dawson Last revised November 10, 2015 We will do some of the following together. The exercises with a (*) should be done and turned in as part of HW9. Before we start, let
More informationStatistics I Practice 2 Notes Probability and probabilistic models; Introduction of the statistical inference
Statistics I Practice 2 Notes Probability and probabilistic models; Introduction of the statistical inference 1. Simulation of random variables In Excel we can simulate values from random variables (discrete
More informationSD 372 Pattern Recognition
SD 372 Pattern Recognition Lab 2: Model Estimation and Discriminant Functions 1 Purpose This lab examines the areas of statistical model estimation and classifier aggregation. Model estimation will be
More informationFathom Dynamic Data TM Version 2 Specifications
Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other
More informationThe Normal Distribution & z-scores
& z-scores Distributions: Who needs them? Why are we interested in distributions? Important link between distributions and probabilities of events If we know the distribution of a set of events, then we
More informationParameter Estimation. Learning From Data: MLE. Parameter Estimation. Likelihood. Maximum Likelihood Parameter Estimation. Likelihood Function 12/1/16
Learning From Data: MLE Maximum Estimators Common approach in statistics: use a parametric model of data: Assume data set: Bin(n, p), Poisson( ), N(µ, exp( ) Uniform(a, b) 2 ) But parameters are unknown!!!
More informationComputational Cognitive Science
Computational Cognitive Science Lecture 5: Maximum Likelihood Estimation; Parameter Uncertainty Chris Lucas (Slides adapted from Frank Keller s) School of Informatics University of Edinburgh clucas2@inf.ed.ac.uk
More informationPart II: A broader view
Part II: A broader view Understanding ML metrics: isometrics, basic types of linear isometric plots linear metrics and equivalences between them skew-sensitivity non-linear metrics Model manipulation:
More informationAn Introduction to the R Commander
An Introduction to the R Commander BIO/MAT 460, Spring 2011 Christopher J. Mecklin Department of Mathematics & Statistics Biomathematics Research Group Murray State University Murray, KY 42071 christopher.mecklin@murraystate.edu
More informationChapter 8. Interval Estimation
Chapter 8 Interval Estimation We know how to get point estimate, so this chapter is really just about how to get the Introduction Move from generating a single point estimate of a parameter to generating
More informationLecture 25: Review I
Lecture 25: Review I Reading: Up to chapter 5 in ISLR. STATS 202: Data mining and analysis Jonathan Taylor 1 / 18 Unsupervised learning In unsupervised learning, all the variables are on equal standing,
More informationIntroduction to Mobile Robotics Bayes Filter Particle Filter and Monte Carlo Localization. Wolfram Burgard
Introduction to Mobile Robotics Bayes Filter Particle Filter and Monte Carlo Localization Wolfram Burgard 1 Motivation Recall: Discrete filter Discretize the continuous state space High memory complexity
More informationThe Bootstrap and Jackknife
The Bootstrap and Jackknife Summer 2017 Summer Institutes 249 Bootstrap & Jackknife Motivation In scientific research Interest often focuses upon the estimation of some unknown parameter, θ. The parameter
More informationThe Multi Stage Gibbs Sampling: Data Augmentation Dutch Example
The Multi Stage Gibbs Sampling: Data Augmentation Dutch Example Rebecca C. Steorts Bayesian Methods and Modern Statistics: STA 360/601 Module 8 1 Example: Data augmentation / Auxiliary variables A commonly-used
More informationSTAT - Edit Scroll up the appropriate list to highlight the list name at the very top Press CLEAR, followed by the down arrow or ENTER
Entering/Editing Data Use arrows to scroll to the appropriate list and position Enter or edit data, pressing ENTER after each (including the last) Deleting Data (One Value at a Time) Use arrows to scroll
More informationIn this computer exercise we will work with the analysis of variance in R. We ll take a look at the following topics:
UPPSALA UNIVERSITY Department of Mathematics Måns Thulin, thulin@math.uu.se Analysis of regression and variance Fall 2011 COMPUTER EXERCISE 2: One-way ANOVA In this computer exercise we will work with
More informationHomework 2. Due: March 2, 2018 at 7:00PM. p = 1 m. (x i ). i=1
Homework 2 Due: March 2, 2018 at 7:00PM Written Questions Problem 1: Estimator (5 points) Let x 1, x 2,..., x m be an i.i.d. (independent and identically distributed) sample drawn from distribution B(p)
More informationPart I, Chapters 4 & 5. Data Tables and Data Analysis Statistics and Figures
Part I, Chapters 4 & 5 Data Tables and Data Analysis Statistics and Figures Descriptive Statistics 1 Are data points clumped? (order variable / exp. variable) Concentrated around one value? Concentrated
More informationThe Normal Distribution & z-scores
& z-scores Distributions: Who needs them? Why are we interested in distributions? Important link between distributions and probabilities of events If we know the distribution of a set of events, then we
More informationYEAR 7 SCHEME OF WORK - EXTENSION
YEAR 7 SCHEME OF WORK - EXTENSION Autumn Term 1 Number Skills Spring Term 1 Angles and Shape Summer Term 1 Multiplicative Reasoning Analysing and displaying data Decimals Perimeter, Area and Volume Half
More informationStatistical Pattern Recognition
Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2013 http://ce.sharif.edu/courses/91-92/2/ce725-1/ Agenda Features and Patterns The Curse of Size and
More informationBESTFIT, DISTRIBUTION FITTING SOFTWARE BY PALISADE CORPORATION
Proceedings of the 1996 Winter Simulation Conference ed. J. M. Charnes, D. J. Morrice, D. T. Brunner, and J. J. S\vain BESTFIT, DISTRIBUTION FITTING SOFTWARE BY PALISADE CORPORATION Linda lankauskas Sam
More informationRemote Sensing & Photogrammetry W4. Beata Hejmanowska Building C4, room 212, phone:
Remote Sensing & Photogrammetry W4 Beata Hejmanowska Building C4, room 212, phone: +4812 617 22 72 605 061 510 galia@agh.edu.pl 1 General procedures in image classification Conventional multispectral classification
More informationChapter 6: DESCRIPTIVE STATISTICS
Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 12 Combining
More informationLecture 8: Jointly distributed random variables
Lecture : Jointly distributed random variables Random Vectors and Joint Probability Distributions Definition: Random Vector. An n-dimensional random vector, denoted as Z = (Z, Z,, Z n ), is a function
More informationLab 7: Bayesian analysis of a dice toss problem using C++ instead of python
Lab 7: Bayesian analysis of a dice toss problem using C++ instead of python Due date: Monday March 27, 11:59pm Short version of the assignment Take your python file from lab 6 and convert it into lab7
More informationCPSC 340: Machine Learning and Data Mining. Non-Parametric Models Fall 2016
CPSC 340: Machine Learning and Data Mining Non-Parametric Models Fall 2016 Admin Course add/drop deadline tomorrow. Assignment 1 is due Friday. Setup your CS undergrad account ASAP to use Handin: https://www.cs.ubc.ca/getacct
More informationResearch Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel
Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement
More informationData Classification 1
Data Classification 1 Data Classification The idea of classification is to group together items that are alike The objective of classification is to group data in such a manner that not only are the observations
More informationOverview. Monte Carlo Methods. Statistics & Bayesian Inference Lecture 3. Situation At End Of Last Week
Statistics & Bayesian Inference Lecture 3 Joe Zuntz Overview Overview & Motivation Metropolis Hastings Monte Carlo Methods Importance sampling Direct sampling Gibbs sampling Monte-Carlo Markov Chains Emcee
More information