Keywords. net install st0037_2, from(
|
|
- Isaac Perkins
- 5 years ago
- Views:
Transcription
1 Kernel smoothed CDF estimation with akdensity Philippe Van Kerm CEPS/INSTEAD, Luxembourg Abstract This note describes formulas for estimation of kernel smoothed CDF in the Stata user-written package akdensity, with usage example. July 200 Keywords akdensity ; Stata ; smoothed CDF Introduction akdensity is a user-written Stata command for univariate density estimation using adaptive kernel estimation methods (Van Kerm, 2003). The command is freely available online for installation in net-aware Stata. At the Stata command prompt, type or ssc install akdensity net install st0037_2, from( A new functionality has been added in a software update in July 200 (version 4.0). A new option cdf(newvarname) is now available to supplement estimation of the density function with estimation of the kernel smoothed cumulative distribution function of the data. Both the PDF and CDF function estimates produced are based on identical (adaptive) bandwidth and kernel function specifications set by the other command options. This note describes syntax, formulas and usage examples for this new option which was not available at the time of publication of Van Kerm (2003). 2 Methods and formulas The adaptive kernel density estimate as computed by akdensity is given by ˆf(x) = n i= w i n ( ) w i x xi K i= where the x i s are the data points (associated with sample weights w i ), K is a kernel function, and = h λ i where s a global bandwidth parameter and λ i is a bandwidth adaptation factor proportional to the square root of the density of the data at each sample point (Van Kerm, 2003). Centre d Etudes de Populations, de Pauvreté et de Politiques Socio-Economiques/International Networks for Studies in Technology, Environment, Alternatives, Development. B.P. 48, L-450 Differdange, Luxembourg. philippe.vankerm@ceps.lu. The latest version of the akdensity package is 4.0 (of ). Stata 7.0 or later is required.
2 The corresponding kernel smoothed CDF estimate is given by ˆF(x) = x where IK is the integral of the kernel function K: = ˆf(v)dv n i= w i IK(x) = x n ( ) x xi w i IK i= K(v)dv. (See, e.g., Reiss (98), Kulczycki and Dawidowicz (997), Bowman et al. (998) or Li and Racine (2006).) For a Gaussian kernel, K(z) = φ(z) and IK(z) = Φ(z) where φ and Φ are the Gaussian PDF and CDF, respectively. For Epanechnikov kernel functions, we have or K(z) = IK(z) = K(z) = IK(z) = { ( 5 z2) if z < 5 0 otherwise 0 if z < ( z 5 z3) if z < 5 if z > 5 { 3 4 ( z 2 ) if z < 0 otherwise 0 if z < 2 + ( 3 4 z 3 z3) if z < if z > 3 Syntax The syntax for akdensity follows the official kdensity syntax: akdensity varname [ weight ][ if ][ in ][, nograph noadaptive generate(newvar x newvar density) n(#) width(#) [ epan gauss ] cdf(string) normal student(#) at(var x) stdbands(#) symbol(...) connect(...) title(string) graph options ] The syntax for the companion command akdensity0 is: akdensity0 varname [ weight ][ if ][ in ], width(# varname) at(var x) generate(newvar density) [ stdbands(#) lambda(string) [ epan gauss ] double ] cdf(string) requests estimation of the kernel smoothed cumulative distribution function in addition to the density function. Both function estimates are based on identical (adaptive) bandwidth and kernel function specifications. CDF estimates for each point on the grid specified by at or n options are stored in the newly created variable newvarname. With the exception of cdf(newvarname), all options are described in Van Kerm (2003) or in [R] kdensity (for the options in common). 2
3 4 Example The following example illustrates usage of akdensity s cdf option using the coral trout length data distribution used in Van Kerm (2003).. use trocolen.dta, clear First, cdf is used with the default bandwidth selected by akdensity. The first plot (top panel below) illustrates the resulting estimates of both the PDF and CDF estimates along with the classic empirical CDF estimate.. range x (26 missing values generated). label var x "Length". akdensity length, nograph gen(fx) at(x) cdf(fx) Two-stage adaptive kernel density estimation Step : Pilot density and local bandwidth factors estimation Step 2: Adaptive kernel density estimation. cumul length, gen(ecdf). tw (line fx x) (line Fx x, yaxis(2)) (line ecdf length, connect(stairstep) > sort yaxis(2) ) `gropts Note that preference over bandwidth size may differ according to focus on smoothing the PDF or the CDF. The second plot (bottom panel) illustrates PDF and CDF estimates with a smaller bandwidth. For consistency, however, akdensity will estimate both PDF and CDF using identical bandwidths.. akdensity length, nograph gen(fx2) at(x) cdf(fx2) w(0) Two-stage adaptive kernel density estimation Step : Pilot density and local bandwidth factors estimation Step 2: Adaptive kernel density estimation. tw (line fx2 x) (line Fx2 x, yaxis(2)) (line ecdf length, connect(stairstep > ) sort yaxis(2) ) `gropts References Bowman, A., Hall, P. and Prvan, T. (998), Bandwidth selection for the smoothing of distribution functions, Biometrika 85(4), Kulczycki, P. and Dawidowicz, A. L. (997), Kernel estimator of quantile, Universitatis Iagellonicae Acta Mathematica 37, Li, Q. and Racine, J. S. (2006), Nonparametric Econometrics: Theory and Practice, Princeton University Press. Reiss, R.-D. (98), Nonparametric estimation of smooth distribution functions, Scandinavian Journal of Statistics 8(2), 6 9. Van Kerm, P. (2003), Adaptive kernel density estimation, Stata Journal 3(2),
4 (a) Default bandwidth Adaptive kernel PDF estimate Adaptive kernel CDF estimate Coral trout length (in mm.) (b) Bandwidth set to 0 mm. Adaptive kernel PDF estimate Adaptive kernel CDF estimate Coral trout length (in mm.) Figure. Adaptive kernel PDF and CDF estimates and empirical CDF 4
5 Acknowledgements Implementation of the new cdf() option and preparation of this note was financially supported by the World Bank Knowledge for Change Program (KCP II - TF094570). Suggested citation Van Kerm, P. (200), Kernel smoothed CDF estimation with akdensity, CEPS/INSTEAD, Differdange, Luxembourg. 5
Self-consistent density estimation
Self-consistent density estimation Joerg Luedicke Alberto Bernacchia Manuscript currently under review by The Stata Journal 5 April 2013 Contact: joerg.luedicke@ufl.edu The Stata Journal (yyyy) vv, Number
More informationEconometric Tools 1: Non-Parametric Methods
University of California, Santa Cruz Department of Economics ECON 294A (Fall 2014) - Stata Lab Instructor: Manuel Barron 1 Econometric Tools 1: Non-Parametric Methods 1 Introduction This lecture introduces
More informationRegression III: Advanced Methods
Lecture 3: Distributions Regression III: Advanced Methods William G. Jacoby Michigan State University Goals of the lecture Examine data in graphical form Graphs for looking at univariate distributions
More informationNonparametric Estimation of Distribution Function using Bezier Curve
Communications for Statistical Applications and Methods 2014, Vol. 21, No. 1, 105 114 DOI: http://dx.doi.org/10.5351/csam.2014.21.1.105 ISSN 2287-7843 Nonparametric Estimation of Distribution Function
More informationCALCULATION OF OPERATIONAL LOSSES WITH NON- PARAMETRIC APPROACH: DERAILMENT LOSSES
2. Uluslar arası Raylı Sistemler Mühendisliği Sempozyumu (ISERSE 13), 9-11 Ekim 2013, Karabük, Türkiye CALCULATION OF OPERATIONAL LOSSES WITH NON- PARAMETRIC APPROACH: DERAILMENT LOSSES Zübeyde Öztürk
More informationComputing multidimensional poverty indices using fuzzy sets theory. (The MPI Program)
Computing multidimensional poverty indices using fuzzy sets theory (The MPI Program) Maria Noel Pi Alperin CEPS/INSTEAD B.P.48, L-4501 Differdange Luxembourg LAMETA Université Montpellier I Av. de la Mer
More informationCREATING THE DISTRIBUTION ANALYSIS
Chapter 12 Examining Distributions Chapter Table of Contents CREATING THE DISTRIBUTION ANALYSIS...176 BoxPlot...178 Histogram...180 Moments and Quantiles Tables...... 183 ADDING DENSITY ESTIMATES...184
More informationNonparametric Density Estimation
Nonparametric Estimation Data: X 1,..., X n iid P where P is a distribution with density f(x). Aim: Estimation of density f(x) Parametric density estimation: Fit parametric model {f(x θ) θ Θ} to data parameter
More informationIntroduction to STATA
Introduction to STATA Duah Dwomoh, MPhil School of Public Health, University of Ghana, Accra July 2016 International Workshop on Impact Evaluation of Population, Health and Nutrition Programs Learning
More informationNonparametric regression using kernel and spline methods
Nonparametric regression using kernel and spline methods Jean D. Opsomer F. Jay Breidt March 3, 016 1 The statistical model When applying nonparametric regression methods, the researcher is interested
More informationPoints Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked
Plotting Menu: QCExpert Plotting Module graphs offers various tools for visualization of uni- and multivariate data. Settings and options in different types of graphs allow for modifications and customizations
More information1 Overview XploRe is an interactive computational environment for statistics. The aim of XploRe is to provide a full, high-level programming language
Teaching Statistics with XploRe Marlene Muller Institute for Statistics and Econometrics, Humboldt University Berlin Spandauer Str. 1, D{10178 Berlin, Germany marlene@wiwi.hu-berlin.de, http://www.wiwi.hu-berlin.de/marlene
More informationUse of Extreme Value Statistics in Modeling Biometric Systems
Use of Extreme Value Statistics in Modeling Biometric Systems Similarity Scores Two types of matching: Genuine sample Imposter sample Matching scores Enrolled sample 0.95 0.32 Probability Density Decision
More informationAn Introduction to PDF Estimation and Clustering
Sigmedia, Electronic Engineering Dept., Trinity College, Dublin. 1 An Introduction to PDF Estimation and Clustering David Corrigan corrigad@tcd.ie Electrical and Electronic Engineering Dept., University
More informationThe MKLE Package. July 24, Kdensity... 1 Kernels... 3 MKLE-package... 3 checkparms... 4 Klik... 5 MKLE... 6 mklewarp... 7 state... 8.
The MKLE Package July 24, 2006 Type Package Title Maximum kernel likelihood estimation Version 0.02 Date 2006-07-12 Author Maintainer Package to compute the maximum kernel likelihood
More informationA Quick Guide to Stata 8 for Windows
Université de Lausanne, HEC Applied Econometrics II Kurt Schmidheiny October 22, 2003 A Quick Guide to Stata 8 for Windows 2 1 Introduction A Quick Guide to Stata 8 for Windows This guide introduces the
More informationOn Kernel Density Estimation with Univariate Application. SILOKO, Israel Uzuazor
On Kernel Density Estimation with Univariate Application BY SILOKO, Israel Uzuazor Department of Mathematics/ICT, Edo University Iyamho, Edo State, Nigeria. A Seminar Presented at Faculty of Science, Edo
More informationEditor Executive Editor Associate Editors Copyright Statement:
The Stata Journal Editor H. Joseph Newton Department of Statistics Texas A & M University College Station, Texas 77843 979-845-3142; FAX 979-845-3144 jnewton@stata-journal.com Associate Editors Christopher
More informationPackage kerdiest. February 20, 2015
Type Package Package kerdiest February 20, 2015 Title Nonparametric kernel estimation of the distribution function. Bandwidth selection and estimation of related functions. Version 1.2 Date 2012-08-11
More informationImproving the Post-Smoothing of Test Norms with Kernel Smoothing
Improving the Post-Smoothing of Test Norms with Kernel Smoothing Anli Lin Qing Yi Michael J. Young Pearson Paper presented at the Annual Meeting of National Council on Measurement in Education, May 1-3,
More informationMonte Carlo Simula/on and Copula Func/on. by Gerardo Ferrara
Monte Carlo Simula/on and Copula Func/on by Gerardo Ferrara Introduc)on A Monte Carlo method is a computational algorithm that relies on repeated random sampling to compute its results. In a nutshell,
More informationMultivariate Analysis
Multivariate Analysis Project 1 Jeremy Morris February 20, 2006 1 Generating bivariate normal data Definition 2.2 from our text states that we can transform a sample from a standard normal random variable
More informationRobust Shape Retrieval Using Maximum Likelihood Theory
Robust Shape Retrieval Using Maximum Likelihood Theory Naif Alajlan 1, Paul Fieguth 2, and Mohamed Kamel 1 1 PAMI Lab, E & CE Dept., UW, Waterloo, ON, N2L 3G1, Canada. naif, mkamel@pami.uwaterloo.ca 2
More informationGenerating random samples from user-defined distributions
The Stata Journal (2011) 11, Number 2, pp. 299 304 Generating random samples from user-defined distributions Katarína Lukácsy Central European University Budapest, Hungary lukacsy katarina@phd.ceu.hu Abstract.
More informationISSN Article. New Graphical Methods and Test Statistics for Testing Composite Normality
Econometrics 25, 3, 532-56; doi:.339/econometrics33532 OPEN ACCESS econometrics ISSN 2225-46 www.mdpi.com/journal/econometrics Article New Graphical Methods and Test Statistics for Testing Composite Normality
More informationSimultaneous Interval Regression for K-Nearest Neighbor
Simultaneous Interval Regression for K-Nearest Neighbor Mohammad Ghasemi Hamed 1,2, Mathieu Serrurier 1, and Nicolas Durand 1,2 1 IRIT - Université Paul Sabatier 118 route de Narbonne 31062, Toulouse Cedex
More informationEstimation and Comparison of Receiver Operating Characteristic Curves
UW Biostatistics Working Paper Series 1-22-2008 Estimation and Comparison of Receiver Operating Characteristic Curves Margaret Pepe University of Washington, Fred Hutch Cancer Research Center, mspepe@u.washington.edu
More informationSection 4 Matching Estimator
Section 4 Matching Estimator Matching Estimators Key Idea: The matching method compares the outcomes of program participants with those of matched nonparticipants, where matches are chosen on the basis
More informationModelling Bivariate Distributions Using Kernel Density Estimation
Modelling Bivariate Distributions Using Kernel Density Estimation Alexander Bilock, Carl Jidling and Ylva Rydin Project in Computational Science 6 January 6 Department of information technology Abstract
More informationOptimization and Simulation
Optimization and Simulation Statistical analysis and bootstrapping Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale
More informationRevision of Stata basics in STATA 11:
Revision of Stata basics in STATA 11: April, 2016 Dr. Selim Raihan Executive Director, SANEM Professor, Department of Economics, University of Dhaka Contents a) Resources b) Stata 11 Interface c) Datasets
More informationDescriptive and Graphical Analysis of the Data
Descriptive and Graphical Analysis of the Data Carlo Favero Favero () Descriptive and Graphical Analysis of the Data 1 / 10 The first database Our first database is made of 39 seasons (from 1979-1980 to
More informationIntroduction to Nonparametric/Semiparametric Econometric Analysis: Implementation
to Nonparametric/Semiparametric Econometric Analysis: Implementation Yoichi Arai National Graduate Institute for Policy Studies 2014 JEA Spring Meeting (14 June) 1 / 30 Motivation MSE (MISE): Measures
More informationA quick introduction to STATA
A quick introduction to STATA Data files and other resources for the course book Introduction to Econometrics by Stock and Watson is available on: http://wps.aw.com/aw_stock_ie_3/178/45691/11696965.cw/index.html
More informationPackage tseriesentropy
Package tseriesentropy April 15, 2017 Title Entropy Based Analysis and Tests for Time Series Date 2017-04-15 Version 0.6-0 Author Simone Giannerini Depends R (>= 2.14.0) Imports cubature, methods, parallel,
More informationWill Monroe July 21, with materials by Mehran Sahami and Chris Piech. Joint Distributions
Will Monroe July 1, 017 with materials by Mehran Sahami and Chris Piech Joint Distributions Review: Normal random variable An normal (= Gaussian) random variable is a good approximation to many other distributions.
More informationComputing Optimal Strata Bounds Using Dynamic Programming
Computing Optimal Strata Bounds Using Dynamic Programming Eric Miller Summit Consulting, LLC 7/27/2012 1 / 19 Motivation Sampling can be costly. Sample size is often chosen so that point estimates achieve
More informationPackage sbf. R topics documented: February 20, Type Package Title Smooth Backfitting Version Date Author A. Arcagni, L.
Type Package Title Smooth Backfitting Version 1.1.1 Date 2014-12-19 Author A. Arcagni, L. Bagnato Package sbf February 20, 2015 Maintainer Alberto Arcagni Smooth Backfitting
More informationPerformance Measures
1 Performance Measures Classification F-Measure: (careful: similar but not the same F-measure as the F-measure we saw for clustering!) Tradeoff between classifying correctly all datapoints of the same
More informationProbability Models.S4 Simulating Random Variables
Operations Research Models and Methods Paul A. Jensen and Jonathan F. Bard Probability Models.S4 Simulating Random Variables In the fashion of the last several sections, we will often create probability
More informationStat 302 Statistical Software and Its Applications SAS: Distributions
Stat 302 Statistical Software and Its Applications SAS: Distributions Yen-Chi Chen Department of Statistics, University of Washington Autumn 2016 1 / 39 Distributions in R and SAS Distribution R SAS Beta
More informationUniversity of Wisconsin-Madison Spring 2018 BMI/CS 776: Advanced Bioinformatics Homework #2
Assignment goals Use mutual information to reconstruct gene expression networks Evaluate classifier predictions Examine Gibbs sampling for a Markov random field Control for multiple hypothesis testing
More informationA Bayesian approach to parameter estimation for kernel density estimation via transformations
A Bayesian approach to parameter estimation for kernel density estimation via transformations Qing Liu,, David Pitt 2, Xibin Zhang 3, Xueyuan Wu Centre for Actuarial Studies, Faculty of Business and Economics,
More informationDynamic Thresholding for Image Analysis
Dynamic Thresholding for Image Analysis Statistical Consulting Report for Edward Chan Clean Energy Research Center University of British Columbia by Libo Lu Department of Statistics University of British
More informationPart I. Graphical exploratory data analysis. Graphical summaries of data. Graphical summaries of data
Week 3 Based in part on slides from textbook, slides of Susan Holmes Part I Graphical exploratory data analysis October 10, 2012 1 / 1 2 / 1 Graphical summaries of data Graphical summaries of data Exploratory
More informationStatistical modelling of the patterns of planar sections of prostatic capillaries on the basis of Strauss hard-core processes
TITLE Statistical modelling of the patterns of planar sections of prostatic capillaries on the basis of Strauss hard-core processes AUTHORS Torsten Mattfeldt 1 Stefanie Eckel 2 Frank Fleischer 3 Volker
More informationCHAPTER 6. The Normal Probability Distribution
The Normal Probability Distribution CHAPTER 6 The normal probability distribution is the most widely used distribution in statistics as many statistical procedures are built around it. The central limit
More informationNonparametric Regression
Nonparametric Regression John Fox Department of Sociology McMaster University 1280 Main Street West Hamilton, Ontario Canada L8S 4M4 jfox@mcmaster.ca February 2004 Abstract Nonparametric regression analysis
More informationGetting started with Stata 2017: Cheat-sheet
Getting started with Stata 2017: Cheat-sheet 4. september 2017 1 Get started Graphical user interface (GUI). Clickable. Simple. Commands. Allows for use of do-le. Easy to keep track. Command window: Write
More informationNonparametric Risk Assessment of Gas Turbine Engines
Nonparametric Risk Assessment of Gas Turbine Engines Michael P. Enright *, R. Craig McClung, and Stephen J. Hudak Southwest Research Institute, San Antonio, TX, 78238, USA The accuracy associated with
More informationKernel Density Estimation (KDE)
Kernel Density Estimation (KDE) Previously, we ve seen how to use the histogram method to infer the probability density function (PDF) of a random variable (population) using a finite data sample. In this
More informationKernel Density Estimation
Kernel Density Estimation An Introduction Justus H. Piater, Université de Liège Overview 1. Densities and their Estimation 2. Basic Estimators for Univariate KDE 3. Remarks 4. Methods for Particular Domains
More informationSPSS Basics for Probability Distributions
Built-in Statistical Functions in SPSS Begin by defining some variables in the Variable View of a data file, save this file as Probability_Distributions.sav and save the corresponding output file as Probability_Distributions.spo.
More informationAn R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation
An R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation Xingguo Li Tuo Zhao Xiaoming Yuan Han Liu Abstract This paper describes an R package named flare, which implements
More informationWorkshop for empirical trade analysis. December 2015 Bangkok, Thailand
Workshop for empirical trade analysis December 2015 Bangkok, Thailand Cosimo Beverelli (WTO) Rainer Lanz (WTO) Content a. Resources b. Stata windows c. Organization of the Bangkok_Dec_2015\Stata folder
More informationA Short Guide to Stata 10 for Windows
A Short Guide to Stata 10 for Windows 1. Introduction 2 2. The Stata Environment 2 3. Where to get help 2 4. Opening and Saving Data 3 5. Importing Data 4 6. Data Manipulation 5 7. Descriptive Statistics
More information4.3 The Normal Distribution
4.3 The Normal Distribution Objectives. Definition of normal distribution. Standard normal distribution. Specialties of the graph of the standard normal distribution. Percentiles of the standard normal
More informationWhat is the Optimal Bin Size of a Histogram: An Informal Description
International Mathematical Forum, Vol 12, 2017, no 15, 731-736 HIKARI Ltd, wwwm-hikaricom https://doiorg/1012988/imf20177757 What is the Optimal Bin Size of a Histogram: An Informal Description Afshin
More informationFFT-Based Fast Computation of Multivariate Kernel Density Estimators with Unconstrained Bandwidth Matrices
FFT-Based Fast Computation of Multivariate Kernel Density Estimators with Unconstrained Bandwidth Matrices arxiv:1508.02766v6 [stat.co] 7 Sep 2016 Artur Gramacki Institute of Control and Computation Engineering
More informationProjective Integration Methods for Distributions. Λ C. W. Gear y November 26, 2001 Abstract Projective methods were introduced in an earlier paper. In this paper we consider their application to the output
More informationSimulating multivariate distributions with sparse data: a kernel density smoothing procedure
Simulating multivariate distributions with sparse data: a kernel density smoothing procedure James W. Richardson Department of gricultural Economics, Texas &M University Gudbrand Lien Norwegian gricultural
More informationUniversity of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques
University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques Mark Gales mjfg@eng.cam.ac.uk Michaelmas 2015 11. Non-Parameteric Techniques
More informationAlgorithms for Manipulation of Level Sets of Nonparametric Density Estimates 1
Algorithms for Manipulation of Level Sets of Nonparametric Density Estimates 1 Jussi Klemelä Department of Statistics, University of Mannheim, L 7 3-5, Verfügungsgebäude, 68131 Mannheim, Germany Summary
More informationComputational Methods for Finding Probabilities of Intervals
Computational Methods for Finding Probabilities of Intervals Often the standard normal density function is denoted ϕ(z) = (2π) 1/2 exp( z 2 /2) and its CDF by Φ(z). Because Φ cannot be expressed in closed
More informationJMP Book Descriptions
JMP Book Descriptions The collection of JMP documentation is available in the JMP Help > Books menu. This document describes each title to help you decide which book to explore. Each book title is linked
More informationAssessing Power Output Specifications of PV Modules
Assessing Power Output Specifications of PV Modules This user manual describes Version 1.4, build May 19th, 2009. APOS photovoltaic StatLab is based on joint research projects of the Institute of Statistics
More informationEconomics Nonparametric Econometrics
Economics 217 - Nonparametric Econometrics Topics covered in this lecture Introduction to the nonparametric model The role of bandwidth Choice of smoothing function R commands for nonparametric models
More informationFathom Dynamic Data TM Version 2 Specifications
Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other
More informationUniversity of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques.
. Non-Parameteric Techniques University of Cambridge Engineering Part IIB Paper 4F: Statistical Pattern Processing Handout : Non-Parametric Techniques Mark Gales mjfg@eng.cam.ac.uk Michaelmas 23 Introduction
More informationLOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave.
LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave. http://en.wikipedia.org/wiki/local_regression Local regression
More informationCHAPTER 1. Introduction. Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data.
1 CHAPTER 1 Introduction Statistics: Statistics is the science of collecting, organizing, analyzing, presenting and interpreting data. Variable: Any characteristic of a person or thing that can be expressed
More informationData Mining: Exploring Data. Lecture Notes for Chapter 3
Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Look for accompanying R code on the course web site. Topics Exploratory Data Analysis
More informationUniversity of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques
University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques Mark Gales mjfg@eng.cam.ac.uk Michaelmas 2011 11. Non-Parameteric Techniques
More informationC N O S N T S RA R INT N - T BA B SE S D E L O L C O A C L S E S A E RC R H
LECTURE 11 & 12 CONSTRAINT-BASED LOCAL SEARCH Constraint-based Local Search Problem given in CSP form : a set of variables V={V1, V2,, Vn} a set of constraints C={C1, C2,, Ck} i.e. arithmetic or symbolic
More informationheight VUD x = x 1 + x x N N 2 + (x 2 x) 2 + (x N x) 2. N
Math 3: CSM Tutorial: Probability, Statistics, and Navels Fall 2 In this worksheet, we look at navel ratios, means, standard deviations, relative frequency density histograms, and probability density functions.
More informationIndex. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113
Index A Add-on packages information page, 186 187 Linux users, 191 Mac users, 189 mirror sites, 185 Windows users, 187 aggregate function, 62 Analysis of variance (ANOVA), 152 anova function, 152 as.data.frame
More informationFrequency Distributions
Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data so that it is possible to get a general overview of the results. Remember,
More informationCDA6530: Performance Models of Computers and Networks. Chapter 8: Statistical Simulation --- Discrete-Time Simulation
CDA6530: Performance Models of Computers and Networks Chapter 8: Statistical Simulation --- Discrete-Time Simulation Simulation Studies Models with analytical formulas Calculate the numerical solutions
More informationAMaLGaM IDEAs in Noisy Black-Box Optimization Benchmarking
AMaLGaM IDEAs in Noisy Black-Box Optimization Benchmarking Peter A.N. Bosman Centre for Mathematics and Computer Science P.O. Box GB Amsterdam The Netherlands Peter.Bosman@cwi.nl Jörn Grahl Johannes Gutenberg
More informationPackage mhde. October 23, 2015
Package mhde October 23, 2015 Type Package Title Minimum Hellinger Distance Test for Normality Version 1.0-1 Date 2015-10-21 Author Paul W. Eslinger [aut, cre], Heather Orr [aut, ctb] Maintainer Paul W.
More informationBeyond The Vector Data Model - Part Two
Beyond The Vector Data Model - Part Two Introduction Spatial Analyst Extension (Spatial Analysis) What is your question? Selecting a method of analysis Map Algebra Who is the audience? What is Spatial
More informationPrinciples Of Econometrics Third Edition Solution Manual
Principles Of Econometrics Third Edition Solution Manual If you are searched for a book Principles of econometrics third edition solution manual in pdf form, in that case you come on to faithful site.
More informationMedical Image Analysis
Medical Image Analysis Instructor: Moo K. Chung mchung@stat.wisc.edu Lecture 10. Multiple Comparisons March 06, 2007 This lecture will show you how to construct P-value maps fmri Multiple Comparisons 4-Dimensional
More informationRobert Collins CSE598G. Robert Collins CSE598G
Recall: Kernel Density Estimation Given a set of data samples x i ; i=1...n Convolve with a kernel function H to generate a smooth function f(x) Equivalent to superposition of multiple kernels centered
More informationPrograms for MDE Modeling and Conditional Distribution Calculation
Programs for MDE Modeling and Conditional Distribution Calculation Sahyun Hong and Clayton V. Deutsch Improved numerical reservoir models are constructed when all available diverse data sources are accounted
More informationNonparametric Approaches to Regression
Nonparametric Approaches to Regression In traditional nonparametric regression, we assume very little about the functional form of the mean response function. In particular, we assume the model where m(xi)
More informationLab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots
STAT 350 (Spring 2015) Lab 3: SAS Solutions 1 Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots Note: The data sets are not included in the solutions;
More informationINTRODUCTION TO ECONOMETRICS STOCK WATSON SOLUTIONS CHAPTER 9
page 1 / 7 page 2 / 7 introduction to econometrics stock pdf Documents Similar To 3rd Ed - Intro to Econometrics - Stock-Watson.pdf. StockWatson_3e_EmpiricalExerciseSolutions. Uploaded by. markus. Introduction
More informationLecture 17: Smoothing splines, Local Regression, and GAMs
Lecture 17: Smoothing splines, Local Regression, and GAMs Reading: Sections 7.5-7 STATS 202: Data mining and analysis November 6, 2017 1 / 24 Cubic splines Define a set of knots ξ 1 < ξ 2 < < ξ K. We want
More informationLecture 11: Distributions as Models October 2014
Lecture 11: Distributions as Models 36-350 1 October 2014 Previously R functions for regression models R functions for probability distributions Agenda Distributions from data Review of R for theoretical
More informationIntegration. Volume Estimation
Monte Carlo Integration Lab Objective: Many important integrals cannot be evaluated symbolically because the integrand has no antiderivative. Traditional numerical integration techniques like Newton-Cotes
More informationComparing different interpolation methods on two-dimensional test functions
Comparing different interpolation methods on two-dimensional test functions Thomas Mühlenstädt, Sonja Kuhnt May 28, 2009 Keywords: Interpolation, computer experiment, Kriging, Kernel interpolation, Thin
More informationModel Based Symbolic Description for Big Data Analysis
Model Based Symbolic Description for Big Data Analysis 1 Model Based Symbolic Description for Big Data Analysis *Carlo Drago, **Carlo Lauro and **Germana Scepi *University of Rome Niccolo Cusano, **University
More informationOutlier Detection of Data in Wireless Sensor Networks Using Kernel Density Estimation
International Journal of Computer Applications (975 8887) Volume 5 No.7, August 2 Outlier Detection of Data in Wireless Sensor Networks Using Kernel Density Estimation V. S. Kumar Samparthi Department
More informationTracking Computer Vision Spring 2018, Lecture 24
Tracking http://www.cs.cmu.edu/~16385/ 16-385 Computer Vision Spring 2018, Lecture 24 Course announcements Homework 6 has been posted and is due on April 20 th. - Any questions about the homework? - How
More informationSTATA 13 INTRODUCTION
STATA 13 INTRODUCTION Catherine McGowan & Elaine Williamson LONDON SCHOOL OF HYGIENE & TROPICAL MEDICINE DECEMBER 2013 0 CONTENTS INTRODUCTION... 1 Versions of STATA... 1 OPENING STATA... 1 THE STATA
More informationAdvanced Applied Multivariate Analysis
Advanced Applied Multivariate Analysis STAT, Fall 3 Sungkyu Jung Department of Statistics University of Pittsburgh E-mail: sungkyu@pitt.edu http://www.stat.pitt.edu/sungkyu/ / 3 General Information Course
More informationGive one example where you might wish to use a three dimensional array
CS 110: INTRODUCTION TO COMPUTER SCIENCE SAMPLE TEST 3 TIME ALLOWED: 60 MINUTES Student s Name: MAXIMUM MARK 100 NOTE: Unless otherwise stated, the questions are with reference to the Java Programming
More informationYou will learn: The structure of the Stata interface How to open files in Stata How to modify variable and value labels How to manipulate variables
Jennie Murack You will learn: The structure of the Stata interface How to open files in Stata How to modify variable and value labels How to manipulate variables How to conduct basic descriptive statistics
More informationDensity-Based Clustering Based on Probability Distribution for Uncertain Data
International Journal of Engineering and Advanced Technology (IJEAT) Density-Based Clustering Based on Probability Distribution for Uncertain Data Pramod Patil, Ashish Patel, Parag Kulkarni Abstract: Today
More information