Bayesian Time-Stratified-Petersen estimators for abundance. Sampling Protocol. Simon J. Bonner (UBC) Carl James Schwarz (SFU)

Size: px
Start display at page:

Download "Bayesian Time-Stratified-Petersen estimators for abundance. Sampling Protocol. Simon J. Bonner (UBC) Carl James Schwarz (SFU)"

Transcription

1 Bayesian Time-Stratified-Petersen estimators for abundance Sampling Protocol Simon J. Bonner (UBC) Carl James Schwarz (SFU) Simple-Petersen or Stratified-Petersen methods are often used to estimate number of outgoing smolt or returning salmon. These methods are inadequate to deal with heterogeneity in catchability among strata and with missing data from strata caused by crew illness, high water flow, or other causes. We propose a Bayesian spline-based methodology to estimate abundance and run-timing which provides several compelling advantages over the more traditional estimators. The hierarchical model for capture probabilities and the spline model for the general shape of the run curve, allow information to be shared among stratra within a Bayesian framework and allows great flexibility to deal with missing data. It is self-calibrating- for strata with poor data, extensive pooling across strata take place but with strata with rich data, the information for a particular stratum takes precedence. The methodology automatically adjust measures of precision for heterogeneity in catchability among strata (which is ignored in the simple-petersen) and shares information from neighbouring strata (unlike the Stratified-Petersen). Examples from estimating the number of outgoing number of salmon smolt in the Trinity River, CA will be present - rotary screw-traps are used to capture fish - a sample of fish is marked and transported above trap and released - a portion of the marked fish are recaptured to estimate capture efficiency - unmarked fish are captured along with marked fish 1 2

2 Notation Data: - batch marks change weekly - unit of analysis is the (julian) week - n i - number of fish marked and released in week i - m ij - number of marked fish released in week i and recovered in week j. - u i - number of unmarked fish captured in week i (may include the n i ). This can often be (partially) subdivided in hatchery and wild fish. Hatchery Chinook are 25% adfin clipped; Hatchery Steelhead are 100% ad-fin clipped. - Current program is basically diagonal recoveries, i.e. m ij = 0for j>i. (but see Bonner and Schwarz, 2010) Parameters: - p i - recapture rate in week i - U i - total outgoing population in week i. - U =! U i - grand total outgoing population weeks 3 Sample JC 2003 Chinook Data Marked Fish Unmarked Fish AD JW D n i m ii m i,i+1 m i,i+2 AD* NC +NC f ,135 4, , ,452 10, , ,199 2, , ,427 30,542 39, , ,243 13,337 17, , ,646 6,282 7, , ,412 26,706 35, , ,703 26,831 34, , ,651 11,309 14, , ,677 3, , ,343 1, Total 50,489 2, , , ,995 * 25% of hatchery fish have AD-fin clip 4

3 Objectives and Problems Objectives: - estimate total number of outgoing juvenile salmon - separate estimates for hatchery and wild fish - estimate characteristics such as percentiles of run timing Problems: - sparse data (e.g. few marked fish released/recaptured) - missing data (no marking done and/or no unmarked captured) - anomalous data (e.g. fish disappear after release) Population Estimation Key concept (of mark-recapture): - recapture of marked fish provide estimate of screw-trap capture efficiency (e.g. the screw-trap is capturing 5% of the fish that pass the location) - use the estimated recapture rate to expand the number of unmarked fish captured. Methods: Complete Pooling Simple Petersen " # $ Û =! weeks " # $ u i % " & ' # $! weeks! weeks m ii % & ' n i % & '!" Separate weekly estimates (Weekly) Stratified- Petersen Û i = u in i m ii ;Û =! weeks Û i JC 2003 Chinook Estimates 4.2 (SE.081) million 16 (SE 3.7 ) million 5 (SE.21 ) million (ex JW 41) 5 6

4 Population Estimation Example JC 2003 Chinook Julian Week n i m ii * u i Û i ,616?? 10 1, , , , ,557 23, , ?? , , , , , ,580 1,274, , , , , , , ,534 12,414, , , ,734 Pooling 50,489 2, , million * Adjusted for less than 7 days sampling. Complete Pooling Best possible precision BUT Unable to estimate run timing easily Handle missing marking weeks. Unable to deal with weeks missing capture of unmarked fish Implicitly assumes homogeneous capture. Est small bias - whew SE large bias(!) Odd data has little effect Population Estimation!" Separate weekly estimates Comparison Weekly estimates may have poor precision but overall estimate has acceptable precision Estimate run-timing No estimate if don t mark in a week No estimate if don t recapture in a week Allows for heterogeneous capture across weeks, but assumes homogeneity within weeks Odd data could lead to highly biased weekly estimates 7 8

5 Population Estimation alternatives? (Brief Introduction to) Splines Spline flexible curve with no long-term pattern; based on local behavior Try pooling weeks that are similar (partially stratified) - arbitrary - how to estimate se properly? - dealing with missing weeks esp. with no unmarked fish Spline and Hierarchical model a) fit a smooth curve to the Ûi (spline) as the underlying trend but allow variability about trend line b) allow p i to vary around a common mean (hierarchical model) 9 Elements of spline fit - knots where the pieces of the spline meet - degree of spline (linear, quadratic, or cubic) 10

6 Knots - many knots = wiggly fit - fewer knots = smoother fit (Brief Introduction to) Splines Linear spline fit: (Brief Introduction to) Splines - rough rule of thumb, use n/4 knots Degree of spline - linear = joined line segments not differentiable at knots - quadratic/cubic = first/second derivatives are smooth Fitting splines = ordinary least squares (or variants thereof) - e.g. linear spline fit has design (X) matrix of the form: 1, x, (x-knot1) +, (x-knot2) +, where (arg) + is 0 if arg < 0 [1,] [2,] [3,] [4,] [5,] [6,] [7,] [8,]

7 (Brief Introduction to) Splines Quadratic spline fit. Design matrix has: 1, x, x 2, (x-knot1) + 2, (x-knot2) + 2, (Brief Introduction to) Splines Quadratic spline fit with fewer knots (every 8 th point) (less wiggly ) You can also control wiggliness by enforcing penalties on derivatives, or penalties on changes in coefficients

8 (Brief Introduction to) Hierarchical Models Recall that in each of 40 weeks we have a Petersen experiment there is a separate p i for each week In a pooled-petersen experiment, all p i are assumed to be equal (unrealistic) In a stratified-petersen experiment, all p i allowed to unequal and no information from other weeks is used in week of interest (too general) Hierarchical model: p i drawn from a common distribution with mean µ p and sd! p - if! p small, then sharing of information across weeks - if! p large, then each p i is separate from each other Proposed Spline & Hierarchical Method (a) Likelihood standard stratified Petersen estimator (b) Hierarchical model on logit(p i ) [Could use covariates as well including spline on logit(p)] (c) Spline model (with error ) for log(u i ) using B-spline basis functions and knots every 4 th week and smooth changes in coefficients (d) Sensible priors everywhere 15 16

9 Proposed Spline & Hierarchical Method Advantages: - borrows information from other weeks for estimating catchability and weekly run size. - gives weekly (and total) estimates - estimate run timing - if missing marking week, uses range of capture rates seen in other weeks to impute range of possible capture rates for weeks with no marking done - if missing unmarked fish in a week, uses spline to interpolate reasonable value for outgoing total based on variation of other weeks around smooth curve - automatically adjusts for amount of heterogeneity in capture-rates across weeks. If small variation, estimates have precision similar to pooled-petersen. If larger variation, estimates have realistic standard errors - odd data easily handled (simply set to missing) Disadvantage: - not amenable to hand computations - difficulty to fit Bayesian methods useful - computer programs are complex JC 2003 Chinook - Spline Model - Allowed for 2 jumps when hatchery fish arrived. Pooled Petersen: 4.2 (SE.081) million fish. Stratified-Petersen 5 (SE.21 ) million fish (ex j.w. 41) Spline est: 5.3 (SE.18 ) million fish 17 18

10 Capture Efficiency Estimation JC 2003 Chinook - Hierarchical Model Separating Wild and Hatchery JC 2003 CH YOY Fish Above results are a mixture of wild and hatchery fish. 25% of hatchery fish have adipose fin clip; hatchery YoY fish released starting in jw 22. In jw 40, hatchery 1 released; no further wild fish in stream (?) Marked Fish Unmarked Fish AD JW D n i m ii m i,i+1 m i,i+2 AD* NC +NC f ,135 4, , ,452 10, , ,199 2, , ,427 30,542 39, , ,243 13,337 17, , ,646 6,282 7, Notice range of catchability in j.w. 9, 41, etc - Additional structure in p use covariates such as log-flow or spline Need to expand ad-clipped fish by factor of

11 Separating Wild and Hatchery JC 2003 CH YOY Fish Population Estimation Separating Wild and Hatchery YOY CH Fish JC Run Timing W.YOY (millions) H.YOY (millions) Pooled Petersen* 0.74 (SE.02) 1.3 (SE.03) Stratified-Petersen 0.71 (SE.04) 2.3 (SE.17) Spline 0.87 (SD.11) 2.3 (SD.12) *SE Not adjusted for interpolation of ad-clipped. Note that after jw 40, hatchery fish are classified as age 1+ Wild 0% 10% 30% 50% 70% 90% 100% Mean SD Hatchery Mean SD

12 Evaluation of Trinity River Restoration Program Evaluation of Trinity River Restoration Program Estimates of log(abundance) available at JC, WC, and PT (not all sites in all years Estimates of log(abundance) available using flow-based method (i.e. look at fraction of flow that is sampled by rotary screw trap) Need to align series and extrapolate

13 Evaluation of Trinity River Restoration Program Summary - borrows information from other weeks o spline forces estimates to follow smoothish curve o capture rates come from common distribution - estimates available at weekly and total level with realistic se - estimates available for wild vs hatchery groups with realistic se - run timing estimates available - easy to interpolate for weeks with missing/odd data - able to add covariates (e.g. log-flow for p) (not shown) - model fitting complex no hand computations - careful of interpolations before and after last sampling - assumption that patterns visible = patterns hidden - Allows some flexibility in sampling, e.g. every second week - BTSPAS package in R has complete examples 25 - Bonner and Schwarz (2010a; 2010b) for more details - Phase2/TrinityReport-Final%20Phase2.pdf for full details. 26

Bayesian Time-Stratified-Petersen Estimators for Abundance

Bayesian Time-Stratified-Petersen Estimators for Abundance Bayesian Time-Stratified-Petersen Estimators for Abundance Simon Bonner 1 Carl James Schwarz 2 1 Department of Statistics University of Kentucky Lexington, KY 2 Department of Statistics and Actuarial Science

More information

Estimating trap efficiency (common to all diagonal cases) - based on mark-recapture experiments o m ii. , p i. ,! p 2 ) or m ii

Estimating trap efficiency (common to all diagonal cases) - based on mark-recapture experiments o m ii. , p i. ,! p 2 ) or m ii Theoretical Basis for Spline Programs Diagonal Case - Wild + Hatchery combined - Wild vs Hatchery (W.YoY, H.YoY) for Chinook - Wild vs Hatchery (W.YoY, W.1+, H.1+) for Steelhead Non-diagonal case (not

More information

Package BTSPAS. R topics documented: February 19, Version Date Title Bayesian Time-Strat. Population Analysis

Package BTSPAS. R topics documented: February 19, Version Date Title Bayesian Time-Strat. Population Analysis Version 2014.0901 Date 2014-09-01 Title Bayesian Time-Strat. Population Analysis Package BTSPAS February 19, 2015 Author Carl J Schwarz and Simon J Bonner

More information

Trinity River Restoration Program Juvenile Salmonid Outmigrant Monitoring Evaluation, Phase II

Trinity River Restoration Program Juvenile Salmonid Outmigrant Monitoring Evaluation, Phase II , Final Technical Memorandum December 21, 2009 Prepared for: 1313 S. Main Street P.O. Box 1300 Weaverville, California 96093 Prepared by: ESSA Technologies, Ltd. 1765 W 8 th Avenue Vancouver, British Columbia

More information

Smoothing Population Size Estimates for Time-Stratified Mark-Recapture. Experiments using Bayesian P-Splines

Smoothing Population Size Estimates for Time-Stratified Mark-Recapture. Experiments using Bayesian P-Splines Biometrics 000, 000 000 DOI: 000 000 0000 Smoothing Population Size Estimates for Time-Stratified Mark-Recapture Experiments using Bayesian P-Splines Simon J Bonner 1 and Carl J Schwarz 2 1 Department

More information

Nonparametric regression using kernel and spline methods

Nonparametric regression using kernel and spline methods Nonparametric regression using kernel and spline methods Jean D. Opsomer F. Jay Breidt March 3, 016 1 The statistical model When applying nonparametric regression methods, the researcher is interested

More information

Fisheries Research 145 (2013) Contents lists available at SciVerse ScienceDirect. Fisheries Research

Fisheries Research 145 (2013) Contents lists available at SciVerse ScienceDirect. Fisheries Research Fisheries Research 145 (2013) 90 99 Contents lists available at SciVerse ScienceDirect Fisheries Research j our nal ho mep ag e: www.elsevier.com/locate/fishres Bayesian implementation of a time stratified

More information

lecture 10: B-Splines

lecture 10: B-Splines 9 lecture : -Splines -Splines: a basis for splines Throughout our discussion of standard polynomial interpolation, we viewed P n as a linear space of dimension n +, and then expressed the unique interpolating

More information

Chapter 19 Estimating the dose-response function

Chapter 19 Estimating the dose-response function Chapter 19 Estimating the dose-response function Dose-response Students and former students often tell me that they found evidence for dose-response. I know what they mean because that's the phrase I have

More information

Moving Beyond Linearity

Moving Beyond Linearity Moving Beyond Linearity The truth is never linear! 1/23 Moving Beyond Linearity The truth is never linear! r almost never! 1/23 Moving Beyond Linearity The truth is never linear! r almost never! But often

More information

1D Regression. i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is...

1D Regression. i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is... 1D Regression i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is... 1 Non-linear problems What if the underlying function is

More information

Nonparametric Mixed-Effects Models for Longitudinal Data

Nonparametric Mixed-Effects Models for Longitudinal Data Nonparametric Mixed-Effects Models for Longitudinal Data Zhang Jin-Ting Dept of Stat & Appl Prob National University of Sinagpore University of Seoul, South Korea, 7 p.1/26 OUTLINE The Motivating Data

More information

15.10 Curve Interpolation using Uniform Cubic B-Spline Curves. CS Dept, UK

15.10 Curve Interpolation using Uniform Cubic B-Spline Curves. CS Dept, UK 1 An analysis of the problem: To get the curve constructed, how many knots are needed? Consider the following case: So, to interpolate (n +1) data points, one needs (n +7) knots,, for a uniform cubic B-spline

More information

Interactive Graphics. Lecture 9: Introduction to Spline Curves. Interactive Graphics Lecture 9: Slide 1

Interactive Graphics. Lecture 9: Introduction to Spline Curves. Interactive Graphics Lecture 9: Slide 1 Interactive Graphics Lecture 9: Introduction to Spline Curves Interactive Graphics Lecture 9: Slide 1 Interactive Graphics Lecture 13: Slide 2 Splines The word spline comes from the ship building trade

More information

and the crooked shall be made straight, and the rough ways shall be made smooth; Luke 3:5

and the crooked shall be made straight, and the rough ways shall be made smooth; Luke 3:5 ecture 8: Knot Insertion Algorithms for B-Spline Curves and Surfaces and the crooked shall be made straight, and the rough ways shall be made smooth; uke 3:5. Motivation B-spline methods have several advantages

More information

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an Soft Threshold Estimation for Varying{coecient Models Artur Klinger, Universitat Munchen ABSTRACT: An alternative penalized likelihood estimator for varying{coecient regression in generalized linear models

More information

Lecture 9: Introduction to Spline Curves

Lecture 9: Introduction to Spline Curves Lecture 9: Introduction to Spline Curves Splines are used in graphics to represent smooth curves and surfaces. They use a small set of control points (knots) and a function that generates a curve through

More information

Lecture 8. Divided Differences,Least-Squares Approximations. Ceng375 Numerical Computations at December 9, 2010

Lecture 8. Divided Differences,Least-Squares Approximations. Ceng375 Numerical Computations at December 9, 2010 Lecture 8, Ceng375 Numerical Computations at December 9, 2010 Computer Engineering Department Çankaya University 8.1 Contents 1 2 3 8.2 : These provide a more efficient way to construct an interpolating

More information

Computer Graphics Curves and Surfaces. Matthias Teschner

Computer Graphics Curves and Surfaces. Matthias Teschner Computer Graphics Curves and Surfaces Matthias Teschner Outline Introduction Polynomial curves Bézier curves Matrix notation Curve subdivision Differential curve properties Piecewise polynomial curves

More information

Mar. 20 Math 2335 sec 001 Spring 2014

Mar. 20 Math 2335 sec 001 Spring 2014 Mar. 20 Math 2335 sec 001 Spring 2014 Chebyshev Polynomials Definition: For an integer n 0 define the function ( ) T n (x) = cos n cos 1 (x), 1 x 1. It can be shown that T n is a polynomial of degree n.

More information

Parametric Curves. University of Texas at Austin CS384G - Computer Graphics Fall 2010 Don Fussell

Parametric Curves. University of Texas at Austin CS384G - Computer Graphics Fall 2010 Don Fussell Parametric Curves University of Texas at Austin CS384G - Computer Graphics Fall 2010 Don Fussell Parametric Representations 3 basic representation strategies: Explicit: y = mx + b Implicit: ax + by + c

More information

3D Modeling Parametric Curves & Surfaces. Shandong University Spring 2013

3D Modeling Parametric Curves & Surfaces. Shandong University Spring 2013 3D Modeling Parametric Curves & Surfaces Shandong University Spring 2013 3D Object Representations Raw data Point cloud Range image Polygon soup Surfaces Mesh Subdivision Parametric Implicit Solids Voxels

More information

Spline Methods Draft. Tom Lyche and Knut Mørken

Spline Methods Draft. Tom Lyche and Knut Mørken Spline Methods Draft Tom Lyche and Knut Mørken 24th May 2002 2 Contents 1 Splines and B-splines an introduction 3 1.1 Convex combinations and convex hulls..................... 3 1.1.1 Stable computations...........................

More information

End-Term Examination

End-Term Examination Paper Code: MCA-108 Paper ID : 44108 Second Semester [MCA] MAY-JUNE 2006 Q. 1 Describe the following in brief :- (3 x 5 = 15) (a) QUADRATIC SURFACES (b) RGB Color Models. (c) BSP Tree (d) Solid Modeling

More information

Spline Methods Draft. Tom Lyche and Knut Mørken

Spline Methods Draft. Tom Lyche and Knut Mørken Spline Methods Draft Tom Lyche and Knut Mørken January 5, 2005 2 Contents 1 Splines and B-splines an Introduction 3 1.1 Convex combinations and convex hulls.................... 3 1.1.1 Stable computations...........................

More information

Parametric Curves. University of Texas at Austin CS384G - Computer Graphics

Parametric Curves. University of Texas at Austin CS384G - Computer Graphics Parametric Curves University of Texas at Austin CS384G - Computer Graphics Fall 2010 Don Fussell Parametric Representations 3 basic representation strategies: Explicit: y = mx + b Implicit: ax + by + c

More information

Four equations are necessary to evaluate these coefficients. Eqn

Four equations are necessary to evaluate these coefficients. Eqn 1.2 Splines 11 A spline function is a piecewise defined function with certain smoothness conditions [Cheney]. A wide variety of functions is potentially possible; polynomial functions are almost exclusively

More information

Splines and penalized regression

Splines and penalized regression Splines and penalized regression November 23 Introduction We are discussing ways to estimate the regression function f, where E(y x) = f(x) One approach is of course to assume that f has a certain shape,

More information

3D Modeling Parametric Curves & Surfaces

3D Modeling Parametric Curves & Surfaces 3D Modeling Parametric Curves & Surfaces Shandong University Spring 2012 3D Object Representations Raw data Point cloud Range image Polygon soup Solids Voxels BSP tree CSG Sweep Surfaces Mesh Subdivision

More information

Statistical Analysis Using Combined Data Sources: Discussion JPSM Distinguished Lecture University of Maryland

Statistical Analysis Using Combined Data Sources: Discussion JPSM Distinguished Lecture University of Maryland Statistical Analysis Using Combined Data Sources: Discussion 2011 JPSM Distinguished Lecture University of Maryland 1 1 University of Michigan School of Public Health April 2011 Complete (Ideal) vs. Observed

More information

CS-184: Computer Graphics

CS-184: Computer Graphics CS-184: Computer Graphics Lecture #12: Curves and Surfaces Prof. James O Brien University of California, Berkeley V2007-F-12-1.0 Today General curve and surface representations Splines and other polynomial

More information

Curve and Surface Fitting with Splines. PAUL DIERCKX Professor, Computer Science Department, Katholieke Universiteit Leuven, Belgium

Curve and Surface Fitting with Splines. PAUL DIERCKX Professor, Computer Science Department, Katholieke Universiteit Leuven, Belgium Curve and Surface Fitting with Splines PAUL DIERCKX Professor, Computer Science Department, Katholieke Universiteit Leuven, Belgium CLARENDON PRESS OXFORD 1995 - Preface List of Figures List of Tables

More information

B-Spline Polynomials. B-Spline Polynomials. Uniform Cubic B-Spline Curves CS 460. Computer Graphics

B-Spline Polynomials. B-Spline Polynomials. Uniform Cubic B-Spline Curves CS 460. Computer Graphics CS 460 B-Spline Polynomials Computer Graphics Professor Richard Eckert March 24, 2004 B-Spline Polynomials Want local control Smoother curves B-spline curves: Segmented approximating curve 4 control points

More information

Fall CSCI 420: Computer Graphics. 4.2 Splines. Hao Li.

Fall CSCI 420: Computer Graphics. 4.2 Splines. Hao Li. Fall 2014 CSCI 420: Computer Graphics 4.2 Splines Hao Li http://cs420.hao-li.com 1 Roller coaster Next programming assignment involves creating a 3D roller coaster animation We must model the 3D curve

More information

Spline Methods Draft. Tom Lyche and Knut Mørken. Department of Informatics Centre of Mathematics for Applications University of Oslo

Spline Methods Draft. Tom Lyche and Knut Mørken. Department of Informatics Centre of Mathematics for Applications University of Oslo Spline Methods Draft Tom Lyche and Knut Mørken Department of Informatics Centre of Mathematics for Applications University of Oslo January 27, 2006 Contents 1 Splines and B-splines an Introduction 1 1.1

More information

Goals of the Lecture. SOC6078 Advanced Statistics: 9. Generalized Additive Models. Limitations of the Multiple Nonparametric Models (2)

Goals of the Lecture. SOC6078 Advanced Statistics: 9. Generalized Additive Models. Limitations of the Multiple Nonparametric Models (2) SOC6078 Advanced Statistics: 9. Generalized Additive Models Robert Andersen Department of Sociology University of Toronto Goals of the Lecture Introduce Additive Models Explain how they extend from simple

More information

Last time... Bias-Variance decomposition. This week

Last time... Bias-Variance decomposition. This week Machine learning, pattern recognition and statistical data modelling Lecture 4. Going nonlinear: basis expansions and splines Last time... Coryn Bailer-Jones linear regression methods for high dimensional

More information

8 Piecewise Polynomial Interpolation

8 Piecewise Polynomial Interpolation Applied Math Notes by R. J. LeVeque 8 Piecewise Polynomial Interpolation 8. Pitfalls of high order interpolation Suppose we know the value of a function at several points on an interval and we wish to

More information

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset. Glossary of data mining terms: Accuracy Accuracy is an important factor in assessing the success of data mining. When applied to data, accuracy refers to the rate of correct values in the data. When applied

More information

Generalized Additive Models

Generalized Additive Models :p Texts in Statistical Science Generalized Additive Models An Introduction with R Simon N. Wood Contents Preface XV 1 Linear Models 1 1.1 A simple linear model 2 Simple least squares estimation 3 1.1.1

More information

A popular method for moving beyond linearity. 2. Basis expansion and regularization 1. Examples of transformations. Piecewise-polynomials and splines

A popular method for moving beyond linearity. 2. Basis expansion and regularization 1. Examples of transformations. Piecewise-polynomials and splines A popular method for moving beyond linearity 2. Basis expansion and regularization 1 Idea: Augment the vector inputs x with additional variables which are transformation of x use linear models in this

More information

Missing Data Analysis for the Employee Dataset

Missing Data Analysis for the Employee Dataset Missing Data Analysis for the Employee Dataset 67% of the observations have missing values! Modeling Setup Random Variables: Y i =(Y i1,...,y ip ) 0 =(Y i,obs, Y i,miss ) 0 R i =(R i1,...,r ip ) 0 ( 1

More information

Splines. Parameterization of a Curve. Curve Representations. Roller coaster. What Do We Need From Curves in Computer Graphics? Modeling Complex Shapes

Splines. Parameterization of a Curve. Curve Representations. Roller coaster. What Do We Need From Curves in Computer Graphics? Modeling Complex Shapes CSCI 420 Computer Graphics Lecture 8 Splines Jernej Barbic University of Southern California Hermite Splines Bezier Splines Catmull-Rom Splines Other Cubic Splines [Angel Ch 12.4-12.12] Roller coaster

More information

Generalized Additive Model

Generalized Additive Model Generalized Additive Model by Huimin Liu Department of Mathematics and Statistics University of Minnesota Duluth, Duluth, MN 55812 December 2008 Table of Contents Abstract... 2 Chapter 1 Introduction 1.1

More information

A toolbox of smooths. Simon Wood Mathematical Sciences, University of Bath, U.K.

A toolbox of smooths. Simon Wood Mathematical Sciences, University of Bath, U.K. A toolbo of smooths Simon Wood Mathematical Sciences, University of Bath, U.K. Smooths for semi-parametric GLMs To build adequate semi-parametric GLMs requires that we use functions with appropriate properties.

More information

Natural Quartic Spline

Natural Quartic Spline Natural Quartic Spline Rafael E Banchs INTRODUCTION This report describes the natural quartic spline algorithm developed for the enhanced solution of the Time Harmonic Field Electric Logging problem As

More information

CS6375: Machine Learning Gautam Kunapuli. Mid-Term Review

CS6375: Machine Learning Gautam Kunapuli. Mid-Term Review Gautam Kunapuli Machine Learning Data is identically and independently distributed Goal is to learn a function that maps to Data is generated using an unknown function Learn a hypothesis that minimizes

More information

Splines. Patrick Breheny. November 20. Introduction Regression splines (parametric) Smoothing splines (nonparametric)

Splines. Patrick Breheny. November 20. Introduction Regression splines (parametric) Smoothing splines (nonparametric) Splines Patrick Breheny November 20 Patrick Breheny STA 621: Nonparametric Statistics 1/46 Introduction Introduction Problems with polynomial bases We are discussing ways to estimate the regression function

More information

Computer Graphics I Lecture 11

Computer Graphics I Lecture 11 15-462 Computer Graphics I Lecture 11 Midterm Review Assignment 3 Movie Midterm Review Midterm Preview February 26, 2002 Frank Pfenning Carnegie Mellon University http://www.cs.cmu.edu/~fp/courses/graphics/

More information

Intro to Modeling Modeling in 3D

Intro to Modeling Modeling in 3D Intro to Modeling Modeling in 3D Polygon sets can approximate more complex shapes as discretized surfaces 2 1 2 3 Curve surfaces in 3D Sphere, ellipsoids, etc Curved Surfaces Modeling in 3D ) ( 2 2 2 2

More information

Spline fitting tool for scientometric applications: estimation of citation peaks and publication time-lags

Spline fitting tool for scientometric applications: estimation of citation peaks and publication time-lags Spline fitting tool for scientometric applications: estimation of citation peaks and publication time-lags By: Hicham Sabir Presentation to: 21 STI Conference, September 1th, 21 Description of the problem

More information

Know it. Control points. B Spline surfaces. Implicit surfaces

Know it. Control points. B Spline surfaces. Implicit surfaces Know it 15 B Spline Cur 14 13 12 11 Parametric curves Catmull clark subdivision Parametric surfaces Interpolating curves 10 9 8 7 6 5 4 3 2 Control points B Spline surfaces Implicit surfaces Bezier surfaces

More information

REGULARIZED REGRESSION FOR RESERVING AND MORTALITY MODELS GARY G. VENTER

REGULARIZED REGRESSION FOR RESERVING AND MORTALITY MODELS GARY G. VENTER REGULARIZED REGRESSION FOR RESERVING AND MORTALITY MODELS GARY G. VENTER TODAY Advances in model estimation methodology Application to data that comes in rectangles Examples ESTIMATION Problems with MLE

More information

Automatic basis selection for RBF networks using Stein s unbiased risk estimator

Automatic basis selection for RBF networks using Stein s unbiased risk estimator Automatic basis selection for RBF networks using Stein s unbiased risk estimator Ali Ghodsi School of omputer Science University of Waterloo University Avenue West NL G anada Email: aghodsib@cs.uwaterloo.ca

More information

Nonparametric Approaches to Regression

Nonparametric Approaches to Regression Nonparametric Approaches to Regression In traditional nonparametric regression, we assume very little about the functional form of the mean response function. In particular, we assume the model where m(xi)

More information

EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines

EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines EECS 556 Image Processing W 09 Interpolation Interpolation techniques B splines What is image processing? Image processing is the application of 2D signal processing methods to images Image representation

More information

Clustering and The Expectation-Maximization Algorithm

Clustering and The Expectation-Maximization Algorithm Clustering and The Expectation-Maximization Algorithm Unsupervised Learning Marek Petrik 3/7 Some of the figures in this presentation are taken from An Introduction to Statistical Learning, with applications

More information

Cost Functions in Machine Learning

Cost Functions in Machine Learning Cost Functions in Machine Learning Kevin Swingler Motivation Given some data that reflects measurements from the environment We want to build a model that reflects certain statistics about that data Something

More information

08 - Designing Approximating Curves

08 - Designing Approximating Curves 08 - Designing Approximating Curves Acknowledgement: Olga Sorkine-Hornung, Alexander Sorkine-Hornung, Ilya Baran Last time Interpolating curves Monomials Lagrange Hermite Different control types Polynomials

More information

STAT 705 Introduction to generalized additive models

STAT 705 Introduction to generalized additive models STAT 705 Introduction to generalized additive models Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 22 Generalized additive models Consider a linear

More information

Analyzing Longitudinal Data Using Regression Splines

Analyzing Longitudinal Data Using Regression Splines Analyzing Longitudinal Data Using Regression Splines Zhang Jin-Ting Dept of Stat & Appl Prob National University of Sinagpore August 18, 6 DSAP, NUS p.1/16 OUTLINE Motivating Longitudinal Data Parametric

More information

Unsupervised Learning. Clustering and the EM Algorithm. Unsupervised Learning is Model Learning

Unsupervised Learning. Clustering and the EM Algorithm. Unsupervised Learning is Model Learning Unsupervised Learning Clustering and the EM Algorithm Susanna Ricco Supervised Learning Given data in the form < x, y >, y is the target to learn. Good news: Easy to tell if our algorithm is giving the

More information

Program TribPit. Cohort Analysis of Juvenile Salmonid Movement and Survival in Tributaries

Program TribPit. Cohort Analysis of Juvenile Salmonid Movement and Survival in Tributaries Program TribPit Cohort Analysis of Juvenile Salmonid Movement and Survival in Tributaries Developed by: James Lady, John R. Skalski, and Rebecca Buchanan Columbia Basin Research School of Aquatic & Fishery

More information

Lecture 2.2 Cubic Splines

Lecture 2.2 Cubic Splines Lecture. Cubic Splines Cubic Spline The equation for a single parametric cubic spline segment is given by 4 i t Bit t t t i (..) where t and t are the parameter values at the beginning and end of the segment.

More information

CS 450 Numerical Analysis. Chapter 7: Interpolation

CS 450 Numerical Analysis. Chapter 7: Interpolation Lecture slides based on the textbook Scientific Computing: An Introductory Survey by Michael T. Heath, copyright c 2018 by the Society for Industrial and Applied Mathematics. http://www.siam.org/books/cl80

More information

Smoothing and Forecasting Mortality Rates with P-splines. Iain Currie. Data and problem. Plan of talk

Smoothing and Forecasting Mortality Rates with P-splines. Iain Currie. Data and problem. Plan of talk Smoothing and Forecasting Mortality Rates with P-splines Iain Currie Heriot Watt University Data and problem Data: CMI assured lives : 20 to 90 : 1947 to 2002 Problem: forecast table to 2046 London, June

More information

2D Spline Curves. CS 4620 Lecture 13

2D Spline Curves. CS 4620 Lecture 13 2D Spline Curves CS 4620 Lecture 13 2008 Steve Marschner 1 Motivation: smoothness In many applications we need smooth shapes [Boeing] that is, without discontinuities So far we can make things with corners

More information

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1 Preface to the Second Edition Preface to the First Edition vii xi 1 Introduction 1 2 Overview of Supervised Learning 9 2.1 Introduction... 9 2.2 Variable Types and Terminology... 9 2.3 Two Simple Approaches

More information

Spline Models. Introduction to CS and NCS. Regression splines. Smoothing splines

Spline Models. Introduction to CS and NCS. Regression splines. Smoothing splines Spline Models Introduction to CS and NCS Regression splines Smoothing splines 3 Cubic Splines a knots: a< 1 < 2 < < m

More information

Curves and Surfaces Computer Graphics I Lecture 10

Curves and Surfaces Computer Graphics I Lecture 10 15-462 Computer Graphics I Lecture 10 Curves and Surfaces Parametric Representations Cubic Polynomial Forms Hermite Curves Bezier Curves and Surfaces [Angel 10.1-10.6] September 30, 2003 Doug James Carnegie

More information

BASIC LOESS, PBSPLINE & SPLINE

BASIC LOESS, PBSPLINE & SPLINE CURVES AND SPLINES DATA INTERPOLATION SGPLOT provides various methods for fitting smooth trends to scatterplot data LOESS An extension of LOWESS (Locally Weighted Scatterplot Smoothing), uses locally weighted

More information

Homework. Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression Pod-cast lecture on-line. Next lectures:

Homework. Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression Pod-cast lecture on-line. Next lectures: Homework Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression 3.0-3.2 Pod-cast lecture on-line Next lectures: I posted a rough plan. It is flexible though so please come with suggestions Bayes

More information

CS130 : Computer Graphics Curves (cont.) Tamar Shinar Computer Science & Engineering UC Riverside

CS130 : Computer Graphics Curves (cont.) Tamar Shinar Computer Science & Engineering UC Riverside CS130 : Computer Graphics Curves (cont.) Tamar Shinar Computer Science & Engineering UC Riverside Blending Functions Blending functions are more convenient basis than monomial basis canonical form (monomial

More information

Supervised vs unsupervised clustering

Supervised vs unsupervised clustering Classification Supervised vs unsupervised clustering Cluster analysis: Classes are not known a- priori. Classification: Classes are defined a-priori Sometimes called supervised clustering Extract useful

More information

Points Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked

Points Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked Plotting Menu: QCExpert Plotting Module graphs offers various tools for visualization of uni- and multivariate data. Settings and options in different types of graphs allow for modifications and customizations

More information

EECS 487, Fall 2005 Exam 2

EECS 487, Fall 2005 Exam 2 EECS 487, Fall 2005 Exam 2 December 21, 2005 This is a closed book exam. Notes are not permitted. Basic calculators are permitted, but not needed. Explain or show your work for each question. Name: uniqname:

More information

For each question, indicate whether the statement is true or false by circling T or F, respectively.

For each question, indicate whether the statement is true or false by circling T or F, respectively. True/False For each question, indicate whether the statement is true or false by circling T or F, respectively. 1. (T/F) Rasterization occurs before vertex transformation in the graphics pipeline. 2. (T/F)

More information

GAMs, GAMMs and other penalized GLMs using mgcv in R. Simon Wood Mathematical Sciences, University of Bath, U.K.

GAMs, GAMMs and other penalized GLMs using mgcv in R. Simon Wood Mathematical Sciences, University of Bath, U.K. GAMs, GAMMs and other penalied GLMs using mgcv in R Simon Wood Mathematical Sciences, University of Bath, U.K. Simple eample Consider a very simple dataset relating the timber volume of cherry trees to

More information

Lecture 17: Smoothing splines, Local Regression, and GAMs

Lecture 17: Smoothing splines, Local Regression, and GAMs Lecture 17: Smoothing splines, Local Regression, and GAMs Reading: Sections 7.5-7 STATS 202: Data mining and analysis November 6, 2017 1 / 24 Cubic splines Define a set of knots ξ 1 < ξ 2 < < ξ K. We want

More information

4.12 Generalization. In back-propagation learning, as many training examples as possible are typically used.

4.12 Generalization. In back-propagation learning, as many training examples as possible are typically used. 1 4.12 Generalization In back-propagation learning, as many training examples as possible are typically used. It is hoped that the network so designed generalizes well. A network generalizes well when

More information

Curve fitting using linear models

Curve fitting using linear models Curve fitting using linear models Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark September 28, 2012 1 / 12 Outline for today linear models and basis functions polynomial regression

More information

MA 323 Geometric Modelling Course Notes: Day 10 Higher Order Polynomial Curves

MA 323 Geometric Modelling Course Notes: Day 10 Higher Order Polynomial Curves MA 323 Geometric Modelling Course Notes: Day 10 Higher Order Polynomial Curves David L. Finn December 14th, 2004 Yesterday, we introduced quintic Hermite curves as a higher order variant of cubic Hermite

More information

Linear Penalized Spline Model Estimation Using Ranked Set Sampling Technique

Linear Penalized Spline Model Estimation Using Ranked Set Sampling Technique Linear Penalized Spline Model Estimation Using Ranked Set Sampling Technique Al Kadiri M. A. Abstract Benefits of using Ranked Set Sampling (RSS) rather than Simple Random Sampling (SRS) are indeed significant

More information

Spectral Methods for Network Community Detection and Graph Partitioning

Spectral Methods for Network Community Detection and Graph Partitioning Spectral Methods for Network Community Detection and Graph Partitioning M. E. J. Newman Department of Physics, University of Michigan Presenters: Yunqi Guo Xueyin Yu Yuanqi Li 1 Outline: Community Detection

More information

Knowledge Discovery and Data Mining

Knowledge Discovery and Data Mining Knowledge Discovery and Data Mining Basis Functions Tom Kelsey School of Computer Science University of St Andrews http://www.cs.st-andrews.ac.uk/~tom/ tom@cs.st-andrews.ac.uk Tom Kelsey ID5059-02-BF 2015-02-04

More information

REPLACING MLE WITH BAYESIAN SHRINKAGE CAS ANNUAL MEETING NOVEMBER 2018 GARY G. VENTER

REPLACING MLE WITH BAYESIAN SHRINKAGE CAS ANNUAL MEETING NOVEMBER 2018 GARY G. VENTER REPLACING MLE WITH BAYESIAN SHRINKAGE CAS ANNUAL MEETING NOVEMBER 2018 GARY G. VENTER ESTIMATION Problems with MLE known since Charles Stein 1956 paper He showed that when estimating 3 or more means, shrinking

More information

2014 Stat-Ease, Inc. All Rights Reserved.

2014 Stat-Ease, Inc. All Rights Reserved. What s New in Design-Expert version 9 Factorial split plots (Two-Level, Multilevel, Optimal) Definitive Screening and Single Factor designs Journal Feature Design layout Graph Columns Design Evaluation

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION Introduction CHAPTER 1 INTRODUCTION Mplus is a statistical modeling program that provides researchers with a flexible tool to analyze their data. Mplus offers researchers a wide choice of models, estimators,

More information

Additive hedonic regression models for the Austrian housing market ERES Conference, Edinburgh, June

Additive hedonic regression models for the Austrian housing market ERES Conference, Edinburgh, June for the Austrian housing market, June 14 2012 Ao. Univ. Prof. Dr. Fachbereich Stadt- und Regionalforschung Technische Universität Wien Dr. Strategic Risk Management Bank Austria UniCredit, Wien Inhalt

More information

Lecture VIII. Global Approximation Methods: I

Lecture VIII. Global Approximation Methods: I Lecture VIII Global Approximation Methods: I Gianluca Violante New York University Quantitative Macroeconomics G. Violante, Global Methods p. 1 /29 Global function approximation Global methods: function

More information

Computer Graphics. Curves and Surfaces. Hermite/Bezier Curves, (B-)Splines, and NURBS. By Ulf Assarsson

Computer Graphics. Curves and Surfaces. Hermite/Bezier Curves, (B-)Splines, and NURBS. By Ulf Assarsson Computer Graphics Curves and Surfaces Hermite/Bezier Curves, (B-)Splines, and NURBS By Ulf Assarsson Most of the material is originally made by Edward Angel and is adapted to this course by Ulf Assarsson.

More information

Handling Data with Three Types of Missing Values:

Handling Data with Three Types of Missing Values: Handling Data with Three Types of Missing Values: A Simulation Study Jennifer Boyko Advisor: Ofer Harel Department of Statistics University of Connecticut Storrs, CT May 21, 2013 Jennifer Boyko Handling

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS6: Mining Massive Datasets Jure Leskovec, Stanford University http://cs6.stanford.edu Training data 00 million ratings, 80,000 users, 7,770 movies 6 years of data: 000 00 Test data Last few ratings of

More information

demonstrate an understanding of the exponent rules of multiplication and division, and apply them to simplify expressions Number Sense and Algebra

demonstrate an understanding of the exponent rules of multiplication and division, and apply them to simplify expressions Number Sense and Algebra MPM 1D - Grade Nine Academic Mathematics This guide has been organized in alignment with the 2005 Ontario Mathematics Curriculum. Each of the specific curriculum expectations are cross-referenced to the

More information

A story about Non Uniform Rational B-Splines. E. Shcherbakov

A story about Non Uniform Rational B-Splines. E. Shcherbakov A story about Non Uniform Rational B-Splines E. Shcherbakov Speakers 09-06: B-spline curves (W. Dijkstra) 16-06: NURBS (E. Shcherbakov) 30-06: B-spline surfaces (M. Patricio) Seminar 16-06-2004 2 Outline

More information

Research Article Data Visualization Using Rational Trigonometric Spline

Research Article Data Visualization Using Rational Trigonometric Spline Applied Mathematics Volume Article ID 97 pages http://dx.doi.org/.//97 Research Article Data Visualization Using Rational Trigonometric Spline Uzma Bashir and Jamaludin Md. Ali School of Mathematical Sciences

More information

CS-184: Computer Graphics. Today

CS-184: Computer Graphics. Today CS-84: Computer Graphics Lecture #5: Curves and Surfaces Prof. James O Brien University of California, Berkeley V25F-5-. Today General curve and surface representations Splines and other polynomial bases

More information

Frequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS

Frequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS ABSTRACT Paper 1938-2018 Frequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS Robert M. Lucas, Robert M. Lucas Consulting, Fort Collins, CO, USA There is confusion

More information

Lecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM)

Lecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM) School of Computer Science Probabilistic Graphical Models Structured Sparse Additive Models Junming Yin and Eric Xing Lecture 7, April 4, 013 Reading: See class website 1 Outline Nonparametric regression

More information

A MATRIX FORMULATION OF THE CUBIC BÉZIER CURVE

A MATRIX FORMULATION OF THE CUBIC BÉZIER CURVE Geometric Modeling Notes A MATRIX FORMULATION OF THE CUBIC BÉZIER CURVE Kenneth I. Joy Institute for Data Analysis and Visualization Department of Computer Science University of California, Davis Overview

More information