MINI-PAPER A Gentle Introduction to the Analysis of Sequential Data

Size: px
Start display at page:

Download "MINI-PAPER A Gentle Introduction to the Analysis of Sequential Data"

Transcription

1 MINI-PAPER by Rong Pan, Ph.D., Assistant Professor of Industrial Engineering, Arizona State University We, applied statisticians and manufacturing engineers, often need to deal with sequential data, which are collected on a manufacturing process by sensors. The data volume could be very large and it is easy to lose the sight of the purpose of data analysis. In this mini-paper we will discuss some basic purposes for doing such data analysis and the simple tools that can assist in data analysis. We will focus on the analysis of a univariate series. Our purposes include understanding the series characteristics as exhibited in observations, predicting a future observation, and controlling mean value and variance. 1. Independence versus Correlation Sequential data is often referred as time series, particularly for the observations collected over time. However, we recognize that this type of data is not necessary to be indexed by time only. For example, considering the x-ray inspection of highway pavement, these data are spatially indexed. One of the critical data analysis steps is to check if observations are independent to each other. Many statistical data analysis techniques assume that observations are independent, which is unlikely in sequential data where close observations tend to be correlated even when the system under study is stable and undisturbed. This type of correlation is defined as autocorrelation (prefix auto- means self ), it is the correlation of samples in the series to earlier samples in the same series. Autocorrelation is a function of time lag, i.e., the time units separating a later series from an earlier series. Figure 1(a) plots the data from Boyles (2000), which gave a time series (190 observations) for the fill weights observed for a powder food product. Over time, it looks as though observations are wandering around a mean value, but it is still not easy to detect autocorrelation by visual inspection only. The autocorrelation can be revealed by a plot of the sample autocorrelation function, which is shown in Figure 1(b). Clearly, the autocorrelation of lag 1 is significant and positive, which indicates that every next observation tends to be more or less like its previous observation. Unlike independent observations where variation is due to a pure random noise only, variation in an autocorrelated series can be decomposed to random noise and a systematic variation pattern. In this example, the primary pattern is lag one autocorrelation. Knowing this systematic pattern is essential to the time series prediction and control. Statistically significant autocorrelation is strongest at lag=1 Figure 1: (a) time series plot and (b) sample autocorrelation of Boyles data set. 4

2 Continued from page 4 2. Stationary versus Nonstationary Another important characteristic of time series is stationarity. Loosely speaking, if observations are varying around a constant mean and the variance does not explode over time, then the series is stationary. In industry, a nonstationary process is highly undesirable. It indicates that the process needs some corrective actions to bring it back under control. Detecting the nonstationary pattern can give insights into the mechanism that generates the series. For example, del Castillo (2002, p.20) gave 40 measures of a dimension machined on aluminum parts processed on a Fadal computer numerically controlled (CNC) machine tool. The time series plot, as shown in Figure 2(a), clearly demonstrates a pattern of increasing mean over time, possibly because of the tool wear. We are interested in decomposing the total variation of this series to the sum of increasing mean value plus the pure random noise. After removing (filtering) the increasing trend of the mean by using a drifted random walk model (technically an integrated moving average model), Figure 2(b) shows the series under control. After subtracting the predictable drifts, it becomes a random noise without apparent patterns. The process can be made stationary through corrective actions calculated by the filter model. Figure 2: (a) time series plot and (b) residual plot of Fadal dataset. Modern manufacturing processes are usually under some types of automatic process control (APC) to prevent nonstationarity/instability. As systematic reactions to past observations, control actions will introduce autocorrelation into the sequential process observations. These automated reactions to trends make APC different from statistical process control (SPC). SPC is more concerned with detecting a sudden shift in process mean and/or process variance due to assignable causes, but less concerned with mild autocorrelation as long as the process is stationary. Assignable causes may come from outside the process, and require investigation to determine an effective reaction. APC reacts to common causes that are part of the process (e.g. tool wear), and effective reactions are so well known they can be automated. However, APC can interfere with SPC. Systematic reactions deliberately push the next observation away from the previous observation, creating negative autocorrelation that may inflate the estimation of process variance. This leads to inflated control chart limits, and makes the statistical process control chart less sensitive to a shift. An active quality control research area in the past two decades is the integration of APC and SPC. For detailed discussions on this topic, see Box and Luceño (1997), del Castillo (2002) and Pan (2009). 5

3 Continued from page 5 3. Filtering and Smoothing Given a sequential dataset we would like to fit a model which consists of a predictable part, such as a deterministic mathematical function, and a random noise. Smoothing and filtering are two approaches to the same problem, i.e., separating the predictable part (signal) from the noise, but from different perspectives. Filtering comes from signal processing and target tracking, where the task is to filter out the noise and to predict the next move of the target. Therefore, the ability to make on-line predictions in real-time is very important. Smoothing comes from statistics, where the major task includes model fitting and diagnosis. It is typically implemented off-line, so all available data can be utilized to estimate the predictable part of the time series. After removing the noise, the series will become smoother (less variation) than the original one. Oftentimes these methods are known as forward filtering and backward smoothing. For example, Eq. (1) is a linear filter using a moving average, taking the weighted average of past five observations: yt = w1xt + w2xt-1 + w3xt-2 + w4xt-3 + w5xt-4, Eq. (1) 5 where wi s are weights and i - 1 wi = 1. The current time index value is t, one lag in the past is t-1, and so on. For this example, a simple moving average filter will let wi = 1/5. After averaging, the series {y1} is smoother than the original series {x1}, and {yt} can be used to predict the next observation of {xt} series. Depending on applications, equal weights used in the above linear filter may not be the best choice. A sensible modification of Eq. (1) could be assigning larger weights to recent observations and smaller weights to remote observations. In fact, the exponential smoothing algorithm reduces the weight value exponentially to discount the effect of remote observations on prediction. The same algorithm is used in EWMA control charts. Statisticians are interested in the estimation of weights in a linear filter. For some special classes of time series models, optimal weights can be found based on certain optimization criteria. For example, we may find the optimal weight of exponential smoothing by minimizing the mean square prediction error. For the previous Boyles s dataset, the optimal weight of exponential smoothing is found to be As shown in Figure 3, the mean square error is reduced to 216 by exponential smoothing using the optimal weight, comparing to 240 by moving average. However, optimizing the weights may not be practically important to many applications. Exponential smoothing performs well for weights that are rougly near the optimal, which is why it is popular. Figure 3: (a) moving average and (b) exponential smoothing of Boyles dataset. 6

4 Continued from page 6 Eq. (1) can be viewed as a linear regression of y_t on five consecutive observations of x_t. To find the optimal regression coefficients, we can use the least square method. This implies that other regression techniques can be applied for the data smoothing purpose. Locally weighted scatterplot smoothing, or loess, is one of such methods. In loess, each smoothed point in the series is estimated by a low-degree polynomial function of its neighbor points. The polynomial is fitted using weighted least squares, giving more weight to points in the close neighborhood of the point being estimated. Note that we may specify the degree of polynomial model and the weights, but the data fitting is on a local neighborhood level. Typically, we use simple models, such as linear regression models, to fit localized subsets of the data and build smooth curve point by point; thus, there is not a global function to describe the deterministic part of the variation in data, but rather a collection of locally predicted points. Figure 4 (a) and (b) give the loess curves of Boyles dataset with 0.2 and 0.4 degree of smoothing, respectively, which means that for each smoothed point only the nearest 20% or 40% of total points are utilized for localized regression. As this percentage increases, we can see that the curve becomes flatter, so less variability in the observed data is accounted for by this smooth curve. Figure 4: (a) loess smoothing with the smoothing parameter 0.2 and (b) 0.4 Loess is data oriented and computationally intensive. Computer programs are necessary to fit all of the many local regression equations used to smooth one series. However, one of the major advantages of this method is its flexibility in fitting complex, nonlinear patterns exhibited by the data. In fact, analysts have a great deal of control of smoothness of the fitted curve by adjusting the smoothing parameter. 4. Prediction and Control As explained previously, there are many reasons for analyzing sequential data; however, two major general purposes are prediction and control, and often they go hand-in-hand. In production and inventory management, for example, we want to predict the production or inventory level of an item. Then we may apply some management tools and strategies to adjust this level to be closer to a target or to reduce its variation. After decomposing the sequential observations to a predictable function and a random noise, it is straightforward to apply the prediction function. Therefore, to design a good predictor we want it able to account for most of the variability in observations due to systematic patterns while ignoring random noise. An optimal filter is, thus, an optimal predictor. Based on the predicted process value, a process feedback control scheme applies control actions on some adjustable variables, which in turn alters the process value to its desired target. In control theory, design of a controller involves controllable variables and their interactions with system outputs, which is out of the scope of this article. Figure 5 depicts a generic process of feedback control and highlights the role of sequential data analysis in this process. 7

5 Continued from page 7 Figure 5: A generic process feedback control scheme 5. Summary In this mini-paper, we discuss the essence of a sequential data analysis task, which is to decompose the process variation to a systematic pattern and a random noise, and they are in turn to be utilized in system prediction or control. The literature on this topic is vast. We only wish to illustrate some basic concepts, such as autocorrelation and nonstationarity, through examples and to demonstrate the ideas behind some simple filtering/smoothing techniques. References: Boyles, R. A. (2000). Phase I analysis for autocorrelated processes, Journal of Quality Technology, 32(4), pp Del Castillo, E. (2002). Statistical Process Adjustment for Quality Control, Wiley Series in Probability and Statistics, Wiley. Box, G. E. P. and Luceño, A. (1997). Statistical Control by Monitoring and Feedback Adjustment, Wiley Series in Probability and Statistics, Wiley. Pan, R. (2009). Statistical Process Adjustment in Short-Run Manufacturing Process, VDM Verlag Dr. Müller. 8

LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave.

LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave. LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave. http://en.wikipedia.org/wiki/local_regression Local regression

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use

More information

Monitoring of Manufacturing Process Conditions with Baseline Changes Using Generalized Regression

Monitoring of Manufacturing Process Conditions with Baseline Changes Using Generalized Regression Proceedings of the 10th International Conference on Frontiers of Design and Manufacturing June 10~1, 01 Monitoring of Manufacturing Process Conditions with Baseline Changes Using Generalized Regression

More information

Using Excel for Graphical Analysis of Data

Using Excel for Graphical Analysis of Data Using Excel for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters. Graphs are

More information

Spatial Interpolation & Geostatistics

Spatial Interpolation & Geostatistics (Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Lag Mean Distance between pairs of points 1 Tobler s Law All places are related, but nearby places are related more than distant places Corollary:

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming

More information

Spatial Interpolation - Geostatistics 4/3/2018

Spatial Interpolation - Geostatistics 4/3/2018 Spatial Interpolation - Geostatistics 4/3/201 (Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Distance between pairs of points Lag Mean Tobler s Law All places are related, but nearby places

More information

1. Assumptions. 1. Introduction. 2. Terminology

1. Assumptions. 1. Introduction. 2. Terminology 4. Process Modeling 4. Process Modeling The goal for this chapter is to present the background and specific analysis techniques needed to construct a statistical model that describes a particular scientific

More information

Here is Kellogg s custom menu for their core statistics class, which can be loaded by typing the do statement shown in the command window at the very

Here is Kellogg s custom menu for their core statistics class, which can be loaded by typing the do statement shown in the command window at the very Here is Kellogg s custom menu for their core statistics class, which can be loaded by typing the do statement shown in the command window at the very bottom of the screen: 4 The univariate statistics command

More information

Intro to ARMA models. FISH 507 Applied Time Series Analysis. Mark Scheuerell 15 Jan 2019

Intro to ARMA models. FISH 507 Applied Time Series Analysis. Mark Scheuerell 15 Jan 2019 Intro to ARMA models FISH 507 Applied Time Series Analysis Mark Scheuerell 15 Jan 2019 Topics for today Review White noise Random walks Autoregressive (AR) models Moving average (MA) models Autoregressive

More information

Generalized Additive Model

Generalized Additive Model Generalized Additive Model by Huimin Liu Department of Mathematics and Statistics University of Minnesota Duluth, Duluth, MN 55812 December 2008 Table of Contents Abstract... 2 Chapter 1 Introduction 1.1

More information

CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY

CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 23 CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 3.1 DESIGN OF EXPERIMENTS Design of experiments is a systematic approach for investigation of a system or process. A series

More information

Spatial Interpolation & Geostatistics

Spatial Interpolation & Geostatistics (Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Lag Mean Distance between pairs of points 11/3/2016 GEO327G/386G, UT Austin 1 Tobler s Law All places are related, but nearby places are related

More information

Graphical Analysis of Data using Microsoft Excel [2016 Version]

Graphical Analysis of Data using Microsoft Excel [2016 Version] Graphical Analysis of Data using Microsoft Excel [2016 Version] Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC

The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC INTRODUCTION The Time Series Forecasting System (TSFS) is a component of SAS/ETS that provides a menu-based

More information

Fall 2016 CS130 - Regression Analysis 1 7. REGRESSION. Fall 2016

Fall 2016 CS130 - Regression Analysis 1 7. REGRESSION. Fall 2016 Fall 2016 CS130 - Regression Analysis 1 7. REGRESSION Fall 2016 Fall 2016 CS130 - Regression Analysis 2 Regression Analysis Regression analysis: usually falls under statistics and mathematical modeling

More information

Linear Methods for Regression and Shrinkage Methods

Linear Methods for Regression and Shrinkage Methods Linear Methods for Regression and Shrinkage Methods Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Linear Regression Models Least Squares Input vectors

More information

Effects of PROC EXPAND Data Interpolation on Time Series Modeling When the Data are Volatile or Complex

Effects of PROC EXPAND Data Interpolation on Time Series Modeling When the Data are Volatile or Complex Effects of PROC EXPAND Data Interpolation on Time Series Modeling When the Data are Volatile or Complex Keiko I. Powers, Ph.D., J. D. Power and Associates, Westlake Village, CA ABSTRACT Discrete time series

More information

Digital Image Processing. Prof. P. K. Biswas. Department of Electronic & Electrical Communication Engineering

Digital Image Processing. Prof. P. K. Biswas. Department of Electronic & Electrical Communication Engineering Digital Image Processing Prof. P. K. Biswas Department of Electronic & Electrical Communication Engineering Indian Institute of Technology, Kharagpur Lecture - 21 Image Enhancement Frequency Domain Processing

More information

Nonparametric Approaches to Regression

Nonparametric Approaches to Regression Nonparametric Approaches to Regression In traditional nonparametric regression, we assume very little about the functional form of the mean response function. In particular, we assume the model where m(xi)

More information

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu FMA901F: Machine Learning Lecture 3: Linear Models for Regression Cristian Sminchisescu Machine Learning: Frequentist vs. Bayesian In the frequentist setting, we seek a fixed parameter (vector), with value(s)

More information

Multiple Regression White paper

Multiple Regression White paper +44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms

More information

Applications of the k-nearest neighbor method for regression and resampling

Applications of the k-nearest neighbor method for regression and resampling Applications of the k-nearest neighbor method for regression and resampling Objectives Provide a structured approach to exploring a regression data set. Introduce and demonstrate the k-nearest neighbor

More information

How to use FSBforecast Excel add in for regression analysis

How to use FSBforecast Excel add in for regression analysis How to use FSBforecast Excel add in for regression analysis FSBforecast is an Excel add in for data analysis and regression that was developed here at the Fuqua School of Business over the last 3 years

More information

Living with Collinearity in Local Regression Models

Living with Collinearity in Local Regression Models Living with Collinearity in Local Regression Models Chris Brunsdon 1, Martin Charlton 2, Paul Harris 2 1 People Space and Place, Roxby Building, University of Liverpool,L69 7ZT, UK Tel. +44 151 794 2837

More information

Short Note. Non-stationary PEFs and large gaps. William Curry 1 INTRODUCTION

Short Note. Non-stationary PEFs and large gaps. William Curry 1 INTRODUCTION Stanford Exploration Project, Report 120, May 3, 2005, pages 247 257 Short Note Non-stationary PEFs and large gaps William Curry 1 INTRODUCTION Prediction-error filters (PEFs) may be used to interpolate

More information

BIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26

BIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26 Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26 INTRODUCTION Graphs are one of the most important aspects of data analysis and presentation of your of data. They are visual representations

More information

Using Excel for Graphical Analysis of Data

Using Excel for Graphical Analysis of Data EXERCISE Using Excel for Graphical Analysis of Data Introduction In several upcoming experiments, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

Nonparametric Regression

Nonparametric Regression Nonparametric Regression John Fox Department of Sociology McMaster University 1280 Main Street West Hamilton, Ontario Canada L8S 4M4 jfox@mcmaster.ca February 2004 Abstract Nonparametric regression analysis

More information

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables

More information

Your Name: Section: INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression

Your Name: Section: INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression Objectives: 1. To learn how to interpret scatterplots. Specifically you will investigate, using

More information

Economics Nonparametric Econometrics

Economics Nonparametric Econometrics Economics 217 - Nonparametric Econometrics Topics covered in this lecture Introduction to the nonparametric model The role of bandwidth Choice of smoothing function R commands for nonparametric models

More information

Chapter 4: Analyzing Bivariate Data with Fathom

Chapter 4: Analyzing Bivariate Data with Fathom Chapter 4: Analyzing Bivariate Data with Fathom Summary: Building from ideas introduced in Chapter 3, teachers continue to analyze automobile data using Fathom to look for relationships between two quantitative

More information

Tips and Guidance for Analyzing Data. Executive Summary

Tips and Guidance for Analyzing Data. Executive Summary Tips and Guidance for Analyzing Data Executive Summary This document has information and suggestions about three things: 1) how to quickly do a preliminary analysis of time-series data; 2) key things to

More information

Long Term Analysis for the BAM device Donata Bonino and Daniele Gardiol INAF Osservatorio Astronomico di Torino

Long Term Analysis for the BAM device Donata Bonino and Daniele Gardiol INAF Osservatorio Astronomico di Torino Long Term Analysis for the BAM device Donata Bonino and Daniele Gardiol INAF Osservatorio Astronomico di Torino 1 Overview What is BAM Analysis in the time domain Analysis in the frequency domain Example

More information

STATISTICS (STAT) Statistics (STAT) 1

STATISTICS (STAT) Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).

More information

Lesson 5 overview. Concepts. Interpolators. Assessing accuracy Exercise 5

Lesson 5 overview. Concepts. Interpolators. Assessing accuracy Exercise 5 Interpolation Tools Lesson 5 overview Concepts Sampling methods Creating continuous surfaces Interpolation Density surfaces in GIS Interpolators IDW, Spline,Trend, Kriging,Natural neighbors TopoToRaster

More information

Modelling and simulation of seismic reflectivity

Modelling and simulation of seismic reflectivity Modelling reflectivity Modelling and simulation of seismic reflectivity Rita Aggarwala, Michael P. Lamoureux, and Gary F. Margrave ABSTRACT We decompose the reflectivity series obtained from a seismic

More information

3 Nonlinear Regression

3 Nonlinear Regression 3 Linear models are often insufficient to capture the real-world phenomena. That is, the relation between the inputs and the outputs we want to be able to predict are not linear. As a consequence, nonlinear

More information

Estimating Noise and Dimensionality in BCI Data Sets: Towards Illiteracy Comprehension

Estimating Noise and Dimensionality in BCI Data Sets: Towards Illiteracy Comprehension Estimating Noise and Dimensionality in BCI Data Sets: Towards Illiteracy Comprehension Claudia Sannelli, Mikio Braun, Michael Tangermann, Klaus-Robert Müller, Machine Learning Laboratory, Dept. Computer

More information

ChristoHouston Energy Inc. (CHE INC.) Pipeline Anomaly Analysis By Liquid Green Technologies Corporation

ChristoHouston Energy Inc. (CHE INC.) Pipeline Anomaly Analysis By Liquid Green Technologies Corporation ChristoHouston Energy Inc. () Pipeline Anomaly Analysis By Liquid Green Technologies Corporation CHE INC. Overview: Review of Scope of Work Wall thickness analysis - Pipeline and sectional statistics Feature

More information

Machine Learning. Topic 4: Linear Regression Models

Machine Learning. Topic 4: Linear Regression Models Machine Learning Topic 4: Linear Regression Models (contains ideas and a few images from wikipedia and books by Alpaydin, Duda/Hart/ Stork, and Bishop. Updated Fall 205) Regression Learning Task There

More information

Adaptive Waveform Inversion: Theory Mike Warner*, Imperial College London, and Lluís Guasch, Sub Salt Solutions Limited

Adaptive Waveform Inversion: Theory Mike Warner*, Imperial College London, and Lluís Guasch, Sub Salt Solutions Limited Adaptive Waveform Inversion: Theory Mike Warner*, Imperial College London, and Lluís Guasch, Sub Salt Solutions Limited Summary We present a new method for performing full-waveform inversion that appears

More information

Chapter 13 Multivariate Techniques. Chapter Table of Contents

Chapter 13 Multivariate Techniques. Chapter Table of Contents Chapter 13 Multivariate Techniques Chapter Table of Contents Introduction...279 Principal Components Analysis...280 Canonical Correlation...289 References...298 278 Chapter 13. Multivariate Techniques

More information

An Intuitive Explanation of Fourier Theory

An Intuitive Explanation of Fourier Theory An Intuitive Explanation of Fourier Theory Steven Lehar slehar@cns.bu.edu Fourier theory is pretty complicated mathematically. But there are some beautifully simple holistic concepts behind Fourier theory

More information

A time-varying weights tuning method of the double EWMA controller

A time-varying weights tuning method of the double EWMA controller Available online at www.sciencedirect.com Omega 3 (4) 473 48 www.elsevier.com/locate/dsw A time-varying weights tuning method of the double EWMA controller Chao-Ton Su, Chun-Chin Hsu Department of Industrial

More information

STAT 311 (3 CREDITS) VARIANCE AND REGRESSION ANALYSIS ELECTIVE: ALL STUDENTS. CONTENT Introduction to Computer application of variance and regression

STAT 311 (3 CREDITS) VARIANCE AND REGRESSION ANALYSIS ELECTIVE: ALL STUDENTS. CONTENT Introduction to Computer application of variance and regression STAT 311 (3 CREDITS) VARIANCE AND REGRESSION ANALYSIS ELECTIVE: ALL STUDENTS. CONTENT Introduction to Computer application of variance and regression analysis. Analysis of Variance: one way classification,

More information

Geology Geomath Estimating the coefficients of various Mathematical relationships in Geology

Geology Geomath Estimating the coefficients of various Mathematical relationships in Geology Geology 351 - Geomath Estimating the coefficients of various Mathematical relationships in Geology Throughout the semester you ve encountered a variety of mathematical relationships between various geologic

More information

Time Series Analysis by State Space Methods

Time Series Analysis by State Space Methods Time Series Analysis by State Space Methods Second Edition J. Durbin London School of Economics and Political Science and University College London S. J. Koopman Vrije Universiteit Amsterdam OXFORD UNIVERSITY

More information

Release Notes for April StatCrunch Updates

Release Notes for April StatCrunch Updates Release Notes for April 2018 - StatCrunch Updates Major additions Introducing accessibility features that support full keyboard functionality including new keyboard shortcuts. [Go to page 2] Measures to

More information

( ) = Y ˆ. Calibration Definition A model is calibrated if its predictions are right on average: ave(response Predicted value) = Predicted value.

( ) = Y ˆ. Calibration Definition A model is calibrated if its predictions are right on average: ave(response Predicted value) = Predicted value. Calibration OVERVIEW... 2 INTRODUCTION... 2 CALIBRATION... 3 ANOTHER REASON FOR CALIBRATION... 4 CHECKING THE CALIBRATION OF A REGRESSION... 5 CALIBRATION IN SIMPLE REGRESSION (DISPLAY.JMP)... 5 TESTING

More information

AM205: lecture 2. 1 These have been shifted to MD 323 for the rest of the semester.

AM205: lecture 2. 1 These have been shifted to MD 323 for the rest of the semester. AM205: lecture 2 Luna and Gary will hold a Python tutorial on Wednesday in 60 Oxford Street, Room 330 Assignment 1 will be posted this week Chris will hold office hours on Thursday (1:30pm 3:30pm, Pierce

More information

Watershed Sciences 4930 & 6920 GEOGRAPHIC INFORMATION SYSTEMS

Watershed Sciences 4930 & 6920 GEOGRAPHIC INFORMATION SYSTEMS HOUSEKEEPING Watershed Sciences 4930 & 6920 GEOGRAPHIC INFORMATION SYSTEMS Quizzes Lab 8? WEEK EIGHT Lecture INTERPOLATION & SPATIAL ESTIMATION Joe Wheaton READING FOR TODAY WHAT CAN WE COLLECT AT POINTS?

More information

Course on Microarray Gene Expression Analysis

Course on Microarray Gene Expression Analysis Course on Microarray Gene Expression Analysis ::: Normalization methods and data preprocessing Madrid, April 27th, 2011. Gonzalo Gómez ggomez@cnio.es Bioinformatics Unit CNIO ::: Introduction. The probe-level

More information

2.3. Quality Assurance: The activities that have to do with making sure that the quality of a product is what it should be.

2.3. Quality Assurance: The activities that have to do with making sure that the quality of a product is what it should be. 5.2. QUALITY CONTROL /QUALITY ASSURANCE 5.2.1. STATISTICS 1. ACKNOWLEDGEMENT This paper has been copied directly from the HMA Manual with a few modifications from the original version. The original version

More information

A Comparative Study of LOWESS and RBF Approximations for Visualization

A Comparative Study of LOWESS and RBF Approximations for Visualization A Comparative Study of LOWESS and RBF Approximations for Visualization Michal Smolik, Vaclav Skala and Ondrej Nedved Faculty of Applied Sciences, University of West Bohemia, Univerzitni 8, CZ 364 Plzen,

More information

Analytical Techniques for Anomaly Detection Through Features, Signal-Noise Separation and Partial-Value Association

Analytical Techniques for Anomaly Detection Through Features, Signal-Noise Separation and Partial-Value Association Proceedings of Machine Learning Research 77:20 32, 2017 KDD 2017: Workshop on Anomaly Detection in Finance Analytical Techniques for Anomaly Detection Through Features, Signal-Noise Separation and Partial-Value

More information

Recommender Systems New Approaches with Netflix Dataset

Recommender Systems New Approaches with Netflix Dataset Recommender Systems New Approaches with Netflix Dataset Robert Bell Yehuda Koren AT&T Labs ICDM 2007 Presented by Matt Rodriguez Outline Overview of Recommender System Approaches which are Content based

More information

What can we represent as a Surface?

What can we represent as a Surface? Geography 38/42:376 GIS II Topic 7: Surface Representation and Analysis (Chang: Chapters 13 & 15) DeMers: Chapter 10 What can we represent as a Surface? Surfaces can be used to represent: Continuously

More information

Time Series Analysis DM 2 / A.A

Time Series Analysis DM 2 / A.A DM 2 / A.A. 2010-2011 Time Series Analysis Several slides are borrowed from: Han and Kamber, Data Mining: Concepts and Techniques Mining time-series data Lei Chen, Similarity Search Over Time-Series Data

More information

Contrast Optimization: A faster and better technique for optimizing on MTF ABSTRACT Keywords: INTRODUCTION THEORY

Contrast Optimization: A faster and better technique for optimizing on MTF ABSTRACT Keywords: INTRODUCTION THEORY Contrast Optimization: A faster and better technique for optimizing on MTF Ken Moore, Erin Elliott, Mark Nicholson, Chris Normanshire, Shawn Gay, Jade Aiona Zemax, LLC ABSTRACT Our new Contrast Optimization

More information

Chronology development, statistical analysis

Chronology development, statistical analysis A R S T A N Guide for computer program ARSTAN, by Edward R. Cook and Richard L. Holmes Adapted from Users Manual for Program ARSTAN, in Tree-Ring Chronologies of Western North America: California, eastern

More information

Machine Learning / Jan 27, 2010

Machine Learning / Jan 27, 2010 Revisiting Logistic Regression & Naïve Bayes Aarti Singh Machine Learning 10-701/15-781 Jan 27, 2010 Generative and Discriminative Classifiers Training classifiers involves learning a mapping f: X -> Y,

More information

Computer Experiments: Space Filling Design and Gaussian Process Modeling

Computer Experiments: Space Filling Design and Gaussian Process Modeling Computer Experiments: Space Filling Design and Gaussian Process Modeling Best Practice Authored by: Cory Natoli Sarah Burke, Ph.D. 30 March 2018 The goal of the STAT COE is to assist in developing rigorous,

More information

How to use FSBForecast Excel add-in for regression analysis (July 2012 version)

How to use FSBForecast Excel add-in for regression analysis (July 2012 version) How to use FSBForecast Excel add-in for regression analysis (July 2012 version) FSBForecast is an Excel add-in for data analysis and regression that was developed at the Fuqua School of Business over the

More information

Neuro-fuzzy admission control in mobile communications systems

Neuro-fuzzy admission control in mobile communications systems University of Wollongong Thesis Collections University of Wollongong Thesis Collection University of Wollongong Year 2005 Neuro-fuzzy admission control in mobile communications systems Raad Raad University

More information

Information Criteria Methods in SAS for Multiple Linear Regression Models

Information Criteria Methods in SAS for Multiple Linear Regression Models Paper SA5 Information Criteria Methods in SAS for Multiple Linear Regression Models Dennis J. Beal, Science Applications International Corporation, Oak Ridge, TN ABSTRACT SAS 9.1 calculates Akaike s Information

More information

An Optimal Regression Algorithm for Piecewise Functions Expressed as Object-Oriented Programs

An Optimal Regression Algorithm for Piecewise Functions Expressed as Object-Oriented Programs 2010 Ninth International Conference on Machine Learning and Applications An Optimal Regression Algorithm for Piecewise Functions Expressed as Object-Oriented Programs Juan Luo Department of Computer Science

More information

Introduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data

Introduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data Introduction About this Document This manual was written by members of the Statistical Consulting Program as an introduction to SPSS 12.0. It is designed to assist new users in familiarizing themselves

More information

Project 11 Graphs (Using MS Excel Version )

Project 11 Graphs (Using MS Excel Version ) Project 11 Graphs (Using MS Excel Version 2007-10) Purpose: To review the types of graphs, and use MS Excel 2010 to create them from a dataset. Outline: You will be provided with several datasets and will

More information

CS130 Regression. Winter Winter 2014 CS130 - Regression Analysis 1

CS130 Regression. Winter Winter 2014 CS130 - Regression Analysis 1 CS130 Regression Winter 2014 Winter 2014 CS130 - Regression Analysis 1 Regression Analysis Regression analysis: usually falls under statistics and mathematical modeling is a form of statistical analysis

More information

Parameter Estimation in Differential Equations: A Numerical Study of Shooting Methods

Parameter Estimation in Differential Equations: A Numerical Study of Shooting Methods Parameter Estimation in Differential Equations: A Numerical Study of Shooting Methods Franz Hamilton Faculty Advisor: Dr Timothy Sauer January 5, 2011 Abstract Differential equation modeling is central

More information

Adaptive Filtering using Steepest Descent and LMS Algorithm

Adaptive Filtering using Steepest Descent and LMS Algorithm IJSTE - International Journal of Science Technology & Engineering Volume 2 Issue 4 October 2015 ISSN (online): 2349-784X Adaptive Filtering using Steepest Descent and LMS Algorithm Akash Sawant Mukesh

More information

Image Compression for Mobile Devices using Prediction and Direct Coding Approach

Image Compression for Mobile Devices using Prediction and Direct Coding Approach Image Compression for Mobile Devices using Prediction and Direct Coding Approach Joshua Rajah Devadason M.E. scholar, CIT Coimbatore, India Mr. T. Ramraj Assistant Professor, CIT Coimbatore, India Abstract

More information

Reference

Reference Leaning diary: research methodology 30.11.2017 Name: Juriaan Zandvliet Student number: 291380 (1) a short description of each topic of the course, (2) desciption of possible examples or exercises done

More information

STANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA I. 4 th Nine Weeks,

STANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA I. 4 th Nine Weeks, STANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA I 4 th Nine Weeks, 2016-2017 1 OVERVIEW Algebra I Content Review Notes are designed by the High School Mathematics Steering Committee as a resource for

More information

Topics in Machine Learning

Topics in Machine Learning Topics in Machine Learning Gilad Lerman School of Mathematics University of Minnesota Text/slides stolen from G. James, D. Witten, T. Hastie, R. Tibshirani and A. Ng Machine Learning - Motivation Arthur

More information

3 Nonlinear Regression

3 Nonlinear Regression CSC 4 / CSC D / CSC C 3 Sometimes linear models are not sufficient to capture the real-world phenomena, and thus nonlinear models are necessary. In regression, all such models will have the same basic

More information

UvA-DARE (Digital Academic Repository) Memory-type control charts in statistical process control Abbas, N. Link to publication

UvA-DARE (Digital Academic Repository) Memory-type control charts in statistical process control Abbas, N. Link to publication UvA-DARE (Digital Academic Repository) Memory-type control charts in statistical process control Abbas, N. Link to publication Citation for published version (APA): Abbas, N. (2012). Memory-type control

More information

LC-1: Interference and Diffraction

LC-1: Interference and Diffraction Your TA will use this sheet to score your lab. It is to be turned in at the end of lab. You must use complete sentences and clearly explain your reasoning to receive full credit. The lab setup has been

More information

SYS 6021 Linear Statistical Models

SYS 6021 Linear Statistical Models SYS 6021 Linear Statistical Models Project 2 Spam Filters Jinghe Zhang Summary The spambase data and time indexed counts of spams and hams are studied to develop accurate spam filters. Static models are

More information

AUTOMATED 4 AXIS ADAYfIVE SCANNING WITH THE DIGIBOTICS LASER DIGITIZER

AUTOMATED 4 AXIS ADAYfIVE SCANNING WITH THE DIGIBOTICS LASER DIGITIZER AUTOMATED 4 AXIS ADAYfIVE SCANNING WITH THE DIGIBOTICS LASER DIGITIZER INTRODUCTION The DIGIBOT 3D Laser Digitizer is a high performance 3D input device which combines laser ranging technology, personal

More information

Building Better Parametric Cost Models

Building Better Parametric Cost Models Building Better Parametric Cost Models Based on the PMI PMBOK Guide Fourth Edition 37 IPDI has been reviewed and approved as a provider of project management training by the Project Management Institute

More information

AN APPROXIMATE INVENTORY MODEL BASED ON DIMENSIONAL ANALYSIS. Victoria University, Wellington, New Zealand

AN APPROXIMATE INVENTORY MODEL BASED ON DIMENSIONAL ANALYSIS. Victoria University, Wellington, New Zealand AN APPROXIMATE INVENTORY MODEL BASED ON DIMENSIONAL ANALYSIS by G. A. VIGNAUX and Sudha JAIN Victoria University, Wellington, New Zealand Published in Asia-Pacific Journal of Operational Research, Vol

More information

2014 Stat-Ease, Inc. All Rights Reserved.

2014 Stat-Ease, Inc. All Rights Reserved. What s New in Design-Expert version 9 Factorial split plots (Two-Level, Multilevel, Optimal) Definitive Screening and Single Factor designs Journal Feature Design layout Graph Columns Design Evaluation

More information

7 Fractions. Number Sense and Numeration Measurement Geometry and Spatial Sense Patterning and Algebra Data Management and Probability

7 Fractions. Number Sense and Numeration Measurement Geometry and Spatial Sense Patterning and Algebra Data Management and Probability 7 Fractions GRADE 7 FRACTIONS continue to develop proficiency by using fractions in mental strategies and in selecting and justifying use; develop proficiency in adding and subtracting simple fractions;

More information

Conditional Volatility Estimation by. Conditional Quantile Autoregression

Conditional Volatility Estimation by. Conditional Quantile Autoregression International Journal of Mathematical Analysis Vol. 8, 2014, no. 41, 2033-2046 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ijma.2014.47210 Conditional Volatility Estimation by Conditional Quantile

More information

Exploratory Data Analysis EDA

Exploratory Data Analysis EDA Exploratory Data Analysis EDA Luc Anselin http://spatial.uchicago.edu 1 from EDA to ESDA dynamic graphics primer on multivariate EDA interpretation and limitations 2 From EDA to ESDA 3 Exploratory Data

More information

Basis Functions. Volker Tresp Summer 2017

Basis Functions. Volker Tresp Summer 2017 Basis Functions Volker Tresp Summer 2017 1 Nonlinear Mappings and Nonlinear Classifiers Regression: Linearity is often a good assumption when many inputs influence the output Some natural laws are (approximately)

More information

The problem we have now is called variable selection or perhaps model selection. There are several objectives.

The problem we have now is called variable selection or perhaps model selection. There are several objectives. STAT-UB.0103 NOTES for Wednesday 01.APR.04 One of the clues on the library data comes through the VIF values. These VIFs tell you to what extent a predictor is linearly dependent on other predictors. We

More information

Investigation of Shape Parameter for Exponential Weight Function in Moving Least Squares Method

Investigation of Shape Parameter for Exponential Weight Function in Moving Least Squares Method 2 (2017) 17-24 Progress in Energy and Environment Journal homepage: http://www.akademiabaru.com/progee.html ISSN: 2600-7662 Investigation of Shape Parameter for Exponential Weight Function in Moving Least

More information

Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling

Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling Abstract Forecasting future events and statistics is problematic because the data set is a stochastic, rather

More information

A Monotonic Sequence and Subsequence Approach in Missing Data Statistical Analysis

A Monotonic Sequence and Subsequence Approach in Missing Data Statistical Analysis Global Journal of Pure and Applied Mathematics. ISSN 0973-1768 Volume 12, Number 1 (2016), pp. 1131-1140 Research India Publications http://www.ripublication.com A Monotonic Sequence and Subsequence Approach

More information

Robotics. Lecture 5: Monte Carlo Localisation. See course website for up to date information.

Robotics. Lecture 5: Monte Carlo Localisation. See course website  for up to date information. Robotics Lecture 5: Monte Carlo Localisation See course website http://www.doc.ic.ac.uk/~ajd/robotics/ for up to date information. Andrew Davison Department of Computing Imperial College London Review:

More information

High Dynamic Range Imaging.

High Dynamic Range Imaging. High Dynamic Range Imaging High Dynamic Range [3] In photography, dynamic range (DR) is measured in exposure value (EV) differences or stops, between the brightest and darkest parts of the image that show

More information

CHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT

CHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT CHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT 2.1 BRIEF OUTLINE The classification of digital imagery is to extract useful thematic information which is one

More information

Performance Estimation and Regularization. Kasthuri Kannan, PhD. Machine Learning, Spring 2018

Performance Estimation and Regularization. Kasthuri Kannan, PhD. Machine Learning, Spring 2018 Performance Estimation and Regularization Kasthuri Kannan, PhD. Machine Learning, Spring 2018 Bias- Variance Tradeoff Fundamental to machine learning approaches Bias- Variance Tradeoff Error due to Bias:

More information

Non-Linear Regression. Business Analytics Practice Winter Term 2015/16 Stefan Feuerriegel

Non-Linear Regression. Business Analytics Practice Winter Term 2015/16 Stefan Feuerriegel Non-Linear Regression Business Analytics Practice Winter Term 2015/16 Stefan Feuerriegel Today s Lecture Objectives 1 Understanding the need for non-parametric regressions 2 Familiarizing with two common

More information

Emerge Workflow CE8 SAMPLE IMAGE. Simon Voisey July 2008

Emerge Workflow CE8 SAMPLE IMAGE. Simon Voisey July 2008 Emerge Workflow SAMPLE IMAGE CE8 Simon Voisey July 2008 Introduction The document provides a step-by-step guide for producing Emerge predicted petrophysical volumes based on log data of the same type.

More information

Recent advances in Metamodel of Optimal Prognosis. Lectures. Thomas Most & Johannes Will

Recent advances in Metamodel of Optimal Prognosis. Lectures. Thomas Most & Johannes Will Lectures Recent advances in Metamodel of Optimal Prognosis Thomas Most & Johannes Will presented at the Weimar Optimization and Stochastic Days 2010 Source: www.dynardo.de/en/library Recent advances in

More information