MINI-PAPER A Gentle Introduction to the Analysis of Sequential Data
|
|
- Amy Caldwell
- 5 years ago
- Views:
Transcription
1 MINI-PAPER by Rong Pan, Ph.D., Assistant Professor of Industrial Engineering, Arizona State University We, applied statisticians and manufacturing engineers, often need to deal with sequential data, which are collected on a manufacturing process by sensors. The data volume could be very large and it is easy to lose the sight of the purpose of data analysis. In this mini-paper we will discuss some basic purposes for doing such data analysis and the simple tools that can assist in data analysis. We will focus on the analysis of a univariate series. Our purposes include understanding the series characteristics as exhibited in observations, predicting a future observation, and controlling mean value and variance. 1. Independence versus Correlation Sequential data is often referred as time series, particularly for the observations collected over time. However, we recognize that this type of data is not necessary to be indexed by time only. For example, considering the x-ray inspection of highway pavement, these data are spatially indexed. One of the critical data analysis steps is to check if observations are independent to each other. Many statistical data analysis techniques assume that observations are independent, which is unlikely in sequential data where close observations tend to be correlated even when the system under study is stable and undisturbed. This type of correlation is defined as autocorrelation (prefix auto- means self ), it is the correlation of samples in the series to earlier samples in the same series. Autocorrelation is a function of time lag, i.e., the time units separating a later series from an earlier series. Figure 1(a) plots the data from Boyles (2000), which gave a time series (190 observations) for the fill weights observed for a powder food product. Over time, it looks as though observations are wandering around a mean value, but it is still not easy to detect autocorrelation by visual inspection only. The autocorrelation can be revealed by a plot of the sample autocorrelation function, which is shown in Figure 1(b). Clearly, the autocorrelation of lag 1 is significant and positive, which indicates that every next observation tends to be more or less like its previous observation. Unlike independent observations where variation is due to a pure random noise only, variation in an autocorrelated series can be decomposed to random noise and a systematic variation pattern. In this example, the primary pattern is lag one autocorrelation. Knowing this systematic pattern is essential to the time series prediction and control. Statistically significant autocorrelation is strongest at lag=1 Figure 1: (a) time series plot and (b) sample autocorrelation of Boyles data set. 4
2 Continued from page 4 2. Stationary versus Nonstationary Another important characteristic of time series is stationarity. Loosely speaking, if observations are varying around a constant mean and the variance does not explode over time, then the series is stationary. In industry, a nonstationary process is highly undesirable. It indicates that the process needs some corrective actions to bring it back under control. Detecting the nonstationary pattern can give insights into the mechanism that generates the series. For example, del Castillo (2002, p.20) gave 40 measures of a dimension machined on aluminum parts processed on a Fadal computer numerically controlled (CNC) machine tool. The time series plot, as shown in Figure 2(a), clearly demonstrates a pattern of increasing mean over time, possibly because of the tool wear. We are interested in decomposing the total variation of this series to the sum of increasing mean value plus the pure random noise. After removing (filtering) the increasing trend of the mean by using a drifted random walk model (technically an integrated moving average model), Figure 2(b) shows the series under control. After subtracting the predictable drifts, it becomes a random noise without apparent patterns. The process can be made stationary through corrective actions calculated by the filter model. Figure 2: (a) time series plot and (b) residual plot of Fadal dataset. Modern manufacturing processes are usually under some types of automatic process control (APC) to prevent nonstationarity/instability. As systematic reactions to past observations, control actions will introduce autocorrelation into the sequential process observations. These automated reactions to trends make APC different from statistical process control (SPC). SPC is more concerned with detecting a sudden shift in process mean and/or process variance due to assignable causes, but less concerned with mild autocorrelation as long as the process is stationary. Assignable causes may come from outside the process, and require investigation to determine an effective reaction. APC reacts to common causes that are part of the process (e.g. tool wear), and effective reactions are so well known they can be automated. However, APC can interfere with SPC. Systematic reactions deliberately push the next observation away from the previous observation, creating negative autocorrelation that may inflate the estimation of process variance. This leads to inflated control chart limits, and makes the statistical process control chart less sensitive to a shift. An active quality control research area in the past two decades is the integration of APC and SPC. For detailed discussions on this topic, see Box and Luceño (1997), del Castillo (2002) and Pan (2009). 5
3 Continued from page 5 3. Filtering and Smoothing Given a sequential dataset we would like to fit a model which consists of a predictable part, such as a deterministic mathematical function, and a random noise. Smoothing and filtering are two approaches to the same problem, i.e., separating the predictable part (signal) from the noise, but from different perspectives. Filtering comes from signal processing and target tracking, where the task is to filter out the noise and to predict the next move of the target. Therefore, the ability to make on-line predictions in real-time is very important. Smoothing comes from statistics, where the major task includes model fitting and diagnosis. It is typically implemented off-line, so all available data can be utilized to estimate the predictable part of the time series. After removing the noise, the series will become smoother (less variation) than the original one. Oftentimes these methods are known as forward filtering and backward smoothing. For example, Eq. (1) is a linear filter using a moving average, taking the weighted average of past five observations: yt = w1xt + w2xt-1 + w3xt-2 + w4xt-3 + w5xt-4, Eq. (1) 5 where wi s are weights and i - 1 wi = 1. The current time index value is t, one lag in the past is t-1, and so on. For this example, a simple moving average filter will let wi = 1/5. After averaging, the series {y1} is smoother than the original series {x1}, and {yt} can be used to predict the next observation of {xt} series. Depending on applications, equal weights used in the above linear filter may not be the best choice. A sensible modification of Eq. (1) could be assigning larger weights to recent observations and smaller weights to remote observations. In fact, the exponential smoothing algorithm reduces the weight value exponentially to discount the effect of remote observations on prediction. The same algorithm is used in EWMA control charts. Statisticians are interested in the estimation of weights in a linear filter. For some special classes of time series models, optimal weights can be found based on certain optimization criteria. For example, we may find the optimal weight of exponential smoothing by minimizing the mean square prediction error. For the previous Boyles s dataset, the optimal weight of exponential smoothing is found to be As shown in Figure 3, the mean square error is reduced to 216 by exponential smoothing using the optimal weight, comparing to 240 by moving average. However, optimizing the weights may not be practically important to many applications. Exponential smoothing performs well for weights that are rougly near the optimal, which is why it is popular. Figure 3: (a) moving average and (b) exponential smoothing of Boyles dataset. 6
4 Continued from page 6 Eq. (1) can be viewed as a linear regression of y_t on five consecutive observations of x_t. To find the optimal regression coefficients, we can use the least square method. This implies that other regression techniques can be applied for the data smoothing purpose. Locally weighted scatterplot smoothing, or loess, is one of such methods. In loess, each smoothed point in the series is estimated by a low-degree polynomial function of its neighbor points. The polynomial is fitted using weighted least squares, giving more weight to points in the close neighborhood of the point being estimated. Note that we may specify the degree of polynomial model and the weights, but the data fitting is on a local neighborhood level. Typically, we use simple models, such as linear regression models, to fit localized subsets of the data and build smooth curve point by point; thus, there is not a global function to describe the deterministic part of the variation in data, but rather a collection of locally predicted points. Figure 4 (a) and (b) give the loess curves of Boyles dataset with 0.2 and 0.4 degree of smoothing, respectively, which means that for each smoothed point only the nearest 20% or 40% of total points are utilized for localized regression. As this percentage increases, we can see that the curve becomes flatter, so less variability in the observed data is accounted for by this smooth curve. Figure 4: (a) loess smoothing with the smoothing parameter 0.2 and (b) 0.4 Loess is data oriented and computationally intensive. Computer programs are necessary to fit all of the many local regression equations used to smooth one series. However, one of the major advantages of this method is its flexibility in fitting complex, nonlinear patterns exhibited by the data. In fact, analysts have a great deal of control of smoothness of the fitted curve by adjusting the smoothing parameter. 4. Prediction and Control As explained previously, there are many reasons for analyzing sequential data; however, two major general purposes are prediction and control, and often they go hand-in-hand. In production and inventory management, for example, we want to predict the production or inventory level of an item. Then we may apply some management tools and strategies to adjust this level to be closer to a target or to reduce its variation. After decomposing the sequential observations to a predictable function and a random noise, it is straightforward to apply the prediction function. Therefore, to design a good predictor we want it able to account for most of the variability in observations due to systematic patterns while ignoring random noise. An optimal filter is, thus, an optimal predictor. Based on the predicted process value, a process feedback control scheme applies control actions on some adjustable variables, which in turn alters the process value to its desired target. In control theory, design of a controller involves controllable variables and their interactions with system outputs, which is out of the scope of this article. Figure 5 depicts a generic process of feedback control and highlights the role of sequential data analysis in this process. 7
5 Continued from page 7 Figure 5: A generic process feedback control scheme 5. Summary In this mini-paper, we discuss the essence of a sequential data analysis task, which is to decompose the process variation to a systematic pattern and a random noise, and they are in turn to be utilized in system prediction or control. The literature on this topic is vast. We only wish to illustrate some basic concepts, such as autocorrelation and nonstationarity, through examples and to demonstrate the ideas behind some simple filtering/smoothing techniques. References: Boyles, R. A. (2000). Phase I analysis for autocorrelated processes, Journal of Quality Technology, 32(4), pp Del Castillo, E. (2002). Statistical Process Adjustment for Quality Control, Wiley Series in Probability and Statistics, Wiley. Box, G. E. P. and Luceño, A. (1997). Statistical Control by Monitoring and Feedback Adjustment, Wiley Series in Probability and Statistics, Wiley. Pan, R. (2009). Statistical Process Adjustment in Short-Run Manufacturing Process, VDM Verlag Dr. Müller. 8
LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave.
LOESS curve fitted to a population sampled from a sine wave with uniform noise added. The LOESS curve approximates the original sine wave. http://en.wikipedia.org/wiki/local_regression Local regression
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use
More informationMonitoring of Manufacturing Process Conditions with Baseline Changes Using Generalized Regression
Proceedings of the 10th International Conference on Frontiers of Design and Manufacturing June 10~1, 01 Monitoring of Manufacturing Process Conditions with Baseline Changes Using Generalized Regression
More informationUsing Excel for Graphical Analysis of Data
Using Excel for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters. Graphs are
More informationSpatial Interpolation & Geostatistics
(Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Lag Mean Distance between pairs of points 1 Tobler s Law All places are related, but nearby places are related more than distant places Corollary:
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming
More informationSpatial Interpolation - Geostatistics 4/3/2018
Spatial Interpolation - Geostatistics 4/3/201 (Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Distance between pairs of points Lag Mean Tobler s Law All places are related, but nearby places
More information1. Assumptions. 1. Introduction. 2. Terminology
4. Process Modeling 4. Process Modeling The goal for this chapter is to present the background and specific analysis techniques needed to construct a statistical model that describes a particular scientific
More informationHere is Kellogg s custom menu for their core statistics class, which can be loaded by typing the do statement shown in the command window at the very
Here is Kellogg s custom menu for their core statistics class, which can be loaded by typing the do statement shown in the command window at the very bottom of the screen: 4 The univariate statistics command
More informationIntro to ARMA models. FISH 507 Applied Time Series Analysis. Mark Scheuerell 15 Jan 2019
Intro to ARMA models FISH 507 Applied Time Series Analysis Mark Scheuerell 15 Jan 2019 Topics for today Review White noise Random walks Autoregressive (AR) models Moving average (MA) models Autoregressive
More informationGeneralized Additive Model
Generalized Additive Model by Huimin Liu Department of Mathematics and Statistics University of Minnesota Duluth, Duluth, MN 55812 December 2008 Table of Contents Abstract... 2 Chapter 1 Introduction 1.1
More informationCHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY
23 CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 3.1 DESIGN OF EXPERIMENTS Design of experiments is a systematic approach for investigation of a system or process. A series
More informationSpatial Interpolation & Geostatistics
(Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Lag Mean Distance between pairs of points 11/3/2016 GEO327G/386G, UT Austin 1 Tobler s Law All places are related, but nearby places are related
More informationGraphical Analysis of Data using Microsoft Excel [2016 Version]
Graphical Analysis of Data using Microsoft Excel [2016 Version] Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters.
More informationThe Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC
The Time Series Forecasting System Charles Hallahan, Economic Research Service/USDA, Washington, DC INTRODUCTION The Time Series Forecasting System (TSFS) is a component of SAS/ETS that provides a menu-based
More informationFall 2016 CS130 - Regression Analysis 1 7. REGRESSION. Fall 2016
Fall 2016 CS130 - Regression Analysis 1 7. REGRESSION Fall 2016 Fall 2016 CS130 - Regression Analysis 2 Regression Analysis Regression analysis: usually falls under statistics and mathematical modeling
More informationLinear Methods for Regression and Shrinkage Methods
Linear Methods for Regression and Shrinkage Methods Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Linear Regression Models Least Squares Input vectors
More informationEffects of PROC EXPAND Data Interpolation on Time Series Modeling When the Data are Volatile or Complex
Effects of PROC EXPAND Data Interpolation on Time Series Modeling When the Data are Volatile or Complex Keiko I. Powers, Ph.D., J. D. Power and Associates, Westlake Village, CA ABSTRACT Discrete time series
More informationDigital Image Processing. Prof. P. K. Biswas. Department of Electronic & Electrical Communication Engineering
Digital Image Processing Prof. P. K. Biswas Department of Electronic & Electrical Communication Engineering Indian Institute of Technology, Kharagpur Lecture - 21 Image Enhancement Frequency Domain Processing
More informationNonparametric Approaches to Regression
Nonparametric Approaches to Regression In traditional nonparametric regression, we assume very little about the functional form of the mean response function. In particular, we assume the model where m(xi)
More informationFMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu
FMA901F: Machine Learning Lecture 3: Linear Models for Regression Cristian Sminchisescu Machine Learning: Frequentist vs. Bayesian In the frequentist setting, we seek a fixed parameter (vector), with value(s)
More informationMultiple Regression White paper
+44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms
More informationApplications of the k-nearest neighbor method for regression and resampling
Applications of the k-nearest neighbor method for regression and resampling Objectives Provide a structured approach to exploring a regression data set. Introduce and demonstrate the k-nearest neighbor
More informationHow to use FSBforecast Excel add in for regression analysis
How to use FSBforecast Excel add in for regression analysis FSBforecast is an Excel add in for data analysis and regression that was developed here at the Fuqua School of Business over the last 3 years
More informationLiving with Collinearity in Local Regression Models
Living with Collinearity in Local Regression Models Chris Brunsdon 1, Martin Charlton 2, Paul Harris 2 1 People Space and Place, Roxby Building, University of Liverpool,L69 7ZT, UK Tel. +44 151 794 2837
More informationShort Note. Non-stationary PEFs and large gaps. William Curry 1 INTRODUCTION
Stanford Exploration Project, Report 120, May 3, 2005, pages 247 257 Short Note Non-stationary PEFs and large gaps William Curry 1 INTRODUCTION Prediction-error filters (PEFs) may be used to interpolate
More informationBIO 360: Vertebrate Physiology Lab 9: Graphing in Excel. Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26
Lab 9: Graphing: how, why, when, and what does it mean? Due 3/26 INTRODUCTION Graphs are one of the most important aspects of data analysis and presentation of your of data. They are visual representations
More informationUsing Excel for Graphical Analysis of Data
EXERCISE Using Excel for Graphical Analysis of Data Introduction In several upcoming experiments, a primary goal will be to determine the mathematical relationship between two variable physical parameters.
More informationNonparametric Regression
Nonparametric Regression John Fox Department of Sociology McMaster University 1280 Main Street West Hamilton, Ontario Canada L8S 4M4 jfox@mcmaster.ca February 2004 Abstract Nonparametric regression analysis
More informationFurther Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables
Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables
More informationYour Name: Section: INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression
Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression Objectives: 1. To learn how to interpret scatterplots. Specifically you will investigate, using
More informationEconomics Nonparametric Econometrics
Economics 217 - Nonparametric Econometrics Topics covered in this lecture Introduction to the nonparametric model The role of bandwidth Choice of smoothing function R commands for nonparametric models
More informationChapter 4: Analyzing Bivariate Data with Fathom
Chapter 4: Analyzing Bivariate Data with Fathom Summary: Building from ideas introduced in Chapter 3, teachers continue to analyze automobile data using Fathom to look for relationships between two quantitative
More informationTips and Guidance for Analyzing Data. Executive Summary
Tips and Guidance for Analyzing Data Executive Summary This document has information and suggestions about three things: 1) how to quickly do a preliminary analysis of time-series data; 2) key things to
More informationLong Term Analysis for the BAM device Donata Bonino and Daniele Gardiol INAF Osservatorio Astronomico di Torino
Long Term Analysis for the BAM device Donata Bonino and Daniele Gardiol INAF Osservatorio Astronomico di Torino 1 Overview What is BAM Analysis in the time domain Analysis in the frequency domain Example
More informationSTATISTICS (STAT) Statistics (STAT) 1
Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).
More informationLesson 5 overview. Concepts. Interpolators. Assessing accuracy Exercise 5
Interpolation Tools Lesson 5 overview Concepts Sampling methods Creating continuous surfaces Interpolation Density surfaces in GIS Interpolators IDW, Spline,Trend, Kriging,Natural neighbors TopoToRaster
More informationModelling and simulation of seismic reflectivity
Modelling reflectivity Modelling and simulation of seismic reflectivity Rita Aggarwala, Michael P. Lamoureux, and Gary F. Margrave ABSTRACT We decompose the reflectivity series obtained from a seismic
More information3 Nonlinear Regression
3 Linear models are often insufficient to capture the real-world phenomena. That is, the relation between the inputs and the outputs we want to be able to predict are not linear. As a consequence, nonlinear
More informationEstimating Noise and Dimensionality in BCI Data Sets: Towards Illiteracy Comprehension
Estimating Noise and Dimensionality in BCI Data Sets: Towards Illiteracy Comprehension Claudia Sannelli, Mikio Braun, Michael Tangermann, Klaus-Robert Müller, Machine Learning Laboratory, Dept. Computer
More informationChristoHouston Energy Inc. (CHE INC.) Pipeline Anomaly Analysis By Liquid Green Technologies Corporation
ChristoHouston Energy Inc. () Pipeline Anomaly Analysis By Liquid Green Technologies Corporation CHE INC. Overview: Review of Scope of Work Wall thickness analysis - Pipeline and sectional statistics Feature
More informationMachine Learning. Topic 4: Linear Regression Models
Machine Learning Topic 4: Linear Regression Models (contains ideas and a few images from wikipedia and books by Alpaydin, Duda/Hart/ Stork, and Bishop. Updated Fall 205) Regression Learning Task There
More informationAdaptive Waveform Inversion: Theory Mike Warner*, Imperial College London, and Lluís Guasch, Sub Salt Solutions Limited
Adaptive Waveform Inversion: Theory Mike Warner*, Imperial College London, and Lluís Guasch, Sub Salt Solutions Limited Summary We present a new method for performing full-waveform inversion that appears
More informationChapter 13 Multivariate Techniques. Chapter Table of Contents
Chapter 13 Multivariate Techniques Chapter Table of Contents Introduction...279 Principal Components Analysis...280 Canonical Correlation...289 References...298 278 Chapter 13. Multivariate Techniques
More informationAn Intuitive Explanation of Fourier Theory
An Intuitive Explanation of Fourier Theory Steven Lehar slehar@cns.bu.edu Fourier theory is pretty complicated mathematically. But there are some beautifully simple holistic concepts behind Fourier theory
More informationA time-varying weights tuning method of the double EWMA controller
Available online at www.sciencedirect.com Omega 3 (4) 473 48 www.elsevier.com/locate/dsw A time-varying weights tuning method of the double EWMA controller Chao-Ton Su, Chun-Chin Hsu Department of Industrial
More informationSTAT 311 (3 CREDITS) VARIANCE AND REGRESSION ANALYSIS ELECTIVE: ALL STUDENTS. CONTENT Introduction to Computer application of variance and regression
STAT 311 (3 CREDITS) VARIANCE AND REGRESSION ANALYSIS ELECTIVE: ALL STUDENTS. CONTENT Introduction to Computer application of variance and regression analysis. Analysis of Variance: one way classification,
More informationGeology Geomath Estimating the coefficients of various Mathematical relationships in Geology
Geology 351 - Geomath Estimating the coefficients of various Mathematical relationships in Geology Throughout the semester you ve encountered a variety of mathematical relationships between various geologic
More informationTime Series Analysis by State Space Methods
Time Series Analysis by State Space Methods Second Edition J. Durbin London School of Economics and Political Science and University College London S. J. Koopman Vrije Universiteit Amsterdam OXFORD UNIVERSITY
More informationRelease Notes for April StatCrunch Updates
Release Notes for April 2018 - StatCrunch Updates Major additions Introducing accessibility features that support full keyboard functionality including new keyboard shortcuts. [Go to page 2] Measures to
More information( ) = Y ˆ. Calibration Definition A model is calibrated if its predictions are right on average: ave(response Predicted value) = Predicted value.
Calibration OVERVIEW... 2 INTRODUCTION... 2 CALIBRATION... 3 ANOTHER REASON FOR CALIBRATION... 4 CHECKING THE CALIBRATION OF A REGRESSION... 5 CALIBRATION IN SIMPLE REGRESSION (DISPLAY.JMP)... 5 TESTING
More informationAM205: lecture 2. 1 These have been shifted to MD 323 for the rest of the semester.
AM205: lecture 2 Luna and Gary will hold a Python tutorial on Wednesday in 60 Oxford Street, Room 330 Assignment 1 will be posted this week Chris will hold office hours on Thursday (1:30pm 3:30pm, Pierce
More informationWatershed Sciences 4930 & 6920 GEOGRAPHIC INFORMATION SYSTEMS
HOUSEKEEPING Watershed Sciences 4930 & 6920 GEOGRAPHIC INFORMATION SYSTEMS Quizzes Lab 8? WEEK EIGHT Lecture INTERPOLATION & SPATIAL ESTIMATION Joe Wheaton READING FOR TODAY WHAT CAN WE COLLECT AT POINTS?
More informationCourse on Microarray Gene Expression Analysis
Course on Microarray Gene Expression Analysis ::: Normalization methods and data preprocessing Madrid, April 27th, 2011. Gonzalo Gómez ggomez@cnio.es Bioinformatics Unit CNIO ::: Introduction. The probe-level
More information2.3. Quality Assurance: The activities that have to do with making sure that the quality of a product is what it should be.
5.2. QUALITY CONTROL /QUALITY ASSURANCE 5.2.1. STATISTICS 1. ACKNOWLEDGEMENT This paper has been copied directly from the HMA Manual with a few modifications from the original version. The original version
More informationA Comparative Study of LOWESS and RBF Approximations for Visualization
A Comparative Study of LOWESS and RBF Approximations for Visualization Michal Smolik, Vaclav Skala and Ondrej Nedved Faculty of Applied Sciences, University of West Bohemia, Univerzitni 8, CZ 364 Plzen,
More informationAnalytical Techniques for Anomaly Detection Through Features, Signal-Noise Separation and Partial-Value Association
Proceedings of Machine Learning Research 77:20 32, 2017 KDD 2017: Workshop on Anomaly Detection in Finance Analytical Techniques for Anomaly Detection Through Features, Signal-Noise Separation and Partial-Value
More informationRecommender Systems New Approaches with Netflix Dataset
Recommender Systems New Approaches with Netflix Dataset Robert Bell Yehuda Koren AT&T Labs ICDM 2007 Presented by Matt Rodriguez Outline Overview of Recommender System Approaches which are Content based
More informationWhat can we represent as a Surface?
Geography 38/42:376 GIS II Topic 7: Surface Representation and Analysis (Chang: Chapters 13 & 15) DeMers: Chapter 10 What can we represent as a Surface? Surfaces can be used to represent: Continuously
More informationTime Series Analysis DM 2 / A.A
DM 2 / A.A. 2010-2011 Time Series Analysis Several slides are borrowed from: Han and Kamber, Data Mining: Concepts and Techniques Mining time-series data Lei Chen, Similarity Search Over Time-Series Data
More informationContrast Optimization: A faster and better technique for optimizing on MTF ABSTRACT Keywords: INTRODUCTION THEORY
Contrast Optimization: A faster and better technique for optimizing on MTF Ken Moore, Erin Elliott, Mark Nicholson, Chris Normanshire, Shawn Gay, Jade Aiona Zemax, LLC ABSTRACT Our new Contrast Optimization
More informationChronology development, statistical analysis
A R S T A N Guide for computer program ARSTAN, by Edward R. Cook and Richard L. Holmes Adapted from Users Manual for Program ARSTAN, in Tree-Ring Chronologies of Western North America: California, eastern
More informationMachine Learning / Jan 27, 2010
Revisiting Logistic Regression & Naïve Bayes Aarti Singh Machine Learning 10-701/15-781 Jan 27, 2010 Generative and Discriminative Classifiers Training classifiers involves learning a mapping f: X -> Y,
More informationComputer Experiments: Space Filling Design and Gaussian Process Modeling
Computer Experiments: Space Filling Design and Gaussian Process Modeling Best Practice Authored by: Cory Natoli Sarah Burke, Ph.D. 30 March 2018 The goal of the STAT COE is to assist in developing rigorous,
More informationHow to use FSBForecast Excel add-in for regression analysis (July 2012 version)
How to use FSBForecast Excel add-in for regression analysis (July 2012 version) FSBForecast is an Excel add-in for data analysis and regression that was developed at the Fuqua School of Business over the
More informationNeuro-fuzzy admission control in mobile communications systems
University of Wollongong Thesis Collections University of Wollongong Thesis Collection University of Wollongong Year 2005 Neuro-fuzzy admission control in mobile communications systems Raad Raad University
More informationInformation Criteria Methods in SAS for Multiple Linear Regression Models
Paper SA5 Information Criteria Methods in SAS for Multiple Linear Regression Models Dennis J. Beal, Science Applications International Corporation, Oak Ridge, TN ABSTRACT SAS 9.1 calculates Akaike s Information
More informationAn Optimal Regression Algorithm for Piecewise Functions Expressed as Object-Oriented Programs
2010 Ninth International Conference on Machine Learning and Applications An Optimal Regression Algorithm for Piecewise Functions Expressed as Object-Oriented Programs Juan Luo Department of Computer Science
More informationIntroduction. About this Document. What is SPSS. ohow to get SPSS. oopening Data
Introduction About this Document This manual was written by members of the Statistical Consulting Program as an introduction to SPSS 12.0. It is designed to assist new users in familiarizing themselves
More informationProject 11 Graphs (Using MS Excel Version )
Project 11 Graphs (Using MS Excel Version 2007-10) Purpose: To review the types of graphs, and use MS Excel 2010 to create them from a dataset. Outline: You will be provided with several datasets and will
More informationCS130 Regression. Winter Winter 2014 CS130 - Regression Analysis 1
CS130 Regression Winter 2014 Winter 2014 CS130 - Regression Analysis 1 Regression Analysis Regression analysis: usually falls under statistics and mathematical modeling is a form of statistical analysis
More informationParameter Estimation in Differential Equations: A Numerical Study of Shooting Methods
Parameter Estimation in Differential Equations: A Numerical Study of Shooting Methods Franz Hamilton Faculty Advisor: Dr Timothy Sauer January 5, 2011 Abstract Differential equation modeling is central
More informationAdaptive Filtering using Steepest Descent and LMS Algorithm
IJSTE - International Journal of Science Technology & Engineering Volume 2 Issue 4 October 2015 ISSN (online): 2349-784X Adaptive Filtering using Steepest Descent and LMS Algorithm Akash Sawant Mukesh
More informationImage Compression for Mobile Devices using Prediction and Direct Coding Approach
Image Compression for Mobile Devices using Prediction and Direct Coding Approach Joshua Rajah Devadason M.E. scholar, CIT Coimbatore, India Mr. T. Ramraj Assistant Professor, CIT Coimbatore, India Abstract
More informationReference
Leaning diary: research methodology 30.11.2017 Name: Juriaan Zandvliet Student number: 291380 (1) a short description of each topic of the course, (2) desciption of possible examples or exercises done
More informationSTANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA I. 4 th Nine Weeks,
STANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA I 4 th Nine Weeks, 2016-2017 1 OVERVIEW Algebra I Content Review Notes are designed by the High School Mathematics Steering Committee as a resource for
More informationTopics in Machine Learning
Topics in Machine Learning Gilad Lerman School of Mathematics University of Minnesota Text/slides stolen from G. James, D. Witten, T. Hastie, R. Tibshirani and A. Ng Machine Learning - Motivation Arthur
More information3 Nonlinear Regression
CSC 4 / CSC D / CSC C 3 Sometimes linear models are not sufficient to capture the real-world phenomena, and thus nonlinear models are necessary. In regression, all such models will have the same basic
More informationUvA-DARE (Digital Academic Repository) Memory-type control charts in statistical process control Abbas, N. Link to publication
UvA-DARE (Digital Academic Repository) Memory-type control charts in statistical process control Abbas, N. Link to publication Citation for published version (APA): Abbas, N. (2012). Memory-type control
More informationLC-1: Interference and Diffraction
Your TA will use this sheet to score your lab. It is to be turned in at the end of lab. You must use complete sentences and clearly explain your reasoning to receive full credit. The lab setup has been
More informationSYS 6021 Linear Statistical Models
SYS 6021 Linear Statistical Models Project 2 Spam Filters Jinghe Zhang Summary The spambase data and time indexed counts of spams and hams are studied to develop accurate spam filters. Static models are
More informationAUTOMATED 4 AXIS ADAYfIVE SCANNING WITH THE DIGIBOTICS LASER DIGITIZER
AUTOMATED 4 AXIS ADAYfIVE SCANNING WITH THE DIGIBOTICS LASER DIGITIZER INTRODUCTION The DIGIBOT 3D Laser Digitizer is a high performance 3D input device which combines laser ranging technology, personal
More informationBuilding Better Parametric Cost Models
Building Better Parametric Cost Models Based on the PMI PMBOK Guide Fourth Edition 37 IPDI has been reviewed and approved as a provider of project management training by the Project Management Institute
More informationAN APPROXIMATE INVENTORY MODEL BASED ON DIMENSIONAL ANALYSIS. Victoria University, Wellington, New Zealand
AN APPROXIMATE INVENTORY MODEL BASED ON DIMENSIONAL ANALYSIS by G. A. VIGNAUX and Sudha JAIN Victoria University, Wellington, New Zealand Published in Asia-Pacific Journal of Operational Research, Vol
More information2014 Stat-Ease, Inc. All Rights Reserved.
What s New in Design-Expert version 9 Factorial split plots (Two-Level, Multilevel, Optimal) Definitive Screening and Single Factor designs Journal Feature Design layout Graph Columns Design Evaluation
More information7 Fractions. Number Sense and Numeration Measurement Geometry and Spatial Sense Patterning and Algebra Data Management and Probability
7 Fractions GRADE 7 FRACTIONS continue to develop proficiency by using fractions in mental strategies and in selecting and justifying use; develop proficiency in adding and subtracting simple fractions;
More informationConditional Volatility Estimation by. Conditional Quantile Autoregression
International Journal of Mathematical Analysis Vol. 8, 2014, no. 41, 2033-2046 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ijma.2014.47210 Conditional Volatility Estimation by Conditional Quantile
More informationExploratory Data Analysis EDA
Exploratory Data Analysis EDA Luc Anselin http://spatial.uchicago.edu 1 from EDA to ESDA dynamic graphics primer on multivariate EDA interpretation and limitations 2 From EDA to ESDA 3 Exploratory Data
More informationBasis Functions. Volker Tresp Summer 2017
Basis Functions Volker Tresp Summer 2017 1 Nonlinear Mappings and Nonlinear Classifiers Regression: Linearity is often a good assumption when many inputs influence the output Some natural laws are (approximately)
More informationThe problem we have now is called variable selection or perhaps model selection. There are several objectives.
STAT-UB.0103 NOTES for Wednesday 01.APR.04 One of the clues on the library data comes through the VIF values. These VIFs tell you to what extent a predictor is linearly dependent on other predictors. We
More informationInvestigation of Shape Parameter for Exponential Weight Function in Moving Least Squares Method
2 (2017) 17-24 Progress in Energy and Environment Journal homepage: http://www.akademiabaru.com/progee.html ISSN: 2600-7662 Investigation of Shape Parameter for Exponential Weight Function in Moving Least
More informationAaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling
Aaron Daniel Chia Huang Licai Huang Medhavi Sikaria Signal Processing: Forecasting and Modeling Abstract Forecasting future events and statistics is problematic because the data set is a stochastic, rather
More informationA Monotonic Sequence and Subsequence Approach in Missing Data Statistical Analysis
Global Journal of Pure and Applied Mathematics. ISSN 0973-1768 Volume 12, Number 1 (2016), pp. 1131-1140 Research India Publications http://www.ripublication.com A Monotonic Sequence and Subsequence Approach
More informationRobotics. Lecture 5: Monte Carlo Localisation. See course website for up to date information.
Robotics Lecture 5: Monte Carlo Localisation See course website http://www.doc.ic.ac.uk/~ajd/robotics/ for up to date information. Andrew Davison Department of Computing Imperial College London Review:
More informationHigh Dynamic Range Imaging.
High Dynamic Range Imaging High Dynamic Range [3] In photography, dynamic range (DR) is measured in exposure value (EV) differences or stops, between the brightest and darkest parts of the image that show
More informationCHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT
CHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT 2.1 BRIEF OUTLINE The classification of digital imagery is to extract useful thematic information which is one
More informationPerformance Estimation and Regularization. Kasthuri Kannan, PhD. Machine Learning, Spring 2018
Performance Estimation and Regularization Kasthuri Kannan, PhD. Machine Learning, Spring 2018 Bias- Variance Tradeoff Fundamental to machine learning approaches Bias- Variance Tradeoff Error due to Bias:
More informationNon-Linear Regression. Business Analytics Practice Winter Term 2015/16 Stefan Feuerriegel
Non-Linear Regression Business Analytics Practice Winter Term 2015/16 Stefan Feuerriegel Today s Lecture Objectives 1 Understanding the need for non-parametric regressions 2 Familiarizing with two common
More informationEmerge Workflow CE8 SAMPLE IMAGE. Simon Voisey July 2008
Emerge Workflow SAMPLE IMAGE CE8 Simon Voisey July 2008 Introduction The document provides a step-by-step guide for producing Emerge predicted petrophysical volumes based on log data of the same type.
More informationRecent advances in Metamodel of Optimal Prognosis. Lectures. Thomas Most & Johannes Will
Lectures Recent advances in Metamodel of Optimal Prognosis Thomas Most & Johannes Will presented at the Weimar Optimization and Stochastic Days 2010 Source: www.dynardo.de/en/library Recent advances in
More information