Pre-processing method minimizing the need for reference analyses

Size: px
Start display at page:

Download "Pre-processing method minimizing the need for reference analyses"

Transcription

1 JOURNAL OF CHEMOMETRICS J. Chemometrics 2001; 15: Pre-processing method minimizing the need for reference analyses Per Waaben Hansen* Foss Electric A/S, Slangerupgade 69, DK-3400 Hillerød, Denmark SUMMARY A new pre-processing method called independent interference reduction (IIR) is proposed. It is based on modelling of the interferences by use of principal component analysis (PCA) using samples containing no variation in the parameter of interest. This is followed by subtraction of the modelled effects from the calibration matrix. It is useful in cases where the parameter of interest gives only minor contributions to the calibration matrix and where, at the same time, samples not containing variation in this parameter are easy to obtain. In such a case the method reduces the number of reference analyses required for establishing a reliable calibration. When applied to acetone determination in milk samples by use of Fourier transform infrared (FT-IR) spectroscopy, the method provides a treatment of the data which yields more stable models in the subsequent calibration step. Copyright 2000 John Wiley & Sons, Ltd. KEY WORDS: pre-processing method; independent interference reduction; infrared spectroscopy; reference analysis 1. INTRODUCTION When performing a calibration, the case where the data set has a high signal-to-noise ratio (S/N) is sometimes encountered. This is actually a good case, as it makes extraction of minor features from the data set possible. In spectroscopy, such examples are seen occasionally with mid-infrared (mid-ir) data, especially when the spectra are generated by Fourier transform infrared (FT-IR) spectroscopy. Dimension-reducing methods such as principal component regression (PCR) or partial least squares (PLS) regression [1] are necessary for making the information useful if the component of interest only gives rise to minor spectral features, e.g. because it is present in very low concentrations. In such cases, models of very high complexity result. FT-IR examples include urea [2] and acetone [3] in milk, where PLS models based on factors are common. Such models require a high number of calibration samples (i.e. hundreds) to ensure a stable predictive model. As a result, calculation of a reliable predictive model is very resource-demanding because of the amount of work and reagents required for obtaining reference analyses. * Correspondence to: Per Waaben Hansen, Foss Electric A/S, Slangerupgade 69, DK-3400 Hillerød, Denmark. PWH@foss-electric.dk The work was carried out partly at The Royal Veterinary and Agricultural University, Department of Dairy and Food Science, Rolighedsvej 30, DK-1958 Frederiksberg C, Denmark. Contract/grant sponsor: The Danish Academy of Technical Sciences (Lyngby, Denmark) Contract/grant sponsor: Foss Electric A/S (Hillerød, Denmark) Copyright 2000 John Wiley & Sons, Ltd.

2 124 P. W. HANSEN In the case of a component occurring in very low concentrations, this may seem a waste of good data, as the first PLS factors (or principal components if PCR is used) usually describe the major constituents of the samples. This is because the weak spectral features from a low-concentration component may not be visible to the algorithm before the major interferences have been removed. Thus expensive reference-analysed data are wasted on the determination of scores and loadings for interfering compounds. Therefore a method of removing all (or a major part of) the interferences before commencing regression on the reference-analysed samples would be useful. Usually, spectral data from a sample are easy and inexpensive to obtain the reference analysis is the resource-demanding step. This is the basis for the method proposed in this paper, independent interference reduction (IIR). It is useful in cases where a low-concentration component is determined in a sample matrix with many interferences at high concentrations. The method is applied to an example from FT-IR spectroscopy where the above-mentioned requirements are met. 2. DESCRIPTION OF THE METHOD IIR is a pre-processing method with similarities to two methods proposed recently, orthogonal signal correction (OSC) [4] and direct orthogonalization (DO) [5]. They both model the interferences and scatter effects by building a model of the data matrix with the information on the component of interest removed. This requires that (1) all samples should be reference analysed and that (2) the reference results are of a certain quality. If the latter requirement is not met, both methods might remove potentially useful information on the component of interest. Acting as pre-processing methods, they remove scatter effects more efficiently than other schemes, e.g. multiplicative signal correction (MSC) [1], since they make a detailed model of the background. Furthermore, they provide models that are more interpretable, as interferences are removed prior to the calibration step. This is because the loadings are divided into those relating to the interferences and those related to the useful parts of the data. IIR also removes interferences and scatter effects, and it does so by using two data sets: one set (matrix X simple ) containing a large number of samples showing variation in all parameters except the one of interest, used for pre-processing; and another set (matrices X special and Y special ) containing reference-analysed samples with variation in all parameters, the one of interest in particular, used for calibration. No reference results are needed for the samples contained in X simple. The method thus requires that samples without variation in the parameter of interest are easily obtained, which is often the case when spectroscopic methods are used for predictive purposes. This is the major advantage of IIR when compared to DO and OSC, since the latter two methods require all samples to be associated with their reference results. On the other hand, IIR will not be useful when the component of interest accounts for the majority of the variation in X simple, as IIR pre-processing would remove useful information in such a case. Constituents in milk and dairy products, such as compounds added during milk processing or components occurring only as a result of a disease, may fulfil the requirement that samples (almost) without variation in the parameter of interest are easy to obtain. In the case where a compound not present in ordinary milk samples is added, e.g. during a dairy process, the samples for X simple can be chosen among all samples measured before addition of the compound of interest. The basic steps in IIR are as follows. 1. A data matrix X simple showing typical sample variation in all parameters apart from the component of interest is obtained. This matrix is modelled using principal component analysis (PCA) with m principal components. If X simple is large enough, this results in a number of welldefined and accurate loadings.

3 PRE-PROCESSING METHOD Another matrix X special containing a wide variation in the component of interest is projected onto the loadings obtained in step 1, giving m sample scores. This model (based on m scores and loadings) of the interference part of X special is subtracted from X special, yielding a new matrix X special7m. 3. This interference-reduced X special7m is used with its associated reference matrix Y special to generate a calibration model for predictive purposes, e.g. using PLS, PCR or multiple linear regression (MLR). In step 1 the interferences contained in X simple are modelled. It is therefore necessary that the component of interest is not present (or is constant) in X simple. Deviations from this constraint are not likely to be serious if the component is present in low concentrations (or shows weak spectral features), as the first principal components will not describe it anyway. In such a case, if the prediction error (e.g. root mean square error of prediction (RMSEP)) is used for determining the optimal model in step 3, then problems originating from inclusion of spectral information stemming from the component of interest will be revealed, as this will result in an increased prediction error. The data contained in X special and Y special should fulfil the usual requirements that apply to calibration sets (i.e. that the calibration data must represent future samples). The steps are similar to the ones performed in PCA data pre-treatment (PCA-DP) [6], but where PCA-DP only models instrumental (baseline) effects, IIR is extended to model interferences as well. A detailed description of IIR is given below IIR during calibration The matrices X simple, X special and Y special are required to perform the pre-processing and calibration steps. For the method to be useful, X simple should contain more samples than X special. 1. X simple is centred to give X c simple, the mean spectrum being x m1. 2. PCA is performed to give X c simple ˆ T simplep T simple 1 using m principal components. 3. x m1 is subtracted from X special to give X s special. 4. The scores of X s special on P simple are calculated: T special ˆ X s special P simple 2 5. X s special is reduced according to this model: X s special m ˆ Xs special T special P T simple 3 6. X s special m is mean centred (the mean spectrum being x m2) and regression against Y special is performed. A PLS regression using n factors could be used in this step. The mean centring in step 6 is performed since X s special m is the residual after the projection of X s special onto the PCA model of X simple, which does not necessarily have the same mean as X special IIR during prediction The vectors x pred (a new spectrum), x m1, P simple and x m2 as well as a regression model (calculated during calibration) are required in the prediction step.

4 126 P. W. HANSEN 1. x m1 is subtracted from x pred to give x s pred. 2. The scores of x s pred on P simple are calculated: t pred ˆ x s pred P simple 4 3. x s pred is reduced according to this model: x s pred m ˆ xs pred t predp T simple 5 4. x m2 is subtracted from x s pred m and prediction is performed using the regression model calculated in calibration step 6 The dimensions of the initial PCA model (m) and the final PLS calibration (n) both have influence on the results in terms of the prediction error (RMSEP). Therefore validation should be performed by calculating the prediction error using various values of m and n. From experience a closure similar to the one observed with DO [5] and PCA-DP [6] seems to exist, i.e. the sum of m and n (the overall dimensionality of the model) is constant. The application of IIR presented in the next section shows an example of this observation. It could suggest that nothing is gained from use of IIR, but this is not the case: the first m loadings are generated from more samples (in X simple ) than if only the samples from X special had been used. Thus they are expected to be better defined and less noisy. Furthermore, the calibration is generated from a data set (X special ) which is not diluted by samples not including variation from the component of interest. This may avoid the problem of underestimation of high samples and overestimation of low samples which is common in inverse calibration methods such as PLS [1]. 3. APPLICATION TO FT-IR DATA The application presented here relates to acetone determination in milk from individual cows using FT-IR spectroscopy. This application fulfils the requirements of IIR, namely that (1) acetone is found in milk in low concentrations (e.g mm) and that (2) it is easy to obtain samples known not to contain acetone. The latter requirement is met, since acetone is only present when a cow suffers from ketosis (a metabolic disease), so the milk samples for X simple can be obtained from healthy cows. A detailed description of the data set can be found elsewhere [3]. Only the information necessary for understanding the present application of IIR will be given here Experimental The calibration set consisted of 310 samples from individual cows collected in Norway, Sweden and Denmark. Eight outliers were removed from the set by use of PCA. The remaining samples were divided into two sets: 198 samples containing less than 0 1 mm acetone (i.e. from cows not suffering from ketosis) made up the matrix X simple, while the remaining samples (104 samples, still containing some samples below 0 1 mm) made up X special. The latter sample set was used for building a predictive model (i.e. steps 5 and 6 during calibration). The test set consisted of 58 individual cow samples collected and measured in New Zealand. They were collected in a way that ensured a high proportion of acetone-containing samples. This set was used for testing the models. The rationale for using this set for testing purposes was that if IIR leads to more stable predictive models, then using a test set consisting of samples obtained in another part of the world and measured on another instrument (of the same type) would prove it. All samples were measured on MilkoScan FT 120 FT-IR transmission instruments (Foss Electric A/S, Hillerød, Denmark) located in either Denmark or New Zealand. The instruments were

5 PRE-PROCESSING METHOD 127 standardized [7] before use. The samples were analysed for their acetone content using the reference method [3] Calculations The data analysis and calibration work was performed on a PC using Matlab 1 software (Version 5 2 1; The MathWorks, Inc., Natick, MA). The calibration routines were either programmed by the author or taken from the PLS_Toolbox (Version 1 5 1; Eigenvector Technologies, Manson, WA). Repeatability is expressed as the mean standard deviation (S r ) of multiple determinations performed under identical conditions and is calculated as v u 1 X q X n S r ˆ t x j;i x j 2 q n 1 where q is the number of samples, n is the number of replicates, x j,i is the result of the ith replicate of the jth sample and x j is the average result of the jth sample. Accuracy is expressed as the root mean square error of prediction (RMSEP) and is calculated as jˆ1 iˆ1 v u 1 X N RMSEP ˆ t x i;reference x i;predicted 2 N iˆ1 where N is the number of determinations (number of samples (q) times number of replicates (n) from above) and x i,reference and x i,predicted are the reference and predicted values corresponding to the ith determination respectively. When a bias (mean difference between reference results and predictions) is observed, the standard error of prediction (SEP) is used. It is calculated as v u 1 X N SEP ˆ t x i;reference x i;predicted bias 2 N 1 iˆ RESULTS AND DISCUSSION Calibrations were carried out for all values of m (PCA dimension) from 0 (ordinary PLS) to 25 and with overall model complexities (i.e. m n, n being the number of PLS factors in the final regression step) from m 1 to 30. Cross-validation was used for selecting the optimal model complexity, using two, four, six, eight and 10 cross-validation segments. The average model complexity from these five trials was used when the final model was calculated. Note that the 198 samples containing less than 0 1 mm acetone were used for IIR pre-processing only, i.e. the PLS model was calculated using the 104 calibration samples irrespective the value of m. The cross-validated prediction errors using 10 cross-validation segments for various values of m are apparent in Figure 1. From these plots it is clear that the overall model complexity stabilizes at a value of approximately 25, irrespective of the value of m. This does not apply when m = 25. In that case the same RMSEP is reached when m n = 30, so when m becomes high, the optimal overall model complexity is pushed ahead. Thus there seems to exist a lower limit for n, and this value must

6 128 P. W. HANSEN Figure 1. Cross-validation error for acetone in milk (calculated using 10 cross-validation segments) against model complexity for various dimensions (m) of the initial PCA in IIR. Figure 2. First PLS loadings for models without (m = 0) and with (m = 20) IIR preprocessing, overlaid with the FT-IR spectrum of pure acetone in water. The PLS loading with m = 0 is shifted by 0 3 units for the sake of clarity. Only the range from 1000 to 1800 cm 71 is shown. The range from 1581 to 1697 cm 71 was not used in the calculations owing to a strong water band in the area.

7 PRE-PROCESSING METHOD 129 Table I. Results on the independent test set using various combinations of dimensions m (principal components used for pre-processing) and n (PLS factors) m m n RMSEP SEP S r be n = 5 in the present case. Regarding m, using too high a value for this parameter does not seem to affect the predictive ability of the model. The prediction error does not seem to improve when IIR is used, but it does not become poorer either. The main advantage of IIR is that it results in a smoother decay of the prediction error, which facilitates the location of the optimal number of PLS factors. Furthermore, the PLS loadings (Figure 2) show a closer relation to the component of interest when IIR is used. With an appropriate choice of m (e.g. m = 20) the first PLS loading contains features similar to the spectrum of pure acetone. This is not the case when m = 0 (i.e. without IIR pre-processing), where the first PLS loading mainly contains information about the milk fat in the samples. Thus the application of IIR pre-processing uncovers the relevant acetone information from a complex spectral matrix containing numerous interferences. When the final models are tested using the test set (with m n determined by cross-validation), there seems to be a benefit from using IIR. The results for various values of m are presented in Table I. Both RMSEP and SEP are stated, as SEP is independent of a possible bias, while RMSEP is not. In terms of the SEP the result is independent of the value of m, i.e. the use of IIR. The overall dimensionality of the models ends up at 24 or 25, apart from the case when m = 25 itself. In this light, nothing is gained by using IIR. When looking at the RMSEP, on the other hand, a significant improvement is seen when m is increased from 0 to 20. When m = 20, the RMSEP is not significantly different from the SEP, and the bias obviously present at lower values of m has disappeared. The bias could be a result of not all sample matrices being represented in X special, which is a common problem when the calibration set is too small. The application of IIR using the larger matrix X simple for preprocessing removes this problem by adding a wider range of sample types, i.e. also milk samples collected in another geographical region than the calibration samples, so the predictive model becomes more stable. The repeatability s r, which is a measure of the noise in the experiment, is not affected by the use of IIR: it is stable at a value of mm. When m becomes greater than the apparent overall dimension, the RMSEP increases while the SEP remains the same. The overall result is that IIR tends to remove a bias, but it requires careful selection of the value of the parameter m, as the bias increases when m becomes too high. In the present example the optimal model is obtained when m = 20 and n = 5. The predicted versus measured plot for the test set for this case is shown in Figure 3. The fact that the prediction error for the New Zealand samples in terms of the RMSEP exhibits a minimum at a certain combination of m and n is particularly promising, as it means that with IIR preprocessing, the calibration set does not necessarily need to contain local samples from the geographical region where the calibration model is to be used. It does, however, require that the samples contained in X simple are representative for the variations observed in the local samples. Another perspective of the method is that only the PCA loadings calculated from X simple need to be stored. Thus a set of e.g. 20 loadings (calculated from a large number of samples) could be saved and

8 130 P. W. HANSEN Figure 3. Reference versus predicted acetone in milk for test samples using a model with IIR pre-processing, where m = 20 and n = 5. RMSEP = 0 31 mm and s r = 0 17 mm. used for pre-processing in the future even when predictive models for parameters other than acetone are sought. 5. CONCLUSIONS IIR is a new pre-processing method which can be useful for specific applications where the following requirements are met: (1) the parameter of interest only gives minor contributions to the X matrix (e.g. a component in low concentrations); and (2) samples in which the parameter of interest is constant should be easy to obtain. The benefits of the method are that (1) it reduces the need for resource-demanding experiments to obtain Y data for calibration (i.e. reference analyses) and that (2) the resulting calibration model is more interpretable owing to the splitting of the loadings into those related to the interferences and those related to the parameter of interest. The main disadvantage is that two parameters, m and n, must be optimized when PLS regression is used for calibration. In the specific case where acetone is determined by use of FT-IR, an improved predictive model could be obtained by use of IIR. On this data set the major improvement over the ordinary PLS calibration was that future predictions calculated using the model became more stable (i.e. had a lower bias when used on samples from another geographical region measured on different instruments at different locations). ACKNOWLEDGEMENTS The author would like to thank The Danish Academy of Technical Sciences (Lyngby, Denmark) and Foss Electric A/S (Hillerød, Denmark) for providing the funding for the present work. It was carried

9 PRE-PROCESSING METHOD 131 out in co-operation with Food Technology at the Department of Dairy and Food Science (The Royal Veterinary and Agricultural University, Frederiksberg, Denmark). Dairying Research Corporation Ltd (Hamilton, New Zealand) and New Zealand Dairy Research Institute (Palmerston North, New Zealand) are thanked for providing samples and equipment for the practical work. Claus A. Andersson, Rasmus Bro, Henrik V. Juhl, Lars Nørgaard and Carsten Ridder are thanked for useful discussions. REFERENCES 1. Martens H, Næs T. Multivariate Calibration (2nd edn). Wiley: Chichester, Hansen PW. Milchwissenschaft 1998; 53: Hansen PW. J. Dairy Sci. 1999; 82: Wold S, Antti H, Lindgren F, Öhman J. Chemometrics Intell. Lab. Syst. 1998; 44: Andersson CA. Chemometrics Intell. Lab. Syst. 1999; 47: Sun J. J. Chemometrics 1997; 11: Andersen HV, Kjær L, Hansen PW, Ridder C. US Patent , 1999.

SELECTION OF A MULTIVARIATE CALIBRATION METHOD

SELECTION OF A MULTIVARIATE CALIBRATION METHOD SELECTION OF A MULTIVARIATE CALIBRATION METHOD 0. Aim of this document Different types of multivariate calibration methods are available. The aim of this document is to help the user select the proper

More information

Data Preprocessing. D.N. Rutledge, AgroParisTech

Data Preprocessing. D.N. Rutledge, AgroParisTech Data Preprocessing D.N. Rutledge, AgroParisTech 12 Rencontres Héliospir / 30 septembre 2011 Outline Zone selection Examining raw data The importance of pre-treatment of data Common pre-treatment methods

More information

Basics of Multivariate Modelling and Data Analysis

Basics of Multivariate Modelling and Data Analysis Basics of Multivariate Modelling and Data Analysis Kurt-Erik Häggblom 9. Linear regression with latent variables 9.1 Principal component regression (PCR) 9.2 Partial least-squares regression (PLS) [ mostly

More information

Pre-processing in vibrational spectroscopy, a when, why and how

Pre-processing in vibrational spectroscopy, a when, why and how Analytical Methods Pre-processing in vibrational spectroscopy, a when, why and how Journal: Analytical Methods Manuscript ID: AY-TRV--0-00.R Article Type: Tutorial Review Date Submitted by the Author:

More information

Chemometrics. Description of Pirouette Algorithms. Technical Note. Abstract

Chemometrics. Description of Pirouette Algorithms. Technical Note. Abstract 19-1214 Chemometrics Technical Note Description of Pirouette Algorithms Abstract This discussion introduces the three analysis realms available in Pirouette and briefly describes each of the algorithms

More information

Application of genetic algorithm PLS for feature selection in spectral data sets

Application of genetic algorithm PLS for feature selection in spectral data sets JOURNAL OF CHEMOMETRICS J. Chemometrics 2000; 14: 643 655 Application of genetic algorithm PLS for feature selection in spectral data sets Riccardo Leardi* Department of Pharmaceutical and Food Chemistry

More information

2008 Cemagref. Reprinted with permission.

2008 Cemagref. Reprinted with permission. O. Haavisto and H. Hyötyniemi. 8. Partial least squares estimation of mineral flotation slurry contents using optical reflectance spectra. In: Proceedings of the th Conference on Chemometrics in Analytical

More information

Hyperspectral Chemical Imaging: principles and Chemometrics.

Hyperspectral Chemical Imaging: principles and Chemometrics. Hyperspectral Chemical Imaging: principles and Chemometrics aoife.gowen@ucd.ie University College Dublin University College Dublin 1,596 PhD students 6,17 international students 8,54 graduate students

More information

Multivariate Calibration Quick Guide

Multivariate Calibration Quick Guide Last Updated: 06.06.2007 Table Of Contents 1. HOW TO CREATE CALIBRATION MODELS...1 1.1. Introduction into Multivariate Calibration Modelling... 1 1.1.1. Preparing Data... 1 1.2. Step 1: Calibration Wizard

More information

Bread Water Content Measurement Based on Hyperspectral Imaging

Bread Water Content Measurement Based on Hyperspectral Imaging 93 Bread Water Content Measurement Based on Hyperspectral Imaging Zhi Liu 1, Flemming Møller 1.2 1 Department of Informatics and Mathematical Modelling, Technical University of Denmark, Kgs. Lyngby, Denmark

More information

Calibration with absolute shrinkage

Calibration with absolute shrinkage JOURNAL OF CHEMOMETRICS J. Chemometrics 2001; 15: 497 509 DOI: 10.1002/cem.635 Calibration with absolute shrinkage Henrik Öjelund*, Henrik Madsen and Poul Thyregod Department of Mathematical Modelling,

More information

Characterizing dissolved organic matter fluorescence with parallel factor analysis: a tutorial

Characterizing dissolved organic matter fluorescence with parallel factor analysis: a tutorial Limnol. Oceanogr.: Methods 6, 2008 2008, by the American Society of Limnology and Oceanography, Inc. Characterizing dissolved organic matter fluorescence with parallel factor analysis: a tutorial Colin

More information

Agilent MicroLab Quant Calibration Software: Measure Oil in Water using Method IP 426

Agilent MicroLab Quant Calibration Software: Measure Oil in Water using Method IP 426 Agilent MicroLab Quant Calibration Software: Measure Oil in Water using Method IP 426 Application Note Environmental Authors John Seelenbinder and Dipak Mainali Agilent Technologies, Inc. Introduction

More information

Final Report: Kaggle Soil Property Prediction Challenge

Final Report: Kaggle Soil Property Prediction Challenge Final Report: Kaggle Soil Property Prediction Challenge Saurabh Verma (verma076@umn.edu, (612)598-1893) 1 Project Goal Low cost and rapid analysis of soil samples using infrared spectroscopy provide new

More information

Fourier analysis of low-resolution satellite images of cloud

Fourier analysis of low-resolution satellite images of cloud New Zealand Journal of Geology and Geophysics, 1991, Vol. 34: 549-553 0028-8306/91/3404-0549 $2.50/0 Crown copyright 1991 549 Note Fourier analysis of low-resolution satellite images of cloud S. G. BRADLEY

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 2015 MODULE 4 : Modelling experimental data Time allowed: Three hours Candidates should answer FIVE questions. All questions carry equal

More information

Bootstrapping Method for 14 June 2016 R. Russell Rhinehart. Bootstrapping

Bootstrapping Method for  14 June 2016 R. Russell Rhinehart. Bootstrapping Bootstrapping Method for www.r3eda.com 14 June 2016 R. Russell Rhinehart Bootstrapping This is extracted from the book, Nonlinear Regression Modeling for Engineering Applications: Modeling, Model Validation,

More information

Principal Component Analysis

Principal Component Analysis Copyright 2004, Casa Software Ltd. All Rights Reserved. 1 of 16 Principal Component Analysis Introduction XPS is a technique that provides chemical information about a sample that sets it apart from other

More information

Tutorial: Time series hyperspectral image analysis. Dorrepaal, Ronan; Malegori, Cristina; Gowen, Aoife.

Tutorial: Time series hyperspectral image analysis. Dorrepaal, Ronan; Malegori, Cristina; Gowen, Aoife. Provided by the author(s) and University College Dublin Library in accordance with publisher policies. Please cite the published version when available. Title Tutorial: Time series hyperspectral image

More information

17. SEISMIC ANALYSIS MODELING TO SATISFY BUILDING CODES

17. SEISMIC ANALYSIS MODELING TO SATISFY BUILDING CODES 17. SEISMIC ANALYSIS MODELING TO SATISFY BUILDING CODES The Current Building Codes Use the Terminology: Principal Direction without a Unique Definition 17.1 INTRODUCTION { XE "Building Codes" }Currently

More information

Raman Images. Jeremy M. Shaver 1, Eunah Lee 2, Andrew Whitley 2, R. Scott Koch. 1. Eigenvector Research, Inc. 2. HORIBA Jobin Yvon, Inc.

Raman Images. Jeremy M. Shaver 1, Eunah Lee 2, Andrew Whitley 2, R. Scott Koch. 1. Eigenvector Research, Inc. 2. HORIBA Jobin Yvon, Inc. Analyzing and Visualizing Large Raman Images Jeremy M. Shaver 1, Eunah Lee 2, Andrew Whitley 2, R. Scott Koch 1 1. Eigenvector Research, Inc. 2. HORIBA Jobin Yvon, Inc. What is a Large Image? Gone from

More information

The latest trend of hybrid instrumentation

The latest trend of hybrid instrumentation Multivariate Data Processing of Spectral Images: The Ugly, the Bad, and the True The results of various multivariate data-processing methods of Raman maps recorded with a dispersive Raman microscope are

More information

Detecting Polytomous Items That Have Drifted: Using Global Versus Step Difficulty 1,2. Xi Wang and Ronald K. Hambleton

Detecting Polytomous Items That Have Drifted: Using Global Versus Step Difficulty 1,2. Xi Wang and Ronald K. Hambleton Detecting Polytomous Items That Have Drifted: Using Global Versus Step Difficulty 1,2 Xi Wang and Ronald K. Hambleton University of Massachusetts Amherst Introduction When test forms are administered to

More information

Principal Component Image Interpretation A Logical and Statistical Approach

Principal Component Image Interpretation A Logical and Statistical Approach Principal Component Image Interpretation A Logical and Statistical Approach Md Shahid Latif M.Tech Student, Department of Remote Sensing, Birla Institute of Technology, Mesra Ranchi, Jharkhand-835215 Abstract

More information

Experiment 5: Exploring Resolution, Signal, and Noise using an FTIR CH3400: Instrumental Analysis, Plymouth State University, Fall 2013

Experiment 5: Exploring Resolution, Signal, and Noise using an FTIR CH3400: Instrumental Analysis, Plymouth State University, Fall 2013 Experiment 5: Exploring Resolution, Signal, and Noise using an FTIR CH3400: Instrumental Analysis, Plymouth State University, Fall 2013 Adapted from JP Blitz and DG Klarup, "Signal-to-Noise Ratio, Signal

More information

Introduction to Multivariate Image Analysis (MIA) Table of Contents

Introduction to Multivariate Image Analysis (MIA) Table of Contents Introduction to Multivariate Image Analysis (MIA) Copyright 1996-2013 Eigenvector Research, Inc. No part of this material may be photocopied or reproduced in any form without prior written consent from

More information

Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008.

Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008. Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008. Introduction There is much that is unknown regarding

More information

A NEW VARIABLES SELECTION AND DIMENSIONALITY REDUCTION TECHNIQUE COUPLED WITH SIMCA METHOD FOR THE CLASSIFICATION OF TEXT DOCUMENTS

A NEW VARIABLES SELECTION AND DIMENSIONALITY REDUCTION TECHNIQUE COUPLED WITH SIMCA METHOD FOR THE CLASSIFICATION OF TEXT DOCUMENTS A NEW VARIABLES SELECTION AND DIMENSIONALITY REDUCTION TECHNIQUE COUPLED WITH SIMCA METHOD FOR THE CLASSIFICATION OF TEXT DOCUMENTS Ahmed Abdelfattah Saleh University of Brasilia, Brasil ahmdsalh@yahoo.com

More information

NIR Technologies Inc. FT-NIR Technical Note TN 005

NIR Technologies Inc. FT-NIR Technical Note TN 005 NIR Technologies Inc. FT-NIR Technical Note TN 00 IDENTIFICATION OF PURE AND BLENDED FOOD SPICES Fourier Transform Near Infrared Spectroscopy (FT-NIR) (Fast, Accurate, Reliable, and Non-destructive) Identification

More information

Statistics for Engineering

Statistics for Engineering Statistics for Engineering Section 6: Latent variable modelling Kevin Dunn Copyright, and all rights reserved, Kevin Dunn, 2011 kevin.dunn@connectmv.com April 2011 1 What we will cover What is a latent

More information

An Intelligent Clustering Algorithm for High Dimensional and Highly Overlapped Photo-Thermal Infrared Imaging Data

An Intelligent Clustering Algorithm for High Dimensional and Highly Overlapped Photo-Thermal Infrared Imaging Data An Intelligent Clustering Algorithm for High Dimensional and Highly Overlapped Photo-Thermal Infrared Imaging Data Nian Zhang and Lara Thompson Department of Electrical and Computer Engineering, University

More information

Preliminaries: Size Measures and Shape Coordinates

Preliminaries: Size Measures and Shape Coordinates 2 Preliminaries: Size Measures and Shape Coordinates 2.1 Configuration Space Definition 2.1 The configuration is the set of landmarks on a particular object. The configuration matrix X is the k m matrix

More information

Using the DATAMINE Program

Using the DATAMINE Program 6 Using the DATAMINE Program 304 Using the DATAMINE Program This chapter serves as a user s manual for the DATAMINE program, which demonstrates the algorithms presented in this book. Each menu selection

More information

Spectrum analysis by recursively pruned extended auto-associative neural network

Spectrum analysis by recursively pruned extended auto-associative neural network JOURNAL OF CHEMOMETRICS J. Chemometrics 25; 19: 492 499 Published online in Wiley InterScience (www.interscience.wiley.com). DOI: 1.12/cem.955 Spectrum analysis by recursively pruned extended auto-associative

More information

Instrument Specific Signal

Instrument Specific Signal Wavelet transform used to preprocessing of Apple near infrared spectra Wang Xiaoli,, Zhang Xiaoli *, Zhou Guomin. Beijing Forestry University, BeiJing, 00083. Institue of agricultural information, Chinese

More information

MINI-PAPER A Gentle Introduction to the Analysis of Sequential Data

MINI-PAPER A Gentle Introduction to the Analysis of Sequential Data MINI-PAPER by Rong Pan, Ph.D., Assistant Professor of Industrial Engineering, Arizona State University We, applied statisticians and manufacturing engineers, often need to deal with sequential data, which

More information

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010 THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE

More information

Chapter 5. Track Geometry Data Analysis

Chapter 5. Track Geometry Data Analysis Chapter Track Geometry Data Analysis This chapter explains how and why the data collected for the track geometry was manipulated. The results of these studies in the time and frequency domain are addressed.

More information

COLOR FIDELITY OF CHROMATIC DISTRIBUTIONS BY TRIAD ILLUMINANT COMPARISON. Marcel P. Lucassen, Theo Gevers, Arjan Gijsenij

COLOR FIDELITY OF CHROMATIC DISTRIBUTIONS BY TRIAD ILLUMINANT COMPARISON. Marcel P. Lucassen, Theo Gevers, Arjan Gijsenij COLOR FIDELITY OF CHROMATIC DISTRIBUTIONS BY TRIAD ILLUMINANT COMPARISON Marcel P. Lucassen, Theo Gevers, Arjan Gijsenij Intelligent Systems Lab Amsterdam, University of Amsterdam ABSTRACT Performance

More information

Novel variable influence on projection (VIP) methods in OPLS, O2PLS, and OnPLS models for single- and multiblock variable selection

Novel variable influence on projection (VIP) methods in OPLS, O2PLS, and OnPLS models for single- and multiblock variable selection Novel variable influence on projection (VIP) methods in OPLS, O2PLS, and OnPLS models for single- and multiblock variable selection VIPOPLS, VIPO2PLS, and MB-VIOP methods Beatriz Galindo-Prieto Doctoral

More information

Investigation into the use of PCA with Machine Learning for the Identification of Narcotics based on Raman Spectroscopy

Investigation into the use of PCA with Machine Learning for the Identification of Narcotics based on Raman Spectroscopy Investigation into the use of PCA with Machine Learning for the Identification of Narcotics based on Raman Spectroscopy Tom Howley, Michael G. Madden, Marie-Louise O Connell and Alan G. Ryder National

More information

SPA_GUI. Matlab graphical user interface allowing signal processing and. variable selection for multivariate calibration.

SPA_GUI. Matlab graphical user interface allowing signal processing and. variable selection for multivariate calibration. Matlab graphical user interface allowing signal processing and variable selection for multivariate calibration. www.ele.ita.br/~kawakami/spa January/2012 About SPA_GUI SPA_GUI is a Matlab graphical user

More information

THE DEVELOPMENT OF CALIBRATION MODELS FOR SPECTROSCOPIC DATA USING PRINCIPAL COMPONENT REGRESSION

THE DEVELOPMENT OF CALIBRATION MODELS FOR SPECTROSCOPIC DATA USING PRINCIPAL COMPONENT REGRESSION THE DEVELOPMENT OF CALIBRATION MODELS FOR SPECTROSCOPIC DATA USING PRINCIPAL COMPONENT REGRESSION R. De Maesschalck 1, F. Estienne 1, J. Verdú-Andrés 1, A. Candolfi 1, V. Centner 1, F. Despagne 1, D. Jouan-Rimbaud

More information

Spectrum 10 Spectroscopy Software

Spectrum 10 Spectroscopy Software PRODUCT Note Infrared Spectroscopy Spectrum 10 Spectroscopy Software PerkinElmer Spectrum 10 is the newly released 2010 software for the latest range of PerkinElmer FT-IR spectrometers. Designed for busy

More information

Graphical Analysis of Data using Microsoft Excel [2016 Version]

Graphical Analysis of Data using Microsoft Excel [2016 Version] Graphical Analysis of Data using Microsoft Excel [2016 Version] Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

Package lspls. July 27, 2018

Package lspls. July 27, 2018 Title LS-PLS Models Version 0.2-2 Date 2018-07-26 Author [aut, cre] Package lspls July 27, 2018 Maintainer Encoding UTF-8 Depends pls (>= 2.2.0) Imports grdevices, graphics, methods, stats

More information

Using surface markings to enhance accuracy and stability of object perception in graphic displays

Using surface markings to enhance accuracy and stability of object perception in graphic displays Using surface markings to enhance accuracy and stability of object perception in graphic displays Roger A. Browse a,b, James C. Rodger a, and Robert A. Adderley a a Department of Computing and Information

More information

Regression on SAT Scores of 374 High Schools and K-means on Clustering Schools

Regression on SAT Scores of 374 High Schools and K-means on Clustering Schools Regression on SAT Scores of 374 High Schools and K-means on Clustering Schools Abstract In this project, we study 374 public high schools in New York City. The project seeks to use regression techniques

More information

Clustering and Visualisation of Data

Clustering and Visualisation of Data Clustering and Visualisation of Data Hiroshi Shimodaira January-March 28 Cluster analysis aims to partition a data set into meaningful or useful groups, based on distances between data points. In some

More information

What s new in The Unscrambler X version Lars Gidskehaug

What s new in The Unscrambler X version Lars Gidskehaug What s new in The Unscrambler X version 10.4 Lars Gidskehaug Who am I? Born Bergen, Norway 2002: MSc, chemometrics, NTNU Trondheim 2007: PhD, bioinformatics, NTNU Trondheim 2007-2011: genomics, CIGENE,

More information

Using Excel for Graphical Analysis of Data

Using Excel for Graphical Analysis of Data Using Excel for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters. Graphs are

More information

ANNUAL REPORT OF HAIL STUDIES NEIL G, TOWERY AND RAND I OLSON. Report of Research Conducted. 15 May May For. The Country Companies

ANNUAL REPORT OF HAIL STUDIES NEIL G, TOWERY AND RAND I OLSON. Report of Research Conducted. 15 May May For. The Country Companies ISWS CR 182 Loan c.l ANNUAL REPORT OF HAIL STUDIES BY NEIL G, TOWERY AND RAND I OLSON Report of Research Conducted 15 May 1976-14 May 1977 For The Country Companies May 1977 ANNUAL REPORT OF HAIL STUDIES

More information

Cover Page. The handle holds various files of this Leiden University dissertation.

Cover Page. The handle   holds various files of this Leiden University dissertation. Cover Page The handle http://hdl.handle.net/1887/22055 holds various files of this Leiden University dissertation. Author: Koch, Patrick Title: Efficient tuning in supervised machine learning Issue Date:

More information

Applying Supervised Learning

Applying Supervised Learning Applying Supervised Learning When to Consider Supervised Learning A supervised learning algorithm takes a known set of input data (the training set) and known responses to the data (output), and trains

More information

DESIGN OF EXPERIMENTS and ROBUST DESIGN

DESIGN OF EXPERIMENTS and ROBUST DESIGN DESIGN OF EXPERIMENTS and ROBUST DESIGN Problems in design and production environments often require experiments to find a solution. Design of experiments are a collection of statistical methods that,

More information

Analog input to digital output correlation using piecewise regression on a Multi Chip Module

Analog input to digital output correlation using piecewise regression on a Multi Chip Module Analog input digital output correlation using piecewise regression on a Multi Chip Module Farrin Hockett ECE 557 Engineering Data Analysis and Modeling. Fall, 2004 - Portland State University Abstract

More information

Cycle Criteria for Detection of Camouflaged Targets

Cycle Criteria for Detection of Camouflaged Targets Barbara L. O Kane, Ph.D. US Army RDECOM CERDEC NVESD Ft. Belvoir, VA 22060-5806 UNITED STATES OF AMERICA Email: okane@nvl.army.mil Gary L. Page Booz Allen Hamilton Arlington, VA 22203 David L. Wilson,

More information

Non-destructive techniques in seed quality determination

Non-destructive techniques in seed quality determination Non-destructive techniques in seed quality determination Merete Halkjær Olesen Aarhus University Science & Technology Department of Agroecology TATION presen Outline Seed anatomy and the importance of

More information

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2016

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2016 CPSC 340: Machine Learning and Data Mining Principal Component Analysis Fall 2016 A2/Midterm: Admin Grades/solutions will be posted after class. Assignment 4: Posted, due November 14. Extra office hours:

More information

Comparison of Variational Bayes and Gibbs Sampling in Reconstruction of Missing Values with Probabilistic Principal Component Analysis

Comparison of Variational Bayes and Gibbs Sampling in Reconstruction of Missing Values with Probabilistic Principal Component Analysis Comparison of Variational Bayes and Gibbs Sampling in Reconstruction of Missing Values with Probabilistic Principal Component Analysis Luis Gabriel De Alba Rivera Aalto University School of Science and

More information

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani Feature Selection CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Dimensionality reduction Feature selection vs. feature extraction Filter univariate

More information

Improved Centroid Peak Detection and Mass Accuracy using a Novel, Fast Data Reconstruction Method

Improved Centroid Peak Detection and Mass Accuracy using a Novel, Fast Data Reconstruction Method Overview Improved Centroid Peak Detection and Mass Accuracy using a Novel, Fast Data Reconstruction Method James A. Ferguson 1, William G. Sawyers 1, Keith A. Waddell 1, Anthony G. Ferrige 2, Robert Alecio

More information

Unsupervised learning in Vision

Unsupervised learning in Vision Chapter 7 Unsupervised learning in Vision The fields of Computer Vision and Machine Learning complement each other in a very natural way: the aim of the former is to extract useful information from visual

More information

FEA and Multivariate Statistical Data Analysis of Polypropylene Tube Forming Process

FEA and Multivariate Statistical Data Analysis of Polypropylene Tube Forming Process FEA and Multivariate Statistical Data Analysis of Polypropylene ube Forming Process Galini Gavrilidou, Mukesh Jain Department of Mechanical Engineering, McMaster University, Hamilton, On, LS L7, Canada

More information

I How does the formulation (5) serve the purpose of the composite parameterization

I How does the formulation (5) serve the purpose of the composite parameterization Supplemental Material to Identifying Alzheimer s Disease-Related Brain Regions from Multi-Modality Neuroimaging Data using Sparse Composite Linear Discrimination Analysis I How does the formulation (5)

More information

NMRProcFlow Macro-command Reference Guide

NMRProcFlow Macro-command Reference Guide NMRProcFlow Macro-command Reference Guide This document is the reference guide of the macro-commands Daniel Jacob UMR 1332 BFP, Metabolomics Facility CGFB Bordeaux, MetaboHUB - 2018 1 NMRProcFlow - Macro-command

More information

Performance optimization of a rotary mower using Taguchi method

Performance optimization of a rotary mower using Taguchi method Agronomy Research Biosystem Engineering Special Issue, 49-54, 0 Performance optimization of a rotary mower using Taguchi method S. Saeid Hosseini and Mohsen Shamsi Department of mechanics of Agricultural

More information

Validation of a Direct Analysis in Real Time Mass Spectrometry (DART-MS) Method for the Quantitation of Six Carbon Sugar in Saccharification Matrix

Validation of a Direct Analysis in Real Time Mass Spectrometry (DART-MS) Method for the Quantitation of Six Carbon Sugar in Saccharification Matrix Validation of a Direct Analysis in Real Time Mass Spectrometry (DART-MS) Method for the Quantitation of Six Carbon Sugar in Saccharification Matrix Daudi Saang onyo, a,b Gary Selby b and Darrin L. Smith*

More information

MetaData for Database Mining

MetaData for Database Mining MetaData for Database Mining John Cleary, Geoffrey Holmes, Sally Jo Cunningham, and Ian H. Witten Department of Computer Science University of Waikato Hamilton, New Zealand. Abstract: At present, a machine

More information

Kuske Martyna, Rubio, Rubio Rafael, Nicolas Jacques, Marco Santiago, Romain Anne-Claude

Kuske Martyna, Rubio, Rubio Rafael, Nicolas Jacques, Marco Santiago, Romain Anne-Claude Fuzzy k-nn applied to moulds detection. Kuske Martyna, Rubio, Rubio Rafael, Nicolas Jacques, Marco Santiago, Romain Anne-Claude Communication presented at ISOEN 2003 RIGA- Latvia Introduction Excessive

More information

Refined Measurement of Digital Image Texture Loss Peter D. Burns Burns Digital Imaging Fairport, NY USA

Refined Measurement of Digital Image Texture Loss Peter D. Burns Burns Digital Imaging Fairport, NY USA Copyright 3 SPIE Refined Measurement of Digital Image Texture Loss Peter D. Burns Burns Digital Imaging Fairport, NY USA ABSTRACT Image texture is the term given to the information-bearing fluctuations

More information

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables

More information

Multivariate Analysis Multivariate Calibration part 2

Multivariate Analysis Multivariate Calibration part 2 Multivariate Analysis Multivariate Calibration part 2 Prof. Dr. Anselmo E de Oliveira anselmo.quimica.ufg.br anselmo.disciplinas@gmail.com Linear Latent Variables An essential concept in multivariate data

More information

Abstract. Heat and Power Technology. Internship Report. Jack Siman

Abstract. Heat and Power Technology. Internship Report. Jack Siman Heat and Power Technology Internship Report Jack Siman john.siman@duke.edu Submission Date 2017-July-28 Supervisor Jens Fridh Abstract The post processing methods for interpreting flow characteristics

More information

Computer Experiments: Space Filling Design and Gaussian Process Modeling

Computer Experiments: Space Filling Design and Gaussian Process Modeling Computer Experiments: Space Filling Design and Gaussian Process Modeling Best Practice Authored by: Cory Natoli Sarah Burke, Ph.D. 30 March 2018 The goal of the STAT COE is to assist in developing rigorous,

More information

Challenges in data compression for current and future imagers and hyperspectral sounders. Nigel Atkinson

Challenges in data compression for current and future imagers and hyperspectral sounders. Nigel Atkinson Challenges in data compression for current and future imagers and hyperspectral sounders Nigel Atkinson Met Office, Fitzroy Road, Exeter, EX1 3PB, UK Abstract For current high-volume datasets such as IASI,

More information

Fitting Fragility Functions to Structural Analysis Data Using Maximum Likelihood Estimation

Fitting Fragility Functions to Structural Analysis Data Using Maximum Likelihood Estimation Fitting Fragility Functions to Structural Analysis Data Using Maximum Likelihood Estimation 1. Introduction This appendix describes a statistical procedure for fitting fragility functions to structural

More information

The Next Generation of Milk Analysis

The Next Generation of Milk Analysis The Next Generation of Milk Analysis N e x G e n S e r i e s DairySpec FT Fat Protein Lactose Solids AND MORE INTRODUCING THE NEWEST IN THE NEXGEN SERIES The Bentley DairySpec FT A highly accurate, extremely

More information

Feature clustering and mutual information for the selection of variables in spectral data

Feature clustering and mutual information for the selection of variables in spectral data Feature clustering and mutual information for the selection of variables in spectral data C. Krier 1, D. François 2, F.Rossi 3, M. Verleysen 1 1,2 Université catholique de Louvain, Machine Learning Group

More information

ECE 285 Class Project Report

ECE 285 Class Project Report ECE 285 Class Project Report Based on Source localization in an ocean waveguide using supervised machine learning Yiwen Gong ( yig122@eng.ucsd.edu), Yu Chai( yuc385@eng.ucsd.edu ), Yifeng Bu( ybu@eng.ucsd.edu

More information

EPS Product Validation Report: IASI L1 PCC PPF

EPS Product Validation Report: IASI L1 PCC PPF EPS Product Validation Report: IASI L1 PCC PPF Doc.No. : Issue : v2 Date : 5 August 2010 EUMETSAT Eumetsat-Allee 1, D-64295 Darmstadt, Germany Tel: +49 6151 807-7 Fax: +49 6151 807 555 http://www.eumetsat.int

More information

Using Excel for Graphical Analysis of Data

Using Excel for Graphical Analysis of Data EXERCISE Using Excel for Graphical Analysis of Data Introduction In several upcoming experiments, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

A Neural Network Model Of Insurance Customer Ratings

A Neural Network Model Of Insurance Customer Ratings A Neural Network Model Of Insurance Customer Ratings Jan Jantzen 1 Abstract Given a set of data on customers the engineering problem in this study is to model the data and classify customers

More information

Evaluation of empirical models for calibration and classification

Evaluation of empirical models for calibration and classification Evaluation of empirical models for calibration and classification Kurt VARMUZA Vienna University of Technology Institute of Chemical Engineering and Department of Statistics and Probability Theory www.lcm.tuwien.ac.at,

More information

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,

More information

Multicomponent f-x seismic random noise attenuation via vector autoregressive operators

Multicomponent f-x seismic random noise attenuation via vector autoregressive operators Multicomponent f-x seismic random noise attenuation via vector autoregressive operators Mostafa Naghizadeh and Mauricio Sacchi ABSTRACT We propose an extension of the traditional frequency-space (f-x)

More information

Multiresponse Sparse Regression with Application to Multidimensional Scaling

Multiresponse Sparse Regression with Application to Multidimensional Scaling Multiresponse Sparse Regression with Application to Multidimensional Scaling Timo Similä and Jarkko Tikka Helsinki University of Technology, Laboratory of Computer and Information Science P.O. Box 54,

More information

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING Christopher Burges, Daniel Plastina, John Platt, Erin Renshaw, and Henrique Malvar March 24 Technical Report MSR-TR-24-19 Audio fingerprinting

More information

INVESTIGATIONS OF CROSS-CORRELATION AND EUCLIDEAN DISTANCE TARGET MATCHING TECHNIQUES IN THE MPEF ENVIRONMENT. Greg And Ken Holmlund # ABSTRACT

INVESTIGATIONS OF CROSS-CORRELATION AND EUCLIDEAN DISTANCE TARGET MATCHING TECHNIQUES IN THE MPEF ENVIRONMENT. Greg And Ken Holmlund # ABSTRACT INVESTIGATIONS OF CROSS-CORRELATION AND EUCLIDEAN DISTANCE TARGET MATCHING TECHNIQUES IN THE MPEF ENVIRONME Greg Dew @ And Ken Holmlund # @ Logica # EUMETSAT ABSTRACT Cross-Correlation and Euclidean Distance

More information

Cluster Analysis for Microarray Data

Cluster Analysis for Microarray Data Cluster Analysis for Microarray Data Seventh International Long Oligonucleotide Microarray Workshop Tucson, Arizona January 7-12, 2007 Dan Nettleton IOWA STATE UNIVERSITY 1 Clustering Group objects that

More information

CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY

CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 23 CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 3.1 DESIGN OF EXPERIMENTS Design of experiments is a systematic approach for investigation of a system or process. A series

More information

University of Florida CISE department Gator Engineering. Data Preprocessing. Dr. Sanjay Ranka

University of Florida CISE department Gator Engineering. Data Preprocessing. Dr. Sanjay Ranka Data Preprocessing Dr. Sanjay Ranka Professor Computer and Information Science and Engineering University of Florida, Gainesville ranka@cise.ufl.edu Data Preprocessing What preprocessing step can or should

More information

Lecture 8 Object Descriptors

Lecture 8 Object Descriptors Lecture 8 Object Descriptors Azadeh Fakhrzadeh Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University 2 Reading instructions Chapter 11.1 11.4 in G-W Azadeh Fakhrzadeh

More information

Outlier detection using autoencoders

Outlier detection using autoencoders Outlier detection using autoencoders August 19, 2016 Author: Olga Lyudchik Supervisors: Dr. Jean-Roch Vlimant Dr. Maurizio Pierini CERN Non Member State Summer Student Report 2016 Abstract Outlier detection

More information

Position Error Reduction of Kinematic Mechanisms Using Tolerance Analysis and Cost Function

Position Error Reduction of Kinematic Mechanisms Using Tolerance Analysis and Cost Function Position Error Reduction of Kinematic Mechanisms Using Tolerance Analysis and Cost Function B.Moetakef-Imani, M.Pour Department of Mechanical Engineering, Faculty of Engineering, Ferdowsi University of

More information

A Method for Edge Detection in Hyperspectral Images Based on Gradient Clustering

A Method for Edge Detection in Hyperspectral Images Based on Gradient Clustering A Method for Edge Detection in Hyperspectral Images Based on Gradient Clustering V.C. Dinh, R. P. W. Duin Raimund Leitner Pavel Paclik Delft University of Technology CTR AG PR Sys Design The Netherlands

More information

Classifying Body and Surface Reflections using Expectation-Maximization

Classifying Body and Surface Reflections using Expectation-Maximization IS&T's 23 PICS Conference Classifying Body and Surface Reflections using Expectation-Maximization Hans J. Andersen and Moritz Störring Computer Vision and Media Technology Laboratory Institute of Health

More information

Using the generalized Radon transform for detection of curves in noisy images

Using the generalized Radon transform for detection of curves in noisy images Downloaded from orbit.dtu.dk on: Jan 13, 2018 Using the generalized Radon transform for detection of curves in noisy images Toft, eter Aundal ublished in: Conference roceedings. 1996 IEEE International

More information

Package ALS. August 3, 2015

Package ALS. August 3, 2015 Type Package Package ALS August 3, 2015 Title Multivariate Curve Resolution Alternating Least Squares (MCR-ALS) Version 0.0.6 Author Katharine M. Mullen Maintainer Katharine Mullen

More information

CREATING THE ANALYSIS

CREATING THE ANALYSIS Chapter 14 Multiple Regression Chapter Table of Contents CREATING THE ANALYSIS...214 ModelInformation...217 SummaryofFit...217 AnalysisofVariance...217 TypeIIITests...218 ParameterEstimates...218 Residuals-by-PredictedPlot...219

More information