Package ALS. August 3, 2015

Similar documents
SOFTWARE INSTRUCTIONS

Analytica Chimica Acta 595 (2007) R. Tauler

The User s Guide to. MCRC software

Package mpm. February 20, 2015

Package feature. R topics documented: October 26, Version Date

Building Agilent GC/MSD Deconvolution Reporting Libraries for Any Application Technical Overview

Package feature. R topics documented: July 8, Version Date 2013/07/08

Package SSLASSO. August 28, 2018

Package kirby21.base

Package HiResTEC. August 7, 2018

Package pairsd3. R topics documented: August 29, Title D3 Scatterplot Matrices Version 0.1.0

Package cellvolumedist

Package munfold. R topics documented: February 8, Type Package. Title Metric Unfolding. Version Date Author Martin Elff

Package StVAR. February 11, 2017

Package CorrectOverloadedPeaks

Package IntNMF. R topics documented: July 19, 2018

Package mixphm. July 23, 2015

Package Metab. September 18, 2018

Package lspls. July 27, 2018

Package corclass. R topics documented: January 20, 2016

Package MultiRR. October 21, 2015

Package OLScurve. August 29, 2016

Package biglars. February 19, 2015

Package ssmn. R topics documented: August 9, Type Package. Title Skew Scale Mixtures of Normal Distributions. Version 1.1.

Package svapls. February 20, 2015

Package readmzxmldata

Package gibbs.met. February 19, 2015

Package assortnet. January 18, 2016

Package GLDreg. February 28, 2017

Package datasets.load

Package crmn. February 19, 2015

Package comphclust. February 15, 2013

Package cwm. R topics documented: February 19, 2015

Package comphclust. May 4, 2017

Package robustreg. R topics documented: April 27, Version Date Title Robust Regression Functions

Package fso. February 19, 2015

SELECTION OF A MULTIVARIATE CALIBRATION METHOD

Package clustering.sc.dp

Package lvm4net. R topics documented: August 29, Title Latent Variable Models for Networks

Package RCA. R topics documented: February 29, 2016

Package Mondrian. R topics documented: March 4, Type Package

Package CALIBERrfimpute

Package areaplot. October 18, 2017

Package JGEE. November 18, 2015

Package apastyle. March 29, 2017

Package MatrixCorrelation

Package DRaWR. February 5, 2016

Package oaxaca. January 3, Index 11

Package restlos. June 18, 2013

Package gee. June 29, 2015

Package hbm. February 20, 2015

Package Numero. November 24, 2018

Package SiZer. February 19, 2015

Package NNLM. June 18, 2018

Package parcor. February 20, 2015

Package JOP. February 19, 2015

STENO Introductory R-Workshop: Loading a Data Set Tommi Suvitaival, Steno Diabetes Center June 11, 2015

Package ParetoPosStable

EXST 7014, Lab 1: Review of R Programming Basics and Simple Linear Regression

Package MixSim. April 29, 2017

Package EnQuireR. R topics documented: February 19, Type Package Title A package dedicated to questionnaires Version 0.

Defining the Global Error of a Multi-Axis Vibration Test: An Application of MIL-STD-810G Method 527 Annex C

Package gwrr. February 20, 2015

Package RSpectra. May 23, 2018

Package gsalib. R topics documented: February 20, Type Package. Title Utility Functions For GATK. Version 2.1.

Package weco. May 4, 2018

Multivariate Calibration Quick Guide

Package SAM. March 12, 2019

Package pampe. R topics documented: November 7, 2015

Package desirability

Package rknn. June 9, 2015

Package TilePlot. February 15, 2013

Package DTRlearn. April 6, 2018

Package biosigner. March 6, 2019

Package Combine. R topics documented: September 4, Type Package Title Game-Theoretic Probability Combination Version 1.

Package mmtsne. July 28, 2017

Package slp. August 29, 2016

Package rpst. June 6, 2017

Package spark. July 21, 2017

Package sfc. August 29, 2016

Package subplex. April 5, 2018

Package gridgraphics

Package ConvergenceClubs

Package nprotreg. October 14, 2018

Package RPMM. August 10, 2010

Package intccr. September 12, 2017

Package mrm. December 27, 2016

Package OptimaRegion

High-throughput Alternating Least Squares (ALS) with the alsace package

Introduction to Multivariate Image Analysis (MIA) Table of Contents

Package ClustVarLV. December 14, 2016

Package MSGLasso. November 9, 2016

Package gplm. August 29, 2016

Package ADMM. May 29, 2018

Package rplotengine. R topics documented: August 8, 2018

Package dissutils. August 29, 2016

Package DBKGrad. R topics documented: December 2, 2018

Package MFDFA. April 18, 2018

Package lga. R topics documented: February 20, 2015

Package MeanShift. R topics documented: August 29, 2016

Transcription:

Type Package Package ALS August 3, 2015 Title Multivariate Curve Resolution Alternating Least Squares (MCR-ALS) Version 0.0.6 Author Katharine M. Mullen Maintainer Katharine Mullen <mullenkate@gmail.com> Depends nnls (>= 1.1), Iso, R (>= 2.10) Alternating least squares is often used to resolve components contributing to data with a bilinear structure; the basic technique may be extended to alternating constrained least squares. Commonly applied constraints include unimodality, non-negativity, and normalization of components. Several data matrices may be decomposed simultaneously by assuming that one of the two matrices in the bilinear decomposition is shared between datasets. License GPL (>= 2) Repository CRAN Date/Publication 2015-08-03 11:44:35 NeedsCompilation no R topics documented: als.............................................. 2 matchfactor......................................... 5 multiex........................................... 6 multiex1........................................... 6 plots............................................. 7 Index 9 1

2 als als alternating least squares multivariate curve resolution (MCR-ALS) This is an implementation of alternating least squares multivariate curve resolution (MCR-ALS). Given a dataset in matrix form d1, the dataset is decomposed as d1=c %*% t(s) where the columns of C and S represent components contributing to the data in each of the 2-ways that the matrix is resolved. In forming the decomposition, the components in each way many be constrained with e.g., non-negativity, uni-modality, selectivity, normalization of S and closure of C. Note that if more than one dataset is to be analyzed simultaneously, then the matrix S is assumed to be the same for every dataset in the bilinear decomposition of each dataset into matrices C and S. als(clist, PsiList, S=matrix(), WList=list(), thresh =.001, maxiter=100, forcemaxiter = FALSE, opts1st=true, x=1:nrow(clist[[1]]), x2=1:nrow(s), baseline=false, fixed=vector("list", length(psilist)), unic=false, unis=false, nonnegc = TRUE, nonnegs = TRUE, norms=0, closurec=list()) Arguments CList PsiList S WList thresh maxiter forcemaxiter opts1st x x2 list with the same length as PsiList where each element is a matrix of dimension m by comp and represents the matrix C for each dataset list of datasets, where each dataset is a matrix of dimension m by n matrix with n rows and comp columns, often representing (mass) spectra An optional list with the same length as PsiList, where each element is a matrix of dimension m by n giving the weight of that datapoint; note that if closure or normalization constraints are applied, then both are applied after the application of weights. numeric value that defaults to.001; if ((oldrss - rss) / oldrss) < thresh then the optimization stops, where oldrss is the residual sum of squares at iteration x-1 and rss is the residual sum of squares at iteration x The maximum number of iterations to perform (where an iteration is optimization of either AList and C) Logical indicating whether maxiter iterations should be performed even if the residual difference drops below thresh. logical indicating whether the first constrained least squares regression should estimate S or CList. optional vector of labels for the rows of C, which are used in the application of unimodality constraints. optional vector of labels for the rows of S, which are used in the application of unimodality constraints.

als 3 baseline fixed nonnegs nonnegc unic unis norms closurec logical indicating whether a baseline component is present; if baseline=true then this component is exempt from constraints unimodality or non-negativity list with the same length as PsiList in which each element is a vector of the indices of the components to fix to zero in each dataset logical indicating whether the components (columns) of the matrix S should be constrained to non-negative values logical indicating whether the components (columns) of the matrix C should be constrained to non-negative values logical indicating whether unimodality constraints should be applied to the columns of C logical indicating whether unimodality constraints should be applied to the columns of S numeric indicating whether the spectra are normalized; if norms>0, the spectra are normalized. If norms==1 the maximum of the spectrum of each component is constrained to be equal to one; if norms > 0 && norms!=1 then the norm of the spectrum of each component is constrained to be equal to one. list; if the length is zero, then no closure constraints are applied. If the length is not zero, it should be equal to the number of datasets in the analysis, and contain numeric vectors consisting of the desired value of the sum of each row of the concentration matrix. Value A list with components: CList S rss resid iter A list with the same length as the number of datasets, containing the optimized matrix C at termination scaled by the optimized amplitudes for that dataset from AList. The matrix S given as input. The residual sum of squares at termination. A list with the same length as the number of datasets, containing the residual matrix for each dataset The number of iterations performed before termination. Note This function was used to solve problems described in van Stokkum IHM, Mullen KM, Mihaleva VV. Global analysis of multiple gas chromatographymass spectrometry (GS/MS) data sets: A method for resolution of co-eluting components with comparison to MCR-ALS. Chemometrics and Intelligent Laboratory Systems 2009; 95(2): 150-163. in conjunction with the package TIMP. For the code to reproduce the examples in this paper, see examples_chemo.zip included in the inst directory of the package source code..

4 als References Garrido M, Rius FX, Larrechi MS. Multivariate curve resolution alternating least squares (MCR- ALS) applied to spectroscopic data from monitoring chemical reactions processes. Journal Analytical and Bioanalytical Chemistry 2008; 390:2059-2066. Jonsson P, Johansson A, Gullberg J, Trygg J, A J, Grung B, Marklund S, Sjostrom M, Antti H, Moritz T. High-throughput data analysis for detecting and identifying differences between samples in GC/MS-based metabolomic analyses. Analytical Chemistry 2005; 77:5635-5642. Tauler R. Multivariate curve resolution applied to second order data. Chemometrics and Intelligent Laboratory Systems 1995; 30:133-146. Tauler R, Smilde A, Kowalski B. Selectivity, local rank, three-way data analysis and ambiguity in multivariate curve resolution. Journal of Chemometrics 1995; 9:31-58. See Also matchfactor,multiex,multiex1, plots Examples ## load 2 matrix datasets into variables d1 and d2 ## load starting values for elution profiles ## into variables Cstart1 and Cstart2 ## load time labels as x, m/z values as x2 data(multiex) ## starting values for elution profiles matplot(x,cstart1,type="l") matplot(x,cstart2,type="l",add=true) ## using MCR-ALS, improve estimates for mass spectra S and the two ## matrices of elution profiles ## apply unimodality constraints to the elution profile estimates ## note that the starting estimates for S just contain a dummy matrix test0 <- als(clist=list(cstart1,cstart2),s=matrix(1,nrow=400,ncol=2), PsiList=list(d1,d2), x=x, x2=x2, unic=true, norms=0) ## plot the estimated mass spectra plots(test0$s,x2) ## the known mass spectra are contained in the variable S ## can compare the matching factor of each estimated spectrum to ## that in S matchfactor(s[,1],test0$s[,1]) matchfactor(s[,2],test0$s[,2]) ## plot the estimated elution profiles ## this shows the relative abundance of the 2nd component is low matplot(x,test0$clist[[1]],type="l") matplot(x,test0$clist[[2]],type="l",add=true)

matchfactor 5 matchfactor Matching factor functions to describe similarity of two vectors Matching factor functions to describe similarity of two vectors. This function may be useful to match an estimated mass spectrum against mass spectra of known compounds, in order to identify the compound represented by the estimated mass spectrum. matchfactor(u, s, type="dot") Arguments u s type numeric vector of length n numeric vector of length n character vector describing the matching factor function to apply; the choices are "dot" for the normalized dot product or "euclid" for the normalized euclidean distance. Value numeric between 0 and 1 representing the matching factor; vectors that are more similar have a larger matching factor. Note that if both u and s are all zero, we let the matching factor be 1; if one and only one of u and s are all zero, we let the matching factor be 0. Author(s) Katharine M. Mullen References Alfassi ZB. On the normalization of a mass spectrum for comparison of two spectra. Journal of the American Society for Mass Spectrometry 2004; 15:385-387. Stein SE, Scott DR. Optimization and testing of mass spectral library search algorithms for compound identication. Journal of the American Society for Mass Spectrometry 1994; 5:859-866. See Also als

6 multiex1 multiex Data inspired by GC mass spectrometry experiments Data inspired by GC mass spectrometry experiments. data("multiex") Format d1 and d2 are matrices of dimension 80 by 400 representing time and m/z resolved data. x and x2 represent the 80 times and 400 m/z values represented by the data, respectively. Cstart1 and Cstart2 are matrices of dimension 80 by 2, representing starting values for elution profiles. S represents mass spectra known to be represented in the data, as a 400 by 2 matrix. Examples data("multiex") ## mass spectra in the data plots(s,x2) ## starting values for elution profiles matplot(x,cstart1,type="l") matplot(x,cstart2,type="l",add=true) multiex1 Data inspired by GC mass spectrometry experiments Data inspired by GC mass spectrometry experiments. data("multiex1")

plots 7 Format PsiList is a list of 15 matrices of dimension 81 by 165, each representing time and m/z resolved data. WList is a list of 15 matrices of dimension 81 by 165, in which each point is a weight to be applied to a given data point. xm and xm2 represent the 81 times and 165 m/z values represented by each dataset in PsiList, respectively. AList is a list of length 15, the elements of which represent estimates for the amplitude of each component in each of the 15 datasets. See Also C1 is a 81 by 2 matrix representing a starting value for the shape of the elution profiles. Sm represents mass spectra known to be represented in the data, as a 165 by 2 matrix. als Examples data("multiex1") ## mass spectra in the data plots(sm,xm2) plots Plots a matrix representing mass spectra For each column in a matrix representing mass spectra, generates a sub-plot plots(s, x2, out="", filename=paste("s.", out, sep = ""), col=vector(),cex=1, lab="",cex.lab=1) Arguments S x2 out filename col matrix representing mass spectra of dimension n by comp where comp is the number of spectra vector of masses that label the rows of S if "", the plot is written to the current device; if "ps" a postscript file is written and if "pdf" then a pdf file is written character vector specifying the name of the file to write if out=true if length is greater than zero, then the color to plot each spectrum

8 plots cex lab cex.lab cex A numerical value giving the amount by which plotting text and symbols should be magnified relative to the default. cex.lab A character vector used as y-axis label. cex.lab A numerical value giving magnification to be used for x and y labels relative to the default. Author(s) See Also Katharine M. Mullen als Examples ## load example mass spectra S and vector of m/z values x2 data(multiex) plots(s,x2)

Index Topic datasets multiex, 6 multiex1, 6 Topic hplot matchfactor, 5 plots, 7 Topic optimize als, 2 AList (multiex1), 6 als, 2, 5, 7, 8 C1 (multiex1), 6 Cstart1 (multiex), 6 Cstart2 (multiex), 6 d1 (multiex), 6 d2 (multiex), 6 matchfactor, 4, 5 multiex, 4, 6 multiex1, 4, 6 plots, 4, 7 PsiList (multiex1), 6 S (multiex), 6 Sm (multiex1), 6 WList (multiex1), 6 x (multiex), 6 x2 (multiex), 6 xm (multiex1), 6 xm2 (multiex1), 6 9