sscore July 13, 2010 vector of data tuning constant (see details) fuzz value to avoid division by zero (see details)

Similar documents
Package sscore. R topics documented: June 27, Version Date

Package affyio. January 18, Index 6. CDF file format function

Package affypdnn. January 10, 2019

Microarray Data Analysis (VI) Preprocessing (ii): High-density Oligonucleotide Arrays

affyqcreport: A Package to Generate QC Reports for Affymetrix Array Data

Package makecdfenv. March 13, 2019

Package yaqcaffy. January 19, 2019

Description of gcrma package

Frozen Robust Multi-Array Analysis and the Gene Expression Barcode

Using oligonucleotide microarray reporter sequence information for preprocessing and quality assessment

Package frma. R topics documented: March 8, Version Date Title Frozen RMA and Barcode

Package rmat. November 1, 2018

From raw data to gene annotations

Package mimager. March 7, 2019

affypara: Parallelized preprocessing algorithms for high-density oligonucleotide array data

Analysis of Genomic and Proteomic Data. Practicals. Benjamin Haibe-Kains. February 17, 2005

Package affyplm. January 6, 2019

MiChip. Jonathon Blake. October 30, Introduction 1. 5 Plotting Functions 3. 6 Normalization 3. 7 Writing Output Files 3

Microarray Technology (Affymetrix ) and Analysis. Practicals

Package RPA. R topics documented: September 13, Type Package

Affymetrix GeneChip DNA Analysis Software

Package AffyExpress. October 3, 2013

Package LMGene. R topics documented: December 23, Version Date

Package RLMM. March 7, 2019

Quick Reference Card. GeneChip Sequence Analysis Software 4.1. I. GSEQ Introduction

Applying Data-Driven Normalization Strategies for qpcr Data Using Bioconductor

Package RobLoxBioC. August 3, 2018

Package pcagopromoter

Preprocessing -- examples in microarrays

Package BAC. R topics documented: November 30, 2018

GSEQ Software User s Guide for AccuID TM

Introduction: microarray quality assessment with arrayqualitymetrics

Using FARMS for summarization Using I/NI-calls for gene filtering. Djork-Arné Clevert. Institute of Bioinformatics, Johannes Kepler University Linz

Affymetrix Microarrays

Package PSEA. R topics documented: November 17, Version Date Title Population-Specific Expression Analysis.

Package DFP. February 2, 2018

Preprocessing Affymetrix Data. Educational Materials 2005 R. Irizarry and R. Gentleman Modified 21 November, 2009, M. Morgan

Package spikeli. August 3, 2013

500K Data Analysis Workflow using BRLMM

Package ffpe. October 1, 2018

How to use the rbsurv Package

Package virtualarray

Bioconductor tutorial

Exploratory data analysis for microarrays

Codelink Legacy: the old Codelink class

multiscan: Combining multiple laser scans of microarrays

Sadanandam, et al., A colorectal cancer classification system that Associates cellular phenotype and responses to therapy.

Course on Microarray Gene Expression Analysis

Package macat. October 16, 2018

Package madsim. December 7, 2016

Package SCAN.UPC. October 9, Type Package. Title Single-channel array normalization (SCAN) and University Probability of expression Codes (UPC)

Release Notes. JMP Genomics. Version 4.0

User Guide. Affymetrix GeneChip Sequence Analysis Software. Version For Research Use Only. Not for use in diagnostic procedures.

Microarray Excel Hands-on Workshop Handout

Package splicegear. R topics documented: December 4, Title splicegear Version Author Laurent Gautier

MATERIALS PLUS Segmentation Measurement

Gene Expression an Overview of Problems & Solutions: 1&2. Utah State University Bioinformatics: Problems and Solutions Summer 2006

Parallelized preprocessing algorithms for high-density oligonucleotide arrays

Contents. CRITERION Vantage 3 Analysis Training Manual. Introduction 1. Basic Functionality of CRITERION Analysis 5. Charts and Reports 17

GS Analysis of Microarray Data

Expander Online Documentation

Contents. 2.5 DBSCAN Pair Auto-Correlation Saving your results References... 18

Package CNTools. November 6, 2018

Package OrderedList. December 31, 2017

Package puma. November 15, Type Package

Package EMDomics. November 11, 2018

Package tspair. July 18, 2013

Package cgh. R topics documented: February 19, 2015

/ Computational Genomics. Normalization

Package BrainStars. July 19, 2018

ReadqPCR: Functions to load RT-qPCR data into R

Package corclass. R topics documented: January 20, 2016

The crosshybdetector Package

Alternative CDF environments

Package affxparser. R topics documented: September 29, Version Depends R (>= )

Gene signature selection to predict survival benefits from adjuvant chemotherapy in NSCLC patients

Package agilp. R topics documented: January 22, 2018

Package st. July 8, 2015

Package TilePlot. February 15, 2013

Introduction to the xps Package: Comparison to Affymetrix Power Tools (APT)

CARMAweb users guide version Johannes Rainer

All About PlexSet Technology Data Analysis in nsolver Software

Package arrayqualitymetrics

DREM. Dynamic Regulatory Events Miner (v1.0.9b) User Manual

Package ENmix. September 17, 2015

Exploring cdna Data. Achim Tresch, Andreas Buness, Tim Beißbarth, Florian Hahne, Wolfgang Huber. June 17, 2005

Package PGSEA. R topics documented: May 4, Type Package Title Parametric Gene Set Enrichment Analysis Version 1.54.

QuickReferenceCard. Axiom TM Analysis Suite - Analyzing your Samples. Setting Up and Running an Analysis

apt-probeset-genotype Manual

Package geecc. October 9, 2015

Package GSRI. March 31, 2019

Working with Affymetrix data: estrogen, a 2x2 factorial design example

Physics MRI Research Centre UNIFIT VERSION User s Guide

Package twilight. August 3, 2013

DCT-BASED IMAGE QUALITY ASSESSMENT FOR MOBILE SYSTEM. Jeoong Sung Park and Tokunbo Ogunfunmi

Package dualks. April 3, 2019

Affymetrix Data Transfer Tool User s Guide Version 1.1

Working with Raster Graphic Images in Origin

srap: Simplified RNA-Seq Analysis Pipeline

Averages and Variation

Transcription:

sscore July 13, 2010 OneStepBiweightAlgorithm One-step Tukey s biweight Computes one-step Tukey s biweight on a vector. that this implementation follows the Affymetrix code, which is different from the Tukey s biweight computed by the affy package. OneStepBiweightAlgorithm(x, c, epsilon) x c epsilon vector of data tuning constant (see details) fuzz value to avoid division by zero (see details) The details can be found in the given reference. A numeric value Auth/support/developer/stat_sdk/STAT_SDK_source.zip Affymetrix (2002) Statistical Algorithms Document, Affymetrix Inc., Santa Clara, CA, 1

2 SScore SScore Compute S-Score values Computes the S-Score values for a pair of Affymetrix GeneChips SScore(afbatch = stop("no CEL files specified"), classlabel = c(0,1), SF = NULL, SDT = NULL, rm.outliers = TRUE, rm.mask = TRUE, rm.extra = NULL, digits = NULL, verbose = FALSE, celfile.path = NULL, celfile.names = NULL afbatch classlabel SF SDT rm.outliers rm.mask rm.extra digits verbose An AffyBatch object A vector identifying the class for each column of the AffyBatch object a list of Scale Factor (SF) values for each GeneChip a list of Standard Difference Threshold (SDT) values for each GeneChip should the spots marked as OUTLIERS be excluded from S-Score calculation? should the spots marked as MASKS be excluded from S-Score calculation? if TRUE, overrides what is in rm.mask and rm.outliers number of significant digits for S-Score values logical value. If TRUE it provides more detail of the S-Score calculations. celfile.path character denoting the path for the *.CEL files corresponding to afbatch celfile.names optional character vector containing the names of the *.CEL files Computes S-Score values as described by Zhang et al. (2002). SScore provides a simpler interface for comparing only two classes of GeneChips, while SScoreBatch compares multiple pairs of chips. The classlabel consists of a vector with one entry for each column of the AffyBatch object. Each entry consists of a 0 or a 1 to identify the class to which the chip for the corresponding column belongs. SScore will conduct a two-class test comparing all chips labeled 0 to all chips labeled 1. If classlabel is not specified, it defaults to a two-chip comparison, compatible with previous versions of SScore. The SF and SDT factors are required for all calculations. If NULL, these values will be calculated according to the Affymetrix Statistical Algorithms Document. digits allows the specification of the number of significant digits for the S-Score values; if NULL, the maximum number of significant digits are retained. An ExpressionSet with S-Score values in the exprs slot.

SScoreBatch 3 Based on C++ code by Li Zhang and Delphi code by Robnet Kerns Zhang, L., Wang, L., Ravindranathan, A., Miles, M.F. (2002) A new algorithm for analysis of oligonucleotide arrays: application to expression profiling in mouse brain regions. Journal of Molecular Biology, 317(2), pp. 225 35 Kerns, R.T., Zhang, L., Miles, M.F. (2003) Application of the S-score algorithm for analysis of oligonucleotide microarrays. Methods, 31(4), pp. 274 81 See Also SScoreBatch,computeSFandSDT,computeOutlier Examples } if (length(dir(pattern=".cel$"))!= 0) { ## Read in the *.CEL files abatch <- ReadAffy() ## default calling method SScores <- SScore(abatch) ## specifying SF and SDT (gives same results as above) SfSdt <- computesfandsdt(abatch) SScores <- SScore(abatch,SF=SfSdt$SF,SDT=SfSdt$SDT) ## specifying outlier and masked values should be included in calculations SScores <- SScore(abatch,rm.outliers=FALSE,rm.mask=FALSE) ## round results to 3 significant digits SScores <- SScore(abatch,digits=3) ## show verbose output SScores <- SScore(abatch,verbose=TRUE) SScoreBatch Compute S-Score values Computes the S-Score values for multiple pairs of Affymetrix GeneChips

4 SScoreBatch SScoreBatch(afbatch = stop("no CEL files specified"), compare = stop("no list of SF = NULL, SDT = NULL, rm.outliers = TRUE, rm.mask = TRUE, rm.extra digits = NULL, verbose = FALSE, celfile.path = NULL, celfile.names = afbatch compare SF SDT rm.outliers rm.mask rm.extra digits verbose An AffyBatch object A matrix describing which chips to compare a list of Scale Factor (SF) values for each GeneChip a list of Standard Difference Threshold (SDT) values for each GeneChip should the spots marked as OUTLIERS be excluded from S-Score calculation? should the spots marked as MASKS be excluded from S-Score calculation? if TRUE, overrides what is in rm.mask and rm.outliers number of significant digits for S-Score values logical value. If TRUE it provides more detail of the S-Score calculations. celfile.path character denoting the path for the *.CEL files corresponding to afbatch celfile.names optional character vector containing the names of the *.CEL files Computes S-Score values as described by Zhang et al. (2002). SScoreBatch allows comparison of multiple pairs of chips, while SScore provides a simpler interface when comparing only two GeneChips. compare specifies how the pairwise comparisons are performed. It is an N x 2 matrix, where N is the number of pairwise comparisons; each row of the matrix contains index in the AffyBatch object for the chips to be compared. For example, 1 3 4 2 5 9 10 2 5 7 would do a comparison of chip 1 to chip 3, a comparison of chip 4 to chip 2, a comparison of chip 5 to chip 9, and so on. The columns in ExpressionSet correspond to the rows of compare, so that the results of the first comparison are in column 1, the results of the second comparison are in column 2, and so on. The SF and SDT factors are required for all calculations. If NULL, these values will be calculated according to the Affymetrix Statistical Algorithms Document. probes. digits allows the specification of the number of significant digits for the S-Score values; if NULL, the maximum number of significant digits are retained. An ExpressionSet with S-Score values in the exprs slot.

computeaffxrawq 5 Based on C++ code by Li Zhang and Delphi code by Robnet Kerns Zhang, L., Wang, L., Ravindranathan, A., Miles, M.F. (2002) A new algorithm for analysis of oligonucleotide arrays: application to expression profiling in mouse brain regions. Journal of Molecular Biology, 317(2), pp. 225 35 Kerns, R.T., Zhang, L., Miles, M.F. (2003) Application of the S-score algorithm for analysis of oligonucleotide microarrays. Methods, 31(4), pp. 274 81 See Also SScore,computeSFandSDT,computeOutlier Examples } if (length(dir(pattern=".cel$"))!= 0) { ## Read in the *.CEL files abatch <- ReadAffy() ## default calling method SScores <- SScoreBatch(abatch) ## specifying SF and SDT (gives same results as above) SfSdt <- computesfandsdt(abatch) ## specifying outlier and masked values should be included in calculations SScores <- SScoreBatch(abatch,SF=SfSdt$SF,SDT=SfSdt$SDT) SScores <- SScoreBatch(abatch,rm.outliers=FALSE,rm.mask=FALSE) ## round results to 3 significant digits SScores <- SScoreBatch(abatch,digits=3) ## show verbose output SScores <- SScoreBatch(abatch,verbose=TRUE) computeaffxrawq Computes RawQ using affxparser routines Computes the RawQ value of a single Affymetrix GeneChip

6 computeaffxsfandsdt computeaffxrawq(intensity, stdvs, pixels, probe.index, probe.zoneid, bgcells, Nu intensity vector of intensities for the GeneChip stdvs matrix of standard deviations for probe intensities from *.CEL file pixels matrix of number of pixels for probe intensities from *.CEL file probe.index vector of indices for each probe probe.zoneid vector of zone ID numbers for each probe bgcells number of background cells for the GeneChip NumberZones number of zones on the GeneChip Calculates RawQ using the algorithms described in the Affymetrix Statistical Algorithms Document for a single GeneChip. This is an internal function that will generally not be accessed directly. the RawQ value for the given array Auth/support/developer/stat_sdk/STAT_SDK_source.zip, as well as C++ code by Li Zhang and Delphi code by Robnet Kerns Affymetrix (2002) Statistical Algorithms Document, Affymetrix Inc., Santa Clara, CA, computeaffxsfandsdt Compute SF and SDT values using affxparser routines Computes the scaling factor (SF) and statistical difference threshold (SDT) values of Affymetrix GeneChips, for use in calculating S-Score values

computeaffxsfandsdt 7 computeaffxsfandsdt(afbatch, stdvs, pixels, TGT = 500, digits = NULL, verbose = afbatch stdvs pixels TGT digits An AffyBatch object matrix of standard deviations for probe intensities from *.CEL file matrix of number of pixels for probe intensities from *.CEL file the target intensity to which the arrays should be scaled number of significant digits for SF and SDT values verbose logical value. If TRUE it provides more detail of the SF and SDT calculations. plot.histogram logical value. if TRUE it plots a histogram of intensities Calculates SF and SDT factors using the algorithms described in the Affymetrix Statistical Algorithms Document. The SF and SDT may be used in the calculation of S-Score values, or may be useful in their own right. One SF and SDT value is calculated for each GeneChip, which are arranged in the same order as the columns in the AffyBatch object. computesfandsdt returns a list containing the following components: SF SDT SF values, one for each GeneChip SDT values, one for each GeneChip Auth/support/developer/stat_sdk/STAT_SDK_source.zip, as well as C++ code by Li Zhang and Delphi code by Robnet Kerns Affymetrix (2002) Statistical Algorithms Document, Affymetrix Inc., Santa Clara, CA, Examples if (length(dir(pattern=".cel$"))!= 0) { ## Read in the *.CEL files abatch <- ReadAffy() ## compute SF and SDT

8 computeoutlier } SfSdt <- computesfandsdt(abatch) ## show verbose output SfSdt <- computesfandsdt(abatch,verbose=true) ## plot PM and MM histograms for each *.CEL file SfSdt <- computesfandsdt(abatch,plot.histogram=true) computeoutlier Compute outlier probes Computes outlier and/or mask probes for a set of Affymetrix GeneChips that will be excluded from S-Score calculations computeoutlier(afbatch, rm.mask = TRUE, rm.outliers = TRUE, rm.extra = TRUE, cel afbatch rm.mask rm.outliers rm.extra An AffyBatch object should the spots marked as MASKS be excluded from S-Score calculation? should the spots marked as OUTLIERS be excluded from S-Score calculation? if TRUE, overrides what is in rm.mask and rm.outliers celfile.path character denoting the path for the *.CEL files corresponding to afbatch celfile.names optional character vector containing the names of the *.CEL files Computes the outlier and / or mask probes for an AffyBatch object. These are returned in matrix form, with one probe per row and one chip per column. The value of each location in the matrix will be TRUE if the corresponding probe is an outlier / masked value and FALSE if it is not. The options may be set to exclude only outlier values, only mask values, or both. The probes are be arranged in the same row order as the intensity values. that this function assumes the *.CEL files are still available in the directory given by celfile.path (or the current directory if celfile.path is not specified). The *.CEL names are given by celfile.names. If celfile.names is not specified, the sample names from the AffyBatch object will be used. a matrix containing the list of outliers / masked values for the given AffyBatch object. Based on C++ code by Li Zhang and Delphi code by Robnet Kerns

computerawq 9 Examples } if (length(dir(pattern=".cel$"))!= 0) { abatch <- ReadAffy() outlier <- computeoutlier(abatch) computerawq Computes RawQ Computes the RawQ value of a single Affymetrix GeneChip computerawq(fname, intensity, probe.index, probe.zoneid, bgcells, NumberZones, c fname intensity probe.index character string with the filename of the GeneChip vector of intensities for the GeneChip vector of indices for each probe probe.zoneid vector of zone ID numbers for each probe bgcells NumberZones number of background cells for the GeneChip number of zones on the GeneChip celfile.path character denoting the path for the *.CEL files specified in fname Calculates RawQ using the algorithms described in the Affymetrix Statistical Algorithms Document for a single GeneChip. This is an internal function that will generally not be accessed directly. the RawQ value for the given array Auth/support/developer/stat_sdk/STAT_SDK_source.zip, as well as C++ code by Li Zhang and Delphi code by Robnet Kerns

10 computesfandsdt Affymetrix (2002) Statistical Algorithms Document, Affymetrix Inc., Santa Clara, CA, computesfandsdt Compute SF and SDT values Computes the scaling factor (SF) and statistical difference threshold (SDT) values of Affymetrix GeneChips, for use in calculating S-Score values computesfandsdt(afbatch, TGT = 500, digits = NULL, verbose = FALSE, plot.histogr afbatch TGT digits An AffyBatch object the target intensity to which the arrays should be scaled number of significant digits for SF and SDT values verbose logical value. If TRUE it provides more detail of the SF and SDT calculations. plot.histogram logical value. if TRUE it plots a histogram of intensities celfile.path character denoting the path for the *.CEL files corresponding to afbatch Calculates SF and SDT factors using the algorithms described in the Affymetrix Statistical Algorithms Document. The SF and SDT may be used in the calculation of S-Score values, or may be useful in their own right. One SF and SDT value is calculated for each GeneChip, which are arranged in the same order as the columns in the AffyBatch object. computesfandsdt returns a list containing the following components: SF SDT SF values, one for each GeneChip SDT values, one for each GeneChip Auth/support/developer/stat_sdk/STAT_SDK_source.zip, as well as C++ code by Li Zhang and Delphi code by Robnet Kerns

computezoneiinfo 11 Affymetrix (2002) Statistical Algorithms Document, Affymetrix Inc., Santa Clara, CA, Examples } if (length(dir(pattern=".cel$"))!= 0) { ## Read in the *.CEL files abatch <- ReadAffy() ## compute SF and SDT SfSdt <- computesfandsdt(abatch) ## show verbose output SfSdt <- computesfandsdt(abatch,verbose=true) ## plot PM and MM histograms for each *.CEL file SfSdt <- computesfandsdt(abatch,plot.histogram=true) computezoneiinfo Compute zone background and noise Computes the background and noise for a given zone of a single Affymetrix GeneChip computezoneiinfo(zoneinfo, NumberBGCells) ZoneInfo vector of intensities in a given zone NumberBGCells number of background cells for the GeneChip Calculates background and noise for a zone using the algorithms described in the Affymetrix Statistical Algorithms Document. This is an internal function that will generally not be accessed directly. computezoneiinfo returns a list containing the following components: background noise background value for the given zone noise value for the given zone

12 computezonenoise Auth/support/developer/stat_sdk/STAT_SDK_source.zip, as well as C++ code by Li Zhang and Delphi code by Robnet Kerns Affymetrix (2002) Statistical Algorithms Document, Affymetrix Inc., Santa Clara, CA, computezonenoise Computes zone noise Computes the noise (average standard error) of the probe intensities for a single Affymetrix GeneChip computezonenoise(index, intensity, stdv, npixels, bgcells) index intensity stdv npixels bgcells vector of indices for probes in the given zone vector of intensities for the GeneChip vector of standard deviations for the GeneChip vector containing number of pixels for each probe of the GeneChip number of background cells on the GeneChip Calculates the noise (average standard error) of the probes in a given zone, using the algorithms described in the Affymetrix Statistical Algorithms Document, for a single GeneChip. This is an internal function that will generally not be accessed directly. the noise of the probes for the given array Auth/support/developer/stat_sdk/STAT_SDK_source.zip, as well as C++ code by Li Zhang and Delphi code by Robnet Kerns

trimmean 13 Affymetrix (2002) Statistical Algorithms Document, Affymetrix Inc., Santa Clara, CA, trimmean Compute trimmed mean for a vector Computes the trimmed mean for a vector. that this implementation follows the Affymetrix code, which gives different results than the standard R function mean(). trimmean(vec, p1, p2) vec p1 p2 vector of values lower percentage for trimming upper percentage for trimming The details can be found in the given reference. A numeric value Auth/support/developer/stat_sdk/STAT_SDK_source.zip Affymetrix (2002) Statistical Algorithms Document, Affymetrix Inc., Santa Clara, CA,

Index Topic manip computeaffxrawq, 5 computeaffxsfandsdt, 6 computeoutlier, 8 computerawq, 9 computesfandsdt, 10 computezoneiinfo, 11 computezonenoise, 12 OneStepBiweightAlgorithm, 1 SScore, 2 SScoreBatch, 3 Topic univar trimmean, 13 computeaffxrawq, 5 computeaffxsfandsdt, 6 computeoutlier, 3, 5, 8 computerawq, 9 computesfandsdt, 3, 5, 10 computezoneiinfo, 11 computezonenoise, 12 OneStepBiweightAlgorithm, 1 SScore, 2, 5 SScoreBatch, 3, 3 trimmean, 13 14