Package flowtype. R topics documented: November 27, Type Package Title Phenotyping Flow Cytometry Assays Version 2.16.

Similar documents
Projection-based Gating Strategy Optimization for Flow and Mass Cytometry

Package GateFinder. March 17, 2019

Package SamSPECTRAL. January 23, 2019

Package flowmerge. February 8, 2018

Merging Mixture Components for Cell Population Identification in Flow Cytometry Data The flowmerge package

Package flowdensity. June 28, 2018

Package diffcyt. December 26, 2018

Package flowfp. July 21, Imports Biobase, BiocGenerics (>= 0.1.6), flowcore, flowviz, graphics,grdevices, methods, stats, stats4

flowcore: data structures package for the analysis of flow cytometry data

Package spade. R topics documented:

Robust Model-based Clustering of Flow Cytometry Data The flowclust package

Package ordinalclust

Getting started with flowstats

Package flowqb. October 7, 2014

Package Binarize. February 6, 2017

Package QUALIFIER. March 26, Imports MASS,hwriter,RSVGTipsDevice,lattice,stats4,flowCore,flowViz,methods,flowWorkspace,reshape

flowtrans: A Package for Optimizing Data Transformations for Flow Cytometry

flowmatch: Cell population matching and meta-clustering in Flow Cytometry

Critical Assessment of Automated Flow Cytometry Data Analysis Techniques

Package BiocNeighbors

Package corclass. R topics documented: January 20, 2016

Package flowchic. R topics documented: October 16, Encoding UTF-8 Type Package

Package QUALIFIER. R topics documented: September 26, Type Package

Automated data analysis of Flow Cytometry (FCM) data

Package ggcyto. October 12, 2016

Package Risa. November 28, 2017

Misty Mountain A Parallel Clustering Method. Application to Fast Unsupervised Flow Cytometry Gating

Package BDMMAcorrect

Package ibbig. R topics documented: December 24, 2018

Package ridge. R topics documented: February 15, Title Ridge Regression with automatic selection of the penalty parameter. Version 2.

Package biosigner. March 6, 2019

Package densityclust

Package Numero. November 24, 2018

SPADE: Spanning Tree Progression of Density Normalized Events

Package scmap. March 29, 2019

Package SC3. November 27, 2017

Package flsa. February 19, 2015

Package Modeler. May 18, 2018

Package HTSFilter. November 30, 2017

Package SC3. September 29, 2018

Package EFS. R topics documented:

MERGE FILES TUTORIAL. Version 1.4

Package BigVAR. R topics documented: April 3, Type Package

ncdfflow: Provides netcdf storage based methods and functions for manipulation of flow cytometry data

Package ConsensusClusterPlus

Package ffpe. October 1, 2018

Package CONOR. August 29, 2013

Multivariate analyses in ecology. Cluster (part 2) Ordination (part 1 & 2)

Package gpart. November 19, 2018

Package pcagopromoter

Package ggcyto. April 14, 2017

Package MaxContrastProjection

A guide to use the flowpeaks: a flow cytometry data clustering algorithm via K-means and density peak finding

Package SAM. March 12, 2019

Package DFP. February 2, 2018

Package FunciSNP. November 16, 2018

Package PSEA. R topics documented: November 17, Version Date Title Population-Specific Expression Analysis.

Package rqt. November 21, 2017

Package hpoplot. R topics documented: December 14, 2015

Package twilight. August 3, 2013

Package logicfs. R topics documented:

Package geogrid. August 19, 2018

Package rknn. June 9, 2015

Bioconductor s tspair package

SWIFT: SCALABLE WEIGHTED ITERATIVE FLOW-CLUSTERING TECHNIQUE

Package cgh. R topics documented: February 19, 2015

How to use the SimpleCOMPASS Interface

Package ClusterSignificance

Package maxent. July 2, 2014

Package inversion. R topics documented: July 18, Type Package. Title Inversions in genotype data. Version

Package sigqc. June 13, 2018

Package bnbc. R topics documented: April 1, Version 1.4.0

Package TVsMiss. April 5, 2018

Package fbroc. August 29, 2016

RCASPAR: A package for survival time analysis using high-dimensional data

Cytometry Data Analysis in FlowJo V10. Timothy Quinn Crawford, PhD Application Scientist FlowJo, LLC

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte

Package CytoML. October 15, 2018

Package FlowSOM. April 14, 2017

Feature-based Comparison of Flow Cytometry Data

Package makecdfenv. March 13, 2019

Package EnQuireR. R topics documented: February 19, Type Package Title A package dedicated to questionnaires Version 0.

Package longclust. March 18, 2018

Package dmrseq. September 14, 2018

Package wskm. July 8, 2015

Package Linnorm. October 12, 2016

Package OutlierDC. R topics documented: February 19, 2015

Package PrivateLR. March 20, 2018

Package strat. November 23, 2016

Package PedCNV. February 19, 2015

Package mirnapath. July 18, 2013

Package cwm. R topics documented: February 19, 2015

Package optimus. March 24, 2017

Package BoolFilter. January 9, 2017

Package clusterseq. R topics documented: June 13, Type Package

Package procoil. R topics documented: April 10, Type Package

Midterm Exam 2B Answer key

Package ipft. January 4, 2018

Package RTNduals. R topics documented: March 7, Type Package

Package FisherEM. February 19, 2015

Transcription:

Type Package Title Phenotyping Flow Cytometry Assays Version 2.16.0 Date 2014-11-26 Package flowtype November 27, 2017 Author Nima Aghaeepour, Kieran O'Neill, Adrin Jalali Maintainer Nima Aghaeepour <naghaeep@gmail.com> Phenotyping Flow Cytometry Assays using multidimentional expansion of single dimentional partitions. Imports Biobase, graphics, grdevices, methods, flowcore, flowmeans, sfsmisc, rrcov, flowclust, flowmerge, stats Depends R (>= 2.10), Rcpp (>= 0.10.4), BH (>= 1.51.0-3) LinkingTo Rcpp, BH Suggests xtable biocviews FlowCytometry License Artistic-2.0 LazyLoad yes NeedsCompilation yes R topics documented: flowtype-package...................................... 2 calcmemuse........................................ 3 calcnumpops........................................ 4 decodephenotype...................................... 5 DLBCLExample...................................... 6 encodephenotype...................................... 6 flowtype.......................................... 7 getlabels.......................................... 9 HIVData........................................... 10 HIVMetaData........................................ 10 Phenotypes-class...................................... 11 plot............................................. 12 summary-methods...................................... 12 Index 14 1

2 flowtype-package flowtype-package flowtype: Phenotyping Flow Cytometry Assays Details flowtype uses a simple threshold, Kmeans, flowmeans, or flowclust to partition every channel to a positive and a negative cell population. These partitions are then combined to generate a set of multi-dimensional phenotypes. Package: flowtype Type: Package Version: 0.0.1 Date: 2011-04-27 License: Artistic-2.0 LazyLoad: yes Depends: methods For a given FCS file, the flowtype function extracts a the phenotypes and reports their cell frequencies (number of cells) and mean fluorescence intensity (MFI)s. Nima Aghaeepour, Kieran O Neill, Adrin Jalali References Please cite the following for the current version of flowtype: O Neill K, Jalali A, Aghaeepour N, Hoos H, Brinkman RR. Enhanced flowtype/rchyoptimyx: a BioConductor pipeline for discovery in high-dimensional cytometry data. Bioinformatics. 2014 May 1;30(9):1329-30. doi: 10.1093/bioinformatics/btt770 The original paper and description can be found at: Nima Aghaeepour, Pratip K. Chattopadhyay, Anuradha Ganesan, Kieran O Neill, Habil Zare, Adrin Jalali, Holger H. Hoos, Mario Roederer, and Ryan R. Brinkman. Early Immunologic Correlates of HIV Protection can be Identified from Computational Analysis of Complex Multivariate T-cell Flow Cytometry Assays. Bioinformatics, 2011. #Load the library library(flowtype) data(dlbclexample) MarkerNames <- c('time', 'FSC-A','FSC-H','SSC-A','IgG','CD38','CD19','CD3','CD27','CD20', 'NA', 'NA') #These markers will be analyzed PropMarkers <- 3:5 MFIMarkers <- PropMarkers MarkerNames <- c('fs', 'SS','CD3','CD5','CD19')

calcmemuse 3 #Run flowtype Res <- flowtype(dlbclexample, PropMarkers, MFIMarkers, 'kmeans', MarkerNames); MFIs=Res@MFIs; Proportions=Res@CellFreqs; Proportions <- Proportions / max(proportions) names(proportions) <- unlist(lapply(res@phenocodes, function(x){return(decodephenotype( x,res@markernames[propmarkers], Res@PartitionsPerMarker))})) #Select the 30 largest phenotypes index=order(proportions,decreasing=true)[1:30] bp=barplot(proportions[index], axes=false, names.arg=false) text(bp+0.2, par("usr")[3]+0.02, srt = 90, adj = 0, labels = names(proportions[index]), xpd = TRUE, cex=0.8) axis(2); axis(1, at=bp, labels=false); title(xlab='phenotype Names', ylab='cell Proportion') #These phenotype can be analyzed using a predictive model (e.g., classification or regression) calcmemuse Function calcmemuse in Package flowtype Estimates the memory usage in bytes for running flowtype with a given set of parameters. calcmemuse(numpops, NumPropMarkers, NumMFIMarkers, NumCells, MaxMarkersPerPop, PartitionsPerChann Arguments NumPops Number of cell types which will be returned. Can be computed using calcnumpops NumPropMarkers Numer of markers to use for combinatorial gating NumMFIMarkers Number of markers to determine the MFIs of for every cell type NumCells Number of cells in the flowframe passed to flowtype MaxMarkersPerPop Maximum number of markers to use at once in combinatorial gating (ie all cell types over 1:MaxMarkersPerPop will be counted) PartitionsPerChannel Number of partitions per channel. Details Value If you use different numbers of partitions for different channels, try providing the highest number as PartitionsPerChannel, and expect an over-estimate. Estimated memory use in bytes.

4 calcnumpops Kieran O Neill See Also calcnumpops, flowtype calcnumpops Function calcnumpops in Package flowtype Compute the number of populations that will be produced by running flowtype with a given set of parameters. This is especially useful for estimating memory but is also used internally to determine the size of return objects to pass down to C++. This may also be useful for determining the cutoff for number of markers to use to make phenotypes (in terms of statistical power for later testing). calcnumpops(partitionspermarker, MaxMarkersPerPop) Arguments PartitionsPerMarker Integer vector specifying the number of partitions for each marker, in order. MaxMarkersPerPop Integer speciying the threshold chosen Value Integer specifying the number of populations the given paramters would produce. Kieran O Neill See Also calcmemuse, flowtype calcnumpops(c(2,2,3,2,2,4), 5)

decodephenotype 5 decodephenotype Method decodephenotype in Package flowtype Method to decode phenotypes back to a human-readable string. Details FlowType s encoding is as follows: 0 marker not considered in phenotype (don t care about its value) 1 marker is negative (e.g. CD4-) 2 marker is positive (e.g. CD4+) 3 marker is positive, but brighter than 2 (CD4++) 4 marker is even brighter (CD4+++) etc Note that this encoding system does not allow for "dim" markers dim positivity is denoted by the first level of positivity. Also note that the encoding is performed from the dimmest to the brightest partition, but the location of thresholds will dictate the interpretation of the code. (e.g. if you only set one threshold, but you place it between the positive and the bright population, then both positive and negative events will be considered negative.) Methods signature(pheno.code = "character", marker.names = "character", partitions.per.marker = "numeric In flowtype, phenotypes themselves are represented by codes (e.g. 012) rather than full strings (CD4+CD8-), in order to save memory when a very large number of phenotypes are considered. decodephenotype serves to translate the codes back to a human-readable string. Kieran O Neill See Also encodephenotype, flowtype decodephenotype('1034',c('cd34','cd3','cd45','cd19'), 4) decodephenotype('20013',c('cd34','cd3','cd45','cd19', 'CD20'), 4)

6 encodephenotype DLBCLExample DLBCLExample A flow cytometry sample from a patient with DLBC lymphoma. The full dataset is available through the FlowCAP project (http://flowcap.flowsite.org). data(dlbclexample) Format A flowframe describing expression values of 3 markers and 3796 cells. Each column represents a marker and each row represents a cell. data(dlbclexample) encodephenotype Function encodephenotype in Package flowtype In flowtype, phenotypes themselves are represented by codes (e.g. 012) rather than full strings (CD4+CD8-), in order to save memory when a very large number of phenotypes are considered. encodephenotype serves to translate a human-readable string down to flowtype s internal coded representation. encodephenotype(pheno.string, marker.names) Arguments pheno.string marker.names character vector containing containing the phenotype string to be encoded vector of character vectors each specifying the name of a channel, in order Details FlowType s encoding is as follows: 0 marker not considered in phenotype (don t care about its value) 1 marker is negative (e.g. CD4-) 2 marker is positive (e.g. CD4+) 3 marker is positive, but brighter than 2 (CD4++) 4 marker is even brighter (CD4+++) etc Note that this encoding system does not allow for "dim" markers dim positivity is denoted by the first level of positivity.

flowtype 7 Value Also note that the encoding is performed from the dimmest to the brightest partition, but the location of thresholds will dictate the interpretation of the code. (e.g. if you only set one threshold, but you place it between the positive and the bright population, then both positive and negative events will be considered negative.) Character vector containing the encoded phenotype, with one character per channel. See Also Kieran O Neill decodephenotype, flowtype encodephenotype('cd34++cd3-cd45+++',c('cd34','cd3','cd45','cd19')) flowtype flowtype: Phenotyping Flow Cytometry Assays flowtype uses a simple threshold, Kmeans, flowmeans or flowclust to partition every channel to a positive and a negative cell population. These partitions are then combined to generate a set of multi-dimensional phenotypes. flowtype(frame, PropMarkers=NULL, MFIMarkers=NULL, Methods='kmeans', MarkerNames=NULL, MaxMarkers #If upgrading from flowtype 1.x to 2.x, please check documentation as some arguments have changed s Arguments Frame PropMarkers MFIMarkers Methods A flowframe (after transformation) that is going to be phenotyped. A vector of the indexes or names of the markers to partition to specify phenotypes. If NULL, all markers in the frame will be used. A vector of the indexes or names of the markers for which MFIs must be measured. If NULL, no markers will be used. A single string specifying the method to use to determine thresholds for partitioning of markers. Values can be "kmeans", "flowmeans", "flowclust", or "Thresholds". If "Thresholds" is specified, user-specified thresholds must be provided via the Thresholds parameter. MarkerNames A vector of names for the channels. If NULL, the names in Frame will be used. MaxMarkersPerPop An integer specifying the maximum number of markers to use to define populations (how "deep" to phenotype). This should be less than or equal to PropMarkers. If NULL, will default to the length of PropMarkers.

8 flowtype Value PartitionsPerMarker An integer or vector of integers specifing the number of partitions per marker. If a single integer, this number will be used for all markers. If a vector, the numbers will be matched with PropMarkers in order. Thresholds MemLimit verbose CellFreqs: MFIs: PropMarkers MFIMarkers MarkerNames A list of vectors specifying per-channel thresholds. Each list item corresponds to one marker, and contains the threshold(s) for that marker. If only one vector is provided in the list, then those thresholds will be used for all markers. Otherwise, the list must be of the same length as PropMarkers. Note: if Methods == 'thresholds', then Thresholds must be specified. If not, it is ignored. Memory limit in GB. flowtype will do a sanity check before executing, and if the total size of counts plus MFI values for all populations would exceed MemLimit, will not run. Boolean variable. If TRUE, information about different processing tasks will be printed into the standard output. Object of class "numeric" containing the cell frequencies measured for each phenotype. Phenotype names are assigned as labels. Object of class "matrix" containing the measured MFIs for each phenotype. Phenotype names are assigned as column labels and marker names as row labels. A vector of the indexes or names of the markers for which cell proportions must be measured. A vector of the indexes or names of the markers for which MFIs must be measured. A vector of names for the channels. If NULL, the names provided in Frame will be used. Partitions A matrix where each column shows the partitioning of the respective channel. 1 and 2 correspond to negative and positive, respectively. PhenoCodes Nima Aghaeepour, Kieran O Neill References A vector of strings of length N (the number of markers) for each phenotype measured. For every phenotype, the character corresponding to a given marker can be 0, 1, 2, etc for neutral, negative, positive, bright, etc. See the provided vigenette for more details and examples. Please cite the following for the current version of flowtype: O Neill K, Jalali A, Aghaeepour N, Hoos H, Brinkman RR. Enhanced flowtype/rchyoptimyx: a BioConductor pipeline for discovery in high-dimensional cytometry data. Bioinformatics. 2014 May 1;30(9):1329-30. doi: 10.1093/bioinformatics/btt770 The original paper and description can be found at: Nima Aghaeepour, Pratip K. Chattopadhyay, Anuradha Ganesan, Kieran O Neill, Habil Zare, Adrin Jalali, Holger H. Hoos, Mario Roederer, and Ryan R. Brinkman. Early Immunologic Correlates of HIV Protection can be Identified from Computational Analysis of Complex Multivariate T-cell Flow Cytometry Assays. Bioinformatics, 2011.

getlabels 9 #Load the library library(flowtype) data(dlbclexample) #These markers will be analyzed PropMarkers <- 3:5 MFIMarkers <- PropMarkers MarkerNames <- c('fs', 'SS','CD3','CD5','CD19') #Run flowtype Res <- flowtype(dlbclexample, PropMarkers, MFIMarkers, 'kmeans', MarkerNames); MFIs=Res@MFIs; Proportions=Res@CellFreqs; Proportions <- Proportions / max(proportions) names(proportions) <- unlist(lapply(res@phenocodes, function(x){return(decodephenotype( x,res@markernames[propmarkers], Res@PartitionsPerMarker))})) #Select the 30 largest phenotypes index=order(proportions,decreasing=true)[1:30] bp=barplot(proportions[index], axes=false, names.arg=false) text(bp+0.2, par("usr")[3]+0.02, srt = 90, adj = 0, labels = names(proportions[index]), xpd = TRUE, cex=0.8) axis(2); axis(1, at=bp, labels=false); title(xlab='phenotype Names', ylab='cell Proportion') #These phenotype can be analyzed using a predictive model (e.g., classification or regression) getlabels getlabels: Returns the labels of the cells in a given phenotype. Returns the labels of the cells in a given phenotype in a Phenotypes object. getlabels(phenotypes, PhenotypeNumber) Arguments Phenotypes An object of class Phenotypes as produced by the flowtype function. PhenotypeNumber A numeric or character value representing the phenotypes number of name, respectively. Value Membership Labels: A vector of length of the number of events. 1 and 2 represent the cells that are not-included and included in the phenotype respectively.

10 HIVMetaData Nima Aghaeepour References Nima Aghaeepour, Pratip K. Chattopadhyay, Anuradha Ganesan, Kieran O Neill, Habil Zare, Adrin Jalali, Holger H. Hoos, Mario Roederer, and Ryan R. Brinkman. Early Immunologic Correlates of HIV Protection can be Identified from Computational Analysis of Complex Multivariate T-cell Flow Cytometry Assays. submitted to Bioinformatics, 2011. #See the vigentte HIVData HIVData Format A flow cytometry dataset from a HIV+ patients PBMC by the Scott lab of the Simon Fraser University and the Spina lab of the University of California San Diego. data(hivdata) A flowset describing expression values of 11 markers and 500 cells (sampled uniformly) for 19 HIV+ and 12 normal subjects. data(hivdata) HIVMetaData HIVMetaData Format The meta-data of a flow cytometry dataset from a HIV+ patients PBMC by the Scott lab of the Simon Fraser University and the Spina lab of the University of California San Diego. data(hivmetadata) A matrix describing the FCS filename, patient label (HIV+ or normal) and tube number of every assay.

Phenotypes-class 11 data(hivmetadata) Phenotypes-class Class "Phenotypes" The return data from running flowtype, containing counts Objects from the Class Objects can be created by calls of the form new("phenotypes",...). Slots CellFreqs: Numeric vector containing counts of the number of cells belonging to each cell type. MFIs: Matrix of MFIs, with rows for cell types and columns for markers. PhenoCodes: Vector of character strings representing the codes of each cell type (phenotype). PropMarkers: Numeric vector specifying which markers were used for combinatorial gating. MFIMarkers: Numeric vector specifying for which markers MFIs were computed for each cell type. MarkerNames: A character vector of the names of all markers in the flowframe given Partitions: The first level partitions that each cell in the flowframe belong to in each channel. MaxPopSize: MaxMarkersPerPop PartitionsPerMarker: Vector of number of partitions used for each marker Thresholds: A list of vectors with the calculated thresholds for each marker (if a clustering algorithm was used) or the thresholds provided by the user. See Also flowtype showclass("phenotypes")

12 summary-methods plot Methods for Function plot Methods for function plot ## S4 method for signature 'Phenotypes,flowFrame' plot(x, y,...) ## S4 method for signature 'Phenotypes,numeric' plot(x, y, Frame,...) ## S4 method for signature 'Phenotypes,character' plot(x, y, Frame,...) Arguments x y Frame An object of class Phenotypes as generated by the flowtype package. A flowframe or a numeric/character value representing the phenotype that needs to be plotted depending on the signature of the function A flowframe (might be optional depending on the signature of the function... Extra parameters that will be passed to the generic plot function See Also Nima Aghaeepour <<naghaeep@gmail.com>> flowtype #See the vigentte summary-methods ~~ Methods for Function summary ~~ ~~ Methods for function summary ~~ Methods signature(object = "Phenotypes") Prints basic characterstics of a Phenotypes object. See Also flowtype

summary-methods 13 #See the vigentte

Index Topic FlowCytData flowtype, 7 flowtype-package, 2 getlabels, 9 Topic HIV flowtype, 7 flowtype-package, 2 getlabels, 9 Topic classes Phenotypes-class, 11 Topic classification flowtype, 7 flowtype-package, 2 getlabels, 9 Topic clustering flowtype, 7 flowtype-package, 2 getlabels, 9 Topic datasets DLBCLExample, 6 HIVData, 10 HIVMetaData, 10 Topic print plot, 12 summary-methods, 12 Topic utilities calcmemuse, 3 calcnumpops, 4 decodephenotype, 5 encodephenotype, 6 encodephenotype, 5, 6 encodephenotype,character,character-method (encodephenotype), 6 encodephenotype-methods (encodephenotype), 6 flowtype, 4, 5, 7, 7, 11, 12 flowtype-package, 2 getlabels, 9 HIVData, 10 HIVMetaData, 10 Phenotypes (Phenotypes-class), 11 Phenotypes-class, 11 plot, 12 plot,phenotypes,character (plot), 12 plot,phenotypes,character-method (plot), 12 plot,phenotypes,flowframe (plot), 12 plot,phenotypes,flowframe-method (plot), 12 plot,phenotypes,numeric (plot), 12 plot,phenotypes,numeric-method (plot), 12 summary,phenotypes-method (summary-methods), 12 summary-methods, 12 calcmemuse, 3, 4 calcnumpops, 3, 4, 4 decodephenotype, 5, 7 decodephenotype,character, character,numeric-method (decodephenotype), 5 decodephenotype,character,character,numeric-method (decodephenotype), 5 decodephenotype-methods (decodephenotype), 5 DLBCLExample, 6 14