From raw data to gene annotations
|
|
- Osborne Baldwin
- 5 years ago
- Views:
Transcription
1 From raw data to gene annotations Laurent Gautier (Modified by C. Friis) 1 Process Affymetrix data First of all, you must download data files listed at and place them in a directory. Convenience, we assume that they are located in the directory from where you started R. > working.dir <- "." Replace it by any alternative location you might have chosen for your files. As usual one needs to load packages for the current R session: > library(affy) Welcome to Bioconductor Vignettes contain introductory material. simply type: openvignette() For details on reading vignettes, see the openvignette help page. To view, An introduction to the affy package can guide you a bit through the package. It is distributed with the package itself, or is available on the website for Bioconductor ( select Vignettes in the left frame). 1.1 Make raw data objects We will need an instance of class phenodata (this object contains covariate information for the samples). The function read.phenodata can be used to build such an object from a file. > my.filenames <- list.files(working.dir, "CEL") > my.pheno <- read.phenodata(filename = file.path(working.dir, + "pheno.tab"), header = TRUE) In its current state this function is not very fool-proof when reading data from a file. It is advised to have the name of the raw data files in the covariate data file in order to check that we get things in the right order. > cbind(my.filenames, pdata(my.pheno)) 1
2 my.filenames Filename Condition 1 starv1.cel starv1.cel Starved 2 starv2.cel starv2.cel Starved 3 starv3.cel starv3.cel Starved 4 stim1.cel stim1.cel Stimulated 5 stim2.cel stim2.cel Stimulated 6 stim3.cel stim3.cel Stimulated > my.names.in.pdatafile <- pdata(my.pheno)["filename"] > any(my.filenames!= my.names.in.pdatafile) [1] FALSE Alternatively, you can enter information about the experiments by hand: my.pheno <- read.phenodata(samplenames=my.filenames, widget=true) We need also to read raw data in CEL files and store the information they contain in an instance of class AffyBatch. > abatch <- ReadAffy(filenames = file.path(working.dir, my.filenames), + phenodata = my.pheno) > class(abatch) [1] "AffyBatch" attr(,"package") [1] "affy" 1.2 Pre-process The easiest way to go from raw probe intensities to expression values uses the function expresso: > eset <- expresso(abatch, bg.correct = FALSE, normalize = TRUE, + normalize.method = "qspline", pmcorrect.method = "pmonly", + summary.method = "medianpolish") normalization: qspline PM/MM correction : pmonly expression values: medianpolish normalizing...[1] "samples= 282 k= 5 first= 999" [1] "sampling(array= 1 iter= 1 off= )" [1] "sampling(array= 1 iter= 2 off= )" [1] "sampling(array= 1 iter= 3 off= )" [1] "sampling(array= 1 iter= 4 off= )" [1] "sampling(array= 1 iter= 5 off= )" [1] "sampling(array= 2 iter= 1 off= )" [1] "sampling(array= 2 iter= 2 off= )" [1] "sampling(array= 2 iter= 3 off= )" [1] "sampling(array= 2 iter= 4 off= )" [1] "sampling(array= 2 iter= 5 off= )" [1] "sampling(array= 3 iter= 1 off= )" 2
3 [1] "sampling(array= 3 iter= 2 off= )" [1] "sampling(array= 3 iter= 3 off= )" [1] "sampling(array= 3 iter= 4 off= )" [1] "sampling(array= 3 iter= 5 off= )" [1] "sampling(array= 4 iter= 1 off= )" [1] "sampling(array= 4 iter= 2 off= )" [1] "sampling(array= 4 iter= 3 off= )" [1] "sampling(array= 4 iter= 4 off= )" [1] "sampling(array= 4 iter= 5 off= )" [1] "sampling(array= 5 iter= 1 off= )" [1] "sampling(array= 5 iter= 2 off= )" [1] "sampling(array= 5 iter= 3 off= )" [1] "sampling(array= 5 iter= 4 off= )" [1] "sampling(array= 5 iter= 5 off= )" [1] "sampling(array= 6 iter= 1 off= )" [1] "sampling(array= 6 iter= 2 off= )" [1] "sampling(array= 6 iter= 3 off= )" [1] "sampling(array= 6 iter= 4 off= )" [1] "sampling(array= 6 iter= 5 off= )" done ids to be processed... This may take several minutes... You can choose different pre-processing methods. A point and click way to do it is: eset <- expresso(abatch, widget=true) Now we have expression values stored in an instance of class exprset. 2 cdna array 2.1 Make raw data objects Raw data from cdna arrays can be found in several different formats. The package marrayinput addresses this issue. We use a pre-loaded dataset for the examples. > library(marrayclasses) > data(swirl) 2.2 Pre-process Normalization comes next: > library(marraynorm) Loading required package: stepfun Loading required package: marrayinput > swirl.n <- manormmain(swirl) 3
4 Note that there exist several methods for normalizing your data. Please refer to the documentation in marraynorm if you have interest in the matter. Often people work with log-ratios. Log ratios are stored in the slot mam of an instance of class marraynorm. An instance of class exprset can be built simply, as show below. First, we define an helpful function: > getphenodata.marraynorm <- function(ma) { + mat <- matargets(ma) + dataf <- mainfo(mat) + dataf <- data.frame(labels = malabels(mat), dataf) + names(dataf) <- make.names(names(dataf)) + pd <- new("phenodata", pdata = dataf, varlabels = c(list("labels"), + as.list(names(mainfo(mat))))) + return(pd) + } Then we build the instance of class exprset: > my.m <- mam(swirl.n) > swirl.gnames <- magnames(swirl.n) > swirl.genelabels <- malabels(swirl.gnames) > rownames(my.m) <- swirl.genelabels > my.pheno <- getphenodata.marraynorm(swirl.n) > eset2 <- new("exprset", exprs = my.m, phenodata = my.pheno) The reader can observe that this is a very artificial example. The swirl dataset contains what is called a dye-swap set of experiments. This means that for each sample, the labelling is performed with alternatively with Cy3 and Cy5. The way to build an object of class exprset will depend on your design. Note that in some spotting patterns, the same gene is spotted several times across the chip. This is done to add robustness to the measurement. If we proceed like above, we might have several times the same gene in different rows. This is the case for the swirl dataset. This is not necessarily bad, but one should be aware of it when analyzing the data. Having several measurements for each gene can be seen as very similar in data structure to Affymetrix chips having several probes per gene. One might wish to summarize all the probes intensities for the same gene into one numerical value. The example below shows how to take the mean of the repeated measurement. > swirl.genelabels <- as.factor(swirl.genelabels) > allrows.pos <- seq(along = swirl.genelabels) > genes.index <- tapply(allrows.pos, swirl.genelabels, I) 3 Expressions values A large percentage of microarray data available on the internet, often as supplementary material for published articles, are not raw data but already preprocessed expression values or in the case of two colors arrays ratios or log-ratios between the green and the red channel. This is not a perfect thing, since once 4
5 findings are made with the data one cannot verify with raw data the possibility of experimental artefacts. A instance of class exprset can be build from scratch, or using the function read.exprset. Note that an instance of class exprset can contain expression values, log-ratio of expression values, of whatever measurement one may like to have. 4 From gene to annotation The package annotate provides utilities to query public databases over the internet. First it needs to be loaded. > library(annotate) We leave how to select interesting genes to the reader (and to the other tutorials). To continue with the examples we take three random genes: > set.seed(123) > my.interestgenes <- sample(genenames(eset2), 3) > print(my.interestgenes) [1] "18-B21" "18-H17" "12-N19" Unfortunately the gene names are not GENBANK entries. A query on GEN- BANK returns: > genbank(my.interestgenes) Loading required package: XML [1] "No XML records available for accession number 18-B21%2c18-H17%2c12-N19" NULL Things are a little more complex. Sometimes the gene names are local identifier for the probes on the chip. One needs to map the identifiers on the chips to identifiers on entries in databases. The package AnnBuilder is a package that can build packages, and more specifically annotation package. The procedure can be a little complex, as it requires to install different databases. Interested readers will refer to the documentation contained in the package. Affymetrix chips are commercial chips produced in large quantities, therefore building annotation packages for the popular chip types sold can benefit a very large number of research teams. This is done by Zhang JianHua, and the packages are available on the Bioconductor website (section meta-data). We had an AffyBatch called abatch. The name of the corresponding chip type is: > abatch@cdfname [1] "Hu6800" The annotation package is: > library(hu6800) 5
6 (Other annotation packages are available on the Bioconductor website, or are already available on the CBS computer you are working with). Each annotation package contains several associative arrays (to map an indentifier to its corresponding identifier in an other database). To convert the Affymetrix identifiers into an identifier of interest, the function multiget is handy: > my.genes <- sample(genenames(eset), 3) > my.pubmedids <- multiget(my.genes, envir = hu6800pmid) > print(my.pubmedids) $"X78121_at" [1] " " " " " " " " " " " " " " $"X99586_s_at" [1] " " " " " " " " " " " " [7] " " " " " " " " " " " " [13] " " " " " " " " " " " " [19] " " " " " " $"D17516_at" [1] " " " " " " " " " " " " " " [8] " " " " > my.genbankids <- multiget(my.genes, envir = hu6800accnum) > print(my.genbankids) $"X78121_at" [1] "X78121" $"X99586_s_at" [1] "X99586" $"D17516_at" [1] "D17516" Note that the association is not one to one. One entry in a databank can be associated with several entries in an other databank. Now that we have identifiers associated with the genes of interest, the corresponding information in databases can be queried. If one wishes to query databases over the internet, functions exist to do so and display the result in a web browser: for (i in seq(along=my.genbankids)) { current.gene <- names(my.genbankids)[i] readline(paste(current.gene, "- press enter to continue.")) genbank(my.genbankids, disp="browser") } or return the results in XML format: 6
Analysis of Genomic and Proteomic Data. Practicals. Benjamin Haibe-Kains. February 17, 2005
Analysis of Genomic and Proteomic Data Affymetrix c Technology and Preprocessing Methods Practicals Benjamin Haibe-Kains February 17, 2005 1 R and Bioconductor You must have installed R (available from
More informationWorking with Affymetrix data: estrogen, a 2x2 factorial design example
Bioconductor exercises 1 Working with Affymetrix data: estrogen, a 2x2 factorial design example Practical Microarray Course, Heidelberg Oct 2003 Robert Gentleman, Wolfgang Huber 1.) Preliminaries. To go
More informationPackage affypdnn. January 10, 2019
Version 1.56.0 Date 2009-09-22 Package affypdnn January 10, 2019 Title Probe Dependent Nearest Neighbours (PDNN) for the affy package Author H. Bjorn Nielsen and Laurent Gautier (Many thanks to Li Zhang
More informationUsing FARMS for summarization Using I/NI-calls for gene filtering. Djork-Arné Clevert. Institute of Bioinformatics, Johannes Kepler University Linz
Software Manual Institute of Bioinformatics, Johannes Kepler University Linz Using FARMS for summarization Using I/NI-calls for gene filtering Djork-Arné Clevert Institute of Bioinformatics, Johannes Kepler
More informationGS Analysis of Microarray Data
GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org kcoombes@mdanderson.org
More informationMicroarray Technology (Affymetrix ) and Analysis. Practicals
Data Analysis and Modeling Methods Microarray Technology (Affymetrix ) and Analysis Practicals B. Haibe-Kains 1,2 and G. Bontempi 2 1 Unité Microarray, Institut Jules Bordet 2 Machine Learning Group, Université
More informationGS Analysis of Microarray Data
GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Section of Bioinformatics Department of Biostatistics and Applied Mathematics UT M. D. Anderson Cancer Center kabagg@mdanderson.org
More informationBioconductor tutorial
Bioconductor tutorial Adapted by Alex Sanchez from tutorials by (1) Steffen Durinck, Robert Gentleman and Sandrine Dudoit (2) Laurent Gautier (3) Matt Ritchie (4) Jean Yang Outline The Bioconductor Project
More informationI. Overview of the Bioconductor Project. Bioinformatics and Biostatistics Lab., Seoul National Univ. Seoul, Korea Eun-Kyung Lee
Introduction to Bioconductor I. Overview of the Bioconductor Project Bioinformatics and Biostatistics Lab., Seoul National Univ. Seoul, Korea Eun-Kyung Lee Outline What is R? Overview of the Biocondcutor
More informationPreprocessing Affymetrix Data. Educational Materials 2005 R. Irizarry and R. Gentleman Modified 21 November, 2009, M. Morgan
Preprocessing Affymetrix Data Educational Materials 2005 R. Irizarry and R. Gentleman Modified 21 November, 2009, M. Morgan 1 Input, Quality Assessment, and Pre-processing Fast Track Identify CEL files.
More informationAffymetrix Microarrays
Affymetrix Microarrays Cavan Reilly November 3, 2017 Table of contents Overview The CLL data set Quality Assessment and Remediation Preprocessing Testing for Differential Expression Moderated Tests Volcano
More informationAlternative CDF environments
Alternative CDF environments Laurent Gautier November 1, 2004 Introduction On short oligonuleotide arrays, several probes are designed to match a target transcript, and probes matching the same target
More informationUsing oligonucleotide microarray reporter sequence information for preprocessing and quality assessment
Using oligonucleotide microarray reporter sequence information for preprocessing and quality assessment Wolfgang Huber and Robert Gentleman April 30, 2018 Contents 1 Overview 1 2 Using probe packages 1
More informationPackage makecdfenv. March 13, 2019
Version 1.58.0 Date 2006-03-06 Title CDF Environment Maker Package makecdfenv March 13, 2019 Author Rafael A. Irizarry , Laurent Gautier , Wolfgang Huber ,
More informationsscore July 13, 2010 vector of data tuning constant (see details) fuzz value to avoid division by zero (see details)
sscore July 13, 2010 OneStepBiweightAlgorithm One-step Tukey s biweight Computes one-step Tukey s biweight on a vector. that this implementation follows the Affymetrix code, which is different from the
More informationExploring cdna Data. Achim Tresch, Andreas Buness, Tim Beißbarth, Wolfgang Huber
Exploring cdna Data Achim Tresch, Andreas Buness, Tim Beißbarth, Wolfgang Huber Practical DNA Microarray Analysis http://compdiag.molgen.mpg.de/ngfn/pma0nov.shtml The following exercise will guide you
More informationGene Expression an Overview of Problems & Solutions: 1&2. Utah State University Bioinformatics: Problems and Solutions Summer 2006
Gene Expression an Overview of Problems & Solutions: 1&2 Utah State University Bioinformatics: Problems and Solutions Summer 2006 Review DNA mrna Proteins action! mrna transcript abundance ~ expression
More informationPackage sscore. R topics documented: June 27, Version Date
Version 1.52.0 Date 2009-04-11 Package sscore June 27, 2018 Title S-Score Algorithm for Affymetrix Oligonucleotide Microarrays Author Richard Kennedy , based on C++ code from
More informationExploring cdna Data. Achim Tresch, Andreas Buness, Tim Beißbarth, Florian Hahne, Wolfgang Huber. June 17, 2005
Exploring cdna Data Achim Tresch, Andreas Buness, Tim Beißbarth, Florian Hahne, Wolfgang Huber June 7, 00 The following exercise will guide you through the first steps of a spotted cdna microarray analysis.
More informationBioconductor s stepnorm package
Bioconductor s stepnorm package Yuanyuan Xiao 1 and Yee Hwa Yang 2 October 18, 2004 Departments of 1 Biopharmaceutical Sciences and 2 edicine University of California, San Francisco yxiao@itsa.ucsf.edu
More informationExploring cdna Data. Achim Tresch, Andreas Buness, Tim Beißbarth, Wolfgang Huber
Exploring cdna Data Achim Tresch, Andreas Buness, Tim Beißbarth, Wolfgang Huber Practical DNA Microarray Analysis, Heidelberg, March 2005 http://compdiag.molgen.mpg.de/ngfn/pma2005mar.shtml The following
More informationDescription of gcrma package
Description of gcrma package Zhijin(Jean) Wu, Rafael Irizarry October 30, 2018 Contents 1 Introduction 1 2 What s new in version 2.0.0 3 3 Options for gcrma 3 4 Getting started: the simplest example 4
More informationIntroduction to the Bioconductor marray package : Input component
Introduction to the Bioconductor marray package : Input component Yee Hwa Yang 1 and Sandrine Dudoit 2 April 30, 2018 Contents 1. Department of Medicine, University of California, San Francisco, jean@biostat.berkeley.edu
More informationReadqPCR: Functions to load RT-qPCR data into R
ReadqPCR: Functions to load RT-qPCR data into R James Perkins University College London February 15, 2013 Contents 1 Introduction 1 2 read.qpcr 1 3 read.taqman 3 4 qpcrbatch 4 1 Introduction The package
More informationaffypara: Parallelized preprocessing algorithms for high-density oligonucleotide array data
affypara: Parallelized preprocessing algorithms for high-density oligonucleotide array data Markus Schmidberger Ulrich Mansmann IBE August 12-14, Technische Universität Dortmund, Germany Preprocessing
More informationParallelized preprocessing algorithms for high-density oligonucleotide arrays
Parallelized preprocessing algorithms for high-density oligonucleotide arrays Markus Schmidberger, Ulrich Mansmann Chair of Biometrics and Bioinformatics, IBE University of Munich 81377 Munich, Germany
More informationBioConductor Overviewr
BioConductor Overviewr 2016-09-28 Contents Installing Bioconductor 1 Bioconductor basics 1 ExressionSet 2 assaydata (gene expression)........................................ 2 phenodata (sample annotations).....................................
More informationPackage splicegear. R topics documented: December 4, Title splicegear Version Author Laurent Gautier
Title splicegear Version 1.51.0 Author Laurent Gautier Package splicegear December 4, 2017 A set of tools to work with alternative splicing Maintainer Laurent Gautier
More informationPackage frma. R topics documented: March 8, Version Date Title Frozen RMA and Barcode
Version 1.34.0 Date 2017-11-08 Title Frozen RMA and Barcode Package frma March 8, 2019 Preprocessing and analysis for single microarrays and microarray batches. Author Matthew N. McCall ,
More informationNormalization: Bioconductor s marray package
Normalization: Bioconductor s marray package Yee Hwa Yang 1 and Sandrine Dudoit 2 October 30, 2017 1. Department of edicine, University of California, San Francisco, jean@biostat.berkeley.edu 2. Division
More informationExploring cdna Data. Achim Tresch, Andreas Buness, Wolfgang Huber, Tim Beißbarth
Exploring cdna Data Achim Tresch, Andreas Buness, Wolfgang Huber, Tim Beißbarth Practical DNA Microarray Analysis http://compdiag.molgen.mpg.de/ngfn/pma0nov.shtml The following exercise will guide you
More informationHowTo: Querying online Data
HowTo: Querying online Data Jeff Gentry and Robert Gentleman November 12, 2017 1 Overview This article demonstrates how you can make use of the tools that have been provided for on-line querying of data
More informationBayesian Pathway Analysis (BPA) Tutorial
Bayesian Pathway Analysis (BPA) Tutorial Step by Step to run BPA: 1-) Download latest version of BPAS from BPA website. Unzip it to an appropriate directory. You need to have JAVA Runtime engine and Matlab
More informationBuilding R objects from ArrayExpress datasets
Building R objects from ArrayExpress datasets Audrey Kauffmann October 30, 2017 1 ArrayExpress database ArrayExpress is a public repository for transcriptomics and related data, which is aimed at storing
More informationLab: An introduction to Bioconductor
Lab: An introduction to Bioconductor Martin Morgan 14 May, 2007 Take one of these paths through the lab: 1. Tour 1 visits a few of the major packages in Bioconductor, much like in the accompanying lecture.
More informationFrozen Robust Multi-Array Analysis and the Gene Expression Barcode
Frozen Robust Multi-Array Analysis and the Gene Expression Barcode Matthew N. McCall November 9, 2017 Contents 1 Frozen Robust Multiarray Analysis (frma) 2 1.1 From CEL files to expression estimates...................
More informationIntroduction: microarray quality assessment with arrayqualitymetrics
Introduction: microarray quality assessment with arrayqualitymetrics Audrey Kauffmann, Wolfgang Huber April 4, 2013 Contents 1 Basic use 3 1.1 Affymetrix data - before preprocessing.......................
More informationUsing metama for differential gene expression analysis from multiple studies
Using metama for differential gene expression analysis from multiple studies Guillemette Marot and Rémi Bruyère Modified: January 28, 2015. Compiled: January 28, 2015 Abstract This vignette illustrates
More informationPackage AffyExpress. October 3, 2013
Version 1.26.0 Date 2009-07-22 Package AffyExpress October 3, 2013 Title Affymetrix Quality Assessment and Analysis Tool Author Maintainer Xuejun Arthur Li Depends R (>= 2.10), affy (>=
More informationCodelink Legacy: the old Codelink class
Codelink Legacy: the old Codelink class Diego Diez October 30, 2018 1 Introduction Codelink is a platform for the analysis of gene expression on biological samples property of Applied Microarrays, Inc.
More informationPackage affyplm. January 6, 2019
Version 1.58.0 Title Methods for fitting probe-level models Author Ben Bolstad Maintainer Ben Bolstad Package affyplm January 6, 2019 Depends R (>= 2.6.0), BiocGenerics
More informationMicroarray Data Analysis (V) Preprocessing (i): two-color spotted arrays
Microarray Data Analysis (V) Preprocessing (i): two-color spotted arrays Preprocessing Probe-level data: the intensities read for each of the components. Genomic-level data: the measures being used in
More informationMiChip. Jonathon Blake. October 30, Introduction 1. 5 Plotting Functions 3. 6 Normalization 3. 7 Writing Output Files 3
MiChip Jonathon Blake October 30, 2018 Contents 1 Introduction 1 2 Reading the Hybridization Files 1 3 Removing Unwanted Rows and Correcting for Flags 2 4 Summarizing Intensities 3 5 Plotting Functions
More informationBioconductor exercises 1. Exploring cdna data. June Wolfgang Huber and Andreas Buness
Bioconductor exercises Exploring cdna data June 2004 Wolfgang Huber and Andreas Buness The following exercise will show you some possibilities to load data from spotted cdna microarrays into R, and to
More information/ Computational Genomics. Normalization
10-810 /02-710 Computational Genomics Normalization Genes and Gene Expression Technology Display of Expression Information Yeast cell cycle expression Experiments (over time) baseline expression program
More informationBayesian Robust Inference of Differential Gene Expression The bridge package
Bayesian Robust Inference of Differential Gene Expression The bridge package Raphael Gottardo October 30, 2017 Contents Department Statistics, University of Washington http://www.rglab.org raph@stat.washington.edu
More informationRDBMS in bioinformatics: the Bioconductor experience
DSC 2003 Working Papers (Draft Versions) http://www.ci.tuwien.ac.at/conferences/dsc-2003/ RDBMS in bioinformatics: the Bioconductor experience VJ Carey Harvard University stvjc@channing.harvard.edu Abstract.
More informationReadqPCR: Functions to load RT-qPCR data into R
ReadqPCR: Functions to load RT-qPCR data into R James Perkins University College London Matthias Kohl Furtwangen University April 30, 2018 Contents 1 Introduction 1 2 read.lc480 2 3 read.qpcr 7 4 read.taqman
More informationAn Introduction to Bioconductor s ExpressionSet Class
An Introduction to Bioconductor s ExpressionSet Class Seth Falcon, Martin Morgan, and Robert Gentleman 6 October, 2006; revised 9 February, 2007 1 Introduction Biobase is part of the Bioconductor project,
More informationTutorial - Analysis of Microarray Data. Microarray Core E Consortium for Functional Glycomics Funded by the NIGMS
Tutorial - Analysis of Microarray Data Microarray Core E Consortium for Functional Glycomics Funded by the NIGMS Data Analysis introduction Warning: Microarray data analysis is a constantly evolving science.
More informationVector Xpression 3. Speed Tutorial: III. Creating a Script for Automating Normalization of Data
Vector Xpression 3 Speed Tutorial: III. Creating a Script for Automating Normalization of Data Table of Contents Table of Contents...1 Important: Please Read...1 Opening Data in Raw Data Viewer...2 Creating
More informationMicroarray Data Analysis (VI) Preprocessing (ii): High-density Oligonucleotide Arrays
Microarray Data Analysis (VI) Preprocessing (ii): High-density Oligonucleotide Arrays High-density Oligonucleotide Array High-density Oligonucleotide Array PM (Perfect Match): The perfect match probe has
More informationBasic Functions of AnnBuilder
Basic Functions of AnnBuilder Jianhua Zhang June 23, 2003 c 2003 Bioconductor 1 Introduction This vignette is an overview of some of the functions that can be used to build an annotation data package.
More informationaffyqcreport: A Package to Generate QC Reports for Affymetrix Array Data
affyqcreport: A Package to Generate QC Reports for Affymetrix Array Data Craig Parman and Conrad Halling April 30, 2018 Contents 1 Introduction 1 2 Getting Started 2 3 Figure Details 3 3.1 Report page
More informationHow to use the rbsurv Package
How to use the rbsurv Package HyungJun Cho, Sukwoo Kim, Soo-heang Eo, and Jaewoo Kang April 30, 2018 Contents 1 Introduction 1 2 Robust likelihood-based survival modeling 2 3 Algorithm 2 4 Example: Glioma
More informationCreating a New Annotation Package using SQLForge
Creating a New Annotation Package using SQLForge Marc Carlson, Herve Pages, Nianhua Li November 19, 2013 1 Introduction The AnnotationForge package provides a series of functions that can be used to build
More informationHow to use CNTools. Overview. Algorithms. Jianhua Zhang. April 14, 2011
How to use CNTools Jianhua Zhang April 14, 2011 Overview Studies have shown that genomic alterations measured as DNA copy number variations invariably occur across chromosomal regions that span over several
More informationPackage virtualarray
Package virtualarray March 26, 2013 Type Package Title Build virtual array from different microarray platforms Version 1.2.1 Date 2012-03-08 Author Andreas Heider Maintainer Andreas Heider
More informationPackage RPA. R topics documented: September 13, Type Package
Type Package Package RPA September 13, 2018 Title RPA: Robust Probabilistic Averaging for probe-level analysis Encoding UTF-8 Version 1.36.0 Date 2017-11-12 Depends R (>= 3.1.1), affy, BiocGenerics, methods
More informationDrug versus Disease (DrugVsDisease) package
1 Introduction Drug versus Disease (DrugVsDisease) package The Drug versus Disease (DrugVsDisease) package provides a pipeline for the comparison of drug and disease gene expression profiles where negatively
More informationImport GEO Experiment into Partek Genomics Suite
Import GEO Experiment into Partek Genomics Suite This tutorial will illustrate how to: Import a gene expression experiment from GEO SOFT files Specify annotations Import RAW data from GEO for gene expression
More informationMATH3880 Introduction to Statistics and DNA MATH5880 Statistics and DNA Practical Session Monday, 16 November pm BRAGG Cluster
MATH3880 Introduction to Statistics and DNA MATH5880 Statistics and DNA Practical Session Monday, 6 November 2009 3.00 pm BRAGG Cluster This document contains the tasks need to be done and completed by
More informationIntroduction to the Codelink package
Introduction to the Codelink package Diego Diez October 30, 2018 1 Introduction This package implements methods to facilitate the preprocessing and analysis of Codelink microarrays. Codelink is a microarray
More informationGene Set Enrichment Analysis. GSEA User Guide
Gene Set Enrichment Analysis GSEA User Guide 1 Software Copyright The Broad Institute SOFTWARE COPYRIGHT NOTICE AGREEMENT This software and its documentation are copyright 2009, 2010 by the Broad Institute/Massachusetts
More informationRelease Notes. JMP Genomics. Version 4.0
JMP Genomics Version 4.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP. A Business Unit of SAS SAS Campus Drive
More informationTextual Description of webbioc
Textual Description of webbioc Colin A. Smith October 13, 2014 Introduction webbioc is a web interface for some of the Bioconductor microarray analysis packages. It is designed to be installed at local
More informationRLMM - Robust Linear Model with Mahalanobis Distance Classifier
RLMM - Robust Linear Model with Mahalanobis Distance Classifier Nusrat Rabbee and Gary Wong June 13, 2018 Contents 1 Introduction 1 2 Instructions for Genotyping Affymetrix Mapping 100K array - Xba set
More informationPackage SCAN.UPC. July 17, 2018
Type Package Package SCAN.UPC July 17, 2018 Title Single-channel array normalization (SCAN) and Universal expression Codes (UPC) Version 2.22.0 Author and Andrea H. Bild and W. Evan Johnson Maintainer
More informationAB1700 Microarray Data Analysis
AB1700 Microarray Data Analysis Yongming Andrew Sun, Applied Biosystems sunya@appliedbiosystems.com October 30, 2017 Contents 1 ABarray Package Introduction 2 1.1 Required Files and Format.........................................
More informationPackage DupChecker. April 11, 2018
Type Package Package DupChecker April 11, 2018 Title a package for checking high-throughput genomic data redundancy in meta-analysis Version 1.16.0 Date 2014-10-07 Author Quanhu Sheng, Yu Shyr, Xi Chen
More informationApplying Data-Driven Normalization Strategies for qpcr Data Using Bioconductor
Applying Data-Driven Normalization Strategies for qpcr Data Using Bioconductor Jessica Mar April 30, 2018 1 Introduction High-throughput real-time quantitative reverse transcriptase polymerase chain reaction
More informationBioconductor: Annotation Package Overview
Bioconductor: Annotation Package Overview April 30, 2018 1 Overview In its current state the basic purpose of annotate is to supply interface routines that support user actions that rely on the different
More informationPackage spikeli. August 3, 2013
Package spikeli August 3, 2013 Type Package Title Affymetrix Spike-in Langmuir Isotherm Data Analysis Tool Version 2.21.0 Date 2009-04-03 Author Delphine Baillon, Paul Leclercq ,
More informationCARMAweb users guide version Johannes Rainer
CARMAweb users guide version 1.0.8 Johannes Rainer July 4, 2006 Contents 1 Introduction 1 2 Preprocessing 5 2.1 Preprocessing of Affymetrix GeneChip data............................. 5 2.2 Preprocessing
More informationComputer Exercise - Microarray Analysis using Bioconductor
Computer Exercise - Microarray Analysis using Bioconductor Introduction The SWIRL dataset The SWIRL dataset comes from an experiment using zebrafish to study early development in vertebrates. SWIRL is
More informationR version has been released on (Linux source code versions)
Installation of R and Bioconductor R is a free software environment for statistical computing and graphics. It is based on the statistical computer language S. It is famous for its wide set of statistical
More informationLab: Using R and Bioconductor
Lab: Using R and Bioconductor Robert Gentleman Florian Hahne Paul Murrell June 19, 2006 Introduction In this lab we will cover some basic uses of R and also begin working with some of the Bioconductor
More informationUser Guide for DNAFORM Clone Search Engine
User Guide for DNAFORM Clone Search Engine Document Version: 3.0 Dated from: 1 October 2010 The document is the property of K.K. DNAFORM and may not be disclosed, distributed, or replicated without the
More informationOrganizing, cleaning, and normalizing (smoothing) cdna microarray data
Organizing, cleaning, and normalizing (smoothing) cdna microarray data All product names are given as examples only and they are not endorsed by the USDA or the University of Illinois. INTRODUCTION The
More informationRecalling Genotypes with BEAGLECALL Tutorial
Recalling Genotypes with BEAGLECALL Tutorial Release 8.1.4 Golden Helix, Inc. June 24, 2014 Contents 1. Format and Confirm Data Quality 2 A. Exclude Non-Autosomal Markers......................................
More informationPackage SCAN.UPC. October 9, Type Package. Title Single-channel array normalization (SCAN) and University Probability of expression Codes (UPC)
Package SCAN.UPC October 9, 2013 Type Package Title Single-channel array normalization (SCAN) and University Probability of expression Codes (UPC) Version 2.0.2 Author Stephen R. Piccolo and W. Evan Johnson
More informationAffymetrix Data Transfer Tool User s Guide Version 1.1
Affymetrix Data Transfer Tool User s Guide Version 1.1 P/N 701747 Rev. 3 For research use only. Not for use in diagnostic procedures. Trademarks Affymetrix,, GeneChip, HuSNP, GenFlex, Flying Objective,
More informationAn introduction to Genomic Data Structures
An introduction to Genomic Data Structures Cavan Reilly October 30, 2017 Table of contents Object Oriented Programming The ALL data set ExpressionSet Objects Environments More on ExpressionSet Objects
More informationPackage simpleaffy. January 11, 2018
Package simpleaffy January 11, 2018 Title Very simple high level analysis of Affymetrix data Version 2.54.0 Author Provides high level functions for reading Affy.CEL files, phenotypic data, and then computing
More informationPackage rmat. November 1, 2018
Type Package Package rmat November 1, 2018 Title R implementation from MAT program to normalize and analyze tiling arrays and ChIP-chip data. Version 3.32.0 Author Charles Cheung and Arnaud Droit and Raphael
More informationPreprocessing -- examples in microarrays
Preprocessing -- examples in microarrays I: cdna arrays Image processing Addressing (gridding) Segmentation (classify a pixel as foreground or background) Intensity extraction (summary statistic) Normalization
More informationPackage pdinfobuilder
Package pdinfobuilder April 10, 2018 Title Platform Design Information Package Builder Builds platform design information packages. These consist of a SQLite database containing feature-level data such
More informationExpander Online Documentation
Expander Online Documentation Table of Contents Introduction...1 Starting EXPANDER...2 Input Data...4 Preprocessing GE Data...8 Viewing Data Plots...12 Clustering GE Data...14 Biclustering GE Data...17
More informationPackage SC3. November 27, 2017
Type Package Title Single-Cell Consensus Clustering Version 1.7.1 Author Vladimir Kiselev Package SC3 November 27, 2017 Maintainer Vladimir Kiselev A tool for unsupervised
More informationIntroduction to GE Microarray data analysis Practical Course MolBio 2012
Introduction to GE Microarray data analysis Practical Course MolBio 2012 Claudia Pommerenke Nov-2012 Transkriptomanalyselabor TAL Microarray and Deep Sequencing Core Facility Göttingen University Medical
More informationLabelled quantitative proteomics with MSnbase
Labelled quantitative proteomics with MSnbase Laurent Gatto lg390@cam.ac.uk Cambridge Centre For Proteomics University of Cambridge European Bioinformatics Institute (EBI) 18 th November 2010 Plan 1 Introduction
More informationPackage RLMM. March 7, 2019
Version 1.44.0 Date 2005-09-02 Package RLMM March 7, 2019 Title A Genotype Calling Algorithm for Affymetrix SNP Arrays Author Nusrat Rabbee , Gary Wong
More informationcrlmm to downstream data analysis
crlmm to downstream data analysis VJ Carey, B Carvalho March, 2012 1 Running CRLMM on a nontrivial set of CEL files To use the crlmm algorithm, the user must load the crlmm package, as described below:
More informationBiobase development and the new eset
Biobase development and the new eset Martin T. Morgan * 7 August, 2006 Revised 4 September, 2006 featuredata slot. Revised 20 April 2007 minor wording changes; verbose and other arguments passed through
More informationIntroduction to the xps Package: Comparison to Affymetrix Power Tools (APT)
Introduction to the xps Package: Comparison to Affymetrix Power Tools (APT) Christian Stratowa April, 2010 Contents 1 Introduction 1 2 Comparison of results for the Human Genome U133 Plus 2.0 Array 2 2.1
More informationPackage stepnorm. R topics documented: April 10, Version Date
Version 1.38.0 Date 2008-10-08 Package stepnorm April 10, 2015 Title Stepwise normalization functions for cdna microarrays Author Yuanyuan Xiao , Yee Hwa (Jean) Yang
More informationChIPXpress: enhanced ChIP-seq and ChIP-chip target gene identification using publicly available gene expression data
ChIPXpress: enhanced ChIP-seq and ChIP-chip target gene identification using publicly available gene expression data George Wu, Hongkai Ji December 22, 2017 1 Introduction ChIPx (i.e., ChIP-seq and ChIP-chip)
More informationmirnet Tutorial Starting with expression data
mirnet Tutorial Starting with expression data Computer and Browser Requirements A modern web browser with Java Script enabled Chrome, Safari, Firefox, and Internet Explorer 9+ For best performance and
More informationAlternative CDF environments
Alternative CDF environments Laurent Gautier April 30, 2018 Contents 1 Introduction 1 2 The class CdfEnvAffy 2 3 Reading sequence information in FASTA connections 2 4 Creating an alternative mapping from
More informationBioMart: a research data management tool for the biomedical sciences
Yale University From the SelectedWorks of Rolando Garcia-Milian 2014 BioMart: a research data management tool for the biomedical sciences Rolando Garcia-Milian, Yale University Available at: https://works.bepress.com/rolando_garciamilian/2/
More information