Package pdinfobuilder

Similar documents
crlmm to downstream data analysis

Package makecdfenv. March 13, 2019

Package affyio. January 18, Index 6. CDF file format function

Package AnnotationForge

Package rmat. November 1, 2018

Creating a New Annotation Package using SQLForge

Package mimager. March 7, 2019

Package AnnotationHub

Creating a New Annotation Package using SQLForge

Package AnnotationHubData

Package DFP. February 2, 2018

Preprocessing and Genotyping Illumina Arrays for Copy Number Analysis

Package RGalaxy. R topics documented: January 19, 2018

Package genomeintervals

HOWTO generate biocviews HTML

Package RLMM. March 7, 2019

Package affxparser. R topics documented: September 29, Version Depends R (>= )

500K Data Analysis Workflow using BRLMM

Package rsffreader. May 14, 2018

Package SCAN.UPC. July 17, 2018

Package MiRaGE. October 16, 2018

Djork-Arné Clevert and Andreas Mitterecker. Institute of Bioinformatics, Johannes Kepler University Linz

Analysis of two-way cell-based assays

Package RTCGAToolbox

Djork-Arné Clevert and Andreas Mitterecker. Institute of Bioinformatics, Johannes Kepler University Linz

Package snplist. December 11, 2017

Package SCAN.UPC. October 9, Type Package. Title Single-channel array normalization (SCAN) and University Probability of expression Codes (UPC)

SimBindProfiles: Similar Binding Profiles, identifies common and unique regions in array genome tiling array data

Package FunciSNP. November 16, 2018

Package AnnotationHub

Package SNPchip. May 3, 2018

Package oligo. April 7, 2018

Package GEOmetadb. October 4, 2013

Package EventPointer

Bioconductor tutorial

Package TilePlot. April 8, 2011

Package frma. R topics documented: March 8, Version Date Title Frozen RMA and Barcode

How to Use pkgdeptools

Package QUALIFIER. March 26, Imports MASS,hwriter,RSVGTipsDevice,lattice,stats4,flowCore,flowViz,methods,flowWorkspace,reshape

Package sscore. R topics documented: June 27, Version Date

Analysis of screens with enhancer and suppressor controls

Package Risa. November 28, 2017

Package demi. February 19, 2015

Package Organism.dplyr

Package seqcat. March 25, 2019

Introduction: microarray quality assessment with arrayqualitymetrics

Package TxRegInfra. March 17, 2019

Package ensemblvep. April 5, 2014

Package yaqcaffy. January 19, 2019

Package AnnotationHub

RLMM - Robust Linear Model with Mahalanobis Distance Classifier

Package BiocStyle. January 26, 2018

How to use CNTools. Overview. Algorithms. Jianhua Zhang. April 14, 2011

Package plethy. April 4, 2019

Building an R Package

Package AffyExpress. October 3, 2013

Axiom Analysis Suite Release Notes (For research use only. Not for use in diagnostic procedures.)

Implementing S4 objects in your package: Exercises

Importing and Merging Data Tutorial

Package saascnv. May 18, 2016

Quick Reference Card. GeneChip Sequence Analysis Software 4.1. I. GSEQ Introduction

Package gmapr. January 22, 2019

Using crlmm for copy number estimation and genotype calling with Illumina platforms

Package splicegear. R topics documented: December 4, Title splicegear Version Author Laurent Gautier

Package GOTHiC. November 22, 2017

apt-probeset-genotype Manual

Package SMAP. R topics documented: June 19, 2018

Package OSAT. February 13, 2018

Release Notes. JMP Genomics. Version 4.0

Package HTSeqGenie. April 16, 2019

Package BiocStyle. December 9, 2018

Package TilePlot. February 15, 2013

Package mgsa. January 13, 2019

Package biomformat. April 11, 2018

Package procoil. R topics documented: April 10, Type Package

From raw data to gene annotations

Affymetrix Genotyping Console 4.2 Release Notes (For research use only. Not for use in diagnostic procedures.)

Package GSRI. March 31, 2019

Package AffyCompatible

puma User Guide R. D. Pearson, X. Liu, M. Rattray, M. Milo, N. D. Lawrence G. Sanguinetti, Li Zhang October 30, Abstract 2 2 Citing puma 2

Affymetrix Data Transfer Tool User s Guide Version 1.1

Package muscle. R topics documented: March 7, Type Package

Package MEAL. August 16, 2018

Package ibbig. R topics documented: December 24, 2018

Package ensemblvep. January 19, 2018

Package BiocInstaller

The SQLiteDF Package

Package customprodb. September 9, 2018

Robert Gentleman! Copyright 2011, all rights reserved!

Package spikeli. August 3, 2013

Bioconductor: Annotation Package Overview

Package mirnapath. July 18, 2013

sscore July 13, 2010 vector of data tuning constant (see details) fuzz value to avoid division by zero (see details)

Package affyplm. January 6, 2019

Package DAVIDQuery. R topics documented: October 4, Type Package. Title Retrieval from the DAVID bioinformatics data resource into R

Wave correction for arrays

Package BiocStyle. April 22, 2016

Affymetrix Genotyping Console 3.0 User Manual

Package OTUbase. R topics documented: January 28, Type Package

Package DASiR. R topics documented: October 4, Type Package. Title Distributed Annotation System in R. Version

Transcription:

Package pdinfobuilder April 10, 2018 Title Platform Design Information Package Builder Builds platform design information packages. These consist of a SQLite database containing feature-level data such as x, y position on chip and featureset ID. The database also incorporates featureset-level annotation data. The products of this packages are used by the oligo pkg. Version 1.43.0 Author Seth Falcon, Vince Carey, Matt Settles, Kristof de Beuf, Benilton Carvalho Maintainer Benilton Carvalho <beniltoncarvalho@gmail.com> LazyLoad yes Depends R (>= 3.2.0), methods, Biobase (>= 2.27.3), RSQLite (>= 1.0.0), affxparser (>= 1.39.4), oligo (>= 1.31.5) Imports Biostrings (>= 2.35.12), BiocGenerics (>= 0.13.11), DBI (>= 0.3.1), IRanges (>= 2.1.43), oligoclasses (>= 1.29.6), S4Vectors (>= 0.5.22) License Artistic-2.0 Collate AllClasses.R AllGenerics.R initialize-methods.r utils.r schema.r initdb.r initdb.snp6.r pmmmblocktomat.r loaders.r loaders.snp6.r makepdinfopackage-methods.r chipname-methods.r getgeometry-methods.r pdbuilderv2tiledregion.r pdbuilderv2exontranscription.r pdbuilderv2gene.r pdbuilderv2hta2.r pdbuilderv2affytiling.r pdbuilderv2ngsexpression.r pdbuilderv2affyexpressionht.r pdbuilderv2affysnp.r pdbuilderv2affysnpcnv.r pdbuilderv2mirna.r pdbuilderv3genericarray.r pdbuilderv2clariom.r biocviews Annotation, Infrastructure R topics documented: AffyClariomSPDInfoPkgSeed............................... 2 AffyExpressionPDInfoPkgSeed-class........................... 3 AffySNPCNVPDInfoPkgSeed-class............................ 4 AffySNPCNVPDInfoPkgSeed2-class........................... 5 AffySNPPDInfoPkgSeed-class............................... 6 AffySNPPDInfoPkgSeed2-class.............................. 7 AffySTPDInfoPkgSeed-class................................ 8 1

2 AffyClariomSPDInfoPkgSeed AffyTilingPDInfoPkgSeed-class.............................. 9 cdf2table.......................................... 10 chipname.......................................... 11 getgeometry........................................ 11 makepdinfopackage..................................... 12 NgsExpressionPDInfoPkgSeed-class............................ 13 NgsTilingPDInfoPkgSeed-class.............................. 14 NimbleGenPDInfoPkgSeed-class.............................. 15 Index 16 AffyClariomSPDInfoPkgSeed Class "AffyClariomSPDInfoPkgSeed" PD Info Package Seed for Affymetrix Clariom S Arrays Objects can be created by calls of the form new("affyclariomspdinfopkgseed", pgffile, clffile, coremps, tra pgffile: PGF filename clffile: CLF filename coremps: MPS filename transfile: Transcript annotation CSV file chipname chipname getgeometry initialize makepdinfopackage package creator showclass("affyclariomspdinfopkgseed")

AffyExpressionPDInfoPkgSeed-class 3 AffyExpressionPDInfoPkgSeed-class Class "AffyExpressionPDInfoPkgSeed" PD Info Package Seed for Affymetrix Expression Arrays Objects can be created by calls of the form new("affyexpressionpdinfopkgseed", cdffile, csvannofile, tabseq cdffile: CDF filename celfile: CEL filename tabseqfile: TAB sequence file chipname chipname getgeometry initialize makepdinfopackage package creator showclass("affyexpressionpdinfopkgseed")

4 AffySNPCNVPDInfoPkgSeed-class AffySNPCNVPDInfoPkgSeed-class Class "AffySNPCNVPDInfoPkgSeed" This class represents Platform Design (PD) packages for Affymetrix genomewide (SNP 5.0 and SNP 6.0) arrays. Objects can be created by calls of the form new("affysnpcnvpdinfopkgseed", cdffile, csvannofile, csvseqfile cdffile: Path to the CDF file for this. csvannofile: Path to the Affymetrix CSV annotation for the SNP probes. csvseqfile: Path to the (SNP) probe sequence file. csvannofilecnv: Path to the Affymetrix CSV annotation for the CNV probes. csvseqfilecnv: Path to the (CNV) probe sequence file. splineparamfile: Path to the spline parameters file used to compute the predicted accuracy of the the genotype calls. Used internally in.predictaccuracy. crlmminfofile: Path to is data file containing regions data used by the crlmm function. referencedistfile: Path to a reference distribution file used in the normalization step. This is the reference used in snprma. chipname signature(object = "AffySNPCNVPDInfoPkgSeed"):... getgeometry signature(object = "AffySNPCNVPDInfoPkgSeed"):... makepdinfopackage signature(object = "AffySNPCNVPDInfoPkgSeed"):...

AffySNPCNVPDInfoPkgSeed2-class 5 Notes *IMPORTANT* Users are strongly advised to download Affymetrix SNP packages from BioConductor. The files used for slots splineparamfile, crlmminfofile, and referencedistfile are generated by the Bioconductor project for each chip/platform and are hosted in our svn data repository at https://hedgehog.fhcrc.org/bioc-data/trunk/annotation/parms_store. When makepdinfopackage is run, these files are simply copied to the inst/extdata directory of the generated package. Author(s) Benilton Carvalho showclass("affysnpcnvpdinfopkgseed") AffySNPCNVPDInfoPkgSeed2-class Class "AffySNPCNVPDInfoPkgSeed2" A generic annotation package builder for Affymetrix SNP/CNV arrays. This is a simplified version of the annotation package and crlmm will *NOT* work for them. Objects can be created by calls of the form new("affysnpcnvpdinfopkgseed2", csvannofilecnv, csvseqfilecnv, cdffile: Path to the CDF file for this. csvannofile: Path to the Affymetrix CSV annotation for the SNP probes. csvseqfile: Path to the (SNP) probe sequence file. csvannofilecnv: Path to the Affymetrix CSV annotation for the CNV probes. csvseqfilecnv: Path to the (CNV) probe sequence file.

6 AffySNPPDInfoPkgSeed-class Note chipname signature(object = "AffySNPCNVPDInfoPkgSeed2"):... makepdinfopackage signature(object = "AffySNPCNVPDInfoPkgSeed2"):... This is a simplified annotation package. CRLMM won t work for these objects. The user may need to rename the columns or even add column names to the annotation and sequence files. In case problems are found, column names are suggested. Author(s) Benilton Carvalho showclass("affysnpcnvpdinfopkgseed2") AffySNPPDInfoPkgSeed-class Class "AffySNPPDInfoPkgSeed" This class represents Platform Design (PD) packages for Affymetrix mapping (SNP chip) arrays. Objects can be created by calls of the form new("affysnppdinfopkgseed", splineparamfile, crlmminfofile, ref splineparamfile: Spline parameters file used to compute the predicted accuracy of the genotype calls. crlmminfofile: Data file containing regions data used by the crlmm function. referencedistfile: Reference distribution file used in the normalization step by snprma. cdffile: CDF file for the design. csvannofile: Affymetrix CSV Annotation file. csvseqfile: Affymetrix Probe Sequence file.

AffySNPPDInfoPkgSeed2-class 7 chipname signature(object = "AffySNPPDInfoPkgSeed"):... getgeometry signature(object = "AffySNPPDInfoPkgSeed"):... makepdinfopackage signature(object = "AffySNPPDInfoPkgSeed"):... Note *IMPORTANT* The user is strongly advised to download Affymetrix SNP packages from BioConductor. The files used for slots splineparamfile, crlmminfofile, and referencedistfile are generated by the Bioconductor project for each chip/platform and are hosted in our svn data repository at https://hedgehog.fhcrc.org/bioc-data/trunk/annotation/parms_store. When makepdinfopackage is run, these files are simply copied to the inst/extdata directory of the generated package. showclass("affysnppdinfopkgseed") cdffile <- "Mapping250K_Nsp.cdf" csvanno <- "Mapping250K_Nsp_annot.csv" csvseq <- "Mapping250K_Nsp_probe_tab" spline <- "pd.mapping250k.nsp.spline.params.rda" refd <- "pd.mapping250k.nspref.rda" crlmminf <- "pd.mapping250k.nspcrlmminfo.rda" pkg <- new("affysnppdinfopkgseed", version="0.1.5", author="a. U. Thor", email="au@thor.net", biocviews="annotationdata", genomebuild="ncbi Build 35, May 2004", cdffile=cdffile, csvannofile=csvanno, csvseqfile=csvseq, splineparamfile=spline, crlmminfofile=crlmminf, referencedistfile=refd) show(classes=class(pkg)) AffySNPPDInfoPkgSeed2-class Class "AffySNPPDInfoPkgSeed2" A generic annotation package builder for Affymetrix SNP arrays. This is a simplified version of the annotation package and crlmm will *not* work for them. Objects can be created by calls of the form new("affysnppdinfopkgseed2", cdffile, csvannofile, csvseqfile,

8 AffySTPDInfoPkgSeed-class axiom: Logical flag for experimental build of annotation packages for Axiom arrays. cdffile: CDF file for the design. csvannofile: Affymetrix CSV Annotation file. csvseqfile: Affymetrix Probe Sequence file. chipname signature(object = "AffySNPPDInfoPkgSeed2"):... Note This is a simplified annotation package. CRLMM won t work for these objects. The user may need to rename the columns or even add column names to the annotation and sequence files. In case problems are found, column names are suggested. showclass("affysnppdinfopkgseed2") AffySTPDInfoPkgSeed-class Class "AffySTPDInfoPkgSeed" for the Sense Target gene-level array container for parameters related to pdmapping package construction for ST type arrays Objects can be created by calls of the form new("affystpdinfopkgseed", pgffile, clffile, probefile, transfi

AffyTilingPDInfoPkgSeed-class 9 pgffile: Object of class "ScalarCharacter" path to pgf clffile: Object of class "ScalarCharacter" path to clf probefile: Object of class "ScalarCharacter", path to probe sequence file (Optional) transfile: Object of class "ScalarCharacter", path to trans file (Optional) chipname signature(object = "AffySTPDInfoPkgSeed"):... getgeometry signature(object = "AffySTPDInfoPkgSeed"):... makepdinfopackage signature(object = "AffySTPDInfoPkgSeed"):... Author(s) B. Carvalho showclass("affystpdinfopkgseed") AffyTilingPDInfoPkgSeed-class Class "AffyTilingPDInfoPkgSeed" PD Info Package Seed for Affymetrix Tiling Arrays Objects can be created by calls of the form new("affytilingpdinfopkgseed",...).

10 cdf2table bpmapfile: BPMAP File - provided by Affymetrix celfile: CEL File - provided by Affymetrix makepdinfopackage signature(object = "AffyTilingPDInfoPkgSeed"):... chipname signature(object = "AffyTilingPDInfoPkgSeed"):... showclass("affytilingpdinfopkgseed") cdf2table Helper functions to assist the creation of an annotation package for a generic array Helper functions to assist the creation of an annotation package for a generic array. This includes converting CDF files into flat tables and parsing probe sequence files. Usage cdf2table(cdffile) sequenceparser(seqfile) Arguments cdffile seqfile name of the CDF file to be used name of the probe sequence file Details cdf2table will convert a CDF to a flat table. seqfile will extract a flat table containing physical location and probe sequences.

chipname 11 chipname Return an Official Chip/Platform Name This generic function returns an official or standard chip/platform name. Usage chipname(object) Arguments object See show("chipname"), but generally object will be a subclass of PkgSeed. Details Value The idea is that the input files can be used to determine a standard name for each platform. For example, the method for AffySNPPDInfoPkgSeed objects reads the header of the CDF file to extract a name. A character vector of length one giving a standard name for the platform. Author(s) Seth Falcon getgeometry Return the Chip/Platform geometry This generic function returns the geometry for a chip/platform. Usage getgeometry(object) Arguments object See show("getgeometry"), but generally object will be a subclass of PkgSeed. Details The idea is that the input files can be used to determine the geometry for each platform. For example, the method for AffySNPPDInfoPkgSeed objects reads the header of the CDF file to extract the geometry.

12 makepdinfopackage Value A list with two elements nrows and ncols Author(s) Matt Settles makepdinfopackage Create a Platform Design Info Package This generic function create a platform design info package based on the parameters contained in object which will generally be an instance of a subclass of PkgSeed. The result is a new directory on the filesystem containing the source for the generated pdinfo package. Usage makepdinfopackage(object, destdir, batch_size = 10000, quiet = FALSE, unlink = FALSE) Arguments object destdir batch_size quiet unlink See show("makepdinfopackage") to see available methods. Path where the resulting pdinfo package source directory will be written. An integer controlling the size of batches processed when reading the flatfiles and loading the DB. In general, larger values of batch_size will use more memory and less time (unless you exceed physical memory, in which case more time will be used as well). A logical value. When TRUE, diagnostic and status messages are not printed. A logical value. If TRUE, and destdir already contains a file or directory with the name pkgname, try to unlink (remove) it. Details In general, creating the SQLite database will be a time and memory intensive task. Value This function is called for its side-effect of producing a pdinfo source package directory. Author(s) Seth Falcon

NgsExpressionPDInfoPkgSeed-class 13 cdffile <- "Mapping250K_Nsp.cdf" csvanno <- "Mapping250K_Nsp_annot.csv" csvseq <- "Mapping250K_Nsp_probe_tab" ## Not run: pkg <- new("affysnppdinfopkgseed", version="0.1.5", author="a.u. Thor", email="au@thor.net", biocviews="annotationdata", genomebuild="ncbi Build 35, May 2004", cdffile=cdffile, csvannofile=csvanno, csvseqfile=csvseq) makepdinfopackage(pkg, destdir=".") ## End(Not run) NgsExpressionPDInfoPkgSeed-class Class "NgsExpressionPDInfoPkgSeed" PDInfo package Seed for NimbleGen Expression arrays Objects can be created by calls of the form new("ngsexpressionpdinfopkgseed", ndffile, pairfile, xysfile, n ndffile: NDF (NimbleGen Design) file xysfile: XYS File - used as template makepdinfopackage signature(.object = "NgsExpressionPDInfoPkgSeed"):... chipname signature(object = "NimbleGenPDInfoPkgSeed"):... getgeometry signature(.object = "NimbleGenPDInfoPkgSeed"):...

14 NgsTilingPDInfoPkgSeed-class showclass("ngsexpressionpdinfopkgseed") NgsTilingPDInfoPkgSeed-class Class "NgsTilingPDInfoPkgSeed" PDInfo package Seed for NimbleGen Tiling arrays Objects can be created by calls of the form new("ngstilingpdinfopkgseed", ndffile, xysfile, pairfile, posfi ndffile: NDF (NimbleGen Design) file xysfile: XYS File - used as template posfile: POS (Positions) file makepdinfopackage signature(.object = "NgsTilingPDInfoPkgSeed"):... chipname signature(object = "NimbleGenPDInfoPkgSeed"):... getgeometry signature(object = "NimbleGenPDInfoPkgSeed"):... showclass("ngstilingpdinfopkgseed")

NimbleGenPDInfoPkgSeed-class 15 NimbleGenPDInfoPkgSeed-class Class "NimbleGenPDInfoPkgSeed" PDInfo package Seed for all NimbleGen arrays Objects can be created by calls of the form new("nimblegenpdinfopkgseed",...). manufacturer: Manufacturer = NimbleGen chipname signature(object = "NimbleGenPDInfoPkgSeed"):... getgeometry signature(object = "NimbleGenPDInfoPkgSeed"):... showclass("nimblegenpdinfopkgseed")

Index Topic classes AffyClariomSPDInfoPkgSeed, 2 AffyExpressionPDInfoPkgSeed-class, 3 AffySNPCNVPDInfoPkgSeed-class, 4 AffySNPCNVPDInfoPkgSeed2-class, 5 AffySNPPDInfoPkgSeed-class, 6 AffySNPPDInfoPkgSeed2-class, 7 AffySTPDInfoPkgSeed-class, 8 AffyTilingPDInfoPkgSeed-class, 9 NgsExpressionPDInfoPkgSeed-class, 13 NgsTilingPDInfoPkgSeed-class, 14 NimbleGenPDInfoPkgSeed-class, 15 Topic manip cdf2table, 10 Topic methods chipname, 11 getgeometry, 11 makepdinfopackage, 12 AffyClariomSPDInfoPkgSeed, 2 AffyClariomSPDInfoPkgSeed-class (AffyClariomSPDInfoPkgSeed), 2 AffyExonPDInfoPkgSeed-class (AffySTPDInfoPkgSeed-class), 8 AffyExpressionPDInfoPkgSeed-class, 3 AffyGenePDInfoPkgSeed-class (AffySTPDInfoPkgSeed-class), 8 AffySNPCNVPDInfoPkgSeed-class, 4 AffySNPCNVPDInfoPkgSeed2-class, 5 AffySNPPDInfoPkgSeed-class, 6 AffySNPPDInfoPkgSeed2-class, 7 AffySTPDInfoPkgSeed-class, 8 AffyTilingPDInfoPkgSeed-class, 9 cdf2table, 10 chipname, 11 chipname,affyexpressionpdinfopkgseed-method chipname,affygeneric1pdinfopkgseed-method chipname,affysnpcnvpdinfopkgseed-method chipname,affysnpcnvpdinfopkgseed2-method chipname,affysnppdinfopkgseed-method chipname,affysnppdinfopkgseed2-method chipname,affystpdinfopkgseed-method chipname,affytilingpdinfopkgseed-method chipname,nimblegenpdinfopkgseed-method getgeometry, 11 getgeometry,affyexpressionpdinfopkgseed-method (getgeometry), 11 getgeometry,affysnpcnvpdinfopkgseed-method (getgeometry), 11 getgeometry,affysnppdinfopkgseed-method (getgeometry), 11 getgeometry,affystpdinfopkgseed-method (getgeometry), 11 getgeometry,affytilingpdinfopkgseed-method (getgeometry), 11 getgeometry,nimblegenpdinfopkgseed-method (getgeometry), 11 makepdinfopackage, 12 makepdinfopackage,affyclariomspdinfopkgseed-method makepdinfopackage,affyexonpdinfopkgseed-method makepdinfopackage,affyexpressionpdinfopkgseed-method makepdinfopackage,affygenepdinfopkgseed-method makepdinfopackage,affyhtapdinfopkgseed-method makepdinfopackage,affymirnapdinfopkgseed-method makepdinfopackage,affysnpcnvpdinfopkgseed-method makepdinfopackage,affysnpcnvpdinfopkgseed2-method 16

INDEX 17 makepdinfopackage,affysnppdinfopkgseed-method makepdinfopackage,affysnppdinfopkgseed2-method makepdinfopackage,affystpdinfopkgseed-method makepdinfopackage,affytilingpdinfopkgseed-method makepdinfopackage,genericpdinfopkgseed-method makepdinfopackage,ngsexpressionpdinfopkgseed-method makepdinfopackage,ngstilingpdinfopkgseed-method NgsExpressionPDInfoPkgSeed-class, 13 NgsTilingPDInfoPkgSeed-class, 14 NimbleGenPDInfoPkgSeed-class, 15 sequenceparser (cdf2table), 10