Package EventPointer

Similar documents
Package metagene. November 1, 2018

Package scmap. March 29, 2019

Package customprodb. September 9, 2018

Package bnbc. R topics documented: April 1, Version 1.4.0

Package HTSeqGenie. April 16, 2019

Package roar. August 31, 2018

Package RTCGAToolbox

Package MEAL. August 16, 2018

Package PGA. July 6, 2018

Package scruff. November 6, 2018

Package omicplotr. March 15, 2019

Package SCAN.UPC. July 17, 2018

Package SIMLR. December 1, 2017

Package dupradar. R topics documented: July 12, Type Package

Package seqcat. March 25, 2019

Package BgeeDB. January 5, 2019

Package SC3. September 29, 2018

Package SCAN.UPC. October 9, Type Package. Title Single-channel array normalization (SCAN) and University Probability of expression Codes (UPC)

Package ChIPseeker. July 12, 2018

Package TFutils. October 11, 2018

Package multihiccompare

Package crossmeta. September 5, 2018

Package mimager. March 7, 2019

Package ccmap. June 17, 2018

Package Linnorm. October 12, 2016

Package transcriptogramer

Package SGSeq. December 31, 2018

Package CAGEr. September 4, 2018

Package RcisTarget. July 17, 2018

Package EMDomics. November 11, 2018

Package Organism.dplyr

Package RNASeqR. January 8, 2019

Package phantasus. March 8, 2019

Package RTNduals. R topics documented: March 7, Type Package

Package TxRegInfra. March 17, 2019

Package SC3. November 27, 2017

Package DRIMSeq. December 19, 2018

Package BDMMAcorrect

Package DMCHMM. January 19, 2019

Package dmrseq. September 14, 2018

Package JunctionSeq. November 1, 2018

Package GeneNetworkBuilder

Package decomptumor2sig

Ensembl RNASeq Practical. Overview

Package AnnotationHub

Package ssizerna. January 9, 2017

Package BiocStyle. December 9, 2018

Working with aligned nucleotides (WORK- IN-PROGRESS!)

Package methyvim. April 5, 2019

Package SingleCellExperiment

Package bumphunter. September 22, 2018

Package MIRA. October 16, 2018

Package IsoCorrectoR. R topics documented:

Package MaxContrastProjection

Package GRmetrics. R topics documented: September 27, Type Package

Package LOLA. December 24, 2018

Package cattonum. R topics documented: May 2, Type Package Version Title Encode Categorical Features

Package M3Drop. August 23, 2018

Package igc. February 10, 2018

Package GREP2. July 25, 2018

Package GreyListChIP

riboseqr Introduction Getting Data Workflow Example Thomas J. Hardcastle, Betty Y.W. Chung October 30, 2017

Our typical RNA quantification pipeline

Package EDASeq. December 30, 2018

Package gpart. November 19, 2018

Package STROMA4. October 22, 2018

Package ECctmc. May 1, 2018

Package SNPediaR. April 9, 2019

Package rsppfp. November 20, 2018

Package ERSSA. November 4, 2018

Package conumee. February 17, 2018

Package robotstxt. November 12, 2017

Package anota2seq. January 30, 2018

Package RNAseqNet. May 16, 2017

Package diffcyt. December 26, 2018

Package methylgsa. March 7, 2019

Package VSE. March 21, 2016

Package wavcluster. January 23, 2019

Package prozor. July 26, 2018

Package OmaDB. R topics documented: December 19, Title R wrapper for the OMA REST API Version Author Klara Kaleb

Package omu. August 2, 2018

Package mfa. R topics documented: July 11, 2018

Package RUVSeq. September 16, 2018

Package AnnotationHub

Package JunctionSeq. November 16, 2015

Package pgca. March 17, 2019

Package MultiDataSet

The software and data for the RNA-Seq exercise are already available on the USB system

Package validara. October 19, 2017

Package semisup. March 10, Version Title Semi-Supervised Mixture Model

Package tximport. November 25, 2017

Package geecc. R topics documented: December 7, Type Package

Package BiocManager. November 13, 2018

Colorado State University Bioinformatics Algorithms Assignment 6: Analysis of High- Throughput Biological Data Hamidreza Chitsaz, Ali Sharifi- Zarchi

Package M3D. March 21, 2019

Sequence Analysis Pipeline

ROTS: Reproducibility Optimized Test Statistic

Package icnv. R topics documented: March 8, Title Integrated Copy Number Variation detection Version Author Zilu Zhou, Nancy Zhang

Package tximport. May 13, 2018

Package CoGAPS. January 24, 2019

Transcription:

Type Package Package EventPointer September 5, 2018 Title An effective identification of alternative splicing events using junction arrays and RNA-Seq data Version 1.4.0 Author Juan Pablo Romero, Ander Muniategui, Fernando Carazo, Ander Aramburu, Angel Rubio Maintainer Juan Pablo Romero <jpromero@ceit.es> EventPointer is an R package to identify alternative splicing events that involve either simple (case-control experiment) or complex experimental designs such as time course experiments and studies including paired-samples. The algorithm can be used to analyze data from either junction arrays (Affymetrix Arrays) or sequencing data (RNA- Seq). The software returns a data.frame with the detected alternative splicing events: gene name, type of event (cassette, alternative 3',...,etc), genomic position, statistical significance and increment of the percent spliced in (Delta PSI) for all the events. The algorithm can generate a series of files to visualize the detected alternative splicing events in IGV. This eases the interpretation of results and the design of primers for standard PCR validation. Depends R (>= 3.4), SGSeq, Matrix, SummarizedExperiment Imports GenomicFeatures, stringr, GenomeInfoDb, igraph, MASS, nnls, limma, matrixstats, RBGL, prodlim, graph, methods, utils, stats, doparallel, foreach, affxparser, GenomicRanges, S4Vectors Suggests knitr, rmarkdown, BiocStyle, RUnit, BiocGenerics License Artistic-2.0 LazyData true RoxygenNote 6.0.1 biocviews AlternativeSplicing, DifferentialSplicing, mrnamicroarray, RNASeq, Transcription, Sequencing VignetteBuilder knitr Url https://github.com/jpromeror/eventpointer BugReports https://github.com/jpromeror/eventpointer/issues git_url https://git.bioconductor.org/packages/eventpointer git_branch RELEASE_3_7 1

2 AllEvents_RNASeq git_last_commit ed5be18 git_last_commit_date 2018-04-30 Date/Publication 2018-09-04 R topics documented: AllEvents_RNASeq..................................... 2 ArraysData......................................... 3 CDFfromGTF........................................ 3 EventDetection....................................... 4 EventPointer........................................ 5 EventPointer_IGV..................................... 6 EventPointer_RNASeq................................... 7 EventPointer_RNASeq_IGV................................ 8 PrepareBam_EP....................................... 9 SG_RNASeq........................................ 10 Index 11 AllEvents_RNASeq Alternative splicing events detected by EventPointer Alternative splicing events detected by EventPointer data(allevents_rnaseq) Format A list object AllEvents_RNASeq[[i]][[j]] displays the jth splicing event for the ith gene. AllEvents_RNASeq object contains all the detected alternative splicing events using EventPointer methodology. The splicing events where detected using the BAM files from the dataset published in Seshagiri et al. 2012 and used in the SGSeq R package vignette.

ArraysData 3 ArraysData Preprocessed arrays data Preprocessed arrays data data(arraysdata) Format A data.frame with preprocessed arrays data. The preprocessing was done using aroma.affymetrix. See the package vignette for the preprocessing pipeline ArraysData object contains preprocessed junction arrays data. The preprocessing was done using aroma.affymetrix R package, refer to EventPointer vignette for the pipeline used for the preprocessing. The data corresponds to 4 samples from the SUM149 Cell line hybridized to the HTA 2.0 Affymetrix array. The first two samples are control and the second ones are treated. CDFfromGTF CDF file creation for EventPointer Generates the CDF file to be used under the aroma.affymetrix framework CDFfromGTF(input = "Ensembl", inputfile = NULL, PSR, Junc, PathCDF, microarray = NULL) input inputfile PSR Junc PathCDF microarray Reference transcriptome used to build the CDF file. Must be one of: "Ensembl", "UCSC", "AffyGTF" or "CustomGTF". If input is "AffyGTF" or "CustomGTF", inputfile should point to the GTF file to be used. Path to the Exon probes txt file Path to the Junction probes txt file Directory where the output will be saved Microarray used to create the CDF file. Must be one of: HTA-2_0, ClariomD, RTA or MTA

4 EventDetection The function displays a progress bar to show the user the progress of the function. However, there is no value returned in R as the function creates three files that are used later by other EventPointer functions. 1) EventsFound.txt : Tab separated file with all the information of all the alternative splcing events found. 2).flat file : Used to build the corresponding CDF file. 3).CDF file: Output required for the aroma.affymetrix preprocessing pipeline. Both the.flat and.cdf file take large ammounts of memory in the hard drive, it is recommended to have at least 1.5 GB of free space. PathFiles<-system.file('extdata',package='EventPointer') DONSON_GTF<-paste(PathFiles,'/DONSON.gtf',sep='') PSRProbes<-paste(PathFiles,'/PSR_Probes.txt',sep='') JunctionProbes<-paste(PathFiles,'/Junction_Probes.txt',sep='') Directory<-tempdir() microarray<-"hta-2_0" # Run the function CDFfromGTF(input='AffyGTF',inputFile=DONSON_GTF,PSR=PSRProbes,Junc=JunctionProbes,PathCDF=Directory,micr EventDetection Detect splicing events using EventPointer methodology Identification of all the alternative splicing events in the splicing graphs EventDetection(Input, cores, Path) Input cores Path Output of the PrepareBam_EP function Number of cores used for parallel processing Directory where to write the EventsFound_RNASeq.txt file list with all the events found for all the genes present in the experiment. It also generates a file called EventsFound_RNASeq.txt with the information for every detected event. # Run EventDetection function data(sg_rnaseq) TxtPath<-tempdir() AllEvents_RNASeq<-EventDetection(SG_RNASeq,cores=1,Path=TxtPath)

EventPointer 5 EventPointer EventPointer Statistical analysis of alternative splcing events EventPointer(Design, Contrast, ExFit, Eventstxt, Filter = TRUE, Qn = 0.25, Statistic = "LogFC", PSI = FALSE) Design Contrast ExFit Eventstxt Filter The design matrix for the experiment. The contrast matrix for the experiment. aroma.affymetrix pre-processed variable after using extractdataframe(affy, addnames=true) Path to the EventsFound.txt file generated by CDFfromGTF function. Boolean variable to indicate if an expression filter is applied Qn Quantile used to filter the events (Bounded between 0-1, Q1 would be 0.25). Statistic PSI Statistical test to identify differential splicing events, must be one of : LogFC, Dif_LogFC or DRS. Boolean variable to indicate if Delta PSI should be calculated for every splicing event. Data.frame ordered by the splicing p.value. The object contains the different information for each splicing event such as Gene name, event type, genomic position, p.value, z.value and delta PSI. data(arraysdata) Dmatrix<-matrix(c(1,1,1,1,0,0,1,1),nrow=4,ncol=2,byrow=FALSE) Cmatrix<-t(t(c(0,1))) EventsFound<-paste(system.file('extdata',package='EventPointer'),'/EventsFound.txt',sep='') Events<-EventPointer(Design=Dmatrix, Contrast=Cmatrix, ExFit=ArraysData, Eventstxt=EventsFound, Filter=TRUE, Qn=0.25, Statistic='LogFC', PSI=TRUE)

6 EventPointer_IGV EventPointer_IGV EventPointer IGV Visualization Generates of files to be loaded in IGV for visualization and interpretation of events EventPointer_IGV(Events, input, inputfile = NULL, PSR, Junc, PathGTF, EventsFile, microarray = NULL) Events input inputfile PSR Junc PathGTF EventsFile microarray Data.frame generated by EventPointer with the events to be included in the GTF file. Reference transcriprome. Must be one of: "Ensembl", "UCSC", "AffyGTF" or "CustomGTF". If input is "AffyGTF" or "CustomGTF", inputfile should point to the GTF file to be used. Path to the Exon probes txt file. Path to the Junction probes txt file. Directory where to write the GTF files. Path to EventsFound.txt file generated with CDFfromGTF function. Microarray used to create the CDF file. Must be one of: HTA-2_0, ClariomD, RTA or MTA The function displays a progress bar to show the user the progress of the function. Once the progress bar reaches 100 files are written to the specified directory in PathGTF. The created files are: 1) paths.gtf : GTF file representing the alternative splicing events and 2) probes.gtf : GTF file representing the probes that measure each event and each path. PathFiles<-system.file('extdata',package='EventPointer') DONSON_GTF<-paste(PathFiles,'/DONSON.gtf',sep='') PSRProbes<-paste(PathFiles,'/PSR_Probes.txt',sep='') JunctionProbes<-paste(PathFiles,'/Junction_Probes.txt',sep='') Directory<-tempdir() data(arraysdata) Dmatrix<-matrix(c(1,1,1,1,0,0,1,1),nrow=4,ncol=2,byrow=FALSE) Cmatrix<-t(t(c(0,1))) EventsFound<-paste(system.file('extdata',package='EventPointer'),'/EventsFound.txt',sep='') Events<-EventPointer(Design=Dmatrix,

EventPointer_RNASeq 7 Contrast=Cmatrix, ExFit=ArraysData, Eventstxt=EventsFound, Filter=TRUE, Qn=0.25, Statistic='LogFC', PSI=TRUE) EventPointer_IGV(Events=Events[1,,drop=FALSE], input='affygtf', inputfile=donson_gtf, PSR=PSRProbes, Junc=JunctionProbes, PathGTF=Directory, EventsFile= EventsFound, microarray="hta-2_0") EventPointer_RNASeq Statistical analysis of alternative splcing events for RNASeq data Statistical analysis of all the alternative splicing events found in the given bam files. EventPointer_RNASeq(Events, Design, Contrast, Statistic = "LogFC", PSI = FALSE) Events Design Contrast Statistic PSI Output from EventDetection function The design matrix for the experiment. The contrast matrix for the experiment. Statistical test to identify differential splicing events, must be one of : LogFC, Dif_LogFC and DRS. Boolean variable to indicate if PSI should be calculated for every splicing event. Data.frame ordered by the splicing p.value. The object contains the different information for each splicing event such as Gene name, event type, genomic position, p.value, z.value and delta PSI. data(allevents_rnaseq) Dmatrix<-matrix(c(1,1,1,1,1,1,1,1,0,0,0,0,1,1,1,1),ncol=2,byrow=FALSE) Cmatrix<-t(t(c(0,1))) Events <- EventPointer_RNASeq(AllEvents_RNASeq,Dmatrix,Cmatrix,Statistic='LogFC',PSI=TRUE)

8 EventPointer_RNASeq_IGV EventPointer_RNASeq_IGV EventPointer RNASeq IGV Visualization Generates of files to be loaded in IGV for visualization and interpretation of events EventPointer_RNASeq_IGV(Events, SG_RNASeq, EventsTxt, PathGTF) Events SG_RNASeq EventsTxt PathGTF Data.frame generated by EventPointer_RNASeq with the events to be included in the GTF file. Output from PrepareBam_EP function. Contains splicing graphs components. Path to EventsFound.txt file generated with EventDetection function Directory where to write the GTF files. The function displays a progress bar to show the user the progress of the function. Once the progress bar reaches 100 file is written to the specified directory in PathGTF. The created file: 1) paths_rnaseq.gtf : GTF file representing the alternative splicing events. data(allevents_rnaseq) data(sg_rnaseq) # Run EventPointer Dmatrix<-matrix(c(1,1,1,1,1,1,1,1,0,0,0,0,1,1,1,1),ncol=2,byrow=FALSE) Cmatrix<-t(t(c(0,1))) Events <- EventPointer_RNASeq(AllEvents_RNASeq,Dmatrix,Cmatrix,Statistic='LogFC',PSI=TRUE) # IGV Visualization EventsTxt<-paste(system.file('extdata',package='EventPointer'),'/EventsFound_RNASeq.txt',sep='') PathGTF<-tempdir() EventPointer_RNASeq_IGV(Events,SG_RNASeq,EventsTxt,PathGTF)

PrepareBam_EP 9 PrepareBam_EP Bam files preparation for EventPointer Prepares the information contained in.bam files to be analyzed by EventPointer PrepareBam_EP(Samples, SamplePath, Ref_Transc = "Ensembl", filetransc = NULL, cores = 1, Alpha = 2) Samples SamplePath Ref_Transc filetransc cores Alpha Name of the.bam files to be analyzed (Sample1.bam,Sample2.bam,...,etc). Path where the bam files are stored. Reference transcriptome used to name the genes found in bam files. Options are: Ensembl, UCSC or GTF. Path to the GTF reference transcriptome ff Ref_Transc is GTF. Number of cores used for parallel processing. Internal SGSeq parameter to include or exclude regions SGFeaturesCounts object. It contains a GRanges object with the corresponding elements to build the different splicing graphs found and the counts related to each of the elements. ## Not run: # Obtain the samples and directory for.bam files BamInfo<-si Samples<-BamInfo[,2] PathToSamples <- system.file('extdata/bams', package = 'SGSeq') PathToGTF<-paste(system.file('extdata',package='EventPointer'),'/FBXO31.gtf',sep='') # Run PrepareBam function SG_RNASeq<-PrepareBam_EP(Samples=Samples, SamplePath=PathToSamples, Ref_Transc='GTF', filetransc=pathtogtf, cores=1) ## End(Not run)

10 SG_RNASeq SG_RNASeq Splicing graph elements predicted from BAM files Format Splicing graph elements predicted from BAM files data(sg_rnaseq) A SGFeatureCounts objects with predicted splicing graph features and counts SG_RNASeq object displays the predicted features found in the BAM files from the dataset published in Seshagiri et al. 2012 and used in the SGSeq R package vignette.

Index AllEvents_RNASeq, 2 ArraysData, 3 CDFfromGTF, 3 EventDetection, 4 EventPointer, 5 EventPointer_IGV, 6 EventPointer_RNASeq, 7 EventPointer_RNASeq_IGV, 8 PrepareBam_EP, 9 SG_RNASeq, 10 11