Package polyphemus. February 15, 2013

Size: px
Start display at page:

Download "Package polyphemus. February 15, 2013"

Transcription

1 Package polyphemus February 15, 2013 Type Package Title Genome wide analysis of RNA Polymerase II-based ChIP Seq data. Version Date Author Martial Sankar, supervised by Marco Mendoza and Hinrich Gronenmeyer Maintainer Martial Sankar Polyphemus is able to combine signal intensity (count) profiles, with significant peak annotations in order to identify coding regions of interest for the comparative analysis or two or more samples. The algorithm (i) identifies significant coding regions, (ii) performs normalization, (iii) compares several profiles and (iv) provides matrix output suitable for further clustering analysis. License GPL (>= 2) LazyLoad yes Depends R (>= ) OS_type unix Imports IRanges, zoo, RUnit, parallel, R.utils, limma URL Repository CRAN Date/Publication :21:15 NeedsCompilation no 1

2 2 polyphemus-package R topics documented: polyphemus-package createfolderhierarchy generatewig getcodreg getincrbyg getint getlowessfitval getmvaplot getnorm getoutput getresult getswv gwcomp lowessnorm medichimerge quantilenorm stretchgene Index 18 polyphemus-package R package for comparative analysis of genome-wide RNA Polymerase II ChIP-seq data using non-linear normalisation approaches. The polyphemus package was designed to extract comparative information about PolII action at different gene loci from ChIP-seq profiling. This permits to (i) integrates TSS annotation with PolII enrichment over coding regions, (ii) normalizes signal intensity profiles using non-linear approaches to correct for technical/experimental variations and iii) generates outputs for gene classification according to differential transcription characteristics. Package: polyphemus Type: Package Version: Date: License: GPL (>=2) LazyLoad: yes

3 createfolderhierarchy 3 Martial Sankar directed by Marco Mendoza and Hinrich Gronemeyer Maintainer: Martial Sankar <martial.sankar@unil.ch> References Reiss, D. J., M. T. Facciotti, et al. (2008). "Model-based deconvolution of genome-wide DNA binding." Bioinformatics 24(3): Rozowsky, J., G. Euskirchen, et al. (2009). "PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls." Nat Biotechnol 27(1): Taslim, C., J. Wu, et al. (2009). "Comparative study on ChIP-seq data: normalization and binding pattern characterization." Bioinformatics 25(18): Examples fpath1<-system.file("extdata", package="polyphemus") fpath2<-file.path(fpath1,"atra") ex <- gwcomp(initdir=fpath2, posprocparam=c(300, 0.1,NA), denomtrack="etoh", database=file.path(fpath1,"mouserefseq-genenames.txt"), normmethod="quantile", stretched =50, TSSLength = 10, window=1000, span=500,prepost=c(0,0), outinttable=true, plotmva=true) createfolderhierarchy Create hierarchy of folder. createfolderhierarchy create a hierarchy of folders. The functions takes in input a folder of pooled files (bed and wig format). One folder is then created for each chromosome. the folder name corresponds to the name of the chromosome (i.e: "chr1", "chrx"... ). The subfolders "pos" and "wig" are then created. They contain respectively the bed and wig files. createfolderhierarchy(initialdir) initialdir character the path of the folder containing the pooled files. If the folders and subfolders are already created nothing is done. Martial Sankar

4 4 generatewig Examples fpath<-system.file("extdata", package="polyphemus") createfolderhierarchy(file.path(fpath,"atra")) generatewig Generate Wig files generatewig Generate Wig files from a count files [deprecated] generatewig(countdf, col, chr, dbori, window, span, prespost = c(1000, 1000), out) countdf col chr dbori window span prespost out data.frame, two columns dataset (Start, End position) numeric, column index of the count column numeric, chromosome index data.frame, intial database (refseq) numeric, size of the sliding window numeric, span of the sliding window vector, the number of pb before and after the gene, c(int, int) character, output path and file name of the generated wwig file deprecated function.

5 getcodreg 5 getcodreg extract intensity (count) values from the wig file. getcodreg permits to define coding regions and extract intensity (count) values from the wig file. getcodreg(wiggfile, prepost, window, dbcurr, chrcurr, span = NULL) wiggfile prepost window dbcurr chrcurr span character, path and filename to a wig file. vector, contains two values, the number of bp (base pairs) before and the number of bp after the coding region. numeric, size of the sliding window (in bp). data.frame, corresponding to a subset of the database file for the processing chromosome. character, name of the processing chromosome. numeric, if NULL take the span of the wig file else take the specified span. Value returns a list. each element corresponds to the vector of intensities for the specific gene. Martial Sankar getincrbyg sub-function of generatewig getincrbyg, sub-function of generatewig. getincrbyg(gcurr, col, countdf, dbori, window, span, prespost)

6 6 getint gcurr col countdf dbori window span prespost String, gene name. numeric, column index. data.frame, two columns dataset (Start, End position). data.frame, intial database (refseq). numeric, size of the sliding window. numeric, span of the sliding window. vector, the number of pb before and after the gene, c(int, int). generatewig getint Extract intensities and start the comparative analysis. getint permits to extract intensities from files in one chromosome folder and start the comparative analysis. getint(folderpath, medichiproc = c(null, NULL, NULL), prepost = c(500, 2000), window = 250, denomtra folderpath medichiproc prepost window denomtrack databasefile puttss span character, path to the wig folder. vector, c(window, pvalue filters, String path and name for output (NULL default)). see the details section. vector, contains two values, the number of bp (base pair) before and the number of bp after the coding region. numeric, size of the sliding window (in bp). character, the denominator track name (string contained in the filename). character, path and name of the database file. logical, if TRUE, detect the putative TSS inside the coding region. (not yet implemented). numeric, if NULL take the span of the wig file else take the specified span.

7 getlowessfitval 7 Value The denomtrack argument is the exact String of the denominator track contained in the filename. Its intensity values will be used as steady denominator when calculating the ratio between tracks. The medichiproc argument is a vector that contains three elements. The first is the window size where the TSS is centered. The second is a filter on the pvalue. The third is the file path and name of the output table. if NULL, the output file is not provided. The getint function is also called by the wrapper gwcomp. The getint function returns an object (list) of, at least, four elements. It can be more depending on the number of tracks in the experiments. gene chr denominator numerator character, the gene annotation. character, the chromosome name. numeric, intensities value for the denominator track. numeric, intensities value for the first numerator track.... numeric, intensities value for the other track. Martial Sankar See also gwcomp for genome wide analysis (in all chromosomes). See also getcodreg. Examples fpath1<-system.file("extdata", package="polyphemus") fpath2<-file.path(file.path(fpath1,"atra"),"chr19") x <- getint(folderpath=fpath2, # path of the parent folder of the wig directory for the chr19 medichiproc = c(300, 0.1, NULL), prepost = c(0, 0), window = 1000, denomtrack="etoh", puttss = FALSE, databasefile=file.path(fpath1,"mouserefseq-genenames.txt"), span = 500) getlowessfitval get fitted values from a lowess regression. getlowessfitval, return a list of table where it was extracted the fit values from a lowess regression, getlowessfitval(id, list, toplot = FALSE)

8 8 getmvaplot id list toplot character, geneid from the final output of polyphemus character, splitted polyphemus output (by ID) Boolean, whether the the fitted values should be added to the mva plot subfunction of getoutput See also getoutput. getmvaplot Display the MVA plot getmvaplot, call by lowessnorm & quantilenorm function, display the mva plot getmvaplot(matraw, matnorm, normmeth = NULL) matraw matnorm normmeth matrix, initial matrix, contains the raw intensity matrix, normalized matrix character, normalization methods subfunction of quantilenorm and lowessnorm. See also quantilenorm and lowessnorm.

9 getnorm 9 getnorm Normalization The getnorm permits the normalization of the intensity (count) values. getnorm(objint, norm = c("linear", "lowess", "quantile"), lowessspan = NULL, mva = NULL) objint norm lowessspan mva list, the intensity object. Output of the getint function. character, the normalization method : "linear" for linear normalization, "lowess" and "quantile" for local normalization. See details section. numeric, the smoother span. This gives the proportion of points in the plot which influence the smooth at each value. Larger values give more smoothness but requires more computing time. if NULL, lowess span = 2/3. logical, if TRUE, the MVA figures are plotted. Polyphemus provides three normalization procedure. Choosing a "linear" normalization, a correction factor is applied such that the sum of the intensities for a track is the same for all the tracks. Choosing a local normalisation, "lowess" or "quantile", a correction factor is locally calculating and applied to the track intensities. This turns to be more reliable than classic "linear" normalisation. Taslim et al. already mentionned it to normalize PolII ChIP-seq data. However when dealing with more than two samples, the "quatile" procedure turns to be more suitable as it requires less computing time. The "lowess" normalization used the lowess implementation of the R base package. The "quantile" procedure used the normalizequantiles of the limma package. getnorm is called in the polyphemus wrapper gwcomp. Value returns an object (list) containing three elements. gene chr matrix character, the gene annotation. chr, the chromosome name. matrix,the matrix of normalized intensity values. Martial Sankar

10 10 getoutput References Taslim, C., J. Wu, et al. (2009). "Comparative study on ChIP-seq data: normalization and binding pattern characterization." Bioinformatics 25(18): gwcomp. getoutput get cleaner output from POLYPHEMUS results data postprocessing function, return a list of table with 3 columns (ID, ratio, approx), used by default in getresult function. getoutput(inputtable = NULL, filename = NULL, norm, stretch, tssl, chr, outputdir, denostring, time, inputtable data.frame, polyphemus output, if NULL filename is used. filename norm stretch tssl chr outputdir denostring time toplot subfunction of getresult. String, the path and name of polyphemus output if NULL the inputtable is used. String, normalization method numeric, if a length value is specified, the ChIP seq ratio profiles will be "stretched" or compacted to reach the length value. if NULL is specified the median length of the overall profiles is used. numeric, estimated length for the TSS region. The TSS part of the coding region will be not "stretched". numeric, the chromosome index. numeric, the chromosome index. String, column name for the pairwise comparison. String, time generated by Sys.time function Boolean, whether to plot lowess fit values

11 getresult 11 as getresult getresult final result output the getresult function permits to get the final table output suitable for further clustering analysis. getresult(objint, iter = "", outinttable = FALSE, out, norm = "quantile", stretchval, tsslength, den objint iter outinttable out norm stretchval tsslength denostring list, is the object output of the getnorm function. character. not used. logical, if TRUE save the intermediate intensity tables. character, is the path of the output folder. character, is the used normalization method. numeric, is the length value to use to uniformize the coding region length. See section (units: number of span). numeric, is the approximate length of the TSS to used to separate TSS and body regions (units : number of span). character, is the name of the denominator track. The stretchval and tsslength arguments takes a numerical value in -number of span- units. The span length (in bp) is an argument of the gwcomp and getint functions. Users can easily determine the number of span length to define the TSSlength. if one assumes the TSS length is 1000 bp and he uses a span of 10 bp. The TSSLength arguments will be 100. The same method can be used to calculate the stretchval argument. Value returns one file by ratio between tracks. These output files can then reload in R or used in external program for clustering analysis. Martial Sankar See also gwcomp, getint, getnorm.

12 12 getswv getswv get the Sliding Windows values by genes getswv, get the Sliding Windows Values by genes. Values are obtained using the median of the tags counts. getswv(gid, wiggtable, commontable, dbcurr, window = 250, span) gid wiggtable commontable dbcurr window span numeric, the Gene ID. data.frame,the entire wiggtable. data.frame, the results matrix of overlap function. data.frame, the entire database. numeric, the sliding window size. numeric, if NULL take the span of the WIGG. subfunction of getcodreg. Value return average inside the sliding window intensity. as getcodreg

13 gwcomp 13 gwcomp Polyphemus Wrapper The gwcomp function is the polyphemus wrapper. This calls the sub-fonctions requires to performs the comparative analysis of the ChIP-seq profiles (multi-states analysis) in a genome wide manner (whole genome analysis). gwcomp(initdir, posprocparam = c(null, NULL, NULL), prepost = c(500, 2000), window = 250, denomtrack InitDir posprocparam prepost window denomtrack database puttss span normmethod stretched TSSLength outinttable plotmva logfile character the path of the folder containing the pooled files. If the folder already contained hierarchized folder. nothing is done. vector, are the post-processing parameters c(window, pvalue filters, String path and name for output (NULL default)). vector, contains two values, the number of bp (base pair) before and the number of bp after the coding region. numeric, size of the sliding window (in bp). character, the denominator track name (string contained in the filename) character, path and name of the database file. logical, if TRUE, detect the putative TSS inside the coding region. (not yet implemented). numeric, if NULL take the span of the wig file else take the specified span. character, the normalization method : "linear" for linear normalization, "lowess" and "quantile" for local normalization. numeric, is the length value to use to uniformize the coding region length. See section (units: number of span). numeric, is the approximate length of the TSS to used to separate TSS and body regions (units : number of span). logical, if TRUE save the intermediate intensity tables. character, path and name to save the MVA plots. if NULL, the MVA figures are not plotted. logical, if TRUE a log file is save in the current folder.

14 14 gwcomp The denomtrack argument is the exact String of the denominator track contained in the filename. Its intensity values will be used as steady denominator when calculating the ratio between tracks. The posprocparam argument is a vector that contains three elements. The first is the window size where the TSS is centered. The second is a filter on the pvalue. The third is the file path and name of the output table. if NULL, the output file is not provided. The Polyphemus package provides three normalization procedure. Choosing a "linear" normalization, a correction factor is applied such that the sum of the intensities for a track is the same for all the tracks. Choosing a local normalisation, "lowess" or "quantile", a correction factor is locally calculating and applied to the track intensities. This turns to be more reliable than classic "linear" normalisation. Taslim et al. already mentionned it to normalize PolII ChIP-seq data. However when dealing with more than two samples, the "quatile" procedure turns to be more suitable as it requires less computing time. The "lowess" normalization used the lowess implementation of the R base package. The "quantile" procedure used the normalizequantiles of the limma package. The stretched and tsslength arguments takes a numerical value in -number of span- units. The span length (in bp) is an argument of the gwcomp and getint functions. Users can easily determine the number of span length to define the TSSlength. if one assumes the TSS length is 1000 bp and he uses a span of 10 bp. The TSSLength arguments will be 100. The same method can be used to calculate the stretched argument. Value returns one file by ratio between tracks. These output files can then reload in R or used in external program for clustering analysis. Martial Sankar References Taslim, C., J. Wu, et al. (2009). "Comparative study on ChIP-seq data: normalization and binding pattern characterization." Bioinformatics 25(18): See also the wrapped functions : getint, getnorm, getresult. Examples fpath1<-system.file("extdata", package="polyphemus") fpath2<-file.path(fpath1,"atra") ex <- gwcomp(initdir=fpath2, posprocparam=c(300, 0.1,NA), denomtrack="etoh", database=file.path(fpath1,"mouserefseq-genenames.txt"), normmethod="quantile", stretched =50, TSSLength = 10, window=1000, span=500,prepost=c(0,0), outinttable=true, plotmva=true)

15 lowessnorm 15 lowessnorm lowess normalization lowessnorm, perform the lowess normalisation. Called by the getnorm function. lowessnorm(listint, ff = 2/3, plotmva = NULL) listint ff plotmva numeric, vector of intensity (SWI) values. numeric, the span value for lowess. see lowess help. Boolean, whether to display the MVA plot (on the X). subfunction of getnorm. lowess, getnorm medichimerge Merge MeDiChi files. medichimerge, merge all medichi files for one chromosome & return the list of peaks and their position. medichimerge(files, windo, cutoff = 0.01, write = NA) files windo cutoff write vector, vector of String containing the path and name of each files. numeric, the length of the window range numeric, filers on the cutoff values of MediChi[1] String, path and file name for the output merged file. if NA, not output.

16 16 quantilenorm subfunction in getint References [1] Reiss, D. J., M. T. Facciotti, et al. (2008). "Model-based deconvolution of genome-wide DNA binding." Bioinformatics 24(3): getint quantilenorm quantile normalization lowessnorm, perform the quantile normalisation. Called by the getnorm function. quantilenorm(listint, plotmva = FALSE) listint plotmva numeric, vector of intensity (SWI) values. Boolean, whether to display the MVA plot (on the X). subfunction of getnorm. getnorm

17 stretchgene 17 stretchgene "stretch" genes. return a table with stretched body of the profile stretchgene(genetab, tsscut, lengthref) genetab tsscut lengthref data.frame, the gene table (id, logratio, fittedval). numeric,number of line considered as part of the TSS region. numeric, the length of the reference gene. subfunction of getoutput. Value data frame of "streched" or "compacted" body region of the gene. getoutput

18 Index Topic Wig generatewig, 4 Topic count getint, 6 Topic hierarchy createfolderhierarchy, 3 Topic intensity getint, 6 Topic normalisation getnorm, 9 Topic normalization getmvaplot, 8 getnorm, 9 lowessnorm, 15 quantilenorm, 16 Topic package polyphemus-package, 2 Topic ratio getoutput, 10 getresult, 11 Topic wig getincrbyg, 5 Topic wrapper gwcomp, 13 lowessnorm, 8, 15 medichimerge, 15 polyphemus (polyphemus-package), 2 polyphemus-package, 2 quantilenorm, 8, 16 stretchgene, 17 createfolderhierarchy, 3 generatewig, 4 getcodreg, 5, 7, 12 getincrbyg, 5 getint, 6, 9, 11, 14, 16 getlowessfitval, 7 getmvaplot, 8 getnorm, 9, 11, getoutput, 8, 10, 17 getresult, 11, 11, 14 getswv, 12 gwcomp, 7, 10, 11, 13, 14 lowess, 9, 14, 15 18

Package cgh. R topics documented: February 19, 2015

Package cgh. R topics documented: February 19, 2015 Package cgh February 19, 2015 Version 1.0-7.1 Date 2009-11-20 Title Microarray CGH analysis using the Smith-Waterman algorithm Author Tom Price Maintainer Tom Price

More information

Package GSCA. October 12, 2016

Package GSCA. October 12, 2016 Type Package Title GSCA: Gene Set Context Analysis Version 2.2.0 Date 2015-12-8 Author Zhicheng Ji, Hongkai Ji Maintainer Zhicheng Ji Package GSCA October 12, 2016 GSCA takes as input several

More information

Package GSCA. April 12, 2018

Package GSCA. April 12, 2018 Type Package Title GSCA: Gene Set Context Analysis Version 2.8.0 Date 2015-12-8 Author Zhicheng Ji, Hongkai Ji Maintainer Zhicheng Ji Package GSCA April 12, 2018 GSCA takes as input several

More information

Package GCAI.bias. February 19, 2015

Package GCAI.bias. February 19, 2015 Type Package Package GCAI.bias February 19, 2015 Title Guided Correction Approach for Inherited bias (GCAI.bias) Version 1.0 Date 2014-07-14 Author Guoshuai Cai Maintainer Guoshuai Cai

More information

Package EMDomics. November 11, 2018

Package EMDomics. November 11, 2018 Package EMDomics November 11, 2018 Title Earth Mover's Distance for Differential Analysis of Genomics Data The EMDomics algorithm is used to perform a supervised multi-class analysis to measure the magnitude

More information

Package MIRA. October 16, 2018

Package MIRA. October 16, 2018 Version 1.2.0 Date 2018-04-17 Package MIRA October 16, 2018 Title Methylation-Based Inference of Regulatory Activity MIRA measures the degree of ``dip'' in methylation level surrounding a regulatory site

More information

Package PlasmaMutationDetector

Package PlasmaMutationDetector Type Package Package PlasmaMutationDetector Title Tumor Mutation Detection in Plasma Version 1.7.2 Date 2018-05-16 June 11, 2018 Author Yves Rozenholc, Nicolas Pécuchet, Pierre Laurent-Puig Maintainer

More information

Easy visualization of the read coverage using the CoverageView package

Easy visualization of the read coverage using the CoverageView package Easy visualization of the read coverage using the CoverageView package Ernesto Lowy European Bioinformatics Institute EMBL June 13, 2018 > options(width=40) > library(coverageview) 1 Introduction This

More information

Package TilePlot. February 15, 2013

Package TilePlot. February 15, 2013 Package TilePlot February 15, 2013 Type Package Title Characterization of functional genes in complex microbial communities using tiling DNA microarrays Version 1.3 Date 2011-05-04 Author Ian Marshall

More information

Package roar. August 31, 2018

Package roar. August 31, 2018 Type Package Package roar August 31, 2018 Title Identify differential APA usage from RNA-seq alignments Version 1.16.0 Date 2016-03-21 Author Elena Grassi Maintainer Elena Grassi Identify

More information

Package BAC. R topics documented: November 30, 2018

Package BAC. R topics documented: November 30, 2018 Type Package Title Bayesian Analysis of Chip-chip experiment Version 1.42.0 Date 2007-11-20 Author Raphael Gottardo Package BAC November 30, 2018 Maintainer Raphael Gottardo Depends

More information

Triform: peak finding in ChIP-Seq enrichment profiles for transcription factors

Triform: peak finding in ChIP-Seq enrichment profiles for transcription factors Triform: peak finding in ChIP-Seq enrichment profiles for transcription factors Karl Kornacker * and Tony Håndstad October 30, 2018 A guide for using the Triform algorithm to predict transcription factor

More information

Package conumee. February 17, 2018

Package conumee. February 17, 2018 Package conumee February 17, 2018 Title Enhanced copy-number variation analysis using Illumina DNA methylation arrays Version 1.13.0 Author Volker Hovestadt, Marc Zapatka Maintainer Address Division of

More information

Package demi. February 19, 2015

Package demi. February 19, 2015 Type Package Package demi February 19, 2015 Title Differential Expression from Multiple Indicators Implementation of the DEMI method for the analysis of high-density microarray data. URL http://biit.cs.ut.ee/demi

More information

How to use the DEGseq Package

How to use the DEGseq Package How to use the DEGseq Package Likun Wang 1,2 and Xi Wang 1. October 30, 2018 1 MOE Key Laboratory of Bioinformatics and Bioinformatics Division, TNLIST /Department of Automation, Tsinghua University. 2

More information

Package RobustRankAggreg

Package RobustRankAggreg Type Package Package RobustRankAggreg Title Methods for robust rank aggregation Version 1.1 Date 2010-11-14 Author Raivo Kolde, Sven Laur Maintainer February 19, 2015 Methods for aggregating ranked lists,

More information

Package VSE. March 21, 2016

Package VSE. March 21, 2016 Type Package Title Variant Set Enrichment Version 0.99 Date 2016-03-03 Package VSE March 21, 2016 Author Musaddeque Ahmed Maintainer Hansen He Calculates

More information

Package CONOR. August 29, 2013

Package CONOR. August 29, 2013 Package CONOR August 29, 2013 Type Package Title CONOR Version 1.0.2 Date 2010-11-06 Author Jason Rudy and Faramarz Valafar Maintainer Jason Rudy Description CrOss-platform NOrmalization

More information

Package ChIPseeker. July 12, 2018

Package ChIPseeker. July 12, 2018 Type Package Package ChIPseeker July 12, 2018 Title ChIPseeker for ChIP peak Annotation, Comparison, and Visualization Version 1.16.0 Encoding UTF-8 Maintainer Guangchuang Yu

More information

Package ABSSeq. September 6, 2018

Package ABSSeq. September 6, 2018 Type Package Package ABSSeq September 6, 2018 Title ABSSeq: a new RNA-Seq analysis method based on modelling absolute expression differences Version 1.34.1 Author Wentao Yang Maintainer Wentao Yang

More information

CLC Server. End User USER MANUAL

CLC Server. End User USER MANUAL CLC Server End User USER MANUAL Manual for CLC Server 10.0.1 Windows, macos and Linux March 8, 2018 This software is for research purposes only. QIAGEN Aarhus Silkeborgvej 2 Prismet DK-8000 Aarhus C Denmark

More information

Package RAPIDR. R topics documented: February 19, 2015

Package RAPIDR. R topics documented: February 19, 2015 Package RAPIDR February 19, 2015 Title Reliable Accurate Prenatal non-invasive Diagnosis R package Package to perform non-invasive fetal testing for aneuploidies using sequencing count data from cell-free

More information

Package Tnseq. April 13, 2017

Package Tnseq. April 13, 2017 Version 0.1.2 Date 2017-4-13 Package Tnseq April 13, 2017 Title Identification of Conditionally Essential Genes in Transposon Sequencing Studies Author Lili Zhao Maintainer Lili Zhao

More information

Analysis of ChIP-seq data

Analysis of ChIP-seq data Before we start: 1. Log into tak (step 0 on the exercises) 2. Go to your lab space and create a folder for the class (see separate hand out) 3. Connect to your lab space through the wihtdata network and

More information

Course on Microarray Gene Expression Analysis

Course on Microarray Gene Expression Analysis Course on Microarray Gene Expression Analysis ::: Normalization methods and data preprocessing Madrid, April 27th, 2011. Gonzalo Gómez ggomez@cnio.es Bioinformatics Unit CNIO ::: Introduction. The probe-level

More information

Package les. October 4, 2013

Package les. October 4, 2013 Package les October 4, 2013 Type Package Title Identifying Differential Effects in Tiling Microarray Data Version 1.10.0 Date 2011-10-10 Author Julian Gehring, Clemens Kreutz, Jens Timmer Maintainer Julian

More information

Package geecc. R topics documented: December 7, Type Package

Package geecc. R topics documented: December 7, Type Package Type Package Package geecc December 7, 2018 Title Gene Set Enrichment Analysis Extended to Contingency Cubes Version 1.16.0 Date 2016-09-19 Author Markus Boenn Maintainer Markus Boenn

More information

Package TilePlot. April 8, 2011

Package TilePlot. April 8, 2011 Type Package Package TilePlot April 8, 2011 Title This package analyzes functional gene tiling DNA microarrays for studying complex microbial communities. Version 1.1 Date 2011-04-07 Author Ian Marshall

More information

User's guide to ChIP-Seq applications: command-line usage and option summary

User's guide to ChIP-Seq applications: command-line usage and option summary User's guide to ChIP-Seq applications: command-line usage and option summary 1. Basics about the ChIP-Seq Tools The ChIP-Seq software provides a set of tools performing common genome-wide ChIPseq analysis

More information

ChIP-seq hands-on practical using Galaxy

ChIP-seq hands-on practical using Galaxy ChIP-seq hands-on practical using Galaxy In this exercise we will cover some of the basic NGS analysis steps for ChIP-seq using the Galaxy framework: Quality control Mapping of reads using Bowtie2 Peak-calling

More information

ChIP-Seq Tutorial on Galaxy

ChIP-Seq Tutorial on Galaxy 1 Introduction ChIP-Seq Tutorial on Galaxy 2 December 2010 (modified April 6, 2017) Rory Stark The aim of this practical is to give you some experience handling ChIP-Seq data. We will be working with data

More information

Package imgur. R topics documented: December 20, Type Package. Title Share plots using the imgur.com image hosting service. Version 0.1.

Package imgur. R topics documented: December 20, Type Package. Title Share plots using the imgur.com image hosting service. Version 0.1. Package imgur December 20, 2010 Type Package Title Share plots using the imgur.com image hosting service Version 0.1.4 Date 2010-12-18 Author Aaron Statham Maintainer Aaron Statham

More information

Package detectruns. February 6, 2018

Package detectruns. February 6, 2018 Type Package Package detectruns February 6, 2018 Title Detect Runs of Homozygosity and Runs of Heterozygosity in Diploid Genomes Version 0.9.5 Date 2018-02-05 Detection of runs of homozygosity and of heterozygosity

More information

Package FunciSNP. November 16, 2018

Package FunciSNP. November 16, 2018 Type Package Package FunciSNP November 16, 2018 Title Integrating Functional Non-coding Datasets with Genetic Association Studies to Identify Candidate Regulatory SNPs Version 1.26.0 Date 2013-01-19 Author

More information

Package seqcat. March 25, 2019

Package seqcat. March 25, 2019 Package seqcat March 25, 2019 Title High Throughput Sequencing Cell Authentication Toolkit Version 1.4.1 The seqcat package uses variant calling data (in the form of VCF files) from high throughput sequencing

More information

The ChIP-seq quality Control package ChIC: A short introduction

The ChIP-seq quality Control package ChIC: A short introduction The ChIP-seq quality Control package ChIC: A short introduction April 30, 2018 Abstract The ChIP-seq quality Control package (ChIC) provides functions and data structures to assess the quality of ChIP-seq

More information

Expander 7.2 Online Documentation

Expander 7.2 Online Documentation Expander 7.2 Online Documentation Introduction... 2 Starting EXPANDER... 2 Input Data... 3 Tabular Data File... 4 CEL Files... 6 Working on similarity data no associated expression data... 9 Working on

More information

srap: Simplified RNA-Seq Analysis Pipeline

srap: Simplified RNA-Seq Analysis Pipeline srap: Simplified RNA-Seq Analysis Pipeline Charles Warden October 30, 2017 1 Introduction This package provides a pipeline for gene expression analysis. The normalization function is specific for RNA-Seq

More information

Package gsalib. R topics documented: February 20, Type Package. Title Utility Functions For GATK. Version 2.1.

Package gsalib. R topics documented: February 20, Type Package. Title Utility Functions For GATK. Version 2.1. Package gsalib February 20, 2015 Type Package Title Utility Functions For GATK Version 2.1 Date 2014-12-09 Author Maintainer Geraldine Van der Auwera This package contains

More information

Package geecc. October 9, 2015

Package geecc. October 9, 2015 Type Package Package geecc October 9, 2015 Title Gene set Enrichment analysis Extended to Contingency Cubes Version 1.2.0 Date 2014-12-31 Author Markus Boenn Maintainer Markus Boenn

More information

panda Documentation Release 1.0 Daniel Vera

panda Documentation Release 1.0 Daniel Vera panda Documentation Release 1.0 Daniel Vera February 12, 2014 Contents 1 mat.make 3 1.1 Usage and option summary....................................... 3 1.2 Arguments................................................

More information

Package icnv. R topics documented: March 8, Title Integrated Copy Number Variation detection Version Author Zilu Zhou, Nancy Zhang

Package icnv. R topics documented: March 8, Title Integrated Copy Number Variation detection Version Author Zilu Zhou, Nancy Zhang Title Integrated Copy Number Variation detection Version 1.2.1 Author Zilu Zhou, Nancy Zhang Package icnv March 8, 2019 Maintainer Zilu Zhou Integrative copy number variation

More information

ChIP-seq practical: peak detection and peak annotation. Mali Salmon-Divon Remco Loos Myrto Kostadima

ChIP-seq practical: peak detection and peak annotation. Mali Salmon-Divon Remco Loos Myrto Kostadima ChIP-seq practical: peak detection and peak annotation Mali Salmon-Divon Remco Loos Myrto Kostadima March 2012 Introduction The goal of this hands-on session is to perform some basic tasks in the analysis

More information

Analysis of ChIP-seq Data with mosaics Package

Analysis of ChIP-seq Data with mosaics Package Analysis of ChIP-seq Data with mosaics Package Dongjun Chung 1, Pei Fen Kuan 2 and Sündüz Keleş 1,3 1 Department of Statistics, University of Wisconsin Madison, WI 53706. 2 Department of Biostatistics,

More information

Package saascnv. May 18, 2016

Package saascnv. May 18, 2016 Version 0.3.4 Date 2016-05-10 Package saascnv May 18, 2016 Title Somatic Copy Number Alteration Analysis Using Sequencing and SNP Array Data Author Zhongyang Zhang [aut, cre], Ke Hao [aut], Nancy R. Zhang

More information

Analyzing ChIP- Seq Data in Galaxy

Analyzing ChIP- Seq Data in Galaxy Analyzing ChIP- Seq Data in Galaxy Lauren Mills RISS ABSTRACT Step- by- step guide to basic ChIP- Seq analysis using the Galaxy platform. Table of Contents Introduction... 3 Links to helpful information...

More information

Package snplist. December 11, 2017

Package snplist. December 11, 2017 Type Package Title Tools to Create Gene Sets Version 0.18.1 Date 2017-12-11 Package snplist December 11, 2017 Author Chanhee Yi, Alexander Sibley, and Kouros Owzar Maintainer Alexander Sibley

More information

Package INCATome. October 5, 2017

Package INCATome. October 5, 2017 Type Package Package INCATome October 5, 2017 Title Internal Control Analysis of Translatome Studies by Microarrays Version 1.0 Date 2017-10-03 Author Sbarrato T. [cre,aut], Spriggs R.V. [cre,aut], Wilson

More information

A short Introduction to UCSC Genome Browser

A short Introduction to UCSC Genome Browser A short Introduction to UCSC Genome Browser Elodie Girard, Nicolas Servant Institut Curie/INSERM U900 Bioinformatics, Biostatistics, Epidemiology and computational Systems Biology of Cancer 1 Why using

More information

Package clstutils. R topics documented: December 30, 2018

Package clstutils. R topics documented: December 30, 2018 Type Package Title Tools for performing taxonomic assignment. Version 1.30.0 Depends R (>= 2.10), clst, rjson, ape Imports lattice, RSQLite Suggests RUnit, RSVGTipsDevice LazyLoad yes LazyData yes Author

More information

Package ChIC. R topics documented: May 4, Title Quality Control Pipeline for ChIP-Seq Data Version Author Carmen Maria Livi

Package ChIC. R topics documented: May 4, Title Quality Control Pipeline for ChIP-Seq Data Version Author Carmen Maria Livi Title Quality Control Pipeline for ChIP-Seq Data Version 1.0.0 Author Carmen Maria Livi Package ChIC May 4, 2018 Maintainer Carmen Maria Livi Quality control pipeline for ChIP-seq

More information

Package dmrseq. September 14, 2018

Package dmrseq. September 14, 2018 Type Package Package dmrseq September 14, 2018 Title Detection and inference of differentially methylated regions from Whole Genome Bisulfite Sequencing Version 1.1.15 Author Keegan Korthauer ,

More information

Package missforest. February 15, 2013

Package missforest. February 15, 2013 Type Package Package missforest February 15, 2013 Title Nonparametric Missing Value Imputation using Random Forest Version 1.3 Date 2012-06-26 Author Maintainer Depends randomforest The function missforest

More information

Package pcagopromoter

Package pcagopromoter Version 1.26.0 Date 2012-03-16 Package pcagopromoter November 13, 2018 Title pcagopromoter is used to analyze DNA micro array data Author Morten Hansen, Jorgen Olsen Maintainer Morten Hansen

More information

Differential Expression Analysis at PATRIC

Differential Expression Analysis at PATRIC Differential Expression Analysis at PATRIC The following step- by- step workflow is intended to help users learn how to upload their differential gene expression data to their private workspace using Expression

More information

Microarray Data Analysis (V) Preprocessing (i): two-color spotted arrays

Microarray Data Analysis (V) Preprocessing (i): two-color spotted arrays Microarray Data Analysis (V) Preprocessing (i): two-color spotted arrays Preprocessing Probe-level data: the intensities read for each of the components. Genomic-level data: the measures being used in

More information

m6aviewer Version Documentation

m6aviewer Version Documentation m6aviewer Version 1.6.0 Documentation Contents 1. About 2. Requirements 3. Launching m6aviewer 4. Running Time Estimates 5. Basic Peak Calling 6. Running Modes 7. Multiple Samples/Sample Replicates 8.

More information

Protocol: peak-calling for ChIP-seq data / segmentation analysis for histone modification data

Protocol: peak-calling for ChIP-seq data / segmentation analysis for histone modification data Protocol: peak-calling for ChIP-seq data / segmentation analysis for histone modification data Table of Contents Protocol: peak-calling for ChIP-seq data / segmentation analysis for histone modification

More information

Package clusterseq. R topics documented: June 13, Type Package

Package clusterseq. R topics documented: June 13, Type Package Type Package Package clusterseq June 13, 2018 Title Clustering of high-throughput sequencing data by identifying co-expression patterns Version 1.4.0 Depends R (>= 3.0.0), methods, BiocParallel, bayseq,

More information

ChromHMM: automating chromatin-state discovery and characterization

ChromHMM: automating chromatin-state discovery and characterization Nature Methods ChromHMM: automating chromatin-state discovery and characterization Jason Ernst & Manolis Kellis Supplementary Figure 1 Supplementary Figure 2 Supplementary Figure 3 Supplementary Figure

More information

Package LOLA. December 24, 2018

Package LOLA. December 24, 2018 Package LOLA December 24, 2018 Version 1.13.0 Date 2015-06-26 Title Locus overlap analysis for enrichment of genomic ranges Provides functions for testing overlap of sets of genomic regions with public

More information

Chromatin signature discovery via histone modification profile alignments Jianrong Wang, Victoria V. Lunyak and I. King Jordan

Chromatin signature discovery via histone modification profile alignments Jianrong Wang, Victoria V. Lunyak and I. King Jordan Supplementary Information for: Chromatin signature discovery via histone modification profile alignments Jianrong Wang, Victoria V. Lunyak and I. King Jordan Contents Instructions for installing and running

More information

Package MethylSeekR. January 26, 2019

Package MethylSeekR. January 26, 2019 Type Package Title Segmentation of Bis-seq data Version 1.22.0 Date 2014-7-1 Package MethylSeekR January 26, 2019 Author Lukas Burger, Dimos Gaidatzis, Dirk Schubeler and Michael Stadler Maintainer Lukas

More information

Package snm. July 20, 2018

Package snm. July 20, 2018 Type Package Title Supervised Normalization of Microarrays Version 1.28.0 Package snm July 20, 2018 Author Brig Mecham and John D. Storey SNM is a modeling strategy especially designed

More information

Package HiResTEC. August 7, 2018

Package HiResTEC. August 7, 2018 Type Package Package HiResTEC August 7, 2018 Title Non-Targeted Fluxomics on High-Resolution Mass-Spectrometry Data Version 0.54 Date 2018-08-07 Maintainer Jan Lisec Identifying labeled

More information

PICS: Probabilistic Inference for ChIP-Seq

PICS: Probabilistic Inference for ChIP-Seq PICS: Probabilistic Inference for ChIP-Seq Xuekui Zhang * and Raphael Gottardo, Arnaud Droit and Renan Sauteraud April 30, 2018 A step-by-step guide in the analysis of ChIP-Seq data using the PICS package

More information

Package QCEWAS. R topics documented: February 1, Type Package

Package QCEWAS. R topics documented: February 1, Type Package Type Package Package QCEWAS February 1, 2019 Title Fast and Easy Quality Control of EWAS Results Files Version 1.2-2 Date 2019-02-01 Author Peter J. van der Most, Leanne K. Kupers, Ilja Nolte Maintainer

More information

Package MultiMeta. February 19, 2015

Package MultiMeta. February 19, 2015 Type Package Package MultiMeta February 19, 2015 Title Meta-analysis of Multivariate Genome Wide Association Studies Version 0.1 Date 2014-08-21 Author Dragana Vuckovic Maintainer Dragana Vuckovic

More information

Package cumseg. February 19, 2015

Package cumseg. February 19, 2015 Package cumseg February 19, 2015 Type Package Title Change point detection in genomic sequences Version 1.1 Date 2011-10-14 Author Vito M.R. Muggeo Maintainer Vito M.R. Muggeo Estimation

More information

Identify differential APA usage from RNA-seq alignments

Identify differential APA usage from RNA-seq alignments Identify differential APA usage from RNA-seq alignments Elena Grassi Department of Molecular Biotechnologies and Health Sciences, MBC, University of Turin, Italy roar version 1.16.0 (Last revision 2014-09-24)

More information

An Introduction to VariantTools

An Introduction to VariantTools An Introduction to VariantTools Michael Lawrence, Jeremiah Degenhardt January 25, 2018 Contents 1 Introduction 2 2 Calling single-sample variants 2 2.1 Basic usage..............................................

More information

Package ArrayBin. February 19, 2015

Package ArrayBin. February 19, 2015 Package ArrayBin February 19, 2015 Version 0.2 Date 2013-02-01 Title Binarization of numeric data arrays Author Ed Curry Maintainer Depends R (>= 2.15.0), SAGx Fast adaptive binarization for numeric data

More information

Package scmap. March 29, 2019

Package scmap. March 29, 2019 Type Package Package scmap March 29, 2019 Title A tool for unsupervised projection of single cell RNA-seq data Version 1.4.1 Author Vladimir Kiselev Maintainer Vladimir Kiselev

More information

From genomic regions to biology

From genomic regions to biology Before we start: 1. Log into tak (step 0 on the exercises) 2. Go to your lab space and create a folder for the class (see separate hand out) 3. Connect to your lab space through the wihtdata network and

More information

Getting Started. April Strand Life Sciences, Inc All rights reserved.

Getting Started. April Strand Life Sciences, Inc All rights reserved. Getting Started April 2015 Strand Life Sciences, Inc. 2015. All rights reserved. Contents Aim... 3 Demo Project and User Interface... 3 Downloading Annotations... 4 Project and Experiment Creation... 6

More information

Package kirby21.base

Package kirby21.base Type Package Package kirby21.base October 11, 2017 Title Example Data from the Multi-Modal MRI 'Reproducibility' Resource Version 1.6.0 Date 2017-10-10 Author John Muschelli Maintainer

More information

Package methylmnm. January 14, 2013

Package methylmnm. January 14, 2013 Type Package Title detect different methylation level (DMR) Version 0.99.0 Date 2012-12-01 Package methylmnm January 14, 2013 Author Maintainer Yan Zhou To give the exactly p-value and

More information

Package tatest. July 18, 2018

Package tatest. July 18, 2018 Type Package Title Two-Group Ta-Test Version 1.0 Date 2018-07-10 Author Yuan-De Tan Package tatest July 18, 2018 Maintainer Yuan-De Tan The ta-test is a modified two-sample or two-group

More information

Package EFS. R topics documented:

Package EFS. R topics documented: Package EFS July 24, 2017 Title Tool for Ensemble Feature Selection Description Provides a function to check the importance of a feature based on a dependent classification variable. An ensemble of feature

More information

Package spade. R topics documented:

Package spade. R topics documented: Package spade April 23, 2016 Title SPADE -- An analysis and visualization tool for Flow Cytometry Version 1.18.2 Author M. Linderman, P. Qiu, E. Simonds, Z. Bjornson SPADE, or Spanning tree Progression

More information

Package SVM2CRM. R topics documented: May 1, Type Package

Package SVM2CRM. R topics documented: May 1, Type Package Type Package Package SVM2CRM May 1, 2018 Title SVM2CRM: support vector machine for cis-regulatory elements detections Version 1.12.0 Date 2014-03-30 Description Detection of cis-regulatory elements using

More information

Package MLP. February 12, 2018

Package MLP. February 12, 2018 Package MLP February 12, 2018 Maintainer Tobias Verbeke License GPL-3 Title MLP Type Package Author Nandini Raghavan, Tobias Verbeke, An De Bondt with contributions by

More information

Goldmine: Integrating information to place sets of genomic ranges into biological contexts Jeff Bhasin

Goldmine: Integrating information to place sets of genomic ranges into biological contexts Jeff Bhasin Goldmine: Integrating information to place sets of genomic ranges into biological contexts Jeff Bhasin 2016-04-22 Contents Purpose 1 Prerequisites 1 Installation 2 Example 1: Annotation of Any Set of Genomic

More information

Package mosaics. April 23, 2016

Package mosaics. April 23, 2016 Type Package Package mosaics April 23, 2016 Title MOSAiCS (MOdel-based one and two Sample Analysis and Inference for ChIP-Seq) Version 2.4.1 Depends R (>= 3.0.0), methods, graphics, Rcpp Imports MASS,

More information

Introduction to GE Microarray data analysis Practical Course MolBio 2012

Introduction to GE Microarray data analysis Practical Course MolBio 2012 Introduction to GE Microarray data analysis Practical Course MolBio 2012 Claudia Pommerenke Nov-2012 Transkriptomanalyselabor TAL Microarray and Deep Sequencing Core Facility Göttingen University Medical

More information

Services Performed. The following checklist confirms the steps of the RNA-Seq Service that were performed on your samples.

Services Performed. The following checklist confirms the steps of the RNA-Seq Service that were performed on your samples. Services Performed The following checklist confirms the steps of the RNA-Seq Service that were performed on your samples. SERVICE Sample Received Sample Quality Evaluated Sample Prepared for Sequencing

More information

Relationship Between BED and WIG Formats

Relationship Between BED and WIG Formats Relationship Between BED and WIG Formats Pete E. Pascuzzi July 2, 2015 This example will illustrate the similarities and differences between the various ways to represent ranged data in R. In bioinformatics,

More information

ROTS: Reproducibility Optimized Test Statistic

ROTS: Reproducibility Optimized Test Statistic ROTS: Reproducibility Optimized Test Statistic Fatemeh Seyednasrollah, Tomi Suomi, Laura L. Elo fatsey (at) utu.fi March 3, 2016 Contents 1 Introduction 2 2 Algorithm overview 3 3 Input data 3 4 Preprocessing

More information

Package RNASeqBias. R topics documented: March 7, Type Package. Title Bias Detection and Correction in RNA-Sequencing Data. Version 1.

Package RNASeqBias. R topics documented: March 7, Type Package. Title Bias Detection and Correction in RNA-Sequencing Data. Version 1. Package RNASeqBias March 7, 2011 Type Package Title Bias Detection and Correction in RNA-Sequencing Data Version 1.0 Date 2011-03-01 Author Maintainer Wei Zheng The package contains

More information

Advanced UCSC Browser Functions

Advanced UCSC Browser Functions Advanced UCSC Browser Functions Dr. Thomas Randall tarandal@email.unc.edu bioinformatics.unc.edu UCSC Browser: genome.ucsc.edu Overview Custom Tracks adding your own datasets Utilities custom tools for

More information

Package Repliscope. February 6, 2019

Package Repliscope. February 6, 2019 Type Package Language en-gb Package Repliscope February 6, 2019 Title Replication Timing Profiling using DNA Copy Number Version 1.0.0 Author Dzmitry G Batrakou Maintainer Dzmitry G Batrakou

More information

methylmnm Tutorial Yan Zhou, Bo Zhang, Nan Lin, BaoXue Zhang and Ting Wang January 14, 2013

methylmnm Tutorial Yan Zhou, Bo Zhang, Nan Lin, BaoXue Zhang and Ting Wang January 14, 2013 methylmnm Tutorial Yan Zhou, Bo Zhang, Nan Lin, BaoXue Zhang and Ting Wang January 14, 2013 Contents 1 Introduction 1 2 Preparations 2 3 Data format 2 4 Data Pre-processing 3 4.1 CpG number of each bin.......................

More information

Package phantasus. March 8, 2019

Package phantasus. March 8, 2019 Package phantasus March 8, 2019 Title Visual and interactive gene exprsion analysis Version 1.2.1 Phantasus is a web-application for visual and interactive gene exprsion analysis. Phantasus is based on

More information

Package MetaLonDA. R topics documented: January 22, Type Package

Package MetaLonDA. R topics documented: January 22, Type Package Type Package Package MetaLonDA January 22, 2018 Title Metagenomics Longitudinal Differential Abundance Method Version 1.0.9 Date 2018-1-20 Author Ahmed Metwally, Yang Dai, Patricia Finn, David Perkins

More information

Package GOTHiC. November 22, 2017

Package GOTHiC. November 22, 2017 Title Binomial test for Hi-C data analysis Package GOTHiC November 22, 2017 This is a Hi-C analysis package using a cumulative binomial test to detect interactions between distal genomic loci that have

More information

Package crmn. February 19, 2015

Package crmn. February 19, 2015 Version 0.0.20 Date 2012-06-19 Author Package crmn February 19, 2015 Maintainer Title CCMN and other normalization methods for metabolomics data Depends R (>= 2.10), pcamethods

More information

Package agilp. R topics documented: January 22, 2018

Package agilp. R topics documented: January 22, 2018 Type Package Title Agilent expression array processing package Version 3.10.0 Date 2012-06-10 Author Benny Chain Maintainer Benny Chain Depends R (>= 2.14.0) Package

More information

CNVPanelizer: Reliable CNV detection in target sequencing applications

CNVPanelizer: Reliable CNV detection in target sequencing applications CNVPanelizer: Reliable CNV detection in target sequencing applications Oliveira, Cristiano cristiano.oliveira@med.uni-heidelberg.de Wolf, Thomas thomas_wolf71@gmx.de April 30, 2018 Amplicon based targeted

More information

Package NGScopy. April 10, 2015

Package NGScopy. April 10, 2015 Type Package Version 1.0.0 Date 2014-08-21 23:46:44 EDT Package NGScopy April 10, 2015 Title NGScopy: Detection of copy number variations in next generation sequencing NGScopy provides a quantitative caller

More information

Using generxcluster. Charles C. Berry. April 30, 2018

Using generxcluster. Charles C. Berry. April 30, 2018 Using generxcluster Charles C. Berry April 30, 2018 Contents 1 Overview 1 2 Basic Use 1 2.1 Reading Data from a File...................... 2 2.2 Simulating Data........................... 2 2.3 Invoking

More information