Package CopywriteR. April 20, 2018

Similar documents
Package RAPIDR. R topics documented: February 19, 2015

Package metagene. November 1, 2018

Package dmrseq. September 14, 2018

Package HTSeqGenie. April 16, 2019

Package NGScopy. April 10, 2015

Package ChIC. R topics documented: May 4, Title Quality Control Pipeline for ChIP-Seq Data Version Author Carmen Maria Livi

Package icnv. R topics documented: March 8, Title Integrated Copy Number Variation detection Version Author Zilu Zhou, Nancy Zhang

Package GreyListChIP

Package conumee. February 17, 2018

Practical 4: ChIP-seq Peak calling

R topics documented: 2 R topics documented:

segmentseq: methods for detecting methylation loci and differential methylation

Package GOTHiC. November 22, 2017

Introduction to QDNAseq

Package rgreat. June 19, 2018

Package bnbc. R topics documented: April 1, Version 1.4.0

RAPIDR. Kitty Lo. November 20, Intended use of RAPIDR 1. 2 Create binned counts file from BAMs Masking... 1

segmentseq: methods for detecting methylation loci and differential methylation

Package SVM2CRM. R topics documented: May 1, Type Package

Visualisation, transformations and arithmetic operations for grouped genomic intervals

ExomeDepth. Vincent Plagnol. May 15, What is new? 1. 4 Load an example dataset 4. 6 CNV calling 5. 9 Visual display 9

Package MIRA. October 16, 2018

Representing sequencing data in Bioconductor

Package MethylSeekR. January 26, 2019

Package chromstar. March 19, 2019

Protocol: peak-calling for ChIP-seq data / segmentation analysis for histone modification data

Package roar. August 31, 2018

Package HiCcompare. September 9, 2018

Package multihiccompare

Package ChIPseeker. July 12, 2018

Basic4Cseq: an R/Bioconductor package for the analysis of 4C-seq data

Package seqcat. March 25, 2019

Easy visualization of the read coverage using the CoverageView package

Package RcisTarget. July 17, 2018

Basic4Cseq: an R/Bioconductor package for the analysis of 4C-seq data

Package AnnotationHubData

The ChIP-seq quality Control package ChIC: A short introduction

Package compartmap. November 10, Type Package

Package mosaics. July 3, 2018

Working with aligned nucleotides (WORK- IN-PROGRESS!)

Package scruff. November 6, 2018

Package LOLA. December 24, 2018

ChIP-seq hands-on practical using Galaxy

Package customprodb. September 9, 2018

Package MACPET. January 16, 2019

Agilent Genomic Workbench Lite Edition 6.5

Package saascnv. May 18, 2016

Package CAGEr. September 4, 2018

MMDiff2: Statistical Testing for ChIP-Seq Data Sets

CLC Server. End User USER MANUAL

Package EventPointer

Package BayesPeak. R topics documented: September 21, Version Date Title Bayesian Analysis of ChIP-seq Data

Package GSCA. October 12, 2016

Package GSCA. April 12, 2018

You will be re-directed to the following result page.

Analyzing ChIP- Seq Data in Galaxy

!"#$%&$'()#$*)+,-./).01"0#,23+3,303456"6,&((46,7$+-./&((468,

Tutorial. Identification of Variants in a Tumor Sample. Sample to Insight. November 21, 2017

Package TFutils. October 11, 2018

Package igc. February 10, 2018

BadRegionFinder an R/Bioconductor package for identifying regions with bad coverage

Package GUIDEseq. October 12, 2016

Package FunciSNP. November 16, 2018

PICS: Probabilistic Inference for ChIP-Seq

CNVPanelizer: Reliable CNV detection in target sequencing applications

Triform: peak finding in ChIP-Seq enrichment profiles for transcription factors

seqcna: A Package for Copy Number Analysis of High-Throughput Sequencing Cancer DNA

Package RTCGAToolbox

Package BubbleTree. December 28, 2018

Package TxRegInfra. March 17, 2019

Copy Number Variations Detection - TD. Using Sequenza under Galaxy

Package dupradar. R topics documented: July 12, Type Package

User's guide to ChIP-Seq applications: command-line usage and option summary

Package PlasmaMutationDetector

ChromHMM: automating chromatin-state discovery and characterization

Package AneuFinder. October 24, 2018

ChIP-Seq Tutorial on Galaxy

Package DMCHMM. January 19, 2019

Package SomaticSignatures

Analysis of ChIP-seq data

Package DNaseR. November 1, 2013

Package h5vc. R topics documented: October 2, Type Package

Part 1: How to use IGV to visualize variants

Package karyoploter. September 20, 2018

Package PICS. R topics documented: February 4, Type Package Title Probabilistic inference of ChIP-seq Version

Package PGA. July 6, 2018

An Introduction to the genoset Package

Package GUIDEseq. April 23, 2016

m6aviewer Version Documentation

HIPPIE User Manual. (v0.0.2-beta, 2015/4/26, Yih-Chii Hwang, yihhwang [at] mail.med.upenn.edu)

TEQC: Target Enrichment Quality Control

Working with ChIP-Seq Data in R/Bioconductor

KC-SMART Vignette. Jorma de Ronde, Christiaan Klijn April 30, 2018

ChIP-seq hands-on practical using Galaxy

Package MEAL. August 16, 2018

The software comes with 2 installers: (1) SureCall installer (2) GenAligners (contains BWA, BWA- MEM).

From genomic regions to biology

Package MEDIPS. August 13, 2018

Package polyphemus. February 15, 2013

Package STAN. R topics documented: July 23, Version Date Title The Genomic STate ANnotation Package

Transcription:

Type Package Package CopywriteR April 20, 2018 Title Copy number information from targeted sequencing using off-target reads Version 2.11.0 Date 2016-02-23 Author Thomas Kuilman Maintainer Thomas Kuilman <t.kuilman@nki.nl> Imports matrixstats, gtools, data.table, S4Vectors, chipseq, IRanges, Rsamtools, DNAcopy, GenomicAlignments, GenomicRanges, CopyhelpeR, GenomeInfoDb, futile.logger Depends R(>= 3.2), BiocParallel Suggests BiocStyle, SCLCBam, snow URL https://github.com/peeperlab/copywriter Description CopywriteR extracts DNA copy number information from targeted sequencing by utiizing off-target reads. It allows for extracting uniformly distributed copy number information, can be used without reference, and can be applied to sequencing data obtained from various techniques including chromatin immunoprecipitation and target enrichment on small gene panels. Thereby, CopywriteR constitutes a widely applicable alternative to available copy number detection tools. License GPL-2 biocviews TargetedResequencing, ExomeSeq, CopyNumberVariation, Preprocessing, Visualization, Coverage R topics documented: CopywriteR......................................... 2 plotcna........................................... 3 precopywriter....................................... 5 Index 7 1

2 CopywriteR CopywriteR CopywriteR: ENhanced COpy number Detection from Exome Reads Description Usage Generates DNA copy number profiles from targeted sequencing data using off-target reads CopywriteR(sample.control, destination.folder, reference.folder, bp.param, capture.regions.file, keep.intermediary.files = FALSE) Arguments sample.control a data.frame or matrix that contains the locations of sample and control BAM files, respectively, in columns. The file locations in a row represent a sample and its corresponding control, which is used for calling peaks. destination.folder the path to the folder to which output should be written. The path can be either absolute or relative. reference.folder the path to the folder with the helper files generated by precopywriter(). The helper files include the bin, mappability, GC-content, and blacklist, files in.bed format. bp.param a BiocParallelParam instance (see BiocParallel Bioconductor pacakage) that determines the settings used for parallel computing. Please refer to the vignette for more information. capture.regions.file optional; the path to the capture regions file, which should be in.bed format. Overlapping bait regions should be reduced into single regions. If included, statistics on the overlap of peaks called by MACS and the capture regions will be provided. keep.intermediary.files optional; logical that indicates whether intermediary.bam,.bai and peak regions files should be kept after the analysis is done. Defaults to FALSE. Details CopywriteR uses off-target sequence reads from targeted sequencing to create copy number profiles. First, it removes non-random off-target reads, and it subsequently calculates the depth of coverage for the bins that are provided in the helper files. It then performs GC-content and mappability corrections, and removes blacklisted regions. plotcna() generates a DNA copy number profile from the output of the CopywriteR() function. Helper files required for CopywriteR analysis can be created using precopywriter(). Value BamBaiPeaksFiles a folder with the.bam,.bai and peak regions files that are created during the CopywriteR() run. The folder will only be created when the keep.intermediary.files argument is set to TRUE.

plotcna 3 input.rdata an R object that contains information for plotcna(). log2_read_counts.igv the file that contains the compensated corrected read counts after GC-content and mappability corrections, and after removal of data points in blacklisted regions. Counts are log2-transformed. The file is a tab-separated file formatted to be viewed in the IGV genome browser. CopywriteR.log log file of CopywriteR. qc a folder with quality control files. The.png files contain the plots and the loesses that are used for GC-content and mappability corrections. The fraction_of-bin.pdf files display the empirical cumulative distribution function for the fraction of bin (the bin size after removal of peak regions expressed as a fraction of the original size). read_counts.txt the file that contains the raw and compensated read counts per bin. Author(s) Thomas Kuilman (t.kuilman@nki.nl) References CopywriteR: DNA copy number detection from off-target sequence data. Thomas Kuilman, Arno Velds, Kristel Kemper, Marco Ranzani, Lorenzo Bombardelli, Marlous Hoogstraat, Ekaterina Nevedomskaya, Guotai Xu, Julian de Ruiter, Martijn P. Lolkema, Bauke Ylstra, Jos Jonkers, Sven Rottenberg, Lodewyk F. Wessels, David J. Adams, Daniel S. Peeper, Oscar Krijgsman. Submitted for publication. Examples ## Not run: setwd("/path/to/bamfiles/") samples <- list.files(pattern = ".bam$", full.names = TRUE) ## Use the first.bam file as a control for every sample # controls <- samples[rep(1, length(samples))] ## Use every sample as its own control (i.e., peaks are called on sample itself) controls <- samples sample.control <- data.frame(samples, controls) CopywriteR(sample.control = sample.control, destination.folder = "/PATH/TO/DESTINATIONFOLDER/", reference.folder = "/PATH/TO/REFERENCEFOLDER", ncpu = nrow(sample.control), capture.regions.file <- "/PATH/TO/CAPTUREREGIONSFILE") ## End(Not run) plotcna CopywriteR: ENhanced COpy number Detection from Exome Reads Description plotcna analyses CopywriteR output using the segmentation algorithm CBS and creates both whole-genome and per-chromosome plots.

4 plotcna Usage plotcna(destination.folder, smoothed = TRUE, sample.plot, y.min, y.max,...) Arguments Value destination.folder the path to the output folder of CopywriteR. smoothed sample.plot y.min y.max logical that determines whether DNAcopy smoothens copy number information before segmentation. Defaults to TRUE. optional; a data.frame or matrix that contains the locations of sample and control BAM files. The column names should be "samples" and "controls", respectively. The file locations in a row represent a sample and its corresponding control, which is used for plotting of relative copy number values. Every sample is plotted both without a reference and with a reference. Defaults to sample.controls as used for running CopywriteR. determines lower boundary of the plotting range. determines upper boundary of the plotting range.... takes additional arguments that will be passed on to DNAcopy. segment.rdata plots an R object of the DNAcopy class, which contains the segmentation values for all analysed samples. a folder that contains all copy number profiles in portable document file (pdf) file format. Both profiles per chromosome, as well as a genome-wide profile are provided for every sample. Author(s) Thomas Kuilman (t.kuilman@nki.nl) References CopywriteR: DNA copy number detection from off-target sequence data. Thomas Kuilman, Arno Velds, Kristel Kemper, Marco Ranzani, Lorenzo Bombardelli, Marlous Hoogstraat, Ekaterina Nevedomskaya, Guotai Xu, Julian de Ruiter, Martijn P. Lolkema, Bauke Ylstra, Jos Jonkers, Sven Rottenberg, Lodewyk F. Wessels, David J. Adams, Daniel S. Peeper, Oscar Krijgsman. Submitted for publication. See Also CopywriteR() Examples ## Not run: setwd("/path/to/bamfiles/") samples <- list.files(pattern = ".bam$", full.names = TRUE) ## Use the first.bam file as a reference for plotting every sample controls <- samples[rep(1, length(samples))] sample.plot <- data.frame(samples, controls)

precopywriter 5 plotcna("./path/to/destinationfolder/", sample.plot = sample.plot, y.min = -3, y.max = 3) ## End(Not run) precopywriter CopywriteR: DNA copy number detection from off-target sequence data Description precopywriter is used to generate helper files (blacklist, bin region, GC-content, and mappability.bed files) for the desired bin size from pre-assembled 1 kb helper files. These binary 1 kb helper files are available at https://github.com/peeperlab/copywriter for hg18, hg19, hg38, mm9 and mm10 reference genomes. Usage precopywriter(output.folder, bin.size, ref.genome, prefix = "") Arguments output.folder bin.size ref.genome the path to the output folder. desired bin length (in bp) for which helper files should be generated. the reference genome for which helper files should be generated, e.g. hg18, hg19, hg38, mm9 or mm10. prefix the prefix that is used for chromosome notation. Standard notation is "1", "2",..., "X", "Y". For "chr1",... notation, use prefix = "chr". Details Currently helper files can be generated for human (hg18, hg19 and hg38) and mouse (mm9 and mm10) reference genomes. Helper files can only be generated for a bin.size that is a multiple of 1000 bp. Value blacklist.rda an R data file containing a GRanges object with blacklisted regions of known CNVs. GC_mappability.rda an R data file containing a GRanges object with the mappability and GC-content for bins of the specified size. Author(s) Oscar krijgsman (o.krijgsman@nki.nl) Thomas Kuilman (t.kuilman@nki.nl)

6 precopywriter References CopywriteR: DNA copy number detection from off-target sequence data. Thomas Kuilman, Arno Velds, Kristel Kemper, Marco Ranzani, Lorenzo Bombardelli, Marlous Hoogstraat, Ekaterina Nevedomskaya, Guotai Xu, Julian de Ruiter, Martijn P. Lolkema, Bauke Ylstra, Jos Jonkers, Sven Rottenberg, Lodewyk F. Wessels, David J. Adams, Daniel S. Peeper, Oscar Krijgsman. Submitted for publication. Examples ## Not run: precopywriter("/path/to/hg19_1kb_folder/", "./", 20000, "hg19") ## End(Not run)

Index CopywriteR, 2 list.files (CopywriteR), 2 plotcna, 3 precopywriter, 5 7