Fusion Detection Using QIAseq RNAscan Panels

Similar documents
QIAseq Targeted RNAscan Panel Analysis Plugin USER MANUAL

Modification of an Existing Workflow

Tutorial. Find Very Low Frequency Variants With QIAGEN GeneRead Panels. Sample to Insight. November 21, 2017

Tutorial. Identification of Variants Using GATK. Sample to Insight. November 21, 2017

Tutorial. Variant Detection. Sample to Insight. November 21, 2017

Tutorial. RNA-Seq Analysis of Breast Cancer Data. Sample to Insight. November 21, 2017

Tutorial. Aligning contigs manually using the Genome Finishing. Sample to Insight. February 6, 2019

Tutorial. De Novo Assembly of Paired Data. Sample to Insight. November 21, 2017

Expression Analysis with the Advanced RNA-Seq Plugin

Tutorial. OTU Clustering Step by Step. Sample to Insight. June 28, 2018

OTU Clustering Using Workflows

Tutorial. Identification of Variants in a Tumor Sample. Sample to Insight. November 21, 2017

Tutorial. Getting Started. Sample to Insight. November 28, 2018

Tutorial: De Novo Assembly of Paired Data

Tutorial. OTU Clustering Step by Step. Sample to Insight. March 2, 2017

QIAseq DNA V3 Panel Analysis Plugin USER MANUAL

Tutorial. Small RNA Analysis using Illumina Data. Sample to Insight. October 5, 2016

Tutorial. Comparative Analysis of Three Bovine Genomes. Sample to Insight. November 21, 2017

Small RNA Analysis using Illumina Data

Tutorial. Batching of Multi-Input Workflows. Sample to Insight. November 21, 2017

Tutorial. Typing and Epidemiological Clustering of Common Pathogens (beta) Sample to Insight. November 21, 2017

CLC Server. End User USER MANUAL

Tutorial: RNA-Seq analysis part I: Getting started

Biomedical Genomics Workbench APPLICATION BASED MANUAL

Getting started: Analysis of Microbial Communities

Tutorial. Identification of somatic variants in a matched tumor-normal pair. Sample to Insight. November 21, 2017

CLC Sequence Viewer USER MANUAL

Tutorial: Resequencing Analysis using Tracks

Tutorial: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and Expression measures

Tutorial. Phylogenetic Trees and Metadata. Sample to Insight. November 21, 2017

Performing a resequencing assembly

De novo genome assembly

Tour Guide for Windows and Macintosh

Access Groups. Collect and Store. Text Currency Date/Time. Tables Fields Data Type. You Your Friend Your Parent. Unique information

Access Groups. Collect and Store. Text Currency Date/Time. Tables Fields Data Type. You Your Friend Your Parent. Unique information

Tutorial for Windows and Macintosh SNP Hunting

Access Groups. Collect and Store. Text Currency Date/Time. Tables Fields Data Type. You Your Friend Your Parent. Unique information

Analyzing ChIP- Seq Data in Galaxy

Supplementary Figure 1. Fast read-mapping algorithm of BrowserGenome.

CLC Sequence Viewer 6.5 Windows, Mac OS X and Linux

Tutorial for Windows and Macintosh SNP Hunting

Getting Started. April Strand Life Sciences, Inc All rights reserved.

Additional Alignments Plugin USER MANUAL

Differential Expression Analysis at PATRIC

Resequencing Analysis. (Pseudomonas aeruginosa MAPO1 ) Sample to Insight

Astra Scheduling Grids

Provide Feedback year-round for your direct reports, any HSHS colleague, or add Journal Notes for yourself.

Browser Exercises - I. Alignments and Comparative genomics

Helpful Galaxy screencasts are available at:

m6aviewer Version Documentation

Next generation Confirmation (NGC) module

Importing sequence assemblies from BAM and SAM files

Protocol: peak-calling for ChIP-seq data / segmentation analysis for histone modification data

Genome Browsers Guide

Advanced RNA-Seq 1.5. User manual for. Windows, Mac OS X and Linux. November 2, 2016 This software is for research purposes only.

Annotating a single sequence

Tutorial for Windows and Macintosh. Trimming Sequence Gene Codes Corporation

Recalling Genotypes with BEAGLECALL Tutorial

Ensembl RNASeq Practical. Overview

Agilent Genomic Workbench Lite Edition 6.5

Integrating Sintelix and ANB. Learn how to access and explore Sintelix networks in IBM i2 Analyst s Notebook

Omixon HLA Explore User guide

Wilson Leung 01/03/2018 An Introduction to NCBI BLAST. Prerequisites: Detecting and Interpreting Genetic Homology: Lecture Notes on Alignment

GEP Project Management System: Annotation Project Submission

CLC Genomics Server. Administrator Manual

Cisco Unified CM User Options

Agilent Genomic Workbench 7.0

Microsoft Access 2010

User Guide. v Released June Advaita Corporation 2016

Copyright 2014 Regents of the University of Minnesota

TruSight HLA Assign 2.1 RUO Software Guide

Copyright 2014 Regents of the University of Minnesota

v Working with Rasters SMS 12.1 Tutorial Requirements Raster Module Map Module Mesh Module Time minutes Prerequisites Overview Tutorial

Microsoft Access 2013

Microsoft Access 2013

BovineMine Documentation

Creating and Using Genome Assemblies Tutorial

Agilent Genomic Workbench 7.0

The software comes with 2 installers: (1) SureCall installer (2) GenAligners (contains BWA, BWA- MEM).

Access Review. 4. Save the table by clicking the Save icon in the Quick Access Toolbar or by pulling

Import and preprocessing of raw spectrum data

ChIP-seq hands-on practical using Galaxy

CLC Microbial Genomics Module USER MANUAL

MySQL DOWNLOAD & INSTALLATION GUIDE

W ASHU E PI G ENOME B ROWSER

Locate your Advanced Tools and Applications

Intro to NGS Tutorial

Import GEO Experiment into Partek Genomics Suite

Note that you must have the Curriculum Map Admin permission to create curriculum map templates. Please contact your IU to obtain this permission.

InSite Prepress Portal

Instructional Improvement System (IIS) Dashboard District User Guide Statewide Longitudinal Data System (SLDS)

ITMO Ecole de Bioinformatique Hands-on session: smallrna-seq N. Servant 21 rd November 2013

Integrative Genomics Viewer. Prat Thiru

Tutorial for Windows and Macintosh. De Novo Sequence Assembly with Velvet

Assigning an Assessment for Online Delivery in Edusoft

Creating a Subset of Production Data

Getting Started with IBM Cognos 10 Reports

Tutorial 1: Exploring the UCSC Genome Browser

User's Guide to DNASTAR SeqMan NGen For Windows, Macintosh and Linux

ChIP-Seq Tutorial on Galaxy

Transcription:

Fusion Detection Using QIAseq RNAscan Panels June 11, 2018 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com ts-bioinformatics@qiagen.com

2 Fusion Detection Using QIAseq RNAscan Panels This tutorial uses the capacities of the Biomedical Genomics Workbench and the QIAseq Targeted Panel Analysis plugin to detect fusion events in sequences generated using a QIAseq Targeted RNAscan Panel. The plugin includes an Analyze QIAseq Panels guide that facilitates the configuration and the running of QIAseq panels secondary analysis workflows. However, because we are working with a smaller data set here (limited to chromosome 10), we choose to show you in this tutorial how to proceed without using the guide, but using the Detect QIAseq RNAscans Fusions workflow instead. This tutorial covers in just a few steps all the following: Import Illumina paired reads in the workbench. Generate a trim adaptor list to remove the sequencing primers off the reads. Find fusion events with the Detect QIAseq RNAscan Fusions ready-to-use workflow. This tutorial makes use of a subset (reads mapping only to chromosome 10) of the sequencing reads obtained using a QIAseq Targeted RNAscan Panel. Prerequisites For this tutorial, you must be working with the Biomedical Genomics Workbench 5.1 or higher. You must also have installed the QIAseq Targeted Panel Analysis plugin version 1.1 or higher. How to install plugins is described here: http://resources.qiagenbioinformatics.com/ manuals/biomedicalgenomicsworkbench/current/index.php?manual=installing_ plugins.html. Data management configuration In order to do this tutorial, you need to use the Reference Data Set made specifically for this tutorial: 1. Click on the Data Management icon in the top right corner of the workbench (1) (figure 1). 2. In the QIAGEN Reference Data Library (2), open the Reference Data Sets tab (3), select the Fusion Detection Using QIAseq RNAscan Panels data set, and click on the button Download (4). 3. When the data is downloaded (characterized by check icons next to each item included in the dataset), click "Apply" (5). The workflow is now set up to work with the adequate references. In addition, QIAseq Targeted RNAscan Panels specific primer tracks are now available in the CLC_References folder located in the Navigation Area.

3 Figure 1: Downloading the tutorial specific reference data. Download and import data Go through the following steps to download and import the data into the Workbench. 1. Download the sample data from our website: http://resources.qiagenbioinformatics. com/testdata/fusion_gene_detection_tutorial.zip and unzip it in the location of your choice on your computer. 2. Start the Biomedical Genomics Workbench. 3. Import the reads via the toolbar: File Import ( ) Illumina Select the two fastq paired files. Under "General Options" section, ensure the Paired reads and Discard read names checkboxes are checked. In the "Paired read orientation" section, ensure the Paired-end (forward-reverse) option is checked. Set the Minimum distance to 1 and the Maximum distance to 1000 (default values). Click Next. 4. Click on the button labeled Save in the wizard page that appears, choose the folder you wish to save the reads to (you can create a new folder dedicated to this tutorial for example) and click Finish. Run Detect QIAseq RNAscan Fusions The Detect QIAseq RNAscan Fusions ready-to-use workflow can be found in the Toolbox here:

4 Toolbox Ready-to-Use Workflows QIAseq Panel Analysis QIAseq Analysis Workflows Detect QIAseq RNAscan Fusions 1. Double-click on the Detect QIAseq RNAscan Fusions ready-to-use workflow to run the analysis. If the QIAseq Targeted Panel Analysis plugin has been installed on a CLC Server you are connected to via the workbench, you will be asked where you would like to run the analysis. We recommend that you run the analysis on a CLC Server when possible. 2. Select the sequencing reads that should be analyzed (figure 2). Figure 2: Select the sequencing reads by double-clicking on the file name or by clicking once on the file name and then on the arrow pointing to the right hand side. 3. Select the primers track available in the CLC_References folder once the tutorial Reference Data Set has been downloaded (figure 3). Figure 3: Select the Trim adapter list you generated ahead of running the workflow. 4. In the Detect Fusion Genes dialog, check the option "Detect exon skippings" (figure 4) and Next. Figure 4: Choose to detect exon skippings.

5 5. Finally, in the last wizard step, choose to Save the results of the workflow and specify a location in the Navigation Area before clicking Finish. Output from the Detect QIAseq RNAscan Fusions workflow The Detect QIAseq RNAscan Fusions workflow produces a series of output. Before checking the detected fusions, users are usually recommended to review the QC for RNAscan Panels Report, and in particular, to verify that no DNA contamination was found in the sample. Also the reference genes included in the RNAscan panels should have been detected and their expression to that of the target genes should be reviewed (Target gene vs reference gene coverage ratio [%]). In the case of this tutorial, the data was limited to chromosome 10 only and the QC for RNAscan Panels Report is therefore incomplete. The easiest way to review the results is to open the Genome Browser View (Fusions) (figure 5). Figure 5: A Genome Browser View (Fusions) showing the KIF5B-RET fusion.

6 Double-click on the Fusion genes (fusions) track name (top left of the Genome Browser View). The fusion track will open as a table in split view, below the Genome Browser View. You can sort and filter the table using one of the metrics: Fusions that get the Filter annotation PASS have not been caught by any filter and are therefore the most trustworthy fusions. Also, sorting the table using the fusion Number column will move the most trustworthy fusions to the top of the table. Note that one of the three fusions with a "PASS" is an exon skipping fusion, i.e., a same-gene fusion event where the 5 breakpoint is upstream of the 3 breakpoint. This same fusion is found to be not in frame (information available because a CDS track was specified as input), as seen in figure 6. Figure 6: A Genome Browser View (Fusions) showing the MLLT10-MLLT10 exon skiping fusion. Clicking on a fusion event in the table will zoom in to its location in the UMI read mapping, allowing you to review the UMI reads supporting the detected fusion. Note that each fusion event is represented by two rows in the table, allowing you to visualize successively both breakpoints of each fusion. You can estimate the evidence for the fusion by studying the Mapped UMI Reads that aligns to one side of a fusion, and the Mapped Unaligned Ends that, after re-mapping, align

7 to the other side of the fusion. It is also possible to assess in the Genome Browser View (WT) know how well the reads maps to the WT genes at the breakpoints. Besides, the WT tracks that are not included in the Genome Browser View (read mappings of the first iteration, and the breakpoint track from the detect tool) can be dragged from the Navigation Area into the Genome Browser View in order to see the results from the Detect Fusion Genes tool. For example, look at the number of fusion crossing and fusion spanning reads, as well as the estimated Z-scores and p-values in the different Fusion genes tracks. This could be interesting in cases where no reads map to the fusion genes in the fusion tracks. Note that the Mapped UMI track displays the mapping of UMI reads to the transcript sequences. Paired reads still in proper pairs are blue. Forward reads are green, reverse reads are red. Unaligned ends of mapped reads are shown in a lighter shade of the respective read color. Multi-match reads, i.e., reads that could have been mapped to another position within the mapping criteria used, are in yellow.