DRAGEN Bio-IT Platform Enabling the Global Genomic Infrastructure

Size: px
Start display at page:

Download "DRAGEN Bio-IT Platform Enabling the Global Genomic Infrastructure"

Transcription

1 TM DRAGEN Bio-IT Platform Enabling the Global Genomic Infrastructure About DRAGEN Edico Genome s DRAGEN TM (Dynamic Read Analysis for GENomics) Bio-IT Platform provides ultra-rapid secondary analysis of next-generation sequencing (NGS) data. DRAGEN uses highly reconfigurable field-programmable gate array (FPGA) to provide hardware-accelerated implementations of genome pipeline algorithms, such as BCL conversion, compression, mapping, alignment, sorting, duplicate marking, and haplotype variant calling. The highly flexible DRAGEN platform allows Edico Genome to develop custom algorithms and refine and improve existing pipelines. Industry Leading Speeds Whole human genome sequence analysis at 30x coverage in ~20 minutes Highly Accurate Clinical-grade sensitivity and specificity for all major genomic applications and workflows Cost Savings One DRAGEN instance replaces ~ 80 traditional compute instances and reduces storage costs Onsite and Cloud DRAGEN can be utilized onsite, in the cloud, or through a hybrid cloud solution

2 Unleash the Power of DRAGEN Edico Genome s DRAGEN Bio-IT Platform leverages FPGA technology to provide customers with hardware-accelerated implementation of genome pipeline algorithms, achieving quicker turnaround times without compromising accuracy. Ultra-Rapid FPGAs implement algorithms as logic circuits, providing an output almost instantaneously. As a result, DRAGEN can process an entire human genome at 30x coverage in about 20 minutes, as compared to over 20 hours using a traditional CPU-based system. Highly Accurate DRAGEN is engineered to remove biases and other sources of error, ensuring that accuracy is maintained across the board. DRAGEN s suite of pipelines contain checks and balances to ensure reliable results, including GC bias corrections. DRAGEN received the highest score in five of six accuracy metrics in the most recent PrecisionFDA variant calling challenge. Hybrid Cloud Solution DRAGEN onsite and cloud offerings enable customers to use DRAGEN on their preferred platform. Customers can select DRAGEN onsite or in the cloud through our industry-leading partnerships, or use a hybrid of both to securely transfer data analysis and storage between their physical and cloud-based DRAGEN systems. Cost Savings One DRAGEN instance can replace up to 80 traditional compute instances, reducing onsite investments. With G3 compression (see page 7), DRAGEN offers further cost savings by compressing customer data to reduce data storage costs. TM Guinness World Records Holders for Fastest Genetic Diagnosis and Fastest Analysis of 1,000 Human Genomes Fastest Genetic Diagnosis in 26 hours Fastest Analysis of 1,000 Human Genomes in 2:25 hours Applications NICU Diagnostics Clinical Genomics Sequencing Centers Agri Genomics NIPT NIPS Population Sequencing

3 The DRAGEN Platform DRAGEN leverages FPGA technology to provide customers with hardware-accelerated implementation of genome pipeline algorithms onsite and in the cloud. API CLI GUI Compression FPGA-Based DRAGEN Engine The DRAGEN Engine solves two key unmet needs in big data genomics: compute and storage. DRAGEN offers a scalable, accelerated, and cost-efficient secondary analysis solution for all key genomics applications, as well as providing a highly optimized mechanism for the efficient storage of genomic data. The core FPGA technology enables the ultra-rapid compute and optimized storage that the DRAGEN platform provides. As FPGAs are highly reconfigurable, DRAGEN can be reprogrammed as frequently as needed, seamlessly switching or updating algorithms on the fly. This flexibility allows Edico Genome to continuously improve its DRAGEN platform, releasing customer downloads on a regular basis. Powered by Amazon F1 Instances DRAGEN in the cloud is enabled by Amazon Elastic Compute Cloud (Amazon EC2) F1 instance types. Powered by Xilinx Virtex UltraScale + FPGAs, F1 instances are designed to provide capabilities for hardware-accelerated big data applications such as NGS analysis. With the integration of Edico Genome s FPGA-backed DRAGEN Bio-IT platform on Amazon EC2 F1 instances, the accuracy, speed, flexibility, and low cost of DRAGEN can now be accessed through a number of cloud-based platforms.

4 DRAGEN Onsite and Cloud Solutions DRAGEN onsite and cloud offerings enable customers to utilize DRAGEN s ultra-rapid speeds, accuracy, and cost savings through the platform of their choosing. Customers can select between DRAGEN onsite or in the cloud through our industry-leading partnerships, or choose a hybrid of both, allowing them to securely transfer data analysis and storage between their physical and cloud-based DRAGEN systems. Ultra-Rapid NGS Compute 100x faster WGS 30x in ~20 minutes* Accurate Clinical-grade sensitivity and specificity Low Cost Replace up to 80 compute instances Storage Lossless near real-time compression Flexible Onsite, in the cloud or both through hybrid cloud Onsite Hybrid Cloud Solution In the Cloud *Speeds based on DRAGEN onsite Dell EMC server and in the Cloud on an AWS F1 16x large instance Onsite DRAGEN onsite can replace 100 traditional compute instances, reducing hardware and maintenance costs. Utilizing proprietary technology, DRAGEN conducts secondary analysis in a fraction of the time while still maintaining high accuracy. The onsite DRAGEN platform allows customers to analyze and store their data locally. When integrated with DRAGEN s cloud offerings, customers can utilize the DRAGEN hybrid cloud solution to scale up to the cloud during high capacity and return onsite when demand drops. In the Cloud DRAGEN in the cloud enables customers to utilize the same enhanced speed, accuracy, and cost savings that DRAGEN onsite provides, but with the flexibility and scalability of the AWS cloud. Through industry-leading partnerships with Illumina s BaseSpace Sequence Hub, DNAnexus, AWS Marketplace, and Seven Bridges Genomics, customers can run DRAGEN on their preferred platform whenever needed. DRAGEN on the cloud is enabled by Amazon s FPGA instances (EC2 F1), and leverages the storage features of Amazon s S3 for secure storage of large amounts of data in the cloud.

5 DRAGEN in the Cloud DRAGEN in the cloud enables customers to utilize the same enhanced speed, accuracy and cost savings that onsite provides, but with the flexibility and scalability of the AWS cloud. Through industry-leading partnerships, customers can run DRAGEN on their preferred platform whenever needed. DRAGEN on the cloud is enabled by Amazon Elastic Compute Cloud (Amazon EC2) F1 instance type, and leverages the storage features of Amazon s S3 for secure storage of large amounts of data in the cloud. AWS Marketplace is an online platform where customers can sell or buy software that runs on Amazon Web Services (AWS). AWS allows customers to simplify and securely scale genomic analysis, building a bridge from their existing on-premise infrastructure to the cloud. Combined with the power of DRAGEN, the AWS Marketplace enables customers to accelerate their secondary analysis of next-generation sequencing (NGS) data. BaseSpace Sequence Hub is Illumina's cloud-based genomics computing environment for next-generation sequencing (NGS) data management and analysis. Sequencing labs can store and share sequencing data, and researchers can simplify and accelerate NGS data analysis with push-button tools. BaseSpace Sequence Hub coupled with DRAGEN allows for the transition from sequencing to analysis with efficiency and ease, and is compatible with other BaseSpace Sequence Hub applications. DNAnexus provides a global network for sharing and managing genomic data and tools to accelerate genomics. The DNAnexus platform is optimized to address the challenges of security, scalability, and collaboration for organizations that are pursuing translational informatics genomic-based approaches to health, in the clinic and in the research lab. The availability of DRAGEN on DNAnexus enables new and existing DNAnexus customers to reduce turnaround times and costs directly from the online platform. For more information, visit

6 DRAGEN Ultra-Rapid Analysis Pipelines Edico Genome s continuously expanding and improving series of pipelines can be run on a single DRAGEN platform. Germline V2 The DRAGEN Germline V2 Pipeline provides clinical-grade, end-to-end (BCL VCF) NGS analysis, including advanced error model calibration for increased accuracy. Somatic The DRAGEN Somatic Pipeline includes tumor-only and tumor/normal pipelines, designed for use in detecting even rare mutations that may contribute to cancer formation and growth. Transcriptome/RNA Gene Fusion Detection The DRAGEN Transcriptome Pipeline performs transcriptome analysis with splice junction alignment. The pipeline offers multiple operating modes, including reference-only alignment and annotation-assisted. Joint Genotyping/Population Calling The DRAGEN Joint Genotyping Pipeline calls variants from extremely large datasets at a speed 25x faster than competing pipelines with uncompromising accuracy. CNV The DRAGEN Copy Number Variants (CNV) Pipeline performs CNV analysis for germline and somatic exomes and genomes. Various levels of filtering can be applied to mitigate false positives before emitting the final calls. Methylation The DRAGEN M e t hyla t i o n Pipeline offers multiple operating modes, including reference-only alignment and annotation-assisted. VLRD The DRAGEN Virtual Long Read Detection (VLRD) pipeline allows for the detection of variants in segmental duplication regions that are otherwise only detectable by the use of long read sequencing. Map + Align Map + Align The DRAGEN Map + Align pipeline is capable of ultra-rapid mapping and aligning DNA and RNA for both exomes and genomes. GATK BestPractices Pipeline The GATK Best Practices Pipeline is an accelerated implementation of the open-sourced GATK 4.0 Haplotype Variant Caller from the Broad Institute on the DRAGEN platform.

7 DRAGEN G3 Compression Technology Lossless data compression in near-real time Edico Genome s FPGA-based G3 reduces data size by up to 80 percent, significantly reducing customers storage costs. DRAGEN is able to regenerate the original files on the fly with 100% accuracy, enabling a significantly smaller storage footprint and increasing the capacity of existing data storage. G3 compression technology is transparent to the user, who is able to see the attributes of all the original data files, including BAMs, FASTQs, and VCFs, even though some of the data is compressed. Significantly reduces storage costs Seamless compression and storage Available onsite and in the Cloud Hardware accelerated, genomics specific compression Auto compression/decompression via control layer Lossless compression for bit-exact file recovery, or lossy compression for reduced storage footprint DRAGEN Workflow Management System Production-Scale Automation For Complex Genomics Workflows Edico Genome s Workflow Management System (WMS) enables users to easily automate their pipelines and workflows on the DRAGEN Platform. The WMS integrates sequencing instruments, storage resources, compute resources, and workflows with the user s Laboratory Information Management System (LIMS), creating a fully automated, end-to-end process. Users have complete transparency of their samples, lanes, and analyses. The WMS is comprehensive in terms of functional capabilities, and users can effortlessly automate their workflows with data from any number of sequencing instruments (of any type) and any number of samples spread across a vast number of flow cells and lanes.

8 Join the Genomic Revolution DRAGEN provides ultra-rapid secondary analysis in ~20 minutes, enabling customers to focus their time and money on what matters most delivering life-changing answers. DRAGEN is utilized by leading sequencing centers, labs, and hospitals around the world, including About Edico Genome At Edico Genome, we are helping to usher in the new era of personalized medicine by enabling a fundamental shift in healthcare, with customized treatments and data-driven insights tailored to the individual. At the heart of personalized medicine, DNA sequencing technology is advancing at an more rapid rate than cell phone technology. By increasing the speed and accuracy of NGS data analysis, such as whole genome sequencing (WGS) data, our computing platform makes it easier to discover links between DNA sequence variations and diseases. info@edicogenome.com

Mar. Guide. Edico Genome Inc North Torrey Pines Court, Plaza Level, La Jolla, CA 92037

Mar. Guide.  Edico Genome Inc North Torrey Pines Court, Plaza Level, La Jolla, CA 92037 Mar 2017 DRAGEN TM Quick Start Guide www.edicogenome.com info@edicogenome.com Edico Genome Inc. 3344 North Torrey Pines Court, Plaza Level, La Jolla, CA 92037 Notice Contents of this document and associated

More information

Sep. Guide. Edico Genome Corp North Torrey Pines Court, Plaza Level, La Jolla, CA 92037

Sep. Guide.  Edico Genome Corp North Torrey Pines Court, Plaza Level, La Jolla, CA 92037 Sep 2017 DRAGEN TM Quick Start Guide www.edicogenome.com info@edicogenome.com Edico Genome Corp. 3344 North Torrey Pines Court, Plaza Level, La Jolla, CA 92037 Notice Contents of this document and associated

More information

Lessons Learned during Illumina s Secure DevOps Transition Kenneth G. Hartman Associate Director Cloud Products Security

Lessons Learned during Illumina s Secure DevOps Transition Kenneth G. Hartman Associate Director Cloud Products Security Lessons Learned during Illumina s Secure DevOps Transition Kenneth G. Hartman Associate Director Cloud Products Security 2018 Illumina, Inc. All rights reserved. QB Document #6608 At a Glance 2 https://www.illumina.com/company/news-center/feature-articles/illumina-at-a-glance.html

More information

Exome sequencing. Jong Kyoung Kim

Exome sequencing. Jong Kyoung Kim Exome sequencing Jong Kyoung Kim Genome Analysis Toolkit The GATK is the industry standard for identifying SNPs and indels in germline DNA and RNAseq data. Its scope is now expanding to include somatic

More information

Genomics on Cisco Metacloud + SwiftStack

Genomics on Cisco Metacloud + SwiftStack Genomics on Cisco Metacloud + SwiftStack Technology is a large component of driving discovery in both research and providing timely answers for clinical treatments. Advances in genomic sequencing have

More information

Falcon Accelerated Genomics Data Analysis Solutions. User Guide

Falcon Accelerated Genomics Data Analysis Solutions. User Guide Falcon Accelerated Genomics Data Analysis Solutions User Guide Falcon Computing Solutions, Inc. Version 1.0 3/30/2018 Table of Contents Introduction... 3 System Requirements and Installation... 4 Software

More information

AWS Marketplace Quick Start Guide

AWS Marketplace Quick Start Guide Sep 2017 AWS Marketplace Quick Start Guide www.edicogenome.com info@edicogenome.com Edico Genome Inc. 3344 North Torrey Pines Court, Plaza Level, La Jolla, CA 92037 Contents 1 Getting started... 2 1.1

More information

BaseSpace - MiSeq Reporter Software v2.4 Release Notes

BaseSpace - MiSeq Reporter Software v2.4 Release Notes Page 1 of 5 BaseSpace - MiSeq Reporter Software v2.4 Release Notes For MiSeq Systems Connected to BaseSpace June 2, 2014 Revision Date Description of Change A May 22, 2014 Initial Version Revision History

More information

Mar. EDICO GENOME CORP North Torrey Pines Court, Plaza Level, La Jolla, CA 92037

Mar.  EDICO GENOME CORP North Torrey Pines Court, Plaza Level, La Jolla, CA 92037 Mar 2017 DRAGEN TM User Guide www.edicogenome.com EDICO GENOME CORP. 3344 North Torrey Pines Court, Plaza Level, La Jolla, CA 92037 Notice The information disclosed in this User Guide and associated software

More information

BaseSpace Variant Interpreter Release Notes

BaseSpace Variant Interpreter Release Notes Document ID: EHAD_RN_010220118_0 Release Notes External v.2.4.1 (KN:v1.2.24) Release Date: Page 1 of 7 BaseSpace Variant Interpreter Release Notes BaseSpace Variant Interpreter v2.4.1 FOR RESEARCH USE

More information

REPORT. NA12878 Platinum Genome. GENALICE MAP Analysis Report. Bas Tolhuis, PhD GENALICE B.V.

REPORT. NA12878 Platinum Genome. GENALICE MAP Analysis Report. Bas Tolhuis, PhD GENALICE B.V. REPORT NA12878 Platinum Genome GENALICE MAP Analysis Report Bas Tolhuis, PhD GENALICE B.V. INDEX EXECUTIVE SUMMARY...4 1. MATERIALS & METHODS...5 1.1 SEQUENCE DATA...5 1.2 WORKFLOWS......5 1.3 ACCURACY

More information

Mapping NGS reads for genomics studies

Mapping NGS reads for genomics studies Mapping NGS reads for genomics studies Valencia, 28-30 Sep 2015 BIER Alejandro Alemán aaleman@cipf.es Genomics Data Analysis CIBERER Where are we? Fastq Sequence preprocessing Fastq Alignment BAM Visualization

More information

Nov. EDICO GENOME CORP North Torrey Pines Court, Plaza Level, La Jolla, CA 92037

Nov.  EDICO GENOME CORP North Torrey Pines Court, Plaza Level, La Jolla, CA 92037 Nov 2017 DRAGEN TM User Guide www.edicogenome.com EDICO GENOME CORP. 3344 North Torrey Pines Court, Plaza Level, La Jolla, CA 92037 Notice The information disclosed in this User Guide and associated software

More information

Kelly et al. Genome Biology (2015) 16:6 DOI /s x. * Correspondence:

Kelly et al. Genome Biology (2015) 16:6 DOI /s x. * Correspondence: Kelly et al. Genome Biology (215) 16:6 DOI 1.1186/s1359-14-577-x METHOD Open Access Churchill: an ultra-fast, deterministic, highly scalable and balanced parallelization strategy for the discovery of human

More information

Cloud Confidence: Simple Seamless Secure. Dell EMC Data Protection for VMware Cloud on AWS

Cloud Confidence: Simple Seamless Secure. Dell EMC Data Protection for VMware Cloud on AWS Cloud Confidence: Simple Seamless Secure Dell EMC Data Protection for VMware Cloud on AWS Introduction From the boardroom to the data center, digital transformation has become a business imperative. Whether

More information

Tutorial. Identification of somatic variants in a matched tumor-normal pair. Sample to Insight. November 21, 2017

Tutorial. Identification of somatic variants in a matched tumor-normal pair. Sample to Insight. November 21, 2017 Identification of somatic variants in a matched tumor-normal pair November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com

More information

Tutorial. Identification of Variants in a Tumor Sample. Sample to Insight. November 21, 2017

Tutorial. Identification of Variants in a Tumor Sample. Sample to Insight. November 21, 2017 Identification of Variants in a Tumor Sample November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com

More information

NA12878 Platinum Genome GENALICE MAP Analysis Report

NA12878 Platinum Genome GENALICE MAP Analysis Report NA12878 Platinum Genome GENALICE MAP Analysis Report Bas Tolhuis, PhD Jan-Jaap Wesselink, PhD GENALICE B.V. INDEX EXECUTIVE SUMMARY...4 1. MATERIALS & METHODS...5 1.1 SEQUENCE DATA...5 1.2 WORKFLOWS......5

More information

Intro to NGS Tutorial

Intro to NGS Tutorial Intro to NGS Tutorial Release 8.6.0 Golden Helix, Inc. October 31, 2016 Contents 1. Overview 2 2. Import Variants and Quality Fields 3 3. Quality Filters 10 Generate Alternate Read Ratio.........................................

More information

Digital Enterprise Platform for Live Business. Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU

Digital Enterprise Platform for Live Business. Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU Digital Enterprise Platform for Live Business Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU Rethinking the Future Competing in today s marketplace means leveraging

More information

Sentieon Documentation

Sentieon Documentation Sentieon Documentation Release 201808.03 Sentieon, Inc Dec 21, 2018 Sentieon Manual 1 Introduction 1 1.1 Description.............................................. 1 1.2 Benefits and Value..........................................

More information

Biomedical Genomics Workbench APPLICATION BASED MANUAL

Biomedical Genomics Workbench APPLICATION BASED MANUAL Biomedical Genomics Workbench APPLICATION BASED MANUAL Manual for Biomedical Genomics Workbench 4.0 Windows, Mac OS X and Linux January 23, 2017 This software is for research purposes only. QIAGEN Aarhus

More information

Decrypting your genome data privately in the cloud

Decrypting your genome data privately in the cloud Decrypting your genome data privately in the cloud Marc Sitges Data Manager@Made of Genes @madeofgenes The Human Genome 3.200 M (x2) Base pairs (bp) ~20.000 genes (~30%) (Exons ~1%) The Human Genome Project

More information

Galaxy Platform For NGS Data Analyses

Galaxy Platform For NGS Data Analyses Galaxy Platform For NGS Data Analyses Weihong Yan wyan@chem.ucla.edu Collaboratory Web Site http://qcb.ucla.edu/collaboratory Collaboratory Workshops Workshop Outline ü Day 1 UCLA galaxy and user account

More information

Tutorial. Identification of Variants Using GATK. Sample to Insight. November 21, 2017

Tutorial. Identification of Variants Using GATK. Sample to Insight. November 21, 2017 Identification of Variants Using GATK November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com

More information

ITMO Ecole de Bioinformatique Hands-on session: smallrna-seq N. Servant 21 rd November 2013

ITMO Ecole de Bioinformatique Hands-on session: smallrna-seq N. Servant 21 rd November 2013 ITMO Ecole de Bioinformatique Hands-on session: smallrna-seq N. Servant 21 rd November 2013 1. Data and objectives We will use the data from GEO (GSE35368, Toedling, Servant et al. 2011). Two samples were

More information

Turning Data Science into a reality with TIBCO Spotfire

Turning Data Science into a reality with TIBCO Spotfire Turning Data Science into a reality with TIBCO Spotfire Eduardo Gonzalez-Couto, Ph.D. Product Manager, PerkinElmer Informatics Basel, 3 rd November 2016 Safe Harbor Statement This document shows current

More information

TECH NOTE Improving the Sensitivity of Ultra Low Input mrna Seq

TECH NOTE Improving the Sensitivity of Ultra Low Input mrna Seq TECH NOTE Improving the Sensitivity of Ultra Low Input mrna Seq SMART Seq v4 Ultra Low Input RNA Kit for Sequencing Powered by SMART and LNA technologies: Locked nucleic acid technology significantly improves

More information

CORPORATE PERFORMANCE IMPROVEMENT DOES CLOUD MEAN THE PRIVATE DATA CENTER IS DEAD?

CORPORATE PERFORMANCE IMPROVEMENT DOES CLOUD MEAN THE PRIVATE DATA CENTER IS DEAD? CORPORATE PERFORMANCE IMPROVEMENT DOES CLOUD MEAN THE PRIVATE DATA CENTER IS DEAD? DOES CLOUD MEAN THE PRIVATE DATA CENTER IS DEAD? MASS MIGRATION: SHOULD ALL COMPANIES MOVE TO THE CLOUD? Achieving digital

More information

NGS Data Analysis. Roberto Preste

NGS Data Analysis. Roberto Preste NGS Data Analysis Roberto Preste 1 Useful info http://bit.ly/2r1y2dr Contacts: roberto.preste@gmail.com Slides: http://bit.ly/ngs-data 2 NGS data analysis Overview 3 NGS Data Analysis: the basic idea http://bit.ly/2r1y2dr

More information

Tumor-Specific NeoAntigen Detector (TSNAD) v2.0 User s Manual

Tumor-Specific NeoAntigen Detector (TSNAD) v2.0 User s Manual Tumor-Specific NeoAntigen Detector (TSNAD) v2.0 User s Manual Zhan Zhou, Xingzheng Lyu and Jingcheng Wu Zhejiang University, CHINA March, 2016 USER'S MANUAL TABLE OF CONTENTS 1 GETTING STARTED... 1 1.1

More information

The software comes with 2 installers: (1) SureCall installer (2) GenAligners (contains BWA, BWA- MEM).

The software comes with 2 installers: (1) SureCall installer (2) GenAligners (contains BWA, BWA- MEM). Release Notes Agilent SureCall 4.0 Product Number G4980AA SureCall Client 6-month named license supports installation of one client and server (to host the SureCall database) on one machine. For additional

More information

Copy Number Variations Detection - TD. Using Sequenza under Galaxy

Copy Number Variations Detection - TD. Using Sequenza under Galaxy Copy Number Variations Detection - TD Using Sequenza under Galaxy I. Data loading We will analyze the copy number variations of a human tumor (parotid gland carcinoma), limited to the chr17, from a WES

More information

Supplementary Information. Detecting and annotating genetic variations using the HugeSeq pipeline

Supplementary Information. Detecting and annotating genetic variations using the HugeSeq pipeline Supplementary Information Detecting and annotating genetic variations using the HugeSeq pipeline Hugo Y. K. Lam 1,#, Cuiping Pan 1, Michael J. Clark 1, Phil Lacroute 1, Rui Chen 1, Rajini Haraksingh 1,

More information

Cisco Cloud Services Router 1000V and Amazon Web Services CASE STUDY

Cisco Cloud Services Router 1000V and Amazon Web Services CASE STUDY Cisco Cloud Services Router 1000V and Amazon Web Services CASE STUDY CASE STUDY ADOBE 2 About Adobe Adobe Systems provides digital media and marketing solutions to customers around the world including

More information

Analyzing massive genomics datasets using Databricks Frank Austin Nothaft,

Analyzing massive genomics datasets using Databricks Frank Austin Nothaft, Analyzing massive genomics datasets using Databricks Frank Austin Nothaft, PhD frank.nothaft@databricks.com @fnothaft VISION Accelerate innovation by unifying data science, engineering and business PRODUCT

More information

Terabases of long-read sequence data, analysed in real time

Terabases of long-read sequence data, analysed in real time Terabases of long-read sequence data, analysed in real time The PromethION is a real game changer. Combining ultra-long reads with high sequence output for the production of contiguous, highquality reference

More information

Financed by the European Commission 7 th Framework Programme. biobankcloud.eu. Jim Dowling, PhD Assoc. Prof, KTH Project Coordinator

Financed by the European Commission 7 th Framework Programme. biobankcloud.eu. Jim Dowling, PhD Assoc. Prof, KTH Project Coordinator Financed by the European Commission 7 th Framework Programme. biobankcloud.eu Jim Dowling, PhD Assoc. Prof, KTH Project Coordinator The Biobank Bottleneck We will soon be generating massive amounts of

More information

EBOOK: Backup & Recovery on AWS

EBOOK: Backup & Recovery on AWS EBOOK: Backup & Recovery on AWS Contents Backup and Recovery on AWS... 2 AWS Object Storage Services... 3 Featured Backup and Recovery Providers... 5 APN Storage Partner Benefits on AWS... 6 King County,

More information

Transform Health IT with Enterprise Cloud technologies Session 178, Feb 22, 2017, 11:30 am EST

Transform Health IT with Enterprise Cloud technologies Session 178, Feb 22, 2017, 11:30 am EST Transform Health IT with Enterprise Cloud technologies Session 178, Feb 22, 2017, 11:30 am EST Sanjay Maru, Director, Enterprise Architecture Preethy Padman, Head of Healthcare Marketing 1 Speaker Introduction

More information

SoftNAS Cloud Data Management Products for AWS Add Breakthrough NAS Performance, Protection, Flexibility

SoftNAS Cloud Data Management Products for AWS Add Breakthrough NAS Performance, Protection, Flexibility Control Any Data. Any Cloud. Anywhere. SoftNAS Cloud Data Management Products for AWS Add Breakthrough NAS Performance, Protection, Flexibility Understanding SoftNAS Cloud SoftNAS, Inc. is the #1 software-defined

More information

Using Cohesity with Amazon Web Services (AWS)

Using Cohesity with Amazon Web Services (AWS) Using Cohesity with Amazon Web Services (AWS) Achieve your long-term retention and archival objectives for secondary data Cohesity DataPlatform is a hyperconverged secondary data and application solution

More information

The software comes with 2 installers: (1) SureCall installer (2) GenAligners (contains BWA, BWA-MEM).

The software comes with 2 installers: (1) SureCall installer (2) GenAligners (contains BWA, BWA-MEM). Release Notes Agilent SureCall 3.5 Product Number G4980AA SureCall Client 6-month named license supports installation of one client and server (to host the SureCall database) on one machine. For additional

More information

Innovative DSPLL and MultiSynth Clock Architecture Enables High-Density 10/40/100G Line Card Designs

Innovative DSPLL and MultiSynth Clock Architecture Enables High-Density 10/40/100G Line Card Designs Innovative and MultiSynth Clock Architecture Enables High-Density 10/40/100G Line Card Designs Introduction The insatiable demand for bandwidth to support applications such as video streaming and cloud

More information

ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS. By George Crump

ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS. By George Crump ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS By George Crump Economical, Storage Purpose-Built for the Emerging Data Centers Most small, growing businesses start as a collection of laptops

More information

High-throughout sequencing and using short-read aligners. Simon Anders

High-throughout sequencing and using short-read aligners. Simon Anders High-throughout sequencing and using short-read aligners Simon Anders High-throughput sequencing (HTS) Sequencing millions of short DNA fragments in parallel. a.k.a.: next-generation sequencing (NGS) massively-parallel

More information

Expression Analysis with the Advanced RNA-Seq Plugin

Expression Analysis with the Advanced RNA-Seq Plugin Expression Analysis with the Advanced RNA-Seq Plugin May 24, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com

More information

The ISB Cancer Genomics Cloud

The ISB Cancer Genomics Cloud The ISB Cancer Genomics Cloud www.isb-cgc.org David L Gibbs david.gibbs@systemsbiology.org April 3, 2018 ISB-CGC Mission is to democratize the NCI cancer genomics data sets, with tools and compute-power,

More information

AWS Reference Design Document

AWS Reference Design Document AWS Reference Design Document Contents Overview... 1 Amazon Web Services (AWS), Public Cloud and the New Security Challenges... 1 Security at the Speed of DevOps... 2 Securing East-West and North-South

More information

Accelerate your SAS analytics to take the gold

Accelerate your SAS analytics to take the gold Accelerate your SAS analytics to take the gold A White Paper by Fuzzy Logix Whatever the nature of your business s analytics environment we are sure you are under increasing pressure to deliver more: more

More information

10 QUESTIONS, 10 ANSWERS. Get to know VMware Cloud on AWS The Best-in-Class Hybrid Cloud Service

10 QUESTIONS, 10 ANSWERS. Get to know VMware Cloud on AWS The Best-in-Class Hybrid Cloud Service 10 QUESTIONS, 10 ANSWERS Get to know VMware Cloud on AWS The Best-in-Class Hybrid Cloud Service GET TO KNOW VMWARE CLOUD ON AWS: THE BEST-IN-CLASS HYBRID CLOUD SERVICE IT leaders focused on finding ways

More information

Aeromancer: A Workflow Manager for Large- Scale MapReduce-Based Scientific Workflows

Aeromancer: A Workflow Manager for Large- Scale MapReduce-Based Scientific Workflows Aeromancer: A Workflow Manager for Large- Scale MapReduce-Based Scientific Workflows Presented by Sarunya Pumma Supervisors: Dr. Wu-chun Feng, Dr. Mark Gardner, and Dr. Hao Wang synergy.cs.vt.edu Outline

More information

SMART. Investing in urban innovation

SMART. Investing in urban innovation SMART Investing in urban innovation What Smart Belfast? Belfast has ambitious plans for the future. Building on our economic revival, we want to make our city an outstanding place to live, work and invest.

More information

De-dupe: It s not a question of if, rather where and when! What to Look for and What to Avoid

De-dupe: It s not a question of if, rather where and when! What to Look for and What to Avoid De-dupe: It s not a question of if, rather where and when! What to Look for and What to Avoid By Greg Schulz Founder and Senior Analyst, the StorageIO Group Author The Green and Virtual Data Center (CRC)

More information

NGS Data Visualization and Exploration Using IGV

NGS Data Visualization and Exploration Using IGV 1 What is Galaxy Galaxy for Bioinformaticians Galaxy for Experimental Biologists Using Galaxy for NGS Analysis NGS Data Visualization and Exploration Using IGV 2 What is Galaxy Galaxy for Bioinformaticians

More information

Tutorial. Find Very Low Frequency Variants With QIAGEN GeneRead Panels. Sample to Insight. November 21, 2017

Tutorial. Find Very Low Frequency Variants With QIAGEN GeneRead Panels. Sample to Insight. November 21, 2017 Find Very Low Frequency Variants With QIAGEN GeneRead Panels November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com

More information

NGF0502 AWS Student Slides

NGF0502 AWS Student Slides NextGen Firewall AWS Use Cases Barracuda NextGen Firewall F Implementation Guide Architectures and Deployments Based on four use cases Edge Firewall Secure Remote Access Office to Cloud / Hybrid Cloud

More information

Transforming Management for Modern Scale-Out Infrastructure

Transforming Management for Modern Scale-Out Infrastructure Transforming Management for Modern Scale-Out Infrastructure Software is eating the world. Marc Andreessen, WSJ 2011 That famous quote from venture capitalist and Netscape founder Marc Andreessen was true

More information

Modernize Your Infrastructure

Modernize Your Infrastructure Modernize Your Infrastructure The next industrial revolution 3 Dell - Internal Use - Confidential It s a journey. 4 Dell - Internal Use - Confidential Last 15 years IT-centric Systems of record Traditional

More information

DELL EMC ISILON SCALE-OUT NAS PRODUCT FAMILY Unstructured data storage made simple

DELL EMC ISILON SCALE-OUT NAS PRODUCT FAMILY Unstructured data storage made simple SCALE-OUT NAS PRODUCT FAMILY Unstructured data storage made simple ESSENTIALS Simple storage management designed for ease of use Massive scalability of capacity and performance Unmatched efficiency to

More information

Cisco Cloud Application Centric Infrastructure

Cisco Cloud Application Centric Infrastructure Cisco Cloud Application Centric Infrastructure About Cisco cloud application centric infrastructure Cisco Cloud Application Centric Infrastructure (Cisco Cloud ACI) is a comprehensive solution for simplified

More information

a measurable difference

a measurable difference Thermo Scientific Cell Analysis Software and Informatics Products a measurable difference for everyone Thermo Scientific HCS Studio Cell Analysis Software Thermo Scientific Store Image and Database Management

More information

Amazon Web Services. Foundational Services for Research Computing. April Mike Kuentz, WWPS Solutions Architect

Amazon Web Services. Foundational Services for Research Computing. April Mike Kuentz, WWPS Solutions Architect Amazon Web Services Foundational Services for Research Computing Mike Kuentz, WWPS Solutions Architect April 2017 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Global Infrastructure

More information

Security Overview of the BGI Online Platform

Security Overview of the BGI Online Platform WHITEPAPER 2015 BGI Online All rights reserved Version: Draft v3, April 2015 Security Overview of the BGI Online Platform Data security is, in general, a very important aspect in computing. We put extra

More information

Accelerate Your Enterprise Private Cloud Initiative

Accelerate Your Enterprise Private Cloud Initiative Cisco Cloud Comprehensive, enterprise cloud enablement services help you realize a secure, agile, and highly automated infrastructure-as-a-service (IaaS) environment for cost-effective, rapid IT service

More information

HiSeq Instrument Software Release Notes

HiSeq Instrument Software Release Notes HiSeq Instrument Software Release Notes HCS v2.0.12 RTA v1.17.21.3 Recipe Fragments v1.3.61 Illumina BaseSpace Broker v2.0.13022.1628 SAV v1.8.20 For HiSeq 2000 and HiSeq 1000 Systems FOR RESEARCH USE

More information

SOLUTION BRIEF Fulfill the promise of the cloud

SOLUTION BRIEF Fulfill the promise of the cloud SOLUTION BRIEF Fulfill the promise of the cloud NetApp Solutions for Amazon Web Services Fulfill the promise of the cloud NetApp Cloud Volumes Service for AWS: Move and manage more workloads faster Many

More information

Analytics in the Cloud Mandate or Option?

Analytics in the Cloud Mandate or Option? Analytics in the Cloud Mandate or Option? Rick Lower Sr. Director of Analytics Alliances Teradata 1 The SAS & Teradata Partnership Overview Partnership began in 2007 to improving analytic performance Teradata

More information

ADNI Sequencing Working Group. Robert C. Green, MD, MPH Andrew J. Saykin, PsyD Arthur Toga, PhD

ADNI Sequencing Working Group. Robert C. Green, MD, MPH Andrew J. Saykin, PsyD Arthur Toga, PhD ADNI Sequencing Working Group Robert C. Green, MD, MPH Andrew J. Saykin, PsyD Arthur Toga, PhD Why sequencing? V V V V V V V V V V V V V A fortuitous relationship TIME s Best Invention of 2008 The initial

More information

Hyper-Convergence De-mystified. Francis O Haire Group Technology Director

Hyper-Convergence De-mystified. Francis O Haire Group Technology Director Hyper-Convergence De-mystified Francis O Haire Group Technology Director The Cloud Era Is Well Underway Rapid Time to Market I deployed my application in five minutes. Fractional IT Consumption I use and

More information

Super-Fast Genome BWA-Bam-Sort on GLAD

Super-Fast Genome BWA-Bam-Sort on GLAD 1 Hututa Technologies Limited Super-Fast Genome BWA-Bam-Sort on GLAD Zhiqiang Ma, Wangjun Lv and Lin Gu May 2016 1 2 Executive Summary Aligning the sequenced reads in FASTQ files and converting the resulted

More information

Accelerate Big Data Insights

Accelerate Big Data Insights Accelerate Big Data Insights Executive Summary An abundance of information isn t always helpful when time is of the essence. In the world of big data, the ability to accelerate time-to-insight can not

More information

Connecting the Healthcare Dots to Enable Precision Medicine

Connecting the Healthcare Dots to Enable Precision Medicine Connecting the Healthcare Dots to Enable Precision Medicine Enabling SAP Health services with Dell EMC Isilon Data Lake in a Hadoop environment ABSTRACT Dell EMC offers an end-to-end solution for the personalized

More information

PRACTICAL SESSION 5 GOTCLOUD ALIGNMENT WITH BWA JAN 7 TH, 2014 STOM 2014 WORKSHOP HYUN MIN KANG UNIVERSITY OF MICHIGAN, ANN ARBOR

PRACTICAL SESSION 5 GOTCLOUD ALIGNMENT WITH BWA JAN 7 TH, 2014 STOM 2014 WORKSHOP HYUN MIN KANG UNIVERSITY OF MICHIGAN, ANN ARBOR PRACTICAL SESSION 5 GOTCLOUD ALIGNMENT WITH BWA JAN 7 TH, 2014 STOM 2014 WORKSHOP HYUN MIN KANG UNIVERSITY OF MICHIGAN, ANN ARBOR GOAL OF THIS SESSION Assuming that The audiences know how to perform GWAS

More information

Semi-Structured Data Management (CSE 511)

Semi-Structured Data Management (CSE 511) Semi-Structured Data Management (CSE 511) Note: Below outline is subject to modifications and updates. About this Course Database systems are used to provide convenient access to disk-resident data through

More information

Data Processing at Scale (CSE 511)

Data Processing at Scale (CSE 511) Data Processing at Scale (CSE 511) Note: Below outline is subject to modifications and updates. About this Course Database systems are used to provide convenient access to disk-resident data through efficient

More information

Making hybrid IT simple with Capgemini and Microsoft Azure Stack

Making hybrid IT simple with Capgemini and Microsoft Azure Stack Making hybrid IT simple with Capgemini and Microsoft Azure Stack The significant evolution of cloud computing in the last few years has encouraged IT leaders to rethink their enterprise cloud strategy.

More information

AUTOMATE THE DEPLOYMENT OF SECURE DEVELOPER VPCs

AUTOMATE THE DEPLOYMENT OF SECURE DEVELOPER VPCs AUTOMATE THE DEPLOYMENT OF SECURE DEVELOPER VPCs WITH PALO ALTO NETWORKS AND REAN CLOUD 1 INTRODUCTION EXECUTIVE SUMMARY Organizations looking to provide developers with a free-range development environment

More information

儲存網路, 與時俱進 JASON LIN SENIOR TECHNICAL CONSULTANT BROCADE, TAIWAN

儲存網路, 與時俱進 JASON LIN SENIOR TECHNICAL CONSULTANT BROCADE, TAIWAN 儲存網路, 與時俱進 JASON LIN SENIOR TECHNICAL CONSULTANT BROCADE, TAIWAN 1 Your Trusted Network for the Modern Data Center HISTORY OF INNOVATION 18+ Year Partnership Joint Engineering, Roadmap, and Go-to-Market

More information

RNA-Seq analysis with Astrocyte Differential expression and transcriptome assembly

RNA-Seq analysis with Astrocyte Differential expression and transcriptome assembly RNA-Seq analysis with Astrocyte Differential expression and transcriptome assembly Beibei Chen Ph.D BICF 9/28/2016 Agenda Launch Workflows using Astrocyte BICF Workflows BICF RNA-seq Workflow Experimental

More information

Data Protection Done Right with Dell EMC and AWS

Data Protection Done Right with Dell EMC and AWS Data Protection Done Right with Dell EMC and AWS There is a growing need for superior data protection Introduction If you re like most businesses, your data is continually growing in speed and scale, and

More information

Introduction to Galaxy

Introduction to Galaxy Introduction to Galaxy Dr Jason Wong Prince of Wales Clinical School Introductory bioinformatics for human genomics workshop, UNSW Day 1 Thurs 28 th January 2016 Overview What is Galaxy? Description of

More information

Sequence Mapping and Assembly

Sequence Mapping and Assembly Practical Introduction Sequence Mapping and Assembly December 8, 2014 Mary Kate Wing University of Michigan Center for Statistical Genetics Goals of This Session Learn basics of sequence data file formats

More information

CBSU/3CPG/CVG Joint Workshop Series Reference genome based sequence variation detection

CBSU/3CPG/CVG Joint Workshop Series Reference genome based sequence variation detection CBSU/3CPG/CVG Joint Workshop Series Reference genome based sequence variation detection Computational Biology Service Unit (CBSU) Cornell Center for Comparative and Population Genomics (3CPG) Center for

More information

An Introduction to VariantTools

An Introduction to VariantTools An Introduction to VariantTools Michael Lawrence, Jeremiah Degenhardt January 25, 2018 Contents 1 Introduction 2 2 Calling single-sample variants 2 2.1 Basic usage..............................................

More information

Tech Brief Wasabi Hot Storage for Video Content

Tech Brief Wasabi Hot Storage for Video Content Wasabi Hot Storage for Video Content Incredibly Cost-Effective Cloud Storage for Production, Analysis, Playback and Backup Executive Overview Wasabi is fundamentally transforming cloud storage with the

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

BGGN-213: FOUNDATIONS OF BIOINFORMATICS (Lecture 14)

BGGN-213: FOUNDATIONS OF BIOINFORMATICS (Lecture 14) BGGN-213: FOUNDATIONS OF BIOINFORMATICS (Lecture 14) Genome Informatics (Part 1) https://bioboot.github.io/bggn213_f17/lectures/#14 Dr. Barry Grant Nov 2017 Overview: The purpose of this lab session is

More information

EBOOK: VMware Cloud on AWS: Optimized for the Next-Generation Hybrid Cloud

EBOOK: VMware Cloud on AWS: Optimized for the Next-Generation Hybrid Cloud EBOOK: VMware Cloud on AWS: Optimized for the Next-Generation Hybrid Cloud Contents Introduction... 3 What is VMware Cloud on AWS?... 5 Customer Benefits of Adopting VMware Cloud on AWS... 6 VMware Cloud

More information

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research Dr Paul Calleja Director of Research Computing University of Cambridge Global leader in science & technology

More information

SDACCEL DEVELOPMENT ENVIRONMENT. The Xilinx SDAccel Development Environment. Bringing The Best Performance/Watt to the Data Center

SDACCEL DEVELOPMENT ENVIRONMENT. The Xilinx SDAccel Development Environment. Bringing The Best Performance/Watt to the Data Center SDAccel Environment The Xilinx SDAccel Development Environment Bringing The Best Performance/Watt to the Data Center Introduction Data center operators constantly seek more server performance. Currently

More information

A Closer Look at SERVER-SIDE RENDERING. Technology Overview

A Closer Look at SERVER-SIDE RENDERING. Technology Overview A Closer Look at SERVER-SIDE RENDERING Technology Overview Driven by server-based rendering, Synapse 5 is the fastest PACS in the medical industry, offering subsecond image delivery and diagnostic quality.

More information

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands Unleash Your Data Center s Hidden Power September 16, 2014 Molly Rector CMO, EVP Product Management & WW Marketing

More information

Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight

Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight ESG Lab Review InterSystems Data Platform: A Unified, Efficient Data Platform for Fast Business Insight Date: April 218 Author: Kerry Dolan, Senior IT Validation Analyst Abstract Enterprise Strategy Group

More information

Isilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team

Isilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team Isilon: Raising The Bar On Performance & Archive Use Cases John Har Solutions Product Manager Unstructured Data Storage Team What we ll cover in this session Isilon Overview Streaming workflows High ops/s

More information

Industry-leading Application PaaS Platform

Industry-leading Application PaaS Platform Industry-leading Application PaaS Platform Solutions Transactional Apps Digital Marketing LoB App Modernization Services Web Apps Web App for Containers API Apps Mobile Apps IDE Enterprise Integration

More information

Farsight Genome Systems

Farsight Genome Systems Customer Success Story Farsight Genome Systems ClearDATA Helps Speed to Market Groundbreaking Genomic Testing Solution Page 2 of 5 Farsight Genome Systems ClearDATA Helps Speed to Market Groundbreaking

More information

DELL EMC ISILON SCALE-OUT NAS PRODUCT FAMILY

DELL EMC ISILON SCALE-OUT NAS PRODUCT FAMILY DATA SHEET DELL EMC ISILON SCALE-OUT NAS PRODUCT FAMILY Unstructured data storage made simple ESSENTIALS Simple storage management designed for ease of use Massive scalability of capacity and performance

More information

BaseSpace User Guide. Supporting the NextSeq, MiSeq, and HiSeq Sequencing Systems FOR RESEARCH USE ONLY

BaseSpace User Guide. Supporting the NextSeq, MiSeq, and HiSeq Sequencing Systems FOR RESEARCH USE ONLY BaseSpace User Guide Supporting the NextSeq, MiSeq, and HiSeq Sequencing Systems FOR RESEARCH USE ONLY Introduction 3 How Do I Start 8 BaseSpace User Interface 13 How To Use BaseSpace 25 Workflow Reference

More information

Big Linked Data ETL Benchmark on Cloud Commodity Hardware

Big Linked Data ETL Benchmark on Cloud Commodity Hardware Big Linked Data ETL Benchmark on Cloud Commodity Hardware iminds Ghent University Dieter De Witte, Laurens De Vocht, Ruben Verborgh, Erik Mannens, Rik Van de Walle Ontoforce Kenny Knecht, Filip Pattyn,

More information