SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data

Size: px
Start display at page:

Download "SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data"

Transcription

1 SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data SYNOPSIS SEQGWAS [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome DESCRIPTION SEQGWAS is a command-line program written in C/C++ for integrative analysis of sequencing and GWAS data. SEQGWAS produces all commonly used gene-level tests, including the burden test, variable threshold (VT) test, and sequence-kernel association test (SKAT), all of which are based on the score statistic for assessing the effects of individual variants on the trait of interest. SEQGWAS calculates the score statistic based on the observed genotypes for sequenced subjects and the imputed genotypes for non-sequenced subjects, and constructs a robust variance estimator that reflects the true variability of the score statistic regardless of the sampling scheme and imputation quality, so that the corresponding association tests always have correct type I error. We are working intensely to improve the capabilities of SEQGWAS, so please check back frequently for updates. INPUT FILES Specification File REGRESSION_MODEL = linear #linear/logistic SUBJECT_FILE =.//subject.dat SUBJECT_FILE_HEADER = TRUE SUBJECT_PHENOTYPE_COLUMN = 4 SUBJECT_COVARIATE_COLUMN = 2 3 SUBJECT_SEQUENCED_INDICATOR_COLUMN = 5 # optional VARIANT_FILE =.//variant_chr.dat 1

2 VARIANT_FILE_HEADER = TRUE VARIANT_ID_COLUMN = 2 VARIANT_POS_COLUMN = 1 VARIANT_FREQ_COLUMN = 5 VARIANT_RSQ_COLUMN = 8 # optional DOSAGE_FILE =.//dosage_chr.dat DOSAGE_FILE_HEADER = FALSE DOSAGE_FILE_SKIP_COLUMNS = 2 ANNOTATION_FILE =.//annotation_chr.dat ANNOTATION_FILE_HEADER = FALSE ANNOTATION_TYPE = SNP # SNP/gene ANNOTATION_POS_COLUMN = 2 ANNOTATION_ACCESSION_COLUMN = 3 ANNOTATION_FUNCTION_COLUMN = 4 ANNOTATION_GENE_COLUMN = 5 ANNOTATION_ID_COLUMN = 6 OUTPUT_FILE = results_chr.out MAF_CUTOFF = 0.05 The file describes the input/output files and the program parameters. The syntax follows KEYWORD = value1 [value2 ] with spaces around =. All the following lines are required unless otherwise stated as optional. REGRESSION_MODEL = linear/logistic Specify the regression model for genotype-phenotype association. SUBJECT_FILE = full_pathname SUBJECT_FILE_HEADER = TRUE/FALSE SUBJECT_PHENOTYPE_COLUMN = num Specify the column (starting with number 1) to be used as the phenotype. SUBJECT_COVARIATE_COLUMN = num_1 [num_2 ] Specify column(s) in the subject file to be used as covariates in the regression model. Optional. SUBJECT_SEQUENCED_INDICATOR_COLUMN = num Specify the column to be used as the indicator of whether the subject is sequenced. DOSAGE_FILE = prefix affix Specify the prefix and affix of the pathname. The program will insert the chromosome number (single digit for 1-9 and two digits for 10-23), specified by -chr, to obtain the 2

3 full pathname. For example, for the two strings in the example specification file, the dosage file for chromosome 1 is accessed through the pathname:.//dosage_chr1.dat DOSAGE_FILE_HEADER = TRUE/FALSE DOSAGE_FILE_SKIP_COLUMNS = num Skip the first num columns. VARIANT_FILE = prefix affix VARIANT_FILE_HEADER = TRUE/FALSE VARIANT_ID_COLUMN = num VARIANT_POS_COLUMN = num VARIANT_FREQ_COLUMN = num VARIANT_RSQ_COLUMN = num Optional. If not specified, the Rsq measure will be calculated internally. ANNOTATION_FILE = prefix affix ANNOTATION_FILE_HEADER = TRUE/FALSE ANNOTATION_TYPE = SNP Specify the format of the annotation file. Currently, only the value SNP is allowed. ANNOTATION_POS_COLUMN = num ANNOTATION_ACCESSION_COLUMN = num ANNOTATION_FUNCTION_COLUMN = num ANNOTATION_GENE_COLUMN = num ANNOTATION_ID_COLUMN = num OUTPUT_FILE = prefix affix MAF_CUTOFF = MAF_cutoff Only variants with MAFs MAF_CUTOFF are considered for analysis. All the data files are space- or tab-delimited and can allow for one header row (or no header row). Subject File GWAS_ID AfrIA age BMI sequenced

4 The file provides information on the phenotype, covariates, and sequencing indicator (indicating whether a subject is sequenced or not) for all subjects in the GWAS cohort. Each row is specific to an individual. The column for the phenotype and the sequencing indicator is required and those for the subject identifier and covariates are optional. In a case-control study, the disease variable should be coded 0/1 to represent unaffected/affected. Missing data are denoted as. or NA. Variant File pos SNP Al1 Al2 Freq1 MAF AvgCall Rsq snp C C snp A A snp C T e-05 The file provides information on the sequencing-identified variants as well as GWAS SNPs on the particular chromosome specified by --chr. Each row is specific to a SNP; the rows must be in genomic order. The columns for the position, SNP identifier, coding-allele frequency are required and the one for the Rsq measurement is optional. If the position of a SNP is missing, it should be denoted as. or NA and that SNP will be excluded from analysis. The SNP position will be used to link the SNPs in the variant and the annotation files, and thus should be comparible. Dosage File The file provides (imputed) genotypic dosages for all the subjects in the GWAS cohort. Each row pertains to a subject; the order of subjects must align with their orders in the subject file. Each column pertains to a SNP; the order of SNPs must align with their orders in the variant file. This file allows arbitraty number of columns in front of the main data body. 4

5 Annotation File NM_ utr-3 CBS snp NM_ utr-3 CBS snp NM_ utr-3 CBS snp NM_ missense CBS snp The file provides annotation information for the SNPs. The current version of SEQGWAS (v1.0) only allows the annotation format for SNPs. Specifically, each row pertains to a SNP; the rows must be grouped by the accession number. OUTPUT Output File chr index gene accession n_var Rsq_gene p_t1 p_t5 p_v p_skat LIPI NM_ e e e e TPTE NM_ e e e e-1 The file contains information on the number of variants included in each gene (n_var), the gene-averged Rsq (Rsq_gene), and the p-values of the burden test with the MAF threshold of 1% (T1) and 5% (T1), the variable threshold test (VT) and SKAT. EXAMPLE Download and unzip the software package. Enter the command $ SEQGWAS -sfile specification.txt -chr 21 to obtain the results given in results_chr21.out. REFERENCE Hu, Y.J., Li, Y., Auer, P,L. and Lin, D.Y. Integrative Analysis of Sequencing and GWAS Data for Rare Variant Associations. Submitted. 5

6 VERSION HISTORY v /03/04 First version released. 6

MAGA: Meta-Analysis of Gene-level Associations

MAGA: Meta-Analysis of Gene-level Associations MAGA: Meta-Analysis of Gene-level Associations SYNOPSIS MAGA [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome DESCRIPTION

More information

SUGEN 8.6 Overview. Misa Graff, July 2017

SUGEN 8.6 Overview. Misa Graff, July 2017 SUGEN 8.6 Overview Misa Graff, July 2017 General Information By Ran Tao, https://sites.google.com/site/dragontaoran/home Website: http://dlin.web.unc.edu/software/sugen/ Standalone command-line software

More information

Step-by-Step Guide to Basic Genetic Analysis

Step-by-Step Guide to Basic Genetic Analysis Step-by-Step Guide to Basic Genetic Analysis Page 1 Introduction This document shows you how to clean up your genetic data, assess its statistical properties and perform simple analyses such as case-control

More information

Step-by-Step Guide to Advanced Genetic Analysis

Step-by-Step Guide to Advanced Genetic Analysis Step-by-Step Guide to Advanced Genetic Analysis Page 1 Introduction In the previous document, 1 we covered the standard genetic analyses available in JMP Genomics. Here, we cover the more advanced options

More information

Package lodgwas. R topics documented: November 30, Type Package

Package lodgwas. R topics documented: November 30, Type Package Type Package Package lodgwas November 30, 2015 Title Genome-Wide Association Analysis of a Biomarker Accounting for Limit of Detection Version 1.0-7 Date 2015-11-10 Author Ahmad Vaez, Ilja M. Nolte, Peter

More information

Association Analysis of Sequence Data using PLINK/SEQ (PSEQ)

Association Analysis of Sequence Data using PLINK/SEQ (PSEQ) Association Analysis of Sequence Data using PLINK/SEQ (PSEQ) Copyright (c) 2018 Stanley Hooker, Biao Li, Di Zhang and Suzanne M. Leal Purpose PLINK/SEQ (PSEQ) is an open-source C/C++ library for working

More information

MAGMA manual (version 1.06)

MAGMA manual (version 1.06) MAGMA manual (version 1.06) TABLE OF CONTENTS OVERVIEW 3 QUICKSTART 4 ANNOTATION 6 OVERVIEW 6 RUNNING THE ANNOTATION 6 ADDING AN ANNOTATION WINDOW AROUND GENES 7 RESTRICTING THE ANNOTATION TO A SUBSET

More information

Genetic Analysis. Page 1

Genetic Analysis. Page 1 Genetic Analysis Page 1 Genetic Analysis Objectives: 1) Set up Case-Control Association analysis and the Basic Genetics Workflow 2) Use JMP tools to interact with and explore results 3) Learn advanced

More information

MAGMA manual (version 1.05)

MAGMA manual (version 1.05) MAGMA manual (version 1.05) TABLE OF CONTENTS OVERVIEW 3 QUICKSTART 4 ANNOTATION 6 OVERVIEW 6 RUNNING THE ANNOTATION 6 ADDING AN ANNOTATION WINDOW AROUND GENES 7 RESTRICTING THE ANNOTATION TO A SUBSET

More information

Intro to NGS Tutorial

Intro to NGS Tutorial Intro to NGS Tutorial Release 8.6.0 Golden Helix, Inc. October 31, 2016 Contents 1. Overview 2 2. Import Variants and Quality Fields 3 3. Quality Filters 10 Generate Alternate Read Ratio.........................................

More information

PreMeta GENERAL INFORMATION SYNOPSIS

PreMeta GENERAL INFORMATION SYNOPSIS PreMeta GENERAL INFORMATION PreMeta is a software program written in C++ that is designed to facilitate the exchange of information between four software packages for meta-analysis of rare-variant associations:

More information

Click on "+" button Select your VCF data files (see #Input Formats->1 above) Remove file from files list:

Click on + button Select your VCF data files (see #Input Formats->1 above) Remove file from files list: CircosVCF: CircosVCF is a web based visualization tool of genome-wide variant data described in VCF files using circos plots. The provided visualization capabilities, gives a broad overview of the genomic

More information

PreMeta GENERAL INFORMATION SYNOPSIS

PreMeta GENERAL INFORMATION SYNOPSIS PreMeta GENERAL INFORMATION PreMeta is a software program written in C++ that is designed to facilitate the exchange of information between four software packages for meta-analysis of rare-variant associations:

More information

BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017

BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017 BICF Nano Course: GWAS GWAS Workflow Development using PLINK Julia Kozlitina Julia.Kozlitina@UTSouthwestern.edu April 28, 2017 Getting started Open the Terminal (Search -> Applications -> Terminal), and

More information

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS FVGWAS- 3.0 Manual Hongtu Zhu @ UNC BIAS Chao Huang @ UNC BIAS Nov 8, 2015 More and more large- scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical

More information

GMMAT: Generalized linear Mixed Model Association Tests Version 0.7

GMMAT: Generalized linear Mixed Model Association Tests Version 0.7 GMMAT: Generalized linear Mixed Model Association Tests Version 0.7 Han Chen Department of Biostatistics Harvard T.H. Chan School of Public Health Email: hanchen@hsph.harvard.edu Matthew P. Conomos Department

More information

Polymorphism and Variant Analysis Lab

Polymorphism and Variant Analysis Lab Polymorphism and Variant Analysis Lab Arian Avalos PowerPoint by Casey Hanson Polymorphism and Variant Analysis Matt Hudson 2018 1 Exercise In this exercise, we will do the following:. 1. Gain familiarity

More information

Package SMAT. January 29, 2013

Package SMAT. January 29, 2013 Package SMAT January 29, 2013 Type Package Title Scaled Multiple-phenotype Association Test Version 0.98 Date 2013-01-26 Author Lin Li, Ph.D.; Elizabeth D. Schifano, Ph.D. Maintainer Lin Li ;

More information

BOLT-LMM v1.2 User Manual

BOLT-LMM v1.2 User Manual BOLT-LMM v1.2 User Manual Po-Ru Loh November 4, 2014 Contents 1 Overview 2 1.1 Citing BOLT-LMM.................................. 2 2 Installation 2 2.1 Downloading reference LD Scores..........................

More information

PRACTICAL SESSION 8 SEQUENCE-BASED ASSOCIATION, INTERPRETATION, VISUALIZATION USING EPACTS JAN 7 TH, 2014 STOM 2014 WORKSHOP

PRACTICAL SESSION 8 SEQUENCE-BASED ASSOCIATION, INTERPRETATION, VISUALIZATION USING EPACTS JAN 7 TH, 2014 STOM 2014 WORKSHOP PRACTICAL SESSION 8 SEQUENCE-BASED ASSOCIATION, INTERPRETATION, VISUALIZATION USING EPACTS JAN 7 TH, 2014 STOM 2014 WORKSHOP HYUN MIN KANG UNIVERSITY OF MICHIGAN, ANN ARBOR EPACTS ASSOCIATION ANALYSIS

More information

JMP Genomics. Release Notes. Version 6.0

JMP Genomics. Release Notes. Version 6.0 JMP Genomics Version 6.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP, A Business Unit of SAS SAS Campus Drive

More information

GCTA: a tool for Genome- wide Complex Trait Analysis

GCTA: a tool for Genome- wide Complex Trait Analysis GCTA: a tool for Genome- wide Complex Trait Analysis Version 1.04, 13 Sep 2012 Overview GCTA (Genome- wide Complex Trait Analysis) is designed to estimate the proportion of phenotypic variance explained

More information

GMDR User Manual Version 1.0

GMDR User Manual Version 1.0 GMDR User Manual Version 1.0 Oct 30, 2011 1 GMDR is a free, open-source interaction analysis tool, aimed to perform gene-gene interaction with generalized multifactor dimensionality methods. GMDR is being

More information

Package SimGbyE. July 20, 2009

Package SimGbyE. July 20, 2009 Package SimGbyE July 20, 2009 Type Package Title Simulated case/control or survival data sets with genetic and environmental interactions. Author Melanie Wilson Maintainer Melanie

More information

Importing and Merging Data Tutorial

Importing and Merging Data Tutorial Importing and Merging Data Tutorial Release 1.0 Golden Helix, Inc. February 17, 2012 Contents 1. Overview 2 2. Import Pedigree Data 4 3. Import Phenotypic Data 6 4. Import Genetic Data 8 5. Import and

More information

QUICKTEST user guide

QUICKTEST user guide QUICKTEST user guide Toby Johnson Zoltán Kutalik December 11, 2008 for quicktest version 0.94 Copyright c 2008 Toby Johnson and Zoltán Kutalik Permission is granted to copy, distribute and/or modify this

More information

Package EMLRT. August 7, 2014

Package EMLRT. August 7, 2014 Package EMLRT August 7, 2014 Type Package Title Association Studies with Imputed SNPs Using Expectation-Maximization-Likelihood-Ratio Test LazyData yes Version 1.0 Date 2014-08-01 Author Maintainer

More information

ELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2

ELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2 ELAI user manual Yongtao Guan Baylor College of Medicine Version 1.0 25 June 2015 Contents 1 Copyright 2 2 What ELAI Can Do 2 3 A simple example 2 4 Input file formats 3 4.1 Genotype file format....................................

More information

KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li

KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual Miao-Xin Li, Jiang Li Department of Psychiatry Centre for Genomic Sciences Department

More information

iloci software is used to calculate the gene-gene interactions from GWAS data. This software was implemented by the OpenCL framework.

iloci software is used to calculate the gene-gene interactions from GWAS data. This software was implemented by the OpenCL framework. iloci software iloci software is used to calculate the gene-gene interactions from GWAS data. This software was implemented by the OpenCL framework. Software requirements : 1. Linux or Mac operating system

More information

Recalling Genotypes with BEAGLECALL Tutorial

Recalling Genotypes with BEAGLECALL Tutorial Recalling Genotypes with BEAGLECALL Tutorial Release 8.1.4 Golden Helix, Inc. June 24, 2014 Contents 1. Format and Confirm Data Quality 2 A. Exclude Non-Autosomal Markers......................................

More information

SKAT Package. Seunggeun (Shawn) Lee. July 21, 2017

SKAT Package. Seunggeun (Shawn) Lee. July 21, 2017 SKAT Package Seunggeun (Shawn) Lee July 21, 2017 1 Overview SKAT package has functions to 1) test for associations between SNP sets and continuous/binary phenotypes with adjusting for covariates and kinships

More information

PRSice: Polygenic Risk Score software v1.22

PRSice: Polygenic Risk Score software v1.22 PRSice: Polygenic Risk Score software v1.22 Jack Euesden jack.euesden@kcl.ac.uk Cathryn M. Lewis April 30, 2015 Paul F. O Reilly Contents 1 Overview 3 2 R packages required 3 3 Quickstart 3 3.1 Input Data...................................

More information

BioBin User Guide Current version: BioBin 2.3

BioBin User Guide Current version: BioBin 2.3 BioBin User Guide Current version: BioBin 2.3 Last modified: April 2017 Ritchie Lab Geisinger Health System URL: http://www.ritchielab.com/software/biobin-download Email: software@ritchielab.psu.edu 1

More information

MIRING: Minimum Information for Reporting Immunogenomic NGS Genotyping. Data Standards Hackathon for NGS HACKATHON 1.0 Bethesda, MD September

MIRING: Minimum Information for Reporting Immunogenomic NGS Genotyping. Data Standards Hackathon for NGS HACKATHON 1.0 Bethesda, MD September MIRING: Minimum Information for Reporting Immunogenomic NGS Genotyping Data Standards Hackathon for NGS HACKATHON 1.0 Bethesda, MD September 27 2014 Static Dynamic Static Minimum Information for Reporting

More information

Step-by-Step Guide to Relatedness and Association Mapping Contents

Step-by-Step Guide to Relatedness and Association Mapping Contents Step-by-Step Guide to Relatedness and Association Mapping Contents OBJECTIVES... 2 INTRODUCTION... 2 RELATEDNESS MEASURES... 2 POPULATION STRUCTURE... 6 Q-K ASSOCIATION ANALYSIS... 10 K MATRIX COMPRESSION...

More information

Package FREGAT. April 21, 2017

Package FREGAT. April 21, 2017 Title Family REGional Association Tests Version 1.0.3 Package FREGAT April 21, 2017 Author Nadezhda M. Belonogova and Gulnara R. Svishcheva , with contributions from:

More information

BIMBAM user manual. Yongtao Guan and Matthew Stephens Baylor College of Medicine and University of Chicago. Version 1.0 Revised on 25 June 2015

BIMBAM user manual. Yongtao Guan and Matthew Stephens Baylor College of Medicine and University of Chicago. Version 1.0 Revised on 25 June 2015 BIMBAM user manual Yongtao Guan and Matthew Stephens Baylor College of Medicine and University of Chicago Version 1.0 Revised on 25 June 2015 Contents 1 Copyright 2 2 Introduction 3 2.1 The model..........................................

More information

MACAU User Manual. Xiang Zhou. March 15, 2017

MACAU User Manual. Xiang Zhou. March 15, 2017 MACAU User Manual Xiang Zhou March 15, 2017 Contents 1 Introduction 2 1.1 What is MACAU...................................... 2 1.2 How to Cite MACAU................................... 2 1.3 The Model.........................................

More information

Estimating Variance Components in MMAP

Estimating Variance Components in MMAP Last update: 6/1/2014 Estimating Variance Components in MMAP MMAP implements routines to estimate variance components within the mixed model. These estimates can be used for likelihood ratio tests to compare

More information

GCTA: a tool for Genome- wide Complex Trait Analysis

GCTA: a tool for Genome- wide Complex Trait Analysis GCTA: a tool for Genome- wide Complex Trait Analysis Version 1.24, 28 July 2014 Overview GCTA (Genome- wide Complex Trait Analysis) was originally designed to estimate the proportion of phenotypic variance

More information

Package GEM. R topics documented: January 31, Type Package

Package GEM. R topics documented: January 31, Type Package Type Package Package GEM January 31, 2018 Title GEM: fast association study for the interplay of Gene, Environment and Methylation Version 1.5.0 Date 2015-12-05 Author Hong Pan, Joanna D Holbrook, Neerja

More information

Package MOJOV. R topics documented: February 19, 2015

Package MOJOV. R topics documented: February 19, 2015 Type Package Title Mojo Variants: Rare Variants analysis Version 1.0.1 Date 2013-02-25 Author Maintainer Package MOJOV February 19, 2015 A package for analysis between rare variants

More information

Package coloc. February 24, 2018

Package coloc. February 24, 2018 Type Package Package coloc February 24, 2018 Imports ggplot2, snpstats, BMA, reshape, methods, flashclust, speedglm Suggests knitr, testthat Title Colocalisation Tests of Two Genetic Traits Version 3.1

More information

Package RVS0.0 Jiafen Gong, Zeynep Baskurt, Andriy Derkach, Angelina Pesevski and Lisa Strug October, 2016

Package RVS0.0 Jiafen Gong, Zeynep Baskurt, Andriy Derkach, Angelina Pesevski and Lisa Strug October, 2016 Package RVS0.0 Jiafen Gong, Zeynep Baskurt, Andriy Derkach, Angelina Pesevski and Lisa Strug October, 2016 The Robust Variance Score (RVS) test is designed for association analysis for next generation

More information

Tutorial on gene-c ancestry es-ma-on: How to use LASER. Chaolong Wang Sequence Analysis Workshop June University of Michigan

Tutorial on gene-c ancestry es-ma-on: How to use LASER. Chaolong Wang Sequence Analysis Workshop June University of Michigan Tutorial on gene-c ancestry es-ma-on: How to use LASER Chaolong Wang Sequence Analysis Workshop June 2014 @ University of Michigan LASER: Loca-ng Ancestry from SEquence Reads Main func:ons of the so

More information

GMDR User Manual. GMDR software Beta 0.9. Updated March 2011

GMDR User Manual. GMDR software Beta 0.9. Updated March 2011 GMDR User Manual GMDR software Beta 0.9 Updated March 2011 1 As an open source project, the source code of GMDR is published and made available to the public, enabling anyone to copy, modify and redistribute

More information

PRSice: Polygenic Risk Score software - Vignette

PRSice: Polygenic Risk Score software - Vignette PRSice: Polygenic Risk Score software - Vignette Jack Euesden, Paul O Reilly March 22, 2016 1 The Polygenic Risk Score process PRSice ( precise ) implements a pipeline that has become standard in Polygenic

More information

CircosVCF workshop, TAU, 9/11/2017

CircosVCF workshop, TAU, 9/11/2017 CircosVCF exercise In this exercise, we will create and design circos plots using CircosVCF. We will use vcf files of a published case "X-linked elliptocytosis with impaired growth is related to mutated

More information

Package MultiMeta. February 19, 2015

Package MultiMeta. February 19, 2015 Type Package Package MultiMeta February 19, 2015 Title Meta-analysis of Multivariate Genome Wide Association Studies Version 0.1 Date 2014-08-21 Author Dragana Vuckovic Maintainer Dragana Vuckovic

More information

Manual code: MSU_pigs.R

Manual code: MSU_pigs.R Manual code: MSU_pigs.R Authors: Jose Luis Gualdrón Duarte 1 and Juan Pedro Steibel,3 1 Departamento de Producción Animal, Facultad de Agronomía, UBA-CONICET, Buenos Aires, ARG Department of Animal Science,

More information

snpqc an R pipeline for quality control of Illumina SNP data

snpqc an R pipeline for quality control of Illumina SNP data snpqc an R pipeline for quality control of Illumina SNP data 1. In a nutshell snpqc is a series of R scripts to perform quality control analysis on Illumina SNP data. The objective of the program is to

More information

BOLT-LMM v2.0 User Manual

BOLT-LMM v2.0 User Manual BOLT-LMM v2.0 User Manual Po-Ru Loh March 13, 2015 Contents 1 Overview 2 1.1 BOLT-LMM mixed model association testing.................... 2 1.2 BOLT-REML variance components analysis.....................

More information

Axiom Analysis Suite Release Notes (For research use only. Not for use in diagnostic procedures.)

Axiom Analysis Suite Release Notes (For research use only. Not for use in diagnostic procedures.) Axiom Analysis Suite 4.0.1 Release Notes (For research use only. Not for use in diagnostic procedures.) Axiom Analysis Suite 4.0.1 includes the following changes/updates: 1. For library packages that support

More information

NAME QUICKTEST Quick association testing, for quantitative traits, allowing genotype uncertainty

NAME QUICKTEST Quick association testing, for quantitative traits, allowing genotype uncertainty NAME QUICKTEST Quick association testing, for quantitative traits, allowing genotype uncertainty SYNOPSIS quicktest pheno file geno file out file method mean method ML quicktest pheno file npheno name

More information

Package GWAF. March 12, 2015

Package GWAF. March 12, 2015 Type Package Package GWAF March 12, 2015 Title Genome-Wide Association/Interaction Analysis and Rare Variant Analysis with Family Data Version 2.2 Date 2015-03-12 Author Ming-Huei Chen

More information

Dealing with heterogeneity: group-specific variances and stratified analyses

Dealing with heterogeneity: group-specific variances and stratified analyses Dealing with heterogeneity: group-specific variances and stratified analyses Tamar Sofer July 2017 1 / 32 The HCHS/SOL population is quite heterogeneous 1. Due to admixture: Hispanics are admixed with

More information

Spotter Documentation Version 0.5, Released 4/12/2010

Spotter Documentation Version 0.5, Released 4/12/2010 Spotter Documentation Version 0.5, Released 4/12/2010 Purpose Spotter is a program for delineating an association signal from a genome wide association study using features such as recombination rates,

More information

Part 1: How to use IGV to visualize variants

Part 1: How to use IGV to visualize variants Using IGV to identify true somatic variants from the false variants http://www.broadinstitute.org/igv A FAQ, sample files and a user guide are available on IGV website If you use IGV in your publication:

More information

Package seqmeta. February 9, 2017

Package seqmeta. February 9, 2017 Type Package Package seqmeta February 9, 2017 Title Meta-Analysis of Region-Based Tests of Rare DNA Variants Version 1.6.7 Date 2016-09-24 Author Arie Voorman, Jennifer Brody, Han Chen, Thomas Lumley,

More information

BEAGLECALL 1.0. Brian L. Browning Department of Medicine Division of Medical Genetics University of Washington. 15 November 2010

BEAGLECALL 1.0. Brian L. Browning Department of Medicine Division of Medical Genetics University of Washington. 15 November 2010 BEAGLECALL 1.0 Brian L. Browning Department of Medicine Division of Medical Genetics University of Washington 15 November 2010 BEAGLECALL 1.0 P a g e i Contents 1 Introduction... 1 1.1 Citing BEAGLECALL...

More information

Ricopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015

Ricopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 Ricopili: Introdution WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 What will we offer? Practical: Sorry, no practical sessions today, please refer to the summer school, organized

More information

1. Summary statistics test_gwas. This file contains a set of 50K random SNPs of the Subjective Well-being GWAS of the Netherlands Twin Register

1. Summary statistics test_gwas. This file contains a set of 50K random SNPs of the Subjective Well-being GWAS of the Netherlands Twin Register Quality Control for Genome-Wide Association Studies Bart Baselmans & Meike Bartels Boulder 2017 Setting up files and directories To perform a quality control protocol in a Genome-Wide Association Meta

More information

Agilent Genomic Workbench 7.0

Agilent Genomic Workbench 7.0 Agilent Genomic Workbench 7.0 Workflow User Guide For Research Use Only. Not for use in diagnostic procedures. Agilent Technologies Notices Agilent Technologies, Inc. 2012, 2015 No part of this manual

More information

CTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1

CTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 CTL mapping in R Danny Arends, Pjotr Prins, and Ritsert C. Jansen University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 First written: Oct 2011 Last modified: Jan 2018 Abstract: Tutorial

More information

Data Walkthrough: Background

Data Walkthrough: Background Data Walkthrough: Background File Types FASTA Files FASTA files are text-based representations of genetic information. They can contain nucleotide or amino acid sequences. For this activity, students will

More information

A manual for the use of mirvas

A manual for the use of mirvas A manual for the use of mirvas Authors: Sophia Cammaerts, Mojca Strazisar, Jenne Dierckx, Jurgen Del Favero, Peter De Rijk Version: 1.0.2 Date: July 27, 2015 Contact: peter.derijk@gmail.com, mirvas.software@gmail.com

More information

Introduction to GDS. Stephanie Gogarten. July 18, 2018

Introduction to GDS. Stephanie Gogarten. July 18, 2018 Introduction to GDS Stephanie Gogarten July 18, 2018 Genomic Data Structure CoreArray (C++ library) designed for large-scale data management of genome-wide variants data format (GDS) to store multiple

More information

Helpful Galaxy screencasts are available at:

Helpful Galaxy screencasts are available at: This user guide serves as a simplified, graphic version of the CloudMap paper for applicationoriented end-users. For more details, please see the CloudMap paper. Video versions of these user guides and

More information

BOLT-LMM v2.3 User Manual

BOLT-LMM v2.3 User Manual BOLT-LMM v2.3 User Manual Po-Ru Loh August 1, 2017 Contents 1 Overview 2 1.1 BOLT-LMM mixed model association testing.................... 2 1.2 BOLT-REML variance components analysis.....................

More information

Package RobustSNP. January 1, 2011

Package RobustSNP. January 1, 2011 Package RobustSNP January 1, 2011 Type Package Title Robust SNP association tests under different genetic models, allowing for covariates Version 1.0 Depends mvtnorm,car,snpmatrix Date 2010-07-11 Author

More information

User s Guide Release 3.3

User s Guide Release 3.3 [1]Oracle Healthcare Translational Research User s Guide Release 3.3 E91297-01 October 2018 Oracle Healthcare Translational Research User's Guide, Release 3.3 E91297-01 Copyright 2012, 2018, Oracle and/or

More information

INTRODUCTION AUX FORMATS DE FICHIERS

INTRODUCTION AUX FORMATS DE FICHIERS INTRODUCTION AUX FORMATS DE FICHIERS Plan. Formats de séquences brutes.. Format fasta.2. Format fastq 2. Formats d alignements 2.. Format SAM 2.2. Format BAM 4. Format «Variant Calling» 4.. Format Varscan

More information

Analyzing Variant Call results using EuPathDB Galaxy, Part II

Analyzing Variant Call results using EuPathDB Galaxy, Part II Analyzing Variant Call results using EuPathDB Galaxy, Part II In this exercise, we will work in groups to examine the results from the SNP analysis workflow that we started yesterday. The first step is

More information

The fgwas Package. Version 1.0. Pennsylvannia State University

The fgwas Package. Version 1.0. Pennsylvannia State University The fgwas Package Version 1.0 Zhong Wang 1 and Jiahan Li 2 1 Department of Public Health Science, 2 Department of Statistics, Pennsylvannia State University 1. Introduction The fgwas Package (Functional

More information

v0.2.0 XX:Z:UA - Unassigned XX:Z:G1 - Genome 1-specific XX:Z:G2 - Genome 2-specific XX:Z:CF - Conflicting

v0.2.0 XX:Z:UA - Unassigned XX:Z:G1 - Genome 1-specific XX:Z:G2 - Genome 2-specific XX:Z:CF - Conflicting October 08, 2015 v0.2.0 SNPsplit is an allele-specific alignment sorter which is designed to read alignment files in SAM/ BAM format and determine the allelic origin of reads that cover known SNP positions.

More information

Release Notes. JMP Genomics. Version 4.0

Release Notes. JMP Genomics. Version 4.0 JMP Genomics Version 4.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP. A Business Unit of SAS SAS Campus Drive

More information

Data Currently Available (And How to Access It) Chance Hohensee Data Training September 9, 2016

Data Currently Available (And How to Access It) Chance Hohensee Data Training September 9, 2016 Data Currently Available (And How to Access It) Chance Hohensee Data Training September 9, 2016 Introduction The WHI dataset is large and complex There are different cohorts within the 161,808 WHI participants,

More information

Bioinformatics - Homework 1 Q&A style

Bioinformatics - Homework 1 Q&A style Bioinformatics - Homework 1 Q&A style Instructions: in this assignment you will test your understanding of basic GWAS concepts and GenABEL functions. The materials needed for the homework (two datasets

More information

General Help & Instructions to use with Examples

General Help & Instructions to use with Examples General Help & Instructions to use with Examples Contents Types of Searches and their Purposes... 2 Basic Search:... 2 Advance search option... 6 List Search:... 7 Details Page... 8 Results Grid functionalities:...

More information

Package LGRF. September 13, 2015

Package LGRF. September 13, 2015 Type Package Package LGRF September 13, 2015 Title Set-Based Tests for Genetic Association in Longitudinal Studies Version 1.0 Date 2015-08-20 Author Zihuai He Maintainer Zihuai He Functions

More information

GWAS Exercises 3 - GWAS with a Quantiative Trait

GWAS Exercises 3 - GWAS with a Quantiative Trait GWAS Exercises 3 - GWAS with a Quantiative Trait Peter Castaldi January 28, 2013 PLINK can also test for genetic associations with a quantitative trait (i.e. a continuous variable). In this exercise, we

More information

GSCAN GWAS Analysis Plan, v GSCAN GWAS ANALYSIS PLAN, Version 1.0 October 6, 2015

GSCAN GWAS Analysis Plan, v GSCAN GWAS ANALYSIS PLAN, Version 1.0 October 6, 2015 GSCAN GWAS Analysis Plan, v0.5 1 Overview GSCAN GWAS ANALYSIS PLAN, Version 1.0 October 6, 2015 There are three major components to this analysis plan. First, genome-wide genotypes must be on the correct

More information

Introduction to GEMINI

Introduction to GEMINI Introduction to GEMINI Aaron Quinlan University of Utah! quinlanlab.org Please refer to the following Github Gist to find each command for this session. Commands should be copy/pasted from this Gist https://gist.github.com/arq5x/9e1928638397ba45da2e#file-gemini-intro-sh

More information

Applications of admixture models

Applications of admixture models Applications of admixture models CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar, Alkes Price Applications of admixture models 1 / 27

More information

Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc.

Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc. Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc. Overview This script converts allelic dosage values to genotypes based on user-specified thresholds. The dosage data may be in

More information

PODKAT. An R Package for Association Testing Involving Rare and Private Variants. Ulrich Bodenhofer

PODKAT. An R Package for Association Testing Involving Rare and Private Variants. Ulrich Bodenhofer Software Manual Institute of Bioinformatics, Johannes Kepler University Linz PODKAT An R Package for Association Testing Involving Rare and Private Variants Ulrich Bodenhofer Institute of Bioinformatics,

More information

User Manual for GIGI v1.06.1

User Manual for GIGI v1.06.1 1 User Manual for GIGI v1.06.1 Author: Charles Y K Cheung [cykc@uw.edu] Ellen M Wijsman [wijsman@uw.edu] Department of Biostatistics University of Washington Last Modified on 1/31/2015 2 Contents Introduction...

More information

Package FunciSNP. November 16, 2018

Package FunciSNP. November 16, 2018 Type Package Package FunciSNP November 16, 2018 Title Integrating Functional Non-coding Datasets with Genetic Association Studies to Identify Candidate Regulatory SNPs Version 1.26.0 Date 2013-01-19 Author

More information

Maximizing Public Data Sources for Sequencing and GWAS

Maximizing Public Data Sources for Sequencing and GWAS Maximizing Public Data Sources for Sequencing and GWAS February 4, 2014 G Bryce Christensen Director of Services Questions during the presentation Use the Questions pane in your GoToWebinar window Agenda

More information

Forensic Resource/Reference On Genetics knowledge base: FROG-kb User s Manual. Updated June, 2017

Forensic Resource/Reference On Genetics knowledge base: FROG-kb User s Manual. Updated June, 2017 Forensic Resource/Reference On Genetics knowledge base: FROG-kb User s Manual Updated June, 2017 Table of Contents 1. Introduction... 1 2. Accessing FROG-kb Home Page and Features... 1 3. Home Page and

More information

TCGA Variant Call Format (VCF) 1.0 Specification

TCGA Variant Call Format (VCF) 1.0 Specification TCGA Variant Call Format (VCF) 1.0 Specification Document Information Specification for TCGA Variant Call Format (VCF) Version 1.0 1 About TCGA VCF specification 2 TCGA-specific customizations 3 File format

More information

arxiv: v2 [q-bio.qm] 17 Nov 2013

arxiv: v2 [q-bio.qm] 17 Nov 2013 arxiv:1308.2150v2 [q-bio.qm] 17 Nov 2013 GeneZip: A software package for storage-efficient processing of genotype data Palmer, Cameron 1 and Pe er, Itsik 1 1 Center for Computational Biology and Bioinformatics,

More information

User Manual. Ver. 3.0 March 19, 2012

User Manual. Ver. 3.0 March 19, 2012 User Manual Ver. 3.0 March 19, 2012 Table of Contents 1. Introduction... 2 1.1 Rationale... 2 1.2 Software Work-Flow... 3 1.3 New in GenomeGems 3.0... 4 2. Software Description... 5 2.1 Key Features...

More information

BOLT-LMM v2.3.2 User Manual

BOLT-LMM v2.3.2 User Manual BOLT-LMM v2.3.2 User Manual Po-Ru Loh March 10, 2018 Contents 1 Overview 2 1.1 BOLT-LMM mixed model association testing.................... 3 1.2 BOLT-REML variance components analysis.....................

More information

Data formats in GWASTools

Data formats in GWASTools Data formats in GWASTools Stephanie M. Gogarten January 18, 2018 The central classes of the GWASTools package are GenotypeData and IntensityData. They are designed to link all parts of a GWAS analysis

More information

Tutorial. Identification of Variants Using GATK. Sample to Insight. November 21, 2017

Tutorial. Identification of Variants Using GATK. Sample to Insight. November 21, 2017 Identification of Variants Using GATK November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com

More information

bimm vignette Matti Pirinen & Christian Benner University of Helsinki November 15, 2016

bimm vignette Matti Pirinen & Christian Benner University of Helsinki November 15, 2016 bimm vignette Matti Pirinen & Christian Benner University of Helsinki November 15, 2016 1 Introduction bimm is a software package to efficiently estimate variance parameters of a bivariate lineax mixed

More information

User s Guide for R Routines to Perform Reference Marker Normalization

User s Guide for R Routines to Perform Reference Marker Normalization User s Guide for R Routines to Perform Reference Marker Normalization Stan Pounds and Charles Mullighan St. Jude Children s Research Hospital Memphis, TN 38135 USA Version Date: January 29, 2008 Purpose

More information

Package ukbtools. February 5, 2018

Package ukbtools. February 5, 2018 Version 0.10.1 Title Manipulate and Explore UK Biobank Data Package ukbtools February 5, 2018 Maintainer Ken Hanscombe A set of tools to create a UK Biobank

More information