Step-by-Step Guide to Advanced Genetic Analysis
|
|
- Damian Fletcher
- 6 years ago
- Views:
Transcription
1 Step-by-Step Guide to Advanced Genetic Analysis Page 1
2 Introduction In the previous document, 1 we covered the standard genetic analyses available in JMP Genomics. Here, we cover the more advanced options available in the software. These analytical processes are not necessarily more difficult to use; however, they are more specialized and may require a greater understanding of the underlying statistics than do the more commonly used processes. We do not review statistical theory in this document the literature citations available in the online User Guide do this quite well. The goal is rather to provide an explanation of how to use the processes described below and interpret their results. Objectives In this document we will cover the following processes: Recode Genotypes, which recodes allelic/genotypic values to a variety of formats. Relationship Matrix for calculating identity by state. PCA for Population Stratification, which uses the Eigenstrat method to describe and account for population structure in marker data. Multiple SNP-Trait Association, which uses a gene or other organizational level to analyze multiple SNPs. Pleiotropic Association for MANOVA analyses. Rare-Variant Analysis, which accommodates a number of rare variant analytic approaches. Recode Genotypes The Recode Genotypes process changes allele/genotype formats to numeric formats or numeric formats to the A/B-style format. In some instances, having data in the numeric format can speed processing of large data sets and is a required format for a number of processes in this training module. Note that genotypes/alleles can be recoded into the default Numeric Additive format as a part of the Marker Properties process, as described in the Basic Genetic Analysis document. If you need or desire data in other numeric formats for further analysis, we recommend that you recode the data using the Recode Genotypes AP. 1) Select Genomics > Genetics Utilities > Recode Genotypes from the Genomics Starter menu. 1 You should review the Step-by-Step Guide to Basic Genetic Analysis guide before working through the examples described here. Page 2
3 2) Choose the ord1_geno_data_sr.sas7bdat data set, generated as described in the Basic Genetic Analysis module, as the Input SAS Data Set. 3) Select the Marker Variables as done previously. In this example, type rs: seq: in the List-Style Specification of Marker Variables text box. 4) Choose the Output Folder. The completed General tab is shown below: 5) Select the Recode Tab. 6) Complete the Recode Tab as shown below: Page 3
4 Clicking on the? next to the Genotype Recoding section will bring up the recoding scheme for each option. We will be adding r_ to the start of each column name of the recoded genotype. A new, corresponding column will be added to the annotation file. We are optionally naming the output data set, rec_num_data. 7) Select the Annotation tab. 8) Choose the data set, ord1_geno_data_hwe_sr.sas7bdat for the Annotation SAS Data Set. 9) Select ID as the Annotation Label Variable and MajorAllele as the Annotation Major Allele Variable. Identifying the major allele variable, calculated earlier with Marker Properties, will speed up this process. Keep in mind that the selection of a major allele using Marker Properties will be data-set dependent. 10) Type rec_anno for the Output Annotation Data Set. 11) Click Run to start the process. Page 4
5 Two data sets, the rec_num_data.sas7bdat output data set and the rec_anno.sas7bdat annotation data set are generated. We us these data sets in the next process. Relationship Matrix The Relationship Matrix AP allows you to perform Identity by State (IBS), Identity by Descent (IBD) and Allele Sharing calculations. These data are from unrelated individuals, and so we will use the IBS option. 1) Select Genetics > Relatedness Measures > Relationship Matrix from the Genomics Starter menu. 2) Choose the rec_num_data.sas7bdat data set as the Input SAS Data Set. 3) Complete the General tab as shown below: 4) Select the Annotation tab. 5) Choose the rec_anno.sas7bdat data set as the Annotation SAS Data Set as shown below: Page 5
6 6) Select the Analysis tab. 7) Complete the Analysis tab as shown below: Select Identity by State as the Relationship Matrix to Compute. The Compute the Root of the Matrix option generates a population structure dataset for later use in Q-K mixed model association analysis. The Report Sample Pairs options will generate a table and corresponding graph from pairs of individuals with relatedness scores above this threshold. Principal Components Analysis will be performed on the relatedness matrix values. This can be useful for finding patterns or groups of related individuals. 8) Select the Options tab. 9) Check the Plot Relationship Matrix Heat Map check box. 10) Click Run to start the process. The results are shown below: Page 6
7 The heat map shows the hierarchical clustering relationship between all of the samples. It is a matrix of samples with the x- and y-axis comprised of the same samples. The dark line running diagonally from the upper left to the lower right indicates the intersection of identical samples where the IBS values are all equal to 1. Samples will cluster based on the similarity of the IBS or distance score. In this instance, there is a group of samples on the lower right that form a distinct cluster separated from the remainder of the samples. There is no strong relationship in these samples as the IBS value is relatively low. Clustering algorithms will always create clusters, but the significance of observed clusters is dependent on the strength of the relationship and knowledge about the samples included in the experiment. The IBS Pairs tab shows the distribution of the IBS values and the underlying table (found by selecting View Data under the IBS Pairs pull-down menu at the upper left of the dashboard) shows the sample pairs within the distribution. As this data set included a limited number of markers, the interpretation of these results is difficult. Page 7
8 In a GWAS dataset, IBS values >0.98 often indicate twins or repeated measurements from the same sample. 11) Select the PCA 2D Row Scores tab. Each point represents a sample, and the graphs at the intersections at right angles from each distribution plot represent the samples in those two principal components. There are no obvious clusters in this data and the distribution of points in the three principal components does not appear to be heavily biased or bimodal. PCA for Population Stratification In a genetic data set, there is often unknown structure in the population. PCA for Population Stratification (Eigenstrat, Price et al., 2006) attempts to derive and correct for population structure through the use of principal components analysis. 1) Select Genetics > GWAS Testing > PCA for Population Stratification from the Genomics Starter menu. 2) Choose the rec_num_data.sas7bdat data set as the Input SAS Data Set. 3) Select PCT_CHG_APOC3 as the Trait Variable. 4) Complete the General tab as shown below: Page 8
9 5) Select the Annotation tab. 6) Choose the rec_anno.sas7bdat data set as the Annotation SAS Data Set. 7) Select ID as the Annotation Label Variable. 8) Select the Options tab. 9) Complete the Options tab as shown below: Page 9
10 The PCA Data Set field is used when PCA analysis has been previously run on the data. This can also be useful when changing the number of principal components used in the process. The Maximum Number of Principal Components and Cumulative Proportion values are used to limit how many components will be used in the correction. When either of these values is reached, no more principal components are added to the calculation. Create Merged Output PCA Data Set is useful when you may want to use principal components as covariates in a separate analysis. Eigencorr Options can be set to use statistical tests to choose significant principal components to use for adjustment. 12) Click Run to start the process. The results are identical to those previously shown for the SNP-Trait Association AP, with the following exception: a new Action Button, Plot Trait by Genotype appears on the dashboard. Note: If numeric markers are run in the SNP-Trait Association, this will appear in those results. 13) Select a few points from the Volcano Plot then click the Plot Trait by Genotype action button. Each point represents a sample, the numeric genotype is on the x-axis and the continuous trait value is on the y-axis. Multiple SNP-Trait Association The Multiple SNP-trait Association process tests the association between a group of SNPs and a trait. You have great flexibility when specifying sets of SNPs for this process. Some common grouping choices are genes, LD blocks and pathways. The setup is identical to the SNP-Trait Association process with the exception that an additional field has been added to the Annotation tab to designate the grouping variable. Page 10
11 1) Select Genetics > Other Association Testing > Multiple SNP-Trait Association from the Genomics Starter menu. 2) Complete the General tab and the Annotation tab as done for PCA for Population Stratification. Select Gene_Symbol as the Annotation Analysis Group Variable on the Annotation tab. 3) Select the Options tab. 4) Complete the Options tab as shown below: The majority of these options are specific for the test run. Exclude Single-SNP Genes should always be selected. That will save time in the analysis. Note that this pertains to the grouping variable, and if the group specified is different than a gene identifier, it will exclude single-snp groups of the specified type (e.g., single SNP LD blocks or pathways). 14) Click Run to start the process. The output is identical to that of SNP-Trait, with the exception that each point in the Manhattan Plot represents a gene, and there is no Volcano Plot option. Pleiotropic Association Pleiotropic association performs a MANOVA (Multivariate Analysis of Variance) test between two or more continuous traits and genetic marker data. The purpose of this Page 11
12 process is to test if two or more traits are linked to the same region of the genome. For example, it may be in selective breeding that a desirable phenotype (increase in milk production) is correlated with an undesirable effect (susceptibility to infection) and you would like to test if these two correlated events are linked genetically. Bear in mind that the more continuous traits tested, the more difficult it will be, in general, to find significant regions of association. 1) Select Genetics > Other Association Tests > Pleiotropic Association from the Genomics starter menu. 2) Choose the rec_num_data.sas7bdat data set as the Input SAS Data Set. 3) Select PCT_CHG_APOC3 and BMI as the Trait Variables. 4) Complete the General tab as shown below: 5) Make no changes to the default settings on the Model Variables tab. Note that random effects are not allowed in this model. 6) Select the Annotation tab. 7) Choose the rec_anno.sas7bdat data set as the Annotation SAS Data Set. 8) Select ID as the Annotation Label Variable. 9) Complete the Options tab as shown below: Page 12
13 The MANOVA Statistic options perform significance tests with slightly different assumptions. The Roy Greatest Root is the least stringent test. 10) Click Run to start the process. The results are similar in format to the other genetics processes we have reviewed. The Manhattan Plot shows the MANOVA test for all traits for the genotype and trend test, and then the individually tested traits for the two association tests. There is a new Action Button, View Venn Diagram of Significant Markers 11) Click on the Genotype button for the Venn diagram. Page 13
14 Selecting a value with a mouse click (e.g., 10) will highlight those rows in the data table. 12) Select Tables > Subset from the JMP menu to view the markers associated with this value. Rare Variant Analyses Rare variant analyses are generally only used with next-gen sequencing variant call data, as SNP microarrays tend to be composed of markers pre-selected for a relatively high degree of heterozygosity. There are a number of methods available in JMP Genomics for rare-variant analyses, and all of them attempt in one way or another to take into account the gene (or other structure) in which the variant exists, rather than testing for association between each individual variant and the trait of interest. In this way, significant association to a phenotype can be attributed to a gene, even in a population with numerous different variant loci within that gene. We will examine only one type of analysis in this exercise, as the set-up is generally similar for all variations of the test. 1) Select Genetics > Other Association Testing > Rare Variant Tutorial from the Genomics starter menu. A list of the available tests pops up. Selecting any one of these will open the appropriate AP with the settings for that test. 2) Select the VT variant threshold method button. 3) Click on Rare Variant Association in the pop-up window. 4) Choose the rec_num_data.sas7bdat data set as the Input SAS Data Set. 5) Select RESP as the Trait Variable. 6) Complete the General tab as shown below: Page 14
15 7) Select the Annotation tab. 8) Choose the rec_anno.sas7bdat data set as the Annotation SAS Data Set. 9) Select ID as the Annotation Label Variable. 10) Select Gene_Symbol as the Annotation Analysis Group. 11) Fill out the Options tab as shown below: Page 15
16 A weight proportional to the inverse of the standard deviation of allele counts is being used in the calculation. If there were other weights available (e.g., poly-phen scores) they could have be specified in the Annotation tab. Single-SNP Genes are excluded from this analysis. 12) Click Run to start the process. The output is similar to the previous outputs, with the exception that each point on the Manhattan plot represents a gene s p-value as opposed to the individual SNP p-value. This completes the advanced genetics module. Most dialogs and output from other Genetics applications will be variations of those covered in the basic and advanced genetics modules. Page 16
Step-by-Step Guide to Relatedness and Association Mapping Contents
Step-by-Step Guide to Relatedness and Association Mapping Contents OBJECTIVES... 2 INTRODUCTION... 2 RELATEDNESS MEASURES... 2 POPULATION STRUCTURE... 6 Q-K ASSOCIATION ANALYSIS... 10 K MATRIX COMPRESSION...
More informationStep-by-Step Guide to Basic Genetic Analysis
Step-by-Step Guide to Basic Genetic Analysis Page 1 Introduction This document shows you how to clean up your genetic data, assess its statistical properties and perform simple analyses such as case-control
More informationGenetic Analysis. Page 1
Genetic Analysis Page 1 Genetic Analysis Objectives: 1) Set up Case-Control Association analysis and the Basic Genetics Workflow 2) Use JMP tools to interact with and explore results 3) Learn advanced
More informationJMP Genomics. Release Notes. Version 6.0
JMP Genomics Version 6.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP, A Business Unit of SAS SAS Campus Drive
More informationRelease Notes. JMP Genomics. Version 4.0
JMP Genomics Version 4.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP. A Business Unit of SAS SAS Campus Drive
More informationExpression Analysis with the Advanced RNA-Seq Plugin
Expression Analysis with the Advanced RNA-Seq Plugin May 24, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com
More informationMAGA: Meta-Analysis of Gene-level Associations
MAGA: Meta-Analysis of Gene-level Associations SYNOPSIS MAGA [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome DESCRIPTION
More informationImporting and Merging Data Tutorial
Importing and Merging Data Tutorial Release 1.0 Golden Helix, Inc. February 17, 2012 Contents 1. Overview 2 2. Import Pedigree Data 4 3. Import Phenotypic Data 6 4. Import Genetic Data 8 5. Import and
More informationBioinformatics - Homework 1 Q&A style
Bioinformatics - Homework 1 Q&A style Instructions: in this assignment you will test your understanding of basic GWAS concepts and GenABEL functions. The materials needed for the homework (two datasets
More informationKGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li
KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual Miao-Xin Li, Jiang Li Department of Psychiatry Centre for Genomic Sciences Department
More informationSEQGWAS: Integrative Analysis of SEQuencing and GWAS Data
SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data SYNOPSIS SEQGWAS [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome
More informationBICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017
BICF Nano Course: GWAS GWAS Workflow Development using PLINK Julia Kozlitina Julia.Kozlitina@UTSouthwestern.edu April 28, 2017 Getting started Open the Terminal (Search -> Applications -> Terminal), and
More informationFVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS
FVGWAS- 3.0 Manual Hongtu Zhu @ UNC BIAS Chao Huang @ UNC BIAS Nov 8, 2015 More and more large- scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical
More informationPackage MultiMeta. February 19, 2015
Type Package Package MultiMeta February 19, 2015 Title Meta-analysis of Multivariate Genome Wide Association Studies Version 0.1 Date 2014-08-21 Author Dragana Vuckovic Maintainer Dragana Vuckovic
More informationDeltaGen: Quick start manual
1 DeltaGen: Quick start manual Dr. Zulfi Jahufer & Dr. Dongwen Luo CONTENTS Page Main operations tab commands 2 Uploading a data file 3 Matching variable identifiers 4 Data check 5 Univariate analysis
More informationDifferential Expression Analysis at PATRIC
Differential Expression Analysis at PATRIC The following step- by- step workflow is intended to help users learn how to upload their differential gene expression data to their private workspace using Expression
More informationImproved Ancestry Estimation for both Genotyping and Sequencing Data using Projection Procrustes Analysis and Genotype Imputation
The American Journal of Human Genetics Supplemental Data Improved Ancestry Estimation for both Genotyping and Sequencing Data using Projection Procrustes Analysis and Genotype Imputation Chaolong Wang,
More informationQTX. Tutorial for. by Kim M.Chmielewicz Kenneth F. Manly. Software for genetic mapping of Mendelian markers and quantitative trait loci.
Tutorial for QTX by Kim M.Chmielewicz Kenneth F. Manly Software for genetic mapping of Mendelian markers and quantitative trait loci. Available in versions for Mac OS and Microsoft Windows. revised for
More informationCTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1
CTL mapping in R Danny Arends, Pjotr Prins, and Ritsert C. Jansen University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 First written: Oct 2011 Last modified: Jan 2018 Abstract: Tutorial
More informationSPSS INSTRUCTION CHAPTER 9
SPSS INSTRUCTION CHAPTER 9 Chapter 9 does no more than introduce the repeated-measures ANOVA, the MANOVA, and the ANCOVA, and discriminant analysis. But, you can likely envision how complicated it can
More informationSage: Symmetry and Asymmetry in Geometric Data Version 1.21 (compiled 03/1114)
Sage: Symmetry and Asymmetry in Geometric Data Version 1.21 (compiled 03/1114) Eladio Marquez 2012-2014 Mammals Division University of Michigan Museum of Zoology http://www-personal.umich.edu/~emarquez/morph/
More informationGenViewer Tutorial / Manual
GenViewer Tutorial / Manual Table of Contents Importing Data Files... 2 Configuration File... 2 Primary Data... 4 Primary Data Format:... 4 Connectivity Data... 5 Module Declaration File Format... 5 Module
More informationPackage lodgwas. R topics documented: November 30, Type Package
Type Package Package lodgwas November 30, 2015 Title Genome-Wide Association Analysis of a Biomarker Accounting for Limit of Detection Version 1.0-7 Date 2015-11-10 Author Ahmad Vaez, Ilja M. Nolte, Peter
More informationA short manual for LFMM (command-line version)
A short manual for LFMM (command-line version) Eric Frichot efrichot@gmail.com April 16, 2013 Please, print this reference manual only if it is necessary. This short manual aims to help users to run LFMM
More informationApplications of admixture models
Applications of admixture models CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar, Alkes Price Applications of admixture models 1 / 27
More informationBayesian analysis of genetic population structure using BAPS: Exercises
Bayesian analysis of genetic population structure using BAPS: Exercises p S u k S u p u,s S, Jukka Corander Department of Mathematics, Åbo Akademi University, Finland Exercise 1: Clustering of groups of
More informationPolymorphism and Variant Analysis Lab
Polymorphism and Variant Analysis Lab Arian Avalos PowerPoint by Casey Hanson Polymorphism and Variant Analysis Matt Hudson 2018 1 Exercise In this exercise, we will do the following:. 1. Gain familiarity
More informationEstimating Variance Components in MMAP
Last update: 6/1/2014 Estimating Variance Components in MMAP MMAP implements routines to estimate variance components within the mixed model. These estimates can be used for likelihood ratio tests to compare
More informationSOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie
SOLOMON: Parentage Analysis 1 Corresponding author: Mark Christie christim@science.oregonstate.edu SOLOMON: Parentage Analysis 2 Table of Contents: Installing SOLOMON on Windows/Linux Pg. 3 Installing
More informationAxiom Analysis Suite Release Notes (For research use only. Not for use in diagnostic procedures.)
Axiom Analysis Suite 4.0.1 Release Notes (For research use only. Not for use in diagnostic procedures.) Axiom Analysis Suite 4.0.1 includes the following changes/updates: 1. For library packages that support
More informationREAP Software Documentation
REAP Software Documentation Version 1.2 Timothy Thornton 1 Department of Biostatistics 1 The University of Washington 1 REAP A C program for estimating kinship coefficients and IBD sharing probabilities
More informationClick on "+" button Select your VCF data files (see #Input Formats->1 above) Remove file from files list:
CircosVCF: CircosVCF is a web based visualization tool of genome-wide variant data described in VCF files using circos plots. The provided visualization capabilities, gives a broad overview of the genomic
More informationMAGMA manual (version 1.06)
MAGMA manual (version 1.06) TABLE OF CONTENTS OVERVIEW 3 QUICKSTART 4 ANNOTATION 6 OVERVIEW 6 RUNNING THE ANNOTATION 6 ADDING AN ANNOTATION WINDOW AROUND GENES 7 RESTRICTING THE ANNOTATION TO A SUBSET
More informationChapter 13 Multivariate Techniques. Chapter Table of Contents
Chapter 13 Multivariate Techniques Chapter Table of Contents Introduction...279 Principal Components Analysis...280 Canonical Correlation...289 References...298 278 Chapter 13. Multivariate Techniques
More informationJMP Clinical. Getting Started with. JMP Clinical. Version 3.1
JMP Clinical Version 3.1 Getting Started with JMP Clinical Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP, A Business Unit of
More informationStatistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte
Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,
More informationBreeding View A visual tool for running analytical pipelines User Guide Darren Murray, Roger Payne & Zhengzheng Zhang VSN International Ltd
Breeding View A visual tool for running analytical pipelines User Guide Darren Murray, Roger Payne & Zhengzheng Zhang VSN International Ltd January 2015 1. Introduction The Breeding View is a visual tool
More informationTutorial. RNA-Seq Analysis of Breast Cancer Data. Sample to Insight. November 21, 2017
RNA-Seq Analysis of Breast Cancer Data November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com
More informationQuality control of array genotyping data with argyle Andrew P Morgan
Quality control of array genotyping data with argyle Andrew P Morgan 2015-10-08 Introduction Proper quality control of array genotypes is an important prerequisite to further analysis. Genotype quality
More informationCircosVCF workshop, TAU, 9/11/2017
CircosVCF exercise In this exercise, we will create and design circos plots using CircosVCF. We will use vcf files of a published case "X-linked elliptocytosis with impaired growth is related to mutated
More informationCLC Server. End User USER MANUAL
CLC Server End User USER MANUAL Manual for CLC Server 10.0.1 Windows, macos and Linux March 8, 2018 This software is for research purposes only. QIAGEN Aarhus Silkeborgvej 2 Prismet DK-8000 Aarhus C Denmark
More informationGMDR User Manual. GMDR software Beta 0.9. Updated March 2011
GMDR User Manual GMDR software Beta 0.9 Updated March 2011 1 As an open source project, the source code of GMDR is published and made available to the public, enabling anyone to copy, modify and redistribute
More informationClustering analysis of gene expression data
Clustering analysis of gene expression data Chapter 11 in Jonathan Pevsner, Bioinformatics and Functional Genomics, 3 rd edition (Chapter 9 in 2 nd edition) Human T cell expression data The matrix contains
More informationCreating and Using Genome Assemblies Tutorial
Creating and Using Genome Assemblies Tutorial Release 8.1 Golden Helix, Inc. March 18, 2014 Contents 1. Create a Genome Assembly for Danio rerio 2 2. Building Annotation Sources 5 A. Creating a Reference
More informationRicopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015
Ricopili: Introdution WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 What will we offer? Practical: Sorry, no practical sessions today, please refer to the summer school, organized
More informationPractical OmicsFusion
Practical OmicsFusion Introduction In this practical, we will analyse data, from an experiment which aim was to identify the most important metabolites that are related to potato flesh colour, from an
More informationTo finish the current project and start a new project. File Open a text data
GGEbiplot version 5 In addition to being the most complete, most powerful, and most user-friendly software package for biplot analysis, GGEbiplot also has powerful components for on-the-fly data manipulation,
More informationTutorial: Resequencing Analysis using Tracks
: Resequencing Analysis using Tracks September 20, 2013 CLC bio Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 Fax: +45 86 20 12 22 www.clcbio.com support@clcbio.com : Resequencing
More informationMACAU User Manual. Xiang Zhou. March 15, 2017
MACAU User Manual Xiang Zhou March 15, 2017 Contents 1 Introduction 2 1.1 What is MACAU...................................... 2 1.2 How to Cite MACAU................................... 2 1.3 The Model.........................................
More informationCompClustTk Manual & Tutorial
CompClustTk Manual & Tutorial Brandon King Copyright c California Institute of Technology Version 0.1.10 May 13, 2004 Contents 1 Introduction 1 1.1 Purpose.............................................
More informationPackage GWAF. March 12, 2015
Type Package Package GWAF March 12, 2015 Title Genome-Wide Association/Interaction Analysis and Rare Variant Analysis with Family Data Version 2.2 Date 2015-03-12 Author Ming-Huei Chen
More informationGWAS Exercises 3 - GWAS with a Quantiative Trait
GWAS Exercises 3 - GWAS with a Quantiative Trait Peter Castaldi January 28, 2013 PLINK can also test for genetic associations with a quantitative trait (i.e. a continuous variable). In this exercise, we
More informationWindows. RNA-Seq Tutorial
Windows RNA-Seq Tutorial 2017 Gene Codes Corporation Gene Codes Corporation 525 Avis Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074 (fax) www.genecodes.com
More informationMAGMA manual (version 1.05)
MAGMA manual (version 1.05) TABLE OF CONTENTS OVERVIEW 3 QUICKSTART 4 ANNOTATION 6 OVERVIEW 6 RUNNING THE ANNOTATION 6 ADDING AN ANNOTATION WINDOW AROUND GENES 7 RESTRICTING THE ANNOTATION TO A SUBSET
More informationMultivariate Capability Analysis
Multivariate Capability Analysis Summary... 1 Data Input... 3 Analysis Summary... 4 Capability Plot... 5 Capability Indices... 6 Capability Ellipse... 7 Correlation Matrix... 8 Tests for Normality... 8
More informationLEA: An R Package for Landscape and Ecological Association Studies
LEA: An R Package for Landscape and Ecological Association Studies Eric Frichot and Olivier François Université Grenoble-Alpes, Centre National de la Recherche Scientifique, TIMC-IMAG UMR 5525, Grenoble,
More informationToCatchAThief c ryan campbell & jenn coughlan 7/23/2018
ToCatchAThief c ryan campbell & jenn coughlan 7/23/2018 Welcome to the To Catch a Thief: With Data! walkthrough! https://bioconductor.org/packages/devel/ bioc/vignettes/snprelate/inst/doc/snprelatetutorial.html
More informationClustering and Visualisation of Data
Clustering and Visualisation of Data Hiroshi Shimodaira January-March 28 Cluster analysis aims to partition a data set into meaningful or useful groups, based on distances between data points. In some
More informationSAS Enterprise Miner : Tutorials and Examples
SAS Enterprise Miner : Tutorials and Examples SAS Documentation February 13, 2018 The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2017. SAS Enterprise Miner : Tutorials
More informationIntro to NGS Tutorial
Intro to NGS Tutorial Release 8.6.0 Golden Helix, Inc. October 31, 2016 Contents 1. Overview 2 2. Import Variants and Quality Fields 3 3. Quality Filters 10 Generate Alternate Read Ratio.........................................
More informationONLINE TUTORIAL T1: TEXT MINING PROJECT
Online Tutorial T1: Text Mining Project T1-1 ONLINE TUTORIAL T1: TEXT MINING PROJECT The sample data file 4Cars.sta, available at this Web site contains car reviews written by automobile owners. Car reviews
More informationComparative Sequencing
Tutorial for Windows and Macintosh Comparative Sequencing 2017 Gene Codes Corporation Gene Codes Corporation 525 Avis Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074
More informationThe fgwas software. Version 1.0. Pennsylvannia State University
The fgwas software Version 1.0 Zhong Wang 1 and Jiahan Li 2 1 Department of Public Health Science, 2 Department of Statistics, Pennsylvannia State University 1. Introduction Genome-wide association studies
More informationWELCOME! Lecture 3 Thommy Perlinger
Quantitative Methods II WELCOME! Lecture 3 Thommy Perlinger Program Lecture 3 Cleaning and transforming data Graphical examination of the data Missing Values Graphical examination of the data It is important
More informationBreeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel
Breeding Guide Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel www.phenome-netwoks.com Contents PHENOME ONE - INTRODUCTION... 3 THE PHENOME ONE LAYOUT... 4 THE JOBS ICON...
More informationSpotter Documentation Version 0.5, Released 4/12/2010
Spotter Documentation Version 0.5, Released 4/12/2010 Purpose Spotter is a program for delineating an association signal from a genome wide association study using features such as recombination rates,
More informationCalculating a PCA and a MDS on a fingerprint data set
BioNumerics Tutorial: Calculating a PCA and a MDS on a fingerprint data set 1 Aim Principal Components Analysis (PCA) and Multi Dimensional Scaling (MDS) are two alternative grouping techniques that can
More informationIntroduction to Hail. Cotton Seed, Technical Lead Tim Poterba, Software Engineer Hail Team, Neale Lab Broad Institute and MGH
Introduction to Hail Cotton Seed, Technical Lead Tim Poterba, Software Engineer Hail Team, Neale Lab Broad Institute and MGH Why Hail? Genetic data is becoming absolutely massive Broad Genomics, by the
More informationTutorial: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and Expression measures
: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and February 24, 2014 Sample to Insight : RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and : RNA-Seq Analysis
More informationNCSS Statistical Software
Chapter 245 Introduction This procedure generates R control charts for variables. The format of the control charts is fully customizable. The data for the subgroups can be in a single column or in multiple
More informationPrincipal Components Analysis with Spatial Data
Principal Components Analysis with Spatial Data A SpaceStat Software Tutorial Copyright 2013, BioMedware, Inc. (www.biomedware.com). All rights reserved. SpaceStat and BioMedware are trademarks of BioMedware,
More informationPackage REGENT. R topics documented: August 19, 2015
Package REGENT August 19, 2015 Title Risk Estimation for Genetic and Environmental Traits Version 1.0.6 Date 2015-08-18 Author Daniel J.M. Crouch, Graham H.M. Goddard & Cathryn M. Lewis Maintainer Daniel
More informationFast, Easy, and Publication-Quality Ecological Analyses with PC-ORD
Emerging Technologies Fast, Easy, and Publication-Quality Ecological Analyses with PC-ORD JeriLynn E. Peck School of Forest Resources, Pennsylvania State University, University Park, Pennsylvania 16802
More informationSPSS TRAINING SPSS VIEWS
SPSS TRAINING SPSS VIEWS Dataset Data file Data View o Full data set, structured same as excel (variable = column name, row = record) Variable View o Provides details for each variable (column in Data
More informationHierarchical Clustering Tutorial
Hierarchical Clustering Tutorial Ignacio Gonzalez, Sophie Lamarre, Sarah Maman, Luc Jouneau CATI Bios4Biol - Statistical group March 2017 To know about clustering There are two main methods: Classification
More information7.4 Tutorial #4: Profiling LC Segments Using the CHAID Option
7.4 Tutorial #4: Profiling LC Segments Using the CHAID Option DemoData = gss82.sav After an LC model is estimated, it is often desirable to describe (profile) the resulting latent classes in terms of demographic
More informationGegenees genome format...7. Gegenees comparisons...8 Creating a fragmented all-all comparison...9 The alignment The analysis...
User Manual: Gegenees V 1.1.0 What is Gegenees?...1 Version system:...2 What's new...2 Installation:...2 Perspectives...4 The workspace...4 The local database...6 Populate the local database...7 Gegenees
More informationJMP Clinical. Release Notes. Version 5.0
JMP Clinical Version 5.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP, A Business Unit of SAS SAS Campus Drive
More informationSEEK User Manual. Introduction
SEEK User Manual Introduction SEEK is a computational gene co-expression search engine. It utilizes a vast human gene expression compendium to deliver fast, integrative, cross-platform co-expression analyses.
More informationLFMM version Reference Manual (Graphical User Interface version)
LFMM version 1.2 - Reference Manual (Graphical User Interface version) Eric Frichot 1, Sean Schoville 1, Guillaume Bouchard 2, Olivier François 1 * 1. Université Joseph Fourier Grenoble, Centre National
More informationRight-click on whatever it is you are trying to change Get help about the screen you are on Help Help Get help interpreting a table
Q Cheat Sheets What to do when you cannot figure out how to use Q What to do when the data looks wrong Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help
More informationIntroduction to GDS. Stephanie Gogarten. July 18, 2018
Introduction to GDS Stephanie Gogarten July 18, 2018 Genomic Data Structure CoreArray (C++ library) designed for large-scale data management of genome-wide variants data format (GDS) to store multiple
More informationDocumentation for BayesAss 1.3
Documentation for BayesAss 1.3 Program Description BayesAss is a program that estimates recent migration rates between populations using MCMC. It also estimates each individual s immigrant ancestry, the
More informationUser Manual ixora: Exact haplotype inferencing and trait association
User Manual ixora: Exact haplotype inferencing and trait association June 27, 2013 Contents 1 ixora: Exact haplotype inferencing and trait association 2 1.1 Introduction.............................. 2
More informationSTATISTICS (STAT) Statistics (STAT) 1
Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).
More informationGenotype x Environmental Analysis with R for Windows
Genotype x Environmental Analysis with R for Windows Biometrics and Statistics Unit Angela Pacheco CIMMYT,Int. 23-24 Junio 2015 About GEI In agricultural experimentation, a large number of genotypes are
More informationQTL Analysis with QGene Tutorial
QTL Analysis with QGene Tutorial Phillip McClean 1. Getting the software. The first step is to download and install the QGene software. It can be obtained from the following WWW site: http://qgene.org
More informationSupplementary Figure 1. Decoding results broken down for different ROIs
Supplementary Figure 1 Decoding results broken down for different ROIs Decoding results for areas V1, V2, V3, and V1 V3 combined. (a) Decoded and presented orientations are strongly correlated in areas
More informationDownload PLINK from
PLINK tutorial Amended from two tutorials that the PLINK author Shaun Purcell wrote, see http://pngu.mgh.harvard.edu/~purcell/plink/tutorial.shtml and 'Teaching materials and example dataset' at http://pngu.mgh.harvard.edu/~purcell/plink/res.shtml
More informationVersion 2.4 of Idiogrid
Version 2.4 of Idiogrid Structural and Visual Modifications 1. Tab delimited grids in Grid Data window. The most immediately obvious change to this newest version of Idiogrid will be the tab sheets that
More informationStatistical Models for Management. Instituto Superior de Ciências do Trabalho e da Empresa (ISCTE) Lisbon. February 24 26, 2010
Statistical Models for Management Instituto Superior de Ciências do Trabalho e da Empresa (ISCTE) Lisbon February 24 26, 2010 Graeme Hutcheson, University of Manchester Principal Component and Factor Analysis
More informationGemTools Documentation
Literature: GemTools Documentation Bert Klei and Brian P. Kent February 2011 This software is described in GemTools: a fast and efficient approach to estimating genetic ancestry (in preparation) Klei L,
More informationPRACTICAL SESSION 8 SEQUENCE-BASED ASSOCIATION, INTERPRETATION, VISUALIZATION USING EPACTS JAN 7 TH, 2014 STOM 2014 WORKSHOP
PRACTICAL SESSION 8 SEQUENCE-BASED ASSOCIATION, INTERPRETATION, VISUALIZATION USING EPACTS JAN 7 TH, 2014 STOM 2014 WORKSHOP HYUN MIN KANG UNIVERSITY OF MICHIGAN, ANN ARBOR EPACTS ASSOCIATION ANALYSIS
More informationMinitab 17 commands Prepared by Jeffrey S. Simonoff
Minitab 17 commands Prepared by Jeffrey S. Simonoff Data entry and manipulation To enter data by hand, click on the Worksheet window, and enter the values in as you would in any spreadsheet. To then save
More informationABI PRISM GeneMapper Software Version 3.0 SNP Genotyping
ABI PRISM GeneMapper Software Version 3.0 SNP Genotyping Tutorial ABI PRISM GeneMapper Software Version 3.0 SNP Genotyping Tutorial September 25, 2002 1:20 pm, 7x9_Title.fm Copyright 2002, Applied Biosystems.
More informationTutorial #1: Using Latent GOLD choice to Estimate Discrete Choice Models
Tutorial #1: Using Latent GOLD choice to Estimate Discrete Choice Models In this tutorial, we analyze data from a simple choice-based conjoint (CBC) experiment designed to estimate market shares (choice
More informationBayesian Multiple QTL Mapping
Bayesian Multiple QTL Mapping Samprit Banerjee, Brian S. Yandell, Nengjun Yi April 28, 2006 1 Overview Bayesian multiple mapping of QTL library R/bmqtl provides Bayesian analysis of multiple quantitative
More informationTutorial on gene-c ancestry es-ma-on: How to use LASER. Chaolong Wang Sequence Analysis Workshop June University of Michigan
Tutorial on gene-c ancestry es-ma-on: How to use LASER Chaolong Wang Sequence Analysis Workshop June 2014 @ University of Michigan LASER: Loca-ng Ancestry from SEquence Reads Main func:ons of the so
More informationPerforming whole genome SNP analysis with mapping performed locally
BioNumerics Tutorial: Performing whole genome SNP analysis with mapping performed locally 1 Introduction 1.1 An introduction to whole genome SNP analysis A Single Nucleotide Polymorphism (SNP) is a variation
More informationPrincipal Component Image Interpretation A Logical and Statistical Approach
Principal Component Image Interpretation A Logical and Statistical Approach Md Shahid Latif M.Tech Student, Department of Remote Sensing, Birla Institute of Technology, Mesra Ranchi, Jharkhand-835215 Abstract
More information