User Manual ixora: Exact haplotype inferencing and trait association
|
|
- Juniper York
- 5 years ago
- Views:
Transcription
1 User Manual ixora: Exact haplotype inferencing and trait association June 27, 2013
2 Contents 1 ixora: Exact haplotype inferencing and trait association Introduction Requirements and availability Input format without phenotypic information Ped file format ixora file format Input format with phenotypic information Input with completely missing parents Output format Phasing results in.txt files Error measures in.log file Statistics in.stats file ixora graphical user interface Executing haplotype inference and association analysis Specifying input and output files Options and parameters Visualization Haplotype frequency Expected haplotype frequency distribution Haplotype frequencies by phenotype P-values for haplotype-phenotype association Troubleshooting Command line executable for exact haplotype inference Parameters
3 Chapter 1 ixora: Exact haplotype inferencing and trait association 1.1 Introduction ixora is a framework for inferring haplotypes from genotyped population data, and for associating observed phenotypes with the inferred haplotypes, proposed in [1]. The framework is especially applicable in plant breeding where there exist large populations of individuals from the same parents. Given a set of genotypes for progeny of at most two heterozygous parents, ixora efficiently and accurately extracts all the equally-likely haplotypes of the progenies to produce an agglomerate structure. The structure can be conveniently visualized as mosaics of the ancestor haplotypes, and statistics can be computed on the distribution of crossovers in the progeny. Furthermore, the results of the phasing can then be applied as input to statistical tests and visualization methods to find genomic regions and the specific haplotypes in those regions associated with observed phenotypes. ixora has an option to phase the progeny even in the complete absence of genotype information for the parents. ixora performs the following two main steps: 1. Exact haplotype inference: this is done via a rigorous mathematical analysis that examines the space of all the best possible haplotype solutions, as described in detail in [1]. 2. Statistical analysis of association, per genotyped marker, between the haplotypes and phenotypes: ixora identifies genomic regions of interest when i) the phenotype is determined by the haplotypes inherited from both parents or ii) the phenotype is determined by the haplotypes inherited from just one parent. While performing the latter step, ixora computes the appropriate inputs to statistical tests for haplotype-phenotype association. The current implementa- 2
4 tion includes Fisher s exact test for this purpose. ixora outputs the resulting p-values for marker-phenotype association as a text file and visualizes the results. An added value of ixora is the built-in feature to perform randomization tests on the phenotypes, to establish significance thresholds on the p-values. ixora reports these limits and visualizes them along with the p-values on the real data. In addition, ixora outputs error measures on the phasing step, summarizing the ambiguity in the data regarding haplotype inference. 1.2 Requirements and availability The ixora stand-alone framework executed via a graphical user interface has the following dependencies (tested on the versions listed): R (version 2.14), Java (version 1.6), and JFreeChart (version ). R is required for the statistical Fishers test for phenotype association and is not required when performing haplotype inference only. JFreeChart is required for the visualization via the GUI. In addition, the haplotype inference algorithm is provided as a separate command line version in the form of a C++ executable. ixora dependencies include: jfreechart-<version>.jar - This file must be located in the correct Java directory (...\lib\ext\), for example: C:\ProgramFiles\Java\jre7\lib\ext \jfreechart jar. To obtain this file, download the latest jfreechart release from the file is located inside the lib folder jcommon-<version>.jar - This file must be located in the correct Java directory (...\lib\ext\), for example: C:\ProgramFiles\Java\jre7\lib\ext \jcommon jar. To obtain this file, download the latest jfreechart release from the file is located inside the lib folder. Rscript.exe - The user needs to specify the directory where R (including this file) is installed. The Rpath.txt file included in the ixora download will need to be modified to reflect this path, for example: C:\Program\ Files\R\R \bin\Rscript.exe. R can be downloaded from We have tested the system on Windows and Linux, ixora should work on any operating system as long as the dependencies are included. Currently, the ixora implementation is designed to work with biallelic markers on a population of diploid individuals derived from at most two parents. For the phenotype association analysis, the parents and progeny are required to have discrete phenotypes (such as resistant/susceptible, green/yellow/red, etc.). ixora executable files are available for download at The zipped folders contain six files: 3
5 ixora.jar - executable Java file for the entire ixora framework, including graphical user interface ixora.exe - executable C++ program for the exact haplotype inference algorithm Rpath.txt - one line text file specifying the path to where the user has R installed LICENCE.txt - copy of the licence agreement that the user agreed to when downloading the program README.txt - details on the dependencies and limitations of the current implementation ExampleData.txt - an example input file for ixora If ixora is used in published analysis, it should be cited as: Utro, F., Haiminen, N., Livingstone, D., Cornejo, O.E., Royaert, S., Schnell, R.J., Motamayor, J.C., Kuhn, D.N., Parida, L.: ixora: Exact Haplotype Inferencing and Trait Association. Submitted (2012). For further information visit or contact parida@us.ibm.com. 1.3 Input format without phenotypic information ixora takes as input two file formats: the commonly used.ped file format and a custom file format referred as ixora file format. For brevity, in what follows the main features of the.ped file format is presented, due to its large use in the scientific community. Each chromosome should be stored in a separate input file and ixora run independetly on each chromosome Ped file format If the.ped file format is used, the first two lines must correspond to the parental information. The input file should be formatted as follows: 1. The first file columns must contain a string for each of the following field: family, person, father, mother and sex; 2. The following columns must contains the genotype information separated by a space. 3. The first two lines contain the parental information. 4. the following n lines correspond to the n progenies. 4
6 1.3.2 ixora file format For n progenies at m loci, the input file should be formatted as follows, as shown in Table 1.1: 1. The first line contains the m markers name separated by space. 2. The second and third line contain the parental sequences. Each line contains m + 1 fields: the parent identifier followed by the m genotypes separated by space. A missing value is represented by dash. Even if all the parents values are missing, they still need to be included in the input as - -. A separate option for handling the case with completely missing parental genotypes is described in Section The following n lines correspond to the n progeny. Each line contains m + 1 fields: the progeny identifier followed by the m genotypes separated by space. A missing value is represented by dash. 4. There should be no line change at the end of the last line and no space between the two alleles per genotype (i.e. should be AC, not A C or A/C ) marker1 marker2 maker3 marker4 Parent1 CC TT AG GG Parent2 AC TG AG TG Prog1 CC - - GG TG Prog2 AC TG AG GG Prog3 AC TG AG GG Prog4 CC TT GG - - Prog5 CC TT AG TG Table 1.1: An example input file for five progeny sequences 1.4 Input format with phenotypic information When the user wishes to specify phenotypic information for the progeny, this should be included as the first column after the individual s name, as demonstrated in 1.2. The.ped file format is not currently supported in conjunction with phenotype information. The phenotype values must be discrete and start from 0, i.e. the phenotypes can be 0,1,2,... For missing phenotype data the value can be set to -1. The phenotype is then followed by the genotype information as described in the previous section. 5
7 marker1 marker2 maker3 marker4 Parent1-1 CC TT AG GG Parent2-1 AC TG AG TG Prog1 1 CC - - GG TG Prog2 0 AC TG AG GG Prog3-1 AC TG AG GG Prog4 1 CC TT GG - - Prog5 0 CC TT AG TG Table 1.2: An example input file for five progeny sequences including phenotypic information. 1.5 Input with completely missing parents When the parental genotype values are missing for all markers, they should be specified as - - in the input, and the special option missing parents selected in the user interface or in the command line. This will invoke ixora to run in a special mode to determine the parental genotype values. 1.6 Output format ixora outputs, by default, the phasing results in three files per parent, plus an additional log file describing the preciseness of the phasing solution Phasing results in.txt files The ixora executables generate two text files for each parent. They are named <input> <parent name>.txt and <input> <parent name> phased.txt, where <input> is the name of the input file, and <parent name> is the name specified as the name for the parent in the input file, for example Parent1 and Parent2 in Table 1.2. In both files, the first the two lines correspond to the parent s phased haplotypes, denoted <parent name> H1 and <parent name> H2. The following n lines correspond to the progeny. In the file <input> <parent name>.txt each of the n progeny lines satisfy the following format, an example is shown in Table 1.3: The first column contains the name of the progeny as indicated in the input file The second column is a string of m characters, separated by white space (tab). Each character could assume one of the following values: 1: denoting the allele comes from the first parental haplotype H1 2: denoting the allele comes from the second parental haplotype H2, q, Q: representing ambiguous values. They indicate multiple equally likely solutions, possibly with multiple crossovers 6
8 b: denoting that are potential sources of mistakes, such as markers with a high number of missing data, or imputation of parents values from the progeny. The user may choose to replace it with the numerical values in the columns to the left and right of such markers. E: denoting the individual has too many missing values to obtain a phasing result for it The files <input> <parent name> phased.txt follows the same standard but provides a possible haplotype assignment for the, q, Q and b characters, providing a simplified, less precise output format. Parent1 H1 A T T C T T T T T A Parent1 H2 A T T C C T T C C A Prog Prog Prog Prog Prog Table 1.3: An example phasing output file <input> <Parent1>.txt, showing the phasing results for the Parent1 and for five progeny. For added convenience, the file <input> <parent name> phased sequence.txt provides the haplotype sequences corresponding to the simplified phasing results in the file <input> <parent name> phased.txt Error measures in.log file An additional output of ixora is the <input>.log file. This file contains the error measures relating to the preciseness of the phasing solution, as disucssed in [1]. Delta, D, and E denote the distance from the lower bound regarding the number of crossovers, ambiguity in the solution, and errors in the data. If these values are large, it may not possible to obtain a reliable phasing on the input data Statistics in.stats file The phasing step in ixora outputs an optional <input>.stats file containing detailed analysis of the haplotype frequencies and association with phenotype. This file will be used as input to the various visualizations and statistical tests. The <input>.stats file contains the following elements, in the order that is as described below: 1. Expected haplotype count (c) and variance (delta), as disucssed in [1], per marker for each haplotype pair 7
9 2. Haplotype count (frequency) per marker for each parent 3. Expected haplotype count (c) and variance (delta) per marker for each haplotype pair, for individuals with phenotype 0 4. Haplotype count (frequency) per marker for each parent, for individuals with phenotype 0 5. Data 3. and 4. for each remaining phenotype 6. p-value from Fisher s exact test per marker for pair of haplotypes 7. p-value from Fisher s exact test on randomized data for pair of haplotypes 8. p-value from Fisher s exact test per marker for the 1st parent 9. p-value from Fisher s exact test on randomized data per marker for the 1st parent 10. p-value from Fisher s exact test per marker for the 2nd parent 11. p-value from Fisher s exact test on randomized data per marker for the 2nd parent 8
10 Chapter 2 ixora graphical user interface This chapter describes using the ixora program with graphical user interface ixora.jar for haplotype inference and statistical testing of haplotype-phenotype associations. ixora can be started by double-clicking on the ixora.jar executable. 2.1 Executing haplotype inference and association analysis This section describes executing the haplotype inference and association analysis. The haplotype analysis is started by pressing the RUN button after specifying the input file, output folder and desired options as specified below Specifying input and output files To begin an analysis, the user must specify an input file in the ixora (or.ped) format, in the by using the Browse button in the Select Input File section on the main dilog window. Similarly, Select Output Folder must be performed to specify the location of the resulting output files Options and parameters The following options are provided in the ixora main dialog window: If the input genotypes are derived from the selfing of a single parent, this can be indicated in the Self check box. When using this option, the user should specify two parents whose genotypes are identical, but their names different, for example: Parent1AsMother and Parent1AsFather. 9
11 If all the genotype values for the parents are missing, the Missing parents check box should be selected, and otherwise left unselected. When selected, ixora will run in a special mode to infer the parental genotypes. Note that in this case the labeling of the parents (Parent1 / Parent2) in the output is arbitrary, since both parents are completely missing in the input. If the input data file contains phenotype information, the Phenotype included check box should be selected, and otherwise left unselected. Number of randomizations denotes the iterations for estimating a significance threshold via permutation tests. The default value is 20 and minimum value is 1. Note that running many (i.e. several tens or hundreds) of randomizations can be time consuming 2.2 Visualization In this section we desribe the visualizations that ixora provides. The input to visualization is data contained in the phasing output statistics file. To run visualizations, the user must perform File Open from the main dialog menu to open the relevant.stats file. After performing this, the user can choose to perform the various visualizations given as options in the Visualize drop down menu in the main dialog window. The figures produced by ixora can be saved as.png figure files by rightclicking on them with the mouse and selecting Save as. It is possible to zoom in on the window contents by left clicking with the mouse on the left upper corner of the desird region and holding the mouse button down until the desired range is shaded. The range can also be manipulated by right-clicking with the mouse and selecting the desired actions. The visualization options are described in the following subsections Haplotype frequency Haplotype frequency will draw histograms of haplotype counts at each marker for each parent Expected haplotype frequency distribution Frequency distribution will draw the expected counts and frequencies of haplotype pairs at each marker. The shaded areas correspond to ambiguity in the phasing result Haplotype frequencies by phenotype Phenotypic charts will draw Haplotype frequency and Frequency distribution plots, after dividing the individuals into subsets based on their phenotype. 10
12 2.2.4 P-values for haplotype-phenotype association P-value will draw two plots, one for the combined effect of the parents, and one where each parent s effect on the phenotype is shown as a separate subplot. The p-value results from Fisher s exact test on haplotype phenotype association. The most significant values found in randomizations are also shown, as is the background significance level. 2.3 Troubleshooting Here is a list of some possible problems and solutions when running the ixora analysis via the user interface Problem: There is an error in the execution of ixora. Solution: The most likely reason is that the input file is formatted incorrectly, or the appropriate option regarding the presence of phenotype data in the input is not set. Please take a look at the examples in the sections regarding the input format. Problem: The ixora phasing analysis takes a long time to complete. Solution: Reduce the number of randomization tests for the statistical significance. The phasing should run in a short time for, e.g. hundreds of markers and individuals. Problem: There is no reaction when double-clicking on the ixora.jar file, or it takes a long time to react. Solution: Kill any lingering javaw processes associated with ixora (in Windows via the Task Manager). 11
13 Chapter 3 Command line executable for exact haplotype inference This chapter describes the use of the C++ executable ixora.exe for exact haploype inference. The inputs and outputs are defined as described in Chapter Parameters The binary file ixora can be executed with command line parameters: ixora <-i (or -p) input> [-self] [-missparents] [-phen] [-stats] [-r T] [-debug] [-o path] where: - i: assumes the input file in ixora file format; - p: assumes the input file in ped file format; - self: indicates that the two parents are the same; - missparents: indicates the parents genotypes are unknown; - phen: indicates that phenotype is included in the input; - stats: produces a stats file as described in the section Output format; - r T: indicates the number of randomizations for the p-value threshold computation, by default T=20; - debug: provides in the standard output all the information computed by ixora step by step; - o: define the path where the output is stored in the main memory; 12
14 The parameter denoted with the <> bracket must be provided. If any of the remaining parameters are not set, it is assigned the default value. For example, to execute ixora using the input file ExampleData.txt (without phenotype information) or ExampleDataPheno.txt (with phenotype information), possible command lines include: ixora -i ExampleData.txt -stats ixora -i ExampleDataPheno.txt -phen -stats -r 50 13
15 Bibliography [1] F. Utro, N. Haiminen, D. Livingstone, O.E. Cornejo, S. Royaert, R.J. Schnell, J.C. Motamayor, D.N. Kuhn, and L. Parida. ixora: Exact Haplotype Inferencing and Trait Association. BMC Genetics 14:48,
Genetic Analysis. Page 1
Genetic Analysis Page 1 Genetic Analysis Objectives: 1) Set up Case-Control Association analysis and the Basic Genetics Workflow 2) Use JMP tools to interact with and explore results 3) Learn advanced
More informationStep-by-Step Guide to Basic Genetic Analysis
Step-by-Step Guide to Basic Genetic Analysis Page 1 Introduction This document shows you how to clean up your genetic data, assess its statistical properties and perform simple analyses such as case-control
More informationA comprehensive modelling framework and a multiple-imputation approach to haplotypic analysis of unrelated individuals
A comprehensive modelling framework and a multiple-imputation approach to haplotypic analysis of unrelated individuals GUI Release v1.0.2: User Manual January 2009 If you find this software useful, please
More informationSOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie
SOLOMON: Parentage Analysis 1 Corresponding author: Mark Christie christim@science.oregonstate.edu SOLOMON: Parentage Analysis 2 Table of Contents: Installing SOLOMON on Windows/Linux Pg. 3 Installing
More informationStatistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual
Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual Department of Epidemiology and Biostatistics Wolstein Research Building 2103 Cornell Rd Case Western
More informationPolymorphism and Variant Analysis Lab
Polymorphism and Variant Analysis Lab Arian Avalos PowerPoint by Casey Hanson Polymorphism and Variant Analysis Matt Hudson 2018 1 Exercise In this exercise, we will do the following:. 1. Gain familiarity
More informationUSER S MANUAL FOR THE AMaCAID PROGRAM
USER S MANUAL FOR THE AMaCAID PROGRAM TABLE OF CONTENTS Introduction How to download and install R Folder Data The three AMaCAID models - Model 1 - Model 2 - Model 3 - Processing times Changing directory
More informationUser s Guide. Version 2.2. Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario
User s Guide Version 2.2 Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario Mehdi Sargolzaei, Jacques Chesnais and Flavio Schenkel Jan 2014 Disclaimer
More informationGenetic type 1 Error Calculator (GEC)
Genetic type 1 Error Calculator (GEC) (Version 0.2) User Manual Miao-Xin Li Department of Psychiatry and State Key Laboratory for Cognitive and Brain Sciences; the Centre for Reproduction, Development
More informationStep-by-Step Guide to Advanced Genetic Analysis
Step-by-Step Guide to Advanced Genetic Analysis Page 1 Introduction In the previous document, 1 we covered the standard genetic analyses available in JMP Genomics. Here, we cover the more advanced options
More informationPopulation Genetics (52642)
Population Genetics (52642) Benny Yakir 1 Introduction In this course we will examine several topics that are related to population genetics. In each topic we will discuss briefly the biological background
More informationRecalling Genotypes with BEAGLECALL Tutorial
Recalling Genotypes with BEAGLECALL Tutorial Release 8.1.4 Golden Helix, Inc. June 24, 2014 Contents 1. Format and Confirm Data Quality 2 A. Exclude Non-Autosomal Markers......................................
More informationStep-by-Step Guide to Relatedness and Association Mapping Contents
Step-by-Step Guide to Relatedness and Association Mapping Contents OBJECTIVES... 2 INTRODUCTION... 2 RELATEDNESS MEASURES... 2 POPULATION STRUCTURE... 6 Q-K ASSOCIATION ANALYSIS... 10 K MATRIX COMPRESSION...
More informationPediHaplotyper Manual
PediHaplotyper Manual Roeland Voorrips, Wageningen UR Plant Breeding, 2015 Introduction PediHaplotyper is software for assigning haploblock alleles to individuals in a pedigree, based on observed marker
More informationBICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017
BICF Nano Course: GWAS GWAS Workflow Development using PLINK Julia Kozlitina Julia.Kozlitina@UTSouthwestern.edu April 28, 2017 Getting started Open the Terminal (Search -> Applications -> Terminal), and
More informationQTX. Tutorial for. by Kim M.Chmielewicz Kenneth F. Manly. Software for genetic mapping of Mendelian markers and quantitative trait loci.
Tutorial for QTX by Kim M.Chmielewicz Kenneth F. Manly Software for genetic mapping of Mendelian markers and quantitative trait loci. Available in versions for Mac OS and Microsoft Windows. revised for
More informationREAP Software Documentation
REAP Software Documentation Version 1.2 Timothy Thornton 1 Department of Biostatistics 1 The University of Washington 1 REAP A C program for estimating kinship coefficients and IBD sharing probabilities
More informationImporting and Merging Data Tutorial
Importing and Merging Data Tutorial Release 1.0 Golden Helix, Inc. February 17, 2012 Contents 1. Overview 2 2. Import Pedigree Data 4 3. Import Phenotypic Data 6 4. Import Genetic Data 8 5. Import and
More informationELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2
ELAI user manual Yongtao Guan Baylor College of Medicine Version 1.0 25 June 2015 Contents 1 Copyright 2 2 What ELAI Can Do 2 3 A simple example 2 4 Input file formats 3 4.1 Genotype file format....................................
More informationFVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS
FVGWAS- 3.0 Manual Hongtu Zhu @ UNC BIAS Chao Huang @ UNC BIAS Nov 8, 2015 More and more large- scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical
More informationE. coli functional genotyping: predicting phenotypic traits from whole genome sequences
BioNumerics Tutorial: E. coli functional genotyping: predicting phenotypic traits from whole genome sequences 1 Aim In this tutorial we will screen genome sequences of Escherichia coli samples for phenotypic
More informationHaplotype Analysis. 02 November 2003 Mendel Short IGES Slide 1
Haplotype Analysis Specifies the genetic information descending through a pedigree Useful visualization of the gene flow through a pedigree A haplotype for a given individual and set of loci is defined
More informationPractical OmicsFusion
Practical OmicsFusion Introduction In this practical, we will analyse data, from an experiment which aim was to identify the most important metabolites that are related to potato flesh colour, from an
More informationData input vignette Reading genotype data in snpstats
Data input vignette Reading genotype data in snpstats David Clayton November 9, 2017 Memory limitations Before we start it is important to emphasise that the SnpMatrix objects that hold genotype data in
More informationHaploHMM - A Hidden Markov Model (HMM) Based Program for Haplotype Inference Using Identified Haplotypes and Haplotype Patterns
HaploHMM - A Hidden Markov Model (HMM) Based Program for Haplotype Inference Using Identified Haplotypes and Haplotype Patterns Jihua Wu, Guo-Bo Chen, Degui Zhi, NianjunLiu, Kui Zhang 1. HaploHMM HaploHMM
More informationBEAGLECALL 1.0. Brian L. Browning Department of Medicine Division of Medical Genetics University of Washington. 15 November 2010
BEAGLECALL 1.0 Brian L. Browning Department of Medicine Division of Medical Genetics University of Washington 15 November 2010 BEAGLECALL 1.0 P a g e i Contents 1 Introduction... 1 1.1 Citing BEAGLECALL...
More informationThe Imprinting Model
The Imprinting Model Version 1.0 Zhong Wang 1 and Chenguang Wang 2 1 Department of Public Health Science, Pennsylvania State University 2 Office of Surveillance and Biometrics, Center for Devices and Radiological
More informationEmile R. Chimusa Division of Human Genetics Department of Pathology University of Cape Town
Advanced Genomic data manipulation and Quality Control with plink Emile R. Chimusa (emile.chimusa@uct.ac.za) Division of Human Genetics Department of Pathology University of Cape Town Outlines: 1.Introduction
More informationTribble Genotypes and Phenotypes
Name(s): Period: Tribble Genetics Instructions Step 1. Determine the genotype and phenotype of your F 1 tribbles based on inheritance of traits from purebred dominant and recessive parent tribbles. One
More informationRandom Forest in Genomic Selection
Random Forest in genomic selection 1 Dpto Mejora Genética Animal, INIA, Madrid; Universidad Politécnica de Valencia, 20-24 September, 2010. Outline 1 Remind 2 Random Forest Introduction Classification
More informationQTL Analysis with QGene Tutorial
QTL Analysis with QGene Tutorial Phillip McClean 1. Getting the software. The first step is to download and install the QGene software. It can be obtained from the following WWW site: http://qgene.org
More informationDevelopment of linkage map using Mapmaker/Exp3.0
Development of linkage map using Mapmaker/Exp3.0 Balram Marathi 1, A. K. Singh 2, Rajender Parsad 3 and V.K. Gupta 3 1 Institute of Biotechnology, Acharya N. G. Ranga Agricultural University, Rajendranagar,
More informationBreeding View A visual tool for running analytical pipelines User Guide Darren Murray, Roger Payne & Zhengzheng Zhang VSN International Ltd
Breeding View A visual tool for running analytical pipelines User Guide Darren Murray, Roger Payne & Zhengzheng Zhang VSN International Ltd January 2015 1. Introduction The Breeding View is a visual tool
More informationJMP Genomics. Release Notes. Version 6.0
JMP Genomics Version 6.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP, A Business Unit of SAS SAS Campus Drive
More informationOrder Preserving Triclustering Algorithm. (Version1.0)
Order Preserving Triclustering Algorithm User Manual (Version1.0) Alain B. Tchagang alain.tchagang@nrc-cnrc.gc.ca Ziying Liu ziying.liu@nrc-cnrc.gc.ca Sieu Phan sieu.phan@nrc-cnrc.gc.ca Fazel Famili fazel.famili@nrc-cnrc.gc.ca
More informationThe fgwas software. Version 1.0. Pennsylvannia State University
The fgwas software Version 1.0 Zhong Wang 1 and Jiahan Li 2 1 Department of Public Health Science, 2 Department of Statistics, Pennsylvannia State University 1. Introduction Genome-wide association studies
More informationThe Analysis of RAD-tag Data for Association Studies
EDEN Exchange Participant Name: Layla Freeborn Host Lab: The Kronforst Lab, The University of Chicago Dates of visit: February 15, 2013 - April 15, 2013 Title of Protocol: Rationale and Background: to
More informationCFinder The Community / Cluster Finding Program. Users' Guide
CFinder The Community / Cluster Finding Program Users' Guide Copyright (C) Department of Biological Physics, Eötvös University, Budapest, 2005 Contents 1. General information and license...3 2. Quick start...4
More informationMQLS-XM Software Documentation
MQLS-XM Software Documentation Version 1.0 Timothy Thornton 1 and Mary Sara McPeek 2,3 Department of Biostatistics 1 The University of Washington Departments of Statistics 2 and Human Genetics 3 The University
More informationCreating a custom mappings similarity matrix
BioNumerics Tutorial: Creating a custom mappings similarity matrix 1 Aim In BioNumerics, character values can be mapped to categorical names according to predefined criteria (see tutorial Importing non-numerical
More informationGMDR User Manual Version 1.0
GMDR User Manual Version 1.0 Oct 30, 2011 1 GMDR is a free, open-source interaction analysis tool, aimed to perform gene-gene interaction with generalized multifactor dimensionality methods. GMDR is being
More informationPackage DSPRqtl. R topics documented: June 7, Maintainer Elizabeth King License GPL-2. Title Analysis of DSPR phenotypes
Maintainer Elizabeth King License GPL-2 Title Analysis of DSPR phenotypes LazyData yes Type Package LazyLoad yes Version 2.0-1 Author Elizabeth King Package DSPRqtl June 7, 2013 Package
More informationUser Guide. v Released June Advaita Corporation 2016
User Guide v. 0.9 Released June 2016 Copyright Advaita Corporation 2016 Page 2 Table of Contents Table of Contents... 2 Background and Introduction... 4 Variant Calling Pipeline... 4 Annotation Information
More informationLinkage analysis with paramlink Session I: Introduction and pedigree drawing
Linkage analysis with paramlink Session I: Introduction and pedigree drawing In this session we will introduce R, and in particular the package paramlink. This package provides a complete environment for
More informationPackage allehap. August 19, 2017
Package allehap August 19, 2017 Type Package Title Allele Imputation and Haplotype Reconstruction from Pedigree Databases Version 0.9.9 Date 2017-08-19 Author Nathan Medina-Rodriguez and Angelo Santana
More informationIntroduction to Excel Workshop
Introduction to Excel Workshop Empirical Reasoning Center June 6, 2016 1 Important Terminology 1. Rows are identified by numbers. 2. Columns are identified by letters. 3. Cells are identified by the row-column
More informationKGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li
KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual Miao-Xin Li, Jiang Li Department of Psychiatry Centre for Genomic Sciences Department
More informationFORMAT PED PHENO Software Documentation
FORMAT PED PHENO Software Documentation Version 1.0 Timothy Thornton 1 and Mary Sara McPeek 2,3 Department of Biostatistics 1 University of Washington Departments of Statistics 2 and Human Genetics 3 The
More informationEvolutionary form design: the application of genetic algorithmic techniques to computer-aided product design
Loughborough University Institutional Repository Evolutionary form design: the application of genetic algorithmic techniques to computer-aided product design This item was submitted to Loughborough University's
More informationCHAPTER 1: INTRODUCTION...
Linkage Analysis Package User s Guide to Analysis Programs Version 5.10 for IBM PC/compatibles 10 Oct 1996, updated 2 November 2013 Table of Contents CHAPTER 1: INTRODUCTION... 1 1.0 OVERVIEW... 1 1.1
More informationFamily Based Association Tests Using the fbat package
Family Based Association Tests Using the fbat package Weiliang Qiu email: stwxq@channing.harvard.edu Ross Lazarus email: ross.lazarus@channing.harvard.edu Gregory Warnes email: warnes@bst.rochester.edu
More informationMAGA: Meta-Analysis of Gene-level Associations
MAGA: Meta-Analysis of Gene-level Associations SYNOPSIS MAGA [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome DESCRIPTION
More informationGenetic Algorithm (GA)
Genetic Algorithm (GA) A QSAR model development tool NANOBRIDGES -A Collaborative Project The authors are grateful for the financial support from the European Commission through the Marie Curie IRSES program,
More informationPackage lodgwas. R topics documented: November 30, Type Package
Type Package Package lodgwas November 30, 2015 Title Genome-Wide Association Analysis of a Biomarker Accounting for Limit of Detection Version 1.0-7 Date 2015-11-10 Author Ahmad Vaez, Ilja M. Nolte, Peter
More informationPLNT4610 BIOINFORMATICS FINAL EXAMINATION
PLNT4610 BIOINFORMATICS FINAL EXAMINATION 18:00 to 20:00 Thursday December 13, 2012 Answer any combination of questions totalling to exactly 100 points. The questions on the exam sheet total to 120 points.
More informationQuality control of array genotyping data with argyle Andrew P Morgan
Quality control of array genotyping data with argyle Andrew P Morgan 2015-10-08 Introduction Proper quality control of array genotypes is an important prerequisite to further analysis. Genotype quality
More informationBRAT Manual V1.1. May 6th, Christian Göschl & Wolfgang Busch. for correspondence:
BRAT Manual V1.1 May 6th, 2014 Christian Göschl & Wolfgang Busch Email for correspondence: wolfgang.busch@gmi.oeaw.ac.at Gregor Mendel Institute Dr. Bohr-Gasse 3 1030 Vienna Austria 1 2 Brat Segmentation
More informationNotes on QTL Cartographer
Notes on QTL Cartographer Introduction QTL Cartographer is a suite of programs for mapping quantitative trait loci (QTLs) onto a genetic linkage map. The programs use linear regression, interval mapping
More informationClaNC: The Manual (v1.1)
ClaNC: The Manual (v1.1) Alan R. Dabney June 23, 2008 Contents 1 Installation 3 1.1 The R programming language............................... 3 1.2 X11 with Mac OS X....................................
More informationhaplo.score Score Tests for Association of Traits with Haplotypes when Linkage Phase is Ambiguous
haploscore Score Tests for Association of Traits with Haplotypes when Linkage Phase is Ambiguous Charles M Rowland, David E Tines, and Daniel J Schaid Mayo Clinic Rochester, MN E-mail contact: rowland@mayoedu
More informationThe fgwas Package. Version 1.0. Pennsylvannia State University
The fgwas Package Version 1.0 Zhong Wang 1 and Jiahan Li 2 1 Department of Public Health Science, 2 Department of Statistics, Pennsylvannia State University 1. Introduction The fgwas Package (Functional
More informationTable of Contents. 2. Files Input File Formats Output Files Export Options Auxiliary Input Files
GEVALT Documentation Table of Contents 1. Using GEVALT Loading a Dataset Saving and Loading Status Data Quality Checks LD Display Blocks and Haplotypes Phased Genotypes Individual Statistics Stampa Tagger
More informationHPC Course Session 3 Running Applications
HPC Course Session 3 Running Applications Checkpointing long jobs on Iceberg 1.1 Checkpointing long jobs to safeguard intermediate results For long running jobs we recommend using checkpointing this allows
More informationMendel and His Peas Investigating Monhybrid Crosses Using the Graphing Calculator
20 Investigating Monhybrid Crosses Using the Graphing Calculator This activity will use the graphing calculator s random number generator to simulate the production of gametes in a monohybrid cross. The
More informationGenetic Algorithms. Kang Zheng Karl Schober
Genetic Algorithms Kang Zheng Karl Schober Genetic algorithm What is Genetic algorithm? A genetic algorithm (or GA) is a search technique used in computing to find true or approximate solutions to optimization
More informationA practical example of tomato QTL mapping using a RIL population. R R/QTL
A practical example of tomato QTL mapping using a RIL population R http://www.r-project.org/ R/QTL http://www.rqtl.org/ Dr. Hamid Ashrafi UC Davis Seed Biotechnology Center Breeding with Molecular Markers
More informationEFFICIENT HAPLOTYPE INFERENCE FROM PEDIGREES WITH MISSING DATA USING LINEAR SYSTEMS WITH DISJOINT-SET DATA STRUCTURES
1 EFFICIENT HAPLOTYPE INFERENCE FROM PEDIGREES WITH MISSING DATA USING LINEAR SYSTEMS WITH DISJOINT-SET DATA STRUCTURES Xin Li and Jing Li Department of Electrical Engineering and Computer Science, Case
More informationGMDR User Manual. GMDR software Beta 0.9. Updated March 2011
GMDR User Manual GMDR software Beta 0.9 Updated March 2011 1 As an open source project, the source code of GMDR is published and made available to the public, enabling anyone to copy, modify and redistribute
More informationWorkflow Guide Slide(s) Topic 2-6 Importing Data and Labeling Samples 7-11 Processing Data Without an Allelic Ladder Processing Data With an
Workflow Guide Slide(s) Topic 2-6 Importing Data and Labeling Samples 7-11 Processing Data Without an Allelic Ladder 12-23 Processing Data With an Allelic Ladder 24-30 Reviewing Size and Allele Calls 31-37
More informationImportant notice regarding accounts used for installation and configuration
System Requirements Operating System Nintex Reporting 2008 can be installed on Microsoft Windows Server 2003 or 2008 (32 and 64 bit supported for both OS versions). Browser Client Microsoft Internet Explorer
More informationAlamut Focus 0.9 User Guide
0.9 User Guide Alamut Focus 0.9 User Guide 1 June 2015 Alamut Focus 0.9 User Guide This document and its contents are proprietary to Interactive Biosoftware. They are intended solely for the contractual
More informationLDsplit: A Java Program for Association Studies of Meiotic Recombination Hotspots Using SNP Data
LDsplit: A Java Program for Association Studies of Meiotic Recombination Hotspots Using SNP Data Peng Yang, Jing Guo and Jie Zheng* Bioinformatics Research Centre (BIRC) School of Computer Engineering
More informationEstimation of haplotypes
Estimation of haplotypes Cavan Reilly October 4, 2013 Table of contents Estimating haplotypes with the EM algorithm Individual level haplotypes Testing for differences in haplotype frequency Using the
More informationChapter 3 -- PedigreeMaster screens. Pedigree Display
Chapter 3 -- PedigreeMaster screens Before entering data and looking for specific items in the software, it is advisable to go through all of the various screens to familiarize yourself with all of the
More informationStat Server Deployment Guide. Java Sections
Stat Server Deployment Guide Java Sections 2/8/2018 Contents 1 Java Sections 1.1 How to Configure a Particular Java Extension 1.2 How to Configure a Particular Java Extension Stat Type 1.3 How to Configure
More informationSEQGWAS: Integrative Analysis of SEQuencing and GWAS Data
SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data SYNOPSIS SEQGWAS [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome
More informationBayesian Multiple QTL Mapping
Bayesian Multiple QTL Mapping Samprit Banerjee, Brian S. Yandell, Nengjun Yi April 28, 2006 1 Overview Bayesian multiple mapping of QTL library R/bmqtl provides Bayesian analysis of multiple quantitative
More informationConvert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc.
Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc. Overview This script converts allelic dosage values to genotypes based on user-specified thresholds. The dosage data may be in
More informationHigh dimensional data analysis
High dimensional data analysis Cavan Reilly October 24, 2018 Table of contents Data mining Random forests Missing data Logic regression Multivariate adaptive regression splines Data mining Data mining
More informationMultivariate probability distributions
Multivariate probability distributions September, 07 STAT 0 Class Slide Outline of Topics Background Discrete bivariate distribution 3 Continuous bivariate distribution STAT 0 Class Slide Multivariate
More informationGenomeMixer
GenomeMixer 0.54 http://www.nervenet.org/genome_mixer/ rwilliam@nb.utmem.edu 1. What is GenomeMixer? GenomeMixer is a cross-platform program that allows you to simulate complex multigenerational breeding
More informationbimm vignette Matti Pirinen & Christian Benner University of Helsinki November 15, 2016
bimm vignette Matti Pirinen & Christian Benner University of Helsinki November 15, 2016 1 Introduction bimm is a software package to efficiently estimate variance parameters of a bivariate lineax mixed
More informationJGA User Guide. WinNT/2000 Deployment v
JGA User Guide WinNT/2000 Deployment v.2005.06.20 Andrés Medaglia (amedagli@uniandes.edu.co) Centro para la Optimización y Probabilidad Aplicada Departamento de Ingeniería Industrial Universidad de los
More informationThe Lander-Green Algorithm in Practice. Biostatistics 666
The Lander-Green Algorithm in Practice Biostatistics 666 Last Lecture: Lander-Green Algorithm More general definition for I, the "IBD vector" Probability of genotypes given IBD vector Transition probabilities
More informationSTAT STATISTICAL METHODS. Statistics: The science of using data to make decisions and draw conclusions
STAT 515 --- STATISTICAL METHODS Statistics: The science of using data to make decisions and draw conclusions Two branches: Descriptive Statistics: The collection and presentation (through graphical and
More informationER/Studio Enterprise Portal User Guide
ER/Studio Enterprise Portal 1.0.3 User Guide Copyright 1994-2009 Embarcadero Technologies, Inc. Embarcadero Technologies, Inc. 100 California Street, 12th Floor San Francisco, CA 94111 U.S.A. All rights
More informationDocumentation for OptiMAS: a decision support tool for marker-assisted assembly of diverse alleles
Documentation for OptiMAS: a decision support tool for marker-assisted assembly of diverse alleles Version 1.5 F. Valente, F. Gauthier, N. Bardol, G. Blanc, J. Joets, A. Charcosset & L. Moreau Code by
More informationAxiom Analysis Suite Release Notes (For research use only. Not for use in diagnostic procedures.)
Axiom Analysis Suite 4.0.1 Release Notes (For research use only. Not for use in diagnostic procedures.) Axiom Analysis Suite 4.0.1 includes the following changes/updates: 1. For library packages that support
More informationForensic Resource/Reference On Genetics knowledge base: FROG-kb User s Manual. Updated June, 2017
Forensic Resource/Reference On Genetics knowledge base: FROG-kb User s Manual Updated June, 2017 Table of Contents 1. Introduction... 1 2. Accessing FROG-kb Home Page and Features... 1 3. Home Page and
More informationPerforming a resequencing assembly
BioNumerics Tutorial: Performing a resequencing assembly 1 Aim In this tutorial, we will discuss the different options to obtain statistics about the sequence read set data and assess the quality, and
More informationINSTALLATION MANUAL. GenoProof Mixture 3. Version /03/2018 qualitype GmbH Dresden. All rights reserved.
INSTALLATION MANUAL GenoProof Mixture 3 Version 3.0.4 12/03/2018 qualitype GmbH Dresden. All rights reserved. qualitype GmbH Moritzburger Weg 67 01109 Dresden Germany Tel +49 (0)351 8838 2800 Fax +49 (0)351
More informationMinimum Recombinant Haplotype Configuration on Tree Pedigrees (Extended Abstract)
Minimum Recombinant Haplotype Configuration on Tree Pedigrees (Extended Abstract) Koichiro Doi 1, Jing Li 2, and Tao Jiang 2 1 Department of Computer Science Graduate School of Information Science and
More informationPackage Eagle. January 31, 2019
Type Package Package Eagle January 31, 2019 Title Multiple Locus Association Mapping on a Genome-Wide Scale Version 1.3.0 Maintainer Andrew George Author Andrew George [aut, cre],
More informationTrait Analysis by association, Evolution and Linkage (TASSEL) User Manual
Trait Analysis by association, Evolution and Linkage (TASSEL) User Manual www.maizegenetics.net/tassel Testing and Validation of TASSEL were performed by the Buckler Lab at North Carolina State University
More informationQUICKTEST user guide
QUICKTEST user guide Toby Johnson Zoltán Kutalik December 11, 2008 for quicktest version 0.94 Copyright c 2008 Toby Johnson and Zoltán Kutalik Permission is granted to copy, distribute and/or modify this
More informationAnnotating a single sequence
BioNumerics Tutorial: Annotating a single sequence 1 Aim The annotation application in BioNumerics has been designed for the annotation of coding regions on sequences. In this tutorial you will learn how
More informationInstallation Guide
In Silico Pharmacokinetic System The In Silico Pharmacokinetic System (ISPKS) is an agent-based, discrete-event, simulation library core designed to be the foundation for running in silico pharmacokinetic
More informationLFMM version Reference Manual (Graphical User Interface version)
LFMM version 1.2 - Reference Manual (Graphical User Interface version) Eric Frichot 1, Sean Schoville 1, Guillaume Bouchard 2, Olivier François 1 * 1. Université Joseph Fourier Grenoble, Centre National
More informationCHAPTER 6 REAL-VALUED GENETIC ALGORITHMS
CHAPTER 6 REAL-VALUED GENETIC ALGORITHMS 6.1 Introduction Gradient-based algorithms have some weaknesses relative to engineering optimization. Specifically, it is difficult to use gradient-based algorithms
More informationiplots extreme Next-generation interactive graphics for analysis of large data Simon Urbanek AT&T Labs Statistics Research
iplots extreme Next-generation interactive graphics for analysis of large data Simon Urbanek AT&T Labs Statistics Research Overview About interactive graphics iplots: next generation - why and how? New
More information