HaploHMM - A Hidden Markov Model (HMM) Based Program for Haplotype Inference Using Identified Haplotypes and Haplotype Patterns

Size: px
Start display at page:

Download "HaploHMM - A Hidden Markov Model (HMM) Based Program for Haplotype Inference Using Identified Haplotypes and Haplotype Patterns"

Transcription

1 HaploHMM - A Hidden Markov Model (HMM) Based Program for Haplotype Inference Using Identified Haplotypes and Haplotype Patterns Jihua Wu, Guo-Bo Chen, Degui Zhi, NianjunLiu, Kui Zhang 1. HaploHMM HaploHMM is a program to infer haplotypes from unrelated individuals using a set of identified haplotypes and haplotype patterns from previous studies. The program is based on an extended Hidden Markov Model. It has been shown that the program is effective for genotypes with a substantial amount of missing data. Please refer to Wu et al. (2014) for the detailed description of methods used in this program. 2. Files Included in the Package HaploHMM.exe: the executive file under windows operating system HaploHMM_Manual.pdf: the manual for HaploHMM HaploHMM_Geno.txt: an example file for genotype data HaploHMM_HapList.txt: an example file for the set of identified haplotypes HaploHMM_HapPattern.txt: an example file for identified haplotype patterns HaploHMM_res.txt: an example file for haplotypes of each individual HaploHMM_res.sample11: an example file for haplotypes of each individual sampled at the HMM iteration How to run HaploHMM HaploHMM.exe runs on the Command Promote window of Microsoft Windows operating system. Here we use Microsoft Windows 7 operating system as an example to show how to run HaploHMM. To run HaploHMM, you first need to open a Command Prompt window. There are two ways to do this. The first way is to click the Start button, click All Programs, click Accessories, and then click Command Prompt. The second way is to click Start button, type Command Prompt in the Search box, then double-click Command Prompt in the list of results.

2 After a Command Promote window is opened, change the folder to the folder that contains your program and data, then type the following command with the options followed by the specified value for each option: HaploHMM --genotype [genotype data file] --haplotype [initial haplotype file] --pattern [haplotype pattern file] --seed [seed] --burnin [number of iterations for burnin] --rounds [total number of iterations] --states [number of reference haplotypes] --sampleinterval [sample interval] --prefix [prefix of output files] The order of options can be arbitrary. However, you must specify the value for each option right after that option identifier with a space between them. Also be aware that each option is preceded by --. If you do not specify a value for an option, the program will use its default value. The brief description and the default value for each option are listed in Table 1. Using the data files provided in the package, we can infer their haplotypes based on the following command: Example 1: Command line example. HaploHMM --genotype HaploHMM_Geno.txt --haplotype HaploHMM_HapList.txt -- pattern HaploIHMM-HapPattern.txt --seed burnin 10 --rounds states errorrate prefix HaploHMM_res Table 1. The description and the default value for each option used in HaploHMM. Option Description Default value --genotype [genotype data file]: A character string for the file name of No default. genotype data. This is required by HaploHMM. --haplotype --pattern --seed --burnin --rounds [initial haplotype file]: A character string for the file name of a set of identified haplotypes. [haplotype pattern file]: A character string for the file name of the sets of identified haplotype patterns. [seed]: A positive integer for the starting seed of random number generator. [number of iterations for burnin]: A non-negative integer for the number of iterations before taking samples in the HMM sampling. [total number of iterations]: A non-negative integer for the total number of iterations in the HMM sampling No default. No default

3 --states --sampleinterval --errorrate --prefix [number of reference haplotypes]: A non-negative integer for the total number of reference haplotypes used in the HMM sampling. The value of 0 means the use of all inferred haplotypes from individuals provided from [genotype data file] and all haplotypes provided from [initial haplotype file]. [sample interval]: A no-native integer for the intermediate results. A value of 0 means no intermediate results. A value of 1 means the haplotype from each HMM iteration after the burin will be output to files. [error rate]: A number in [0, 1) to represent the error rate in the HMM. [prefix of output files]: A character string for the prefix of output file names res_hmm 4. How to prepare input files (options --genotype, --haplotype, --pattern ) HaploHMM can have three different types of input file specified by options --genotype, -- haplotype, and --pattern. These files are [genotype data file], [initial haplotype file], and [haplotype pattern file]. Only [genotype data file] is required by the program and the other two options can be omitted. [genotype data file] : --genotype This is the file for genotypes. Two alleles at a bi-allelic locus are detonated by 0 and 1, respectively. Missing allele is denoted as?. If your genotype is present-absent genotype as those described in our paper (Joo et al., 2007 and Wu, et al., 2014) and is denoted by one digit (0 absence, 1 presence, and? missing) at each locus, you can convert 1 to 1?, 0 to 0 0, and? to?? to comply the file format required by HaploHMM. The detailed format of genotype data file is as follows: First raw: the list of marker names in this genotype data file Second raw : the individual ID and genotype data First column: the individual ID (an arbitrary character string) Second column : Two columns for two alleles at a marker locus. The order of alleles does not matter. The columns should be separated by the space or the tab.

4 Example 2: [genotype data file] 2DL1 2DL2 2DS1 2DL3 3DS1 E ?? 0 1 E002 0? E ? [initial haplotype file]: --haplotype This is the file for a set of identified haplotypes. Each haplotype is listed in one row as a combination of allele symbols of 0 and 1. The missing allele is not allowed in the file. The first column is the list of marker names and must match the names given in [genotype data file]. The detailed format of [initial haplotype file] is as follows: First raw: the marker names which match the names given in [genotype data file] Second raw : haplotype and its name First column: the haplotype name Second column : each column is for an allele at a marker locus for the haplotype. The columns should be separated by the space or the tab. Example 3: [initial haplotype file] 2DL1 2DL2 2DS1 2DL3 3DS1 A A A [haplotype pattern file]: --pattern This is the file for the sets of haplotype patterns. Each set of haplotype patterns consists of a header line to indicate the involved markers and a set of haplotype patterns allowed in the program. For example, if you have a set of haplotype patterns for markers 2DS2 and 2DL2 and have observed that they are in complete negative LD. In other words, you only have two types of haplotypes 1 0 and 0 1 allowed for 2DS2 and 2DL2 but do not have other two haplotypes ( 0 0 and 1 1 ). This set of haplotype patterns can be specified in the file as follows: 2DS2 2DL2 2

5 In the first row, the first two are the names of two markers and the last number is the number of haplotype patterns allowed in the program for these two markers. The subsequent rows specify the haplotype patterns.. The detailed format of [haplotype pattern file] is as follows: First section: the first set of haplotype patterns First row: the header line First column : the name of markers involved Last column: the number of haplotype patterns that are allowed Second row...: each haplotype pattern is specified in each row; Second section..: other sets of haplotype patterns Example 4: [haplotype pattern file] 3DL DS2 2DL DS1 3DL Example 5: [haplotype pattern file] 3DL2 2DL DS2 2DL2 2DL DS1 3DL It is worth noting that all identified haplotypes in initial haplotype file must be compatible

6 with one haplotype pattern in each set of haplotype patterns. For example, the haplotype ( ) for the markers 3DL2, 2DL1, 2DS2, 2DL2, 2DL3, 2DL4, 3DS1, 3DL1 contradicts with the haplotype pattern in example 5 because the haplotype pattern (1 0 1) for the markers 2DS2, 2DL2, and 2DL3 is not allowed. So this haplotype cannot be used together with the haplotype patterns in example 5. The program checks the contradiction between haplotypes in [initial haplotype file] and haplotype patterns in [haplotype pattern file]. The program will stop and reports an error message when a contradiction is found. 5. The Formats and Interpretations of Output Files HaploHMM generates an output file with the haplotype pair of each individual. If the value from the option --sampleinterval is a positive integer, the haplotype pair of each individual from certain HMM iterations after the burin will be output to files. The prefix of the output file names is specified by the option --prefix. All output files have the same format: each row consists of the name of each individual, the name of haplotype (either hap1 or hap2 ), and one of two haplotypes of that individual. Example 6: [output file] ind1 hap ind1 hap ind2 hap ind2 hap ind3 hap ind3 hap ind4 hap ind4 hap Options for HMM sampling HaploHMM provides set of options for the HMM sampling. You can set the starting seed of the random number generator by --seed, the number of iterations before taking samples by --burnin, the number of total iterations by --rounds, the number of reference haplotypes by --states, the error rate by --errorrate, and the intermediate results by -- sampleinterval. In general, the larger values of --rounds and --states are, the more computational time are needed and the more accurate results can be obtained. For the option --rounds, a

7 number greater than 100 is sufficient in most situations. The default value of --states is 0, which means that the program will use all inferred haplotypes from individuals provided from [genotype data file] and all haplotypes provided from [initial haplotype file] as reference haplotypes. Since the computational time is proportional to the square of the number of reference haplotypes, you may want to use a smaller number and a few hundred reference haplotypes are sufficient in most situations. 7. Remarks In this section, we highlight some possible solutions for the problems that may be encountered in running HaploHMM. HaploHMM can only handle bi-allelic markers and the two alleles at each marker locus should be specified as 1 and 0. The missing data is specified as?. You cannot use other allele symbols such as A, G, C, T or other numeric or character symbols. The first lines in three input files only list the marker names in the file and the order of these names is the order that data are entered. These first lines are not the variable headers. If you have a data file with the first line as column names (variable header), you need to modify the first line. Otherwise, the program will report an error message. Both [initial haplotype file] and [haplotype pattern file] can help to reduce the computing time. Without one of these input files, the program will run longer in the presence of a large of portion of missing data. You can monitor the computing progress through the screen output. To reduce the computational time, you can use the smaller number for options --rounds and --states. Please refer to Section 6 for more details. 8. Contact information This program and related materials can be downloaded through following website: Bugs and comments should be addressed to:

8 Kui Zhang Section of Statistical Genetics Department of Biostatistics University of Alabama at Birmingham 1600 University Blvd Change Log The program is first released on June 22, References Yoo Y, Kaslow R, Tang J, Zhang K Haplotype inference for present-absent genotype data for clustered genes using identified haplotypes and haplotype patterns. Bioinformatics, 23: Jihua Wu, Guo-Bo Chen, Degui Zhi, Nianjun Liu, Kui Zhang A hidden Markov model for haplotype inference for present-absent data of clustered genes using identified haplotypes and haplotype patterns. Frontiers in Statistical Genetics. Under Review.

Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc.

Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc. Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc. Overview This script converts allelic dosage values to genotypes based on user-specified thresholds. The dosage data may be in

More information

ELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2

ELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2 ELAI user manual Yongtao Guan Baylor College of Medicine Version 1.0 25 June 2015 Contents 1 Copyright 2 2 What ELAI Can Do 2 3 A simple example 2 4 Input file formats 3 4.1 Genotype file format....................................

More information

GMDR User Manual. GMDR software Beta 0.9. Updated March 2011

GMDR User Manual. GMDR software Beta 0.9. Updated March 2011 GMDR User Manual GMDR software Beta 0.9 Updated March 2011 1 As an open source project, the source code of GMDR is published and made available to the public, enabling anyone to copy, modify and redistribute

More information

Importing and Merging Data Tutorial

Importing and Merging Data Tutorial Importing and Merging Data Tutorial Release 1.0 Golden Helix, Inc. February 17, 2012 Contents 1. Overview 2 2. Import Pedigree Data 4 3. Import Phenotypic Data 6 4. Import Genetic Data 8 5. Import and

More information

HapBlock A Suite of Dynamic Programming Algorithms for Haplotype Block Partitioning and Tag SNP Selection Based on Haplotype and Genotype Data

HapBlock A Suite of Dynamic Programming Algorithms for Haplotype Block Partitioning and Tag SNP Selection Based on Haplotype and Genotype Data HapBlock A Suite of Dynamic Programming Algorithms for Haplotype Block Partitioning and Tag SNP Selection Based on Haplotype and Genotype Data Introduction The suite of programs, HapBlock, is developed

More information

Genetic Analysis. Page 1

Genetic Analysis. Page 1 Genetic Analysis Page 1 Genetic Analysis Objectives: 1) Set up Case-Control Association analysis and the Basic Genetics Workflow 2) Use JMP tools to interact with and explore results 3) Learn advanced

More information

Documentation for BayesAss 1.3

Documentation for BayesAss 1.3 Documentation for BayesAss 1.3 Program Description BayesAss is a program that estimates recent migration rates between populations using MCMC. It also estimates each individual s immigrant ancestry, the

More information

BEAGLECALL 1.0. Brian L. Browning Department of Medicine Division of Medical Genetics University of Washington. 15 November 2010

BEAGLECALL 1.0. Brian L. Browning Department of Medicine Division of Medical Genetics University of Washington. 15 November 2010 BEAGLECALL 1.0 Brian L. Browning Department of Medicine Division of Medical Genetics University of Washington 15 November 2010 BEAGLECALL 1.0 P a g e i Contents 1 Introduction... 1 1.1 Citing BEAGLECALL...

More information

Step-by-Step Guide to Basic Genetic Analysis

Step-by-Step Guide to Basic Genetic Analysis Step-by-Step Guide to Basic Genetic Analysis Page 1 Introduction This document shows you how to clean up your genetic data, assess its statistical properties and perform simple analyses such as case-control

More information

Procedure for GeneMapper ID. 1.0 Purpose To specify the steps for performing analysis on DNA samples using GeneMapper ID (GMID) software.

Procedure for GeneMapper ID. 1.0 Purpose To specify the steps for performing analysis on DNA samples using GeneMapper ID (GMID) software. Procedure for GeneMapper ID 1.0 Purpose To specify the steps for performing analysis on DNA samples using GeneMapper ID (GMID) software. 2.0 Scope This procedure applies to DNA Database Forensic Scientists

More information

Haplotype Analysis. 02 November 2003 Mendel Short IGES Slide 1

Haplotype Analysis. 02 November 2003 Mendel Short IGES Slide 1 Haplotype Analysis Specifies the genetic information descending through a pedigree Useful visualization of the gene flow through a pedigree A haplotype for a given individual and set of loci is defined

More information

Step-by-Step Guide to Relatedness and Association Mapping Contents

Step-by-Step Guide to Relatedness and Association Mapping Contents Step-by-Step Guide to Relatedness and Association Mapping Contents OBJECTIVES... 2 INTRODUCTION... 2 RELATEDNESS MEASURES... 2 POPULATION STRUCTURE... 6 Q-K ASSOCIATION ANALYSIS... 10 K MATRIX COMPRESSION...

More information

fasta2genotype.py Version 1.10 Written for Python Available on request from the author 2017 Paul Maier

fasta2genotype.py Version 1.10 Written for Python Available on request from the author 2017 Paul Maier 1 fasta2genotype.py Version 1.10 Written for Python 2.7.10 Available on request from the author 2017 Paul Maier This program takes a fasta file listing all sequence haplotypes of all individuals at all

More information

Package Hapi. July 28, 2018

Package Hapi. July 28, 2018 Type Package Package Hapi July 28, 2018 Title Inference of Chromosome-Length Haplotypes Using Genomic Data of Single Gamete Cells Version 0.0.3 Author, Han Qu, Jinfeng Chen, Shibo Wang, Le Zhang, Julong

More information

Step-by-Step Guide to Advanced Genetic Analysis

Step-by-Step Guide to Advanced Genetic Analysis Step-by-Step Guide to Advanced Genetic Analysis Page 1 Introduction In the previous document, 1 we covered the standard genetic analyses available in JMP Genomics. Here, we cover the more advanced options

More information

Operating instructions for MixtureCalc v1.2 (Freeware Version)

Operating instructions for MixtureCalc v1.2 (Freeware Version) 1 Objective To provide instructions for importing data into and for performing mixture calculations using MixtureCalc-v1.2. MixtureCalc-v1.2 is validated for SPSA casework only and no warranty is provided

More information

Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual

Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual Department of Epidemiology and Biostatistics Wolstein Research Building 2103 Cornell Rd Case Western

More information

SOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie

SOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie SOLOMON: Parentage Analysis 1 Corresponding author: Mark Christie christim@science.oregonstate.edu SOLOMON: Parentage Analysis 2 Table of Contents: Installing SOLOMON on Windows/Linux Pg. 3 Installing

More information

QTX. Tutorial for. by Kim M.Chmielewicz Kenneth F. Manly. Software for genetic mapping of Mendelian markers and quantitative trait loci.

QTX. Tutorial for. by Kim M.Chmielewicz Kenneth F. Manly. Software for genetic mapping of Mendelian markers and quantitative trait loci. Tutorial for QTX by Kim M.Chmielewicz Kenneth F. Manly Software for genetic mapping of Mendelian markers and quantitative trait loci. Available in versions for Mac OS and Microsoft Windows. revised for

More information

User Manual ixora: Exact haplotype inferencing and trait association

User Manual ixora: Exact haplotype inferencing and trait association User Manual ixora: Exact haplotype inferencing and trait association June 27, 2013 Contents 1 ixora: Exact haplotype inferencing and trait association 2 1.1 Introduction.............................. 2

More information

Spotter Documentation Version 0.5, Released 4/12/2010

Spotter Documentation Version 0.5, Released 4/12/2010 Spotter Documentation Version 0.5, Released 4/12/2010 Purpose Spotter is a program for delineating an association signal from a genome wide association study using features such as recombination rates,

More information

Recalling Genotypes with BEAGLECALL Tutorial

Recalling Genotypes with BEAGLECALL Tutorial Recalling Genotypes with BEAGLECALL Tutorial Release 8.1.4 Golden Helix, Inc. June 24, 2014 Contents 1. Format and Confirm Data Quality 2 A. Exclude Non-Autosomal Markers......................................

More information

Introduction to GDS. Stephanie Gogarten. July 18, 2018

Introduction to GDS. Stephanie Gogarten. July 18, 2018 Introduction to GDS Stephanie Gogarten July 18, 2018 Genomic Data Structure CoreArray (C++ library) designed for large-scale data management of genome-wide variants data format (GDS) to store multiple

More information

Package REGENT. R topics documented: August 19, 2015

Package REGENT. R topics documented: August 19, 2015 Package REGENT August 19, 2015 Title Risk Estimation for Genetic and Environmental Traits Version 1.0.6 Date 2015-08-18 Author Daniel J.M. Crouch, Graham H.M. Goddard & Cathryn M. Lewis Maintainer Daniel

More information

6.867 Machine learning

6.867 Machine learning 6.867 Machine learning Final eam December 3, 24 Your name and MIT ID: J. D. (Optional) The grade ou would give to ourself + a brief justification. A... wh not? Problem 5 4.5 4 3.5 3 2.5 2.5 + () + (2)

More information

Devyser QF-PCR. Guide to Sample Runs, Data Analysis & Results Interpretation

Devyser QF-PCR. Guide to Sample Runs, Data Analysis & Results Interpretation Devyser QF-PCR Guide to Sample Runs, Data Analysis & Results Interpretation Version 4-2013 Contents 1. Setting up a sample run on an ABI Genetic Analyzer... 3 1.1 Introduction... 3 1.2 Workflow... 3 1.3

More information

6.867 Machine learning

6.867 Machine learning 6.867 Machine learning Final eam December 3, 24 Your name and MIT ID: J. D. (Optional) The grade ou would give to ourself + a brief justification. A... wh not? Cite as: Tommi Jaakkola, course materials

More information

Package rubias. February 2, 2019

Package rubias. February 2, 2019 Type Package Package rubias February 2, 2019 Title Bayesian Inference from the Conditional Genetic Stock Identification Model Version 0.2.0 Maintainer Eric C. Anderson Implements

More information

Estimation of haplotypes

Estimation of haplotypes Estimation of haplotypes Cavan Reilly October 4, 2013 Table of contents Estimating haplotypes with the EM algorithm Individual level haplotypes Testing for differences in haplotype frequency Using the

More information

HPC Course Session 3 Running Applications

HPC Course Session 3 Running Applications HPC Course Session 3 Running Applications Checkpointing long jobs on Iceberg 1.1 Checkpointing long jobs to safeguard intermediate results For long running jobs we recommend using checkpointing this allows

More information

PediHaplotyper Manual

PediHaplotyper Manual PediHaplotyper Manual Roeland Voorrips, Wageningen UR Plant Breeding, 2015 Introduction PediHaplotyper is software for assigning haploblock alleles to individuals in a pedigree, based on observed marker

More information

JMP Genomics. Release Notes. Version 6.0

JMP Genomics. Release Notes. Version 6.0 JMP Genomics Version 6.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP, A Business Unit of SAS SAS Campus Drive

More information

The Lander-Green Algorithm in Practice. Biostatistics 666

The Lander-Green Algorithm in Practice. Biostatistics 666 The Lander-Green Algorithm in Practice Biostatistics 666 Last Lecture: Lander-Green Algorithm More general definition for I, the "IBD vector" Probability of genotypes given IBD vector Transition probabilities

More information

POLDISP 1.0c. a software package to estimate Pollen Dispersal. by Juan J. Robledo-Arnuncio, Frédéric Austerlitz and Peter E. Smouse USER'S MANUAL

POLDISP 1.0c. a software package to estimate Pollen Dispersal. by Juan J. Robledo-Arnuncio, Frédéric Austerlitz and Peter E. Smouse USER'S MANUAL POLDISP 1.0c a software package to estimate Pollen Dispersal by Juan J. Robledo-Arnuncio, Frédéric Austerlitz and Peter E. Smouse USER'S MANUAL June 009 (see release notes at the end of this document)

More information

Hidden Markov Models in the context of genetic analysis

Hidden Markov Models in the context of genetic analysis Hidden Markov Models in the context of genetic analysis Vincent Plagnol UCL Genetics Institute November 22, 2012 Outline 1 Introduction 2 Two basic problems Forward/backward Baum-Welch algorithm Viterbi

More information

500K Data Analysis Workflow using BRLMM

500K Data Analysis Workflow using BRLMM 500K Data Analysis Workflow using BRLMM I. INTRODUCTION TO BRLMM ANALYSIS TOOL... 2 II. INSTALLATION AND SET-UP... 2 III. HARDWARE REQUIREMENTS... 3 IV. BRLMM ANALYSIS TOOL WORKFLOW... 3 V. RESULTS/OUTPUT

More information

USER S MANUAL FOR THE AMaCAID PROGRAM

USER S MANUAL FOR THE AMaCAID PROGRAM USER S MANUAL FOR THE AMaCAID PROGRAM TABLE OF CONTENTS Introduction How to download and install R Folder Data The three AMaCAID models - Model 1 - Model 2 - Model 3 - Processing times Changing directory

More information

GMDR User Manual Version 1.0

GMDR User Manual Version 1.0 GMDR User Manual Version 1.0 Oct 30, 2011 1 GMDR is a free, open-source interaction analysis tool, aimed to perform gene-gene interaction with generalized multifactor dimensionality methods. GMDR is being

More information

MAGA: Meta-Analysis of Gene-level Associations

MAGA: Meta-Analysis of Gene-level Associations MAGA: Meta-Analysis of Gene-level Associations SYNOPSIS MAGA [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome DESCRIPTION

More information

User Manual for GIGI v1.06.1

User Manual for GIGI v1.06.1 1 User Manual for GIGI v1.06.1 Author: Charles Y K Cheung [cykc@uw.edu] Ellen M Wijsman [wijsman@uw.edu] Department of Biostatistics University of Washington Last Modified on 1/31/2015 2 Contents Introduction...

More information

Flow Cytometry Analysis Software. Developed by scientists, for scientists. User Manual. Version Introduction:

Flow Cytometry Analysis Software. Developed by scientists, for scientists. User Manual. Version Introduction: Flowlogic Flow Cytometry Analysis Software Developed by scientists, for scientists User Manual Version 7.2.1 Introduction: Overview, Preferences, Saving and Opening Analysis Files www.inivai.com TABLE

More information

Release Notes. JMP Genomics. Version 4.0

Release Notes. JMP Genomics. Version 4.0 JMP Genomics Version 4.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP. A Business Unit of SAS SAS Campus Drive

More information

haplo.score Score Tests for Association of Traits with Haplotypes when Linkage Phase is Ambiguous

haplo.score Score Tests for Association of Traits with Haplotypes when Linkage Phase is Ambiguous haploscore Score Tests for Association of Traits with Haplotypes when Linkage Phase is Ambiguous Charles M Rowland, David E Tines, and Daniel J Schaid Mayo Clinic Rochester, MN E-mail contact: rowland@mayoedu

More information

FHIR Genomics Use Case: Reporting HLA Genotyping Consensus-Sequence-Block & Observation-genetics-Sequence Cardinality

FHIR Genomics Use Case: Reporting HLA Genotyping Consensus-Sequence-Block & Observation-genetics-Sequence Cardinality FHIR Genomics Use Case: Reporting HLA Genotyping Consensus-Sequence-Block & Observation-genetics-Sequence Cardinality Bob Milius, PhD Bioinformatics NMDP/Be The Match Outline Background MIRING, & HML FHIR

More information

Random Forest in Genomic Selection

Random Forest in Genomic Selection Random Forest in genomic selection 1 Dpto Mejora Genética Animal, INIA, Madrid; Universidad Politécnica de Valencia, 20-24 September, 2010. Outline 1 Remind 2 Random Forest Introduction Classification

More information

msbgs - a program for generating samples under background selection models

msbgs - a program for generating samples under background selection models msbgs - a program for generating samples under background selection models Kai Zeng Department of Animal and Plant Sciences, University of Sheffield, UK k.zeng@sheffield.ac.uk Version 1.01 1 Introduction

More information

PHYC 500: Introduction to LabView. Exercise 8 (v 1.3) M.P. Hasselbeck, University of New Mexico. Arrays, XY Graphs, Disk I/O

PHYC 500: Introduction to LabView. Exercise 8 (v 1.3) M.P. Hasselbeck, University of New Mexico. Arrays, XY Graphs, Disk I/O PHYC 500: Introduction to LabView M.P. Hasselbeck, University of New Mexico Exercise 8 (v 1.3) Arrays, XY Graphs, Disk I/O Place two numeric controls (label them Number of points and Offset ) on the Front

More information

CHAPTER 1: INTRODUCTION...

CHAPTER 1: INTRODUCTION... Linkage Analysis Package User s Guide to Analysis Programs Version 5.10 for IBM PC/compatibles 10 Oct 1996, updated 2 November 2013 Table of Contents CHAPTER 1: INTRODUCTION... 1 1.0 OVERVIEW... 1 1.1

More information

M(ARK)S(IM) Dec. 1, 2009 Payseur Lab University of Wisconsin

M(ARK)S(IM) Dec. 1, 2009 Payseur Lab University of Wisconsin M(ARK)S(IM) Dec. 1, 2009 Payseur Lab University of Wisconsin M(ARK)S(IM) extends MS by enabling the user to simulate microsatellite data sets under a variety of mutational models. Simulated data sets are

More information

LDsplit: A Java Program for Association Studies of Meiotic Recombination Hotspots Using SNP Data

LDsplit: A Java Program for Association Studies of Meiotic Recombination Hotspots Using SNP Data LDsplit: A Java Program for Association Studies of Meiotic Recombination Hotspots Using SNP Data Peng Yang, Jing Guo and Jie Zheng* Bioinformatics Research Centre (BIRC) School of Computer Engineering

More information

Notes on QTL Cartographer

Notes on QTL Cartographer Notes on QTL Cartographer Introduction QTL Cartographer is a suite of programs for mapping quantitative trait loci (QTLs) onto a genetic linkage map. The programs use linear regression, interval mapping

More information

GenViewer Tutorial / Manual

GenViewer Tutorial / Manual GenViewer Tutorial / Manual Table of Contents Importing Data Files... 2 Configuration File... 2 Primary Data... 4 Primary Data Format:... 4 Connectivity Data... 5 Module Declaration File Format... 5 Module

More information

User s Guide. Version 2.2. Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario

User s Guide. Version 2.2. Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario User s Guide Version 2.2 Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario Mehdi Sargolzaei, Jacques Chesnais and Flavio Schenkel Jan 2014 Disclaimer

More information

KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li

KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual Miao-Xin Li, Jiang Li Department of Psychiatry Centre for Genomic Sciences Department

More information

Package dkdna. June 1, Description Compute diffusion kernels on DNA polymorphisms, including SNP and bi-allelic genotypes.

Package dkdna. June 1, Description Compute diffusion kernels on DNA polymorphisms, including SNP and bi-allelic genotypes. Package dkdna June 1, 2015 Type Package Title Diffusion Kernels on a Set of Genotypes Version 0.1.1 Date 2015-05-31 Author Gota Morota and Masanori Koyama Maintainer Gota Morota Compute

More information

All About PlexSet Technology Data Analysis in nsolver Software

All About PlexSet Technology Data Analysis in nsolver Software All About PlexSet Technology Data Analysis in nsolver Software PlexSet is a multiplexed gene expression technology which allows pooling of up to 8 samples per ncounter cartridge lane, enabling users to

More information

Square roots: We say that the square root of 16 is 4. We write this as 16 4.

Square roots: We say that the square root of 16 is 4. We write this as 16 4. Intermediate algebra Class notes Radicals and Radical Functions (section 10.1) These are square roots, cube roots, etc. Square roots: We say that the square root of 16 is 4. We write this as 16 4. Root

More information

Workflow Guide Slide(s) Topic 2-6 Importing Data and Labeling Samples 7-11 Processing Data Without an Allelic Ladder Processing Data With an

Workflow Guide Slide(s) Topic 2-6 Importing Data and Labeling Samples 7-11 Processing Data Without an Allelic Ladder Processing Data With an Workflow Guide Slide(s) Topic 2-6 Importing Data and Labeling Samples 7-11 Processing Data Without an Allelic Ladder 12-23 Processing Data With an Allelic Ladder 24-30 Reviewing Size and Allele Calls 31-37

More information

E. coli functional genotyping: predicting phenotypic traits from whole genome sequences

E. coli functional genotyping: predicting phenotypic traits from whole genome sequences BioNumerics Tutorial: E. coli functional genotyping: predicting phenotypic traits from whole genome sequences 1 Aim In this tutorial we will screen genome sequences of Escherichia coli samples for phenotypic

More information

C How to Program, 6/e by Pearson Education, Inc. All Rights Reserved.

C How to Program, 6/e by Pearson Education, Inc. All Rights Reserved. C How to Program, 6/e 1992-2010 by Pearson Education, Inc. An important part of the solution to any problem is the presentation of the results. In this chapter, we discuss in depth the formatting features

More information

Forensic Resource/Reference On Genetics knowledge base: FROG-kb User s Manual. Updated June, 2017

Forensic Resource/Reference On Genetics knowledge base: FROG-kb User s Manual. Updated June, 2017 Forensic Resource/Reference On Genetics knowledge base: FROG-kb User s Manual Updated June, 2017 Table of Contents 1. Introduction... 1 2. Accessing FROG-kb Home Page and Features... 1 3. Home Page and

More information

Package lodgwas. R topics documented: November 30, Type Package

Package lodgwas. R topics documented: November 30, Type Package Type Package Package lodgwas November 30, 2015 Title Genome-Wide Association Analysis of a Biomarker Accounting for Limit of Detection Version 1.0-7 Date 2015-11-10 Author Ahmad Vaez, Ilja M. Nolte, Peter

More information

Overview. Experiment Specifications. This tutorial will enable you to

Overview. Experiment Specifications. This tutorial will enable you to Defining a protocol in BioAssay Overview BioAssay provides an interface to store, manipulate, and retrieve biological assay data. The application allows users to define customized protocol tables representing

More information

Predicated Software Pipelining Technique for Loops with Conditions

Predicated Software Pipelining Technique for Loops with Conditions Predicated Software Pipelining Technique for Loops with Conditions Dragan Milicev and Zoran Jovanovic University of Belgrade E-mail: emiliced@ubbg.etf.bg.ac.yu Abstract An effort to formalize the process

More information

Table of Contents TABLE OF CONTENTS...I TABLE OF FIGURES... II D - DATA PLOTTING, REGRAPH AND EXPORT...16 E - CHANGE AND VERSION RECORD...

Table of Contents TABLE OF CONTENTS...I TABLE OF FIGURES... II D - DATA PLOTTING, REGRAPH AND EXPORT...16 E - CHANGE AND VERSION RECORD... Table of Contents TABLE OF CONTENTS...I TABLE OF FIGURES... II A - DISCUSSION...1 B - MAIN SETUP...6 B.1 - Setup Dialog...7 B.2 - Description...8 B.3 - Controls...10 C - PLOT SETUP...12 C.1 - Setup Dialog...13

More information

Notice to U.S. Forensic Laboratories on the status of the U.S. Y-STR Database

Notice to U.S. Forensic Laboratories on the status of the U.S. Y-STR Database Notice to U.S. Forensic Laboratories on the status of the U.S. Y-STR Database Funded by the National Institute of Justice, the U.S. Y-STR Database (http://usystrdatabase.org) has been managed by the National

More information

QTL Analysis with QGene Tutorial

QTL Analysis with QGene Tutorial QTL Analysis with QGene Tutorial Phillip McClean 1. Getting the software. The first step is to download and install the QGene software. It can be obtained from the following WWW site: http://qgene.org

More information

Axiom Analysis Suite Release Notes (For research use only. Not for use in diagnostic procedures.)

Axiom Analysis Suite Release Notes (For research use only. Not for use in diagnostic procedures.) Axiom Analysis Suite 4.0.1 Release Notes (For research use only. Not for use in diagnostic procedures.) Axiom Analysis Suite 4.0.1 includes the following changes/updates: 1. For library packages that support

More information

SNP HiTLink Manual. Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1

SNP HiTLink Manual. Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1 SNP HiTLink Manual Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1 1 Department of Neurology, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan 2 Dynacom Co., Ltd, Kanagawa,

More information

Incompatibility Dimensions and Integration of Atomic Commit Protocols

Incompatibility Dimensions and Integration of Atomic Commit Protocols The International Arab Journal of Information Technology, Vol. 5, No. 4, October 2008 381 Incompatibility Dimensions and Integration of Atomic Commit Protocols Yousef Al-Houmaily Department of Computer

More information

Vignette for the package rehh (version 2+) Mathieu Gautier, Alexander Klassmann and Renaud Vitalis 24/10/2016

Vignette for the package rehh (version 2+) Mathieu Gautier, Alexander Klassmann and Renaud Vitalis 24/10/2016 Vignette for the package rehh (version 2+) Mathieu Gautier, Alexander Klassmann and Renaud Vitalis 24/10/2016 Contents 1 Input Files 2 1.1 Haplotype data file..........................................

More information

User Guide Written By Yasser EL-Manzalawy

User Guide Written By Yasser EL-Manzalawy User Guide Written By Yasser EL-Manzalawy 1 Copyright Gennotate development team Introduction As large amounts of genome sequence data are becoming available nowadays, the development of reliable and efficient

More information

Binary vector quantizer design using soft centroids

Binary vector quantizer design using soft centroids Signal Processing: Image Communication 14 (1999) 677}681 Binary vector quantizer design using soft centroids Pasi FraK nti *, Timo Kaukoranta Department of Computer Science, University of Joensuu, P.O.

More information

A comprehensive modelling framework and a multiple-imputation approach to haplotypic analysis of unrelated individuals

A comprehensive modelling framework and a multiple-imputation approach to haplotypic analysis of unrelated individuals A comprehensive modelling framework and a multiple-imputation approach to haplotypic analysis of unrelated individuals GUI Release v1.0.2: User Manual January 2009 If you find this software useful, please

More information

Tutorial on gene-c ancestry es-ma-on: How to use LASER. Chaolong Wang Sequence Analysis Workshop June University of Michigan

Tutorial on gene-c ancestry es-ma-on: How to use LASER. Chaolong Wang Sequence Analysis Workshop June University of Michigan Tutorial on gene-c ancestry es-ma-on: How to use LASER Chaolong Wang Sequence Analysis Workshop June 2014 @ University of Michigan LASER: Loca-ng Ancestry from SEquence Reads Main func:ons of the so

More information

BUCKy Bayesian Untangling of Concordance Knots (applied to yeast and other organisms)

BUCKy Bayesian Untangling of Concordance Knots (applied to yeast and other organisms) Introduction BUCKy Bayesian Untangling of Concordance Knots (applied to yeast and other organisms) Version 1.2, 17 January 2008 Copyright c 2008 by Bret Larget Last updated: 11 November 2008 Departments

More information

arxiv: v2 [q-bio.qm] 17 Nov 2013

arxiv: v2 [q-bio.qm] 17 Nov 2013 arxiv:1308.2150v2 [q-bio.qm] 17 Nov 2013 GeneZip: A software package for storage-efficient processing of genotype data Palmer, Cameron 1 and Pe er, Itsik 1 1 Center for Computational Biology and Bioinformatics,

More information

MultiMap. User's Guide. A Program for Automated Genetic Linkage Mapping. version 1.1. Written by. Tara Matise. Mark Perlin Aravinda Chakravarti

MultiMap. User's Guide. A Program for Automated Genetic Linkage Mapping. version 1.1. Written by. Tara Matise. Mark Perlin Aravinda Chakravarti MultiMap User's Guide A Program for Automated Genetic Linkage Mapping version 1.1 Written by Tara Cox Matise Mark Perlin Aravinda Chakravarti Language: Common Lisp Computer: Sun/Dec/HP... Program for Linkage

More information

Package MsatAllele. February 15, 2013

Package MsatAllele. February 15, 2013 Package MsatAllele February 15, 2013 Type Package Title Visualizes the scoring and binning of microsatellite fragment sizes Version 1.03 Date 2008-09-11 Author Maintainer The package

More information

LASER: Locating Ancestry from SEquence Reads version 2.04

LASER: Locating Ancestry from SEquence Reads version 2.04 LASER: Locating Ancestry from SEquence Reads version 2.04 Chaolong Wang 1 Computational and Systems Biology Genome Institute of Singapore A*STAR, Singapore 138672, Singapore Xiaowei Zhan 2 Department of

More information

round decimals to the nearest decimal place and order negative numbers in context

round decimals to the nearest decimal place and order negative numbers in context 6 Numbers and the number system understand and use proportionality use the equivalence of fractions, decimals and percentages to compare proportions use understanding of place value to multiply and divide

More information

Adobe Marketing Cloud Data Workbench Controlled Experiments

Adobe Marketing Cloud Data Workbench Controlled Experiments Adobe Marketing Cloud Data Workbench Controlled Experiments Contents Data Workbench Controlled Experiments...3 How Does Site Identify Visitors?...3 How Do Controlled Experiments Work?...3 What Should I

More information

Package HMMASE. February 4, HMMASE R package

Package HMMASE. February 4, HMMASE R package Package HMMASE February 4, 2014 Type Package Title HMMASE R package Version 1.0 Date 2014-02-04 Author Juan R. Steibel, Heng Wang, Ping-Shou Zhong Maintainer Heng Wang An R package that

More information

G-PhoCS Generalized Phylogenetic Coalescent Sampler version 1.2.3

G-PhoCS Generalized Phylogenetic Coalescent Sampler version 1.2.3 G-PhoCS Generalized Phylogenetic Coalescent Sampler version 1.2.3 Contents 1. About G-PhoCS 2. Download and Install 3. Overview of G-PhoCS analysis: input and output 4. The sequence file 5. The control

More information

Some notes about Event-B and Rodin

Some notes about Event-B and Rodin Some notes about Event-B and Rodin Résumé This document briefly presents the language event-b and the tool Rodin. For a comprehensive presentation, refer to the event-b page http://www.event-b.org/, the

More information

Importing MLST data from an Excel file

Importing MLST data from an Excel file BioNumerics Tutorial: Importing MLST data from an Excel file 1 Aims This tutorial shows how to import MLST allele information from a MS Excel file as character data in a BioNumerics database. It illustrates

More information

MAN Package for pedigree analysis. Contents.

MAN Package for pedigree analysis. Contents. 1 MAN Package for pedigree analysis. Contents. Introduction 5 1. Operations with pedigree data. 5 1.1. Data input options. 5 1.1.1. Import from file. 5 1.1.2. Manual input. 7 1.2. Drawing of the pedigree

More information

The same can also be achieved by clicking on Format Character and then selecting an option from the Typeface list box.

The same can also be achieved by clicking on Format Character and then selecting an option from the Typeface list box. CHAPTER 2 TEXT FORMATTING A text without any special formatting can have a monotonous appearance. To outline text, to highlight individual words, quotations, or references, or to separate certain parts

More information

E08 Lay Table Automation

E08 Lay Table Automation E08 Lay Table Automation This example demonstrates how the OrcaFlex automation tools can be used to create installation lay tables. A basecase model is analysed to determine the range of top tension values

More information

MACAU User Manual. Xiang Zhou. March 15, 2017

MACAU User Manual. Xiang Zhou. March 15, 2017 MACAU User Manual Xiang Zhou March 15, 2017 Contents 1 Introduction 2 1.1 What is MACAU...................................... 2 1.2 How to Cite MACAU................................... 2 1.3 The Model.........................................

More information

MapReduce Design Patterns

MapReduce Design Patterns MapReduce Design Patterns MapReduce Restrictions Any algorithm that needs to be implemented using MapReduce must be expressed in terms of a small number of rigidly defined components that must fit together

More information

GeneMapper ID-X Software

GeneMapper ID-X Software Quick Reference Guide GeneMapper ID-X Software Version 1.0 Overview GeneMapper ID-X Software Version 1.0 is an automated genotyping software solution for all human identification (HID) data analysis needs,

More information

Data Walkthrough: Background

Data Walkthrough: Background Data Walkthrough: Background File Types FASTA Files FASTA files are text-based representations of genetic information. They can contain nucleotide or amino acid sequences. For this activity, students will

More information

HIDDEN MARKOV MODELS AND SEQUENCE ALIGNMENT

HIDDEN MARKOV MODELS AND SEQUENCE ALIGNMENT HIDDEN MARKOV MODELS AND SEQUENCE ALIGNMENT - Swarbhanu Chatterjee. Hidden Markov models are a sophisticated and flexible statistical tool for the study of protein models. Using HMMs to analyze proteins

More information

MIRING: Minimum Information for Reporting Immunogenomic NGS Genotyping. Data Standards Hackathon for NGS HACKATHON 1.0 Bethesda, MD September

MIRING: Minimum Information for Reporting Immunogenomic NGS Genotyping. Data Standards Hackathon for NGS HACKATHON 1.0 Bethesda, MD September MIRING: Minimum Information for Reporting Immunogenomic NGS Genotyping Data Standards Hackathon for NGS HACKATHON 1.0 Bethesda, MD September 27 2014 Static Dynamic Static Minimum Information for Reporting

More information

BIR pipeline steps and subsequent output files description STEP 1: BLAST search

BIR pipeline steps and subsequent output files description STEP 1: BLAST search Lifeportal (Brief description) The Lifeportal at University of Oslo (https://lifeportal.uio.no) is a Galaxy based life sciences portal lifeportal.uio.no under the UiO tools section for phylogenomic analysis,

More information

Year 9: Long term plan

Year 9: Long term plan Year 9: Long term plan Year 9: Long term plan Unit Hours Powerful procedures 7 Round and round 4 How to become an expert equation solver 6 Why scatter? 6 The construction site 7 Thinking proportionally

More information

Package logicfs. R topics documented:

Package logicfs. R topics documented: Package logicfs November 21, 2017 Title Identification of SNP Interactions Version 1.48.0 Date 2013-09-12 Author Holger Schwender Maintainer Holger Schwender Depends LogicReg, mcbiopi

More information

citius A program to apply Markov chain Monte Carlo method for multilocus analysis of large complex pedigrees.

citius A program to apply Markov chain Monte Carlo method for multilocus analysis of large complex pedigrees. With the support of the Walloon Region of Belgium, Direction Générale de l'agriculture citius May 2008 A program to apply Markov chain Monte Carlo method for multilocus analysis of large complex pedigrees.

More information

1. Below is an example 1D river reach model built in HEC-RAS and displayed in the HEC-RAS user interface:

1. Below is an example 1D river reach model built in HEC-RAS and displayed in the HEC-RAS user interface: How Do I Import HEC-RAS Cross-Section Data? Flood Modeller allows you to read in cross sections defined in HEC-RAS models, automatically converting them to Flood Modeller 1D cross sections. The procedure

More information