The Imprinting Model

Similar documents
The fgwas Package. Version 1.0. Pennsylvannia State University

Version 2.0 Zhong Wang Department of Public Health Sciences Pennsylvannia State University

The fgwas software. Version 1.0. Pennsylvannia State University

The Lander-Green Algorithm in Practice. Biostatistics 666

Overview. Background. Locating quantitative trait loci (QTL)

QTX. Tutorial for. by Kim M.Chmielewicz Kenneth F. Manly. Software for genetic mapping of Mendelian markers and quantitative trait loci.

Development of linkage map using Mapmaker/Exp3.0

Bayesian Multiple QTL Mapping

Estimating Variance Components in MMAP

Notes on QTL Cartographer

Haplotype Analysis. 02 November 2003 Mendel Short IGES Slide 1

Package DSPRqtl. R topics documented: June 7, Maintainer Elizabeth King License GPL-2. Title Analysis of DSPR phenotypes

Package dlmap. February 19, 2015

CTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1

Lecture 7 QTL Mapping

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS

SOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie

PROC QTL A SAS PROCEDURE FOR MAPPING QUANTITATIVE TRAIT LOCI (QTLS)

QTL Analysis with QGene Tutorial

Supporting Figures Figure S1. FIGURE S2.

Intersection Tests for Single Marker QTL Analysis can be More Powerful Than Two Marker QTL Analysis

Package PedCNV. February 19, 2015

Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual

Package REGENT. R topics documented: August 19, 2015

GMDR User Manual. GMDR software Beta 0.9. Updated March 2011

User Manual for GIGI v1.06.1

Package EBglmnet. January 30, 2016

Effective Recombination in Plant Breeding and Linkage Mapping Populations: Testing Models and Mating Schemes

Simulation Study on the Methods for Mapping Quantitative Trait Loci in Inbred Line Crosses

Genet. Sel. Evol. 36 (2004) c INRA, EDP Sciences, 2004 DOI: /gse: Original article

THERE is a long history of work to map genetic loci (quantitative

Interpretation of Complex STR Results Using the Forensim Package

Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc.

User Manual ixora: Exact haplotype inferencing and trait association

Tribble Genotypes and Phenotypes

Step-by-Step Guide to Advanced Genetic Analysis

CHAPTER 1: INTRODUCTION...

Step-by-Step Guide to Basic Genetic Analysis

Package Eagle. January 31, 2019

MAPPING quantitative trait loci (QTL) is important

KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li

Tutorial of the Breeding Planner (BP) for Marker Assisted Recurrent Selection (MARS)

Step-by-Step Guide to Relatedness and Association Mapping Contents

Recalling Genotypes with BEAGLECALL Tutorial

Breeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel

simulation with 8 QTL

Breeding View A visual tool for running analytical pipelines User Guide Darren Murray, Roger Payne & Zhengzheng Zhang VSN International Ltd

User s Guide. Version 2.2. Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario

And there s so much work to do, so many puzzles to ignore.

Importing and Merging Data Tutorial

Package MOJOV. R topics documented: February 19, 2015

To finish the current project and start a new project. File Open a text data

Parallel Algorithms and Implementations for Genetic Analysis of Quantitative Traits

GenomeMixer

USER-FRIENDLY SOFTWARE

GeneMarker HID Quick Start

Genome-Wide Association Study Using

PLNT4610 BIOINFORMATICS FINAL EXAMINATION

A practical example of tomato QTL mapping using a RIL population. R R/QTL

Family Based Association Tests Using the fbat package

GMDR User Manual Version 1.0

Network Based Models For Analysis of SNPs Yalta Opt

THE standard approach for mapping the genetic the binary trait, defined by whether or not an individual

MACAU User Manual. Xiang Zhou. March 15, 2017

Tutorial on gene-c ancestry es-ma-on: How to use LASER. Chaolong Wang Sequence Analysis Workshop June University of Michigan

REAP Software Documentation

Mendel and His Peas Investigating Monhybrid Crosses Using the Graphing Calculator

QUANTITATIVE genetics has developed rapidly, methods. Kruglyak and Lander (1995) apply the linear

Precision Mapping of Quantitative Trait Loci

Package GEM. R topics documented: January 31, Type Package

Polymorphism and Variant Analysis Lab

QTL Analysis and Linkage Library (QTL-ALL)

Package allehap. August 19, 2017

Welcome to the Quantitative Trait Loci (QTL) Tutorial

PLNT4610 BIOINFORMATICS FINAL EXAMINATION

MQLS-XM Software Documentation

STATISTICAL PACKAGE FOR AGRICULTURAL RESEARCH (SPAR 2.0)

Documentation for OptiMAS: a decision support tool for marker-assisted assembly of diverse alleles

Package ridge. R topics documented: February 15, Title Ridge Regression with automatic selection of the penalty parameter. Version 2.

citius A program to apply Markov chain Monte Carlo method for multilocus analysis of large complex pedigrees.

Package MultiMeta. February 19, 2015

GenViewer Tutorial / Manual

Package mqtl. May 25, 2018

GMMAT: Generalized linear Mixed Model Association Tests Version 0.7

GWAS Exercises 3 - GWAS with a Quantiative Trait

BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017

USER S MANUAL FOR THE AMaCAID PROGRAM

Instructions for Curators: Entering new data into the QTLdb

Genetic Analysis. Page 1

HPC Course Session 3 Running Applications

Package SMAT. January 29, 2013

Biased estimators of quantitative trait locus heritability and location in interval mapping

PediHaplotyper Manual

Computational Biology and Chemistry

EMIM: Estimation of Maternal, Imprinting and interaction effects using Multinomial modelling

GS3. Andrés Legarra. March 5, Genomic Selection Gibbs Sampling Gauss Seidel

Linkage analysis with paramlink Appendix: Running MERLIN from paramlink

Package geneslope. October 26, 2016

Package lodgwas. R topics documented: November 30, Type Package

Package BGData. May 11, 2017

Transcription:

The Imprinting Model Version 1.0 Zhong Wang 1 and Chenguang Wang 2 1 Department of Public Health Science, Pennsylvania State University 2 Office of Surveillance and Biometrics, Center for Devices and Radiological Health, Food and Drug Administration, Washington DC 1. Introduction We develop a new computational model for quantifying the effects of genomic imprinting on complex traits and studying how genomic imprinting transmits from one generation to next. The model was founded on two-way reciprocal backcrosses which allow for separate tracing of allelic transmission from maternal and paternal parents. We use traditional quantitative genetic principles to define genetic parameters that describe genomic imprinting effects and their interactions with other types of genetic effects. A number of genetically meaningful hypothesis tests are generated to ask and address fundamental questions in quantitative inheritance and variation. This guide gives a brief instruction on how to perform the analysis. The outline of this guide is as follows. Section 2: Installation Section 3: Data formats Section 4: Main functions Section 6: Simulation Section 7: Function details Section 8: References We refer to Chenguang Wang el at (2011) for the theoretical foundation of this program. If you appreciate this software then please cite the following paper in your work: 1. Chenguang Wang, Zhong Wang, Li Wang, Junjia Zhu, Daniel R. Prows,Rongling Wu. A computational model for the inheritance pattern of genomic imprinting for complex traits. 2. Installation The program for imprinting model depends on the MASS package. This package is available on CRAN, so it should be installed firstly. It is unnecessary to install the program. After unzipping the program and copying them to a folder, it can be loaded into R as the following command. >source( imp2_model.r ); >source( imp2_plot.r ); 3. Data formats

Statgen.psu.edu The imprinting model can indentify QTL loci and perform hypothesis testing on the basis of data files containing the appropriately formated marker, genotypic and phenotypic data. The appropriate formatting for these two files is described below. 3.1. Marker file MarkId, chromosome, Distance, Name 1, 1, 0, D1Mit3 2, 1, 9.4 D1Mit20 In the marker file, the distance field is a distance (in cm) in one chromosome or linkage group. The header row should be included. 3.2. Data file SubjID, Phenotype, Cross, Marker 1, Marker 2, Marker 3, 1, 182.00, 1, 2, -1, 1, 1, 2, 162.50, 2,-1, 2, 2, 1, 3, 173.40, 8, 2, 2, 0, 0, Three genotypes (aa=0, Aa=1, AA=2) and missing data (coded as -1) are valid marker values. Eight mating types (1..8) are valid cross values. The phenotype column should be filled with numerical values. There are only limited data checks in the current version, and the program requires no missing data in the columns of phenotype and cross. 4. Main functions In general, the procedure flow includes the three steps: 1) Load dataset (marker data, cross data, phenotypic data and genotypic data) into a data object. 2) Scan all markers at every 2 cm step under the null hypothesis to obtain a result object. 3) Summary the result object and plot the LR2 profile for all chromosomes. In order to implement above flow, we need the following codes. dat <- IMP2.load( IMP2_marker.csv, IMP2_data.csv, options=list(log_phenol=1)); ret <- IMP2.hp_test(dat); summay(ret); Above test is performed under the default hypothesis which all genetic components are equal zero. The influences of these imprinting effects on genotypic value of AA, Aa, aa and aa can be tested by the different null hypothesis. For examples, the following snippet implements how to test the null hypothesis as the following: H 0 : i 1 = i 2 = I 1 = 0 ret <- IMP2.hp_test(dat, hp=c(5,6,13)); STATGEN.PSU.EDU - 2 -

Imprinting Model summay(ret); plot(ret) The indexes of genetic components of i 1, i 2 and I 1 are 5, 6 and 13, so a test set should be appended into the IMP2.hp_test() method for a special null hypothesis. The summary() method can find out the significant QTL loci from the results of hypothesis test and summarize the significant QTL information including the genetic components. The user can set the maximum QTL locations for the results. The plot() method can plot LR2 profiles for all chromosomes and some significant QTL loci. It also can export the figures into some PDF files. The following snippet shows how to do it. plot(ret, options=list(boutputpdf=1)); EM algorithm is employed to estimate the genetic components and variance for each backcross. We implements this algorithm in the EM_proc() method. The EM_proc() can be called by the user to estimate specific QTL loci or special null hypothesis. The following is a demonstration to show how to call it alone. pg_data <- cbind( dat$phen, dat$cross, dat$geno[,3], dat$geno[,4] ) d.marker <- ( m.right - m.left )/100; d.qtl <- ( qtl - m.left )/100; r0 <- IMP2.EM_proc( pg_data, d.marker, d.qtl, hp=c(5,6,13) ); r1 <- IMP2.EM_proc( pg_data, d.marker, d.qtl ); lr2 <- 2*(r1$l r0$l); 5. Simulation The simulation method is provided to test and verify the Imprinting Model. There are two methods for data simulation, the first is IMP2.param() to build a parameter object and the second is IMP2.simulate() to build a data object containing marker data, cross data, genotypic data and phenotypic data based on the parameter object. Once a parameter object is created, you can directly change the default values. The following table lists all items included in a parameter objects. These items are originally from the statistical model: Table 1: All Items of parameter object No Items Default Description 1 nsize 100 Sample size 2 h2 0.4 Heritabilities 3 qtl 50 Significant QTL loci 4 marker Vector Marker table for one chromosome. Default value is the following vector. c(0, 20, 40, 60, 80, 100) 5 mupar Vector<16> Genetic components. A vector which includes 16 components of u, a, d, i1, i2, i3, i4, i5, i6, i7, i8, I1, I2, I3, I4. Default value is the following vector. - 3 - STATEGEN.PSU.EDU

Statgen.psu.edu The following snippet demonstrates how to customize the parameter values. c( 4, -0.01, -0.02, 0.05, 0.15, -0.03, 0.10, 0.10, 0.02, -0.03, 0.10, 0.06, 0.01, -0.05, 0.05, 0.04) par <- IMP2.param(); par$nsize <- 200; par$h2 <- 0.1; dat <- IMP2.simulate(par); The program also provides one method to do a full simulation test. IMP2.simu_test() implements three steps in one function, including parameter building, data generating and hypothesis testing. The following is an example to use the IMP2.simu_test(). > ret <- IMP2.simu_test(); > summary(ret); 6. Function details No Function description IMP2.load (marker_file, data_file, options=list()) Load markers definition and phenotype/genotype data stored in CSV file. 1 marker_file: A CSV file which contains markers definition described in the section 3. data_file: A CSV file which contains phenotype, cross and genotype data described in the section 3. options: One option is available log_pheno: take a log operation before the phenotypic data is used. IMP2.hp_test(dat_obj, hp=c()) Scan all markers at every 2 cm step under the null hypothesis. 2 The dat_obj is a data object returned by IMP2.load() or IMP2.simulate() method. This function will return a result object which can be summarized by summary() command. The parameter of hp can describe a null hypothesis. The genetic components are µ, a, d, i, i 1, i 2, i 3, i 4, i 5, i 6, i 7, i 8, I 1, I 2, I 3 and I 4, and their corresponding indexes are 1,2, 16. So if a null hypothesis is the following components, the parameter of hp will be c(5,6,13). i 1 =i 2 =I 1 3 IMP2. EM_proc () STATGEN.PSU.EDU - 4 -

Imprinting Model EM algorithm. It estimates the genetic components and variance and obtains a likelihood value for a specific QTL locus. pg_data: a matrix including the columns of phenotype, cross, marker 1 and marker 2. d.marker: the distance of maker 1 and 2. d.qtl: the distance from the left flank marker to QTL locations hp: the null hypothesis, please refer to IMP2.hp_test(). IMP2.param() 4 Create a parameter object for data simulation. A parameter object is created and returned. IMP2.simulate(par_obj) 5 This is simulation function by a parameter object. Maker data, cross data, genotypic data and phenotypic data will be generated according to the parameter object value. A data object will be returned if this simulation is successful. summary(res_obj) 6 7 8 Summarize the significant QTL loci and plot LR2 profiles for all chromosomes and single significant QTL loci. plot(res _obj) Plot LR2 profiles for all chromosomes and single significant QTL loci. IMP2.simu_test() Test the model by simulation method. 7. References (1) Chenguang Wang, Zhong Wang, Li Wang, Junjia Zhu, Daniel R. Prows,Rongling Wu. A computational model for the inheritance pattern of genomic imprinting for complex traits. 8. Appendix: version history. 3/14/2011: Version 1.0 has been released. - 5 - STATEGEN.PSU.EDU