Manual code: MSU_pigs.R

Size: px
Start display at page:

Download "Manual code: MSU_pigs.R"

Transcription

1 Manual code: MSU_pigs.R Authors: Jose Luis Gualdrón Duarte 1 and Juan Pedro Steibel,3 1 Departamento de Producción Animal, Facultad de Agronomía, UBA-CONICET, Buenos Aires, ARG Department of Animal Science, Michigan State University, East Lansing, Michigan, USA 3 Department of Fisheries and Wildlife, Michigan State University, East Lansing, Michigan, USA Introduction The R code MSU_pigs.R is based on the paper Gualdrón et al This code uses a gpdata object called pigmsu as input file, which contains different files: genotype file, map, pedigree, and phenotype file. For further details of construction and structure of gpdata objects please refer to synbreed package in R (Wimmer et al. [1]).This package was used for assembling of gpdata pigmsu. Description of Input Files Each slot in pigmsu contains the following information: Genotype file (pigmsu$geno) In this file, animals IDs are row names and SNP names are column names. The genotypes are expressed as allelic dosage, having elements equal to 0,1,, i.e. the count of the allele used as reference, or a decimal number in the interval [0, ] for imputed genotypes.: MARC MARC ASGA ALGA01185 MARC M1GA Map file (pigmsu$map) This file contains SNP names as row names and also, chromosome and physical position expressed in Megabases as columns. chr pos MARC ASGA ASGA Pedigree file (pigmsu$pedigree) The pedigree file contains five columns: animal ID - Sire ID - Dam ID - generation - sex (1=male, =female). ID Par1 Par gener sex

2 Phenotype file (pigmsu$pheno) The trait file contains: animal ID in row names and the trait in column name: bf10_13wk Running MSU_pigs.R: Load required packages, functions and input files First of all, code MSU_pigs.R allows to load Regress package (version , R packages []) which is used in model fitting and variance components estimation process, performed by using REML algorithm.; and also, to load Synbreed package for data visualization and analysis of gpdata "pigmsu". library(synbreed) library(regress) Then, input gpdata (pigmsu) and functions (Functions_codes.R)for MSU_pigs.R are uploaded: load("pigmsu") source("functions_codes.r") # Load functions Filter process by Minor Allele Frequency (MAF) Genotypes in the F haves a second editing process considering MAF < 0.01, as well as fixed SNP (MAF= 0.5). As a result, a filtered matrix "M" that contains information for SNP is created. ### ## Filter by Minor allele frequency (MAF) in F genotypes # ## See Section: "Methods-Genotyping and data editing" in the reference paper # ### ## Genotype Matrix # ge<-pigmsu$geno IDrowM<-rownames(ge) IDrowM<-as.numeric(IDrowM) genf<-(idrowm>1000)&(idrowm<6000) Mf<-ge[genf,] # Genotype matrix for F animals m1<-colsums(mf) m<-m1/(*nrow(mf)) # Vector with frequency of second allele in each marker ## Filter by Minor allele frequency (MAF) and SNP fixed in the F genotypes MAF<-0.01 #Note: the addition of the filter "m<0.99" is because some SNPs are codificated as the major allele frequency f1<-(m>maf)&(m!=0.5) &(m<0.99) SNPf<-as.matrix(m[f1]) ## New genotype "M" matrix (SNP 44055) with genotypes filtered by MAF### M1<-colnames(Mf)%in%rownames(SNPf) M<-ge[,M1] # Total SNP for F0-F1-F generations

3 Construction of Z and G matrices Following the approach of VanRaden [3], Z matrix for F animals (Znf) is calculated applying the function "zstandard". Then, matrix G (GTo) for the same animalsis calculated using the Z matrix obtained previously (Znf). ################################ ##1) Computation of G Matrix ## ################################ IDrow<-rownames(M) IDrow<-as.numeric(IDrow) ## Frequency of P="" in the F0 genof0<-(idrow>6000) # logical vector with TRUE in F0 ids gen0<-m[genof0,] # Genotype for F0 (1st column ID - Genotype) all_frq_f0<-colmeans(gen0,na.rm=t)/ ## Computation of Z matrix for F idgf<-as.numeric(rownames(m)) idxg<-(idgf>1000)&(idgf<6000) # index for F animals ID genf<-m[idxg,] # genotype for F animals Znf<-zstandard(genf,alfreq=all_frq_f0,procedure="heterogeneous") ## Computation of G matrix: G=ZZ' GTo<-Znf%*%t(Znf) Fixed effects matrix construction From object "pigmsu" are extracted: 1) the growth trait 13 week tenth rib backfat (mm) (bf10_13wk) and ) sex for animals in the F, to be used as response variable (y) and incidence matrix fixed effects (X) for the model respectively. ####################################### ## ) Input files for funtion "snpe" ## ####################################### ##Phenotype pheno<-pigmsu$pheno[,"bf10_13wk",1] indx<-match(rownames(znf),names(pheno)) pheno<-pheno[indx] ##sex indx<-match(rownames(znf),pigmsu$pedigree$id) sex<-pigmsu$pedigree$sex[indx] #model.matrix to create matrix X sex<-as.factor(sex) x<-model.matrix( ~ sex -1, contrasts.arg=list(sex=contrasts(sex, contrasts=f))) #sex ID psex<-cbind(pigmsu$pedigree$id,pigmsu$pedigree$sex) #Extract ID and sex for all animals (F0-F1 and F) idxsex<-psex[,1]%in%names(pheno) sexid<-as.numeric(pigmsu$pedigree$id[idxsex]) rownames(x)<-sexid colnames(x)<-c("female","male")

4 Estimation of variance components and breeding values Once incidence matrix X is constructed, "snpe" function allows to fit the model y= X + a+ e. This function takes as input the phenotype file (y), X matrix, and also Z and G matrices. Internally, variance components are estimated by REML using regress package (version R []). ## 3) Apply function "snpe" ## # return: ## # g_hat=snpe,a_hat,e_variance,a_variance,heredability,#iteratons,ginv ## trout_snpe<-snpe(pheno,x,znf,gto) As a result, the function "snpe" gives: llik: Estimate of the LogLikelihood for the model a_hat: Genome breeding values (GEBVs). E_variance: A_variance: Error variance Additive variance e ) A) Heritability: Heritability of the trait n_iter: Ginv: The inverse of G matrix Number of iterations to converge. Here, a_hat contains the random breeding values, such that vector such that e N I, and I is the identity matrix. (0, e ) a N(0, G ), and e is the random error A Estimation of SNP effects The marker effect, variance of the markers effects, and p-values are calculated for each marker or SNP. For this purpose, the function "GWA" is applied using elements obtained in function "snpe". ## 4) Apply function "GWA" ## # return: ## # beta=uh ("g_hat"), snp_variance=vsnp, pvalues=pvalue ## gwa_trait<-gwa(trout_snpe,x,znf) As a result, the function "GWA" gives: beta: Estimate of each marker (SNP) effect snp_variance: Variance of each marker (SNP) effect pvalues: p-value for each marker (SNP) Histogram of p-values and Manhattan plot Here, the histogram of p-values and the Manhattan plot are displayed. However, for the Manhattan plot is necessary the absolute marker (SNP) position or consecutive position. Then, the function "abmap" is applied.

5 ############################## ## 5) Map Absolute Position ## ############################## map<-as.matrix(pigmsu$map) # Read map map1<-rownames(map)%in%colnames(znf) map<-map[map1,] # Final SNP #### Final map ##### mapmsu1<-abmap(map) # Apply funtion ab idxc<--map[,1]%% # Color index # histogram of p-values hist(gwa_trait$pvalues) # Manhattan Plot threshold<-0.05/nrow(map) #pdf(file="manhattan_trait.pdf") #option to save the Manhattan plot in format ".pdf" in the current directory plot(mapmsu1,-log(gwa_trait$pvalues,10),pch=16,col=ifelse(idxc==,"red","blue"),abline(h=log(threshold,10),lwd=1.1,col="red"),xlab="absolute position Mb", ylab="-log10(p-value)") #dev.off() #option to save the Manhattan plot in format ".pdf" in the current directory Definition of candidate segments Candidate segments are defined by taking SNPs within one Mb upstream and one Mb downstream of the SNP with smallest p-value in each chromosome (see list "maxlogpv"). ########################## ## 4) Candidate Segment ## ########################## pvalue<-gwa_trait$pvalues logpv<- -log(pvalue,10) assoc<-as.data.frame(cbind(map,mapmsu1,pvalue,logpv)) colnames(assoc)<-c("chr", "pos_mb","abspos_mb","pvalue","logpv") #Extract max -logpvalues per chromosome result <- vector("list",18) for(i in 1:18){ pvch<-assoc[assoc$chr==i,] maxpv<-which.max(pvch$logpv) mpv<-pvch[maxpv,] result[[i]] <-mpv } # List "maxlogpv" with the highest -log(p-value) per chromosome maxlogpv<-do.call(rbind,result) ########################################### ## 4.1) Extract segments per chromosome ## ########################################### #model_result<- vector("list",nrow(maxlogpv)) model_result<- vector("list",) # for(i in 1:nrow(maxlogpv)){ for(i in 1:){ #Example!!! Chromosome 1 and psnp<-maxlogpv[i,3] phigh<-as.numeric(psnp)+1 plow<-as.numeric(psnp)-1 #Extract SNP in this range slxmap<-(assoc[,3]<=phigh)&(assoc[,3]>=plow) slxmap<-assoc[slxmap,]

6 Computation of Z and G matrices for candidate segments Using the markers or SNPs into the segment, a new Z ("Z1") and G ("G1") matrices are calculated, whereas genomic relationship matrix G was built using all remaining SNPs. Next, the model equal to: y = X + a1+ a+ e ("model"), where a 1 is the vector of random effects associated with those SNP a1 N 0, G 1 A and a is the vector of additive random effects located in the segment, such that 1 associated with all SNPs except those involved with a 1, such that N 0, A a G. The model is compared with the reduce model y = X + a+ e ("model1"), from results obtained across the regress R package []. ################################################################### ## 4.) compute matrix Z for snp selected (normalized separtely) ## ################################################################### # Create Gseg for the segement using the columns in Znf belong to SNP segment Zidg<-colnames(Znf)%in%rownames(slxmap) Zs<-Znf[,Zidg] ################## Extrac for the GTo G<-GTo-(Zs%*%t(Zs)) ### compute matrix Z and G for snp selected in the segmwnt (normalized separtely) Zidx<-colnames(genf)%in%rownames(slxmap) genoffil<-genf[,zidx] idxfrq<-names(all_frq_f0)%in%rownames(slxmap) # Select the frequencies for the SNP segement sum(idxfrq==true) fil_frq_f0<-all_frq_f0[idxfrq] # Extract the frequencies for the SNP segment # Matrices Z1 and G1 (use the selected markers) #Apply the function for the SNP segement Z1<-zstandard(genoffil,alfreq=fil_frq_f0,procedure="heterogeneous") G1<-Z1%*%t(Z1) # G1 for SNP selected ########################################## ## 4.3) compute models with "regress" ## ########################################## model1<-regress(pheno~x,~gto) # y=xb+a+e model<-regress(pheno~x,~g1+g) # y=xb+a1+a+e ###################################### ## 4.4) Performed models comparison ## ###################################### llik<-list(result=list(l0=model1,l1=model)) ##################################################### ## 4.5) Results from Regress for model1 and model ## ##################################################### model_result[[i]]<-sapply(llik,summary.ll.fit) # Results refer to Additional files in "Table 1" } comp_model<-do.call(rbind,model_result)

7 The function summary.ll.fit list the results obtained by regress R package [] for "model1" and "model", as follow: Loglikem: LogLikem1: LRT: LRTseg: vare1: vara1: vare: VarA: LogLikehood "model" LogLikehood "model1" Likehood Ratio Test for "model1" and "model" p-value for Likehood Ratio Test for the segment Error variance e ) of "model1" Additive variance Error variance Additive variance of "model1" A) e ) of "model" A) of "model" Varseg: Additive variance segment A ) of "model" 1 Varseg_pr: Proportion in % of the total variance explained by the segment. Results for function summary.ll.fit are stored in the object "comp_model" To cite this code use: Gualdrón Duarte JL, Cantet RJC, Bates RO, Ernst CW, Raney NE, Steibel JP. Rapid screening for phenotype-genotype associations by linear transforming genomic evaluations. BMC Bioinformatics 014. (Submitted) References 1. Wimmer V, Albrecht T, Auinger H-J, Schön C-C: synbreed: a framework for the analysis of genomic prediction data using R. Bioinformatics 01, 8: Clifford D, McCullagh P: The regress function. R News 006, 6: VanRaden PM: Efficient methods to compute genomic predictions. J. Dairy Sci. 008, 91:

BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017

BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017 BICF Nano Course: GWAS GWAS Workflow Development using PLINK Julia Kozlitina Julia.Kozlitina@UTSouthwestern.edu April 28, 2017 Getting started Open the Terminal (Search -> Applications -> Terminal), and

More information

Step-by-Step Guide to Basic Genetic Analysis

Step-by-Step Guide to Basic Genetic Analysis Step-by-Step Guide to Basic Genetic Analysis Page 1 Introduction This document shows you how to clean up your genetic data, assess its statistical properties and perform simple analyses such as case-control

More information

Polymorphism and Variant Analysis Lab

Polymorphism and Variant Analysis Lab Polymorphism and Variant Analysis Lab Arian Avalos PowerPoint by Casey Hanson Polymorphism and Variant Analysis Matt Hudson 2018 1 Exercise In this exercise, we will do the following:. 1. Gain familiarity

More information

BAYESIAN STATISTIC COURSE. Valencia, 28 th January - 1 st February 2013 COMPUTER LAB NOTES. Agustín Blasco

BAYESIAN STATISTIC COURSE. Valencia, 28 th January - 1 st February 2013 COMPUTER LAB NOTES. Agustín Blasco BAYESIAN STATISTIC COURSE Valencia, 28 th January - st February 203 COMPUTER LAB NOTES Agustín Blasco 2 AN INTRODUCTION TO BAYESIAN ANALYSIS AND MCMC Computer Lab Notes Agustín Blasco Departamento de Ciencia

More information

MAGA: Meta-Analysis of Gene-level Associations

MAGA: Meta-Analysis of Gene-level Associations MAGA: Meta-Analysis of Gene-level Associations SYNOPSIS MAGA [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome DESCRIPTION

More information

Small example of use of OmicABEL

Small example of use of OmicABEL Small example of use of OmicABEL Yurii Aulchenko for the OmicABEL developers July 1, 2013 Contents 1 Important note on data format for OmicABEL 1 2 Outline of the example 2 3 Prepare the data for analysis

More information

SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data

SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data SYNOPSIS SEQGWAS [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome

More information

The fgwas Package. Version 1.0. Pennsylvannia State University

The fgwas Package. Version 1.0. Pennsylvannia State University The fgwas Package Version 1.0 Zhong Wang 1 and Jiahan Li 2 1 Department of Public Health Science, 2 Department of Statistics, Pennsylvannia State University 1. Introduction The fgwas Package (Functional

More information

Package MultiMeta. February 19, 2015

Package MultiMeta. February 19, 2015 Type Package Package MultiMeta February 19, 2015 Title Meta-analysis of Multivariate Genome Wide Association Studies Version 0.1 Date 2014-08-21 Author Dragana Vuckovic Maintainer Dragana Vuckovic

More information

Estimating Variance Components in MMAP

Estimating Variance Components in MMAP Last update: 6/1/2014 Estimating Variance Components in MMAP MMAP implements routines to estimate variance components within the mixed model. These estimates can be used for likelihood ratio tests to compare

More information

GWAS Exercises 3 - GWAS with a Quantiative Trait

GWAS Exercises 3 - GWAS with a Quantiative Trait GWAS Exercises 3 - GWAS with a Quantiative Trait Peter Castaldi January 28, 2013 PLINK can also test for genetic associations with a quantitative trait (i.e. a continuous variable). In this exercise, we

More information

Package GHap. R topics documented: February 18, 2017

Package GHap. R topics documented: February 18, 2017 Type Package Title Genome-Wide Haplotyping Version 1.2.2 Date 2017-02-18 Author Yuri Tani Utsunomiya, Marco Milanesi Package GHap February 18, 2017 Maintainer Yuri Tani Utsunomiya

More information

GMDR User Manual Version 1.0

GMDR User Manual Version 1.0 GMDR User Manual Version 1.0 Oct 30, 2011 1 GMDR is a free, open-source interaction analysis tool, aimed to perform gene-gene interaction with generalized multifactor dimensionality methods. GMDR is being

More information

ASSIGNMENT Write a program shown in the textbook at page 10 and see what this program does. 2. Write a program that prints

ASSIGNMENT Write a program shown in the textbook at page 10 and see what this program does. 2. Write a program that prints ASSIGNMENT 1. Suggestion: Always use implicit none as the second line of the program to help the program detect undeclared variables. Program... implicit none... 1. Write a program shown in the textbook

More information

Step-by-Step Guide to Advanced Genetic Analysis

Step-by-Step Guide to Advanced Genetic Analysis Step-by-Step Guide to Advanced Genetic Analysis Page 1 Introduction In the previous document, 1 we covered the standard genetic analyses available in JMP Genomics. Here, we cover the more advanced options

More information

GS3. Andrés Legarra. March 5, Genomic Selection Gibbs Sampling Gauss Seidel

GS3. Andrés Legarra. March 5, Genomic Selection Gibbs Sampling Gauss Seidel GS3 Genomic Selection Gibbs Sampling Gauss Seidel Andrés Legarra March 5, 2008 andres.legarra [at] toulouse.inra.fr INRA, UR 631, F-31326 Auzeville, France 1 Contents 1 Introduction 3 1.1 History...............................

More information

Package lodgwas. R topics documented: November 30, Type Package

Package lodgwas. R topics documented: November 30, Type Package Type Package Package lodgwas November 30, 2015 Title Genome-Wide Association Analysis of a Biomarker Accounting for Limit of Detection Version 1.0-7 Date 2015-11-10 Author Ahmad Vaez, Ilja M. Nolte, Peter

More information

DeltaGen: Quick start manual

DeltaGen: Quick start manual 1 DeltaGen: Quick start manual Dr. Zulfi Jahufer & Dr. Dongwen Luo CONTENTS Page Main operations tab commands 2 Uploading a data file 3 Matching variable identifiers 4 Data check 5 Univariate analysis

More information

KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li

KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual Miao-Xin Li, Jiang Li Department of Psychiatry Centre for Genomic Sciences Department

More information

User s Guide. Version 2.2. Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario

User s Guide. Version 2.2. Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario User s Guide Version 2.2 Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario Mehdi Sargolzaei, Jacques Chesnais and Flavio Schenkel Jan 2014 Disclaimer

More information

Supporting Figures Figure S1. FIGURE S2.

Supporting Figures Figure S1. FIGURE S2. Supporting Figures Figure S1. QTL scans for the 12 environments of the yield data for pepper simulated from the physiological genotype-to-phenotype model with seven physiological parameters (Rodrigues

More information

QTL DETECTION WITH GS3. A Legarra, 19/06/2013

QTL DETECTION WITH GS3. A Legarra, 19/06/2013 QTL DETECTION WITH GS3. A Legarra, 19/06/2013 Hélène Gilbert simulated a data set with 2 chromosomes and 5 QTLs at positions 20 cl, 73.14 cm on chromosome 1 and 20, 33.24 and 59.60 in chromosome 2. Each

More information

Genetic Analysis. Page 1

Genetic Analysis. Page 1 Genetic Analysis Page 1 Genetic Analysis Objectives: 1) Set up Case-Control Association analysis and the Basic Genetics Workflow 2) Use JMP tools to interact with and explore results 3) Learn advanced

More information

LEA: An R Package for Landscape and Ecological Association Studies

LEA: An R Package for Landscape and Ecological Association Studies LEA: An R Package for Landscape and Ecological Association Studies Eric Frichot and Olivier François Université Grenoble-Alpes, Centre National de la Recherche Scientifique, TIMC-IMAG UMR 5525, Grenoble,

More information

GMMAT: Generalized linear Mixed Model Association Tests Version 0.7

GMMAT: Generalized linear Mixed Model Association Tests Version 0.7 GMMAT: Generalized linear Mixed Model Association Tests Version 0.7 Han Chen Department of Biostatistics Harvard T.H. Chan School of Public Health Email: hanchen@hsph.harvard.edu Matthew P. Conomos Department

More information

Package RVS0.0 Jiafen Gong, Zeynep Baskurt, Andriy Derkach, Angelina Pesevski and Lisa Strug October, 2016

Package RVS0.0 Jiafen Gong, Zeynep Baskurt, Andriy Derkach, Angelina Pesevski and Lisa Strug October, 2016 Package RVS0.0 Jiafen Gong, Zeynep Baskurt, Andriy Derkach, Angelina Pesevski and Lisa Strug October, 2016 The Robust Variance Score (RVS) test is designed for association analysis for next generation

More information

GSCAN GWAS Analysis Plan, v GSCAN GWAS ANALYSIS PLAN, Version 1.0 October 6, 2015

GSCAN GWAS Analysis Plan, v GSCAN GWAS ANALYSIS PLAN, Version 1.0 October 6, 2015 GSCAN GWAS Analysis Plan, v0.5 1 Overview GSCAN GWAS ANALYSIS PLAN, Version 1.0 October 6, 2015 There are three major components to this analysis plan. First, genome-wide genotypes must be on the correct

More information

Bioinformatics - Homework 1 Q&A style

Bioinformatics - Homework 1 Q&A style Bioinformatics - Homework 1 Q&A style Instructions: in this assignment you will test your understanding of basic GWAS concepts and GenABEL functions. The materials needed for the homework (two datasets

More information

SUGEN 8.6 Overview. Misa Graff, July 2017

SUGEN 8.6 Overview. Misa Graff, July 2017 SUGEN 8.6 Overview Misa Graff, July 2017 General Information By Ran Tao, https://sites.google.com/site/dragontaoran/home Website: http://dlin.web.unc.edu/software/sugen/ Standalone command-line software

More information

Package GEM. R topics documented: January 31, Type Package

Package GEM. R topics documented: January 31, Type Package Type Package Package GEM January 31, 2018 Title GEM: fast association study for the interplay of Gene, Environment and Methylation Version 1.5.0 Date 2015-12-05 Author Hong Pan, Joanna D Holbrook, Neerja

More information

Dealing with heterogeneity: group-specific variances and stratified analyses

Dealing with heterogeneity: group-specific variances and stratified analyses Dealing with heterogeneity: group-specific variances and stratified analyses Tamar Sofer July 2017 1 / 32 The HCHS/SOL population is quite heterogeneous 1. Due to admixture: Hispanics are admixed with

More information

renumf90 Introduction to 5/27/2016 UGA 05/2016 Data File for blupf90 family: a) Only numbers Integer or real

renumf90 Introduction to 5/27/2016 UGA 05/2016 Data File for blupf90 family: a) Only numbers Integer or real Introduction to renumf90 UGA 05/2016 Data File for blupf90 family: a) Only numbers Integer or real a) All effects need to be renumbered from 1 consecutively Controlled by the same parameter file! 1 RENUMF90

More information

Bayesian Multiple QTL Mapping

Bayesian Multiple QTL Mapping Bayesian Multiple QTL Mapping Samprit Banerjee, Brian S. Yandell, Nengjun Yi April 28, 2006 1 Overview Bayesian multiple mapping of QTL library R/bmqtl provides Bayesian analysis of multiple quantitative

More information

CircosVCF workshop, TAU, 9/11/2017

CircosVCF workshop, TAU, 9/11/2017 CircosVCF exercise In this exercise, we will create and design circos plots using CircosVCF. We will use vcf files of a published case "X-linked elliptocytosis with impaired growth is related to mutated

More information

GS3. Andrés Legarra 1 2 Anne Ricard 3 4 Olivier Filangi 5 6. June 17, Genomic Selection Gibbs Sampling Gauss Seidel.

GS3. Andrés Legarra 1 2 Anne Ricard 3 4 Olivier Filangi 5 6. June 17, Genomic Selection Gibbs Sampling Gauss Seidel. GS3 Genomic Selection Gibbs Sampling Gauss Seidel (and BayesCπ) Andrés Legarra 1 2 Anne Ricard 3 4 Olivier Filangi 5 6 June 17, 2011 1 andres.legarra [at] toulouse.inra.fr 2 INRA, UR 631, F-31326 Auzeville,

More information

JMP Genomics. Release Notes. Version 6.0

JMP Genomics. Release Notes. Version 6.0 JMP Genomics Version 6.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP, A Business Unit of SAS SAS Campus Drive

More information

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS FVGWAS- 3.0 Manual Hongtu Zhu @ UNC BIAS Chao Huang @ UNC BIAS Nov 8, 2015 More and more large- scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical

More information

Package HMMASE. February 4, HMMASE R package

Package HMMASE. February 4, HMMASE R package Package HMMASE February 4, 2014 Type Package Title HMMASE R package Version 1.0 Date 2014-02-04 Author Juan R. Steibel, Heng Wang, Ping-Shou Zhong Maintainer Heng Wang An R package that

More information

Package inversion. R topics documented: July 18, Type Package. Title Inversions in genotype data. Version

Package inversion. R topics documented: July 18, Type Package. Title Inversions in genotype data. Version Package inversion July 18, 2013 Type Package Title Inversions in genotype data Version 1.8.0 Date 2011-05-12 Author Alejandro Caceres Maintainer Package to find genetic inversions in genotype (SNP array)

More information

PRSice: Polygenic Risk Score software v1.22

PRSice: Polygenic Risk Score software v1.22 PRSice: Polygenic Risk Score software v1.22 Jack Euesden jack.euesden@kcl.ac.uk Cathryn M. Lewis April 30, 2015 Paul F. O Reilly Contents 1 Overview 3 2 R packages required 3 3 Quickstart 3 3.1 Input Data...................................

More information

ASSIGNMENT Write a program shown in the textbook at page 10 and see what this program does. 2. Write a program that prints

ASSIGNMENT Write a program shown in the textbook at page 10 and see what this program does. 2. Write a program that prints ASSIGNMENT 1. Suggestion: Always use implicit none as the second line of the program to help the program detect undeclared variables. Program... implicit none... 1. Write a program shown in the textbook

More information

MACAU User Manual. Xiang Zhou. March 15, 2017

MACAU User Manual. Xiang Zhou. March 15, 2017 MACAU User Manual Xiang Zhou March 15, 2017 Contents 1 Introduction 2 1.1 What is MACAU...................................... 2 1.2 How to Cite MACAU................................... 2 1.3 The Model.........................................

More information

PRSice: Polygenic Risk Score software - Vignette

PRSice: Polygenic Risk Score software - Vignette PRSice: Polygenic Risk Score software - Vignette Jack Euesden, Paul O Reilly March 22, 2016 1 The Polygenic Risk Score process PRSice ( precise ) implements a pipeline that has become standard in Polygenic

More information

Package EMLRT. August 7, 2014

Package EMLRT. August 7, 2014 Package EMLRT August 7, 2014 Type Package Title Association Studies with Imputed SNPs Using Expectation-Maximization-Likelihood-Ratio Test LazyData yes Version 1.0 Date 2014-08-01 Author Maintainer

More information

ICAR Parentage Analysis Accreditation for DNA Data Interpretation Centres - Applicant's Guide -

ICAR Parentage Analysis Accreditation for DNA Data Interpretation Centres - Applicant's Guide - ICAR Parentage Analysis Accreditation for DNA Data Interpretation Centres - Applicant's Guide - Overview This document serves as a guide for organizations that submit an application to ICAR for its Parentage

More information

BEAGLECALL 1.0. Brian L. Browning Department of Medicine Division of Medical Genetics University of Washington. 15 November 2010

BEAGLECALL 1.0. Brian L. Browning Department of Medicine Division of Medical Genetics University of Washington. 15 November 2010 BEAGLECALL 1.0 Brian L. Browning Department of Medicine Division of Medical Genetics University of Washington 15 November 2010 BEAGLECALL 1.0 P a g e i Contents 1 Introduction... 1 1.1 Citing BEAGLECALL...

More information

GCTA: a tool for Genome- wide Complex Trait Analysis

GCTA: a tool for Genome- wide Complex Trait Analysis GCTA: a tool for Genome- wide Complex Trait Analysis Version 1.04, 13 Sep 2012 Overview GCTA (Genome- wide Complex Trait Analysis) is designed to estimate the proportion of phenotypic variance explained

More information

Quality control of array genotyping data with argyle Andrew P Morgan

Quality control of array genotyping data with argyle Andrew P Morgan Quality control of array genotyping data with argyle Andrew P Morgan 2015-10-08 Introduction Proper quality control of array genotypes is an important prerequisite to further analysis. Genotype quality

More information

Package SimGbyE. July 20, 2009

Package SimGbyE. July 20, 2009 Package SimGbyE July 20, 2009 Type Package Title Simulated case/control or survival data sets with genetic and environmental interactions. Author Melanie Wilson Maintainer Melanie

More information

Click on "+" button Select your VCF data files (see #Input Formats->1 above) Remove file from files list:

Click on + button Select your VCF data files (see #Input Formats->1 above) Remove file from files list: CircosVCF: CircosVCF is a web based visualization tool of genome-wide variant data described in VCF files using circos plots. The provided visualization capabilities, gives a broad overview of the genomic

More information

Step-by-Step Guide to Relatedness and Association Mapping Contents

Step-by-Step Guide to Relatedness and Association Mapping Contents Step-by-Step Guide to Relatedness and Association Mapping Contents OBJECTIVES... 2 INTRODUCTION... 2 RELATEDNESS MEASURES... 2 POPULATION STRUCTURE... 6 Q-K ASSOCIATION ANALYSIS... 10 K MATRIX COMPRESSION...

More information

Vignette for the package rehh (version 2+) Mathieu Gautier, Alexander Klassmann and Renaud Vitalis 24/10/2016

Vignette for the package rehh (version 2+) Mathieu Gautier, Alexander Klassmann and Renaud Vitalis 24/10/2016 Vignette for the package rehh (version 2+) Mathieu Gautier, Alexander Klassmann and Renaud Vitalis 24/10/2016 Contents 1 Input Files 2 1.1 Haplotype data file..........................................

More information

Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc.

Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc. Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc. Overview This script converts allelic dosage values to genotypes based on user-specified thresholds. The dosage data may be in

More information

Package PedCNV. February 19, 2015

Package PedCNV. February 19, 2015 Type Package Package PedCNV February 19, 2015 Title An implementation for association analysis with CNV data. Version 0.1 Date 2013-08-03 Author, Sungho Won and Weicheng Zhu Maintainer

More information

U pública mais antiga dos EUA: k graduandos 9k pós-graduandos Tuition: 26k 46k

U pública mais antiga dos EUA: k graduandos 9k pós-graduandos Tuition: 26k 46k U pública mais antiga dos EUA: 1785 Top 16 nos EUA 29k graduandos 9k pós-graduandos Tuition: 26k 46k Estádio para 92k pessoas Jogos Olímpicos de 1996 (Atlanta) Futebol Americano Salário do Reitor? Salário

More information

Association Analysis of Sequence Data using PLINK/SEQ (PSEQ)

Association Analysis of Sequence Data using PLINK/SEQ (PSEQ) Association Analysis of Sequence Data using PLINK/SEQ (PSEQ) Copyright (c) 2018 Stanley Hooker, Biao Li, Di Zhang and Suzanne M. Leal Purpose PLINK/SEQ (PSEQ) is an open-source C/C++ library for working

More information

SOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie

SOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie SOLOMON: Parentage Analysis 1 Corresponding author: Mark Christie christim@science.oregonstate.edu SOLOMON: Parentage Analysis 2 Table of Contents: Installing SOLOMON on Windows/Linux Pg. 3 Installing

More information

MAGMA manual (version 1.06)

MAGMA manual (version 1.06) MAGMA manual (version 1.06) TABLE OF CONTENTS OVERVIEW 3 QUICKSTART 4 ANNOTATION 6 OVERVIEW 6 RUNNING THE ANNOTATION 6 ADDING AN ANNOTATION WINDOW AROUND GENES 7 RESTRICTING THE ANNOTATION TO A SUBSET

More information

Package GWAF. March 12, 2015

Package GWAF. March 12, 2015 Type Package Package GWAF March 12, 2015 Title Genome-Wide Association/Interaction Analysis and Rare Variant Analysis with Family Data Version 2.2 Date 2015-03-12 Author Ming-Huei Chen

More information

VIII/2015 TECHNICAL REFERENCE GUIDE FOR

VIII/2015 TECHNICAL REFERENCE GUIDE FOR MiX99 Solving Large Mixed Model Equations Release VIII/2015 TECHNICAL REFERENCE GUIDE FOR MiX99 PRE-PROCESSOR Copyright 2015 Last update: Aug 2015 Preface Development of MiX99 was initiated to allow more

More information

The Lander-Green Algorithm in Practice. Biostatistics 666

The Lander-Green Algorithm in Practice. Biostatistics 666 The Lander-Green Algorithm in Practice Biostatistics 666 Last Lecture: Lander-Green Algorithm More general definition for I, the "IBD vector" Probability of genotypes given IBD vector Transition probabilities

More information

Cover Page. The handle holds various files of this Leiden University dissertation.

Cover Page. The handle   holds various files of this Leiden University dissertation. Cover Page The handle http://hdl.handle.net/1887/32015 holds various files of this Leiden University dissertation. Author: Akker, Erik Ben van den Title: Computational biology in human aging : an omics

More information

User Manual ixora: Exact haplotype inferencing and trait association

User Manual ixora: Exact haplotype inferencing and trait association User Manual ixora: Exact haplotype inferencing and trait association June 27, 2013 Contents 1 ixora: Exact haplotype inferencing and trait association 2 1.1 Introduction.............................. 2

More information

MAGMA manual (version 1.05)

MAGMA manual (version 1.05) MAGMA manual (version 1.05) TABLE OF CONTENTS OVERVIEW 3 QUICKSTART 4 ANNOTATION 6 OVERVIEW 6 RUNNING THE ANNOTATION 6 ADDING AN ANNOTATION WINDOW AROUND GENES 7 RESTRICTING THE ANNOTATION TO A SUBSET

More information

GCTA: a tool for Genome- wide Complex Trait Analysis

GCTA: a tool for Genome- wide Complex Trait Analysis GCTA: a tool for Genome- wide Complex Trait Analysis Version 1.24, 28 July 2014 Overview GCTA (Genome- wide Complex Trait Analysis) was originally designed to estimate the proportion of phenotypic variance

More information

Genetic type 1 Error Calculator (GEC)

Genetic type 1 Error Calculator (GEC) Genetic type 1 Error Calculator (GEC) (Version 0.2) User Manual Miao-Xin Li Department of Psychiatry and State Key Laboratory for Cognitive and Brain Sciences; the Centre for Reproduction, Development

More information

bimm vignette Matti Pirinen & Christian Benner University of Helsinki November 15, 2016

bimm vignette Matti Pirinen & Christian Benner University of Helsinki November 15, 2016 bimm vignette Matti Pirinen & Christian Benner University of Helsinki November 15, 2016 1 Introduction bimm is a software package to efficiently estimate variance parameters of a bivariate lineax mixed

More information

QUICKTEST user guide

QUICKTEST user guide QUICKTEST user guide Toby Johnson Zoltán Kutalik December 11, 2008 for quicktest version 0.94 Copyright c 2008 Toby Johnson and Zoltán Kutalik Permission is granted to copy, distribute and/or modify this

More information

Package ridge. R topics documented: February 15, Title Ridge Regression with automatic selection of the penalty parameter. Version 2.

Package ridge. R topics documented: February 15, Title Ridge Regression with automatic selection of the penalty parameter. Version 2. Package ridge February 15, 2013 Title Ridge Regression with automatic selection of the penalty parameter Version 2.1-2 Date 2012-25-09 Author Erika Cule Linear and logistic ridge regression for small data

More information

LD vignette Measures of linkage disequilibrium

LD vignette Measures of linkage disequilibrium LD vignette Measures of linkage disequilibrium David Clayton June 13, 2018 Calculating linkage disequilibrium statistics We shall first load some illustrative data. > data(ld.example) The data are drawn

More information

The Imprinting Model

The Imprinting Model The Imprinting Model Version 1.0 Zhong Wang 1 and Chenguang Wang 2 1 Department of Public Health Science, Pennsylvania State University 2 Office of Surveillance and Biometrics, Center for Devices and Radiological

More information

Package coloc. February 24, 2018

Package coloc. February 24, 2018 Type Package Package coloc February 24, 2018 Imports ggplot2, snpstats, BMA, reshape, methods, flashclust, speedglm Suggests knitr, testthat Title Colocalisation Tests of Two Genetic Traits Version 3.1

More information

CTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1

CTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 CTL mapping in R Danny Arends, Pjotr Prins, and Ritsert C. Jansen University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 First written: Oct 2011 Last modified: Jan 2018 Abstract: Tutorial

More information

Importing and Merging Data Tutorial

Importing and Merging Data Tutorial Importing and Merging Data Tutorial Release 1.0 Golden Helix, Inc. February 17, 2012 Contents 1. Overview 2 2. Import Pedigree Data 4 3. Import Phenotypic Data 6 4. Import Genetic Data 8 5. Import and

More information

A short manual for LFMM (command-line version)

A short manual for LFMM (command-line version) A short manual for LFMM (command-line version) Eric Frichot efrichot@gmail.com April 16, 2013 Please, print this reference manual only if it is necessary. This short manual aims to help users to run LFMM

More information

Estimating. Local Ancestry in admixed Populations (LAMP)

Estimating. Local Ancestry in admixed Populations (LAMP) Estimating Local Ancestry in admixed Populations (LAMP) QIAN ZHANG 572 6/05/2014 Outline 1) Sketch Method 2) Algorithm 3) Simulated Data: Accuracy Varying Pop1-Pop2 Ancestries r 2 pruning threshold Number

More information

Package FREGAT. April 21, 2017

Package FREGAT. April 21, 2017 Title Family REGional Association Tests Version 1.0.3 Package FREGAT April 21, 2017 Author Nadezhda M. Belonogova and Gulnara R. Svishcheva , with contributions from:

More information

SNP HiTLink Manual. Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1

SNP HiTLink Manual. Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1 SNP HiTLink Manual Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1 1 Department of Neurology, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan 2 Dynacom Co., Ltd, Kanagawa,

More information

Package EBglmnet. January 30, 2016

Package EBglmnet. January 30, 2016 Type Package Package EBglmnet January 30, 2016 Title Empirical Bayesian Lasso and Elastic Net Methods for Generalized Linear Models Version 4.1 Date 2016-01-15 Author Anhui Huang, Dianting Liu Maintainer

More information

The fgwas software. Version 1.0. Pennsylvannia State University

The fgwas software. Version 1.0. Pennsylvannia State University The fgwas software Version 1.0 Zhong Wang 1 and Jiahan Li 2 1 Department of Public Health Science, 2 Department of Statistics, Pennsylvannia State University 1. Introduction Genome-wide association studies

More information

Large scale SNP data management and integration in breeding programs

Large scale SNP data management and integration in breeding programs Large scale SNP data management and integration in breeding programs Eildert Groeneveld Institute of Farm Animal Genetics TheSNPpit: Who is it for? Needs to manage many SNPs Uses standardized workflows

More information

Recalling Genotypes with BEAGLECALL Tutorial

Recalling Genotypes with BEAGLECALL Tutorial Recalling Genotypes with BEAGLECALL Tutorial Release 8.1.4 Golden Helix, Inc. June 24, 2014 Contents 1. Format and Confirm Data Quality 2 A. Exclude Non-Autosomal Markers......................................

More information

Ricopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015

Ricopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 Ricopili: Introdution WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 What will we offer? Practical: Sorry, no practical sessions today, please refer to the summer school, organized

More information

Computer Users: How to Get Started

Computer Users: How to Get Started UC Davis Procedures Effective July 1 2013, ADCA members may use University of California - Davis (UC Davis) as an approved lab for genotyping/parentage verification. The process is much the same as you

More information

Notes on QTL Cartographer

Notes on QTL Cartographer Notes on QTL Cartographer Introduction QTL Cartographer is a suite of programs for mapping quantitative trait loci (QTLs) onto a genetic linkage map. The programs use linear regression, interval mapping

More information

PRACTICAL SESSION 8 SEQUENCE-BASED ASSOCIATION, INTERPRETATION, VISUALIZATION USING EPACTS JAN 7 TH, 2014 STOM 2014 WORKSHOP

PRACTICAL SESSION 8 SEQUENCE-BASED ASSOCIATION, INTERPRETATION, VISUALIZATION USING EPACTS JAN 7 TH, 2014 STOM 2014 WORKSHOP PRACTICAL SESSION 8 SEQUENCE-BASED ASSOCIATION, INTERPRETATION, VISUALIZATION USING EPACTS JAN 7 TH, 2014 STOM 2014 WORKSHOP HYUN MIN KANG UNIVERSITY OF MICHIGAN, ANN ARBOR EPACTS ASSOCIATION ANALYSIS

More information

A practical example of tomato QTL mapping using a RIL population. R R/QTL

A practical example of tomato QTL mapping using a RIL population. R  R/QTL A practical example of tomato QTL mapping using a RIL population R http://www.r-project.org/ R/QTL http://www.rqtl.org/ Dr. Hamid Ashrafi UC Davis Seed Biotechnology Center Breeding with Molecular Markers

More information

Forensic Resource/Reference On Genetics knowledge base: FROG-kb User s Manual. Updated June, 2017

Forensic Resource/Reference On Genetics knowledge base: FROG-kb User s Manual. Updated June, 2017 Forensic Resource/Reference On Genetics knowledge base: FROG-kb User s Manual Updated June, 2017 Table of Contents 1. Introduction... 1 2. Accessing FROG-kb Home Page and Features... 1 3. Home Page and

More information

Package LEA. April 23, 2016

Package LEA. April 23, 2016 Package LEA April 23, 2016 Title LEA: an R package for Landscape and Ecological Association Studies Version 1.2.0 Date 2014-09-17 Author , Olivier Francois

More information

MQLS-XM Software Documentation

MQLS-XM Software Documentation MQLS-XM Software Documentation Version 1.0 Timothy Thornton 1 and Mary Sara McPeek 2,3 Department of Biostatistics 1 The University of Washington Departments of Statistics 2 and Human Genetics 3 The University

More information

Random Forest in Genomic Selection

Random Forest in Genomic Selection Random Forest in genomic selection 1 Dpto Mejora Genética Animal, INIA, Madrid; Universidad Politécnica de Valencia, 20-24 September, 2010. Outline 1 Remind 2 Random Forest Introduction Classification

More information

USER-FRIENDLY SOFTWARE

USER-FRIENDLY SOFTWARE USER-FRENDLY SOFTWARE for large genetic evaluation systems Manual V2.0 2016 01 MiXBLUP User s Guide MiXBLUP, the Mixed-model Best Linear Unbiased Prediction software for PCs for large genetic evaluation

More information

ELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2

ELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2 ELAI user manual Yongtao Guan Baylor College of Medicine Version 1.0 25 June 2015 Contents 1 Copyright 2 2 What ELAI Can Do 2 3 A simple example 2 4 Input file formats 3 4.1 Genotype file format....................................

More information

Package Eagle. January 31, 2019

Package Eagle. January 31, 2019 Type Package Package Eagle January 31, 2019 Title Multiple Locus Association Mapping on a Genome-Wide Scale Version 1.3.0 Maintainer Andrew George Author Andrew George [aut, cre],

More information

SKAT Package. Seunggeun (Shawn) Lee. July 21, 2017

SKAT Package. Seunggeun (Shawn) Lee. July 21, 2017 SKAT Package Seunggeun (Shawn) Lee July 21, 2017 1 Overview SKAT package has functions to 1) test for associations between SNP sets and continuous/binary phenotypes with adjusting for covariates and kinships

More information

Package MOJOV. R topics documented: February 19, 2015

Package MOJOV. R topics documented: February 19, 2015 Type Package Title Mojo Variants: Rare Variants analysis Version 1.0.1 Date 2013-02-25 Author Maintainer Package MOJOV February 19, 2015 A package for analysis between rare variants

More information

Breeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel

Breeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel Breeding Guide Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel www.phenome-netwoks.com Contents PHENOME ONE - INTRODUCTION... 3 THE PHENOME ONE LAYOUT... 4 THE JOBS ICON...

More information

Affymetrix GeneChip DNA Analysis Software

Affymetrix GeneChip DNA Analysis Software Affymetrix GeneChip DNA Analysis Software User s Guide Version 3.0 For Research Use Only. Not for use in diagnostic procedures. P/N 701454 Rev. 3 Trademarks Affymetrix, GeneChip, EASI,,,, HuSNP, GenFlex,

More information

Documentation for FCgene

Documentation for FCgene Documentation for FCgene Nab Raj Roshyara Email: roshyara@yahoo.com Universitaet Leipzig Leipzig Research Center for Civilization Diseases(LIFE) Institute for Medical Informatics, Statistics and Epidemiology

More information

Package dlmap. February 19, 2015

Package dlmap. February 19, 2015 Type Package Title Detection Localization Mapping for QTL Version 1.13 Date 2012-08-09 Author Emma Huang and Andrew George Package dlmap February 19, 2015 Maintainer Emma Huang Description

More information